Scaling Systems with Compute: The Rise of AI Agents

Unlocking the power of AI, I'm here to share a groundbreaking idea that could revolutionize how we build systems that scale with compute.

  • 1. Speaker has been working on Learning Management Systems (LMS) for four years.
  • 2. Started focusing on AI agent company when ChatGPT was released.
  • 3. Initially, they used GPD2 models but found them to be frustratingly stupid with small context windows and poor reasoning capabilities.
  • 4. They wrote lots of code around these models to make them work reliably.
  • 5. As models became smarter, they had to delete more of the custom code written earlier.
  • 6. Observed patterns in building agents that can scale with intelligence.
  • 7. Built a structure extraction library called Jsonformer.
  • 8. Wrote code around models as they were too stupid to work with JSON.
  • 9. Core agenda: convey one idea through various examples and demos.
  • 10. Idea: systems that scale with compute beat systems that don't.
  • 11. Rigid, fixed, deterministic systems can think or use more compute when needed.
  • 12. Exponentials are rare and should be taken advantage of when found.
  • 13. Examples from history: chess, go, computer vision, Atari games.
  • 14. General method always ends up winning as it scales out search capabilities.
  • 15. Ramp is a finance platform that helps businesses manage expenses, payments, procurement, and travel efficiently.
  • 16. AI is used extensively in the product to automate routine tasks.
  • 17. Discussing an agent called Switching Report: a checklist for transactions from third-party card providers when people onboard to Ramp.
  • 18. Want to support arbitrary CSV formats from other platforms with crazy schemas.
  • 19. Three approaches to solving the problem: manual code, LMS with scripting, and giving the entire CSV to an LLM for interpretation.
  • 20. The third approach works well when run 50 times in parallel, generalizing across different formats.
  • 21. Although it requires significantly more compute, engineer time is scarce and the cost of running the system is lower than manual processing per transaction.
  • 22. Different approaches can be represented as arrows: Black Arrow (classical compute), Blue Arrows (fuzzy land with neuron nets and weird matrix multiplication).
  • 23. Ramp codebase is moving towards an approach where the LLM decides when to break into classical land and write some code or use pandas/Python code.
  • 24. More blue arrow usage in a company's codebase can help directly by leveraging exponential trends without much effort from their end.

Source: AI Engineer via YouTube

❓ What do you think? What are your thoughts on the ideas shared in this video? Feel free to share your thoughts in the comments!