Maximizing Value with AI Assistants at Jane Street: Challenges in Utilizing Large Language Models within an OCaml-centric Development Environment

Join me as we explore the innovative approach to large language models at Jane Street, where our team is maximizing value through custom models, editor integrations, and a focus on developer tools.

  • 1. John Kzi works at Jane Street's AI Assistant team, which maximizes the value of large language models (LLMs) for the company.
  • 2. He has spent his career in Dev tools, previously working at GitHub and other companies.
  • 3. Jane Street uses OCaml as their primary development platform, a functional language that is powerful but obscure and not widely used.
  • 4. They have built their own build systems, distributed build environment, code review system (Iron), and even store their monorepo in Mercurial instead of Git.
  • 5. The team's dream is to apply LLMs to various parts of their development flow, like resolving merge conflicts or figuring out reviewers for features.
  • 6. John will discuss their approach to LLMs at Jane Street, focusing on custom models, editor integrations, and evaluating model performance.
  • 7. Training models can be expensive and time-consuming but is essential for creating useful LLMs.
  • 8. Jane Street built a Code Evaluation Service (CES) that applies the model's code changes to a base revision and checks if it compiles and passes tests.
  • 9. CES also helps align the model's abilities with human ideas of good code during the reinforcement learning phase.
  • 10. The real test of models is whether they work for humans, so Jane Street builds editor integrations to expose these models to developers.
  • 11. Their AI development environment (Aid) handles prompt construction, context building, and build status; it sits as a sidecar application on developers' machines.
  • 12. Aid's architecture allows them to swap in new models, make changes to the context-building process, add support for new editors, and integrate domain-specific tools.
  • 13. Aid also enables A/B testing different approaches by sending portions of the company to different models and comparing acceptance rates.
  • 14. The team is working on applying LLMs in various ways within editors and large-scale multi-agent workflows, as well as working with reasoning models more often.
  • 15. Jane Street's approach focuses on keeping things pluggable, laying a strong foundation, and building ways for the rest of the company to add domain-specific tooling on top of it.

Source: AI Engineer via YouTube

❓ What do you think? What are your thoughts on the ideas shared in this video? Feel free to share your thoughts in the comments!