Mistral AI CEO Arthur Mensch sat down with investor Elad Gil at Figma's offices to cover ground that matters: how Mistral built and shipped its first model, Mistral 7B, in four months starting from zero GPUs and scaling to 500, with a team of roughly four to five people. Mensch, formerly a staff research scientist at DeepMind, co-founded Mistral with Guillaume Lample and Timothée Lacroix from Meta's LLaMA project after GPT's breakout signaled an opening. The company's open-core strategy, releasing Mistral 7B and Mixtral 8x7B publicly while selling commercial models and a hosted platform, is the structural bet worth understanding in full.

The conversation gets specific on why small models won: LLaMA 7B existed but was not good enough, and Mistral saw a community ready to build on something better. Fast inference, low time-to-first-token, and cheap high-throughput operation were the actual selling points, not scale for its own sake. On the enterprise side, Mensch identifies two distinct go-to-market tracks: financial services firms accessed partly through cloud partnerships including a disclosed deal with Microsoft Azure, and developer-native companies reached directly through the platform.

The roadmap discussion is brief but pointed. Mistral is shipping new open-source models in both generalist and vertical-specific flavors, adding fine-tuning features to the platform, and iterating on a chat assistant called Le Chat that Mensch describes as 'ChatGPT version zero.' The full conversation covers EU regulatory context, the organizational logic behind five-person AI squads, and what enterprises actually need versus what the market assumes. Read it for the operational details, not just the positioning.

[READ ORIGINAL →]