As AI systems become more capable, speech is fast becoming the default way we communicate with machines. French AI startup Mistral has jumped into the audio race with its first open model, aiming to challenge the dominance of walled-off corporate systems with open-weight alternatives.
On Tuesday, Mistral announced the release of Voxtral, its first family of audio models aimed at businesses.
The company is pitching Voxtral as the first open model that’s capable of deploying “truly usable speech intelligence in production.”
In other words, no longer will developers have to choose between a cheap, open system that fumbles transcriptions and doesn’t really understand what’s being said, and one that functions well, but is closed, leaving developers with a higher bill and less control over deployment.
For businesses, that means Voxtral offers an affordable alternative that the company claims is “less than half the price” of comparable solutions.

Mistral says Voxtral can transcribe up to 30 minutes of audio. Due to its LLM backbone, Mistral Small 3.1, it can understand up to 40 minutes, allowing users to ask questions about the audio content, generate summaries, or turn voice commands into real-time actions like calling APIs or running functions. Voxtral is also multilingual, with the ability to transcribe and understand languages including English, Spanish, French, Portuguese, Hindi, German, Dutch, and Italian.
The company is offering up two variants of its “speech understanding models.” The first, Voxtral Small, has 24 billion parameters for production-scale deployments, and is competitive with ElevenLabs Scribe, GPT-4o-mini, and Gemini 2.5 Flash.
Disrupt 2026: The tech ecosystem, all in one room
Your next round. Your next hire. Your next breakout opportunity. Find it at TechCrunch Disrupt 2026, where 10,000+ founders, investors, and tech leaders gather for three days of 250+ tactical sessions, powerful introductions, and market-defining innovation. Register now to save up to $400.
Save up to $300 or 30% to TechCrunch Founder Summit
1,000+ founders and investors come together at TechCrunch Founder Summit 2026 for a full day focused on growth, execution, and real-world scaling. Learn from founders and investors who have shaped the industry. Connect with peers navigating similar growth stages. Walk away with tactics you can apply immediately
Offer ends March 13.
The second, Voxtral Mini, has 3 billion parameters for local and edge deployments. There’s also an ultra-cheap, stripped-down, fast API version of the 3 billion model called Voxtral Mini Transcribe that is optimized for transcription-only use cases and promises to outperform OpenAI Whisper for less than half the price.
Users can try Voxtral for free by downloading the API on Hugging Face or testing the models in Mistral’s chatbot Le Chat. Integrating the API into applications starts at $0.001 per minute, according to the company.
The launch comes a month after Mistral announced Magistral, its first family of reasoning models that work through problems step-by-step for improved reliability.
Mistral, one of the top AI firms in Europe, is well-known for its advocacy pushing open source AI models. Earlier this month, TechCrunch reported that the company is in talks to raise up to $1 billion in equity from investors like Abu Dhabi’s MGX fund.
