/
© 2026 RiffOn. All rights reserved.

Get your free personalized podcast brief

We scan new podcasts and send you the top 5 insights daily.

  1. Cheeky Pint
  2. The world of voice AI, with Mati Staniszewski of ElevenLabs
The world of voice AI, with Mati Staniszewski of ElevenLabs

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint · Apr 14, 2026

ElevenLabs cofounder Mati Staniszewski on the tech behind voice AI, scaling to $450M+ ARR, and closing the gap to achieve a voice Turing test.

"Cascaded" Voice AI Models (Speech-to-Text-to-Speech) Outperform Direct Speech-to-Speech for Enterprise

While direct speech-to-speech models are faster (lower latency), they are less reliable and "dumber." ElevenLabs bets on a "cascaded" approach that uses text as an intermediate layer, providing greater accuracy, visibility, and control—features that are critical for most enterprise applications.

The world of voice AI, with Mati Staniszewski of ElevenLabs thumbnail

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint·15 hours ago

ElevenLabs' AI Models Develop "Britishness" as an Emergent Property, Not a Hardcoded Parameter

Early voice models required hardcoding parameters like accent or emotion. Modern models, like those from ElevenLabs, learn these nuances contextually from data, allowing complex traits like a specific accent to emerge naturally without being explicitly programmed.

The world of voice AI, with Mati Staniszewski of ElevenLabs thumbnail

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint·15 hours ago

Employee "Agency" Is the Most Critical Trait for Success in an AI-Driven Company

In a world where AI handles routine tasks, the most valuable human contribution is the initiative to solve problems independently. ElevenLabs prioritizes hiring for "agency," seeing it as the ultimate amplifier for an individual's impact, regardless of their seniority. High-agency people are the winners of the AI era.

The world of voice AI, with Mati Staniszewski of ElevenLabs thumbnail

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint·15 hours ago

ElevenLabs Subsidizes New AI Models to Accelerate Adoption and User Feedback

Counterintuitively, instead of charging a premium for their latest and most powerful models, ElevenLabs often makes them economically attractive, sometimes at cost. This strategy encourages widespread use, generates crucial feedback for refinement, and showcases what's possible, creating a powerful distribution and learning mechanism.

The world of voice AI, with Mati Staniszewski of ElevenLabs thumbnail

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint·15 hours ago

ElevenLabs' Core Speech-to-Text Product Originated as an Internal Tool for Data Labeling

The company needed a high-quality speech-to-text model to annotate its own training data because existing market solutions were inadequate. This internal necessity evolved into a successful, customer-facing product, demonstrating the value of building tools to solve your own critical problems.

The world of voice AI, with Mati Staniszewski of ElevenLabs thumbnail

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint·15 hours ago

Person-Specific Voice Transcription Models Are a Solvable, Near-Term Problem

Current transcription models use a global approach, often struggling with individual accents. ElevenLabs states that models fine-tuned on a specific person's voice (e.g., from an hour of audio) are not a distant research challenge but a solvable problem and an imminent product release, promising superhuman accuracy.

The world of voice AI, with Mati Staniszewski of ElevenLabs thumbnail

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint·15 hours ago

AI-Native Companies Like ElevenLabs Operate With a 15+ Direct Report Span of Control

Traditional management suggests a span of control around 8. By leveraging AI and fostering high agency, ElevenLabs builds a much flatter organization where leaders, including the co-founders, manage over 15 direct reports each. This structure increases speed and reduces bureaucracy.

The world of voice AI, with Mati Staniszewski of ElevenLabs thumbnail

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint·15 hours ago

AI-First Companies Embed "Tech Leads" into Non-Technical Teams to Drive Automation

To maximize AI's impact, ElevenLabs places dedicated technical resources directly within non-technical departments like operations and talent acquisition. This embedded 'tech lead' is responsible for identifying and building automation, upskilling the team, and bridging the gap between business needs and technical capabilities.

The world of voice AI, with Mati Staniszewski of ElevenLabs thumbnail

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint·15 hours ago

Voice Interfaces Elicit More Open-Ended and Detailed Feedback Than Traditional Web Forms

When ElevenLabs replaced a web form with a voice agent for lead capture, users were not only more likely to complete the process but also provided far more detailed, open-ended information about their use cases. This reveals a richer layer of customer intent that text-based forms often miss.

The world of voice AI, with Mati Staniszewski of ElevenLabs thumbnail

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint·15 hours ago

A "Deployment Gap" Explains Why Consumer Voice AI Lags Years Behind Cutting-Edge Models

Despite incredible advances, everyday voice experiences (like on phones or in cars) feel dated. The lag isn't due to technology but a "deployment gap," where large companies are slow to integrate the latest models into consumer hardware and software, creating a disconnect between what's possible and what's available.

The world of voice AI, with Mati Staniszewski of ElevenLabs thumbnail

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint·15 hours ago

Voice Model Size Plateaus for Specific Tasks Like Audiobook Narration

Unlike LLMs, where performance often scales with size, specific voice AI applications appear to have an optimal parameter count. For tasks like audiobook narration, ElevenLabs believes it has found the size sweet spot, where making models larger yields diminishing returns on quality, suggesting different scaling laws for specialized AI.

The world of voice AI, with Mati Staniszewski of ElevenLabs thumbnail

The world of voice AI, with Mati Staniszewski of ElevenLabs

Cheeky Pint·15 hours ago