Build voice-first experiences with AI speech recognition, voice agents, dictation, and real-time translation.
The most popular tools in this category, ranked by media coverage and activity.
Your easy, private intelligent assistant for voice-controlled tasks.
A visual assistant for the blind and low vision community.
Meta's generative AI model for speech synthesis and editing.
Connect with volunteers for real-time visual assistance.
The most realistic and expressive AI voice.
ElevenLabs is a leading AI research company focused on speech synthesis. They offer advanced tools to generate incredibly realistic and expressive AI voices, making them a top choice for creators and developers. Their technology powers a wide range of applications, from audiobooks to game development.
Microsoft's platform for speech AI.
The language learning app that gets you speaking.
AI-powered translation, voice, and API.
Agentic AI-powered Contact Center & Communications Platform

Voice AI for clearer calls and smarter meetings.
Krisp is a Voice AI platform offering industry-leading noise cancellation for clearer calls and smarter meetings. It features an AI Note Taker to summarize meetings, Accent AI for real-time accent conversion, and solutions for call centers. Audio processing is done locally for enhanced privacy.
AI Notetaker, Transcription, and Insights.
AI voice generation platform, acquired by Meta and shut down.
Reads anything aloud with natural voices and AI.
Free real-time AI voice changer and soundboard for PC & Mac.
Enterprise Voice AI: STT, TTS & Agent APIs.
AI Audio & Video Translation

Improve English speaking
ELSA Speak is an AI-powered English coach that helps users improve their speaking skills and pronunciation through real-world conversations. It provides instant feedback and personalized learning paths. Improve your English speaking skills and pronounce English like an American.
AI models to transcribe and understand speech.
14 / 385