Speech recognition, synthesis, and audio intelligence.
OpenAI's multilingual speech recognition model. Accurate transcription across 99 languages.
Port of OpenAI Whisper to pure C/C++. CPU-first with optional GPU acceleration. Works on Mac, Linux,β¦
Suno's open-source text-to-audio model. Generates highly realistic speech, music, and sound effects.
Open-source text-to-speech with high-quality neural voices and voice cloning support.
Meta's open-source text-to-speech model. Generate natural voice from text prompts.