Logo of the AI Catalog website
Logo of the AI tool "Deepgram"

Deepgram

Voice AI API

Deepgram provides voice AI APIs for speech to text, text to speech, and unified voice agents that operate in real time or batch for scalable applications.

Visit

Alternatives to Deepgram

Logo of the Voicebox AI tool

Open Source Voice Cloning Desktop App

Voicebox is a local-first, open-source voice cloning desktop application powered by the Qwen3-TTS model. It provides professional-grade tools like a timeline-based editor and Whisper-powered transcriptions to generate natural-sounding speech entirely on your machine without cloud subscriptions.

More AI Tools for Audio & Music

View All
Logo of the ElevenLabs AI tool

Voice AI Platform

ElevenLabs is a leading voice AI platform that provides realistic text-to-speech, voice cloning, and audio generation tools for creators, developers, and enterprises.

Logo of the AIVA AI tool

AI Music Generation Assistant

AIVA is an AI music composition assistant that creates emotional soundtracks for creative projects, offering deep customization and copyright ownership for professionals.

Logo of the Chatterbox AI tool

Open-Source Text-to-Speech Models

Chatterbox is a collection of high-performance, open-source text-to-speech models optimized for low-latency voice agents and creative narration. The platform features native support for paralinguistic tags like coughing and laughing to produce highly realistic and expressive audio output.

More AI Tools for Infrastructure & Cloud

View All
Logo of the Wiro AI AI tool

AI APIs for Developers

Wiro AI provides a unified API platform giving developers instant access to hundreds of AI models for image, video, and audio generation without managing infrastructure.

Logo of the Baseten AI tool

AI Inference Platform

Baseten is a highly scalable inference platform that allows companies to deploy, serve, and scale open-source and custom AI models with high performance and reliability.

Discover Other Tools

View All
Logo of the Vibe Typer AI tool

AI Voice Typing

Vibe Typer is an AI-powered voice typing tool for Windows and Linux that transcribes speech to text and refines it using AI for clear, professional writing in any app.

Logo of the Wispr Flow AI tool

Effortless Voice Dictation

Wispr Flow is a voice dictation tool that uses AI to convert speech into clear, formatted text instantly across any application, adapting to your style and auto-editing as you speak.