Deepgram

Voice AI API

Deepgram provides voice AI APIs for speech to text, text to speech, and unified voice agents that operate in real time or batch for scalable applications.

Visit

Alternatives to Deepgram

Resemble AI

Generative Voice AI and Deepfake Detection

Resemble AI is an enterprise-grade platform that enables users to create highly expressive synthetic voices and detect sophisticated audio deepfakes. It provides advanced tools for voice cloning, real-time speech synthesis, and verifiable watermarking to ensure digital trust and security.

🎵 Audio & Music ⚖️ Legal & Compliance ☁️ Infrastructure & Cloud

Eachlabs

AI Model Marketplace and Workflow Engine

Eachlabs is an AI model marketplace and workflow engine that enables developers to integrate over 300 image, video, and voice models via a single API, featuring a drag-and-drop builder for rapid app development.

📚 Libraries ☁️ Infrastructure & Cloud 🎨 Image Generation 🎬 Video & Animation 🎵 Audio & Music

Auphonic

AI Sound Engineer

Auphonic is an automated audio post production service that improves sound quality through leveling, noise reduction, filtering, speech to text, and workflow automation.

🎵 Audio & Music ☁️ Infrastructure & Cloud

Voicebox

Open Source Voice Cloning Desktop App

Voicebox is a local-first, open-source voice cloning desktop application powered by the Qwen3-TTS model. It provides professional-grade tools like a timeline-based editor and Whisper-powered transcriptions to generate natural-sounding speech entirely on your machine without cloud subscriptions.

🎵 Audio & Music 🔓 Open Source

Loop Text to Speech

AI Voice Assistant and Smart Notetaker

Loop Text to Speech is an AI-powered voice assistant that provides instant spoken answers, web searches, and cross-language translations. It features specialized models for capturing notes, summarizing content, and practicing speech to streamline workflows for students and professionals.

📱 Mobile Apps 🎵 Audio & Music ⚡ Productivity 🔌 Built-in AI

Daily

Voice, Video and AI Infrastructure

Daily provides real-time voice, video, and AI infrastructure for developers to build ultra-low-latency conversational AI agents and multimodal experiences.

☁️ Infrastructure & Cloud 🎬 Video & Animation

More AI Tools for Audio & Music

View All

ElevenLabs

Voice AI Platform

ElevenLabs is a leading voice AI platform that provides realistic text-to-speech, voice cloning, and audio generation tools for creators, developers, and enterprises.

🎵 Audio & Music

Illuminate

AI Audio Discussion Generator

Illuminate transforms research papers and complex content into engaging, AI-generated audio discussions adapted to your learning preferences. This experimental tool explores new ways to foster learning and is currently optimized for computer science topics.

🎵 Audio & Music 🔬 Research & Science 🎓 Education 🧪 Fun & Experiments

AIVA

AI Music Generation Assistant

AIVA is an AI music composition assistant that creates emotional soundtracks for creative projects, offering deep customization and copyright ownership for professionals.

🎵 Audio & Music

Descript

All-in-One Video & Podcast Editor

Descript is an all-in-one editor that makes editing audio and video as easy as editing a text doc, featuring AI voice cloning, transcription, and studio sound.

🎬 Video & Animation 🎵 Audio & Music

Chatterbox

Open-Source Text-to-Speech Models

Chatterbox is a collection of high-performance, open-source text-to-speech models optimized for low-latency voice agents and creative narration. The platform features native support for paralinguistic tags like coughing and laughing to produce highly realistic and expressive audio output.

🔓 Open Source 🎵 Audio & Music 📚 Libraries

MusicFX

AI Music Creation

MusicFX is an experimental AI tool from Google Labs that allows users to generate custom music loops and beats using text prompts.

🎵 Audio & Music 🧪 Fun & Experiments

More AI Tools for Infrastructure & Cloud

View All

Mux

Video API for developers

Mux is a developer-first video API that simplifies building video streaming and AI workflows, offering features like auto-captioning and content understanding.

☁️ Infrastructure & Cloud 🎬 Video & Animation

RAGFlow

Open-Source RAG Engine And Agent Platform

RAGFlow is an open-source Retrieval-Augmented Generation engine designed to build a superior context layer for enterprise AI agents. It features high-precision hybrid search and visual workflows for orchestrating unified AI agents seamlessly.

☁️ Infrastructure & Cloud 🦾 AI Agents 🔓 Open Source

Wiro AI

AI APIs for Developers

Wiro AI provides a unified API platform giving developers instant access to hundreds of AI models for image, video, and audio generation without managing infrastructure.

☁️ Infrastructure & Cloud

Lambda

AI Compute Cloud

Lambda provides high-performance cloud computing infrastructure and workstations specifically designed and optimized for deep learning training and inference.

☁️ Infrastructure & Cloud

Baseten

AI Inference Platform

Baseten is a highly scalable inference platform that allows companies to deploy, serve, and scale open-source and custom AI models with high performance and reliability.

☁️ Infrastructure & Cloud

SambaNova

Full-Stack AI Platform

SambaNova Systems offers a full-stack AI platform, including high-performance hardware and software, optimized for running large AI models efficiently.

☁️ Infrastructure & Cloud 📟 Hardware

Discover Other Tools

View All

Vibe Typer

AI Voice Typing

Vibe Typer is an AI-powered voice typing tool for Windows and Linux that transcribes speech to text and refines it using AI for clear, professional writing in any app.

✍️ Writing & Copy

Wispr Flow

Effortless Voice Dictation

Wispr Flow is a voice dictation tool that uses AI to convert speech into clear, formatted text instantly across any application, adapting to your style and auto-editing as you speak.

✍️ Writing & Copy

Plivo

Voice AI Agents

Plivo is a communications platform that enables businesses to build AI agents for voice, SMS, and WhatsApp to automate customer engagement and support.

🎧 Customer Support 📈 Marketing & SEO