Audio on FOSS Engineer

Voicebox - Local AI Voice Studio for Speech, Dictation, and Agents

Sat, 06 Jun 2026 10:50:00 +0200

Voicebox is a local AI voice studio for cloning voices, generating speech, dictating into apps, transcribing captures, adding effects, composing stories, and giving AI agents voices through REST and MCP. It ships a Tauri desktop app, FastAPI backend, web UI, Docker setup, and multiple local TTS/STT engines.

Voicebox

Sat, 06 Jun 2026 00:00:00 +0000

Voicebox is a local AI voice studio with Docker support, REST endpoints, and MCP tools for giving agents a voice.

Chatterbox - Local Open-Source Text-to-Speech by Resemble AI

Fri, 05 Jun 2026 11:45:00 +0200

Chatterbox is Resemble AI’s MIT-licensed open-source text-to-speech toolkit. It ships Python APIs, Gradio demos, English and multilingual models, voice conversion, Turbo inference, paralinguistic tags, and built-in Perth watermarking. It is not a Docker-first self-hosted app; it is a local ML package where GPU access matters.