Open-source framework for intelligent speech interaction
A text-to-speech, speech-to-text and speech-to-speech library
Audio server, programming language, and IDE for sound synthesis
Large Audio Language Model built for natural interactions
Multi-modal large language model designed for audio understanding
Software synthesizer based on the SoundFont 2 specifications
The open-source voice synthesis studio powered by Qwen3-TTS
Sonic Pi is your free code-based music creation and performance tool
A multi-system chiptune tracker compatible with DefleMask modules
Functional programming language for signal processing
Collaborative programmable music
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Controllable & emotion-expressive zero-shot TTS
Free open source speech synthesizer for Russian and other languages
Offline Text To Speech synthesis for python
Capable of understanding text, audio, vision, video
Open Source Speech Language Model
Stable diffusion for real-time music generation (web app)
Translate the video from one language to another and embed dubbing
Transforming Multimodal Content into Captivating Multilingual Audio
Industrial-level controllable zero-shot text-to-speech system
Framework for building real-time voice and multimodal AI agents
Flash + AIR sound effects generator. Based on Sfxr.
A Systematic Framework for Interactive World Modeling
Swift audio synthesis, processing, & analysis platform