Audio foundation model excelling in audio understanding
Large Audio Language Model built for natural interactions
Audio Plugin for Audio to MIDI transcription using deep learning
Official Python inference and LoRA trainer package
Audiocraft is a library for audio processing and generation
Audio player that can play common audio formats
A lightning fast audio upsampler
Python Audio Analysis Library: Feature Extraction, Classification
HLS.js is a JavaScript library that plays HLS in browsers
A Family of Open Sourced Music Foundation Models
Simple and Fast Multimedia Library
A powerhouse of audio functionality for macOS, iOS, and tvOS
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
A tweak to enhance Spotify experience
Multilingual speech recognition and audio understanding model
Open-source multi-speaker long-form text-to-speech model
Code for openai.fm, a demo for the OpenAI Speech API
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Extract audio and video content and organize it into a Markdown note
Stable diffusion for real-time music generation (web app)
The missing YouTube Music macOS app
Oboe is a C++ library that makes it easy to build high-performance
s&box is a modern game engine, built on Valve's Source 2
A nearly-live implementation of OpenAI's Whisper
Synchronized Translation for Videos