dsh54054

Follow

Vae dsh54054

Follow

2 followers · 2 following

12:23 (UTC -12:00)

Stars

yu-haoyuan / CosyVoice

Forked from FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 16 2 Updated Dec 30, 2025

BytedanceSpeech / seed-tts-eval

Python 1,551 143 Updated Jun 14, 2024

xingchensong / FlashCosyVoice

FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.

Python 246 25 Updated Feb 25, 2026

yxuansu / Contrastive_Search_Is_What_You_Need

[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation

Python 123 8 Updated Mar 5, 2023

srvk / eesen

The official repository of the Eesen project

C++ 835 339 Updated May 23, 2019

Plachtaa / seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

Python 3,672 464 Updated Apr 20, 2025

pengzhendong / audio-pipeline

Python 23 7 Updated Oct 17, 2024

fengyizhu / ChatTTS-VLLM

A generative speech model for daily dialogue.

Python 16 2 Updated Aug 21, 2025

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,559 343 Updated Jun 21, 2025

pengzhendong / tts-pqe

Perceptual Quality Estimator for Speech

Python 4 3 Updated Jul 10, 2024

computerhistory / AlexNet-Source-Code

This package contains the original 2012 AlexNet code.

Cuda 2,862 372 Updated Mar 12, 2025

qi-hua / async_cosyvoice

使用vllm加速cosyvoice2的推理

Jupyter Notebook 491 64 Updated Apr 26, 2025

wenet-e2e / speech-synthesis-paper

List of speech synthesis papers.

1,070 123 Updated Jul 24, 2023

pengzhendong / streaming-asr

One command to start a streaming ASR server.

Python 12 2 Updated Oct 2, 2024

k2-fsa / k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,328 234 Updated Mar 9, 2026

Mddct / cosyvoice2-flow-optimized

faster inference

28 1 Updated Jan 20, 2025

zhilengjun / ComfyUI-FunAudioLLM_V2

Forked from SpenserCai/ComfyUI-FunAudioLLM

Comfyui custom node for FunAudioLLM include CosyVoice2, SenseVoice and InspireMusic

Python 15 2 Updated Jan 9, 2026

SpenserCai / ComfyUI-FunAudioLLM

Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice

Python 93 11 Updated Nov 27, 2024

aliutkus / speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 1,048 175 Updated Jul 5, 2023

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,495 2,349 Updated Mar 16, 2026

pengzhendong / streaming-vocos

Streaming Vocos

Python 30 5 Updated Jun 10, 2025

pengzhendong / streaming-ChatTTS

Jupyter Notebook 23 3 Updated Oct 30, 2024

lovemefan / SenseVoice.cpp

Port of Funasr's Sense-voice model in C/C++

C 539 70 Updated Dec 19, 2025

wenet-e2e / WeTextProcessing

Text Normalization & Inverse Text Normalization

Python 748 100 Updated Feb 27, 2026

peilongchencc / My-GLM-4-Voice

Forked from zai-org/GLM-4-Voice

ubuntu 系统下 GLM-4-Voice 部署经验分享

Python 18 Updated Oct 31, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,749 805 Updated Mar 25, 2026

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,542 6,175 Updated Feb 9, 2026

fishaudio / fish-speech

SOTA Open Source TTS

Python 29,223 2,461 Updated Apr 6, 2026

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,311 2,114 Updated Apr 4, 2026

zai-org / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 3,170 279 Updated Dec 5, 2024