Skip to content
View dsh54054's full-sized avatar
  • 12:23 (UTC -12:00)

Block or report dsh54054

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 16 2 Updated Dec 30, 2025

FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.

Python 246 25 Updated Feb 25, 2026

[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation

Python 123 8 Updated Mar 5, 2023

The official repository of the Eesen project

C++ 835 339 Updated May 23, 2019

zero-shot voice conversion & singing voice conversion, with real-time support

Python 3,672 464 Updated Apr 20, 2025
Python 23 7 Updated Oct 17, 2024

A generative speech model for daily dialogue.

Python 16 2 Updated Aug 21, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,559 343 Updated Jun 21, 2025

Perceptual Quality Estimator for Speech

Python 4 3 Updated Jul 10, 2024

This package contains the original 2012 AlexNet code.

Cuda 2,862 372 Updated Mar 12, 2025

使用vllm加速cosyvoice2的推理

Jupyter Notebook 491 64 Updated Apr 26, 2025

List of speech synthesis papers.

1,070 123 Updated Jul 24, 2023

One command to start a streaming ASR server.

Python 12 2 Updated Oct 2, 2024

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,328 234 Updated Mar 9, 2026

faster inference

28 1 Updated Jan 20, 2025

Comfyui custom node for FunAudioLLM include CosyVoice2, SenseVoice and InspireMusic

Python 15 2 Updated Jan 9, 2026

Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice

Python 93 11 Updated Nov 27, 2024

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 1,048 175 Updated Jul 5, 2023

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,495 2,349 Updated Mar 16, 2026

Streaming Vocos

Python 30 5 Updated Jun 10, 2025
Jupyter Notebook 23 3 Updated Oct 30, 2024

Port of Funasr's Sense-voice model in C/C++

C 539 70 Updated Dec 19, 2025

Text Normalization & Inverse Text Normalization

Python 748 100 Updated Feb 27, 2026

ubuntu 系统下 GLM-4-Voice 部署经验分享

Python 18 Updated Oct 31, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,749 805 Updated Mar 25, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,542 6,175 Updated Feb 9, 2026

SOTA Open Source TTS

Python 29,223 2,461 Updated Apr 6, 2026

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,311 2,114 Updated Apr 4, 2026

GLM-4-Voice | 端到端中英语音对话模型

Python 3,170 279 Updated Dec 5, 2024
Next