ag027592

🎯

Focusing

Huang-Cheng Chou ag027592

🎯

Focusing

I gained my Ph.D. degree in the Department of Electrical Engineering at National Tsing University, Taiwan. My research interest is Speech Emotion Recognition.

16 followers · 23 following

https://sail.usc.edu/
Taiwan
15:10 (UTC -12:00)
https://www.linkedin.com/in/huangchougchou/
https://scholar.google.com/citations?user=_d7pcs4AAAAJ&hl

singaporetrip Public template

JavaScript Updated Apr 6, 2026
gstack Public
Forked from garrytan/gstack

Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA

TypeScript MIT License Updated Apr 4, 2026
OpenHarness Public
Forked from HKUDS/OpenHarness

"OpenHarness: Open Agent Harness"

Python MIT License Updated Apr 3, 2026
cc-mini Public
Forked from e10nMa2k/cc-mini

Ultra-light Harness scaffolding for AI agents, a mini version of claude code

Python Updated Apr 3, 2026
pyannote-audio Public
Forked from pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook MIT License Updated Mar 26, 2026
academic-research-skills Public
Forked from voidful/academic-skills

TeX MIT License Updated Mar 19, 2026
jhcodec Public
Forked from jhcodec843/jhcodec

Python MIT License Updated Mar 9, 2026
TorchCode Public
Forked from duoan/TorchCode

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 1 Updated Mar 7, 2026
sail25-workshop-page Public
Forked from algoseer/sail25-workshop-page

Workshop page source

HTML Updated Jan 15, 2026
MeanVC Public
Forked from ASLP-lab/MeanVC

A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows

Python Apache License 2.0 Updated Jan 8, 2026
sam-audio Public
Forked from facebookresearch/sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python Other Updated Jan 5, 2026
Do-You-Hear-What-I-Mean Public
Forked from nerfies/nerfies.github.io

Website for ICASSP 2026 Submission, Do You Hear What I Mean? Quantifying the Instruction-Perception Gap in Instruction-Guided Expressive Text-To-Speech Systems

JavaScript Updated Dec 23, 2025
VibeVoice Public
Forked from microsoft/VibeVoice

Open-Source Frontier Voice AI

Python MIT License Updated Dec 5, 2025
torchcodec Public
Forked from meta-pytorch/torchcodec

PyTorch media decoding and encoding

Python BSD 3-Clause "New" or "Revised" License Updated Nov 11, 2025
PaddleOCR Public
Forked from PaddlePaddle/PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python Apache License 2.0 Updated Oct 19, 2025
ml-switchboard-affect Public
Forked from apple/ml-switchboard-affect

Python Other Updated Sep 2, 2025
LLMEval-3 Public
Forked from llmeval/LLMEval-Fair

中文大语言模型评测第三期

Updated Aug 12, 2025
higgs-audio Public
Forked from boson-ai/higgs-audio

Text-audio foundation model from Boson AI

Python Apache License 2.0 Updated Jul 27, 2025
ModernBERT Public
Forked from AnswerDotAI/ModernBERT

Bringing BERT into modernity via both architecture changes and scaling

Python Apache License 2.0 Updated Jun 30, 2025
SoulChat2.0 Public
Forked from scutcyr/SoulChat2.0

Psychological Counselor's Digital Twin Framework（心理咨询师数字孪生框架）

Python Apache License 2.0 Updated Jun 10, 2025
Awesome-Model-Merging-Methods-Theories-Applications Public
Forked from EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.

Updated Jun 6, 2025
moshi Public
Forked from kyutai-labs/moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python Apache License 2.0 Updated Jun 5, 2025
speech-trident Public
Forked from ga642381/speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

Updated Jun 3, 2025
ConspEmoLLM Public
Forked from lzw108/ConspEmoLLM

Conspiracy Theory Detection using an Emotion-Based Large Language Model

Python MIT License Updated May 30, 2025
chatterbox Public
Forked from resemble-ai/chatterbox

SoTA open-source TTS

Python MIT License Updated May 29, 2025
vox-profile-release Public
Forked from tiantiaf0627/vox-profile-release

Vox-Profile Benchmark

Python Apache License 2.0 Updated May 23, 2025
Meta-PerSER Public
Forked from Jeffabcd/Meta-PerSER

Code publication for Meta-PerSER

Python Updated May 22, 2025
group_affect Public
Forked from sp-uhh/group_affect

Jupyter Notebook MIT License Updated May 14, 2025
emotionalTTS-labeling-instruction Public

HTML 1 Updated May 8, 2025
child-adult-diarization Public
Forked from usc-sail/child-adult-diarization

public child-adult speaker diarization/classification model and codes

Python Updated Apr 24, 2025

Huang-Cheng Chou ag027592

singaporetrip Public template

Uh oh!

gstack Public

Uh oh!

OpenHarness Public

Uh oh!

cc-mini Public

Uh oh!

pyannote-audio Public

Uh oh!

academic-research-skills Public

Uh oh!

jhcodec Public

Uh oh!

TorchCode Public

Uh oh!

sail25-workshop-page Public

Uh oh!

MeanVC Public

Uh oh!

sam-audio Public

Uh oh!

Do-You-Hear-What-I-Mean Public

Uh oh!

VibeVoice Public

Uh oh!

torchcodec Public

Uh oh!

PaddleOCR Public

Uh oh!

ml-switchboard-affect Public

Uh oh!

LLMEval-3 Public

Uh oh!

higgs-audio Public

Uh oh!

ModernBERT Public

Uh oh!

SoulChat2.0 Public

Uh oh!

Awesome-Model-Merging-Methods-Theories-Applications Public

Uh oh!

moshi Public

Uh oh!

speech-trident Public

Uh oh!

ConspEmoLLM Public

Uh oh!

chatterbox Public

Uh oh!

vox-profile-release Public

Uh oh!

Meta-PerSER Public

Uh oh!

group_affect Public

Uh oh!

emotionalTTS-labeling-instruction Public

Uh oh!

child-adult-diarization Public

Uh oh!