-
https://sail.usc.edu/
- Taiwan
-
15:10
(UTC -12:00) - https://www.linkedin.com/in/huangchougchou/
- https://scholar.google.com/citations?user=_d7pcs4AAAAJ&hl
-
-
gstack Public
Forked from garrytan/gstackUse Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA
TypeScript MIT License UpdatedApr 4, 2026 -
OpenHarness Public
Forked from HKUDS/OpenHarness"OpenHarness: Open Agent Harness"
Python MIT License UpdatedApr 3, 2026 -
cc-mini Public
Forked from e10nMa2k/cc-miniUltra-light Harness scaffolding for AI agents, a mini version of claude code
Python UpdatedApr 3, 2026 -
pyannote-audio Public
Forked from pyannote/pyannote-audioNeural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Jupyter Notebook MIT License UpdatedMar 26, 2026 -
academic-research-skills Public
Forked from voidful/academic-skillsTeX MIT License UpdatedMar 19, 2026 -
-
TorchCode Public
Forked from duoan/TorchCode🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.
-
sail25-workshop-page Public
Forked from algoseer/sail25-workshop-pageWorkshop page source
HTML UpdatedJan 15, 2026 -
MeanVC Public
Forked from ASLP-lab/MeanVCA Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows
Python Apache License 2.0 UpdatedJan 8, 2026 -
sam-audio Public
Forked from facebookresearch/sam-audioThe repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
Python Other UpdatedJan 5, 2026 -
Do-You-Hear-What-I-Mean Public
Forked from nerfies/nerfies.github.ioWebsite for ICASSP 2026 Submission, Do You Hear What I Mean? Quantifying the Instruction-Perception Gap in Instruction-Guided Expressive Text-To-Speech Systems
JavaScript UpdatedDec 23, 2025 -
VibeVoice Public
Forked from microsoft/VibeVoiceOpen-Source Frontier Voice AI
Python MIT License UpdatedDec 5, 2025 -
torchcodec Public
Forked from meta-pytorch/torchcodecPyTorch media decoding and encoding
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 11, 2025 -
PaddleOCR Public
Forked from PaddlePaddle/PaddleOCRTurn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Python Apache License 2.0 UpdatedOct 19, 2025 -
ml-switchboard-affect Public
Forked from apple/ml-switchboard-affectPython Other UpdatedSep 2, 2025 -
-
higgs-audio Public
Forked from boson-ai/higgs-audioText-audio foundation model from Boson AI
Python Apache License 2.0 UpdatedJul 27, 2025 -
ModernBERT Public
Forked from AnswerDotAI/ModernBERTBringing BERT into modernity via both architecture changes and scaling
Python Apache License 2.0 UpdatedJun 30, 2025 -
SoulChat2.0 Public
Forked from scutcyr/SoulChat2.0Psychological Counselor's Digital Twin Framework(心理咨询师数字孪生框架)
Python Apache License 2.0 UpdatedJun 10, 2025 -
Awesome-Model-Merging-Methods-Theories-Applications Public
Forked from EnnengYang/Awesome-Model-Merging-Methods-Theories-ApplicationsModel Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
UpdatedJun 6, 2025 -
moshi Public
Forked from kyutai-labs/moshiMoshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Python Apache License 2.0 UpdatedJun 5, 2025 -
speech-trident Public
Forked from ga642381/speech-tridentAwesome speech/audio LLMs, representation learning, and codec models
UpdatedJun 3, 2025 -
ConspEmoLLM Public
Forked from lzw108/ConspEmoLLMConspiracy Theory Detection using an Emotion-Based Large Language Model
Python MIT License UpdatedMay 30, 2025 -
chatterbox Public
Forked from resemble-ai/chatterboxSoTA open-source TTS
Python MIT License UpdatedMay 29, 2025 -
vox-profile-release Public
Forked from tiantiaf0627/vox-profile-releaseVox-Profile Benchmark
Python Apache License 2.0 UpdatedMay 23, 2025 -
Meta-PerSER Public
Forked from Jeffabcd/Meta-PerSERCode publication for Meta-PerSER
Python UpdatedMay 22, 2025 -
group_affect Public
Forked from sp-uhh/group_affectJupyter Notebook MIT License UpdatedMay 14, 2025 -
-
child-adult-diarization Public
Forked from usc-sail/child-adult-diarizationpublic child-adult speaker diarization/classification model and codes
Python UpdatedApr 24, 2025