Skip to content
View George0828Zhang's full-sized avatar
  • 17:42 (UTC +08:00)

Highlights

  • Pro

Block or report George0828Zhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A complete academic research Skill suite. Supports Claude Code, ChatGPT / Codex CLI, and Gemini CLI.

TeX 26 6 Updated Apr 4, 2026

A lightweight, wake word detection engine. Train custom, high-accuracy models with minimal effort.

Python 55 9 Updated Apr 6, 2026

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Python 725 100 Updated Apr 26, 2024

A framework for efficient model inference with omni-modality models

Python 4,167 714 Updated Apr 8, 2026

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,681 58 Updated Mar 9, 2026

Convert english/translit words to katakana

Python 13 3 Updated Sep 1, 2018

A Python script that converts Romaji to Hiragana and/or Katakana

Python 7 1 Updated Feb 16, 2017

NanoGPT (124M) in 2 minutes

Python 5,070 702 Updated Mar 29, 2026

A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.

Python 75 6 Updated Aug 2, 2024

A nearly-live implementation of OpenAI's Whisper.

Python 3,948 542 Updated Mar 17, 2026
Python 58 5 Updated Dec 2, 2024

Faster Whisper transcription with CTranslate2

Python 22,011 1,783 Updated Nov 19, 2025

Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection

Python 110 10 Updated Mar 30, 2025

A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/abs/1905.11235.

Python 36 4 Updated Feb 10, 2024

Collection of articles, books, videos and other things I found useful for those interested in the topic.

298 6 Updated Jul 29, 2020

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Python 1,256 101 Updated Jun 29, 2025
Jupyter Notebook 86 4 Updated Nov 3, 2025

Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra

Python 16 5 Updated Dec 10, 2024

Inference of resemble denoiser

Python 30 5 Updated Mar 11, 2024

Various speech datasets made available to the public

Jupyter Notebook 133 15 Updated Dec 13, 2024

[WIP] Scripts for fine-tuning Whisper

Python 221 30 Updated May 29, 2023

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 34,966 4,298 Updated Aug 6, 2024

Arduino RFID Library for MFRC522

C++ 3,000 1,496 Updated Jan 4, 2026

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Python 2,208 228 Updated Oct 29, 2025
TypeScript 71 24 Updated Dec 10, 2025
20 14 Updated Jun 19, 2023

Download books from pubu.com.tw without buying them 在未購買的情況下下載pubu電子書

Python 7 3 Updated Aug 27, 2023

Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"

Python 442 46 Updated Feb 2, 2022

Google Research

Jupyter Notebook 37,674 8,380 Updated Apr 8, 2026
Next