wangmengzhi

wangmengzhi

4 followers · 0 following

Stars

Alic-Li / Mini_RWKV_7

Mini_RWKV_V7_LM

Python 60 5 Updated Jan 26, 2026

test-time-training / ttt-lm-jax

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 454 39 Updated Nov 2, 2025

charent / ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。

Python 1,679 189 Updated Apr 20, 2024

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 41,072 4,966 Updated Feb 6, 2026

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 6,701 736 Updated Feb 4, 2026

QwenLM / Qwen3.5

Qwen3.5 is the large language model series developed by Qwen team, Alibaba Cloud.

2,032 106 Updated Mar 2, 2026

alibaba / MNN

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

C++ 14,523 2,234 Updated Mar 12, 2026

OpenBMB / MiniCPM-o

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 24,099 1,866 Updated Mar 7, 2026

sollyu / AndroidStudioChineseLanguagePack

AndroidStudio中文插件(官方修改版本）

1,600 62 Updated Jan 24, 2026

allenai / longformer

Longformer: The Long-Document Transformer

Python 2,190 289 Updated Feb 8, 2023

wuch15 / Fastformer

A pytorch &keras implementation and demo of Fastformer.

Jupyter Notebook 192 29 Updated Sep 22, 2022

Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.

Python 1,970 189 Updated Jan 30, 2026

stepfun-ai / Step-Audio-R1

Python 595 41 Updated Feb 13, 2026

stepfun-ai / Step-Audio

Python 4,620 370 Updated Feb 13, 2026

roudimit / whisper-flamingo

Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 201 16 Updated Jul 29, 2025

FunAudioLLM / FunAudioLLM.github.io

HTML 56 10 Updated Jan 21, 2026

zai-org / GLM-ASR

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

Python 759 70 Updated Mar 6, 2026

Tongyi-MAI / Z-Image

Python 10,474 695 Updated Feb 9, 2026

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,597 298 Updated Mar 12, 2026

onnx / neural-compressor

Model compression for ONNX

Python 99 9 Updated Mar 1, 2026

aigc-apps / VideoX-Fun

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,959 147 Updated Mar 12, 2026

Comfy-Org / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 105,725 12,138 Updated Mar 12, 2026

mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,619 200 Updated Jul 12, 2024

woct0rdho / SageAttention

Forked from thu-ml/SageAttention

Fork of SageAttention for Windows wheels and easy installation

Cuda 739 59 Updated Feb 15, 2026

Python 342 35 Updated Mar 12, 2026

hao-ai-lab / FastVideo

A unified inference and post-training framework for accelerated video generation.

Python 3,150 279 Updated Mar 12, 2026

nalexand / Wan2.2

Forked from Wan-Video/Wan2.2

8GB optimized WAN2.2 T2V/I2V-A14B, long video generation 30 sec+

Python 9 2 Updated Oct 25, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,613 1,765 Updated Mar 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly