Skip to content
View wangmengzhi's full-sized avatar

Block or report wangmengzhi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Mini_RWKV_V7_LM

Python 60 5 Updated Jan 26, 2026

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 454 39 Updated Nov 2, 2025

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,679 189 Updated Apr 20, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 41,072 4,966 Updated Feb 6, 2026

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 6,701 736 Updated Feb 4, 2026

Qwen3.5 is the large language model series developed by Qwen team, Alibaba Cloud.

2,032 106 Updated Mar 2, 2026

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

C++ 14,523 2,234 Updated Mar 12, 2026

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 24,099 1,866 Updated Mar 7, 2026

AndroidStudio中文插件(官方修改版本)

1,600 62 Updated Jan 24, 2026

Longformer: The Long-Document Transformer

Python 2,190 289 Updated Feb 8, 2023

A pytorch &keras implementation and demo of Fastformer.

Jupyter Notebook 192 29 Updated Sep 22, 2022

Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.

Python 1,970 189 Updated Jan 30, 2026
Python 595 41 Updated Feb 13, 2026
Python 4,620 370 Updated Feb 13, 2026

Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 201 16 Updated Jul 29, 2025

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

Python 759 70 Updated Mar 6, 2026
Python 10,474 695 Updated Feb 9, 2026

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,597 298 Updated Mar 12, 2026

Model compression for ONNX

Python 99 9 Updated Mar 1, 2026

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,959 147 Updated Mar 12, 2026

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 105,725 12,138 Updated Mar 12, 2026

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,619 200 Updated Jul 12, 2024

Fork of SageAttention for Windows wheels and easy installation

Cuda 739 59 Updated Feb 15, 2026

Light Image Video Generation Inference Framework

Python 2,055 165 Updated Mar 12, 2026

Official repository for LTX-Video

Python 9,516 899 Updated Jan 5, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)

Python 342 35 Updated Mar 12, 2026

A unified inference and post-training framework for accelerated video generation.

Python 3,150 279 Updated Mar 12, 2026

8GB optimized WAN2.2 T2V/I2V-A14B, long video generation 30 sec+

Python 9 2 Updated Oct 25, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,613 1,765 Updated Mar 5, 2026
Next