Skip to content
View whuhxb's full-sized avatar
💭
On The Road!!
💭
On The Road!!

Block or report whuhxb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

From scratch implementation of a vision language model in pure PyTorch

Jupyter Notebook 255 31 Updated May 6, 2024
Jupyter Notebook 711 57 Updated Dec 6, 2025

a repo for moe papers and systems aggregation

8 3 Updated Jan 13, 2022

在千问最新的多模态image-text模型Qwen3-VL-4B-Instruct 进行多种lora微调对比效果,通过langchain+RAG+多智能体(Multi-Agent)进行部署

Python 29 7 Updated Dec 14, 2025
Python 177 12 Updated Jul 22, 2024

[SIGIR'24] The official implementation code of MOELoRA.

Python 189 23 Updated Jul 22, 2024

MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)

Jupyter Notebook 1 Updated Jul 1, 2025

MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)

Jupyter Notebook 46 2 Updated Jul 1, 2025

MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detecti…

Java 654 368 Updated Dec 19, 2025

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,667 85 Updated Mar 8, 2024

Data and software for building the ACL Anthology.

Python 690 379 Updated Mar 13, 2026

[AAAI 2026] 中文公众号报道

Python 149 12 Updated Feb 5, 2026

Repo of "LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving""

10 1 Updated Mar 10, 2026

[CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding

Python 42 Updated Mar 3, 2026
Python 98 4 Updated Jul 24, 2025

New generation of CLIP with strong fine grained discrimination capability, ICML2025

Python 556 33 Updated Oct 27, 2025

[CVPR26] GeoMotion: Rethinking Motion Segmentation via Latent 4D Geometry

Python 21 2 Updated Mar 5, 2026

Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning

Python 41 1 Updated Mar 6, 2026

[CVPR2026] UFO: Unifying Feed-Forward and Optimization-based Methods for Large Driving Scene Modeling

14 2 Updated Feb 25, 2026

[CVPR 2026] VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving

Python 66 5 Updated Mar 10, 2026

[CVPR 2026] SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching

9 1 Updated Feb 23, 2026

This is the official repository for GeoDiv: Framework for Measuring Geographical Diversity in Text-to-Image Models (ICLR 2026)

Python 1 1 Updated Feb 26, 2026

When LoRA Betrays: Backdooring Text-to-Image Models by Masquerading as Benign Adapters(CVPR 2026)

Python 3 1 Updated Feb 28, 2026

Plug-and-Play Diffusion Meets ADMM: Dual-Variable Coupling for Robust Medical Image Reconstruction

Python 8 1 Updated Feb 27, 2026

official repository of GeoDiff4D

12 2 Updated Mar 6, 2026

Open Ended Medical Reinforcement Learning

Python 34 4 Updated Feb 27, 2026
Python 2 2 Updated Mar 1, 2026

Official Implementation of "Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?" (CVPR 2026)

10 1 Updated Feb 25, 2026
Next