-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMar 14, 2026 -
-
-
-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 7, 2026 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Python Apache License 2.0 UpdatedJan 5, 2026 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedDec 26, 2025 -
incubator-brpc Public
Forked from apache/brpcIndustrial-grade RPC framework used throughout Baidu, with 1,000,000+ instances and thousands kinds of services. "brpc" means "better RPC".
C++ Apache License 2.0 UpdatedJan 19, 2022 -
-
ics-pa Public
Forked from NJU-ProjectN/ics-paThe wrapper repo for NJU ICS PA.
Shell UpdatedMay 15, 2021 -
-
x86-64-minimal-JIT-compiler-Cpp Public
Forked from sol-prog/x86-64-minimal-JIT-compiler-CppWriting a minimal x86-64 JIT compiler in C++
C++ GNU General Public License v3.0 UpdatedApr 28, 2018