Stars
The best OSS video generation models, created by Genmo
fast-stable-diffusion + DreamBooth
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Using Low-rank adaptation to quickly fine-tune diffusion models.
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning
Stable Diffusion implemented from scratch in PyTorch
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
[EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!
CUDA accelerated rasterization of gaussian splatting
CoreNet: A library for training deep neural networks
You like pytorch? You like micrograd? You love tinygrad! ❤️
Experiments for efforts to train a new and improved t5
Open-Sora: Democratizing Efficient Video Production for All
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
Lifting ControlNet for Generalized Depth Conditioning


