Masks sensitive data and secrets before they reach AI
LLM101n: Let's build a Storyteller
PyTorch code and models for VJEPA2 self-supervised learning from video
PyTorch code and models for V-JEPA self-supervised learning from video
Fast UTF-8 codepoint sets for Zig
Official codebase for I-JEPA
PyTorch implementation of MAE
World's first open source data quality & data preparation project
Robust BERT-based model for English with improved MLM training
ClinicalBERT model trained on MIMIC notes for clinical NLP tasks