14-stage Fusion Pipeline for LLM token compression
Persistent context and multi-instance coordination
Real-time Claude Code usage monitor with predictions and warnings
OpenSpace: Make Your Agents: Smarter, Low-Cost, Self-Evolving
Context management for Claude Code. Hooks maintain state via ledgers
Real-time multi-AI collaboration: Claude, Codex & Gemini
Privacy enhanced BitTorrent client with P2P content discovery
An Open Source text-to-speech system built by inverting Whisper
Build your own Cowork, AI Scientist and other SoTA Agents
Visual Causal Flow
The highest-scoring AI memory system ever benchmarked
Uncertainty Quantification for Language Models, is a Python package
RAG Search API
Fast backend for long-term AI user memory via structured profiles
Internet-scale Neural Networks
AGiXT is a dynamic AI Automation Platform
Controllable & emotion-expressive zero-shot TTS
Performance-optimized AI inference on your GPUs
"VideoRAG: Chat with Your Videos
Spark-TTS Inference Code
Convert codebases into structured prompts optimized for LLM analysis
Faster and easier training and deployments
95% token savings. 155x faster queries. 16 languages
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A course of learning LLM inference serving on Apple Silicon