Multi-Agent Research System

A sophisticated multi-agent research system that uses Instructor for structured LLM outputs and Exa.ai for neural web search.

Features

🤖 Multi-Agent Architecture: Lead researcher decomposes queries and coordinates parallel subagents
🔍 Neural Search: Integrates Exa.ai for semantic web search
📊 Structured Outputs: Uses Instructor to get typed, validated responses from LLMs
💾 Persistent Memory: Stores research plans and intermediate results
🔄 Iterative Refinement: Subagents evaluate and refine their searches
⚡ Parallel Execution: Multiple agents work simultaneously for faster results

Quick Start

Installation

# Clone the repository
git clone <repository-url>
cd researcher

# Install with uv (recommended)
pip install uv
uv sync

# Or with pip
pip install -e .

Environment Setup

# Required for LLM decomposition
export ANTHROPIC_API_KEY="your-anthropic-key"

# Required for web search (optional - will use mock data without it)
export EXA_API_KEY="your-exa-key"

Run Examples

# Simple research query
uv run python examples/simple_research.py

# Compare AI frameworks
uv run python examples/comparative_research.py

# Find recent AI developments
uv run python examples/time_bounded_research.py

# Academic paper research
uv run python examples/academic_research.py

# Demo with mock data (no API keys needed)
uv run python examples/demo_with_mocks.py

Architecture

The system implements a hierarchical multi-agent architecture:

User Query → Lead Researcher → Query Decomposition (via Instructor)
                ↓
        Parallel Subagents → Iterative Search → Result Synthesis
                ↓
        Memory Storage → Citation Addition → Final Report

Key Components

LeadResearcherV2: Orchestrates the research process using Instructor for structured task decomposition
ResearchSubagent: Performs iterative searches with self-evaluation
Exa.ai Integration: High-quality neural search for web content
Memory System: SQLite-backed persistence for research data
Tool Registry: Pluggable architecture for adding new data sources

Usage

import asyncio
from src.researcher.agents.lead_v2 import LeadResearcherV2, LeadResearcherConfig
from src.researcher.agents.subagent import ResearchSubagent
from src.researcher.agents.base import AgentContext
from src.researcher.memory.base import InMemoryStorage, ResearchMemory
from src.researcher.tools.base import ToolRegistry
from src.researcher.tools.exa_search import ExaSearchTool

async def research():
    # Setup
    memory = ResearchMemory(InMemoryStorage())
    registry = ToolRegistry()
    registry.register(ExaSearchTool(), category="search")
    
    # Configure
    config = LeadResearcherConfig(
        max_subagents=3,
        parallel_execution=True
    )
    
    # Create lead researcher
    lead = LeadResearcherV2(
        memory=memory,
        tool_registry=registry,
        subagent_class=ResearchSubagent,
        config=config
    )
    
    # Run research
    context = AgentContext(
        query="What are the latest advances in AI agents?",
        objective="Comprehensive overview of AI agent technology"
    )
    
    result = await lead.run(context)
    print(result.output)

asyncio.run(research())

How It Works

Query Decomposition: The lead researcher uses Instructor to break down your query into structured subtasks
Parallel Research: Multiple subagents work simultaneously on different aspects
Iterative Search: Each subagent evaluates results and refines searches
Synthesis: Results are combined into a comprehensive report with citations

Documentation

See the docs/ directory for comprehensive documentation including:

Architecture details
API reference
Best practices
Troubleshooting guide

Testing

# Run all tests
uv run pytest

# Run with coverage
uv run pytest --cov=src

# Run specific test file
uv run pytest tests/unit/test_lead_researcher_v2.py -v

Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

License

MIT License - see LICENSE for details.

Acknowledgments

Built with Instructor for reliable structured outputs
Powered by Exa.ai for neural search capabilities
Inspired by Anthropic's multi-agent research architecture

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
docs		docs
examples		examples
src/researcher		src/researcher
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_IMPLEMENTATION.md		README_IMPLEMENTATION.md
example.py		example.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent Research System

Features

Quick Start

Installation

Environment Setup

Run Examples

Architecture

Key Components

Usage

How It Works

Documentation

Testing

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Research System

Features

Quick Start

Installation

Environment Setup

Run Examples

Architecture

Key Components

Usage

How It Works

Documentation

Testing

Contributing

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages