Crawl a website starting from a URL, find relevant pages
AI-ready web crawler that extracts and structures website content
CLI tool to extract (meta)data from PDF and manipulate PDF files
Turn entire websites into LLM-ready markdown or structured data
Lightweight library for scraping web-sites with LLMs
ExtractThinker is a Document Intelligence library for LLMs
A high-quality tool for convert PDF to Markdown and JSON
ContextGem: Effortless LLM extraction from documents
PDF Parser for AI-ready data. Automate PDF accessibility
A Python tool to help extracting information from structured PDFs
OSINT reconnaissance tool for IP, domain, email, and username lookups
A distributed job server
AI Browser Agent is an advanced Browser AI tool
Extract internal monitoring data from application logs
Skill for installing full networking capabilities for Claude Code
A library for audio and music analysis, feature extraction
To extract main article from given URL with Node.js
All-in-one Python web reconnaissance tool for fast target analysis
Python tool for crawling and extracting structured data from news site
Python & command-line tool to gather text on the Web
Tools to build web AI agents that can authenticate
Automate browser-based workflows with LLMs and Computer Vision
Desktop tool for collecting and exporting Xiaohongshu post data
Open-source platform for extracting structured data from documents