Skip to content
Open Source The PDF extraction engine for the AI era

The PDF Engine for RAG Pipelines

Feed your LLMs clean structured data. EdgeParse extracts headings, tables, lists, and reading order from any PDF — in milliseconds, with zero ML dependencies. Built in Rust.

pip install edgeparse
0+ pages/sec
0% accuracy
0 ML dependencies
0 SDK languages
Works with
Python Node.js Rust CLI