Plain Text Output
Overview
Section titled “Overview”The --format text output produces clean plain text following the document’s reading order, with no formatting markup.
import edgeparse
text = edgeparse.convert("document.pdf", format="text")edgeparse document.pdf -f textFeatures
Section titled “Features”- Reading order — text follows natural reading sequence
- Tables — tab-separated columns
- Headings — preserved as standalone lines
- Minimal noise — no formatting artifacts
Example Output
Section titled “Example Output”Annual Report 2024
Revenue grew 23% year-over-year, driven by expansion in the enterprise segment.
Financial Summary
Quarter Revenue ($M) YoY GrowthQ1 2024 142.3 +18%Q2 2024 156.7 +22%Q3 2024 168.9 +25%Q4 2024 179.2 +27%