Skip to content

Plain Text Output

The --format text output produces clean plain text following the document’s reading order, with no formatting markup.

import edgeparse
text = edgeparse.convert("document.pdf", format="text")
Terminal window
edgeparse document.pdf -f text
  • Reading order — text follows natural reading sequence
  • Tables — tab-separated columns
  • Headings — preserved as standalone lines
  • Minimal noise — no formatting artifacts
Annual Report 2024
Revenue grew 23% year-over-year, driven by expansion in the enterprise segment.
Financial Summary
Quarter Revenue ($M) YoY Growth
Q1 2024 142.3 +18%
Q2 2024 156.7 +22%
Q3 2024 168.9 +25%
Q4 2024 179.2 +27%