Convert PDF tables to structured data

Rows, columns & totals Clean JSON output DB or BI tool ready
The Problem

Data in PDF tables is inaccessible to your systems

Financial reports, sales data, regulatory filings, and product catalogs are frequently distributed as PDFs with tables. Copying data manually into spreadsheets is slow and error-prone. Existing PDF parsers return garbled text, not structured rows.

For teams that need to run analysis or feed data into BI tools, this means hours of cleanup work before any data is usable.

Copy-pasting table data from PDFs takes hours per report

Basic PDF-to-text tools return scrambled, unusable output

BI tools can't connect to raw PDFs without manual export

Dozens of PDFs per month means recurring manual work

The Solution

Table rows returned as clean JSON arrays

Define the columns you want. Submit a PDF containing a table. Get back a typed rows array with one object per row, column-keyed and ready to insert.

01

Define table columns

Tell Parselyze the column names and data types expected in the table.

02

Submit the PDF

Any PDF format. Single or multi-page. Works even when tables span multiple pages.

03

Get rows as JSON

Each row is a JSON object with column-keyed fields. Totals extracted separately.

Real Example

Extraction output for a sales report table

Sample PDF table — Q3 2025 Sales Report
table_result.json
{
  "table_title": "Q3 2025 Sales Report",
  "columns": ["Product", "Units Sold", "Revenue (USD)"],
  "rows": [
    { "Product": "Widget A", "Units Sold": 120, "Revenue": 48000 },
    { "Product": "Widget B", "Units Sold": 85, "Revenue": 21250 },
    { "Product": "Widget C", "Units Sold": 200, "Revenue": 60000 }
  ],
  "totals": {
    "Units Sold": 405,
    "Revenue": 129250
  }
}

Typical workflows

Report Ingestion

Extract tables from financial or operational reports and load them directly into a database.

Data Pipelines

Feed extracted table rows into BI tools like Tableau, Power BI, or Metabase.

Excel Replacement

Stop manually copying table data from PDFs. Parse once, export anywhere.

Regulatory Filings

Extract structured data from regulatory PDFs for analysis or compliance tracking.

How to Integrate

From PDF to database in one step

Submit the PDF via the API, receive the rows array in the response, and insert directly into your database or stream to your BI tool. No intermediate cleanup required.

Ready to integrate?

SDK examples, REST API reference, webhook handler, and cURL samples are all on the developer page.

Developer integration guide

Unlock the data trapped in your PDFs

50 pages/month free · No credit card required