PDF Invoice to JSON

Convert any PDF invoice to structured JSON in seconds. Vendor, amounts, due date, and line items — any format, single API call.

Native PDF & scanned Any supplier layout Free plan available
Supported Formats

One API — every PDF invoice format

Whether you receive digitally generated PDFs from SaaS platforms or scanned paper invoices, the same API call handles them all.

Native PDF

Digitally generated PDFs from any invoicing software, ERPs, or billing platforms.

Scanned PDF

Paper invoices scanned to PDF, including low-res or skewed documents.

JPEG / PNG / WEBP

Invoice photos from scanner apps or mobile captures.

Multi-page

Invoices spanning several pages with line-item tables split across pages.

Output Example

What you get back

POST your invoice PDF. Within seconds, the API returns a complete JSON object with every field extracted from the document:

invoice_numberinvoice_datedue_datevendor_namevendor_addressbill_tocurrencysubtotaltax_amounttotal_amountpayment_termsline_items[]

Need custom fields? Define your own in the Template Builder — payment terms, IBAN, PO number, cost center, or any invoice-specific value.

invoice.json
{
  "invoice_number": "INV-2024-00891",
  "invoice_date":   "2024-07-15",
  "due_date":       "2024-08-14",
  "vendor_name":    "Globex Supplies Ltd",
  "currency":       "EUR",
  "subtotal":       840.00,
  "tax_amount":     168.00,
  "total_amount":   1008.00,
  "payment_terms":  "Net 30",
  "line_items": [
    {
      "description": "Office supplies — Q3",
      "qty":         3,
      "unit_price":  280.00,
      "total":       840.00
    }
  ]
}
Quick Start

Convert a PDF invoice to JSON in 3 steps

1

Create an invoice template

In the Parselyze dashboard, define the fields you want extracted: vendor_name, invoice_number, total_amount, line_items, etc.

2

POST your PDF invoice

Use the REST API (any language) or the Node.js SDK. Pass the invoice file path, URL, or binary buffer.

3

Receive structured JSON

The API response contains fully typed extracted fields ready for your database, accounting system, or automation workflow.

Frequently asked questions

All about converting PDF invoices to JSON.

How do I convert a PDF invoice to JSON?

Use the Parselyze REST API or Node.js SDK. Create an invoice template in the dashboard, then POST your invoice PDF to the documents endpoint. The API returns structured JSON containing all extracted fields within seconds.

Which PDF invoice formats does Parselyze support?

Parselyze converts native PDF invoices, scanned invoice PDFs, and invoice images (JPG, PNG, WEBP, TIFF) to JSON. Multi-page invoices and documents from any supplier layout are supported.

Is custom training required to parse my invoice PDFs?

No. Parselyze uses zero-shot AI extraction that adapts to any invoice layout. You define the field names once in the Template Builder. No training data or machine learning expertise is needed.

What fields are included in the JSON output?

The JSON response includes invoice_number, vendor_name, vendor_address, invoice_date, due_date, currency, subtotal, tax_amount, total_amount, payment_terms, and a line_items array with description, qty, unit_price, and total per row.

How fast is the PDF invoice to JSON conversion?

Synchronous extraction for standard single-page PDF invoices typically completes in under 3 seconds. For large multi-page files or bulk processing, the async job endpoint is recommended, with results delivered via webhook.

Start converting PDF invoices to JSON today

50 pages/month free · No credit card required