PDF Invoice to JSON
Convert any PDF invoice to structured JSON in seconds. Vendor, amounts, due date, and line items — any format, single API call.
One API — every PDF invoice format
Whether you receive digitally generated PDFs from SaaS platforms or scanned paper invoices, the same API call handles them all.
Native PDF
Digitally generated PDFs from any invoicing software, ERPs, or billing platforms.
Scanned PDF
Paper invoices scanned to PDF, including low-res or skewed documents.
JPEG / PNG / WEBP
Invoice photos from scanner apps or mobile captures.
Multi-page
Invoices spanning several pages with line-item tables split across pages.
What you get back
POST your invoice PDF. Within seconds, the API returns a complete JSON object with every field extracted from the document:
invoice_numberinvoice_datedue_datevendor_namevendor_addressbill_tocurrencysubtotaltax_amounttotal_amountpayment_termsline_items[]Need custom fields? Define your own in the Template Builder — payment terms, IBAN, PO number, cost center, or any invoice-specific value.
{ "invoice_number": "INV-2024-00891", "invoice_date": "2024-07-15", "due_date": "2024-08-14", "vendor_name": "Globex Supplies Ltd", "currency": "EUR", "subtotal": 840.00, "tax_amount": 168.00, "total_amount": 1008.00, "payment_terms": "Net 30", "line_items": [ { "description": "Office supplies — Q3", "qty": 3, "unit_price": 280.00, "total": 840.00 } ] }
Convert a PDF invoice to JSON in 3 steps
Create an invoice template
In the Parselyze dashboard, define the fields you want extracted: vendor_name, invoice_number, total_amount, line_items, etc.
POST your PDF invoice
Use the REST API (any language) or the Node.js SDK. Pass the invoice file path, URL, or binary buffer.
Receive structured JSON
The API response contains fully typed extracted fields ready for your database, accounting system, or automation workflow.
Frequently asked questions
All about converting PDF invoices to JSON.
How do I convert a PDF invoice to JSON?
Use the Parselyze REST API or Node.js SDK. Create an invoice template in the dashboard, then POST your invoice PDF to the documents endpoint. The API returns structured JSON containing all extracted fields within seconds.
Which PDF invoice formats does Parselyze support?
Parselyze converts native PDF invoices, scanned invoice PDFs, and invoice images (JPG, PNG, WEBP, TIFF) to JSON. Multi-page invoices and documents from any supplier layout are supported.
Is custom training required to parse my invoice PDFs?
No. Parselyze uses zero-shot AI extraction that adapts to any invoice layout. You define the field names once in the Template Builder. No training data or machine learning expertise is needed.
What fields are included in the JSON output?
The JSON response includes invoice_number, vendor_name, vendor_address, invoice_date, due_date, currency, subtotal, tax_amount, total_amount, payment_terms, and a line_items array with description, qty, unit_price, and total per row.
How fast is the PDF invoice to JSON conversion?
Synchronous extraction for standard single-page PDF invoices typically completes in under 3 seconds. For large multi-page files or bulk processing, the async job endpoint is recommended, with results delivered via webhook.
Start converting PDF invoices to JSON today
50 pages/month free · No credit card required