The new Parse API is a redesign, not a version bump. The request you send changes very little, but the response is a new shape built around . Plan to rewrite the code that reads the parse response; there is no field-for-field mapping from the generally available (GA) Parse API. This guide covers what changed and what to update. For the full response reference, see How Parse Works.Documentation Index
Fetch the complete documentation index at: https://docs.landing.ai/llms.txt
Use this file to discover all available pages before exploring further.
This is a Preview release. Don’t use it in production. See Important to Know for details. To learn more about the generally available (GA) version of ADE, go to ADE Overview.
Confirm Your Input Formats
The Preview accepts PDFs and images only. If your pipeline depends on Office formats supported by the GA Parse API (Word, PowerPoint, spreadsheets), keep those workflows on the GA Parse API. See Supported File Types.What Changes at a Glance
- Your request: mostly the same. A new endpoint, two removed parameters, and one new
optionsparameter. - Your response: new. Flat
chunksbecome a hierarchicalstructuretree, grounding coordinates change units, and the Markdown for visuals is simpler. - Other ADE APIs: ADE Extract and ADE Section expect the GA response shape. If your workflow chains Parse with either, keep those workflows on the GA Parse API for now.
Update Your Request
The request contract is close to the GA API. Update the endpoint and adjust a few parameters.| Aspect | GA Parse API () | Preview Parse API () |
|---|---|---|
| Endpoint | POST /v1/ade/parse | POST /v2/ade/parse |
| Authentication | Bearer API key | Bearer API key (unchanged) |
| File input | document or document_url | document or document_url (unchanged) |
| Model selection | model form field | model form field (unchanged) |
| Page or section splitting | split form field | Removed. Use options.pages to select pages to process. |
| Figure prompts | custom_prompts form field | Removed. |
| Output customization | Not functional | options: select pages, set table format, toggle captions, control grounding detail. |
options schema.
Rework Your Response Handling
This is where the work is. The top-level response fields are different, and the concepts behind them changed.Chunks Are Now Elements
The GA API returns a flatchunks array, where each chunk carries its own markdown and inline grounding. replaces this with a hierarchical structure tree: a document whose children are pages, and each page’s children are the elements on that page. Tables nest their cells (td and th) as children.
Elements no longer carry their own Markdown or coordinates. Instead, each element has a span (a [start, end) range into the top-level markdown string) and an id you use to look up its location in the top-level grounding map.
Grounding Moved and Changed
In the GA API, grounding lived both inline on each chunk and in a top-levelgrounding map. In , grounding lives only in the top-level grounding map, keyed by element id. Three things changed about the values:
- Coordinates are integers in page units. The GA API used normalized floats from 0 to 1. uses integer points (
pt) for PDFs and integer pixels (px) for images, as[left, top, right, bottom]. To draw or compare boxes, read the page’swidth,height, andunitfromstructurerather than assuming a 0-to-1 range. - Line-level detail through
parts. Each grounding entry adds apartsarray. Fortextandmarginalia, there is one part per visual line, so you can highlight or extract at the line level. Other element types return an emptypartsarray. - No confidence score. The GA
confidenceandlow_confidence_spansfields are removed.
Markdown Is Cleaner
The GA API wrapped visual elements in non-standard tags (for example,<::logo: ...::>) and embedded long generated descriptions. uses standard Markdown:
- Visual elements use image syntax. Figures and logos render as
![label], where the label is a short description or a classification subtype such as![CHART]. See Figure and Attestation Labels. - Page breaks are explicit. A
<!-- page -->comment separates each page’s content (absent in single-page documents). - Tables and math are standardized. Tables use pipe syntax by default (set
options.elements.table.format="html"for HTML), and math uses$$...$$(block) and$...$(inline).
Spans Are the New Link Between Outputs
The GA API duplicated Markdown onto every chunk. returns the Markdown once, as the top-levelmarkdown string, and every element points into it with a span of Unicode code point offsets [start, end). To get an element’s text, slice the Markdown string with its span. This is the main mental shift: structure and grounding describe positions in one shared Markdown document rather than carrying their own copies of the text.
What Was Removed
These GA fields and parameters have no equivalent in :chunksand the per-chunkmarkdown(usestructureandspanslices instead)splitsand thesplitparametercustom_promptsconfidenceandlow_confidence_spans- Normalized 0-to-1 coordinates
- Inline per-chunk
grounding
Migration Checklist
- Point requests at
POST /v2/ade/parse. - Remove the
splitandcustom_promptsparameters; addoptionsif you need page selection or output control. - Stop reading
chunks. Walkstructure.children(pages) and theirchildren(elements) instead. - Look up each element’s location by
idin the top-levelgroundingmap. - Rescale bounding boxes: switch from normalized floats to integer
pt/px, using the page’swidth,height, andunit. - Slice the top-level
markdownwith each element’sspanto recover its text. - Remove any logic that depends on
confidenceorlow_confidence_spans. - Update visual-element handling for
![label]image syntax instead of<:: ... ::>tags.