Overview
When you run the API, the response includes anextraction_metadata field that contains reference IDs linking each extracted field back to specific chunks in the response. These reference IDs allow you to pinpoint the exact document location where each piece of data was found.
This workflow requires both and responses:
- The response contains the grounding information (page numbers and bounding box coordinates)
- The response contains the extracted data and reference IDs that connect back to those locations
Scenario and Materials
- Parse this PDF: Pay Stub
- Extract these fields: Employee Name and Gross Pay
Scripts
These examples require the Python or TypeScript client library. Before running a script, set your API key and install the library and any required dependencies.

