Important Considerations (Supported Features)
The library supports:- These APIs: , ADE Parse Jobs, , and
- Setting the
- The legacy API
Install the Library
Set the API Key as an Environment Variable
To use the library, first generate an API key. Save the key to a.zshrc file or another secure location on your computer. Then export the key as an environment variable.
VISION_AGENT_API_KEY environment variable:
For more information about API keys and alternate methods for setting the API key, go to API Key.
Use with EU Endpoints
By default, the library uses the US endpoints. If your API key is from the EU endpoint, set theenvironment parameter to eu when initializing the client.
For more information about using in the EU, go to European Union (EU).
Parse: Getting Started
Theparse method converts documents into structured Markdown with chunk and grounding metadata. Use these examples as guides to get started with parsing with the library.
Parse Local Files
Use thedocument parameter to parse files from your filesystem. Pass the file as a read stream using fs.createReadStream().
Parse Remote URLs
Use thedocument parameter with fetch() to parse files from remote URLs (http, https).
Set Parameters
Theparse method accepts optional parameters to customize parsing behavior. To see all available parameters, go to ADE Parse API.
Pass these parameters directly to the parse() method.
Parse Jobs
TheparseJobs resource enables you to asynchronously parse documents that are up to 1,000 pages or 1 GB.
For more information about parse jobs, go to Parse Large Files (Parse Jobs).
Here is the basic workflow for working with parse jobs:
- Start a parse job.
- Copy the
job_idin the response. - Get the results from the parsing job with the
job_id.
List Parse Jobs
To list all async parse jobs associated with your API key, run this code:Work with Parse Response Data
Access all text chunks:Split: Getting Started
Thesplit method classifies and separates a parsed document into multiple sub-documents based on Split Rules you define. Use these examples as guides to get started with splitting with the library.
Pass Markdown Content
The library supports a few methods for passing the Markdown content for splitting:
- Split data directly from the parse response
- Split data from a local Markdown file
- Split data from a Markdown file at a remote URL:
markdown: await fetch("https://example.com/file.md")
name: The Split Type name (required)description: Additional context about what this Split Type represents (optional)identifier: A field that makes each instance unique, used to create separate splits (optional)
Split from Parse Response
After parsing a document, you can pass the Markdown string directly from theParseResponse to the split method without saving it to a file.
Split from Markdown Files
If you already have a Markdown file (from a previous parsing operation), you can split it directly. Use themarkdown parameter for local Markdown files or the markdown parameter with fetch() for remote Markdown files.
Set Parameters
Thesplit method accepts optional parameters to customize split behavior. To see all available parameters, go to ADE Split API.
Split Output
Thesplit method returns a SplitResponse object with the following fields:
splits: Array ofSplitobjects, each containing:classification: The Split Type name assigned to this sub-documentidentifier: The unique identifier value (ornullif no identifier was specified)pages: Array of zero-indexed page numbers that belong to this splitmarkdowns: Array of Markdown content strings, one for each page
metadata: Processing information (credit usage, duration, filename, job ID, page count, version)
Work with Split Response Data
Access all splits by classification:Extract: Getting Started
Theextract method extracts structured data from Markdown content using extraction schemas. Use these examples as guides to get started with extracting with the library.
Pass Markdown Content
The library supports a few methods for passing the Markdown content for extraction:
- Extract data directly from the parse response
- Extract data from a local Markdown file
- Extract data from a Markdown file at a remote URL:
markdown: await fetch("https://example.com/file.md")
Extract from Parse Response
After parsing a document, you can pass the markdown string directly from theParseResponse to the extract method without saving it to a file.
Extract from Markdown Files
If you already have a Markdown file (from a previous parsing operation), you can extract data directly from it. Use themarkdown parameter with fs.createReadStream() for local Markdown files or with fetch() for remote Markdown files.
Extraction with Zod
Use Zod schemas to define your extraction schema in a type-safe way. Zod provides TypeScript type inference and runtime validation for your extracted data. To use Zod with the library, installzod:
zod, run extraction with the library:
Extraction with JSON Schema (Inline)
Define your extraction schema directly as a JSON string in your script.Extraction with JSON Schema File
Load your extraction schema from a separate JSON file for better organization and reusability. For example, here is thepay_stub_schema.json file:
Extract Nested Subfields
Define nested Zod schemas to extract hierarchical data from documents. This approach organizes related information under meaningful section names. The schemas with nested fields must be defined before the main extraction schema. Otherwise you may get an error that the schemas with the nested fields are not defined. For example, to extract data from the Patient Details and Emergency Contact Information sections in this Medical Form, define separate schemas for each section, then combine them in a main schema.Extract Variable-Length Data with List Objects
Use Zod’sz.array() to extract repeatable data structures when you don’t know how many items will appear. Common examples include line items in invoices, transaction records, or contact information for multiple people.
For example, to extract variable-length wire instructions and line items from this Wire Transfer Form, use z.array(DescriptionItemSchema) for line items and z.array(WireInstructionSchema) for wire transfer details.
Link Extracted Data to Document Locations
Use the references IDs fromextraction_metadata to find the exact location where data was extracted in the source document. This is useful for visual validation, quality assurance, or building confidence scores.
Sample Scripts for Common Use Cases
Parse a Directory of Documents
Async Parse: Processing Multiple Documents Concurrently
Use concurrent parsing when you need to process many lightweight documents (such as invoices, receipts, or forms) efficiently. The TypeScript library’s methods are already asynchronous, allowing you to send multiple parse requests concurrently using JavaScript’sPromise.all() or Promise.allSettled(). This significantly reduces total processing time compared to sequential requests.
The concurrent approach lets you send multiple requests in parallel. While one document is being processed, another request can be sent. The API server handles the actual document processing in the background.
To avoid exceeding the pages per hour limits and receiving 429 errors, use a concurrency control library like p-limit to limit the number of simultaneous requests.
Basic Concurrent Parsing
Concurrent Parsing with Rate Limiting
To control concurrency and avoid rate limits, use thep-limit library:
Save Parsed Output
Use this script to save the parsed output to JSON and Markdown files for downstream processing.Visualize Parsed Chunks: Draw Bounding Boxes
Use this script to visualize parsed chunks by drawing color-coded bounding boxes on your document. Each chunk type uses a distinct color, making it easy to see how the document was parsed. The script identifies chunk types and table cells. For PDFs, the script creates a separate annotated PNG for each page (page_1_annotated.png, page_2_annotated.png). For image files, the script creates a single page_annotated.png.
The image below shows an example output with bounding boxes drawn on the first page of a PDF:


