Parse Large Files (Parse Jobs)

The ADE Parse Jobs API enables you to parse large documents that exceed the size limits of the standard ADE Parse API.

Monitor Parse Jobs

You can monitor parse jobs with these APIs:

ADE Get Parse Jobs: Get the status for a specific parse job.
ADE List Parse Jobs: List all parse jobs associated with your API key.

Rate Limits for ADE Parse Jobs

The ADE Parse Jobs API allows you to parse large documents. The following table shows the limits for the ADE Parse Jobs API.

Maximum File Size	Maximum Pages
1 GB	6,000 pages

To see the rate limits for all APIs, go to Rate Limits.

API Reference

To learn more, go to the reference pages for the Parse Jobs APIs:

Save Parsed Output to a URL

When calling the ADE Parse Jobs endpoint, you can use the output_save_url parameter to save the parsed Markdown to a specified URL instead of returning it in the API response. This is useful for managing large documents, integrating with your existing storage workflow, or complying with data retention policies.

When Parsed Output Is Saved to a URL

The parsed Markdown is saved to a URL in these scenarios:

You specify the output_save_url parameter: The Markdown is saved to your specified URL.
The parsed Markdown exceeds 1 MB: The Markdown is automatically saved to a presigned S3 URL generated by . The URL expires after 1 hour, but time you call the ADE Get Parse Jobs API, a new presigned URL is generated.

Behavior when the output is saved to a URL:

The output_url field in the API response contains the URL where the Markdown is stored.
The data field in the API response is None.

URL Requirements

If you specify the output_save_url parameter, your URL must meet these requirements:

The URL must be a public or presigned URL that explicitly allows PUT or CREATE operations (depending on the provider).
Tested storage providers: Amazon S3, Azure Blob Storage, and Google Cloud Storage. Other storage providers may also work.
The API cannot access private URLs, such as folders in Google Drive.

Example: Use Amazon S3 Presigned URLs

If you use Amazon S3, you can generate a presigned URL and provide it as the output_save_url value. Presigned URLs grant temporary access to your S3 bucket without requiring authentication in the API request. For more information about presigned URLs with Amazon S3, go to the Amazon documentation. Here is a sample script that creates a presigned URL and uses it for a parsing job:

import requests
import json

headers = {
    'Authorization': 'Bearer YOUR_API_KEY'
}

url = 'https://api.va.landing.ai/v1/ade/parse/jobs'

# Prepare the request payload
output_save_url = generate_presigned_url(...)
files = {'document_url': 'https://...', 'output_save_url': output_save_url}

response = requests.post(url, files=files, headers=headers)
print(response.json())

ZDR Requirements

When zero data retention (ZDR) is enabled, you must configure the following parameters to ensure that does not store the document content:

Pass your document in the document_url parameter. You cannot use the document parameter with ZDR enabled.
Include the output_save_url parameter. This ensures that the parsed content is saved to your specified URL instead of being returned in the API response. To learn how to configure this parameter, go to Save Parsed Output to a URL.

Workflow Overview

Parse a document with the ADE Parse Jobs API.
Copy the job_id in the API response.
To get results from the parsing job, call the ADE Get Parse Jobs API with the job_id.
The parsed content is returned as Markdown in data.markdown, or as a URL in output_url (in which case data is None). For more information, go to Save Parsed Output to a URL.
If you need to extract fields:
1. Create an extraction schema.
2. Send the Markdown to the API.

End-to-End Workflow: Parse and Extract the Output

This tutorial walks you through how to parse a document with the ADE Parse Jobs API and then extract a subset of fields from it using the API. For the sake of simplicity, we use a 2-page PDF in this example, with the intention that you will use larger documents in your own use case. We provide a separate script for each endpoint, so you can choose to skip the extraction steps if you don’t need them. Scenario and materials:

Parse this PDF: MRI Report
Extract these fields: Exam Date and Procedure

1. Create Parse Job & Get Job ID

First, run the script below to create a parse job.

import requests
import json

headers = {
    'Authorization': 'Bearer YOUR_API_KEY'
}

url = 'https://api.va.landing.ai/v1/ade/parse/jobs'

# Upload a document
document = open('mri-report.pdf', 'rb')
files = {'document': document}
data = {'model': 'dpt-2-latest'}

response = requests.post(url, files=files, data=data, headers=headers)
print(response.json())

This returns the job_id:

{'job_id': 'cmfx34ewm0000hyoqkh9dzd8n'}

2. Use job_id to Get Parsing Results

Use the job_id to get the parsing status. If parsing is complete, the code saves the parsed output (the content in the data.markdown attribute) to a Markdown file. You will then use this Markdown file to extract the fields in the next step.

import requests

headers = {
    'Authorization': 'Bearer YOUR_API_KEY'
}
url = f'https://api.va.landing.ai/v1/ade/parse/jobs/{jobId}'

response = requests.get(url, headers=headers)
response_data = response.json()

# Print the full response
print(response_data)

# Check if job is completed
if response_data.get('status') == 'completed':
    # Check if markdown content is available in data
    if 'data' in response_data and response_data['data'].get('markdown'):
        markdown_content = response_data['data']['markdown']
        
        # Save markdown content to file
        with open('markdown-mri-report.md', 'w', encoding='utf-8') as f:
            f.write(markdown_content)
        
        print("\nMarkdown content saved to a Markdown file.")
    
    # Check if output_url is available instead
    elif response_data.get('output_url'):
        print("Use the Markdown file specified in `output_url`.")
    
    else:
        print("No Markdown content or `output_url` found in the completed job response.")
else:
    print(f"\nJob status: {response_data.get('status', 'unknown')}.")

If the parsing is complete, the response will look like the response below. The Markdown file will be similar to this one: Markdown for MRI Report.

{'job_id': 'cmfx7w34q0000wcrdaaz45miu', 'status': 'completed', 'received_at': 1758671983, 'progress': 1.0, 'org_id': None, 'version': 'latest', 'data': {'markdown': '<a id=\'3c3677d1-5dba-4f48-91fd-719663f870b4\'></a>\n\nSummary : This image is a logo for the website USARAD.com, featuring stylized text over a blue globe with network-like lines.\n\nlogo: USARAD.com\n\nCompany Name & Tagline :\n  • The logo displays "USARAD.com" as the main text.\n  • "USA" is rendered in white with a blue outline and a pattern of stars.\n  • "RAD" is in bold red uppercase letters.\n  • ".com" is in smaller white text with a blue outline, positioned to the right of "RAD".\n\nGraphic Elements :\n  • The background is a blue globe with visible continents (North and South America, parts of Europe and Africa).\n  • The globe features light blue intersecting lines, suggesting global connectivity or a network.\n  • The logo has a slight shadow beneath the globe, giving a three-dimensional effect.\n\nLayout & Placement :\n  • "USARAD.com" text is superimposed across the front of the globe, slightly angled.\n  • The text is centered horizontally on the globe.\n\nAnalysis :\n  • The logo combines patriotic elements (stars, "USA") with a global and technological theme (networked globe), likely representing a US-based radiology or medical imaging company with international reach.\n\n<a id=\'613fcdb6-82a6-455a-9401-cb0dbebbb975\'></a>\n\n2601 E. Oakland Park Blvd, Suite 102 Ft. Lauderdale, FL  33306 Phone: 888.886.5238  Fax: 888.886.5221\n\n<a id=\'13878c67-394d-472b-8e96-4e43ccf6a490\'></a>\n\nPatient: DOE, JOHN\nMRN : JD4USARAD\nReferring Physician: DR. DAVID LIVESEY\n\n<a id=\'c36c0270-60e4-445f-9a11-bb93c63aa136\'></a>\n\nExam Date:  05/20/2010\nDOB:  01/01/1961\nFAX:  (305) 418-8166\n\n<a id=\'92665435-32ff-4d46-b63f-12684f782dbb\'></a>\n\nMRI OF THE LUMBAR SPINE WITH AND WITHOUT CONTRAST\n\n<a id=\'bb927ff0-78f5-4d00-87cb-50dda3446792\'></a>\n\nPROCEDURE:\nMRI of lumbosacral spine without/with IV contrast.\n\n<a id=\'04136993-4ee0-4503-b8f1-b28edc82719f\'></a>\n\nINDICATION:\nRadiculopathy post L2-L3 fusion, question incomplete fusion. Persistent symptoms.\n\n<a id=\'06366b0f-d1f3-4373-98f5-a872b608f00a\'></a>\n\nCOMPARISON:\nNone.\n\n<a id=\'09ba7c03-fd6d-4888-92ec-826d4c94f18e\'></a>\n\nTECHNIQUE:\nMultiplanar and multi-sequence imaging of the lumbosacral spine without/with intravenous contrast using a 0.3T MRI scanner.\n\n<a id=\'e692bd72-8408-4f78-8737-89b1191d09b0\'></a>\n\nFINDINGS:\nPostoperative findings of posterior intrapedicular spinal fusion at L2-L3 noted. The L2-L3 disk is preserved. Enhancing peridural fibrosis noted at L2-L3 level mildly deforming the thecal sac with dominant extrinsic impression on the right lateral thecal sac. Non enhancing cystic foci noted along the posterior elements representing small pseudomeningoceles. Postoperative fusion and laminectomy noted at L4-L5 level with osseous fusion anteriorly. Osseous hypertrophy of the posterior elements noted at L4 and L5. Lumbar lordosis is decreased. Multilevel endplate, disk and facet degenerative changes noted. Conus medullaris terminates at approximately mid L1 vertebral body level.\n\n<a id=\'b08f50d5-9f82-49f1-bee3-2b32b2ff5812\'></a>\n\nL1-L2 shows moderate broad-based disc bulging contributing to mild to moderate left greater than right neuroforamina narrowing. Spinal canal is grossly patent. Approximately 2 mm L1 on L2 retrolisthesis noted.\n\n<a id=\'f783b847-239e-4879-8ef9-6b0875fdd81b\'></a>\n\nL2-L3 shows moderate nonenhancing bi foraminal broad-based disk bulging contributing to mild-to-moderate right greater than left neural foramina narrowing. Moderate acquired spinal canal stenosis noted due to enhancing peridural fibrosis with asymmetric more focal extrinsic impression on the right lateral ventral thecal sac. Negligible spondylolisthesis of L2 on L3 noted.\n\n<a id=\'b26c305d-8971-4447-a9f3-3e86c475adaa\'></a>\n\nL3-L4 level shows mild disk desiccation and height loss. Extraforaminal focal annular tears noted on both sides. Spinal canal and foramina are patent.\n\n<a id=\'ea7169ff-e7dd-49f3-a2ad-7aa66c6fad20\'></a>\n\nPage 1 of 2\n\n<a id=\'0a069015-2af6-4308-a105-1f2810e17732\'></a>\n\nSummary : This is a logo for USARAD.com, featuring the company name overlaid on a stylized globe.\n\nlogo: USARAD.com\n\nCompany Name & Tagline :\n  • Text “USARAD.com” is prominently displayed.\n  • “USA” is in white with a blue star pattern.\n  • “RAD” is in bold red.\n  • “.com” is in white italicized font.\n\nGraphic Elements :\n  • Background is a blue globe with visible continents (North America, South America, Africa, Europe).\n  • The globe has light blue intersecting lines suggesting global connectivity or network.\n\nDesign & Placement :\n  • The text “USARAD.com” wraps around the front of the globe, slightly angled.\n  • The globe is shaded with a gradient, giving a 3D effect.\n  • No additional text, icons, or taglines are present.\n\nAnalysis :\n  • The logo uses a globe and network lines to convey international reach and technological connectivity.\n  • The colour scheme (blue, white, red) aligns with the US national colours, reinforcing the “USA” identity.\n  • The bold “RAD” likely refers to radiology, emphasizing the company’s focus.\n\n<a id=\'c0521088-587c-4c06-9fa0-beea0b017c5d\'></a>\n\n2601 E. Oakland Park Blvd, Suite 102 Ft. Lauderdale, FL 33306 Phone: 888.886.5238  Fax: 888.886.5221\n__________________________________________________________________________________\n\n<a id=\'6fe54f5d-7fd5-4925-a4df-e1a5d48ce9cc\'></a>\n\nPatient: DOE, JOHN\n\nMRN : JD4USARAD\n\nReferring Physician: DR. DAVID LIVESEY\n\n<a id=\'dbd818a0-25d4-45c2-a27f-cc521de5f6ab\'></a>\n\nExam Date:  05/20/2010\nDOB:  01/01/1961\nFAX:  (305) 418-8166\n\n<a id=\'92036285-8d2d-4cf3-ae6a-4976f786406d\'></a>\n\nL4-L5 level shows postoperative findings with partial fusion anteriorly with linear hyper intense signal in the remaining intervertebral disk space. Spinal canal and foramina are patent. No gross thecal sac deformity noted. Bilateral laminectomies noted.\n\n<a id=\'ecc73c56-fbe8-4b7b-b83c-31c9422349d3\'></a>\n\nL5-S1 level shows subtle left central broad-based disk protrusion. Spinal canal and foramina are patent. No gross thecal sac deformity. Bilateral laminectomies noted.\nFerromagnetic susceptibility artifact noted along the mid posterior back spanning from L2 through S2.\n\n<a id=\'1b0984f2-73a9-4ca3-b999-86f153dffffb\'></a>\n\nNo suspicious prevertebral or posterior paraspinal soft tissue signal abnormality noted.\nMild subchondral sclerosis of the included sacroiliac joints noted.\nIncidental note of overdistended bladder.\n\n<a id=\'94e15f89-6ee7-430a-b673-4e2cc55b7930\'></a>\n\nIMPRESSION:\n1. Postoperative findings of posterior spinal intrapedicular fusion at L2-L3 level. L2-L3 shows moderate nonenhancing bi foraminal broad-based disk bulging contributing to mild-to-moderate right greater than left neural foramina narrowing. Moderate acquired spinal canal stenosis noted due to enhancing peridural fibrosis with asymmetric more focal extrinsic impression on the right lateral ventral thecal sac. Negligible spondylolisthesis of L2 on L3 noted. Non enhancing cystic foci noted along the posterior elements representing small pseudomeningoceles.\n\n<a id=\'209f768a-c8bf-48da-9f04-fc1d69c3501b\'></a>\n\n2. Postoperative fusion and laminectomy noted at L4-L5 levels with osseous fusion anteriorly. Osseous hypertrophy of the posterior elements noted at L4 and L5.\n\n<a id=\'19ed3606-f896-47a6-9f91-d1431a93c1ae\'></a>\n\n3. L5-S1 level shows subtle left central broad-based disk protrusion. Spinal canal and foramina are patent. No gross thecal sac deformity.\n\n<a id=\'d9f80cfa-7a41-4855-9308-d95d0132d65e\'></a>\n\nNote that the stability of findings cannot be determined in the absence of prior imaging for comparison/correlation.\nCorrelation with prior imaging is advised to document stability of findings described.\n\n<a id=\'79c68561-1362-45a5-a77a-a7a067cff381\'></a>\n\n-Electronically Signed by: RADIOLOGIST, ADMIN  on\n\n<a id=\'c50ff2eb-8377-413a-90b0-21a457f2cea7\'></a>\n\n06/07/2010 1:22:01 PM\n\n<a id=\'f4472384-fa63-40d6-b07f-ee87c675bded\'></a>\n\nPage 2 of 2', 'chunks': [{'markdown': 'Summary : This image is a logo for the website USARAD.com, featuring stylized text over a blue globe with network-like lines.\n\nlogo: USARAD.com\n\nCompany Name & Tagline :\n  • The logo displays "USARAD.com" as the main text.\n  • "USA" is rendered in white with a blue outline and a pattern of stars.\n  • "RAD" is in bold red uppercase letters.\n  • ".com" is in smaller white text with a blue outline, positioned to the right of "RAD".\n\nGraphic Elements :\n  • The background is a blue globe with visible continents (North and South America, parts of Europe and Africa).\n  • The globe features light blue intersecting lines, suggesting global connectivity or a network.\n  • The logo has a slight shadow beneath the globe, giving a three-dimensional effect.\n\nLayout & Placement :\n  • "USARAD.com" text is superimposed across the front of the globe, slightly angled.\n  • The text is centered horizontally on the globe.\n\nAnalysis :\n  • The logo combines patriotic elements (stars, "USA") with a global and technological theme (networked globe), likely representing a US-based radiology or medical imaging company with international reach.', 'type': 'figure', 'id': '3c3677d1-5dba-4f48-91fd-719663f870b4', 'grounding': {'box': {'left': 0.40283289551734924, 'top': 0.005662549287080765, 'right': 0.5821667909622192, 'bottom': 0.12223149836063385}, 'page': 0}}, {'markdown': '2601 E. Oakland Park Blvd, Suite 102 Ft. Lauderdale, FL  33306 Phone: 888.886.5238  Fax: 888.886.5221', 'type': 'text', 'id': '613fcdb6-82a6-455a-9401-cb0dbebbb975', 'grounding': {'box': {'left': 0.10308089852333069, 'top': 0.12418577075004578, 'right': 0.886221170425415, 'bottom': 0.1434800624847412}, 'page': 0}}, {'markdown': 'Patient: DOE, JOHN\nMRN : JD4USARAD\nReferring Physician: DR. DAVID LIVESEY', 'type': 'text', 'id': '13878c67-394d-472b-8e96-4e43ccf6a490', 'grounding': {'box': {'left': 0.053654029965400696, 'top': 0.15234115719795227, 'right': 0.45202648639678955, 'bottom': 0.2294374406337738}, 'page': 0}}, {'markdown': 'Exam Date:  05/20/2010\nDOB:  01/01/1961\nFAX:  (305) 418-8166', 'type': 'text', 'id': 'c36c0270-60e4-445f-9a11-bb93c63aa136', 'grounding': {'box': {'left': 0.5455445051193237, 'top': 0.152063250541687, 'right': 0.8537962436676025, 'bottom': 0.22956213355064392}, 'page': 0}}, {'markdown': 'MRI OF THE LUMBAR SPINE WITH AND WITHOUT CONTRAST', 'type': 'text', 'id': '92665435-32ff-4d46-b63f-12684f782dbb', 'grounding': {'box': {'left': 0.05383077263832092, 'top': 0.2380325347185135, 'right': 0.6541917324066162, 'bottom': 0.26444464921951294}, 'page': 0}}, {'markdown': 'PROCEDURE:\nMRI of lumbosacral spine without/with IV contrast.', 'type': 'text', 'id': 'bb927ff0-78f5-4d00-87cb-50dda3446792', 'grounding': {'box': {'left': 0.055629804730415344, 'top': 0.27314722537994385, 'right': 0.5007871389389038, 'bottom': 0.3144368529319763}, 'page': 0}}, {'markdown': 'INDICATION:\nRadiculopathy post L2-L3 fusion, question incomplete fusion. Persistent symptoms.', 'type': 'text', 'id': '04136993-4ee0-4503-b8f1-b28edc82719f', 'grounding': {'box': {'left': 0.0549871027469635, 'top': 0.32615983486175537, 'right': 0.7879728078842163, 'bottom': 0.36981821060180664}, 'page': 0}}, {'markdown': 'COMPARISON:\nNone.', 'type': 'text', 'id': '06366b0f-d1f3-4373-98f5-a872b608f00a', 'grounding': {'box': {'left': 0.054171472787857056, 'top': 0.38120588660240173, 'right': 0.2069973349571228, 'bottom': 0.42326685786247253}, 'page': 0}}, {'markdown': 'TECHNIQUE:\nMultiplanar and multi-sequence imaging of the lumbosacral spine without/with intravenous contrast using a 0.3T MRI scanner.', 'type': 'text', 'id': '09ba7c03-fd6d-4888-92ec-826d4c94f18e', 'grounding': {'box': {'left': 0.054425954818725586, 'top': 0.436515748500824, 'right': 0.9229190349578857, 'bottom': 0.4970395565032959}, 'page': 0}}, {'markdown': 'FINDINGS:\nPostoperative findings of posterior intrapedicular spinal fusion at L2-L3 noted. The L2-L3 disk is preserved. Enhancing peridural fibrosis noted at L2-L3 level mildly deforming the thecal sac with dominant extrinsic impression on the right lateral thecal sac. Non enhancing cystic foci noted along the posterior elements representing small pseudomeningoceles. Postoperative fusion and laminectomy noted at L4-L5 level with osseous fusion anteriorly. Osseous hypertrophy of the posterior elements noted at L4 and L5. Lumbar lordosis is decreased. Multilevel endplate, disk and facet degenerative changes noted. Conus medullaris terminates at approximately mid L1 vertebral body level.', 'type': 'text', 'id': 'e692bd72-8408-4f78-8737-89b1191d09b0', 'grounding': {'box': {'left': 0.05330219864845276, 'top': 0.5104570388793945, 'right': 0.9221934080123901, 'bottom': 0.6804394721984863}, 'page': 0}}, {'markdown': 'L1-L2 shows moderate broad-based disc bulging contributing to mild to moderate left greater than right neuroforamina narrowing. Spinal canal is grossly patent. Approximately 2 mm L1 on L2 retrolisthesis noted.', 'type': 'text', 'id': 'b08f50d5-9f82-49f1-bee3-2b32b2ff5812', 'grounding': {'box': {'left': 0.053708285093307495, 'top': 0.695064127445221, 'right': 0.9130266904830933, 'bottom': 0.7541750073432922}, 'page': 0}}, {'markdown': 'L2-L3 shows moderate nonenhancing bi foraminal broad-based disk bulging contributing to mild-to-moderate right greater than left neural foramina narrowing. Moderate acquired spinal canal stenosis noted due to enhancing peridural fibrosis with asymmetric more focal extrinsic impression on the right lateral ventral thecal sac. Negligible spondylolisthesis of L2 on L3 noted.', 'type': 'text', 'id': 'f783b847-239e-4879-8ef9-6b0875fdd81b', 'grounding': {'box': {'left': 0.05392366647720337, 'top': 0.7686383128166199, 'right': 0.929722249507904, 'bottom': 0.8474408984184265}, 'page': 0}}, {'markdown': 'L3-L4 level shows mild disk desiccation and height loss. Extraforaminal focal annular tears noted on both sides. Spinal canal and foramina are patent.', 'type': 'text', 'id': 'b26c305d-8971-4447-a9f3-3e86c475adaa', 'grounding': {'box': {'left': 0.05406838655471802, 'top': 0.8609051704406738, 'right': 0.9318066835403442, 'bottom': 0.9026641845703125}, 'page': 0}}, {'markdown': 'Page 1 of 2', 'type': 'marginalia', 'id': 'ea7169ff-e7dd-49f3-a2ad-7aa66c6fad20', 'grounding': {'box': {'left': 0.4115692377090454, 'top': 0.947119951248169, 'right': 0.5217132568359375, 'bottom': 0.9693078994750977}, 'page': 0}}, {'markdown': 'Summary : This is a logo for USARAD.com, featuring the company name overlaid on a stylized globe.\n\nlogo: USARAD.com\n\nCompany Name & Tagline :\n  • Text “USARAD.com” is prominently displayed.\n  • “USA” is in white with a blue star pattern.\n  • “RAD” is in bold red.\n  • “.com” is in white italicized font.\n\nGraphic Elements :\n  • Background is a blue globe with visible continents (North America, South America, Africa, Europe).\n  • The globe has light blue intersecting lines suggesting global connectivity or network.\n\nDesign & Placement :\n  • The text “USARAD.com” wraps around the front of the globe, slightly angled.\n  • The globe is shaded with a gradient, giving a 3D effect.\n  • No additional text, icons, or taglines are present.\n\nAnalysis :\n  • The logo uses a globe and network lines to convey international reach and technological connectivity.\n  • The colour scheme (blue, white, red) aligns with the US national colours, reinforcing the “USA” identity.\n  • The bold “RAD” likely refers to radiology, emphasizing the company’s focus.', 'type': 'figure', 'id': '0a069015-2af6-4308-a105-1f2810e17732', 'grounding': {'box': {'left': 0.41032475233078003, 'top': 0.012856528162956238, 'right': 0.5895509123802185, 'bottom': 0.1273379623889923}, 'page': 1}}, {'markdown': '2601 E. Oakland Park Blvd, Suite 102 Ft. Lauderdale, FL 33306 Phone: 888.886.5238  Fax: 888.886.5221\n__________________________________________________________________________________', 'type': 'text', 'id': 'c0521088-587c-4c06-9fa0-beea0b017c5d', 'grounding': {'box': {'left': 0.10729825496673584, 'top': 0.12740609049797058, 'right': 0.8849425315856934, 'bottom': 0.1450037658214569}, 'page': 1}}, {'markdown': 'Patient: DOE, JOHN\n\nMRN : JD4USARAD\n\nReferring Physician: DR. DAVID LIVESEY', 'type': 'text', 'id': '6fe54f5d-7fd5-4925-a4df-e1a5d48ce9cc', 'grounding': {'box': {'left': 0.0529191792011261, 'top': 0.15201373398303986, 'right': 0.4523107707500458, 'bottom': 0.22950796782970428}, 'page': 1}}, {'markdown': 'Exam Date:  05/20/2010\nDOB:  01/01/1961\nFAX:  (305) 418-8166', 'type': 'text', 'id': 'dbd818a0-25d4-45c2-a27f-cc521de5f6ab', 'grounding': {'box': {'left': 0.5443383455276489, 'top': 0.15190471708774567, 'right': 0.85552978515625, 'bottom': 0.2291509062051773}, 'page': 1}}, {'markdown': 'L4-L5 level shows postoperative findings with partial fusion anteriorly with linear hyper intense signal in the remaining intervertebral disk space. Spinal canal and foramina are patent. No gross thecal sac deformity noted. Bilateral laminectomies noted.', 'type': 'text', 'id': '92036285-8d2d-4cf3-ae6a-4976f786406d', 'grounding': {'box': {'left': 0.05147621035575867, 'top': 0.25379863381385803, 'right': 0.9402420520782471, 'bottom': 0.31494930386543274}, 'page': 1}}, {'markdown': 'L5-S1 level shows subtle left central broad-based disk protrusion. Spinal canal and foramina are patent. No gross thecal sac deformity. Bilateral laminectomies noted.\nFerromagnetic susceptibility artifact noted along the mid posterior back spanning from L2 through S2.', 'type': 'text', 'id': 'ecc73c56-fbe8-4b7b-b83c-31c9422349d3', 'grounding': {'box': {'left': 0.05230274796485901, 'top': 0.3278675973415375, 'right': 0.9103895425796509, 'bottom': 0.4070744812488556}, 'page': 1}}, {'markdown': 'No suspicious prevertebral or posterior paraspinal soft tissue signal abnormality noted.\nMild subchondral sclerosis of the included sacroiliac joints noted.\nIncidental note of overdistended bladder.', 'type': 'text', 'id': '1b0984f2-73a9-4ca3-b999-86f153dffffb', 'grounding': {'box': {'left': 0.05356493592262268, 'top': 0.4204942584037781, 'right': 0.8166326284408569, 'bottom': 0.47966456413269043}, 'page': 1}}, {'markdown': 'IMPRESSION:\n1. Postoperative findings of posterior spinal intrapedicular fusion at L2-L3 level. L2-L3 shows moderate nonenhancing bi foraminal broad-based disk bulging contributing to mild-to-moderate right greater than left neural foramina narrowing. Moderate acquired spinal canal stenosis noted due to enhancing peridural fibrosis with asymmetric more focal extrinsic impression on the right lateral ventral thecal sac. Negligible spondylolisthesis of L2 on L3 noted. Non enhancing cystic foci noted along the posterior elements representing small pseudomeningoceles.', 'type': 'text', 'id': '94e15f89-6ee7-430a-b673-4e2cc55b7930', 'grounding': {'box': {'left': 0.0519845187664032, 'top': 0.4928928017616272, 'right': 0.9365636110305786, 'bottom': 0.6290770173072815}, 'page': 1}}, {'markdown': '2. Postoperative fusion and laminectomy noted at L4-L5 levels with osseous fusion anteriorly. Osseous hypertrophy of the posterior elements noted at L4 and L5.', 'type': 'text', 'id': '209f768a-c8bf-48da-9f04-fc1d69c3501b', 'grounding': {'box': {'left': 0.052049219608306885, 'top': 0.6408091187477112, 'right': 0.8776929378509521, 'bottom': 0.6834933161735535}, 'page': 1}}, {'markdown': '3. L5-S1 level shows subtle left central broad-based disk protrusion. Spinal canal and foramina are patent. No gross thecal sac deformity.', 'type': 'text', 'id': '19ed3606-f896-47a6-9f91-d1431a93c1ae', 'grounding': {'box': {'left': 0.05212289094924927, 'top': 0.6950728893280029, 'right': 0.921973705291748, 'bottom': 0.7395064830780029}, 'page': 1}}, {'markdown': 'Note that the stability of findings cannot be determined in the absence of prior imaging for comparison/correlation.\nCorrelation with prior imaging is advised to document stability of findings described.', 'type': 'text', 'id': 'd9f80cfa-7a41-4855-9308-d95d0132d65e', 'grounding': {'box': {'left': 0.054182469844818115, 'top': 0.7513603568077087, 'right': 0.8439146876335144, 'bottom': 0.8123989701271057}, 'page': 1}}, {'markdown': '-Electronically Signed by: RADIOLOGIST, ADMIN  on', 'type': 'text', 'id': '79c68561-1362-45a5-a77a-a7a067cff381', 'grounding': {'box': {'left': 0.054914891719818115, 'top': 0.8308022618293762, 'right': 0.45613688230514526, 'bottom': 0.850782573223114}, 'page': 1}}, {'markdown': '06/07/2010 1:22:01 PM', 'type': 'text', 'id': 'c50ff2eb-8377-413a-90b0-21a457f2cea7', 'grounding': {'box': {'left': 0.6882219910621643, 'top': 0.830761194229126, 'right': 0.8706592917442322, 'bottom': 0.8496735095977783}, 'page': 1}}, {'markdown': 'Page 2 of 2', 'type': 'marginalia', 'id': 'f4472384-fa63-40d6-b07f-ee87c675bded', 'grounding': {'box': {'left': 0.41190385818481445, 'top': 0.9478314518928528, 'right': 0.5217553973197937, 'bottom': 0.9693039059638977}, 'page': 1}}], 'splits': [{'class': 'full', 'identifier': 'full', 'pages': [0, 1], 'markdown': '<a id=\'3c3677d1-5dba-4f48-91fd-719663f870b4\'></a>\n\nSummary : This image is a logo for the website USARAD.com, featuring stylized text over a blue globe with network-like lines.\n\nlogo: USARAD.com\n\nCompany Name & Tagline :\n  • The logo displays "USARAD.com" as the main text.\n  • "USA" is rendered in white with a blue outline and a pattern of stars.\n  • "RAD" is in bold red uppercase letters.\n  • ".com" is in smaller white text with a blue outline, positioned to the right of "RAD".\n\nGraphic Elements :\n  • The background is a blue globe with visible continents (North and South America, parts of Europe and Africa).\n  • The globe features light blue intersecting lines, suggesting global connectivity or a network.\n  • The logo has a slight shadow beneath the globe, giving a three-dimensional effect.\n\nLayout & Placement :\n  • "USARAD.com" text is superimposed across the front of the globe, slightly angled.\n  • The text is centered horizontally on the globe.\n\nAnalysis :\n  • The logo combines patriotic elements (stars, "USA") with a global and technological theme (networked globe), likely representing a US-based radiology or medical imaging company with international reach.\n\n<a id=\'613fcdb6-82a6-455a-9401-cb0dbebbb975\'></a>\n\n2601 E. Oakland Park Blvd, Suite 102 Ft. Lauderdale, FL  33306 Phone: 888.886.5238  Fax: 888.886.5221\n\n<a id=\'13878c67-394d-472b-8e96-4e43ccf6a490\'></a>\n\nPatient: DOE, JOHN\nMRN : JD4USARAD\nReferring Physician: DR. DAVID LIVESEY\n\n<a id=\'c36c0270-60e4-445f-9a11-bb93c63aa136\'></a>\n\nExam Date:  05/20/2010\nDOB:  01/01/1961\nFAX:  (305) 418-8166\n\n<a id=\'92665435-32ff-4d46-b63f-12684f782dbb\'></a>\n\nMRI OF THE LUMBAR SPINE WITH AND WITHOUT CONTRAST\n\n<a id=\'bb927ff0-78f5-4d00-87cb-50dda3446792\'></a>\n\nPROCEDURE:\nMRI of lumbosacral spine without/with IV contrast.\n\n<a id=\'04136993-4ee0-4503-b8f1-b28edc82719f\'></a>\n\nINDICATION:\nRadiculopathy post L2-L3 fusion, question incomplete fusion. Persistent symptoms.\n\n<a id=\'06366b0f-d1f3-4373-98f5-a872b608f00a\'></a>\n\nCOMPARISON:\nNone.\n\n<a id=\'09ba7c03-fd6d-4888-92ec-826d4c94f18e\'></a>\n\nTECHNIQUE:\nMultiplanar and multi-sequence imaging of the lumbosacral spine without/with intravenous contrast using a 0.3T MRI scanner.\n\n<a id=\'e692bd72-8408-4f78-8737-89b1191d09b0\'></a>\n\nFINDINGS:\nPostoperative findings of posterior intrapedicular spinal fusion at L2-L3 noted. The L2-L3 disk is preserved. Enhancing peridural fibrosis noted at L2-L3 level mildly deforming the thecal sac with dominant extrinsic impression on the right lateral thecal sac. Non enhancing cystic foci noted along the posterior elements representing small pseudomeningoceles. Postoperative fusion and laminectomy noted at L4-L5 level with osseous fusion anteriorly. Osseous hypertrophy of the posterior elements noted at L4 and L5. Lumbar lordosis is decreased. Multilevel endplate, disk and facet degenerative changes noted. Conus medullaris terminates at approximately mid L1 vertebral body level.\n\n<a id=\'b08f50d5-9f82-49f1-bee3-2b32b2ff5812\'></a>\n\nL1-L2 shows moderate broad-based disc bulging contributing to mild to moderate left greater than right neuroforamina narrowing. Spinal canal is grossly patent. Approximately 2 mm L1 on L2 retrolisthesis noted.\n\n<a id=\'f783b847-239e-4879-8ef9-6b0875fdd81b\'></a>\n\nL2-L3 shows moderate nonenhancing bi foraminal broad-based disk bulging contributing to mild-to-moderate right greater than left neural foramina narrowing. Moderate acquired spinal canal stenosis noted due to enhancing peridural fibrosis with asymmetric more focal extrinsic impression on the right lateral ventral thecal sac. Negligible spondylolisthesis of L2 on L3 noted.\n\n<a id=\'b26c305d-8971-4447-a9f3-3e86c475adaa\'></a>\n\nL3-L4 level shows mild disk desiccation and height loss. Extraforaminal focal annular tears noted on both sides. Spinal canal and foramina are patent.\n\n<a id=\'ea7169ff-e7dd-49f3-a2ad-7aa66c6fad20\'></a>\n\nPage 1 of 2\n\n<a id=\'0a069015-2af6-4308-a105-1f2810e17732\'></a>\n\nSummary : This is a logo for USARAD.com, featuring the company name overlaid on a stylized globe.\n\nlogo: USARAD.com\n\nCompany Name & Tagline :\n  • Text “USARAD.com” is prominently displayed.\n  • “USA” is in white with a blue star pattern.\n  • “RAD” is in bold red.\n  • “.com” is in white italicized font.\n\nGraphic Elements :\n  • Background is a blue globe with visible continents (North America, South America, Africa, Europe).\n  • The globe has light blue intersecting lines suggesting global connectivity or network.\n\nDesign & Placement :\n  • The text “USARAD.com” wraps around the front of the globe, slightly angled.\n  • The globe is shaded with a gradient, giving a 3D effect.\n  • No additional text, icons, or taglines are present.\n\nAnalysis :\n  • The logo uses a globe and network lines to convey international reach and technological connectivity.\n  • The colour scheme (blue, white, red) aligns with the US national colours, reinforcing the “USA” identity.\n  • The bold “RAD” likely refers to radiology, emphasizing the company’s focus.\n\n<a id=\'c0521088-587c-4c06-9fa0-beea0b017c5d\'></a>\n\n2601 E. Oakland Park Blvd, Suite 102 Ft. Lauderdale, FL 33306 Phone: 888.886.5238  Fax: 888.886.5221\n__________________________________________________________________________________\n\n<a id=\'6fe54f5d-7fd5-4925-a4df-e1a5d48ce9cc\'></a>\n\nPatient: DOE, JOHN\n\nMRN : JD4USARAD\n\nReferring Physician: DR. DAVID LIVESEY\n\n<a id=\'dbd818a0-25d4-45c2-a27f-cc521de5f6ab\'></a>\n\nExam Date:  05/20/2010\nDOB:  01/01/1961\nFAX:  (305) 418-8166\n\n<a id=\'92036285-8d2d-4cf3-ae6a-4976f786406d\'></a>\n\nL4-L5 level shows postoperative findings with partial fusion anteriorly with linear hyper intense signal in the remaining intervertebral disk space. Spinal canal and foramina are patent. No gross thecal sac deformity noted. Bilateral laminectomies noted.\n\n<a id=\'ecc73c56-fbe8-4b7b-b83c-31c9422349d3\'></a>\n\nL5-S1 level shows subtle left central broad-based disk protrusion. Spinal canal and foramina are patent. No gross thecal sac deformity. Bilateral laminectomies noted.\nFerromagnetic susceptibility artifact noted along the mid posterior back spanning from L2 through S2.\n\n<a id=\'1b0984f2-73a9-4ca3-b999-86f153dffffb\'></a>\n\nNo suspicious prevertebral or posterior paraspinal soft tissue signal abnormality noted.\nMild subchondral sclerosis of the included sacroiliac joints noted.\nIncidental note of overdistended bladder.\n\n<a id=\'94e15f89-6ee7-430a-b673-4e2cc55b7930\'></a>\n\nIMPRESSION:\n1. Postoperative findings of posterior spinal intrapedicular fusion at L2-L3 level. L2-L3 shows moderate nonenhancing bi foraminal broad-based disk bulging contributing to mild-to-moderate right greater than left neural foramina narrowing. Moderate acquired spinal canal stenosis noted due to enhancing peridural fibrosis with asymmetric more focal extrinsic impression on the right lateral ventral thecal sac. Negligible spondylolisthesis of L2 on L3 noted. Non enhancing cystic foci noted along the posterior elements representing small pseudomeningoceles.\n\n<a id=\'209f768a-c8bf-48da-9f04-fc1d69c3501b\'></a>\n\n2. Postoperative fusion and laminectomy noted at L4-L5 levels with osseous fusion anteriorly. Osseous hypertrophy of the posterior elements noted at L4 and L5.\n\n<a id=\'19ed3606-f896-47a6-9f91-d1431a93c1ae\'></a>\n\n3. L5-S1 level shows subtle left central broad-based disk protrusion. Spinal canal and foramina are patent. No gross thecal sac deformity.\n\n<a id=\'d9f80cfa-7a41-4855-9308-d95d0132d65e\'></a>\n\nNote that the stability of findings cannot be determined in the absence of prior imaging for comparison/correlation.\nCorrelation with prior imaging is advised to document stability of findings described.\n\n<a id=\'79c68561-1362-45a5-a77a-a7a067cff381\'></a>\n\n-Electronically Signed by: RADIOLOGIST, ADMIN  on\n\n<a id=\'c50ff2eb-8377-413a-90b0-21a457f2cea7\'></a>\n\n06/07/2010 1:22:01 PM\n\n<a id=\'f4472384-fa63-40d6-b07f-ee87c675bded\'></a>\n\nPage 2 of 2', 'chunks': ['3c3677d1-5dba-4f48-91fd-719663f870b4', '613fcdb6-82a6-455a-9401-cb0dbebbb975', '13878c67-394d-472b-8e96-4e43ccf6a490', 'c36c0270-60e4-445f-9a11-bb93c63aa136', '92665435-32ff-4d46-b63f-12684f782dbb', 'bb927ff0-78f5-4d00-87cb-50dda3446792', '04136993-4ee0-4503-b8f1-b28edc82719f', '06366b0f-d1f3-4373-98f5-a872b608f00a', '09ba7c03-fd6d-4888-92ec-826d4c94f18e', 'e692bd72-8408-4f78-8737-89b1191d09b0', 'b08f50d5-9f82-49f1-bee3-2b32b2ff5812', 'f783b847-239e-4879-8ef9-6b0875fdd81b', 'b26c305d-8971-4447-a9f3-3e86c475adaa', 'ea7169ff-e7dd-49f3-a2ad-7aa66c6fad20', '0a069015-2af6-4308-a105-1f2810e17732', 'c0521088-587c-4c06-9fa0-beea0b017c5d', '6fe54f5d-7fd5-4925-a4df-e1a5d48ce9cc', 'dbd818a0-25d4-45c2-a27f-cc521de5f6ab', '92036285-8d2d-4cf3-ae6a-4976f786406d', 'ecc73c56-fbe8-4b7b-b83c-31c9422349d3', '1b0984f2-73a9-4ca3-b999-86f153dffffb', '94e15f89-6ee7-430a-b673-4e2cc55b7930', '209f768a-c8bf-48da-9f04-fc1d69c3501b', '19ed3606-f896-47a6-9f91-d1431a93c1ae', 'd9f80cfa-7a41-4855-9308-d95d0132d65e', '79c68561-1362-45a5-a77a-a7a067cff381', 'c50ff2eb-8377-413a-90b0-21a457f2cea7', 'f4472384-fa63-40d6-b07f-ee87c675bded']}], 'metadata': {'filename': 'mri-report.pdf', 'org_id': None, 'page_count': 2, 'duration_ms': 16204, 'credit_usage': 6.0, 'job_id': 'cmfx7w34q0000wcrdaaz45miu', 'version': 'latest'}}, 'output_url': None, 'metadata': {'filename': 'mri-report.pdf', 'org_id': None, 'page_count': 2, 'duration_ms': 17000, 'credit_usage': 6.0, 'job_id': 'cmfx7w34q0000wcrdaaz45miu', 'version': 'latest'}, 'failure_reason': None}

Markdown content saved to a Markdown file.

3. Create a JSON Extraction Schema

As a reminder, we want to extract these fields from the MRI Report: Exam Date and Procedure. To do this, create a JSON extraction schema that identifies these fields. We will use this JSON file when we run the ADE Extract API in the next step. We’ve created the JSON schema below for you to use. Copy and save it as a JSON file.

{
"type": "object",
  "title": "Extracted Exam Information Schema",
  "required": [
    "exam_date",
    "procedure"
  ],
  "properties": {
    "exam_date": {
      "type": "string",
      "title": "Exam Date",
      "format": "YYYY-MM-DD",
      "pattern": "^(0[1-9]|1[0-2])/([0-2][0-9]|3[01])/\\d{4}$",
      "description": "The date on which the exam was performed, formatted as mm/dd/yyyy."
    },
    "procedure": {
      "type": "string",
      "title": "Procedure Name",
      "description": "The name of the procedure performed as stated in the report."
    }
  }
}

To learn more about extraction schemas and how to create them, go to Overview: Extract Data.

4. Extract Fields from Markdown

Now that we have the parsed output in a Markdown file and a JSON extraction schema, we’re ready to extract these fields: Exam Date and Procedure. To do this, run the script below.

import requests

headers = {
    'Authorization': 'Bearer YOUR_API_KEY'
}

url = 'https://api.va.landing.ai/v1/ade/extract'

# Read the schema file as string
with open('schema-mri-report.json', 'r') as f:
    schema_content = f.read()

# Prepare files and data
files = {'markdown': open('markdown-mri-report.md', 'rb')}
data = {'schema': schema_content, 'model': 'extract-latest'}

# Run extraction
response = requests.post(url, files=files, data=data, headers=headers)

# Return the results
print(response.json())

The extracted fields and other metadata are included in the API response:

{
   'extraction':{
      'exam_date':'05/20/2010',
      'procedure':'MRI of lumbosacral spine without/with IV contrast.'
   },
   'extraction_metadata':{
      'exam_date':{
         'chunk_references':[
            'c36c0270-60e4-445f-9a11-bb93c63aa136',
            'dbd818a0-25d4-45c2-a27f-cc521de5f6ab'
         ],
         'value':'05/20/2010'
      },
      'procedure':{
         'chunk_references':[
            'bb927ff0-78f5-4d00-87cb-50dda3446792'
         ],
         'value':'MRI of lumbosacral spine without/with IV contrast.'
      }
   },
   'metadata':{
      'filename':'markdown-mri-report.md',
      'org_id':None,
      'duration_ms':2312,
      'credit_usage':1.7,
      'job_id':'phwc2sbcax1w1g9scchcmpshe',
      'version':None
   }
}

Run Parse Jobs with Our Libraries

Click one of the tiles below to learn how to run Parse Jobs with our libraries.

Python Library

Run Parse Jobs with our Python library.

TypeScript Library

Run Parse Jobs with our TypeScript library.

The legacy agentic-doc library does not support Parse Jobs.

Get Started

Client Libraries

Parsing

Split

Extraction

General

Security

Administration

Agentic Document Extraction on Snowflake

Legacy ADE

Legacy Python Library

More from LandingAI

Parse Large Files (Parse Jobs)

Monitor Parse Jobs

Rate Limits for ADE Parse Jobs

API Reference

Save Parsed Output to a URL

When Parsed Output Is Saved to a URL

URL Requirements

Example: Use Amazon S3 Presigned URLs

ZDR Requirements

Workflow Overview

End-to-End Workflow: Parse and Extract the Output

1. Create Parse Job & Get Job ID

2. Use job_id to Get Parsing Results

3. Create a JSON Extraction Schema

4. Extract Fields from Markdown

Run Parse Jobs with Our Libraries

Python Library

TypeScript Library

Get Started

Client Libraries

Parsing

Split

Extraction

General

Security

Administration

Agentic Document Extraction on Snowflake

Legacy ADE

Legacy Python Library

More from LandingAI

​Monitor Parse Jobs

​Rate Limits for ADE Parse Jobs

​API Reference

​Save Parsed Output to a URL

​When Parsed Output Is Saved to a URL

​URL Requirements

​Example: Use Amazon S3 Presigned URLs

​ZDR Requirements

​Workflow Overview

​End-to-End Workflow: Parse and Extract the Output

​1. Create Parse Job & Get Job ID

​2. Use job_id to Get Parsing Results

​3. Create a JSON Extraction Schema

​4. Extract Fields from Markdown

​Run Parse Jobs with Our Libraries

Python Library

TypeScript Library

Monitor Parse Jobs

Rate Limits for ADE Parse Jobs

API Reference

Save Parsed Output to a URL

When Parsed Output Is Saved to a URL

URL Requirements

Example: Use Amazon S3 Presigned URLs

ZDR Requirements

Workflow Overview

End-to-End Workflow: Parse and Extract the Output

1. Create Parse Job & Get Job ID

2. Use job_id to Get Parsing Results

3. Create a JSON Extraction Schema

4. Extract Fields from Markdown

Run Parse Jobs with Our Libraries