Skip to main content
When you call the API endpoint (https://api.va.landing.ai/v1/ade/parse), the API response includes markdown fields at three levels:
  • Top-level markdown field: Contains the complete parsed document content as Markdown
  • Chunk-level markdown fields: Each object in the chunks array includes its own markdown field containing only the content for that specific chunk
  • Split-level markdown fields: Each object in the splits array includes a markdown field containing the content for that specific section of the document
All markdown fields use the same formatting and include embedded HTML anchor tags that link the content to specific chunks in the chunks array. These anchors enable you to trace content back to its location in the original document.

Markdown Fields in Context

To better understand how these markdown fields work together, let’s look at the parsing response for this pallet label: Pallet Label Below is the full parsing response for this document with the markdown fields highlighted. Notice that each chunk’s HTML anchor tag (the <a id='...'> element) appears consistently across all markdown fields. This consistency means you can reference any chunk by its ID, whether you’re working with the complete document Markdown, a specific split, or an individual chunk.
Markdown Fields
{
  "markdown": "<a id='69a645fe-8617-4be8-b66f-68d7788755c0'></a>\n\nSKU\nWH-2847-BLK\n\n<a id='bdaf2198-0fdc-4c94-be54-a49ca796ea0f'></a>\n\nQUANTITY\n\n48 Units\n\n<a id='65fe085e-b8c8-46e0-848c-f0c32ccf16e8'></a>\n\n<::scan_code: Barcode\n\nThis is a clear, well-defined linear barcode with distinct black bars on a white background, showing no visible quality issues.::>\n\n<a id='9e551c40-3eaa-4d77-a219-50b7f4a57e05'></a>\n\n2847 0000 4812\n---\n",
  "chunks": [
    {
      "markdown": "<a id='69a645fe-8617-4be8-b66f-68d7788755c0'></a>\n\nSKU\nWH-2847-BLK",
      "type": "text",
      "id": "69a645fe-8617-4be8-b66f-68d7788755c0",
      "grounding": {
        "box": {
          "left": 0.06628379225730896,
          "top": 0.09509064257144928,
          "right": 0.4664609134197235,
          "bottom": 0.2678269147872925
        },
        "page": 0
      }
    },
    {
      "markdown": "<a id='bdaf2198-0fdc-4c94-be54-a49ca796ea0f'></a>\n\nQUANTITY\n\n48 Units",
      "type": "text",
      "id": "bdaf2198-0fdc-4c94-be54-a49ca796ea0f",
      "grounding": {
        "box": {
          "left": 0.06750398874282837,
          "top": 0.3068091869354248,
          "right": 0.3204823434352875,
          "bottom": 0.47341251373291016
        },
        "page": 0
      }
    },
    {
      "markdown": "<a id='65fe085e-b8c8-46e0-848c-f0c32ccf16e8'></a>\n\n<::scan_code: Barcode\n\nThis is a clear, well-defined linear barcode with distinct black bars on a white background, showing no visible quality issues.::>",
      "type": "scan_code",
      "id": "65fe085e-b8c8-46e0-848c-f0c32ccf16e8",
      "grounding": {
        "box": {
          "left": 0.11207544803619385,
          "top": 0.5374823212623596,
          "right": 0.8830909729003906,
          "bottom": 0.8660760521888733
        },
        "page": 0
      }
    },
    {
      "markdown": "<a id='9e551c40-3eaa-4d77-a219-50b7f4a57e05'></a>\n\n2847 0000 4812\n---\n",
      "type": "text",
      "id": "9e551c40-3eaa-4d77-a219-50b7f4a57e05",
      "grounding": {
        "box": {
          "left": 0.28996485471725464,
          "top": 0.9088556170463562,
          "right": 0.7057502269744873,
          "bottom": 0.9888754487037659
        },
        "page": 0
      }
    }
  ],
  "splits": [
    {
      "class": "page",
      "identifier": "page_0",
      "pages": [
        0
      ],
      "markdown": "<a id='69a645fe-8617-4be8-b66f-68d7788755c0'></a>\n\nSKU\nWH-2847-BLK\n\n<a id='bdaf2198-0fdc-4c94-be54-a49ca796ea0f'></a>\n\nQUANTITY\n\n48 Units\n\n<a id='65fe085e-b8c8-46e0-848c-f0c32ccf16e8'></a>\n\n<::scan_code: Barcode\n\nThis is a clear, well-defined linear barcode with distinct black bars on a white background, showing no visible quality issues.::>\n\n<a id='9e551c40-3eaa-4d77-a219-50b7f4a57e05'></a>\n\n2847 0000 4812\n---\n",
      "chunks": [
        "69a645fe-8617-4be8-b66f-68d7788755c0",
        "bdaf2198-0fdc-4c94-be54-a49ca796ea0f",
        "65fe085e-b8c8-46e0-848c-f0c32ccf16e8",
        "9e551c40-3eaa-4d77-a219-50b7f4a57e05"
      ]
    }
  ],
  "grounding": {
    "69a645fe-8617-4be8-b66f-68d7788755c0": {
      "box": {
        "left": 0.06628379225730896,
        "top": 0.09509064257144928,
        "right": 0.4664609134197235,
        "bottom": 0.2678269147872925
      },
      "page": 0,
      "type": "chunkText"
    },
    "bdaf2198-0fdc-4c94-be54-a49ca796ea0f": {
      "box": {
        "left": 0.06750398874282837,
        "top": 0.3068091869354248,
        "right": 0.3204823434352875,
        "bottom": 0.47341251373291016
      },
      "page": 0,
      "type": "chunkText"
    },
    "65fe085e-b8c8-46e0-848c-f0c32ccf16e8": {
      "box": {
        "left": 0.11207544803619385,
        "top": 0.5374823212623596,
        "right": 0.8830909729003906,
        "bottom": 0.8660760521888733
      },
      "page": 0,
      "type": "chunkScanCode"
    },
    "9e551c40-3eaa-4d77-a219-50b7f4a57e05": {
      "box": {
        "left": 0.28996485471725464,
        "top": 0.9088556170463562,
        "right": 0.7057502269744873,
        "bottom": 0.9888754487037659
      },
      "page": 0,
      "type": "chunkText"
    }
  },
  "metadata": {
    "filename": "pallet-label.png",
    "org_id": null,
    "page_count": 1,
    "duration_ms": 4226,
    "credit_usage": 3,
    "job_id": "y2xvbwlqc1p9ynwfkx4tx7q7q",
    "version": "dpt-2-20250919"
  }
}

Markdown Structure

The markdown field includes a parsed chunk or a sequence of chunks. Each chunk begins with an HTML anchor tag containing a unique identifier, followed by the chunk content. For example, the following markdown field contains two chunks:
  • A figure chunk (ID: 4c29090b-b75e-4d5f-95b6-24a7d5668486) with a description of the image
  • A text chunk (ID: ae2e4e41-9443-4fb5-bced-199915f97dec) containing formatted address information
"markdown": "<a id='4c29090b-b75e-4d5f-95b6-24a7d5668486'></a>\n\n<::An illustration of a sun with eight rays extending outwards.: figure::>\n\n<a id='ae2e4e41-9443-4fb5-bced-199915f97dec'></a>\n\n**Eliza Smith**\n123 Main St.\nMountain View, CA 94041"

Anchor Tags

Each chunk begins with an HTML anchor tag containing the chunk’s unique identifier. The id attribute contains the UUID that matches the corresponding entry in the chunks array, enabling you to trace content back to its location in the original document.
<a id='chunk-uuid'></a>

Content Format by Chunk Type

The Markdown content format varies based on the chunk type:

Text-Based Chunks

For text-based chunks (text, marginalia), content appears as standard Markdown text:
<a id='chunk-uuid'></a>

## Heading Text

Paragraph content with **bold** and *italic* formatting.

Image-Based Chunks

Image-based chunks (figure, logo, card, attestation, scan_code) use a special delimiter format that wraps the caption or description:
<a id='chunk-uuid'></a>

<::Caption or description of the visual element: figure::>

Table Chunks

Table chunks (table) appear as HTML table markup. Most table elements include unique id attributes. These IDs use the format {page_number}-{base62_sequential_number}, where the page number starts at 0 and the sequential number increments for each element within the page. If a page contains multiple tables, the ID numbering continues sequentially across all tables on that page. This ID system allows you to trace individual cells, rows, and tables back to their locations in the original document.
<a id='chunk-uuid'></a>

<table id="0-1">
<tr><td id="0-2">Product</td><td id="0-3">Revenue</td></tr>
<tr><td id="0-4">Hardware</td><td id="0-5">15,230</td></tr>
<tr><td id="0-6">Software</td><td id="0-7">8,540</td></tr>
<tr><td id="0-8">Services</td><td id="0-9">12,750</td></tr>
<tr><td id="0-a">Licensing</td><td id="0-b">6,920</td></tr>
</table>

Spreadsheets

When you parse spreadsheets, data is identified as table chunks, and embedded images or charts are identified as figure chunks. Table chunks appear as HTML table markup. Most table elements include unique id attributes. These IDs use the format {tab_name}-{cell_reference}, where the tab name is the name of the spreadsheet tab and the cell reference uses standard spreadsheet notation (column letter followed by row number, such as A1, B2, or C3). The table itself uses a range-based ID format: {tab_name}-{start_cell}:{end_cell} (for example, Sheet 1-A1:B4). This ID system allows you to trace individual cells back to their locations in the original spreadsheet. For example, here is a screenshot of a spreadsheet, followed by the Markdown output. Spreadsheet
{
  "markdown": "<a id='Sheet 1-A1:B4-chunk'></a>\n\n<table id='Sheet 1-A1:B4'>\n  <tr>\n    <td id='Sheet 1-A1'>Program</td>\n    <td id='Sheet 1-B1'>Interest Rate</td>\n  </tr>\n  <tr>\n    <td id='Sheet 1-A2'>15 Year Fixed-Rate Mortgage</td>\n    <td id='Sheet 1-B2'>0.05125</td>\n  </tr>\n  <tr>\n    <td id='Sheet 1-A3'>30 Year Fixed-Rate Mortgage</td>\n    <td id='Sheet 1-B3'>0.05875</td>\n  </tr>\n  <tr>\n    <td id='Sheet 1-A4'>10/1 ARM</td>\n    <td id='Sheet 1-B4'>0.05625</td>\n  </tr>\n</table>",
  "type": "table",
  "id": "Sheet 1-A1:B4",
  "grounding": null
    }
For a list of supported spreadsheet types, go to Supported File Types.

Chunk Separators

Chunks are separated by double newlines (\n\n), except for the final chunk in the document.

How do I find the Markdown response for the ADE Parse Jobs API?

If you call the ADE Parse Jobs API, the API responds with the job_id. The parsing results, including the markdown field, are returned when you check the parsing job status with the Get Async Job Status API.

Markdown for Legacy API

When you call the legacy API (https://api.va.landing.ai/v1/tools/agentic-document-analysis), the response structure differs from the Parse API: Key Differences:
  • Single markdown field: The response contains only one markdown field (no chunk-level or split-level fields)
  • Different format: Instead of HTML anchor tags, chunk metadata appears as inline HTML comments after each chunk’s content
  • Same core information: The markdown field still contains the complete parsed document content and chunk IDs for reference
Despite the format differences, you can still trace content back to specific chunks using the IDs embedded in the HTML comments.

Markdown Field in Context (Legacy API)

To better understand the markdown formatting for the legacy API, let’s look at the parsing response for this pallet label: Pallet Label Below is the full parsing response for this document with the markdown field highlighted.
Markdown Fields for Legacy API
{
    "data": {
        "markdown": "SKU\nWH-2847-BLK <!-- text, from page 0 (l=0.071,t=0.098,r=0.464,b=0.263), with ID 426a3506-01d2-4e91-b201-af5f344501a7 -->\n\nQUANTITY\n48 Units <!-- text, from page 0 (l=0.072,t=0.308,r=0.312,b=0.474), with ID 2eeea9d5-17c7-4ea6-9373-b0da9e1f411d -->\n\nSummary : This image displays a barcode consisting of a series of vertical black bars of varying thickness and spacing, set against a white background. No numeric or alphanumeric string is visible below the bars.\n\nbarcode:\n  Barcode Details :\n    • No visible numeric or alphanumeric string.\n    • Barcode type not explicitly indicated; appears to be a 1D linear barcode (possibly Code 128, EAN-13, or similar).\n    • Approximately 50 vertical bars, alternating thick and thin, with consistent height.\n    • Quiet zones (blank margins) present on both left and right sides.\n    • Barcode is horizontally oriented, spanning the width of the image.\n    • No additional text, logo, or annotation present.\n\nDimensions & Placement :\n    • Barcode occupies the central horizontal region of the image.\n    • Bars are evenly spaced and extend vertically from top to bottom within the barcode area.\n\nAnalysis :\n    • The barcode is designed for machine readability, but without a visible numeric/alphanumeric string or further context, the encoded data cannot be determined.\n    • The absence of human-readable text suggests this is a generic or template barcode image, not tied to a specific product or identifier. <!-- figure, from page 0 (l=0.111,t=0.546,r=0.882,b=0.861), with ID 5ac2f680-cb7e-4a6c-8d07-617bd65cb978 -->\n\n2847    0000    4812 <!-- text, from page 0 (l=0.296,t=0.918,r=0.702,b=0.985), with ID 31cafe3e-8f00-4d3f-8e5d-f17451e4ec7a -->",
        "extracted_schema": null,
        "extraction_metadata": null,
        "chunks": [
            {
                "text": "SKU\nWH-2847-BLK",
                "grounding": [
                    {
                        "box": {
                            "l": 0.07070992887020111,
                            "t": 0.09801134467124939,
                            "r": 0.46398574113845825,
                            "b": 0.26285043358802795
                        },
                        "page": 0
                    }
                ],
                "chunk_type": "text",
                "chunk_id": "426a3506-01d2-4e91-b201-af5f344501a7",
                "rotation_angle": 0.0
            },
            {
                "text": "QUANTITY\n48 Units",
                "grounding": [
                    {
                        "box": {
                            "l": 0.0722845047712326,
                            "t": 0.3077216148376465,
                            "r": 0.3118019700050354,
                            "b": 0.47360438108444214
                        },
                        "page": 0
                    }
                ],
                "chunk_type": "text",
                "chunk_id": "2eeea9d5-17c7-4ea6-9373-b0da9e1f411d",
                "rotation_angle": 0.0
            },
            {
                "text": "Summary : This image displays a barcode consisting of a series of vertical black bars of varying thickness and spacing, set against a white background. No numeric or alphanumeric string is visible below the bars.\n\nbarcode:\n  Barcode Details :\n    • No visible numeric or alphanumeric string.\n    • Barcode type not explicitly indicated; appears to be a 1D linear barcode (possibly Code 128, EAN-13, or similar).\n    • Approximately 50 vertical bars, alternating thick and thin, with consistent height.\n    • Quiet zones (blank margins) present on both left and right sides.\n    • Barcode is horizontally oriented, spanning the width of the image.\n    • No additional text, logo, or annotation present.\n\nDimensions & Placement :\n    • Barcode occupies the central horizontal region of the image.\n    • Bars are evenly spaced and extend vertically from top to bottom within the barcode area.\n\nAnalysis :\n    • The barcode is designed for machine readability, but without a visible numeric/alphanumeric string or further context, the encoded data cannot be determined.\n    • The absence of human-readable text suggests this is a generic or template barcode image, not tied to a specific product or identifier.",
                "grounding": [
                    {
                        "box": {
                            "l": 0.11109775304794312,
                            "t": 0.5458672046661377,
                            "r": 0.8822200894355774,
                            "b": 0.861375093460083
                        },
                        "page": 0
                    }
                ],
                "chunk_type": "figure",
                "chunk_id": "5ac2f680-cb7e-4a6c-8d07-617bd65cb978",
                "rotation_angle": 0.0
            },
            {
                "text": "2847    0000    4812",
                "grounding": [
                    {
                        "box": {
                            "l": 0.29578033089637756,
                            "t": 0.91764235496521,
                            "r": 0.7021114826202393,
                            "b": 0.9852532148361206
                        },
                        "page": 0
                    }
                ],
                "chunk_type": "text",
                "chunk_id": "31cafe3e-8f00-4d3f-8e5d-f17451e4ec7a",
                "rotation_angle": 0.0
            }
        ]
    },
    "errors": [],
    "extraction_error": null,
    "metadata": {
        "parse_id": "vtw5hpu9v4t7rqefz7ncdxefs",
        "page_count": 1,
        "processed_at": "2025-10-15T20:20:07.202060+00:00",
        "pages_processed": 1,
        "processing_time_ms": 4385,
        "filename": "pallet-label.png",
        "user_id": "a0k90chtdkyd",
        "pages_rotation_angles": {
            "0": 0.0
        }
    }
}

Markdown Structure (Legacy API)

The markdown field contains a sequence of chunks. Each chunk consists of the content followed by an HTML comment containing metadata. For example, the following markdown field contains two chunks:
  • A text chunk (ID: 426a3506-01d2-4e91-b201-af5f344501a7) with SKU information
  • A text chunk (ID: 2eeea9d5-17c7-4ea6-9373-b0da9e1f411d) with quantity information
SKU
WH-2847-BLK <!-- text, from page 0 (l=0.071,t=0.098,r=0.464,b=0.263), with ID 426a3506-01d2-4e91-b201-af5f344501a7 -->

QUANTITY
48 Units <!-- text, from page 0 (l=0.072,t=0.308,r=0.312,b=0.474), with ID 2eeea9d5-17c7-4ea6-9373-b0da9e1f411d -->

HTML Comment Structure

Each HTML comment contains three pieces of information that map to the corresponding chunk object:
InformationMaps toDescriptionExample
Chunk typechunk_typeThe type of contenttext, figure, table
GroundinggroundingPage number (starting at 0) and bounding box coordinates (l=left, t=top, r=right, b=bottom)from page 0 (l=0.071,t=0.098,r=0.464,b=0.263)
Chunk IDchunk_idUnique identifier for the chunkwith ID 426a3506-01d2-4e91-b201-af5f344501a7