Logo Airparser Knowledge Base Logo Airparser Knowledge Base
  • Back to Airparser.com
  • Blog
  • Contact Us
Logo Airparser Knowledge Base Logo Airparser Knowledge Base
  • Back to Airparser.com
  • Blog
  • Contact Us
  • Getting started
    • ✅ What is Airparser
    • 📄 Use Cases
    • ❓ General Help & FAQ
    • 🤔 What's the difference between Parsio and Airparser?
  • Importing Documents
    • 📥 Data Import and Supported Formats
    • 📝 Creating an Extraction Schema
  • Data Extraction
    • ⚙️ LLM Engines: Text & Vision
    • 🔗 Extracting URLs From an Email or HTML Document
    • 📦 Extracting XML and XML ADF
    • 🚧 Parsing Tips and How to Fix Common Issues
  • Data Export & Integrations
    • 🏷️ Meta Fields
    • 📄 Download to File
    • 📄 Exporting Data to Google Sheets
    • 🔁 Webhooks
    • 🔁 Zapier Integration
    • 🔁 Make Integration
  • Post-processing
    • 🐍 Data Post-processing
    • 🐍 Examples and Code Snippets
    • 🐍 Loops in Python
  • Account & Billing
    • 👫 Invite Members to Your Team
  • Public API
    • 👨‍💻 Public API
  1. Overview
  2. Data Extraction

Data Extraction

  • ⚙️ LLM Engines: Text & Vision

    Text Engine Historically, Airparser has utilized a single engine, which we refer to as the "Text engine". When users upload documents (such as emails, PDFs, scanned images, or Word documents), we execute a complex series of data pre-processing, preparati ...

  • 🔗 Extracting URLs From an Email or HTML Document

    By default, Airparser doesn't always parse hidden URLs of links, buttons, and images. You can activate the email and image parsing features from the Inbox Settings page > "Advanced Settings" section. Reparse your documents to see changes. Alternati ...

  • 📦 Extracting XML and XML ADF

    Airparser supports automatic XML extraction from emails and documents. This feature allows you to extract structured XML data without needing an extraction schema or using large language models (LLMs). If your email or document already contains structure ...

  • 🚧 Parsing Tips and How to Fix Common Issues

    Why are only some pages of my document being parsed? The LLM-powered parser operates within a defined context window, which limits the maximum document size it can process for data extraction. Currently, a key limitation of all LLM engines is their inab ...

© 2025 Airparser Knowledge Base
  • Terms
  • Data protection
  • GDPR