Airparser uses Large Language Model (LLM) engines to extract structured data from documents.Currently, the Vision engine is the default engine for all new inboxes. You can still switch to the Text engine if it better fits your use case.Below is an overvie ...
Airparser allows you to select the parsing model for each Inbox independently. This means you can fine-tune the parsing performance for different document types without affecting other inboxes.You can change the model at any time from the I ...
Human-in-the-loop review lets you verify and correct extracted data before it reaches your webhooks and integrations. Documents matching your conditions are held with the Review status until a team member approves them β so incorrect data never silently f ...
Page processing lets you choose which pages of an incoming PDF Airparser should process. The rest are discarded before parsing β and before any document split.This is useful when your PDFs contain pages you never need, such as blank backs of scanned sheet ...
Document split allows you to break a multi-page PDF into several separate documents and parse each one individually. This is useful when multiple documents β such as invoices, contracts, or statements β arrive bundled together in a single file.How to conf ...
By default, Airparser does not always extract hidden URLs inside clickable text, buttons, or images. These links often exist in the underlying HTML code of the email, so you may need to enable URL parsing explicitly.βοΈ Step 1 β Enable URL Parsing (Text En ...
Airparser supports automatic XML extraction from emails and documents. This feature allows you to extract structured XML data without needing an extraction schema or using large language models (LLMs). If your email or document already contains structure ...
Can I choose whether Airparser processes the email body or attachments?Yes. In each inbox you can control exactly what Airparser processes:Process emails and attachments (default) β parses both the email body and all attachments.Process emails, skip attac ...