How to use Lido's Data Extraction Solution: The Basics
🧾 How to Extract Data from PDFs into Spreadsheets with Lido
🎥 Watch the quick tutorial (2 min)
If your team handles a high volume of PDFs — invoices, forms, receipts, or reports — you know how time-consuming and error-prone manual entry can be.
With Lido, you can automate the entire process. This guide shows you how to extract structured data (like invoice numbers, vendor names, and amounts) directly into spreadsFeaturedAdding AI Columns
How AI Columns Work
You can add any data field to the data extractor that doesn’t exist in the source file (e.g., Invoice Category, Expense Type, GL Code). These appear in your sheet but never need to be present in the uploaded document.
Each AI column runs when a document is processed. Common examples of AI column uses:
Classification – “Classify the expense descriptions into the closest of these 4 categories: food, gas, travel, other"
Conditional coding – "If status column contaiPopularSuper-powered Extra Instructions (i.e., Directives)
Directives are deterministic instructions you add in the Extra Instructions section of your Data Extractor to fine‑tune extraction.
Each directive has a specific syntax and must appear on its own line at the bottom of the Extra Instructions.
File Extractor
@parallel:true / @parallel:false
What is this?
Controls whether the extractor treats each page separately or processes the whole document as one unit.
Use when
@parallel:true: Each page stands on its owPopularHow to process documents stored in OneDrive
Check out this video!!:PopularPost-Data Extraction: How to Push Data to Google Sheet
Check out this video!!:PopularAdding context files to the data extractor
You can upload external documents to give Lido more context about how your data should be extracted.
This is useful when your extraction rules already exist somewhere else, or when you want to show examples instead of re-typing instructions.
What Are Context Documents?
Context documents are reference files that help guide how Lido interprets and extracts data from your PDFs, emails, or other inputs.
Instead of writing long instructions in the prompt, you can simply attach:
An eSome readersHow to extract data from PDFs and documents
Every way to extract structured data from PDFs, images, scans, and emails — the spreadsheet UI, formulas, workflows, and the API — and how to choose the right one.Few readersAutomating the Data Extractor
Automating the Data Extractor
Automatically import files from Google Drive, OneDrive, or Email.
You can automatically fetch new files from a connected Google Drive folder, OneDrive folder, or email. To automatically fetch new files from your connected source, turn on the Process Files toggle.
Automatically process new files from connected drive folder
When the Process Files toggle isPopularAutomate extraction with workflows
This article walks through turning a one-off extractor into a hands-free pipeline that runs every time a new document arrives.Few readersSend extracted data to email or Slack
Once you've extracted data from a document, you usually want to do something with it — notify a team in Slack, email it to a manager, attach the original file, or push it to multiple places at once. This article covers the common notification patterns.Few readers