Let's clear something up before we start. When folks search for how to export PDF to Excel, they almost always mean the opposite of what the phrase literally suggests. They have a PDF (a bank statement, an invoice, a sales report, a scraped table) and they want that data sitting inside an Excel workbook so they can sort it, filter it, sum it, and chart it. That's the job we're solving here.
The good news? You've got six solid ways to pull data out of a PDF and into Excel, ranging from a built-in Excel feature you may not know exists to OCR tools that read scanned paper. The right method depends on the type of PDF, your software, and how often you'll do this.
If you've ever opened a PDF, copied a table, pasted it into Excel, and watched the whole thing land in one column, you're not alone. PDFs are designed for visual fidelity, not data exchange. They lock cells into pixel positions instead of preserving rows and columns. So the methods we'll cover all have one thing in common: they reverse-engineer the visual layout back into structured data Excel can actually use.
The fastest way to export PDF to Excel is Excel's built-in Get Data from PDF feature (Excel 2019 or Microsoft 365). Go to Data → Get Data → From File → From PDF, pick your file, then load the table you want. For scanned PDFs, run OCR first using Adobe Acrobat or Google Drive, then use the same method. For deeper background, check our how to convert PDF to Excel walkthrough.
Before you pick a method, you need to know what kind of PDF you're dealing with. PDFs aren't all built the same way, and the type drives which tools will work for you. A bank statement generated by online banking software is a totally different beast from the same statement printed and scanned. The data might look identical to your eyes, but to Excel one is a cooperative friend and the other is a wall of pixels.
Spend ten seconds figuring out which type you have before you try any method. It will save you from wasting time on tools that physically cannot read your file.
How can you tell which type you've got? Open the PDF, try to highlight a row of text with your cursor. If text gets selected, it's native. If your cursor draws a box but nothing highlights, it's a scanned image and you'll need OCR before any conversion tool will recognize the data. Form PDFs are obvious because they have visible empty fields you can click into and type in.
One more quick test: open the PDF and use Ctrl+F to search for a word you can see on the page. If search finds it, the file has a real text layer. If search comes up empty even though the word is right there in front of you, you're looking at an image. That same test tells your conversion tool whether it has anything to work with.
Now to the methods. We'll go through six in order from fastest to most flexible. Try them top-down โ the first one that works for your file is the one to stick with. There's no point reaching for Adobe Acrobat or a Python script if Excel can do the job in 30 seconds.
Method 1: Excel's Built-In "Get Data from PDF". This is buried under the Data tab and most people don't know it exists. It uses Power Query under the hood to scan a PDF, identify tables, and let you load them into a worksheet. Microsoft added it back in 2020 and it's quietly become one of the most useful features in modern Excel. The accuracy on simple tables is genuinely impressive.
You need Excel 2019 or Microsoft 365 (desktop). Excel for the Web doesn't include this feature.
Then click Get Data → From File → From PDF.
A loading bar appears while Excel scans the file for tables.
On the left you'll see Table001, Table002, and so on. On the right is a preview of each table.
Tick the boxes next to the tables. Click Transform Data to clean it up first, or Load to drop it straight into a sheet.
Spot-check totals and row counts against the PDF. Power Query sometimes splits tables that span pages.
That's it. For a clean text-based PDF with simple tables, you'll be done in under a minute. For more conversion options and edge cases, our broader PDF to Excel converter guide goes deeper.
One thing to watch: when Excel detects multiple tables on a page, it'll list them as Table001, Table002, Table003 and so on. Sometimes what you wanted as one continuous table got split into three. You can fix that in Power Query with an Append step, which stacks tables on top of each other. We'll cover that in Method 5.
Method 2: Adobe Acrobat Export. If you own Adobe Acrobat Pro, this is the most reliable option for complex layouts. It costs around $14.99 to $19.99 a month, but the conversion engine handles multi-page tables, merged cells, and odd formatting better than just about any tool out there. Adobe built the PDF format and they understand its quirks better than anyone else.
Right-click the PDF and choose Open with → Adobe Acrobat. If you don't have Acrobat Pro, the free Adobe Reader won't show export options. You can grab a 7-day free trial from Adobe's site to test it.
In the right-hand panel click Export PDF. Choose Spreadsheet as the format, then Microsoft Excel Workbook (.xlsx). You can also pick legacy XLS or XML if you need them.
Click the gear icon next to Spreadsheet. You can choose to put each table on its own worksheet, detect numeric formats, run OCR on scanned pages, and pick a language. Set what you need and click OK.
Click Export, choose a save location, and Acrobat does the rest. Open the resulting XLSX in Excel and check that totals match the source. Mismatches usually point to merged cells or columns that ran together.
Acrobat shines on bank statements, multi-page financial reports, and anything with currency symbols or negative numbers in parentheses. The tradeoff is the subscription cost. If you only need it once, grab the 7-day free trial. If you need it monthly, the subscription pays for itself the first time it saves you an hour of manual cleanup.
The XLSX export from Acrobat usually preserves more of the original formatting than Excel's built-in tool. Currency symbols stay attached, dates parse correctly more often, and merged cells are handled with more grace. It's not perfect, but it's a noticeable step up for messy files.
Method 3: Online PDF to Excel Converters. Sites like Smallpdf, ILovePDF, PDF2Go, and PDFTables let you drop a file in the browser and download an Excel version. They're convenient and most have a free tier (usually one or two files a day, capped at a small page count). Premium tiers run $5 to $10 a month for unlimited conversions.
One big caveat with online tools: they upload your file to a third-party server. For a public report, no problem. For an HR record, payroll data, or anything with personally identifiable information, skip the online route. Many free converters keep your files on their servers for 24 hours or longer, and that's an audit nightmare waiting to happen. Read the privacy policy before you upload anything important.
That said, the conversion quality on online tools has gotten genuinely good. PDFTables in particular has a strong reputation for handling weird layouts that confuse Excel and Acrobat. If you're working on a non-sensitive file and the desktop methods choke, an online converter is worth a try as a fallback.
Method 4: The Copy and Paste Hack. Sometimes you don't need any tool at all. Open the PDF, drag-select the rows you want, hit Ctrl+C (or Cmd+C on Mac), then paste into Excel. Use Paste Special → Text and Excel will fire up the Text Import Wizard so you can pick column delimiters. It's the most low-tech method on the list and yet it sometimes beats fancy tools when you only need a handful of rows.
Method 5: Power Query for Power Users. Power Query is the engine behind Method 1, but you can do much more than the default load gives you. You can combine multiple PDF tables into one, transform on import (rename columns, split text, filter rows), and refresh the whole thing with a button click whenever the PDF gets updated. Think of it as a recipe Excel saves and re-runs whenever you ask.
This shines on monthly recurring jobs. Set it up once, and next month you swap in the new PDF, hit Refresh, and your worksheet rebuilds itself. No more manual export, no more re-applying the same cleanup steps. The query remembers everything you did the first time and replays it.
To get there, instead of clicking Load in the Power Query Navigator, click Transform Data. That opens the Power Query Editor. From there you can rename columns by double-clicking the header, change types via the small data-type icon, filter rows, split columns by delimiter, merge tables together, and a hundred other operations. Click Close & Load when you're done and Excel saves both the data and the recipe.
Method 6: OCR for Scanned PDFs. If you tried to highlight text and got nothing, your PDF is an image. No converter will read it as data until you run OCR (Optical Character Recognition). Adobe Acrobat has OCR built in (Tools → Scan & OCR → Recognize Text). Free options include Microsoft OneNote (paste an image and right-click → Copy Text from Picture), Google Drive (upload PDF, right-click → Open with Google Docs), and Tesseract on the command line. Each has tradeoffs in accuracy, speed, and ease of use.
For a quick free OCR pass, Google Drive is hard to beat. Upload your scanned PDF, right-click and choose Open with → Google Docs. Google's OCR engine processes the file and returns a Doc with the extracted text. Copy and paste that into Excel, or save the Doc as a Word file and use Word's table-detection features. It's not perfect with complex layouts but for plain text and simple tables it punches well above its weight.
Once OCR has converted the image to selectable text, save the PDF and run it through Method 1 or Method 2. Quality depends heavily on the scan: 300 DPI clean black-and-white scans give near-perfect results; faded photocopies and skewed phone snaps will need cleanup. If your source is bad, no OCR engine on earth will save you โ you'll be retyping numbers by hand.
One pro tip: before you OCR, straighten any skewed pages. Most PDF tools have a deskew option. A page that's tilted by even 2 or 3 degrees can throw off character recognition badly, while a deskewed version of the same page reads almost perfectly. It's a 30-second step that often pays for itself ten times over.
So which method should you use? Here's a quick decision flow you can run in your head before you even open the file. The whole point is to skip methods that won't work for your file type and software stack, and land on the right one in about 10 seconds.
Try selecting a row of text. If it highlights, move to Step 2. If it doesn't, run OCR first using Adobe, Google Drive, or Tesseract.
Yes? Try Get Data from PDF first โ it's free and fast. No? Jump to Adobe Acrobat or an online converter.
Yes? Build a Power Query that you can refresh. No? A one-off load is fine.
Spot-check a few totals against the source PDF. Power Query and Acrobat both occasionally drop or duplicate rows on weird layouts.
Now let's talk about what doesn't translate well. Even the best tool struggles with certain PDF layouts, and knowing the limits saves you frustration. Simple tables โ columns of numbers, rows of text, clear borders โ convert beautifully. The trouble starts when designers get creative with layout, when reports were laid out in InDesign rather than generated from a spreadsheet, or when the original data was already a mess before it became a PDF.
Here's a useful rule of thumb. The closer the PDF is to its original spreadsheet, the better the conversion will be. A PDF that started life as an Excel file and was simply printed to PDF is the easy case. A PDF that was assembled in Word with manually-typed tables, or laid out in a design tool, is the hard case. You'll have to manage your expectations accordingly.
If your file has a few of those problem features, plan for cleanup time after the import. Don't expect a tool to solve every layout โ it can't. Many users go the other way too; if you need to send your spreadsheet out as a polished file, our convert Excel to PDF guide walks through the reverse direction with formatting tips.
After the import, you'll often hit a few common pitfalls. These show up so consistently that we've made a checklist of cleanup steps you should run on every freshly-imported PDF data set. Treat it as a routine โ five minutes of cleanup beats hours of head-scratching when your SUM formulas return zero because every number is secretly a text string.
The biggest gotcha is numbers stored as text. Excel's SUM, AVERAGE, and other math functions silently ignore text cells, so your grand total can come back wildly wrong with no error message. Always validate your imported totals against a known good number from the source PDF before you trust the data for anything important.
Speed matters too if you're doing this often. Here's a rough sense of how long each method takes for a 100-page PDF โ these aren't exact but they'll set expectations. Real-world times depend on your CPU, the complexity of the PDF, and how much cleanup you do afterward. A simple invoice converts almost instantly with any tool; a 200-page financial report with merged cells will take longer no matter what you use.
What about Excel for the Web? Sorry, no luck. The browser version of Excel doesn't include Get Data from PDF or Power Query, so you'll need either the desktop app or one of the other methods. Microsoft has signaled they may bring some of these features to the web version eventually but as of today it's a desktop-only feature. If you only have web access, your options narrow to Adobe Acrobat (also has a web version), online converters, or the copy-paste hack.
Mac users do have the built-in option as of M365 Mac 2021 โ same workflow as Windows, same Data tab path. The keyboard shortcuts and menu structure match the Windows experience pretty closely. If you're on an older standalone Mac version like Excel 2019 for Mac, you don't have access; upgrade to a Microsoft 365 subscription to get it.
For sensitive data, privacy is the call you have to make. Don't upload payroll, medical records, or financial PDFs to free online converters โ read their terms, many keep files for 24 hours minimum. If you're under HIPAA, GDPR, or any compliance regime, stick to desktop tools that process locally. The risk of a data breach from a free conversion site is small but non-zero, and the consequences for regulated data are big enough that the math doesn't favor uploading.
Want to automate the whole pipeline? You've got options that scale way beyond manual exports. If you're processing dozens of PDFs a week, the time you spend on automation will pay back many times over compared to clicking through Power Query each time.
Power Automate (Microsoft 365) can watch an inbox for an attached PDF, trigger an extract action, save the data into a SharePoint Excel file, and email a confirmation โ all without you touching it. The setup takes maybe an hour for a typical workflow but then runs forever in the background.
VBA macros can drive Adobe Acrobat through its API to bulk-export hundreds of PDFs overnight. You write the macro once, point it at a folder, and let it grind. Great for batches of structured invoices or statements that all follow the same template.
Python with libraries like pdfplumber, tabula-py, or PyPDF2 gives you total control if you're comfortable with code. It's great for building a recurring data pipeline that grabs files from S3 or a network share, extracts tables, transforms them, and pushes the results into a database or warehouse. The learning curve is steeper but the ceiling is much higher.
Looking for free alternatives to Adobe Acrobat? Try PDFgear, PDF-XChange Editor, or Foxit Reader. They each have free tiers with export features (sometimes limited to a few pages). Quality varies; test on a sample file before committing to one for production work. PDFgear in particular has gotten strong reviews lately for its export quality, and it's genuinely free with no upsells crammed into the workflow.
The catch with most free alternatives is that you'll hit a feature wall eventually. They'll handle conversion fine, but if you need OCR, batch processing, or table merging, you'll bump into a paywall. For occasional use the free tier is plenty. For daily work, the math usually favors paying for either Acrobat Pro or a Microsoft 365 subscription.
And finally, the errors. Some of these will catch you off guard the first time but they're all fixable. None of these problems mean your tool is broken; they just mean PDFs are tricky and you need to know which knob to turn.
Your PDF is almost certainly a scanned image, not a text-based file. Run OCR (Adobe Tools → Recognize Text, Google Drive → Open with Docs, or Tesseract) and try the conversion again.
The conversion picked the wrong delimiter. In Excel, select the column → Data → Text to Columns → Delimited → tick the right separator (comma, tab, space, or pipe). Preview the result before clicking Finish.
You'll see a green triangle in the corner of each cell. Click the warning icon and choose Convert to Number. For a whole column, multiply by 1 in a helper column, then paste-special as values back over the original.
The PDF has merged cells or uses graphics where you expected tables. Try a different method (Adobe usually handles complex layouts better than Excel built-in). If nothing works, you may need to manually reconstruct the layout in Excel.
Use =DATEVALUE(A1) in a helper column to parse, then paste-values back. If DATEVALUE returns #VALUE!, the cell is text in a regional format Excel doesn't recognize โ try TEXT-to-COLUMNS with the Date type set explicitly.
One more reminder before we wrap up: the workflow runs both ways. Once you've cleaned up your data and built reports, you'll often need to share them as PDFs again. Our Excel to PDF guide covers print areas, page breaks, and hyperlink preservation for that direction. Many people end up doing both โ pulling source data from PDFs, working with it in Excel, then publishing polished PDF reports back out.
Pick a method, validate your results, and you'll never dread a PDF data dump again. Most jobs we've seen converted in under a minute once you know the path. The trick is matching the method to the file type and your software stack. Text-based PDF + Excel 365 = built-in tool. Scanned PDF = OCR first. Recurring monthly job = Power Query. Sensitive data = desktop only. Once you internalize that mapping, the choice becomes automatic.
If you take just one habit away from this guide, make it validating totals. Every method we covered occasionally drops, duplicates, or misreads cells. A two-minute spot-check against the source PDF catches 95% of those errors before they become a problem in your downstream report. The fastest tool in the world isn't worth much if you're handing off bad data โ and a bit of paranoia at the validation step is one of the cheapest insurance policies you can buy.