Excel File Format Guide: XLSX, XLS, XLSM, CSV, and When to Use Each
Master every excel file format including XLSX, XLS, XLSM, XLSB, and CSV. Learn when to use each, conversion tips, and compatibility rules.

The excel file format you choose has more impact on your workbook than most users realize, affecting file size, formula compatibility, macro support, sharing safety, and even whether colleagues can open your spreadsheet at all. Whether you are saving a quick budget tracker with a how to create drop down list in excel validation rule, archiving a finance model with embedded VBA, or exporting data to a partner system, the extension you select determines how Excel reads, writes, and protects your work. Understanding the differences saves hours of troubleshooting.
Microsoft Excel has shipped dozens of file extensions across its history, from the original BIFF binary files of the early 1990s to the modern Open XML standards introduced in Excel 2007. The shift to XLSX marked a turning point because it replaced opaque binary blobs with a transparent ZIP archive of XML parts, allowing third-party tools to parse Excel data without owning a Microsoft license. That single architectural change powered the entire modern ecosystem of reporting, BI, and automation tools.
Most American office workers encounter only three or four extensions in daily life: XLSX for everyday spreadsheets, XLSM for files containing macros, CSV for raw data interchange, and occasionally XLSB for very large workbooks. Yet edge cases appear constantly. A vendor sends you an XLS file from a legacy ERP, a colleague exports an ODS from LibreOffice, or a Python script writes XLSX files that open with a strange warning. Each format carries different rules and quirks.
This guide walks through every relevant excel file format you are likely to encounter in 2026, from the dominant XLSX standard down to obscure but still-supported types like XLT templates and XLA add-ins. We cover size limits, feature support, security implications, and practical conversion workflows. By the end you will know exactly which extension to choose when you press Save As, and you will recognize the warning signs that suggest you have the wrong format for the task at hand.
We also dig into the format wars that matter for power users. Should you use XLSX or XLSB for a 200 MB pivot model? Is CSV truly UTF-8 safe when you double-click it in Excel on Windows? Does Google Sheets preserve XLSM macros when you re-download? These questions have concrete answers, and getting them right keeps your data intact. The wrong choice can silently strip formulas, corrupt Unicode characters, or break entire integrations.
Excel is the most widely used analytics tool on the planet, with Microsoft reporting roughly 1.1 billion users across consumer and commercial channels. That scale means file format decisions ripple far beyond your desktop. Choosing XLSX over XLS keeps your file safe from a malicious legacy macro, while choosing CSV over XLSX strips formatting your accountant might depend on. Treating file format as a strategic choice rather than a default click separates novice users from professionals.
Throughout this guide we reference real-world examples, version compatibility matrices, and concrete byte-level differences. Whether you are a financial analyst archiving quarterly close packs, a developer generating reports from a Python pipeline, or a student preparing a college assignment, you will find practical guidance for picking the right extension and avoiding the silent data losses that ruin so many otherwise solid workbooks.
Excel File Formats by the Numbers

Quick Reference: Every Excel Extension You Will Meet
The default modern workbook format since Excel 2007. Based on Open XML and stored as a ZIP container of XML parts. Supports up to 1,048,576 rows, no macros, and is safe to share with external recipients.
Identical structure to XLSX but with embedded VBA macros. Required whenever your workbook contains Sub procedures, custom functions, or form code. Triggers a security warning when opened to prevent malicious code execution.
Binary Excel workbook. Smaller and faster than XLSX for large datasets because it skips XML parsing. Excellent for files with hundreds of thousands of rows, though less compatible with non-Microsoft tools and parsers.
The pre-2007 legacy binary format limited to 65,536 rows and 256 columns. Still common in older ERPs and bank exports. Microsoft warns against opening unknown XLS files because they can contain dangerous Excel 4.0 macros.
Plain-text comma-separated values. Universally compatible but strips all formatting, formulas, charts, and multiple sheets. The format of choice for data interchange with databases, Python, R, and ETL pipelines.
The history of the excel file format mirrors the broader evolution of personal computing. Excel debuted on the Macintosh in 1985 with a proprietary binary format, then ported to Windows in 1987. For two decades Microsoft iterated on the Binary Interchange File Format, commonly called BIFF, with each Excel version bumping the BIFF revision number. By BIFF8, used in Excel 97 through 2003, the XLS file had become a sprawling collection of binary streams stored inside a Microsoft Compound Document container similar to a tiny FAT file system.
The 2007 release fundamentally restructured everything. Microsoft introduced the Office Open XML standard, eventually ratified as ECMA-376 and ISO/IEC 29500. Suddenly an Excel workbook was a ZIP archive containing XML files describing worksheets, styles, themes, and shared strings. You could rename any XLSX file to .zip, extract it, and read the XML directly. This change unlocked a generation of libraries like openpyxl, ExcelJS, Apache POI, and SheetJS that read and write Excel files without any Microsoft software installed.
The decision split the format into siblings. XLSX is the default macro-free workbook. XLSM is the macro-enabled twin, identical except for an additional vbaProject.bin binary stream and a different content type registration. XLTX and XLTM hold templates. XLAM contains add-ins. XLSB takes a different path entirely, storing the same logical content in a packed binary format that loads dramatically faster than parsing thousands of XML elements.
While Excel evolved, the broader spreadsheet ecosystem developed alternatives. The OpenDocument Spreadsheet format, ODS, became the default in LibreOffice and OpenOffice. Google Sheets stores files in its own proprietary cloud format but exports cleanly to XLSX. Apple Numbers uses its own bundle format but can export to XLSX as well. Each of these formats trades fidelity for portability, and round-tripping a complex workbook through any of them risks dropping conditional formatting, slicers, or pivot caches.
Excel also supports a long tail of import and export formats that most users never see. XPS and PDF appear under Save As for sharing read-only views. The HTML and MHT options let you publish a sheet as a web page, though the results look dated by modern standards. DBF, the dBase database format, remains in the list for GIS shapefile compatibility. XML Spreadsheet 2003 lingers from a brief experiment with text-based formats before XLSX arrived. Each entry exists for a niche, but the niche is real.
Recent versions have added Power Query and Power Pivot capabilities that store additional metadata inside the workbook package. When you build a data model in Excel 2016 or later, that model becomes a binary cache embedded in the XLSX. This is why a workbook with a data model can balloon to 100 megabytes even though the visible sheets look small. The format remains XLSX, but the payload tells a richer story than a simple grid of cells.
Looking ahead, Microsoft has been steadily pushing the cloud-native experience of Excel for the Web. Files saved to OneDrive or SharePoint use the same XLSX format on disk but gain co-authoring capabilities through a separate sync protocol. This has not changed the file format itself, which suggests XLSX will remain the dominant standard for at least another decade. Understanding it deeply is a worthwhile investment, much like learning excellent family dogs behaviors of frozen panes for navigation.
XLSX vs XLSB vs XLSM: Which Excel File Format Wins
XLSX is the gold standard for everyday spreadsheets. It uses Open XML inside a ZIP container, making it readable by virtually every modern tool from Python to PowerShell to mobile apps. The format supports the full 1,048,576-row grid, every native chart type, conditional formatting, slicers, pivot tables, and Power Query connections. It is also the format your IT department prefers because it cannot contain macros and therefore cannot execute malicious code.
The drawbacks are real but minor. XML parsing is slower than binary loading, so workbooks larger than 50 megabytes can take noticeable seconds to open and save. Compression depends on the contents, but text-heavy sheets often shrink to a quarter of their raw size while floating-point heavy data does not compress as well. For 95 percent of users the right answer is always XLSX, including when you build a workbook with a colleges of excellence level of pivot complexity.

Should You Default to XLSX for Every New Workbook
- +Universal compatibility across Excel 2007 and every later version on Windows, Mac, and Web
- +No macro security warnings, making it safe to email and share externally
- +Open XML structure is transparent and parseable by free libraries in dozens of languages
- +Supports the full feature set including pivot tables, slicers, Power Query, and data models
- +ZIP-based compression keeps file sizes reasonable for most datasets
- +Works seamlessly with OneDrive and SharePoint co-authoring
- +Recoverable when corrupted because each XML part is independent
- −Slower to open and save than XLSB for very large workbooks
- −Cannot contain VBA macros, forcing you to use XLSM instead
- −XML overhead inflates files containing many small formulas
- −Some legacy enterprise systems still require XLS exports
- −Pivot caches in data models can grow surprisingly large
- −Email gateways occasionally strip embedded objects during scanning
- −No support for some experimental preview features before they ship
Excel File Format Conversion Checklist
- ✓Confirm the destination application supports your chosen format before sending the file
- ✓Save a backup copy in your source format in case conversion strips features
- ✓Check for macros and convert XLSM to XLSX only after extracting any required automation
- ✓Verify Unicode characters survive when exporting to CSV by choosing UTF-8 encoding
- ✓Test pivot tables and slicers in the converted file because some downgrades silently drop them
- ✓Compress images inside the workbook before saving to reduce final file size
- ✓Inspect the file in the destination version of Excel before sharing widely
- ✓Document the format choice in your file naming convention or folder structure
- ✓Rebuild any chart that loses formatting after a round-trip through ODS or Google Sheets
- ✓Confirm that conditional formatting rules and data validation lists survive the conversion
Switch to XLSB when files exceed 50 MB
If your workbook crosses 50 megabytes or contains more than 250,000 formula cells, saving as XLSB instead of XLSX can cut file size by 40 percent and reduce load time by half. The trade-off is reduced compatibility with non-Microsoft tools, so use XLSB for internal heavy lifting and export to XLSX or CSV when sharing externally.
Security considerations vary dramatically between excel file format types and deserve careful attention from anyone handling sensitive data. The legacy XLS format inherited from BIFF8 supports Excel 4.0 macros, an ancient scripting capability that predates VBA and has become a favorite of malware authors. Modern Excel still executes these macros for compatibility, which is why opening an unknown XLS file from a strange sender is genuinely dangerous. Treat any unsolicited XLS attachment as suspicious until proven otherwise.
XLSM and XLSB also carry macro risks, but Microsoft has significantly hardened the experience. Files downloaded from the internet or received via email are tagged with a mark-of-the-web flag that disables macros by default. Users see a security banner and must explicitly trust the file before code runs. Enterprise administrators can enforce digital signing requirements, restrict trusted locations, or block macro execution entirely through Group Policy. Configure these settings before allowing widespread distribution of macro-enabled workbooks.
XLSX is generally considered safe because the format cannot contain VBA macros at all. However, XLSX files can include other risky payloads like external data connections, dynamic data exchange links, and embedded OLE objects. Power Query connections execute on file open and can pull from arbitrary HTTP endpoints if the trust prompt is accepted. For maximum safety, open unknown XLSX files in Protected View first and inspect the data connections through the Queries and Connections pane before clicking Enable Editing.
Compatibility across Excel versions is mostly forward but partially backward. Anything you save as XLSX in Excel 365 will open in Excel 2007 with minor differences. The catch is that newer features like dynamic array formulas, XLOOKUP, LAMBDA, and modern chart types simply will not work in older versions. A workbook saved with XLOOKUP becomes a static _xlfn prefix nightmare when opened in Excel 2016. Always check the Compatibility Checker under File, Info before sharing with users on older versions.
Cross-platform compatibility introduces additional quirks. Excel for Mac handles XLSX excellently but has historically struggled with certain VBA features, particularly anything involving Windows-specific APIs or ActiveX controls. Excel for the Web supports a broad subset of features but cannot run macros at all. Excel for iOS and Android are read and light-edit experiences. When designing a workbook for distribution, identify the target platforms first and test on each before assuming universal compatibility.
CSV files deserve their own security discussion because they are deceptively simple. A malicious CSV can embed formulas like =cmd|'/c calc'!A1 that execute on open in Excel, a technique called CSV injection. Always sanitize CSVs received from external sources by prefixing cells beginning with equal signs, plus signs, minus signs, or at signs with a single quote. Modern Excel offers some protection through the Trust Center, but defense in depth remains the right approach for any data pipeline.
Encryption deserves attention too. Excel supports AES-128 and AES-256 password protection for XLSX, XLSM, and XLSB files starting in Office 2007 and 2010 respectively. The encryption is strong when paired with a long random password, but worksheet protection without workbook encryption is trivially bypassed by anyone willing to rename the file to .zip and edit the XML. Use the File, Info, Protect Workbook, Encrypt with Password option for genuinely sensitive data, not just sheet-level protection.

Saving an XLSX file as XLS strips any data beyond row 65,536 and column 256 without warning in some versions. Always verify the row and column counts after downgrading, and prefer CSV over XLS when working with truly legacy systems that cannot read modern formats.
Practical workflows for managing excel file format choices start with understanding your audience and toolchain. For internal team collaboration on OneDrive or SharePoint, default to XLSX with co-authoring enabled. For one-off data exports to engineering or data science teams, default to UTF-8 CSV. For finance models with heavy VBA, use XLSM and store the file in a trusted SharePoint library where macros run without prompts. For massive analytical workbooks that only your team will open, XLSB delivers the best performance. These four defaults cover most situations cleanly.
Conversion between formats is mostly handled through File, Save As, but power users should learn the command-line and scripted alternatives. PowerShell with the ImportExcel module can read and write XLSX without Excel installed. Python with openpyxl, pandas, or xlsxwriter handles the same task. The unoconv utility based on LibreOffice can convert between virtually any pair of spreadsheet formats. Building format conversion into your data pipeline as a deterministic script beats relying on humans to click Save As correctly every time.
When you receive an unfamiliar format, identify it by checking the file signature rather than trusting the extension. XLSX, XLSM, and XLSB start with the bytes 50 4B 03 04, identifying them as ZIP archives. Legacy XLS files start with D0 CF 11 E0, the OLE Compound Document signature. CSV files have no signature and rely on extension and content. Tools like file on macOS and Linux or TrID on Windows can identify formats accurately when extensions lie. A simple excellent face wash habit of verifying before opening prevents many problems.
For long-term archival, choose formats based on expected lifetime. CSV is the safest archival format because plain text never goes obsolete, but you lose formulas, formatting, and multi-sheet structure. XLSX is the next best choice because Open XML is an ISO standard with multiple independent implementations. Avoid XLSB for archives because the binary format depends on Microsoft tooling. Avoid XLS for anything new because the format is being slowly deprecated and may eventually lose support entirely.
Migration projects often involve moving thousands of XLS files to XLSX. Excel itself can batch convert via VBA or Power Automate, but a smarter approach is to use a server-side tool. Apache POI in Java, openpyxl plus xlrd in Python, or the OfficeOpenXml package in PowerShell can crawl a folder tree, open each XLS, save it as XLSX, and log any conversion issues. Always test on a representative sample first because edge cases like password-protected files, broken external links, and corrupted streams require manual intervention.
Cloud-based workflows introduce additional considerations. Google Sheets imports XLSX with high fidelity but stores in its own format internally, and re-exporting to XLSX produces a different but logically equivalent file. Microsoft 365 keeps XLSX as the on-disk format throughout the cloud round-trip, preserving every feature. If you must interoperate between Microsoft and Google ecosystems regularly, standardize on CSV for data interchange and accept that visual formatting will not survive the trip.
Finally, build naming conventions that reflect format intent. Filenames like Sales_Model_v3.xlsm, Q4_Close_v2.xlsb, and customers_export_2026-05-19.csv tell the reader exactly what to expect before opening. Avoid generic names like book1.xlsx that obscure both content and purpose. Coupling clear filenames with consistent format choices produces a shared file repository where new team members can find what they need quickly and avoid format-induced confusion.
Final practical tips for working with excel file format choices in 2026 begin with the principle of least surprise. Choose formats that match the expectations of your audience and tooling, even if a different format would technically be more efficient. A finance partner who has used XLSX for fifteen years does not want to receive an XLSB just because it loads faster, and a data engineer who automates ingestion from CSV does not want to suddenly receive XLSX with merged cells and embedded charts. Predictability beats cleverness.
Standardize defaults at the team level. In Excel, navigate to File, Options, Save and set the Save Files in This Format dropdown to your team standard. For most office teams the right answer is Excel Workbook (xlsx). For analytical teams that share heavy models internally, XLSB might be the better default. Document this choice in your team wiki so new joiners adopt the same convention from day one rather than developing their own ad hoc patterns.
Train your team to recognize file format warning signs. A file size that suddenly drops by 90 percent after a save probably means features were stripped. A workbook that takes 30 seconds to open might be an XLSX that should be an XLSB. An XLSM that triggers security warnings on every open suggests the file is downloaded from outside and could benefit from being copied to a trusted location or signed with a digital certificate. Awareness of these signals prevents silent data quality issues.
When in doubt, run the built-in Compatibility Checker. Found under File, Info, Check for Issues, this tool flags features that will not survive a downgrade to older Excel versions. Run it before saving as XLS, before sending a workbook to a partner using Excel 2010, or before exporting to any non-Microsoft tool. The checker is not perfect, but it catches the obvious problems like dynamic array formulas, slicers on non-pivot tables, and modern chart types that older versions cannot render.
Keep your Excel version current. Microsoft 365 receives format-related fixes and enhancements every few months, and being on an older version increases your risk of compatibility surprises when interacting with files generated by newer versions. If your organization is locked to a specific perpetual version like Office 2019 or Office 2021, document which features are off-limits and build templates that only use compatible features. Standardized templates eliminate the entire class of compatibility incidents that otherwise drain time.
Practice with realistic file format scenarios through targeted quizzes and exercises. Excel certifications like the Microsoft Office Specialist exam include format-related questions, and even basic literacy tests for office workers often probe whether candidates understand the difference between XLSX and CSV. Treating format knowledge as a skill worth practicing pays off in faster troubleshooting, cleaner data pipelines, and fewer awkward conversations with IT when something breaks during a critical deadline.
Above all, remember that the excel file format you choose is a communication decision, not just a technical one. It signals to recipients what they can expect, what they can do with the file, and what risks they should consider. Treating it with the same care you would apply to choosing the right email subject line or document title elevates your professional standard and reduces friction across every collaboration. Format choice is a small detail with outsized impact on the daily work of millions of spreadsheet users.
Excel Questions and Answers
About the Author
Attorney & Bar Exam Preparation Specialist
Yale Law SchoolJames R. Hargrove is a practicing attorney and legal educator with a Juris Doctor from Yale Law School and an LLM in Constitutional Law. With over a decade of experience coaching bar exam candidates across multiple jurisdictions, he specializes in MBE strategy, state-specific essay preparation, and multistate performance test techniques.