🎓 Find and Delete Duplicates in Excel: Complete Guide 2026

Learning how to find and delete duplicates in Excel is one of the most valuable data-cleaning skills you can build, whether you manage customer lists, financial records, inventory data, or survey responses. Duplicate values inflate totals, distort averages, corrupt VLOOKUP results, and break dashboards. Microsoft Excel offers several built-in tools — Remove Duplicates, conditional formatting, COUNTIF formulas, advanced filters, and Power Query — each suited to a different situation. This guide walks through every method with step-by-step instructions, real examples, and the trade-offs of each approach so you can clean data confidently.

The fastest method is the built-in Remove Duplicates command on the Data tab. It scans the columns you select, keeps the first occurrence of each unique row, and permanently deletes the rest. You can apply it to a single column to enforce uniqueness on email addresses, or to multiple columns when only the combination of values defines a duplicate. Before you click the button, however, it is critical to back up your worksheet because the action cannot be undone after the file is saved and closed.

For situations where you want to inspect duplicates before deleting them, conditional formatting is the safer choice. Highlighting repeated values in red lets you scan the dataset, decide which rows to keep, and remove duplicates manually. This visual approach pairs well with the remove duplicates excel feature because it gives you a preview of how many records will disappear. It is also useful when duplicates are partial — for example, the same customer name with slightly different spellings or extra spaces.

Excel formulas such as COUNTIF, COUNTIFS, and the newer UNIQUE function give you programmatic control. COUNTIF flags how many times each value appears, UNIQUE returns a deduplicated list as a dynamic array spill, and FILTER combined with COUNTIF can extract only the rows that repeat. Power Query, available in Excel 2016 and later, removes duplicates from imported tables non-destructively and refreshes automatically when source data changes, which is ideal for recurring reports.

Knowing which method to choose depends on three things: whether you need to preserve the original data, whether the duplicates are exact or fuzzy, and whether the cleanup is one-time or repeatable. Throughout this guide you will see specific keystrokes, formula syntax, and real-world examples — including how to clean a sales pipeline of repeated leads, deduplicate an email subscriber list, and remove duplicate transactions from a bank export. By the end you will have a complete toolkit for handling duplicates in any spreadsheet.

Before diving into the methods, take a moment to define what a duplicate means in your dataset. Two rows might look identical to the eye but differ by an invisible trailing space, a different capitalization, or a date stored as text in one row and a serial number in another. Cleaning those inconsistencies first with TRIM, UPPER, or VALUE will dramatically improve the accuracy of every deduplication technique covered below and prevent the frustrating outcome of seeing duplicates remain after you thought they were removed.

Whether you are a beginner using Excel for the first time or an analyst maintaining enterprise workbooks, mastering duplicate management saves hours of manual work and protects the integrity of every report you build. The techniques below work across Excel 2016, 2019, 2021, Microsoft 365, and Excel for the Web, with notes on platform-specific differences where they matter. Bookmark this page as a reference and revisit each method as your data-cleaning needs grow more sophisticated and your spreadsheets get larger.

Duplicate Management in Excel by the Numbers

📊

1M+

Rows Excel Can Process

⏱️

<3 sec

Remove Duplicates Speed

🎯

Built-in Methods

🔄

2016+

Power Query Native Support

⚠️

100%

Action is Irreversible

Try Free Excel Practice: Find and Delete Duplicates Skills

Five Methods to Remove Duplicates in Excel

🖱️

Located on the Data tab, this one-click tool permanently deletes duplicate rows based on the columns you select. Fastest method for clean, structured tables where you do not need to preserve the original.

🎨

Highlights duplicate cells with color so you can review before deleting. Found under Home > Conditional Formatting > Highlight Cells Rules > Duplicate Values. Ideal for visual inspection of small to medium datasets.

📐

COUNTIF flags repeats with a count column, while UNIQUE (Microsoft 365) returns a clean list. FILTER with COUNTIF extracts only duplicates. Non-destructive and dynamic — original data is preserved.

🔍

Under Data > Advanced, choose Copy to another location and check Unique records only. Outputs a deduplicated copy to a new range without altering the source data, which is safer than the Remove Duplicates button.

⚡

Load data into Power Query, right-click columns, and choose Remove Duplicates. Refreshes automatically when source data updates. Best for recurring imports, large datasets, and repeatable cleaning workflows.

The Remove Duplicates button is the most direct way to find and delete duplicates in Excel. Begin by clicking any cell inside your data range, then go to the Data tab on the ribbon and click Remove Duplicates in the Data Tools group. Excel will open a dialog box showing every column in the selected table with checkboxes. The columns you check define what counts as a duplicate — if you check only Email, Excel deletes any row whose email matches an earlier row even if the names differ. If you check all columns, only fully identical rows are removed.

One common mistake is forgetting to include the header row. The dialog has a My data has headers checkbox at the top right. If your table has column titles in row 1, make sure this is checked so Excel does not treat the headers as data. After confirming your choices, click OK and Excel will display a summary popup telling you how many duplicates were found and removed and how many unique values remain. This summary disappears as soon as you click OK, so screenshot it if you need an audit trail.

For more complex scenarios, you can combine Remove Duplicates with sorting. Sort your data first by the column that should win — for example, by date descending — so the most recent record is at the top. Because Remove Duplicates always keeps the first occurrence, sorting effectively lets you choose which version of a duplicate survives. This trick is essential when deduplicating customer records where you want to keep the most recent contact information rather than the oldest entry that appears first in the original file.

A frequent source of confusion is that Excel treats values as duplicates only when they match exactly, character for character. Leading or trailing spaces, different capitalizations of email domains, and curly quotes versus straight quotes all prevent matches. Run TRIM and LOWER on text columns before deduplication, or use Find and Replace to strip non-printing characters. If your data was pasted from a website, hidden characters like non-breaking spaces (Alt+0160) are very common and invisible to the naked eye, yet they make otherwise identical entries appear different to Excel.

The Remove Duplicates button works on Tables (Ctrl+T) as well as regular ranges. When applied to a Table, the structure is preserved and totals or calculated columns recalculate automatically. This is especially handy when your data is part of a larger workflow that feeds PivotTables or charts. The Table version also supports working with how to merge cells in excel scenarios where you have flattened merged headers before deduplication — never try to remove duplicates while merged cells exist in the data range because Excel will throw an error.

If you only want to remove duplicates from a single column without deleting the entire row, the Remove Duplicates button is not the right tool because it always operates on full rows. Instead, copy the column to a new sheet, run Remove Duplicates there, and bring the result back. Alternatively, use the UNIQUE function in Microsoft 365 to spill the deduplicated values into an adjacent column. These approaches preserve the rest of your dataset while giving you a clean list of unique entries to work with separately.

Finally, always save your workbook with a new name before running Remove Duplicates on critical data. The Undo command works within a session, but once you close and reopen the file the deletion is permanent. Many analysts adopt a convention of keeping a Raw sheet untouched and doing all cleaning on a duplicate Working sheet. This habit pays dividends the first time a stakeholder asks where a missing record went — you can show them the original, audit the cleaning rules, and reproduce the deduplication transparently when questions arise.

Microsoft Excel Practice Test Questions

Prepare for the Microsoft Excel exam with our free practice test modules. Each quiz covers key topics to help you pass on your first try.

Microsoft Excel Excel Basic and Advance

Microsoft Excel Exam Questions covering Excel Basic and Advance. Master Microsoft Excel Test concepts for certification prep.

Microsoft Excel Excel Formulas

Free Microsoft Excel Practice Test featuring Excel Formulas. Improve your Microsoft Excel Exam score with mock test prep.

Microsoft Excel Excel Functions

Microsoft Excel Mock Exam on Excel Functions. Microsoft Excel Study Guide questions to pass on your first try.

Microsoft Excel Excel MCQ

Microsoft Excel Test Prep for Excel MCQ. Practice Microsoft Excel Quiz questions and boost your score.

Microsoft Excel Excel

Microsoft Excel Questions and Answers on Excel. Free Microsoft Excel practice for exam readiness.

Microsoft Excel Excel Trivia

Microsoft Excel Mock Test covering Excel Trivia. Online Microsoft Excel Test practice with instant feedback.

Microsoft Excel Advanced Data Analysis Tools

Free Microsoft Excel Quiz on Advanced Data Analysis Tools. Microsoft Excel Exam prep questions with detailed explanations.

Microsoft Excel Advanced Formula and Macro...

Microsoft Excel Practice Questions for Advanced Formula and Macro Creation. Build confidence for your Microsoft Excel certification exam.

Microsoft Excel Advanced Formulas and Macros

Microsoft Excel Test Online for Advanced Formulas and Macros. Free practice with instant results and feedback.

Microsoft Excel Basic and Advance Question...

Microsoft Excel Study Material on Basic and Advance Questions and Answers. Prepare effectively with real exam-style questions.

Microsoft Excel Creating and Managing Charts

Free Microsoft Excel Test covering Creating and Managing Charts. Practice and track your Microsoft Excel exam readiness.

Microsoft Excel Data Visualization with Ch...

Microsoft Excel Exam Questions covering Data Visualization with Charts. Master Microsoft Excel Test concepts for certification prep.

Microsoft Excel Formulas and Functions

Free Microsoft Excel Practice Test featuring Formulas and Functions. Improve your Microsoft Excel Exam score with mock test prep.

Microsoft Excel Formulas and Functions App...

Microsoft Excel Mock Exam on Formulas and Functions Application. Microsoft Excel Study Guide questions to pass on your first try.

Microsoft Excel Formulas Questions and Ans...

Microsoft Excel Test Prep for Formulas Questions and Answers. Practice Microsoft Excel Quiz questions and boost your score.

Microsoft Excel Functions Questions and An...

Microsoft Excel Questions and Answers on Functions Questions and Answers. Free Microsoft Excel practice for exam readiness.

Microsoft Excel Managing Data Cells and Ra...

Microsoft Excel Mock Test covering Managing Data Cells and Ranges. Online Microsoft Excel Test practice with instant feedback.

Microsoft Excel Managing Tables and Data

Free Microsoft Excel Quiz on Managing Tables and Data. Microsoft Excel Exam prep questions with detailed explanations.

Microsoft Excel Managing Tables and Table ...

Microsoft Excel Practice Questions for Managing Tables and Table Data. Build confidence for your Microsoft Excel certification exam.

Microsoft Excel Managing Worksheets and Wo...

Microsoft Excel Test Online for Managing Worksheets and Workbooks. Free practice with instant results and feedback.

Microsoft Excel MCQ Questions and Answers

Microsoft Excel Study Material on MCQ Questions and Answers. Prepare effectively with real exam-style questions.

Microsoft Excel Questions and Answers

Free Microsoft Excel Test covering Questions and Answers. Practice and track your Microsoft Excel exam readiness.

Microsoft Excel Trivia Questions and Answers

Microsoft Excel Exam Questions covering Trivia Questions and Answers. Master Microsoft Excel Test concepts for certification prep.

Microsoft Excel Workbook and Worksheet Man...

Free Microsoft Excel Practice Test featuring Workbook and Worksheet Management. Improve your Microsoft Excel Exam score with mock test prep.

Formula Approaches to Remove Duplicates Excel Tasks

📋 COUNTIF Method

The COUNTIF function counts how many times a value appears in a range, making it the simplest way to flag duplicates without deleting anything. In a helper column next to your data, enter =COUNTIF($A$2:$A$1000,A2). Any cell returning a value greater than 1 indicates a duplicate. You can then filter the helper column to show only rows where the count is 2 or higher and review or delete them manually based on business rules.

For multi-column duplicate detection, switch to COUNTIFS. The formula =COUNTIFS($A$2:$A$1000,A2,$B$2:$B$1000,B2) counts rows where both column A and column B match. This is essential when uniqueness is defined by a combination such as first name plus last name plus email. The helper column approach is non-destructive, easy to audit, and works in every version of Excel from 2007 onward, making it a universally compatible technique for any data analyst.

📋 UNIQUE Function

Available in Microsoft 365 and Excel 2021, the UNIQUE function returns a deduplicated list as a dynamic array. Simply type =UNIQUE(A2:A1000) into an empty cell and Excel spills the unique values down the column automatically. You can wrap it with SORT to alphabetize: =SORT(UNIQUE(A2:A1000)). The function updates instantly when the source range changes, making it perfect for live dashboards and summary tables that always need a current unique list.

UNIQUE also accepts a second argument for column-wise comparison and a third argument exactly_once that returns only values appearing exactly one time. =UNIQUE(A2:A1000,,TRUE) returns items with no duplicates at all — useful for finding records that appear only once in a log. Combine UNIQUE with FILTER to extract distinct rows across multiple columns: =UNIQUE(FILTER(A2:C1000,A2:A1000<>"")) gives you a clean unique table without empties or repeats.

📋 FILTER + COUNTIF

To extract only the duplicate rows from your dataset, combine FILTER with COUNTIF. The formula =FILTER(A2:C1000,COUNTIF(A2:A1000,A2:A1000)>1) returns every row where the value in column A appears more than once. This is the inverse of UNIQUE and is invaluable when you need to investigate duplicates rather than remove them — for example, identifying which customers have placed multiple orders or which transactions have been recorded twice in a bank import.

You can also reverse the logic with =FILTER(A2:C1000,COUNTIF(A2:A1000,A2:A1000)=1) to return only rows with truly unique values in column A. Both formulas spill dynamically, recalculate when source data changes, and never alter the original. This formula-driven workflow pairs beautifully with the how to create a drop down list in excel approach for letting users toggle between viewing duplicates, uniques, or all records via a single cell selector.

Remove Duplicates Button vs Formula Approach: Which is Better?

Pros

Remove Duplicates button is the fastest method for one-time cleanup
Works on any version of Excel from 2007 forward without add-ins
Handles multi-column duplicate definitions through simple checkboxes
Provides an instant summary of how many rows were removed
Integrates with Tables so structure and formatting are preserved
Combines with sorting to control which duplicate version survives
Requires zero formula knowledge — ideal for beginners and non-technical users

Cons

Action is permanent once the workbook is saved and closed
Cannot preview duplicates before deletion without a separate step
Fails silently on hidden whitespace, case mismatches, and special characters
Does not work on merged cells — must unmerge first
No audit trail of which specific rows were removed
Cannot deduplicate a single column without affecting the rest of the row
Not refreshable — must be re-run manually whenever new data arrives

Pre-Cleanup Checklist Before You Remove Duplicates Excel Data

Save a backup copy of the workbook with a clear version-dated name

Convert the data range into an official Excel Table using Ctrl+T

Run TRIM on text columns to strip leading and trailing whitespace

Use Find and Replace to remove non-breaking spaces and curly quotes

Standardize capitalization with UPPER, LOWER, or PROPER as appropriate

Verify all date columns are stored as dates, not text strings

Unmerge any merged cells inside the data range to avoid errors

Decide which columns define a duplicate and document the rule clearly

Sort by date or priority so the version you want to keep appears first

Add a helper column with COUNTIF to preview the duplicate count first

Always preview duplicates before deleting them

Before clicking Remove Duplicates, add a COUNTIF helper column and filter for counts greater than one. This three-second preview shows exactly which rows will disappear and protects you from accidentally deleting legitimate records. Many costly data errors trace back to silent removal of rows that only looked like duplicates but actually represented separate transactions, customers, or events.

Conditional formatting is the visual counterpart to formula-based duplicate detection and one of the most beginner-friendly ways to spot repeated values without writing a single formula. Select the range you want to inspect, go to the Home tab, click Conditional Formatting, choose Highlight Cells Rules, and then Duplicate Values. Excel opens a small dialog letting you pick the fill color and font color for duplicates. Click OK and every repeated value lights up immediately, giving you an at-a-glance map of where the problems live in your worksheet.

This method shines when you need to investigate before deciding what to do. Unlike the Remove Duplicates button, conditional formatting changes nothing about the underlying data — it merely paints cells. You can sort or filter by color to group all the duplicates together, review each one in context, and choose which to delete manually. This level of control is essential in financial reconciliation, where a duplicate-looking entry might actually be a legitimate repeat transaction that should never be removed without human judgment.

Conditional formatting can also highlight unique values rather than duplicates by changing the dropdown in the same dialog from Duplicate to Unique. This is useful when you want to find one-off records — for example, employees who logged in only once, or products that sold only a single unit. Combining duplicate and unique formats in two different colors creates a powerful visual diagnostic that instantly classifies every cell in your dataset into one of three states: repeated, unique, or empty.

For more advanced highlighting, use a formula-based conditional format. Choose New Rule, then Use a formula to determine which cells to format, and enter =COUNTIF($A$2:$A$1000,A2)>1. This gives you the same result as the built-in duplicate rule but with full control over the range and comparison logic. You can extend the formula to multiple columns with COUNTIFS, or check duplicates across non-adjacent sheets by referencing external ranges, enabling cross-workbook duplicate detection that the built-in rule cannot perform.

One limitation to remember is that conditional formatting only flags exact duplicates. If your data contains the same name with different capitalization (john smith versus John Smith), the built-in rule treats them as different. Workaround: add a helper column with =LOWER(TRIM(A2)) and apply the conditional format to that helper instead. This normalization technique reveals fuzzy duplicates that would otherwise hide in plain sight and cause downstream reporting errors that are nearly impossible to trace back to their root cause.

Performance is another consideration. Conditional formatting recalculates with every change to the workbook, so applying it to ranges of 100,000+ cells can noticeably slow Excel. If you experience lag, limit the format to just the columns you actively need to monitor, or convert the highlighted result into static cell colors using a small macro once the cleanup is complete. Removing the conditional format afterward also helps keep file sizes manageable and prevents the format from compounding when rows are copied and pasted elsewhere.

Finally, conditional formatting works wonderfully alongside Excel's how to freeze a row in excel feature when inspecting large duplicate sets. Freezing the header row keeps column titles visible as you scroll through hundreds of highlighted cells, making it easier to interpret which field caused the duplicate flag. Pair this with filtering by color and you have a complete visual review workstation that scales from a few dozen rows up to tens of thousands without ever modifying the original data.

Power Query is the most powerful way to find and delete duplicates in Excel when your data comes from external sources or needs to be cleaned repeatedly. Available in Excel 2016 and later under Data > Get & Transform Data, Power Query loads tables, CSVs, databases, and web sources into a dedicated editor where you can apply transformations that record themselves as steps. Once you remove duplicates in Power Query, you can refresh the query whenever the source updates and the deduplication runs automatically without any further intervention from you.

To get started, click a cell inside your data and choose Data > From Table/Range. Excel opens the Power Query Editor with your data loaded. Select the column or columns that define a duplicate by Ctrl-clicking their headers, right-click, and choose Remove Duplicates. Power Query keeps the first occurrence of each unique combination and discards the rest. Click Close & Load to push the cleaned table back into Excel as a new worksheet or to refresh the existing destination, leaving your original raw data sheet completely untouched throughout the process.

One of Power Query's biggest advantages is the M language step list on the right side of the editor. Every transformation — including remove duplicates excel operations — appears as a discrete step you can click, reorder, edit, or delete. This makes the cleanup process completely transparent and auditable. If a stakeholder asks why a particular row was removed, you can replay the steps in order and demonstrate exactly how the result was produced, which is invaluable for compliance, audits, and reproducible analysis workflows.

Power Query also handles fuzzy duplicates better than native Excel tools. Before removing duplicates, you can add cleaning steps like Trim, Clean, Uppercase, or Replace Values directly in the editor. These transformations run before the duplicate check, so john smith and JOHN SMITH become the same value automatically. For even more sophisticated matching, the Group By feature with fuzzy matching options (available in newer Power Query versions) can collapse near-duplicates within a similarity threshold you control, useful for matching customer names with typos.

For recurring reports, save the query and refresh it whenever new source data arrives. Right-click the loaded table in your worksheet and choose Refresh, or set Power Query to refresh automatically every time the file opens via Query Properties. This automation transforms a manual cleaning task into a one-click process — perfect for weekly sales reports, monthly bank reconciliations, or any workflow where the same kind of duplicates appear in fresh data again and again, eliminating hours of repetitive work each reporting cycle.

Power Query also pairs beautifully with VLOOKUP and other lookup functions. After deduplicating your reference table in Power Query, the cleaned table becomes a reliable source for your vlookup excel formulas elsewhere in the workbook. Because lookups return only the first match, having a deduplicated lookup table prevents the silent wrong-result bugs that plague spreadsheets where the lookup range contains hidden duplicates. The combination of Power Query cleaning plus VLOOKUP retrieval is a foundational pattern in professional Excel work.

The learning curve for Power Query is steeper than the Remove Duplicates button, but the payoff is enormous. Once you build your first refreshable cleaning query, you will never want to go back to manual deletion for recurring tasks. Microsoft has invested heavily in Power Query across the entire Microsoft 365 suite, so the skills transfer directly to Power BI, Power Automate, and SQL Server Analysis Services, making it one of the highest-leverage skills you can develop in the modern data analyst toolkit and well worth the initial investment.

Practice Excel Formulas: Test Your VLOOKUP and UNIQUE Skills

Practical duplicate cleanup is rarely as simple as clicking a button — real datasets come with quirks that require thoughtful preparation. Before you remove a single row, take ten minutes to profile your data. Count total rows, count distinct values in each key column with COUNTA and SUMPRODUCT, and scan for obvious anomalies like empty cells or extremely long strings. This quick reconnaissance reveals whether you are dealing with a simple exact-match problem or a fuzzy-matching challenge that needs Power Query, helper formulas, or even manual review.

When working with customer or contact lists, treat email addresses as the canonical unique identifier whenever possible. Names can repeat legitimately (two different John Smiths), addresses can vary in format, and phone numbers can be entered with or without dashes. Email is the closest thing to a guaranteed unique key in most consumer datasets. Normalize emails to lowercase with =LOWER(TRIM(A2)) before deduplication and your match rate will climb significantly, eliminating dozens or hundreds of false negatives that the raw data would have hidden.

For financial or transactional data, define duplicates by a composite key such as date plus amount plus account number. Two transactions with the same amount on the same day to the same account are almost certainly a duplicate import, while two transactions with the same amount on different days are usually legitimate. Use the Remove Duplicates dialog with all three columns checked, or build a concatenation helper =A2&"|"&B2&"|"&C2 and apply COUNTIF to that single combined column. This composite-key technique scales reliably to millions of rows.

Inventory and product catalog cleanup often requires keeping the most recent record rather than the first. Sort by Last Updated descending before running Remove Duplicates so the newest version of each SKU survives. If you need to keep the row with the highest value (like the most recent price), Power Query's Group By feature handles this elegantly: group by SKU, then aggregate other columns with Max or All Rows operations. This pattern is far more reliable than trying to coordinate sorting tricks with the Remove Duplicates button on complex multi-column datasets.

Document every deduplication you perform. Add a comment or cell note recording the date, the columns used to define duplicates, and the row count before and after. This minimal audit trail saves enormous amounts of time when questions arise weeks later. Many organizations now require this kind of documentation for any data transformation that affects reported numbers, and building the habit early in your Excel career signals professionalism and protects you from being blamed for changes you did not actually make to the underlying data.

Finally, build a personal duplicate-cleanup playbook with the three or four methods you use most. For most analysts, that means a quick Remove Duplicates button workflow for ad hoc cleanup, a COUNTIF helper for previewing and auditing, conditional formatting for visual review, and a Power Query for any recurring report. Having these patterns memorized turns a frustrating chore into a thirty-second task and lets you focus your mental energy on the actual analysis your stakeholders are waiting for, not the data plumbing that nobody else sees or appreciates.

Excel Questions and Answers

How do I find and delete duplicates in Excel quickly?

The fastest method is the Remove Duplicates button on the Data tab. Click any cell in your data, go to Data > Remove Duplicates, check the columns that define a duplicate, and click OK. Excel keeps the first occurrence of each unique row and deletes the rest. Always save a backup first because the action becomes permanent once you save and close the workbook.

Can I remove duplicates in Excel without deleting the original data?

Yes. Use Advanced Filter under Data > Advanced with the Copy to another location option and Unique records only checked, which outputs a deduplicated copy to a new range. Alternatively, use the UNIQUE function in Microsoft 365 (=UNIQUE(A2:A1000)) to spill unique values into a new column, or load the data into Power Query and remove duplicates there non-destructively.

Why does Remove Duplicates leave some duplicates behind?

Excel treats values as duplicates only when they match exactly, character for character. Hidden trailing spaces, different capitalization, non-breaking spaces, or text-formatted numbers prevent matches. Clean your data first with TRIM, LOWER, and Find/Replace to strip invisible characters. Convert text-stored numbers to true numbers with VALUE. After normalization, run Remove Duplicates again and the hidden duplicates will be caught properly.

How do I highlight duplicates without deleting them?

Use conditional formatting. Select your range, click Home > Conditional Formatting > Highlight Cells Rules > Duplicate Values, choose a color, and click OK. Excel paints every repeated cell with the color you picked, leaving the data unchanged. You can then sort or filter by color to review duplicates in context and decide manually which ones to keep or delete based on business rules.

Can I remove duplicates based on multiple columns?

Yes. In the Remove Duplicates dialog, check every column that should be part of the duplicate definition. A row is considered a duplicate only when all checked columns match an earlier row. For example, checking First Name, Last Name, and Email means two rows are duplicates only when all three fields are identical. Uncheck columns whose values can legitimately differ between rows that should still count as the same record.

What is the difference between unique and distinct values?

Unique values appear exactly one time in the dataset — no repeats at all. Distinct values are one copy of each value regardless of how many times it appears. The UNIQUE function returns distinct values by default, but with the third argument set to TRUE (=UNIQUE(A2:A1000,,TRUE)) it returns only truly unique values. Remove Duplicates and Advanced Filter produce distinct lists, not unique lists.

How do I remove duplicates in Excel using a formula?

In Microsoft 365 or Excel 2021, use =UNIQUE(A2:A1000) to spill a deduplicated list into a new column. In older versions, add a helper column with =COUNTIF($A$2:A2,A2) and filter for values equal to 1 — those are the first occurrences. Copy and paste them as values to a new sheet. The formula approach preserves the original data and updates dynamically when source values change.

Does Remove Duplicates work on merged cells?

No. Excel will refuse to run Remove Duplicates on a range containing merged cells and will display an error message. Unmerge all cells in the data range first by selecting the range and clicking Home > Merge & Center to toggle merging off. After deduplication, you can reapply merging if needed for presentation, though it is generally better to avoid merged cells in any data range used for analysis.

How can I remove duplicates in Excel automatically when new data is added?

Use Power Query. Load your data via Data > From Table/Range, remove duplicates in the editor by right-clicking columns and choosing Remove Duplicates, then Close & Load. Whenever source data changes, right-click the loaded table and choose Refresh to re-run the deduplication automatically. You can also enable refresh on file open via Query Properties so the cleanup runs every time you open the workbook.

Is there an undo for Remove Duplicates in Excel?

Yes, but only within the current session. Press Ctrl+Z immediately after running Remove Duplicates to undo it. Once you save and close the workbook, the Undo history is cleared and the deletion becomes permanent. Always save a dated backup copy of the file before running Remove Duplicates on important data, or use non-destructive methods like Advanced Filter, UNIQUE, or Power Query that preserve the original.