Excel Practice Test

โ–ถ

Learning how to clean data in Excel is one of the most valuable spreadsheet skills you can develop in 2026, whether you are preparing a sales report, building a financial model, or auditing customer records. Raw exports from CRMs, accounting tools, and survey platforms almost always arrive with inconsistent capitalization, stray spaces, duplicate rows, mismatched date formats, and missing values. Without a structured cleanup process, even powerful functions like vlookup excel formulas, pivot tables, and conditional logic will return incorrect results that quietly distort decisions.

The good news is that Excel ships with a deep toolkit specifically designed for tidying messy datasets. Features like Remove Duplicates, Text to Columns, Flash Fill, Find and Replace, TRIM, CLEAN, PROPER, and Power Query collectively handle ninety percent of real-world cleaning tasks. When combined thoughtfully, they transform a chaotic export into an analysis-ready table in minutes rather than hours. The trick is knowing which tool to reach for and in what order.

Most analysts approach cleaning reactively, fixing problems as they appear in charts or reports. A better approach treats data cleaning as a deliberate first step before any analysis begins. By following a repeatable workflow, you eliminate the rework loops that plague spreadsheet teams: building a dashboard, finding bad numbers, tracing them back to the source, cleaning, and rebuilding the dashboard. A disciplined cleanup workflow pays dividends every single time you open a new file.

This guide walks through the entire process from initial inspection to final validation. You will learn how to scan a worksheet for problems, remove duplicates safely, standardize text and dates, handle missing values, split combined columns, validate inputs, and audit your changes. Along the way we cover both manual point-and-click methods and formula-driven approaches that scale to thousands of rows. Each technique includes the keyboard shortcut, ribbon path, and underlying logic so you can pick the method that fits your skill level.

Modern Excel users have an even stronger tool at their disposal: Power Query, also called Get & Transform Data. Power Query records every cleaning step as a repeatable recipe, which means next month's export can be cleaned with one click. We cover the most useful Power Query transformations later in the article, including merging tables, unpivoting columns, and removing nulls in bulk. If you process the same report regularly, mastering Power Query alone will save dozens of hours per quarter.

Before diving in, organize your workspace. Always work on a copy of the raw file, never the original, and keep a backup sheet named Raw inside the workbook so you can compare before and after. Add a Notes column or hidden sheet to log every transformation you apply. This audit trail is critical when colleagues question a figure or when you need to reproduce the cleanup six months later. With that foundation in place, let us look at exactly what clean data looks like and the metrics that prove your spreadsheet is ready for analysis.

By the end of this guide, you will have a personal cleaning checklist, an understanding of when to use formulas versus features versus Power Query, and the confidence to take any messy CSV and turn it into a trustworthy analytical asset. Whether you are a beginner discovering how to merge cells in Excel for the first time or a power user automating workflows, the techniques here scale with your needs.

Data Cleaning by the Numbers

โฑ๏ธ
60%
Of analyst time
๐Ÿ“Š
1M+
Rows per sheet
๐Ÿ”„
12
Core cleaning tools
โš ๏ธ
88%
Of spreadsheets
โšก
10x
Faster cleanup
Test Your How to Clean Data in Excel Skills

The Six-Step Data Cleaning Workflow

๐Ÿ”

Duplicate the raw sheet, freeze the header row, and scan for obvious issues like merged cells, empty columns, and inconsistent formatting before you change anything.

๐Ÿ—‘๏ธ

Use Data tab Remove Duplicates to eliminate exact matches, then apply COUNTIF or conditional formatting to catch near-duplicates from typos and extra spaces.

๐Ÿ”ค

Apply TRIM to strip whitespace, CLEAN to remove non-printing characters, and PROPER, UPPER, or LOWER to enforce consistent capitalization across name and address fields.

๐Ÿ“…

Convert text-formatted dates to true date serial numbers using DATEVALUE, force numeric columns into Number format, and audit for negative values or outliers.

โ“

Decide whether to delete, fill with a default, or leave blank. Use Go To Special to select all empty cells at once and apply a consistent treatment.

โœ…

Add Data Validation rules to prevent future errors, run a row count comparison against the raw sheet, and document every transformation in a notes tab.

Duplicates are the single most common problem in real-world spreadsheets and often the easiest to fix once you spot them. Excel offers three complementary approaches: the Remove Duplicates feature, Advanced Filter with Unique Records Only, and formula-based detection using COUNTIF or COUNTIFS. Each method has a place in a complete cleanup workflow. The built-in Remove Duplicates tool, found on the Data tab, scans selected columns and deletes rows that match exactly across every chosen field, making it perfect for obvious cases.

The danger with Remove Duplicates is that it acts permanently and silently. If your customer table has Jane Smith spelled three different ways with extra spaces, the feature will not catch them as duplicates because the strings are technically different. Always run TRIM and CLEAN across text columns first, then convert everything to a consistent case, before applying Remove Duplicates. This pre-processing alone catches the majority of near-duplicate rows that would otherwise slip through and inflate counts.

For situations where you want to identify duplicates without deleting them, use conditional formatting. Select the column, choose Home, Conditional Formatting, Highlight Cells Rules, Duplicate Values, and Excel paints every repeat in a color of your choosing. This visual approach lets you review duplicates manually, which is critical when working with financial records or unique identifiers where accidental deletion would create real business problems. Pair this with a COUNTIF formula in a helper column for an exact count of each duplicate.

Blank rows and blank cells require different treatment than duplicates. Press Ctrl plus G to open Go To, click Special, select Blanks, and Excel highlights every empty cell in your selection. From there you can delete entire rows, fill with zero, or copy a value down. For partially empty rows, where some fields are populated and others are not, decide on a business rule first. Some workflows demand that incomplete records be flagged rather than removed, especially when the missing field is non-critical metadata.

Whitespace is the silent killer of lookups and joins. A trailing space at the end of an account ID will cause an exact-match VLOOKUP to return N/A even when the value visibly looks identical. The TRIM function strips leading, trailing, and double internal spaces, leaving only single spaces between words. Wrap every text column you plan to use as a lookup key in TRIM, paste the results as values, and replace the original column. For datasets that include text-to-Excel conversion you can also explore how to convert text to Excel for cleaner imports.

Non-printing characters such as line breaks, tabs, and Unicode whitespace also break formulas and filters. The CLEAN function removes the first 32 ASCII non-printing characters, while the SUBSTITUTE function targets specific characters by code. Combine CLEAN with TRIM in a single formula: TRIM CLEAN A2. For stubborn Unicode characters like the non-breaking space, character 160, use SUBSTITUTE A2 CHAR 160 with a regular space first, then TRIM. Building this two-step formula into a helper column catches almost every invisible character problem.

Empty columns left over from exports clutter your view and confuse pivot tables. Right-click the column header and choose Delete, or select multiple columns first by Ctrl-clicking headers. Similarly, hidden rows from earlier filters can survive into your cleaned dataset. Always click Data, Clear, and remove any active filters before counting rows. A quick way to verify clean state is to compare the row count from the status bar with the value Excel reports in a COUNTA formula across your key column. Discrepancies signal hidden or filtered data.

FREE Excel Basic and Advance Questions and Answers
Practice core Excel skills from formatting and filters to advanced cleanup with detailed answer explanations.
FREE Excel Formulas Questions and Answers
Drill TRIM, CLEAN, SUBSTITUTE, and lookup formulas with realistic data-cleaning scenarios and worked solutions.

Text and Whitespace Tools Including How to Merge Cells in Excel

๐Ÿ“‹ TRIM and CLEAN

The TRIM function is the workhorse of text cleanup. It removes every leading and trailing space and collapses internal runs of spaces to a single space. Type equals TRIM A2 in a helper column, drag it down, then paste-as-values back over the original column. For deeply messy imports, wrap TRIM around CLEAN to also remove tabs, line breaks, and other non-printing characters that survive copy-paste from PDFs and web pages.

Beware of the Unicode non-breaking space, character 160, which TRIM does not touch. To handle it, use SUBSTITUTE first to replace CHAR 160 with a regular space, then nest that inside TRIM and CLEAN. The full formula looks like TRIM CLEAN SUBSTITUTE A2 CHAR 160 space. This combination handles ninety-nine percent of whitespace problems you will ever encounter in commercial datasets.

๐Ÿ“‹ Case Functions

Three functions standardize text case: UPPER converts everything to capitals, LOWER converts everything to lowercase, and PROPER capitalizes the first letter of each word. PROPER is ideal for names and addresses, while UPPER suits codes and identifiers. Apply them in a helper column, verify the results look correct, then paste as values back over the source. Note that PROPER cannot distinguish between Mc and Mac surnames, so review names manually.

For more complex transformations, Flash Fill is often faster than formulas. Type the desired output for the first two rows, then press Ctrl plus E, and Excel guesses the pattern and fills the rest. Flash Fill handles splitting names, formatting phone numbers, and extracting substrings without writing a single formula. Always spot-check the first ten and last ten rows to confirm Excel inferred the correct pattern.

๐Ÿ“‹ Merge and Split

To combine columns, use the ampersand operator or the CONCAT function. Equals A2 ampersand space ampersand B2 joins first and last name with a space between. The TEXTJOIN function adds a delimiter argument and an ignore-empty flag, which is invaluable when some source cells are blank. For learners researching how to merge cells in Excel, note that Merge and Center on the Home ribbon merges cells visually but should be avoided in data tables because it breaks sorting and filtering.

To split combined columns, use Text to Columns on the Data tab. Choose Delimited for comma, tab, or custom characters, or Fixed Width for positional data. The Flash Fill alternative also works for splitting. For repeated cleanup of the same import format, Power Query Split Column gives you a refreshable recipe that adapts to new rows automatically without re-running the wizard.

Manual Cleaning Versus Power Query Automation

Pros

  • Power Query records every step as a repeatable recipe you can rerun on next month's file
  • Handles datasets larger than Excel's worksheet limit by loading directly to the data model
  • Native support for merging tables, unpivoting, and grouping without complex formulas
  • Visual interface makes transformations easy to audit and explain to colleagues
  • Refreshable connections pull live data from CSV folders, databases, and web sources
  • Errors are surfaced row by row in a preview pane so you can fix issues before loading
  • Built-in functions for date parsing, number formatting, and text cleanup save formula time

Cons

  • Steeper initial learning curve than manual ribbon-based cleaning techniques
  • Refresh times can be slow for very large or heavily transformed queries
  • Some transformations are case-sensitive and require explicit type conversion
  • Cannot edit loaded results directly in the worksheet without breaking the refresh chain
  • Older Excel versions before 2016 require a separate add-in installation
  • Custom M language code is required for advanced transformations beyond the GUI
FREE Excel Functions Questions and Answers
Practice TRIM, CLEAN, SUBSTITUTE, VLOOKUP and other essential functions for data-cleaning workflows.
FREE Excel MCQ Questions and Answers
Multiple-choice questions covering Excel features, shortcuts, and best practices for cleaning spreadsheets.

Complete Data Cleaning Checklist

Create a backup copy of the raw file and a Raw sheet inside the workbook before changing anything
Freeze the header row and convert the dataset to a structured Excel Table with Ctrl plus T
Run TRIM and CLEAN across every text column to strip whitespace and non-printing characters
Apply consistent case using PROPER, UPPER, or LOWER based on the field type
Use Find and Replace to fix known typos and replace abbreviations with full terms
Remove exact duplicates with Data, Remove Duplicates after standardizing text
Highlight remaining near-duplicates with conditional formatting and review manually
Convert text-formatted numbers to true numbers using the VALUE function or paste-multiply by one
Force date columns into a consistent serial-number format with DATEVALUE
Use Go To Special, Blanks to find and handle missing values according to your business rule
Add Data Validation rules to prevent future invalid entries in critical columns
Document every transformation in a notes tab and verify final row counts match expectations
Convert your range to a Table first

Before applying any cleaning steps, select your data and press Ctrl plus T to convert it to an Excel Table. Tables automatically extend formulas to new rows, give every column a named header, and integrate seamlessly with Power Query. This single step prevents most of the silent reference errors that plague large cleanup projects.

Dates and numbers cause more silent cleanup failures than any other data type because they often look correct visually while being stored as text. A date column imported from a CSV may display as 03/15/2026 yet refuse to sort chronologically or feed into a date filter. To diagnose, check the column alignment: numbers and dates align right by default, while text aligns left. If your dates hug the left edge, Excel is treating them as strings and you need to convert them before any time-based analysis works correctly.

The simplest conversion is to select the column, choose Data, Text to Columns, click Next twice, choose the Date radio button, and pick the source format such as MDY or DMY. Excel parses the strings into true date serial numbers in a single pass. For formula-driven conversion, use DATEVALUE for date-only strings and a combination of DATEVALUE and TIMEVALUE for date-time strings. The result is a number you can then format using Format Cells, Date, to display in whatever style your report requires.

Numbers stored as text exhibit the same alignment clue and break SUM, AVERAGE, and almost every aggregation. Excel often shows a small green triangle in the corner of the cell. Click the warning icon and choose Convert to Number, or select the entire column and use the multiplication trick: type 1 in a blank cell, copy it, select the text-formatted column, right-click Paste Special, choose Multiply, and Excel forces every cell to its numeric equivalent. The VALUE function works too but requires a helper column.

Outliers and impossible values deserve scrutiny before you analyze. A salary column with a value of 9999999 is almost certainly a placeholder, not a real number. Use MIN and MAX to scan each numeric column, then filter for values outside expected ranges. Conditional formatting with a color scale highlights extremes visually. For financial datasets, the standard deviation formula in Excel helps quantify how far outliers sit from the mean and decide whether they reflect real variance or data-entry errors.

Currency and percentage formatting deserve special attention. A column labeled Revenue may contain raw numbers in one row and strings like dollar sign 1,234.56 in another. Use Find and Replace to strip dollar signs, commas, and parentheses, then apply Number formatting uniformly. Percentages are particularly tricky because 0.15 and 15 percent are mathematically identical only if you understand which storage convention applies. Always confirm by checking a known total against the raw source after conversion.

Negative numbers stored with trailing minus signs or surrounded by parentheses also need conversion. Use a SUBSTITUTE chain to remove trailing minus signs and convert parentheses to negatives, then wrap the result in VALUE. For accounting exports, this single transformation often unblocks an entire downstream model. Save the conversion formula in a personal macro or Power Query step so you do not rebuild it every month when finance sends a fresh export with the same quirks.

Finally, time zone and locale issues haunt international datasets. Dates may arrive in DMY format when your Excel locale expects MDY, silently swapping March 4 and April 3. Always inspect a few known dates after import and compare against the source. When importing CSVs, the Power Query Locale option lets you specify the source format explicitly, eliminating ambiguity. Document the assumed locale in your notes tab so future maintainers do not unknowingly re-import with the wrong setting.

Power Query, branded as Get and Transform Data on the Data tab, is the single most powerful cleaning tool in modern Excel and the one most users underutilize. It records every transformation as a step in an editable recipe, so when next month's report arrives, you click Refresh and Excel re-runs every cleanup operation automatically. This eliminates the rebuild-from-scratch cycle that consumes hours of analyst time. The interface uses a clear left-to-right pane showing source, applied steps, and live preview, making it easy to audit each change.

To start, click Data, Get Data, From File, and choose your source format. Power Query opens its editor in a separate window with a preview of the first thousand rows. From there, every common cleanup task has a ribbon button: Remove Duplicates, Remove Errors, Remove Rows, Replace Values, Trim, Clean, Change Type, Split Column, and dozens more. Each click adds a recorded step to the right-hand pane, which you can rename, reorder, or delete. When you are satisfied, click Close and Load to push the cleaned table into a worksheet.

One of Power Query's most underrated features is the ability to combine files from a folder. Point it at a directory of monthly exports and it stacks them into a single unified table, applying the same transformations to each. Add a new file to the folder, hit Refresh, and the new rows flow through. This pattern alone replaces hours of manual copy-paste work. For learners researching how to merge tables across worksheets, Power Query Merge Queries provides a SQL-style join experience without writing code.

Unpivoting is another Power Query superpower. Many real-world reports arrive in wide format with months or product names as column headers. Analysis tools prefer long format with one row per observation. Select the columns you want to unpivot, right-click, and choose Unpivot Columns. Power Query restructures the data into a tidy two-column attribute-value layout instantly. The reverse operation, Pivot Column, recreates wide format when you need to present results back to stakeholders.

For really large datasets that exceed the worksheet row limit of just over one million, load Power Query results directly to the data model instead of a worksheet. The data model handles tens of millions of rows efficiently and integrates with pivot tables through PowerPivot. This bypass solves one of the most frustrating constraints in classic Excel. To enable, choose Close and Load To and select Only Create Connection plus Add this data to the Data Model.

Custom columns let you write small expressions in M, Power Query's formula language. The syntax differs from worksheet formulas but follows similar logic. For example, a custom column with the expression Text.Trim and Text.Upper combined standardizes both whitespace and case in one step. Even without learning M deeply, the built-in transformations cover the vast majority of cleaning needs. Investing one afternoon in Power Query basics pays back within the first week.

Finally, document your queries with the description field on each step and a clear query name. Six months from now, you will not remember why a particular Replace Values step exists. Inline documentation makes the recipe self-explanatory and protects against accidental deletion. Treat your queries as code: name them clearly, comment liberally, and version-control the workbook when possible.

Practice VLOOKUP Excel and Formula Skills

Beyond features and formulas, successful data cleaning depends on habits. The most productive Excel users follow a consistent personal workflow regardless of the dataset. They always start by duplicating the source, converting to a Table, freezing headers, and saving a checkpoint. They scan top and bottom rows, sort each column alphabetically and numerically to surface outliers, and check the status bar count and sum against expected totals. These five-minute rituals catch problems early when they are cheap to fix.

Keyboard shortcuts dramatically accelerate cleanup. Ctrl plus T converts to Table, Ctrl plus shift plus L toggles filters, Ctrl plus G opens Go To, Ctrl plus E runs Flash Fill, Ctrl plus semicolon inserts today's date, and Ctrl plus shift plus colon inserts the current time. Memorizing even five of these shortcuts compounds time savings across every project. Pair them with Quick Access Toolbar buttons for Remove Duplicates, Text to Columns, and Data Validation to keep your most-used tools one click away.

Data Validation rules prevent dirty data from entering in the first place. Select a column, choose Data, Data Validation, and set criteria such as Whole Number between zero and one hundred, Date after today, or List with allowed values pulled from a reference range. When a user tries to enter an invalid value, Excel rejects it or shows a warning. For shared spreadsheets, this prevents the bulk of the messiness that cleanup workflows are designed to fix. Pair Data Validation with how to create a drop down list in Excel using the List criterion for the cleanest user experience.

Conditional formatting plus formulas creates a live audit dashboard. Add a column with COUNTIF to flag duplicates, IFERROR to surface broken lookups, and ISBLANK to highlight gaps. Apply color rules so problems jump off the screen. This passive monitoring lets you spot regressions instantly when new data lands without manually re-running every check. Over time, your spreadsheet becomes self-policing, alerting you to issues without conscious effort.

For collaborative teams, naming conventions matter enormously. Adopt a consistent style for sheets such as Raw, Clean, Lookup, and Output, and column headers in lowercase with underscores. Avoid spaces and special characters in header names because they complicate references in Power Query and PivotTables. A small style guide pinned to the team wiki prevents the slow drift toward chaos that affects any spreadsheet maintained by more than one person over more than a year.

Version control deserves more attention than it typically gets. Save numbered versions before major transformations, and consider using OneDrive or SharePoint version history to roll back if needed. For mission-critical workbooks, an external version-control habit such as weekly archives in a dated folder protects against silent corruption. Combine this with a changelog tab inside the workbook listing date, author, and a one-line description of every meaningful change. For more advanced workflows, browse a complete Excel functions list to identify formulas worth incorporating into your standard cleaning recipe.

Finally, learn to stop. There is a diminishing return on additional cleaning effort. Once the data supports the decision the report is meant to inform, additional polish wastes time. Define a clear threshold of acceptable quality at the start of each project: how many missing values you can tolerate, how precisely dates must be parsed, how many edge cases are worth chasing. Walking away cleanly is as important a skill as the cleanup itself, and it separates productive analysts from perfectionists who never ship.

FREE Excel Questions and Answers
Comprehensive Excel practice test covering data cleaning, formulas, formatting, and reporting workflows.
FREE Excel Trivia Questions and Answers
Fun Excel trivia covering shortcuts, history, functions, and lesser-known features for spreadsheet enthusiasts.

Excel Questions and Answers

What is the fastest way to clean data in Excel?

The fastest workflow is to convert the range to a Table with Ctrl plus T, apply TRIM and CLEAN across text columns, use Remove Duplicates on the Data tab, convert text-formatted numbers and dates to true types, and handle blanks with Go To Special. For recurring cleanups, record the steps in Power Query so you can refresh next month's file with a single click instead of repeating the manual process.

How do I remove duplicates in Excel without losing data?

Always duplicate the sheet first so the original survives. Select the data, click Data, Remove Duplicates, and check only the columns that define a true duplicate. Excel removes the second and later occurrences and keeps the first. To preview duplicates before deleting, use conditional formatting Highlight Cells Rules Duplicate Values, or add a COUNTIF helper column to see exactly how many copies of each value exist.

What does TRIM do in Excel?

TRIM removes all leading and trailing spaces from a text string and collapses internal runs of multiple spaces into a single space. It is the standard first step for cleaning imported text. Use it in a helper column with equals TRIM A2, then paste the results as values back over the original column. Note that TRIM does not remove non-breaking spaces character 160, so combine it with SUBSTITUTE for stubborn whitespace.

How do I fix dates stored as text in Excel?

Select the column, choose Data, Text to Columns, click Next twice, select the Date radio button, and choose the correct source format such as MDY or DMY. Click Finish and Excel converts the strings to true date serial numbers. Alternatively, use the DATEVALUE function in a helper column. Verify by checking that the resulting values right-align and respond correctly to date sorting and filtering.

Can Power Query clean data automatically?

Yes. Power Query records every cleaning step as a refreshable recipe. Load your source, apply transformations like Remove Duplicates, Replace Values, Change Type, and Trim through the ribbon, then click Close and Load. When new data arrives, hit Refresh and Excel re-runs every step automatically. Power Query also combines files from a folder, merges tables, and unpivots wide reports into long format without writing any code.

How do I find blank cells in Excel?

Select the data range, press Ctrl plus G to open Go To, click Special, choose Blanks, and click OK. Excel highlights every empty cell in your selection. From there you can delete the rows, fill with a value by typing then pressing Ctrl plus Enter, or apply conditional formatting to flag them. For ongoing monitoring, add an ISBLANK or COUNTBLANK formula to track gaps as new data lands.

Why does VLOOKUP return N slash A even when the value looks identical?

The most common cause is invisible whitespace. A trailing space on either the lookup value or the lookup column causes an exact-match VLOOKUP to fail because the strings differ by one character. Wrap both sides in TRIM, paste as values, and the lookup will work. Other causes include mismatched data types numbers stored as text on one side, case sensitivity issues with EXACT-based lookups, and non-printing characters that CLEAN removes.

How do I merge two columns into one in Excel?

Use the ampersand operator or the CONCAT function. Equals A2 ampersand space ampersand B2 joins two cells with a space between. TEXTJOIN goes further by accepting a delimiter and an ignore-empty flag, which prevents extra spaces when source cells are blank. For visual cell merging in headers, use Home, Merge and Center, but avoid this inside data tables because merged cells break sorting, filtering, and pivot table operations.

What is Flash Fill and when should I use it?

Flash Fill recognizes patterns in your edits and auto-fills the rest of a column. Type the desired result for the first two rows, press Ctrl plus E, and Excel guesses the pattern. It excels at splitting names, formatting phone numbers, extracting substrings, and reformatting dates. Always spot-check the first ten and last ten results because Flash Fill occasionally misinterprets patterns when the source data contains unusual edge cases.

How can I prevent dirty data from entering my spreadsheet?

Use Data Validation on critical columns. Select the column, choose Data, Data Validation, and set criteria such as Whole Number, Date, Decimal, or List. Configure an error alert with a clear message so users understand why their entry was rejected. Combine Data Validation with conditional formatting for visual feedback, and protect the worksheet so users cannot bypass the rules. This proactive approach prevents most of the messiness that cleaning workflows are designed to fix.
โ–ถ Start Quiz