🧠 SQL Skills Checklist: Beginner to Advanced Guide 2026

You can spend a year reading SQL tutorials and still freeze the first time someone slides an interview laptop in front of you. The reason isn't talent. It's that SQL is a layered skill — and most learning paths jumble the layers together until you can't tell what you actually know from what you've just seen once. A checklist fixes that. Walk down it honestly. Mark what you can do cold, what you can do with a reference open, and what still looks like wizardry. You'll know exactly where to spend the next two weeks.

This guide lays out a SQL skills checklist by level — beginner, intermediate, advanced — and then points you at the practice sites that train each layer best.

We'll cover the core syntax everyone needs (SELECT, WHERE, ORDER BY, JOINs, GROUP BY), the intermediate moves that separate juniors from mids (CTEs, window functions, CASE WHEN logic), and the advanced territory that shows up in senior interviews (recursive CTEs, indexes, query plans, isolation levels). We'll also map which keywords and patterns to memorize for a SQL coding test, where to find the best cheat sheets, and how to translate "data analyst skills math statistics persuasion sql" job descriptions into a study plan that actually fits your timeline.

One framing note before we start. SQL skills aren't binary. You don't "know JOINs" — you know inner joins comfortably, you can do left joins with one foot dragging, and you've never written a self-join under pressure. The checklist below uses three tiers — confident, working, unfamiliar — so you can rate yourself honestly. Take a notebook. Score each item. Whatever lands in "unfamiliar" becomes your study queue for the next sprint.

SQL Skills Coverage at a Glance

Beginner skills (SELECT to GROUP BY)

Intermediate skills (CTEs, windows)

Advanced skills (plans, isolation)

Practice sites worth your time

Start at the beginner tier. This is the layer most self-taught learners skim over because it feels obvious — and then they discover during a coding test that they can't write a clean GROUP BY without a syntax error. Boring fundamentals win interviews. Spend a week here even if you think you're past it. The payoff is automatic syntax recall under pressure, which frees your working memory for the harder parts of the question.

Beginner SQL covers single-table querying and basic multi-table work. You should be able to SELECT specific columns, filter rows with WHERE using comparison operators and IN/BETWEEN/LIKE, sort with ORDER BY ascending and descending, and limit results with LIMIT or TOP depending on the dialect. Aggregate functions — COUNT, SUM, AVG, MIN, MAX — come paired with GROUP BY and HAVING for filtering after aggregation. Most beginner SQL questions on coding tests live entirely inside this layer. Master it and you'll handle maybe 40% of any interview's question bank without breaking a sweat.

The other half of the beginner tier is INNER JOIN and LEFT JOIN. Forget RIGHT JOIN for now — almost nobody uses it in practice, and you can always rewrite it as a LEFT JOIN with the tables flipped. What you need cold is the ON syntax, how to choose join keys, how to handle nullable columns in the right table, and the classic gotcha of WHERE clauses on left-joined tables nullifying the outer join behavior.

That last one trips up roughly one in three candidates in technical screens. If you can explain why WHERE b.col IS NULL behaves differently from putting the same condition in the ON clause, you're past the beginner threshold.

What counts as a beginner SQL skill?

Beginner SQL = anything you can write against a single table or a clean two-table inner join without opening a reference. That includes SELECT, WHERE, ORDER BY, LIMIT, basic aggregates with GROUP BY and HAVING, and INNER/LEFT JOIN with simple ON conditions. If you're stopping to look up syntax, it's not beginner yet — keep drilling.

Intermediate SQL is where the layering really starts. This tier is what mid-level data analysts and backend developers use every day, and it's where most coding tests for analyst and engineer roles concentrate their hardest questions. Subqueries come first. Correlated and non-correlated. You should be able to use a subquery in the SELECT clause, in the FROM clause as a derived table, and in the WHERE clause with IN, EXISTS, or comparison operators. The EXISTS pattern in particular shows up constantly — and writing it correctly with proper correlation is a marker that you've moved past beginner.

Then come CTEs. WITH name AS (...) — the syntax that makes long queries actually readable. CTEs are functionally similar to subqueries but they shine when you need to reference the same derived dataset more than once, or when the query logic has three or four logical steps that would be impossible to follow inline.

Practice writing a CTE that builds a daily aggregation, then a second CTE that ranks rows within each day, then a final SELECT that filters the ranked output. That's the canonical three-CTE pattern that shows up in interview questions like "find the top customer by revenue each month."

Window functions are the next big layer. ROW_NUMBER, RANK, DENSE_RANK, LAG, LEAD, SUM OVER, and the PARTITION BY / ORDER BY syntax. These are the single most-tested concept in modern SQL interviews, especially for data analyst and data engineer roles. You should be able to explain the difference between ROW_NUMBER and RANK without looking it up — ROW_NUMBER assigns unique sequential integers, RANK assigns the same integer to ties and skips the next value, DENSE_RANK assigns the same integer to ties but doesn't skip. Memorize that and the typical "find the second highest salary" question becomes a 30-second answer.

Three Skill Tiers — What's Inside Each

🔴 Beginner Tier

SELECT, WHERE, ORDER BY, LIMIT. Aggregate functions with GROUP BY and HAVING. INNER JOIN and LEFT JOIN with clean ON conditions. The skills you should never need to look up — this is your syntax muscle memory layer.

🟠 Intermediate Tier

Subqueries (correlated and not), CTEs with multiple stages, window functions ROW_NUMBER/RANK/DENSE_RANK, LAG/LEAD, CASE WHEN logic, UNION and UNION ALL. The layer where most analyst and engineer interview questions live.

🟡 Advanced Tier

Recursive CTEs, query execution plans, indexes and when to use them, partitioning strategies, transaction isolation levels (READ COMMITTED, REPEATABLE READ, SERIALIZABLE), deadlock troubleshooting. Senior-engineer territory.

🟢 Bonus — Dialect Fluency

Knowing one dialect (PostgreSQL, MySQL, SQL Server, BigQuery, Snowflake) deeply matters more than knowing all of them shallowly. Pick one, get fluent, then learn the differences between yours and the next one you'll meet.

CASE WHEN is the third intermediate skill that separates juniors from mids. You'll use it constantly to bucket continuous values into categories, to handle missing data with sensible defaults, and to build pivot-style aggregations when you don't have access to the PIVOT keyword. The pattern SUM(CASE WHEN condition THEN value ELSE 0 END) is so common in production analytics queries that you should write it from memory in under ten seconds. If it takes longer, drill it.

UNION and UNION ALL round out the intermediate tier. The distinction here trips people up — UNION deduplicates, UNION ALL doesn't. UNION ALL is almost always what you want for performance reasons unless you have a specific need to remove duplicates. Knowing which to reach for, and why, signals that you've thought about how SQL actually executes. Add column-list alignment, NULL handling across the unioned sets, and ordering of the final combined output, and you have the full UNION skill in your kit.

Now — advanced SQL. This is the tier where you stop being a SQL user and start being someone who actually understands how the database thinks. Recursive CTEs come first. The pattern WITH RECURSIVE cte AS (anchor query UNION ALL recursive query referencing cte) lets you traverse hierarchical data, build sequences, walk graphs, and solve problems that would otherwise require procedural logic. Org charts, category trees, dependency graphs, gap-and-island analysis — all recursive CTE territory.

Where to Practice Each Skill Tier

📋 Beginner Sites

SQLZoo is the classic beginner-friendly site — short interactive lessons, immediate feedback, and zero setup. W3Schools' Try SQL editor is even simpler but lighter on conceptual depth. Mode Analytics' SQL Tutorial walks through a clean curriculum with real-world business questions and a connected query engine. Start with SQLZoo for syntax drills, then move to Mode for business context. By the end of two weeks of consistent work across both sites you should clear the beginner tier.

📋 Intermediate Sites

HackerRank's SQL track has a strong mid-tier section covering joins, subqueries, and window functions, with difficulty ratings on every problem. LeetCode SQL has roughly 250 problems across easy/medium/hard tiers — the medium ones are the sweet spot for intermediate practice. StrataScratch is built specifically around real interview questions from companies like Airbnb, Amazon, and Meta, which is unbeatable for analyst-track interview prep. Pick one site, work through 50+ problems, then switch to the next for variety.

📋 Advanced Sites

Advanced practice is harder to find on a single platform — most sites top out at intermediate. The best path is to combine LeetCode's hard SQL problems with reading the PostgreSQL or SQL Server documentation cover-to-cover on topics like window function frames, recursive CTE semantics, and isolation levels. DataCamp's advanced SQL career track has decent coverage of window functions and stored procedures. For query plan reading, nothing beats running EXPLAIN on your own queries in a real database.

📋 Coding Test Prep

Most SQL coding tests for interviews use a HackerRank or CoderPad-style interface. Practice on the actual platform if you know which one your target company uses. HackerRank SQL is the most common — running through their full SQL Intermediate certification track is excellent preparation for any timed test. Mode Analytics also offers a free SQL skills test you can use as a benchmark. Time yourself. The clock pressure changes how you think.

Indexes are the next advanced skill. You need to understand what a B-tree index actually does, when adding one helps a query and when it hurts insert performance, what a covering index is, and why composite indexes are order-sensitive. The classic interview question — "this query is slow, what would you check?" — usually has "the right index isn't being used" somewhere in the answer. Run EXPLAIN on a query without an index, add the index, run EXPLAIN again, and watch the plan change. That hands-on exercise teaches you more about indexes than any tutorial.

Query execution plans are the second advanced pillar. Every major database has its own way of showing the plan — PostgreSQL has EXPLAIN ANALYZE, SQL Server has the graphical Execution Plan, MySQL has EXPLAIN with its own format. Learning to read one teaches you to read all of them. Look for sequential scans on large tables (bad), missing index usage (often bad), nested loop joins on huge inputs (sometimes bad), and hash joins or merge joins where appropriate. Knowing when a plan is reasonable and when it's pathological is what separates a senior SQL practitioner from a strong intermediate.

Partitioning is the next layer up. Table partitioning by date range, by hash, or by list — when does it help and when is it a maintenance nightmare? Most analysts don't need to design partitioned tables, but understanding the concept matters for working with them in data warehouses like BigQuery or Snowflake, where partition pruning can make a query 50x faster or 50x slower depending on whether your WHERE clause hits the partition key. Read your platform's partitioning docs once. Bookmark them.

Try a Free SQL Practice Test

Transaction isolation levels round out the advanced tier — and they're the topic most candidates skip entirely. The four standard levels are READ UNCOMMITTED, READ COMMITTED, REPEATABLE READ, and SERIALIZABLE. Each protects against a different concurrency anomaly: dirty reads, non-repeatable reads, phantom reads. PostgreSQL's default is READ COMMITTED. SQL Server's default is also READ COMMITTED. MySQL's InnoDB defaults to REPEATABLE READ. Knowing the defaults and the trade-offs comes up in senior interviews and in real production work whenever two transactions touch the same row.

Deadlock detection and resolution is a related advanced skill. A deadlock happens when two transactions hold locks that the other one wants. The database resolves it by killing one transaction — usually whichever did less work. Understanding lock ordering, how to read a deadlock graph, and how to redesign queries to avoid the problem entirely is the kind of knowledge that gets you hired as a senior database engineer rather than as a SQL analyst. Most analyst-track interviews won't probe this far, but knowing it impresses everyone in the room.

Now — cheat sheets. Every SQL learner eventually builds or downloads one, and most of the public ones are terrible. The ones worth keeping are the ones that show syntax patterns rather than vocabulary lists. "SUM, AVG, COUNT" is not a cheat sheet — that's a glossary.

A real cheat sheet shows you the full skeleton of a window function call with PARTITION BY and ORDER BY and an OVER() frame clause, with three example use cases below. The PostgreSQL official documentation has the best SQL window function reference in existence. Print the relevant pages. Keep them next to your laptop.

Your Self-Rated SQL Skills Checklist

Beginner — Write SELECT, WHERE, ORDER BY, and LIMIT from memory without any reference, including LIKE patterns, IN lists, and BETWEEN ranges

Beginner — Run GROUP BY queries with multiple aggregate functions and use HAVING to filter the aggregated results correctly

Beginner — Write INNER JOIN and LEFT JOIN with clear ON conditions and explain why a WHERE clause on a left-joined nullable column kills the outer join

Intermediate — Write correlated and non-correlated subqueries fluently, including EXISTS, IN, and NOT EXISTS patterns

Intermediate — Build a three-stage CTE that aggregates, ranks, then filters — the canonical "top N per group" structure

Intermediate — Use ROW_NUMBER, RANK, and DENSE_RANK correctly and explain the difference between them under interview pressure

Intermediate — Apply CASE WHEN inside aggregate functions to build pivot-style output without using the PIVOT keyword

Advanced — Write a recursive CTE that walks a hierarchy (org chart, category tree, dependency graph) and terminates correctly

Advanced — Read an EXPLAIN ANALYZE output and identify the costliest operation, plus explain whether an index would help

Advanced — Choose the right isolation level for a given workload and explain the trade-off between consistency and concurrency

Coding test ready — Solve 50+ medium difficulty problems on HackerRank, LeetCode, or StrataScratch within their time limits

Coding test ready — Memorize keyword precedence (SELECT, FROM, WHERE, GROUP BY, HAVING, ORDER BY, LIMIT) and execution order

A word on the job description gauntlet — "data analyst skills math statistics persuasion sql" and its variants. When recruiters list four or five skills like that, SQL is almost always the gating filter. They'll happily train you on the business communication side if your SQL is sharp. They won't hire you for the SQL role if your queries don't run.

So if you're allocating prep time across that skill bundle, give SQL twice as much weight as anything else in the list. Statistics matters too — but most analyst roles only need descriptive statistics and basic hypothesis testing concepts, which take a long weekend to refresh. SQL takes months to build properly.

Practice site strategy matters more than people realize. Don't bounce between four platforms. Pick one. Work through fifty problems systematically. Then switch when the marginal learning per problem drops off. HackerRank is the right starter for most learners because it has the cleanest difficulty ramp. LeetCode SQL is the right next stop because its medium and hard problems closely resemble what shows up in FAANG-style interviews. StrataScratch is the right finisher because its problems are pulled from actual interview question banks at real companies — and many include the original company name as context.

For ongoing practice once you've cleared the beginner and intermediate tiers, switch tactics. Stop drilling on artificial questions and start writing queries against real data.

Find a free dataset on Kaggle, load it into PostgreSQL locally, and force yourself to answer real business questions about it. "What was the day-over-day revenue change for the top five products in the last 90 days, ranked by retention rate?" That kind of question forces you to integrate window functions, CTEs, and JOINs in ways no canned exercise can replicate. The transition from solving problems to designing them is where intermediate SQL becomes advanced SQL.

SQL Practice Sites — Pros and Cons

Pros

HackerRank SQL has the cleanest difficulty ramp and best free-tier coverage for beginners through intermediates
LeetCode SQL mirrors the question style used in FAANG-level technical screens
StrataScratch pulls from real interview question banks at companies like Airbnb and Amazon
SQLZoo is free, browser-based, and covers fundamentals interactively
Mode Analytics offers a connected query engine with real business datasets
DataCamp has structured tracks with progress tracking and certificates

Cons

HackerRank's advanced section is thinner than its intermediate — you'll outgrow it
LeetCode's free tier locks many problems behind premium subscriptions
StrataScratch requires a paid subscription for most of its highest-value content
SQLZoo doesn't track progress or build toward a certification credential
DataCamp's interactive lessons run on a sandboxed engine, not a real database
None of these sites teach query plan reading or isolation levels well — you need supplemental docs

One last skill that doesn't fit neatly in any tier — dialect fluency. SQL is a standard, but every database vendor has its own quirks. PostgreSQL has rich window function support and array types. MySQL has a slightly different LIMIT syntax and weaker recursive CTE support depending on the version. SQL Server uses TOP instead of LIMIT and has TRY_CAST instead of NULL-returning casts. BigQuery has standard SQL plus its own data type extensions. Snowflake leans on the standard but adds variant types for semi-structured data.

Pick one dialect to be fluent in. For most analyst roles that's PostgreSQL or SQL Server. For data engineer roles in modern stacks it's increasingly Snowflake or BigQuery. Once you're fluent in one, learning a second takes a long weekend — you just read the dialect differences page and write a few queries to feel out the edge cases. Don't try to learn three dialects simultaneously. You'll end up with shallow knowledge of all three and deep knowledge of none.

The advanced interview question banks live mostly on the data engineer and data analyst tracks. "Advanced SQL interview questions for data analyst" pulls up question sets that focus on window functions, CTEs, and complex aggregations. "Advanced SQL interview questions for data engineer" pulls up sets that lean harder into performance, indexing, and pipeline-style queries. "Advanced SQL interview questions for experienced professionals" usually means senior-level questions covering query plans, isolation, and architectural trade-offs. Tailor your prep to the actual role you're targeting. A senior data engineer interview will probe parts of SQL that an analyst interview never touches, and vice versa.

Start Your Free SQL Practice Now

Building the skills checklist into a real study plan is the part most people skip. Don't. Take the checklist above, rate yourself honestly on every item, and turn the "unfamiliar" entries into a queue. Spend one hour per day on the easiest "unfamiliar" item until it becomes "working," then move to the next. Once everything is at least "working," cycle back to the top and push everything to "confident." That's two passes through the list — maybe six weeks of consistent daily practice — and it covers the entire SQL skill space for analyst and engineer roles.

If a SQL coding test is on your calendar in the next month, change the strategy. Don't try to push all your unfamiliar items to confident in time. Instead, prioritize the items most likely to appear. Window functions are tested in roughly 70% of analyst coding tests. CTEs in 60%. Subqueries in 80%. JOIN gotchas in nearly all of them.

Recursive CTEs in maybe 15%. Isolation levels essentially never on a coding test, though they come up in oral interviews afterward. Spend your prep on the high-probability items and accept that you might see one question you can't fully solve. That's fine. Brute-force the rest and submit clean code.

Final thought — the SQL skills market rewards depth over breadth. A candidate who's confident in 80% of the intermediate tier and competent in two-thirds of the advanced tier will outperform one who's seen every advanced feature once but can't write a clean window function under pressure. Drill the basics until they're automatic. Drill the intermediate layer until you can structure a three-CTE query in your sleep. Sample the advanced layer until you understand what each concept does and why it matters. That mix wins interviews and lets you ship real production queries on day one of any new job.

SQL Questions and Answers

What SQL skills do I need for a data analyst job?

Confident command of SELECT, WHERE, ORDER BY, and aggregates with GROUP BY and HAVING. Comfortable use of INNER JOIN and LEFT JOIN with multiple tables. Working knowledge of CTEs, subqueries, and the core window functions — ROW_NUMBER, RANK, LAG, LEAD, and SUM OVER. CASE WHEN logic for bucketing and pivot-style aggregations. If you can handle those without opening a reference, you'll pass most data analyst SQL screens. Anything advanced is bonus credit but rarely required at the analyst tier.

How long does it take to learn intermediate SQL?

With consistent daily practice — about one hour per day on real problems — most learners reach solid intermediate fluency in 8 to 12 weeks starting from beginner. The pacing assumes you're working through structured problems on HackerRank or LeetCode rather than just reading tutorials. Reading without writing queries is the slowest possible path. Pick one practice site, set a daily target of 3 to 5 problems, and check off the skills on your checklist as you become confident with each one.

Which is the best site to practice SQL for interviews?

StrataScratch is the strongest pick if you can afford the subscription — its problems are pulled directly from real interview question banks at companies like Airbnb, Amazon, and Meta. LeetCode SQL is the best free-tier option for FAANG-style preparation, especially its medium-difficulty problems. HackerRank SQL has the cleanest beginner-to-intermediate ramp. Most successful candidates use two of these in sequence rather than one alone — typically HackerRank first, then LeetCode or StrataScratch for the harder material.

Are window functions really that important?

Yes. Window functions show up in roughly 70% of modern SQL coding tests for analyst and data engineer roles and are the single most-tested concept after basic joins and aggregations. ROW_NUMBER, RANK, DENSE_RANK, LAG, LEAD, and SUM OVER with PARTITION BY are the six you need cold. If you can't write a window function from memory or explain the difference between ROW_NUMBER and RANK without thinking, you have a clear study target. Drill them until they're automatic.

Do I need to learn recursive CTEs?

For analyst roles, mostly no — they appear in maybe 10 to 15% of analyst interview questions and rarely on coding tests. For data engineer roles, especially senior ones, yes — they come up regularly when interviewers want to test whether you can solve hierarchical data problems without resorting to procedural code. The good news is that the recursive CTE pattern is small once you've seen it. One afternoon of focused practice on org chart traversal and gap-and-island problems usually covers the interview territory.

What's the fastest way to prep for a SQL coding test?

Drill the high-probability skills first — joins, aggregations with GROUP BY, window functions, CTEs, and CASE WHEN logic. Time yourself on each problem to build clock pressure tolerance. Most coding tests run 60 to 90 minutes with 4 to 8 questions, so practicing under time pressure matters more than the absolute number of problems you solve. If you have one week, do 50 medium problems on HackerRank. If you have two weeks, add 25 more on LeetCode medium. Skip recursive CTEs and isolation levels unless the role description specifically requires them.

Should I memorize a SQL keywords cheat sheet?

Memorize syntax patterns, not vocabulary lists. A useful cheat sheet shows you the full skeleton of a window function call with PARTITION BY, ORDER BY, and frame clauses — plus three real-world examples — rather than just listing the names of aggregate functions. The PostgreSQL official docs have the strongest reference material for window functions in particular. Print the relevant pages and keep them next to your laptop until the patterns become automatic.

What separates advanced SQL from intermediate?

Intermediate SQL means you can solve complex problems using CTEs, window functions, and JOINs. Advanced SQL means you also understand how the database executes those queries — query plans, indexes, partitioning, isolation levels — and can optimize them when they're slow. Most analyst roles top out at intermediate. Data engineer and database engineer roles regularly probe the advanced tier, especially during senior-level interviews. Plan your prep around the actual role you're targeting rather than trying to cover everything equally.