Question 1

What are ranking window functions?

Accepted Answer

Ranking functions assign a position number to each row within its partition, based on the window's . The main ones are , , , and , plus the distribution functions  and . All ranking functions require an  in the  clause — without an order, "rank" has no meaning. Rule of thumb: ranking functions number rows by an ordering; they always need  in .

Question 2

What does ROW_NUMBER() do?

Accepted Answer

assigns a unique, sequential integer to each row within the partition, in the window's  order —  with no ties and no gaps. Even rows with equal ordering values get distinct numbers (the tie-break is arbitrary unless you add more  columns). It's the go-to for pagination, deduplication, and top-N-per-group. Rule of thumb:  = a unique 1,2,3 sequence per partition, no ties — use it when you need exactly one row per position.

Question 3

What is the difference between RANK() and DENSE_RANK()?

Accepted Answer

Both give tied rows the same rank, but they differ in what comes next: -  leaves gaps after ties — if two rows tie at 1, the next is 3. -  leaves no gaps — after a tie at 1, the next is 2. Rule of thumb:  skips numbers after ties (like Olympic ranking);  keeps them consecutive.

Question 4

How does ROW_NUMBER() differ from RANK() and DENSE_RANK()?

Accepted Answer

The key difference is how ties are handled: -  — always unique; tied rows get different numbers (arbitrary   order among the tie). -  — tied rows get the same rank, then a gap. -  — tied rows get the same rank, no gap. Rule of thumb: choose  for unique positions, / when ties should share a position (gaps vs no gaps).

Question 5

What does NTILE() do?

Accepted Answer

divides the ordered rows of a partition into n roughly equal buckets and labels each row with its bucket number . It's used for quartiles, deciles, percentile bands, and bucketing. If the row count doesn't divide evenly, the earlier buckets get one extra row. Rule of thumb:  splits ordered rows into n balanced groups — use it for quartiles/deciles and even distribution.

Question 6

What does PERCENT_RANK() compute?

Accepted Answer

returns the relative rank of a row as a value between  and : . The first row is always ; the last is . It tells you what fraction of rows rank below the current one. It's useful for percentile-style comparisons (e.g. "this salary is higher than 80% of others"). Rule of thumb:  = where a row sits on a 0–1 scale relative to the rest; first row 0, last row 1.

Question 7

What does CUME_DIST() compute and how does it differ from PERCENT_RANK()?

Accepted Answer

(cumulative distribution) returns the fraction of rows with a value less than or equal to the current row: . It ranges in . The difference from : -  = count of rows ≤ current / total (includes the current row). -  =  (excludes current; first row is 0). Rule of thumb:  answers "what proportion are at or below me?";  answers "what's my relative rank position 0–1?".

Question 8

Why do ranking functions require an ORDER BY in the OVER clause?

Accepted Answer

Ranking is meaningless without a defined order — the function needs to know by what to rank. So , , , , etc. all require  inside ; omitting it is an error in most databases. (Aggregate windows like  don't need , but ranking functions do.) Rule of thumb: every ranking function needs  in  to define the ranking criterion.

Question 9

How do you select the top N rows per group?

Accepted Answer

The classic pattern: number rows within each partition with a ranking function in a CTE/subquery, then filter on that number in the outer query (window functions can't go in ). Use  for "exactly N rows" or / to include ties at the cutoff. Rule of thumb: rank in a CTE, filter  — pick  for an exact N,  to keep ties.

Question 10

How do you find the Nth highest value using ranking functions?

Accepted Answer

Rank the rows descending, then filter for the Nth rank. Use  when you want the Nth distinct value (so duplicate values count once). With  you'd get the 3rd row, not the 3rd distinct value; with  you'd risk gaps.  is the safe choice for "Nth highest distinct." Rule of thumb: for the Nth highest distinct value, .

Question 11

How do you remove duplicate rows using ROW_NUMBER()?

Accepted Answer

Partition by the columns that define a duplicate, number the rows, and keep only  (deleting or excluding the rest). The  decides which duplicate is the "keeper" (e.g. newest). Rule of thumb:  partitioned by the dup key, keep , remove  — the standard dedup pattern.

Question 12

How can ROW_NUMBER() be used for pagination?

Accepted Answer

Number the ordered rows, then select the slice for a page. This was the classic pagination method before / and is still used in SQL Server pre-2012. Modern engines often prefer , but  pagination works everywhere and pairs well with deterministic ordering. Note both get slow at deep offsets — keyset pagination scales better. Rule of thumb:  +  slices pages; for large datasets prefer keyset pagination over deep offsets.

Question 13

How do you make ROW_NUMBER() deterministic when ordering values tie?

Accepted Answer

always produces unique numbers, but when the  values tie, the assignment among tied rows is arbitrary and can change between runs. Add a tie-breaker column (ideally a unique key) to the  to make it deterministic. Without the tie-break, paginating or deduplicating can return inconsistent results across executions. Rule of thumb: append a unique column to the window  so tied rows get a stable, repeatable order.

Question 14

How does PARTITION BY affect ranking functions?

Accepted Answer

makes the rank restart at 1 for each group. Without it, ranking runs across the entire result set as one partition. So an employee can be rank 1 in their department even if they're not the highest-paid company-wide. Rule of thumb: add  to rank within groups (rank resets per group); omit it to rank globally.

Question 15

Why can't you filter directly on a ranking function in WHERE?

Accepted Answer

Window functions — including ranking functions — are evaluated after , so the rank doesn't exist yet when  runs. Referencing it there is an error. Rule of thumb: ranks are computed after filtering — always wrap them in a CTE or subquery before filtering.

Question 16

What happens with NTILE() when rows don't divide evenly?

Accepted Answer

When the row count isn't divisible by ,  makes the first buckets one row larger than the later ones. For example, 10 rows into  gives buckets of sizes 4, 3, 3. This guarantees buckets differ in size by at most one, with the extras front-loaded. Rule of thumb:  front-loads the remainder — earlier buckets get the extra rows when the count doesn't divide evenly.

Question 17

How do you choose between ROW_NUMBER, RANK, and DENSE_RANK?

Accepted Answer

Pick based on how you want ties handled: - Need exactly one row per position (pagination, dedup, "the single latest")   → . - Ties should share a rank with gaps (standings where 2 golds means no silver)   → . - Ties should share a rank without gaps (Nth distinct value, dense tiers)   → . Rule of thumb: unique → ; ties-with-gaps → ; ties-no-gaps → .

Question 18

How can ranking/distribution functions help compute a median?

Accepted Answer

A median is the value at the 50th percentile. You can approximate it with /, but most databases offer the dedicated ordered-set aggregate / (a  function, related to window analytics).  interpolates between rows;  returns an actual data value. Rule of thumb: for a median, prefer  over hand-rolling it from rank functions.

Ranking Functions Interview Questions & Answers

What are ranking window functions?

What does ROW_NUMBER() do?

What is the difference between RANK() and DENSE_RANK()?

How does ROW_NUMBER() differ from RANK() and DENSE_RANK()?

What does NTILE() do?

What does PERCENT_RANK() compute?

What does CUME_DIST() compute and how does it differ from PERCENT_RANK()?

Why do ranking functions require an ORDER BY in the OVER clause?

How do you select the top N rows per group?

How do you find the Nth highest value using ranking functions?

How do you remove duplicate rows using ROW_NUMBER()?

How do you make ROW_NUMBER() deterministic when ordering values tie?

How does PARTITION BY affect ranking functions?

Why can't you filter directly on a ranking function in WHERE?

What happens with NTILE() when rows don't divide evenly?

How do you choose between ROW_NUMBER, RANK, and DENSE_RANK?

How can ranking/distribution functions help compute a median?

More ways to practice

What are ranking window functions?

What does ROW_NUMBER() do?

What is the difference between RANK() and DENSE_RANK()?

How does ROW_NUMBER() differ from RANK() and DENSE_RANK()?

What does NTILE() do?

What does PERCENT_RANK() compute?

What does CUME_DIST() compute and how does it differ from PERCENT_RANK()?

Why do ranking functions require an ORDER BY in the OVER clause?

How do you select the top N rows per group?

How do you find the Nth highest value using ranking functions?

How do you remove duplicate rows using ROW_NUMBER()?

How can ROW_NUMBER() be used for pagination?

How do you make ROW_NUMBER() deterministic when ordering values tie?

How does PARTITION BY affect ranking functions?

Why can't you filter directly on a ranking function in WHERE?

What happens with NTILE() when rows don't divide evenly?

How do you choose between ROW_NUMBER, RANK, and DENSE_RANK?

How can ranking/distribution functions help compute a median?

More Window Functions interview questions

More ways to practice