Q: Compare HashSet, LinkedHashSet and TreeSet.

The classic side-by-side every interviewer expects: | Feature | | | | | ------- | --------- | --------------- | --------- | | Backed by | | | (red-black tree) | | Ordering | none | insertion order | sorted (natural/comparator) | | // | O(1) avg | O(1) avg | O(log n) | | allowed | one | one | no (NPE with natural order) | | Needs / | yes | yes | uses / | | Extra navigation | no | no | yes (//…) | None of the three is thread-safe. Default to ; upgrade only when you specifically need order or sorting.

Question 1

What is the Set interface and what is its core contract?

Accepted Answer

A  is a  that holds no duplicate elements — adding an element already present is a no-op and  returns . There's no index and (for the general contract) no positional access; you check membership with , which is the operation sets are built to make fast. "Duplicate" is defined by **** (and  for hash-based sets, or / for sorted sets) — not by reference identity. That definition of equality is the heart of every Set question.

Question 2

How does a Set decide that two elements are duplicates?

Accepted Answer

It depends on the implementation, and this is the single most important Set detail in interviews: | Set | Duplicate test | | --- | -------------- | | ,  |  to find the bucket, then  within it | |  |  (or the supplied ) —  is ignored | The trap: a  uses comparison, not , so an element can be dropped as a "duplicate" even though  says it's different (and vice versa) — keep your  consistent with equals to avoid surprises.

Question 3

How does HashSet work internally?

Accepted Answer

A  is just a thin wrapper around a * — each element is stored as a key, and all keys map to the same dummy  value object. So everything you know about  (buckets, load factor, treeified bins) applies directly. That's why  gives average O(1) //, has no ordering*, and depends entirely on good /. If hashes collide badly, performance degrades toward O(n) (or O(log n) once a bucket treeifies).

Question 4

How is LinkedHashSet different from HashSet?

Accepted Answer

extends  but is backed by a *, so it keeps a doubly-linked list threading through the entries. The effect: iteration follows insertion order while keeping 's O(1) operations. Cost is a slightly larger memory footprint (the extra prev/next links). Reach for it whenever you need dedup but predictable, reproducible ordering* — e.g. removing duplicates from a list without scrambling it.

Question 5

How does TreeSet work and what ordering does it give?

Accepted Answer

A  is backed by a *, a red-black tree (self-balancing binary search tree). Elements are kept in sorted order — natural ordering via , or a  you pass to the constructor — and core operations are O(log n) rather than O(1). Because it's a tree, you get range and neighbour queries* for free (, , , , …). The trade-off vs : slower point lookups but ordered iteration and rich navigation.

Question 6

What does TreeSet require of its elements?

Accepted Answer

Elements must be mutually comparable — either they implement  (natural ordering) or you supply a . Without one of those, the very first  of a non-comparable type throws  at runtime. Note the failure is not at construction time — an empty  is happy; it only blows up when it needs to compare an element it can't order.

Question 7

Compare HashSet, LinkedHashSet and TreeSet.

Accepted Answer

The classic side-by-side every interviewer expects:

Feature	`HashSet`	`LinkedHashSet`	`TreeSet`
Backed by	`HashMap`	`LinkedHashMap`	`TreeMap` (red-black tree)
Ordering	none	insertion order	sorted (natural/comparator)
`add`/`contains`/`remove`	O(1) avg	O(1) avg	O(log n)
`null` allowed	one `null`	one `null`	no (NPE with natural order)
Needs `equals`/`hashCode`	yes	yes	uses `compareTo`/`Comparator`
Extra navigation	no	no	yes (`first`/`floor`/`subSet`…)

// pick by need:
new HashSet<>();        // fastest, order doesn't matter
new LinkedHashSet<>();  // dedup, keep insertion order
new TreeSet<>();        // need sorted iteration or range queries

None of the three is thread-safe. Default to HashSet; upgrade only when you specifically need order or sorting.

Question 8

What is the difference between SortedSet and NavigableSet?

Accepted Answer

is the older interface guaranteeing sorted iteration and offering , , , , , and .  (Java 6+) extends it with neighbour lookups and reverse-order views.  implements both. In practice you usually just declare the variable as  (or ) to get the full method set —  alone lacks the handy /// queries.

Question 9

What do floor, ceiling, higher and lower do on a NavigableSet?

Accepted Answer

They find the closest element to a target. The distinction is inclusive vs exclusive of the target itself:

Method	Returns
`floor(e)`	greatest element ≤ e
`ceiling(e)`	smallest element ≥ e
`lower(e)`	greatest element strictly < e
`higher(e)`	smallest element strictly > e

NavigableSet<Integer> s = new TreeSet<>(List.of(10, 20, 30));
s.floor(20);   // 20  (<=)
s.lower(20);   // 10  (strictly <)
s.ceiling(20); // 20  (>=)
s.higher(20);  // 30  (strictly >)
s.floor(5);    // null — nothing <= 5

All return null when no such element exists, so callers must null-check. These run in O(log n) and are exactly why you reach for a TreeSet over a HashSet for "nearest neighbour" or range problems.

Question 10

What do headSet, tailSet and subSet return, and are they live views?

Accepted Answer

They return range views of a sorted set — not copies.  is everything below ,  is everything from  up, and  is the half-open range . The  overloads let you toggle the inclusive/exclusive bounds. They are backed by the original set: changes flow both ways, and adding an element outside the view's bounds throws . Wrap in a  if you need an independent snapshot.

Question 11

How do you iterate a TreeSet in reverse order?

Accepted Answer

Use  (a reverse-ordered view) or . Both walk the tree from largest to smallest without copying or re-sorting. Because  is a live view, mutations on it affect the original set and vice versa. It's the idiomatic alternative to constructing a  with  when you only need reverse iteration occasionally.

Question 12

Can a TreeSet contain null?

Accepted Answer

No — with natural ordering, adding  throws ***, because the set must call  on the element and  can't happen. (A /, by contrast, allows a single .) Even a null-tolerant  (e.g. ) only helps for non-first* elements; the first  on an empty natural-order  still throws. Treat  as null-hostile.

Question 13

What is EnumSet and why is it so fast?

Accepted Answer

is a specialized  for enum types only. Internally it's a bit vector — a single  (or array of s for big enums) where each bit represents one constant. That makes operations near-instant and the memory footprint tiny. Iteration follows the enum's declaration order. It's created via factory methods (, , , , ) rather than . Whenever your set elements are enum constants,  beats  on every axis — speed, memory, and clarity.

Question 14

How do you make a Set thread-safe?

Accepted Answer

The standard // are not synchronized. Options, from cheap to concurrent: - * — wraps every method in a lock; you   must still manually synchronize when iterating. -  — a concurrent hash set backed by   ; high-throughput, no external locking, weakly consistent   iteration. -  — copies the backing array on every write; great   for tiny, read-heavy sets, terrible for write-heavy ones. For most concurrent code, * is the right choice — it scales far better than a globally locked wrapper.

Question 15

When would you use CopyOnWriteArraySet?

Accepted Answer

(backed by a ) makes a fresh copy of the whole array on every mutation. Reads and iteration are lock-free and see a stable snapshot, but each / is O(n). It shines for small, read-mostly, rarely-mutated sets — the textbook case being event-listener registries. Because it scans the array,  is O(n), so it's a poor fit for large or write-heavy sets, where  wins.

Question 16

How do you compute the union of two sets?

Accepted Answer

Copy one set and  the other — duplicates are dropped automatically by the Set contract.  is the union operator. Always copy first () so you don't mutate the original  — a common bug is calling  and silently changing the caller's set.

Question 17

How do you compute the intersection of two sets?

Accepted Answer

keeps only the elements present in both sets.  is the intersection operator. For performance, copy and iterate the smaller set against the larger one — the cost is roughly O(min(a, b)) lookups, and lookups are O(1) on a .

Question 18

How do you compute the difference between two sets?

Accepted Answer

strips out every element that also appears in the other set, leaving the difference (elements in  but not ). So the trio is:  = union,  = intersection,  = difference. A symmetric difference (in either but not both) is just union minus intersection, or two s combined.

Question 19

Why can mutating an element break a HashSet?

Accepted Answer

A  places each element in a bucket chosen by its  at insertion time. If you then mutate a field that / depend on, the element's hash changes but it stays in the old bucket — so the set can no longer find it. That's why Set (and Map-key) elements should be immutable, or at least never mutated on their / fields while stored. It's the same reason  and the wrapper types — immutable — make perfect set elements.

Question 20

What does Set.of return and what happens with duplicates?

Accepted Answer

(Java 9+) returns a small, immutable set. Any mutating call (, , ) throws , and — unlike a normal  that silently ignores dups — passing a duplicate to the factory throws . It also rejects  (NPE) and has an unspecified iteration order that can vary between runs. Use it for compact, never-changing constant sets; use / when you need order or mutability.

Question 21

What ordering guarantees do the different Sets give during iteration?

Accepted Answer

Each implementation makes a different promise — knowing them prevents a class of "works on my machine" bugs:

Set	Iteration order
`HashSet`	no guarantee (can even change between JVM runs)
`LinkedHashSet`	insertion order
`TreeSet`	sorted (natural or comparator)
`EnumSet`	enum declaration order
`Set.of(...)`	unspecified, may vary per run

// never rely on HashSet order:
for (String s : new HashSet<>(List.of("a", "b", "c"))) { /* any order */ }

The rule of thumb: if your output or tests depend on order, don't use HashSet — choose LinkedHashSet for insertion order or TreeSet for sorted order.

Question 22

Why is HashSet.contains so much faster than List.contains?

Accepted Answer

hashes the element, jumps straight to one bucket, and checks only the few items there — average O(1).  has no index into its data, so it scans from the front comparing each element — O(n). This is why a frequent pattern is to dump a list into a  first when you'll do many membership checks. The rule of thumb: need fast "is it in there?" — use a , not a .

Set Implementations Interview Questions & Answers

What is the Set interface and what is its core contract?

How does a Set decide that two elements are duplicates?

How does HashSet work internally?

How is LinkedHashSet different from HashSet?

How does TreeSet work and what ordering does it give?

What does TreeSet require of its elements?

Compare HashSet, LinkedHashSet and TreeSet.

What is the difference between SortedSet and NavigableSet?

What do floor, ceiling, higher and lower do on a NavigableSet?

What do headSet, tailSet and subSet return, and are they live views?

How do you iterate a TreeSet in reverse order?

Can a TreeSet contain null?

What is EnumSet and why is it so fast?

How do you make a Set thread-safe?

When would you use CopyOnWriteArraySet?

How do you compute the union of two sets?

How do you compute the intersection of two sets?

How do you compute the difference between two sets?

Why can mutating an element break a HashSet?

What does Set.of return and what happens with duplicates?

What ordering guarantees do the different Sets give during iteration?

Why is HashSet.contains so much faster than List.contains?

More ways to practice

Set	Duplicate test
`HashSet`, `LinkedHashSet`	`hashCode()` to find the bucket, then `equals()` within it
`TreeSet`	`compareTo()` (or the supplied `Comparator`) — `equals` is ignored

What is the Set interface and what is its core contract?

How does a Set decide that two elements are duplicates?

How does HashSet work internally?

How is LinkedHashSet different from HashSet?

How does TreeSet work and what ordering does it give?

What does TreeSet require of its elements?

Compare HashSet, LinkedHashSet and TreeSet.

What is the difference between SortedSet and NavigableSet?

What do floor, ceiling, higher and lower do on a NavigableSet?

What do headSet, tailSet and subSet return, and are they live views?

How do you iterate a TreeSet in reverse order?

Can a TreeSet contain null?

What is EnumSet and why is it so fast?

How do you make a Set thread-safe?

When would you use CopyOnWriteArraySet?

How do you compute the union of two sets?

How do you compute the intersection of two sets?

How do you compute the difference between two sets?

Why can mutating an element break a HashSet?

What does Set.of return and what happens with duplicates?

What ordering guarantees do the different Sets give during iteration?

Why is HashSet.contains so much faster than List.contains?

More Collections interview questions

More ways to practice