Question 1

What is a Stream in Java and how does it differ from a collection?

Accepted Answer

A * is not a data structure — it's a pipeline that carries elements from a source (a collection, array, generator, file…) through a series of operations and produces a result. It stores nothing; it just describes a computation over the source. Key differences from a collection: | Collection | Stream | | ---------- | ------ | | Stores elements in memory | Holds no data — pulls from a source | | Eagerly built | Lazily evaluated | | Iterated externally (you loop) | Iterated internally (the stream loops) | | Reusable | Single-use (consumed once) | | Can be modified | Source is never mutated | So you use a collection to hold data and a stream to process* it declaratively ( →  → ) instead of writing explicit loops.

Question 2

What are the common ways to create a Stream?

Accepted Answer

Streams come from many sources: / are infinite — always pair them with a short-circuiting op like .  returns a stream backed by an open file, so close it (use try-with-resources).

Question 3

What is the difference between intermediate and terminal operations?

Accepted Answer

A stream pipeline is exactly zero-or-more intermediate operations followed by one terminal operation. | | Intermediate | Terminal | | --- | ------------ | -------- | | Returns | another  | a value / side-effect (not a stream) | | Evaluation | lazy — does nothing yet | eager — triggers the work | | Count per pipeline | any number | exactly one | | Examples | , , , , ,  | , , , , ,  | Without a terminal operation nothing executes — the intermediate steps are just recorded.

Question 4

What does it mean that streams are lazy?

Accepted Answer

Laziness means intermediate operations are not evaluated when they are called — they only run when a terminal operation demands elements. The pipeline then processes elements one at a time, vertically (each element flows through all stages before the next starts), not stage-by-stage over the whole collection. Laziness enables two big wins: fusion (multiple ops run in one pass) and short-circuiting (the pipeline can stop early without touching every element).

Question 5

What is short-circuiting in a stream pipeline?

Accepted Answer

A short-circuiting operation can produce a result (or stop the pipeline) without processing every element. Combined with laziness, this lets streams work on infinite sources and quit as soon as the answer is known. Short-circuiting terminal ops: , , , , . Short-circuiting intermediate op: .  stops at the first element that fails,  at the first that passes — they rarely scan the whole stream.

Question 6

What does the filter operation do?

Accepted Answer

* is an intermediate operation that keeps only elements matching a  (a function returning ); the rest are dropped. It does not change the element type — only how many pass through.  is stateless and lazy, so several filters fuse into one pass. Rule of thumb:  decides whether an element survives;  decides what* it becomes.

Question 7

What does the map operation do?

Accepted Answer

* is an intermediate operation that applies a  to each element and replaces it with the result — a one-to-one transformation. It can change the element type ( → ).  never changes the number of elements (N in, N out); it only transforms each one. When the mapping produces another collection or stream* per element and you want them flattened, you need  instead.

Question 8

What is flatMap and when do you use it instead of map?

Accepted Answer

* maps each element to a stream and then flattens all those streams into one. Use it when the mapping function yields multiple values per element (a list, an , a nested stream) and you want a single flat stream rather than a stream-of-streams. Mental model: use  for one-to-one (N→N), * for one-to-many that you want merged (N→M). Splitting sentences into words is the classic  case.

Question 9

What do distinct and sorted do, and what is special about them?

Accepted Answer

* removes duplicates (by /);  orders elements (natural order, or a supplied ). Both are intermediate but stateful — they must see elements before they can emit.  is a full barrier: it buffers the entire stream before producing any output, so it cannot short-circuit and breaks on infinite streams. Both add memory and ordering overhead, so apply  before* them to shrink the work.

Question 10

What do limit and skip do?

Accepted Answer

* truncates the stream to the first  elements;  discards the first  and keeps the rest. Together they give pagination-style slicing.  is short-circuiting — it's what tames infinite streams. On an ordered stream both are deterministic; on an unordered/parallel stream they may pick any* n elements, and  on a parallel stream can actually hurt performance because of the ordering constraint.

Question 11

What is peek used for and why is it controversial?

Accepted Answer

* is an intermediate operation that runs a side-effecting action on each element as it flows past, then passes the element through unchanged. Its intended use is debugging — logging what moves through each stage. It's controversial because (1) it's lazy, so elements skipped by short-circuiting never reach , and (2) using it to mutate* state is an anti-pattern. Rule of thumb: use  only for observation, never for logic.

Question 12

What is reduce and what are its three forms?

Accepted Answer

* combines all elements into a single result by repeatedly applying a binary operator. It comes in three overloads: The identity must be a true no-op ( for sum,  for product). The combiner* merges partial results from parallel sub-streams, so it's required when the accumulator's result type differs from the element type.

Question 13

How do count, min, and max work as terminal operations?

Accepted Answer

* returns the number of elements as a .  and  take a  and return an * (empty if the stream is empty). / return  precisely because there's no sensible value for an empty stream — handle it with /. (Note: since Java 9 the JVM may skip the pipeline for  if it can compute the size directly.)

Question 14

What is the difference between anyMatch, allMatch and noneMatch?

Accepted Answer

All three are short-circuiting terminal operations that take a  and return a : - * —  if at least one element matches (stops at the   first match). -  —  if every element matches (stops at the first   failure). -  —  if no element matches. Watch the empty-stream edge cases (vacuous truth): on an empty stream  and  return , while  returns *.

Question 15

What is the difference between findFirst and findAny?

Accepted Answer

Both return an *** with some element (or empty), and both short-circuit. The difference matters only in parallel streams: -  — returns the first element in encounter order. -  — returns any element, whichever a worker thread finds   first. It frees the runtime from honoring order, so it can be faster in   parallel. On a sequential stream they behave identically. Use  when you genuinely don't care which* match you get and want maximum parallel performance.

Question 16

How do you turn a stream back into a collection or array?

Accepted Answer

The main terminal operation is * with a . There are also direct helpers:  (Java 16+) is the concise modern choice but returns an unmodifiable* list; use  if you need to mutate the result. Pass a generator () to  to get a typed array instead of . Deeper collector recipes (grouping, joining) live on the Collectors page.

Question 17

What are IntStream, LongStream and DoubleStream and why use them?

Accepted Answer

They are specialized primitive streams that avoid the boxing overhead of //. Because elements are raw primitives, they add numeric terminal ops that the object stream lacks. Convert with // to enter a primitive stream, * or * to go back to an object stream. Prefer primitive streams for heavy numeric work — they're faster and offer // for free.

Question 18

What is the difference between stateless and stateful operations?

Accepted Answer

A stateless operation processes each element independently of the others (, , , ) — it needs no memory of what came before. A stateful operation must consider other elements to produce its output (, , , ). Why it matters: stateful ops may buffer the stream (extra memory), can act as barriers that prevent short-circuiting ( on an infinite stream hangs), and are harder to parallelize. Keep pipelines stateless where you can.

Question 19

Why can't a stream be reused, and what happens if you try?

Accepted Answer

A stream can be traversed only once. After a terminal operation runs (or even after some intermediate ops link onto it), the stream is consumed; touching it again throws *. Streams are single-use because they hold no data and may be backed by I/O or infinite generators — re-traversal isn't generally possible. If you need to process the data twice, re-create the stream from the source* ( again) or use a  that builds a fresh one on demand.

Question 20

What are parallel streams and when do they help or hurt?

Accepted Answer

A parallel stream splits its source and processes chunks concurrently on the common ForkJoinPool, then merges the partial results. You opt in with  or  on an existing stream. They help when: the data set is large, the per-element work is genuinely expensive (CPU-bound), the source splits cheaply (arrays, ), and operations are stateless. They hurt when: the data is small, the source is hard to split (, I/O streams), elements are cheap (split/merge cost dominates), or you rely on order. Rule of thumb: stay sequential by default and only parallelize after measuring a real win.

Question 21

Why should you avoid side effects and stateful lambdas in streams?

Accepted Answer

Stream operations should be pure — depend only on their input and not mutate shared state. A stateful lambda that reads or writes outside variables breaks under parallelism and even under reordering, producing non-deterministic or corrupt results. Also avoid modifying the stream's source during iteration (). Rule of thumb: never accumulate into an external collection from / — express the result with  or , which are designed to be safe even in parallel.

Stream API Interview Questions & Answers

What is a Stream in Java and how does it differ from a collection?

What are the common ways to create a Stream?

What is the difference between intermediate and terminal operations?

What does it mean that streams are lazy?

What is short-circuiting in a stream pipeline?

What does the filter operation do?

What does the map operation do?

What is flatMap and when do you use it instead of map?

What do distinct and sorted do, and what is special about them?

What do limit and skip do?

What is peek used for and why is it controversial?

What is reduce and what are its three forms?

How do count, min, and max work as terminal operations?

What is the difference between anyMatch, allMatch and noneMatch?

What is the difference between findFirst and findAny?

How do you turn a stream back into a collection or array?

What are IntStream, LongStream and DoubleStream and why use them?

What is the difference between stateless and stateful operations?

Why can't a stream be reused, and what happens if you try?

What are parallel streams and when do they help or hurt?

Why should you avoid side effects and stateful lambdas in streams?

More ways to practice

Collection	Stream
Stores elements in memory	Holds no data — pulls from a source
Eagerly built	Lazily evaluated
Iterated externally (you loop)	Iterated internally (the stream loops)
Reusable	Single-use (consumed once)
Can be modified	Source is never mutated

	Intermediate	Terminal
Returns	another `Stream`	a value / side-effect (not a stream)
Evaluation	lazy — does nothing yet	eager — triggers the work
Count per pipeline	any number	exactly one
Examples	`filter`, `map`, `sorted`, `distinct`, `limit`, `peek`	`collect`, `forEach`, `reduce`, `count`, `findFirst`, `toArray`

What is a Stream in Java and how does it differ from a collection?

What are the common ways to create a Stream?

What is the difference between intermediate and terminal operations?

What does it mean that streams are lazy?

What is short-circuiting in a stream pipeline?

What does the filter operation do?

What does the map operation do?

What is flatMap and when do you use it instead of map?

What do distinct and sorted do, and what is special about them?

What do limit and skip do?

What is peek used for and why is it controversial?

What is reduce and what are its three forms?

How do count, min, and max work as terminal operations?

What is the difference between anyMatch, allMatch and noneMatch?

What is the difference between findFirst and findAny?

How do you turn a stream back into a collection or array?

What are IntStream, LongStream and DoubleStream and why use them?

What is the difference between stateless and stateful operations?

Why can't a stream be reused, and what happens if you try?

What are parallel streams and when do they help or hurt?

Why should you avoid side effects and stateful lambdas in streams?

More Streams & Functional interview questions

More ways to practice