Q: What are the main integer types and when do you choose each?

All major databases offer a family of fixed-size integers: | Type | Bytes | Range (~) | Use when… | |---|---|---|---| | | 2 | ±32 k | small lookup codes, status flags | | / | 4 | ±2.1 B | most surrogate keys, counters | | | 8 | ±9.2 × 10¹⁸ | high-volume tables, distributed IDs | Rule of thumb: default to for PKs; switch to if you expect more than ~1 billion rows or use globally distributed IDs (snowflakes, UUIDs stored as numbers).

Question 1

Why does picking the right data type matter?

Accepted Answer

Choosing the right type affects storage size, query performance, and data integrity. A correct type rejects bad data at insert time (the database enforces the constraint for free) and lets the engine use internal optimizations (integer comparisons are faster than string comparisons; a  column can use date arithmetic natively). Rule of thumb: choose the narrowest type that correctly represents every valid value — it saves space, speeds up indexes, and keeps invalid data out automatically.

Question 2

What are the main integer types and when do you choose each?

Accepted Answer

All major databases offer a family of fixed-size integers:

Type	Bytes	Range (~)	Use when…
`SMALLINT`	2	±32 k	small lookup codes, status flags
`INT` / `INTEGER`	4	±2.1 B	most surrogate keys, counters
`BIGINT`	8	±9.2 × 10¹⁸	high-volume tables, distributed IDs

-- Postgres auto-increment shorthand
id SERIAL PRIMARY KEY          -- alias for INT + sequence
id BIGSERIAL PRIMARY KEY       -- alias for BIGINT + sequence

-- Standard SQL (Postgres 10+, MySQL 8, SQL Server)
id INT GENERATED ALWAYS AS IDENTITY PRIMARY KEY

Rule of thumb: default to INT for PKs; switch to BIGINT if you expect more than ~1 billion rows or use globally distributed IDs (snowflakes, UUIDs stored as numbers).

Question 3

What is the difference between NUMERIC/DECIMAL and FLOAT/REAL?

Accepted Answer

/  store exact values using binary-coded decimal arithmetic. They never introduce rounding errors and are required for money, tax rates, or any value where "0.10 + 0.20 = 0.30" must hold exactly.  /  /  are IEEE-754 floating-point types. They are faster and more compact but introduce tiny rounding errors, making them unsuitable for financial calculations. Rule of thumb: use  for money and anything that will be summed or compared for equality; use / for scientific measurements where small rounding is acceptable.

Question 4

What is the difference between CHAR, VARCHAR, and TEXT?

Accepted Answer

- * — fixed-length, always  characters, right-padded with   spaces. Trailing spaces are ignored in comparisons in most databases.   Useful only for truly fixed-width codes (country codes, ISO currency codes). -  — variable-length up to  characters. The limit is a   declaration; values shorter than  use less storage. -  — unlimited-length character string (no declared max). Postgres   treats  and  identically at the storage level. MySQL and   SQL Server have different performance trade-offs for very large TEXT values. Rule of thumb:* use  only for fixed-width codes,  for fields with a meaningful business-length cap (email, username), and  for free-form content.

Question 5

What date and time types does SQL offer and how do they differ?

Accepted Answer

| Type | Stores | Timezone-aware? | |---|---|---| |  | year-month-day | No | |  | hour-min-sec | No ( in Postgres) | |  | date + time | No | |  (Postgres) /  (SQL Server) | date + time | Yes — stored as UTC, displayed in session tz | Rule of thumb: always store timestamps with time-zone awareness ( in Postgres,  in SQL Server). Store  alone only when the time component is meaningless (birthdays, holidays).

Question 6

How do databases handle boolean values?

Accepted Answer

Postgres has a native  type that accepts / (and aliases like /, /). MySQL lacks a native boolean — / is an alias for , where  = false and any non-zero = true. SQL Server uses  (0 or 1, no native / literal). Rule of thumb: in Postgres use ; in MySQL use ; in SQL Server use . In all cases, enforce  to keep the flag unambiguous.

Question 7

What is NULL and how does three-valued logic work in SQL?

Accepted Answer

means unknown / missing / not applicable — it is not zero, not empty string, not . SQL uses three-valued logic: a comparison involving  evaluates to , which acts like  in  clauses (the row is excluded). Rule of thumb: never compare with ; always use  / . Use  to substitute a default before comparison.

Question 8

What are UUIDs and when should you use them as primary keys?

Accepted Answer

A UUID (Universally Unique Identifier) is a 128-bit value, usually written as . It is collision-resistant without a central coordinator, making it ideal for distributed inserts and exposing IDs in APIs (not predictable like an integer sequence). Downsides: random UUIDs (v4) cause index fragmentation because inserts scatter across the B-tree. UUIDv7 (time-ordered) mitigates this. Rule of thumb: prefer integer PKs for internal tables; use UUIDs when rows are created across multiple nodes or when IDs are exposed externally and must not be guessable.

Question 9

When should you store data as JSON in a relational column?

Accepted Answer

JSON columns let you persist semi-structured, schema-flexible data (event payloads, third-party API responses) alongside relational data. Postgres's  stores a parsed binary representation — indexable with GIN, fast to query. MySQL 8+ and SQL Server 2016+ also support JSON but store it as text with helper functions. Rule of thumb: use JSON columns for truly variable structures that would otherwise require dozens of nullable columns or a separate EAV table. If you find yourself querying the same JSON key in every  clause, extract it into a proper column.

Question 10

What are ENUM types and what are their trade-offs?

Accepted Answer

An * restricts a column to a predefined list of string labels, enforcing a domain constraint at the type level. Postgres stores  as a user-defined type; MySQL stores it internally as an integer but displays the label. Trade-offs: - ✅ Compact storage, enforced domain, readable values. - ❌ Adding a new label requires  (Postgres) or  (MySQL), which can lock the table. - ❌ Harder to manage via migrations; lookup tables are more flexible. Rule of thumb:* use  for short, stable lists (< 10 values, rarely changing); use a lookup/reference table when the list is large or frequently updated.

Question 11

How do you choose precision and scale for NUMERIC(p, s)?

Accepted Answer

= total significant digits,  = digits to the right of the decimal point. | Value | Type | p | s | |---|---|---|---| |  | price | 7 | 2 | |  | rate | 7 | 6 | |  | balance | 11 | 4 | Postgres and SQL Server will raise an error if a value exceeds the declared precision. MySQL silently rounds or truncates. Rule of thumb: set  to the number of decimal places your business logic requires; set  to  plus the number of digits you expect to the left of the decimal, then add a few digits of headroom.

Question 12

What is the difference between SERIAL and GENERATED AS IDENTITY?

Accepted Answer

Both auto-generate ascending integer PKs, but they differ in SQL standard compliance and control:

SERIAL (Postgres-specific) creates a sequence and sets a DEFAULT nextval(...) on the column. It is an alias, not a type — the column's real type is INTEGER. Users can still INSERT an explicit value, bypassing the sequence.
GENERATED ALWAYS AS IDENTITY (SQL:2003 standard, Postgres 10+, SQL Server, MySQL 8+) formally declares the column as identity-generated. GENERATED ALWAYS prevents manual inserts; GENERATED BY DEFAULT allows them.

-- Old Postgres style
id SERIAL PRIMARY KEY

-- Standard SQL (preferred)
id INT GENERATED ALWAYS AS IDENTITY PRIMARY KEY

Rule of thumb: prefer GENERATED ALWAYS AS IDENTITY for new schemas — it is portable and prevents accidental sequence skips from manual inserts.

Question 13

What is the best way to store monetary values in SQL?

Accepted Answer

Use * (or a higher scale for currencies with sub-cent precision). Never use  — floating-point arithmetic makes  equal , which causes reconciliation errors. Some teams store money as a  of the smallest unit (cents, pence) and convert to a decimal only in the application layer — this avoids any numeric type ambiguity. Rule of thumb:* use  in SQL; if you also need speed at very high throughput, use  cents and divide by 100 in the application. Document which approach you use in the column comment.

Question 14

When would you store binary data in a SQL column?

Accepted Answer

Binary columns ( in Postgres, / in MySQL/SQL Server) store raw byte sequences — images, PDFs, encrypted values, hashes. In practice, storing large blobs directly in the database bloats the table, slows backups, and is rarely optimal compared to object storage (S3, GCS) with only a URL or key in the DB. Rule of thumb: store binary data in the database only when it is small (< 1 MB), must be transactionally consistent with other columns, or access patterns demand it. Otherwise, use object storage and keep a reference key.

Question 15

What is implicit type casting and why can it be dangerous?

Accepted Answer

Implicit casting (coercion) happens when the database silently converts a value from one type to another to satisfy a comparison or expression. This can cause index scans to degrade into full-table scans if the cast prevents the engine from using the index on the original column. Rule of thumb: always compare like types. Mismatched types in  predicates are a common source of unexpected full-table scans — check with  when in doubt.

Question 16

When should you use array columns (Postgres) instead of a child table?

Accepted Answer

Postgres supports  columns that hold a list of any base type (, , ). They can save a join for read-heavy denormalized patterns but lose referential integrity and are harder to index and update partially. Rule of thumb: use arrays when the list is small, ordered, read far more than written, and does not need referential integrity or per-element queries. Use a child table when you need FK constraints, ordering, or per-row metadata.

Data Types Interview Questions & Answers

Why does picking the right data type matter?

What are the main integer types and when do you choose each?

What is the difference between NUMERIC/DECIMAL and FLOAT/REAL?

What is the difference between CHAR, VARCHAR, and TEXT?

What date and time types does SQL offer and how do they differ?

How do databases handle boolean values?

What is NULL and how does three-valued logic work in SQL?

What are UUIDs and when should you use them as primary keys?

When should you store data as JSON in a relational column?

What are ENUM types and what are their trade-offs?

How do you choose precision and scale for NUMERIC(p, s)?

What is the difference between SERIAL and GENERATED AS IDENTITY?

What is the best way to store monetary values in SQL?

When would you store binary data in a SQL column?

What is implicit type casting and why can it be dangerous?

When should you use array columns (Postgres) instead of a child table?

More ways to practice

Type	Stores	Timezone-aware?
`DATE`	year-month-day	No
`TIME`	hour-min-sec	No (`TIMETZ` in Postgres)
`TIMESTAMP`	date + time	No
`TIMESTAMPTZ` (Postgres) / `DATETIMEOFFSET` (SQL Server)	date + time	Yes — stored as UTC, displayed in session tz

Why does picking the right data type matter?

What are the main integer types and when do you choose each?

What is the difference between NUMERIC/DECIMAL and FLOAT/REAL?

What is the difference between CHAR, VARCHAR, and TEXT?

What date and time types does SQL offer and how do they differ?

How do databases handle boolean values?

What is NULL and how does three-valued logic work in SQL?

What are UUIDs and when should you use them as primary keys?

When should you store data as JSON in a relational column?

What are ENUM types and what are their trade-offs?

How do you choose precision and scale for NUMERIC(p, s)?

What is the difference between SERIAL and GENERATED AS IDENTITY?

What is the best way to store monetary values in SQL?

When would you store binary data in a SQL column?

What is implicit type casting and why can it be dangerous?

When should you use array columns (Postgres) instead of a child table?

More Schema & Data Types interview questions

More ways to practice