Q: Why combine Gunicorn with Uvicorn workers and how do you configure it?

Uvicorn alone handles one event loop per process. Gunicorn is a battle-tested process manager that handles worker lifecycle (respawning crashed workers, graceful restarts, signal handling). Together they give you: - Gunicorn's robust process management. - Uvicorn's async event loop per worker. replaces Gunicorn's default sync worker with an async one. Rule of thumb: use Gunicorn + for traditional deployments on VMs/bare metal; use Uvicorn directly in Kubernetes where the orchestrator handles pod restarts.

Question 1

What is Uvicorn and why is it the recommended server for FastAPI?

Accepted Answer

Uvicorn is a lightning-fast ASGI server built on  (a C-accelerated asyncio event loop) and . FastAPI requires an ASGI server because it's built on Starlette, which uses the ASGI protocol. Key flags: -  — hot reload for development -  — multiple processes (single worker = one event loop) -  — listen on all interfaces -  Rule of thumb: use  in production for the  speedup; use plain  in Docker to keep the image small (uvloop has C deps).

Question 2

Why combine Gunicorn with Uvicorn workers and how do you configure it?

Accepted Answer

Uvicorn alone handles one event loop per process. Gunicorn is a battle-tested process manager that handles worker lifecycle (respawning crashed workers, graceful restarts, signal handling). Together they give you:

Gunicorn's robust process management.
Uvicorn's async event loop per worker.

pip install gunicorn
gunicorn app.main:app \
    -k uvicorn.workers.UvicornWorker \
    --workers 4 \
    --bind 0.0.0.0:8000 \
    --timeout 120

UvicornWorker replaces Gunicorn's default sync worker with an async one.

Rule of thumb: use Gunicorn + UvicornWorker for traditional deployments on VMs/bare metal; use Uvicorn directly in Kubernetes where the orchestrator handles pod restarts.

Question 3

How many workers should you run per server for a FastAPI app?

Accepted Answer

The classic formula: ****. However, FastAPI is async — a single worker handles many concurrent requests through the event loop. For I/O-bound apps (most web APIs), 2-4 workers per machine is often sufficient: | App type | Worker count | |----------|-------------| | Pure async I/O | 2–4 per machine | | Mixed sync/async | 2 × cores | | CPU-bound | 1 per core (use multiprocessing separately) | Rule of thumb: start with ; profile under load and reduce if workers share limited resources (DB connections, RAM).

Question 4

What is the difference between concurrency and parallelism in the context of FastAPI?

Accepted Answer

- Concurrency: multiple tasks make progress by interleaving on a single CPU   (one event loop thread handles thousands of waiting I/O operations). - Parallelism: multiple tasks run simultaneously on multiple CPUs (multiple   Uvicorn worker processes, each with their own event loop). FastAPI gives you concurrency within a single worker via  handlers. You get parallelism by running multiple workers. CPU-bound code (heavy computation) blocks a core — neither concurrency nor more  helps. Use a thread/process pool or a task queue. Rule of thumb:  handlers add concurrency (better I/O throughput per worker); more workers add parallelism (better CPU utilisation).

Question 5

How do you enable hot reload in Uvicorn for development?

Accepted Answer

Pass  flag: Or use the Python API (better for IDEs):  restricts watching to the  directory, avoiding false reloads when  files or test outputs change. Rule of thumb: never use  in production — it adds overhead and restarts the process on any file change, including logs and temp files.

Question 6

What is the recommended Dockerfile structure for a FastAPI application?

Accepted Answer

Multi-stage build for smaller images: Rule of thumb: use  in pip installs to keep image size down; copy  before source code for layer caching.

Question 7

How do you add a health check endpoint in FastAPI for container orchestration?

Accepted Answer

Kubernetes  uses ;  uses . Rule of thumb: readiness should check actual dependencies (DB, cache); liveness should only check the process is alive — failing liveness kills the pod.

Question 8

How does FastAPI/Uvicorn handle graceful shutdown?

Accepted Answer

When Uvicorn receives SIGTERM (sent by Kubernetes, Docker, or Gunicorn): 1. It stops accepting new connections. 2. Waits for in-flight requests to complete (up to  seconds). 3. Calls the  shutdown code (after ). 4. Closes the event loop and exits. In Kubernetes, set a  hook to delay pod termination so the load balancer routes traffic away before the pod stops accepting connections: Rule of thumb: always set  to slightly less than Kubernetes'  to give in-flight requests time to finish.

Question 9

How do you pass environment variables to a FastAPI app running in Docker?

Accepted Answer

Pass them at container run time (don't bake secrets into the image): Or use a  file: In Docker Compose: In Kubernetes, use  (base64-encoded) for sensitive values: Rule of thumb: never embed production secrets in the Docker image or  — always inject at runtime from secrets management.

Question 10

How do you serve multiple FastAPI apps from a single Uvicorn process?

Accepted Answer

Mount sub-applications using Starlette's : Rule of thumb: mount versioned sub-apps when versions differ so significantly that sharing middleware or the OpenAPI schema would be confusing.

Question 11

How do you run Uvicorn with SSL/TLS in development?

Accepted Answer

Pass  and : In production, don't terminate TLS in Uvicorn — use Nginx/Caddy/ALB in front. TLS termination in the reverse proxy lets you use  for Let's Encrypt, handle certificate rotation without restarting Uvicorn, and offload TLS overhead. Rule of thumb: Uvicorn TLS is fine for internal service-to-service encryption or local dev HTTPS; for public-facing production use a reverse proxy for TLS.

Uvicorn & Gunicorn Interview Questions & Answers

What is Uvicorn and why is it the recommended server for FastAPI?

Why combine Gunicorn with Uvicorn workers and how do you configure it?

How many workers should you run per server for a FastAPI app?

What is the difference between concurrency and parallelism in the context of FastAPI?

How do you enable hot reload in Uvicorn for development?

What is the recommended Dockerfile structure for a FastAPI application?

How do you add a health check endpoint in FastAPI for container orchestration?

How does FastAPI/Uvicorn handle graceful shutdown?

How do you pass environment variables to a FastAPI app running in Docker?

How do you serve multiple FastAPI apps from a single Uvicorn process?

How do you run Uvicorn with SSL/TLS in development?

More ways to practice

App type	Worker count
Pure async I/O	2–4 per machine
Mixed sync/async	2 × cores
CPU-bound	1 per core (use multiprocessing separately)

What is Uvicorn and why is it the recommended server for FastAPI?

Why combine Gunicorn with Uvicorn workers and how do you configure it?

How many workers should you run per server for a FastAPI app?

What is the difference between concurrency and parallelism in the context of FastAPI?

How do you enable hot reload in Uvicorn for development?

What is the recommended Dockerfile structure for a FastAPI application?

How do you add a health check endpoint in FastAPI for container orchestration?

How does FastAPI/Uvicorn handle graceful shutdown?

How do you pass environment variables to a FastAPI app running in Docker?

How do you serve multiple FastAPI apps from a single Uvicorn process?

How do you run Uvicorn with SSL/TLS in development?

More Deployment & Middleware interview questions

More ways to practice