URL Shortener (TinyURL)

A URL shortener has a simple outer interface, but underneath it is a case about a very hot read path, short-ID generation, and scaling the mapping store.

The chapter is useful because it forces you to separate the write path from the read path: what to cache, where hot keys appear, and how to keep the ID generator from becoming a new failure point.

For interviews and architecture discussions, it is valuable because it quickly reveals whether you can spot read-write asymmetry, identify the real bottleneck, and avoid premature complexity.

Control Plane

Focus on policy, limits, routing, and stable edge behavior under variable load.

Data Path

Keep latency and throughput predictable while traffic and burst pressure increase.

Failure Modes

Cover fail-open/fail-close behavior, graceful degradation, and safe fallback paths.

Ops Ready

Show monitoring for saturation, retry storms, and practical operational guardrails.

URL Shortener (TinyURL, bit.ly) is a classic system design case. It looks simple because the surface area is tiny: create a short link and resolve it later. Underneath, though, it quickly turns into a discussion about compact IDs, extremely hot read paths, caching, and mapping storage that must scale with traffic.

That is why the case works so well in interviews: even on a small system, you still have to discuss latency, throughput, availability, and the cost of mistakes on the most popular user path.

Chapter 8

Alex Xu: URL Shortener

Detailed analysis in the book System Design Interview

Читать обзор

Why a URL shortening service matters

Convenience

A short link is easier to dictate, drop into a post, and fit into an SMS where every character counts.

Analytics

Every redirect passes through the service, so it surfaces clicks, user geography, and traffic sources.

Control

A link can still be disabled, capped with an expiration time, or locked behind a password after it ships.

Requirements

Functional

FR1
Creating a short link from a long URL
FR2
Redirect via short link to original URL
FR3
Link lifetime (TTL, optional)
FR4
Custom alias for the link (optional)

Non-functional

NFR1
100M new URLs per day
NFR2
10:1 read-to-write ratio → 1B redirects per day
NFR3
Redirect latency < 100 ms
NFR4
99.9% availability

Quick scale estimate

Traffic

Write: 100M/day = 1,160 QPS
Read: 1B/day = 11,600 QPS
Peak: ~23,000 QPS (2x average)

Storage

Avg URL size: 500 bytes
100M × 500B = 50GB/day
5 years: 50GB × 365 × 5 ≈ 90TB

Length of the short URL

How many characters are needed for a unique identifier? We use base62 (a-z, A-Z, 0-9):

Length	Combinations	URLs over 5 years
6 characters	62⁶ = 56.8B	Not enough
7 characters	62⁷ = 3.5T	✓ Enough
8 characters	62⁸ = 218T	With reserve

Conclusion: 7 base62 characters give 3.5 trillion combinations. At 100M URLs per day, that lasts for about 96 years.

ID generation strategies

1
Hash + Collision Resolution

Take MD5/SHA256 of the URL, keep the first 7 characters, and check whether a collision occurs.

✓ Pros:

Deterministic (same URL = same hash)
No central point of failure

✗ Cons:

Collisions require retry + DB lookup
Harder to support custom aliases

2
Unique ID Generator + Base62
Recommended

Generate a unique numeric ID, then convert it to base62.

✓ Pros:

Guaranteed unique (no collisions)
Simple logic
Easy to support custom aliases

✗ Cons:

Requires a dedicated ID generator service
Identical URLs can give different short URLs

Options for generating IDs

Auto-increment DB

A simple solution with an auto-increment primary key.

⚠️ Becomes a single point of failure and scales poorly

Multi-master DB

Two servers: one generates even IDs and the other generates odd IDs.

✓ Easy to scale at first, but limited by the number of writer nodes

UUID

128-bit unique identifier generated on the client.

⚠️ Too long at 36 characters, which defeats the point of a short link

Snowflake ID
Recommended

64-bit ID: timestamp + datacenter + machine + sequence number.

✓ Distributed, time-sortable, and compact

Snowflake

Twitter/X: Snowflake ID

Detailed analysis of the ID generation algorithm

Читать обзор

High-level architecture

Architecture map

Choose a path to highlight it in the diagram

Client

Browser / App

Load Balancer

Edge routing

URL Service

Stateless API

Read path

Cache

Redis

Database

PostgreSQL / Cassandra

Write path

ID Generator

Snowflake

Database

PostgreSQL / Cassandra

Client

Browser / App

Load Balancer

Edge routing

URL Service

Stateless API

Cache

Redis

ID Generator

Snowflake

Database

PostgreSQL / Cassandra

Note

A cache miss is shown as a dashed line to the database.

Write flow

Read flow

Write Path

1. The client sends a long URL
2. The ID generator produces a unique identifier
3. Convert the ID to base62 and produce the short URL
4. Store the mapping in the database
5. Return the short URL to the client

Read Path

1. The client requests the short URL
2. Check the cache (Redis) first
3. On a cache miss, query the database
4. Refresh the cache
5. Return HTTP 301/302

301 vs 302 redirects

301 Moved Permanently

The browser caches the redirect, so later requests may go straight to the target URL.

✓ Less load on the server

✗ Much harder to keep full click analytics on the service side

302 Found
Recommended

The browser does not cache the redirect, so every click still passes through the service.

✓ Full click analytics

✓ You can change the target URL

Data model

urls table

Column	Type	Description
short_url	VARCHAR(7)	Primary key, base62 encoded
original_url	TEXT	Original long URL
user_id	BIGINT	Link creator (optional)
created_at	TIMESTAMP	Creation date
expires_at	TIMESTAMP	TTL (null = unlimited)

Deep Dive

Database Internals

Indexes, B-trees, and optimization for read-heavy workloads

Читать обзор

Caching strategy

At a 10:1 read-to-write ratio, without a cache every redirect hits the database, and it becomes the bottleneck on the hot path. Redis keeps the hottest short URLs close to the application and takes the bulk of that load off the database.

Strategy

Cache-aside: read from the cache first, then fall back to the database on a miss
LRU eviction: evict rarely used URLs
Write-through: write to the cache immediately when a short link is created

Cache Size

20% daily reads × avg URL size

= 200M × 500B = 100GB

→ Redis cluster with replication

CDN

Content Delivery Network

Geo-distributed caching for global systems

Читать обзор

Choosing a database

PostgreSQL

✓ ACID guarantees
✓ Easy to use
✓ Good for moderate traffic
✗ Horizontal scaling is more difficult

Cassandra / DynamoDB
For scale

✓ Linear horizontal scaling
✓ High availability without a single point of failure
✓ Optimized for write-heavy workloads
✗ Eventually consistent

What to emphasize in an interview

In an interview, the point is not just to draw a diagram. You want to make the trade-offs explicit: why this ID strategy fits the case, how caching changes the read path, and where the service gives up flexibility in exchange for speed, simplicity, or lower cost.

What to show clearly

• How base62 works and why 7 characters are enough

• What trade-offs exist between hashing and an ID generator

• Why 301 and 302 affect analytics differently

• Why caching is the main lever in a read-heavy system

Frequent follow-up questions

• How do you handle duplicate URLs?

• How do you support custom aliases?

• How do you delete expired URLs?

• How do you protect the service from abuse?

References

Twitter Engineering — Snowflake: distributed unique ID generation (twitter-archive, GitHub)Bitly Engineering — Software Scalability Explained (Bitly blog, 2023)Alex Xu — Design a URL Shortener, System Design Interview (ByteByteGo)Hello Interview — Design a URL Shortener Like Bitly (problem breakdown)

Related chapters

Design principles for scalable systems - gives baseline intuition for latency and throughput and helps evaluate URL shortener trade-offs as traffic grows.
Caching strategies: Cache-Aside, Read-Through, Write-Through, Write-Back - goes deeper on the read path: hit rate, invalidation, and the right caching pattern for redirects.
Rate Limiter - covers protecting shorten/redirect APIs from abuse, bots, and burst traffic.
Replication and sharding - shows how to scale storage for short-to-long URL mappings as dataset size and QPS grow.
API Gateway - adds the outer edge layer: routing, authentication, rate limiting, and boundary policies for a public API.
CDN - extends the topic to global delivery and edge caching so redirects stay fast across regions.
System Design Interview — An Insider's Guide - contains the classic TinyURL interview breakdown with a step-by-step solution flow.

Why a URL shortening service matters

Convenience

Analytics

Control

Requirements

Functional

Non-functional

Quick scale estimate

Traffic

Storage

Length of the short URL

ID generation strategies

1Hash + Collision Resolution

2Unique ID Generator + Base62Recommended

Options for generating IDs

Auto-increment DB

Multi-master DB

UUID

Snowflake IDRecommended

High-level architecture

Architecture map

Write Path

Read Path

301 vs 302 redirects

301 Moved Permanently

302 FoundRecommended

Data model

urls table

Caching strategy

Strategy

Cache Size

Choosing a database

PostgreSQL

Cassandra / DynamoDBFor scale

What to emphasize in an interview

What to show clearly

Frequent follow-up questions

References

Related chapters

1
Hash + Collision Resolution

2
Unique ID Generator + Base62
Recommended

Snowflake ID
Recommended

302 Found
Recommended

Cassandra / DynamoDB
For scale