System Design Space
Knowledge graphSettings

Updated: May 4, 2026 at 8:57 PM

Cassandra: The Definitive Guide (short summary)

hard

This Cassandra book becomes especially useful once you stop seeing it as just an AP label from a CAP diagram and start reasoning through its real write, storage, and failure model.

In engineering practice, it connects tunable consistency, data partitioning, LSM-style storage, and query modeling to product demands where availability and linear write growth matter more than universal querying.

In interviews and architecture discussions, this chapter is strong because it lets you state Cassandra's limits honestly: it solves some classes of problems extremely well, but needs complementary designs where exploratory queries or strict consistency are expected.

Practical value of this chapter

AP-first mindset

Use Cassandra where availability and linear write scaling under network partitions are primary requirements.

Data model by query

Design tables from business query paths; partition-key quality directly affects latency and load balance.

Operational discipline

Treat repair, compaction tuning, and tombstone monitoring as mandatory parts of architecture operations.

Interview limitations

Call out constraints clearly: ad-hoc querying and strict consistency often need complementary patterns.

Original

Telegram: Book Cube

Original post with analysis of Cassandra.

Перейти на сайт

Cassandra: The Definitive Guide, 3rd Edition

Authors: Jeff Carpenter, Eben Hewitt
Publisher: O'Reilly Media, Inc.
Length: 426 pages

Short summary of Cassandra's wide-column data model, Bigtable and Dynamo roots, leaderless architecture, token ring, tunable consistency, and LSM-style storage.

Original

Apache Cassandra is one of the best-known NoSQL databases. It combines Google Bigtable's wide-column model with Amazon Dynamo's leaderless distribution ideas: peer nodes, replication, and per-operation consistency choices. This chapter explains where that architecture shines and where its trade-offs become design constraints.

Origin: Bigtable and Dynamo

Google Bigtable

From Bigtable, Cassandra took:

  • A wide-column data model
  • LSM-style storage with MemTable, SSTable, and background compaction
  • Sequential commit-log writes before data is flushed to disk

Amazon Dynamo

From Dynamo, Cassandra inherited:

  • Consistent hashing for key distribution
  • A gossip protocol for sharing node state
  • Tunable consistency for reads and writes

Architecture visualization

Ring Topology

ABCDEFToken Ring0 - 100

Consistent hashing

Choose a key to see how it is distributed across the ring (RF=3):

Replication Factor = 3

Each key is stored on 3 nodes: primary node and the next 2 clockwise nodes.

Write Path

  1. Client -> any node (coordinator)
  2. Coordinator computes a token for the key
  3. Token selects the primary range owner and RF-1 replicas
  4. Write is sent to the required replicas in parallel
Primary Node
Replica Nodes
Gossip Protocol

Related chapter

CAP theorem

The core distributed-systems constraint: consistency, availability, and tolerance to network partitions.

Читать обзор

Cassandra and the CAP theorem

AP

Availability + Partition Tolerance

Cassandra is AP by default

During a network partition, Cassandra prefers to remain available, even if a read may observe an older version of the data. That makes it useful when refusing writes is more dangerous than temporary replica divergence.

Tunable Consistency

Cassandra lets you choose the consistency level for each request. With QUORUM or ALL, behavior moves closer to CP, but latency and availability pay the price.

Related chapter

PACELC theorem

CAP extended with trade-offs during normal operation.

Читать обзор

Cassandra and PACELC

PA/EL

Partition → Availability, Else → Latency

Cassandra is often treated as a PA/EL system

if P (Partition)

Under a partition, Cassandra preserves availability: the reachable side of the cluster can still accept requests.

else E (Normal)

In normal operation, Cassandra usually chooses low latency: a request does not have to wait for every replica.

This is why Cassandra's consistency model is not only a response to partitions. Even when the network is healthy, it can exchange some strictness for faster responses and resilience to individual node failures.

Related chapter

Consistency and idempotency

Practical consistency models, idempotent operations and trade-offs.

Читать обзор

Consistency levels

Cassandra offers a range of consistency levels, from highly available writes to reads and writes confirmed by a majority of replicas:

ANY

The write succeeds even if it is only stored temporarily for an unavailable replica.

Min. latency
ONE

A single replica response is enough. Fast, but a read may observe stale data.

QUORUM

Requires responses from a majority of replicas (RF/2 + 1). A practical balance between latency and consistency.

Recommended
ALL

Requires responses from all replicas. Maximum consistency, lowest availability.

SERIAL

Linearizability through Paxos. Used for lightweight transactions and conditional CAS operations.

LWT

Strong consistency formula: If W + R > RF, where W is the write consistency level, R is the read consistency level, and RF is the replication factor, then a read must intersect with at least one replica that acknowledged the latest write.

Related chapter

Replication and sharding

How to choose a sharding strategy, maintain shard balance and scale writing/reading.

Читать обзор

Architecture

LSM-style storage

1. The write is first recorded in the commit log for crash recovery

2. Fresh data accumulates in MemTable

3. A flush writes immutable SSTable files to disk

4. Background compaction merges tables and removes obsolete versions

Data distribution

• Consistent hashing maps key → token → node

• Virtual nodes help spread ranges more evenly

• NetworkTopologyStrategy places replicas with rack and data-center topology in mind

• Replication factor defines how many copies are stored

Gossip protocol

• Nodes exchange state without a central coordinator

Phi Accrual Failure Detector

• No single point of failure

• Each node periodically gossips with a few neighbors

Fault tolerance

• Hinted handoff stores writes temporarily for unavailable replicas

• Read repair fixes replica divergence during reads

• Anti-entropy repair reconciles replicas in the background

• Merkle trees make large-range comparison efficient

When Cassandra fits and when it gets in the way

✓ Good fit

  • Time-series data: IoT, metrics, logs
  • Large write streams with predictable key-based reads
  • Geographically distributed systems
  • When availability matters more than strict consistency
  • Activity feeds and event queues with known query paths

✗ Not the best choice

  • ACID transactions across multiple records
  • Complex joins and analytical exploration
  • Frequently changing data schemas
  • Small amounts of data (<100GB)
  • Strict read-latency goals for complex filters

History

A detailed history of Cassandra can be found in the chapter "Cassandra: architecture and trade-offs" — the link leads directly to the timeline.

Today, Cassandra is used in large products, including Netflix, Apple, and Uber, when teams need high write throughput, geographic distribution, and resilience to individual node failures.

Key takeaways

  • 1.Cassandra is an AP system in CAP terms and PA/EL in PACELC terms: its strength is availability and low latency with explicit consistency trade-offs
  • 2.Tunable consistency lets you choose the latency and consistency cost separately for reads and writes
  • 3.The leaderless architecture keeps nodes equal, while gossip helps them exchange cluster state
  • 4.LSM-style storage gives high write throughput through sequential I/O and background compaction
  • 5.Cassandra fits write-heavy workloads, time-series data, and geographically distributed systems

Related chapters

  • CAP theorem - Core framework for understanding why Cassandra prioritizes availability under partitions and how that affects system-level SLAs.
  • PACELC theorem - Extension of CAP that explains Cassandra latency-versus-consistency trade-offs even in healthy network conditions.
  • Cassandra: architecture and trade-offs - Practical overview of Cassandra evolution, architecture constraints, and production operating patterns.
  • Replication and sharding - Operational guidance for replication factor, data placement, and rebalancing in large Cassandra deployments.
  • Jepsen and consistency models - How fault-injection testing validates distributed guarantees and what it means for real Cassandra consistency behavior.
  • Database Selection Framework - Selection framework to justify Cassandra for write-heavy and geo-distributed workloads in modern architectures.

Where to find the book

Enable tracking in Settings