Distributed Systems
23 chaptersThis page contains all chapters in this theme. Open chapters in sequence or use this page as a section map.
Why are distributed systems and consistency needed?
Original ContenteasyIntroductory chapter: why we need consistency models, consensus and working with failures.
Open chapterCAP theorem
Original ContentmediumFundamental limitation of distributed systems: consistency, availability, resistance to partitioning. History, misconceptions, ACID vs BASE.
Open chapterPACELC theorem
Original ContenthardExtending CAP: Tradeoffs between latency and consistency in native mode. System classification: PA/EL, PC/EC, PA/EC, PC/EL.
Open chapterClock Synchronization in Distributed Systems
Original ContentmediumPractice of time synchronization: physical vs logical clocks, NTP/PTP, clock skew impact and architectural protection against time drift.
Open chapterLeader Election: patterns and implementations
Original ContentmediumHow to design leader election: leases, quorum, failover, split-brain protection and practical implementations on Raft/ZooKeeper/etcd/Kubernetes.
Open chapterConsensus: Paxos and Raft
Original ContentexpertHow systems negotiate a single meaning: quorums, two-phase Paxos, and leader-centric Raft.
Open chapterLesley Lamport: Causality, Paxos and Engineering Thinking
DocumentaryhardHow Lamport's ideas (happens-before, logical clocks, Paxos, TLA+) grew out of physics and why they are critical for modern distributed systems.
Open chapterDistributed Transactions: 2PC and 3PC
Original ContenthardPractical analysis of distributed transactions: coordinator, prepare/commit phases, failure modes, blocking trade-offs and alternatives via Saga/outbox.
Open chapterJepsen and consistency models
Original ContentexpertDistributed systems testing project: hierarchy of consistency models, Serializable vs Linearizable, known findings.
Open chapterTesting Distributed Systems
Original ContenthardA practical approach to testing distributed systems: chaos engineering, contract testing and integration testing at scale.
Open chapterDesigning Data-Intensive Applications (short summary)
Book SummaryhardAnalysis of the book by Martin Kleppmann: data models, replication, partitioning, transactions, batch and stream processing.
Open chapterDistributed Systems: Principles and Paradigms (short summary)
Book SummaryexpertThe seminal work of Tanenbaum and van Steen: architectures, coordination, consistency, fault tolerance and security.
Open chapterGoogle Global Network: Evolution and Architectural Principles for the AI Age
Original ContenthardEvolution of the Google network from the internet/streaming/cloud era to the AI-era: WAN as new LAN, multi-shard design, Protective ReRoute, intent-driven programmability and autonomous operations.
Open chapterStreaming Data (short summary)
Book SummaryhardAndrew Psaltis about stream processing: Collection/Queue/Analysis tiers, delivery semantics, data windows, stream algorithms.
Open chapterKafka: The Definitive Guide (short summary)
Book SummarymediumDistributed stream processing platform: producers, consumers, partitions, replication, delivery semantics and Kafka Streams.
Open chapterKappa Architecture: stream-first alternative to Lambda
Original ContenthardA single flow circuit without a separate batch layer: immutable log, materialized views, replay/backfill and comparison with Lambda.
Open chapterData Pipeline / ETL / ELT Architecture
Original ContentmediumHow to design a data pipeline: batch + streaming, ETL vs ELT, orchestration, data quality, recovery, cost control and operational reliability.
Open chapterApache Iceberg: table architecture in data lake
Original ContenthardPractical analysis of Apache Iceberg: snapshots, manifests, ACID in the data lake, schema evolution, hidden partitioning, time travel and the place of Tableflow in the streaming circuit.
Open chapterBig Data (short summary)
Book SummaryhardNathan Marz about Lambda Architecture: batch/serving/speed layers, data immutability, HyperLogLog and practical examples.
Open chapterData Mesh in Action
Book SummaryhardA practical guide to adopting data mesh: domain ownership, data as a product, federated computational governance, self-serve platforms, and an MVP in one month.
Open chapterBrief overview of the T-Bank data platform
Original ContenthardEvolution of T-Bank's data platform: from DWH approaches to Lakehouse, key contours of the platform, scale and practical architectural conclusions.
Open chapterData platforms: How to build them in 2025 - interview with Nikolay Golov
DocumentaryhardResearch Insights Made Simple #6: centralization vs federalization, data mesh in practice, OLTP/MPP limitations and the evolution of data platforms.
Open chapterLocal-First Software: Taking Back Control of Data
DocumentaryeasyA short documentary about the local-first approach: offline experience, synchronization and user control of data.
Open chapter