Every design below has an animated diagram, step-by-step walkthrough, component swap, and in-browser stress simulator. Click any title to read the full walkthrough, or open the live interactive version.
These are the same problems asked at Google, Meta, Amazon, Netflix, Apple, Uber, Stripe, and Airbnb. Every page has a step-by-step walkthrough, capacity estimates, and an in-browser stress simulator — no other free resource combines all three.
Alex Xu, System Design Interview Vol 1, Chapter 5; Karger et al. 1997 'Consistent Hashing and Random Trees'; Amazon Dynamo paper (DeCandia et al. 2007) · 8 components · 3 operations
Problem: Design a distributed KV cluster that spreads keys across N shards, routes requests to the owning shard in O(1), replicates for durability, and rebalances with minimal key movement when nodes are added or removed.
Key components: Client SDK, Coordinator, Consistent-Hash Ring, Shard A (Storage Node), Shard B (Storage Node)
Alex Xu, System Design Interview Vol 1, Chapter 6 (pages 87–109); Dynamo paper (DeCandia et al. 2007); Cassandra & BigTable architectures. · 8 components · 3 operations
Problem: Design an always-writable, horizontally scaled key-value store with tunable consistency: N replicas, configurable W/R quorums, vector-clock conflict resolution, gossip membership, and anti-entropy repair.
Alex Xu, System Design Interview Vol 1, Chapter 7 (pages 110–118); Twitter Snowflake blog (2010) · 6 components · 3 operations
Problem: Design a system that generates globally unique, roughly time-sortable 64-bit IDs at very high throughput across many machines, without a central bottleneck on the write path.
Key components: Client (Service or User Request), Application Server, Snowflake Generator (in-process), Coordinator (ZooKeeper / etcd), ID Usage DB (downstream consumer)
Alex Xu, System Design Interview Vol 2, Chapter 7; Stripe idempotency docs; OpenTable and Booking.com engineering blogs · 9 components · 4 operations
Problem: Design a hotel reservation system (Booking.com / Expedia / direct hotel chain) that takes rooms from search, through hold, payment, and confirmed booking, without overselling.
Alex Xu, System Design Interview Vol 1, Chapter 10; Apple Push Notification service docs; Twilio Programmable Messaging SLAs; SendGrid reliability guide · 12 components · 3 operations
Problem: Design a system that delivers notifications across push (APNs/FCM), SMS, and email channels at scale, with reliable retries, rate limiting, per-user preferences, and minute-scale latency for transactional messages.
Alex Xu, System Design Interview Vol 2, Chapter 11 · 10 components · 4 operations
Problem: Design a payment system that charges customers via external PSPs (Stripe/Braintree), keeps an auditable ledger, supports refunds, and reconciles with the PSP daily.
Key components: Client (Merchant checkout / mobile app), Load Balancer, Payment Service API, Idempotency Store (Redis), Payment Orchestrator
Alex Xu, System Design Interview Vol 2, Chapter 10 · 9 components · 3 operations
Problem: Design a real-time global leaderboard for a mobile game: top-100 worldwide, top-K by region, and a given player's rank with surrounding neighbors, all updated in near real-time.
Key components: Game Client, Load Balancer, Score Ingest API, Leaderboard Read API, Redis Sorted Set (ZSET)
Alex Xu, System Design Interview Vol 2, Chapter 9 · 9 components · 3 operations
Problem: Design a distributed, highly durable object storage service like Amazon S3 supporting PUT / GET / DELETE of arbitrary-size objects with versioning.
Alex Xu, System Design Interview Vol 1, Chapter 1 · 11 components · 3 operations
Problem: Evolve a single-server web app into a multi-tier, multi-region architecture that serves millions of users with low latency and high availability.
Alex Xu, System Design Interview Vol 2, Chapter 13 · 9 components · 4 operations
Problem: Design a stock exchange with a matching engine that processes limit orders, a market-data publisher that broadcasts book updates at 100K ticks/s, and a batched clearing/settlement pipeline.
Key components: Client / Broker, Load Balancer, Order Gateway, Order Validation / Risk, Matching Engine (per-symbol shard)
Alex Xu, System Design Interview Vol 1, Chapter 9; 'Mercator: A Scalable, Extensible Web Crawler' (Heydon & Najork, 1999); Google 'Crawling and Indexing' docs · 11 components · 3 operations
Problem: Design a polite, scalable web crawler that ingests billions of URLs, respects robots.txt, and avoids both URL-level and content-level duplicates.
Key components: Seed URLs, URL Frontier (priority queue), HTML Downloader, DNS Resolver + Cache, Content Parser