Design a high-scale system and load-test the bottleneck
Design alone is theory. You design a high-scale system (ride-sharing, streaming, or payments-style) and then prove your central assumption by building and load-testing the critical path.
What you'll build
A complete architecture with capacity planning and failure analysis, plus a working prototype of the critical path that you load-test to validate (or correct) the design.
See how we teach, before you sign up
You don't just get code dumped on you. Every starter file and every solution is explained line-by-line, in plain English. Here's one real file from this project:
services:
postgres:
image: postgres:16
environment:
POSTGRES_PASSWORD: dev
POSTGRES_DB: matcher
ports: ["5432:5432"]
command: ["postgres", "-c", "max_connections=200"]
redis:
image: redis:7
ports: ["6379:6379"]
command: ["redis-server", "--maxmemory", "512mb", "--maxmemory-policy", "allkeys-lru"]
Reading this file
image: postgres:16Using a real Postgres container, not an in-memory fake, so behavior under load matches production.max_connections=200The connection limit is a common bottleneck, set it explicitly so the load test reveals when you hit it.--maxmemory-policy ... allkeys-lruEvicting least-recently-used keys when memory fills mirrors how a real cache behaves under pressure.
Real backing stores so behavior under load is representative, never in-memory fakes.
That's 1 of 10 explained code blocks in this single project.
The build, milestone by milestone
- 1
Design the full system
6 guided stepsThe full design is the map you’ll test against. Without quantified capacity targets, the later load test has nothing to confirm or refute.
- 2
Identify the bottleneck
5 guided stepsEvery system has one component that fails first under load. Finding it on paper, and the assumption you’re betting on, tells you exactly what to prototype and test.
- 3
Build the critical path
5 guided stepsA prototype that doesn’t mirror the real bottleneck’s constraints produces numbers you can’t trust. Realistic enough to measure is the bar, not feature-complete.
- 4
Load-test & revise
5 guided stepsPushing the system until it fails is the only way to know your real ceiling. The design revision afterward is the evidence you can reason from measurement, not hope.
- 5
Break it on purpose: chaos & failover
5 guided stepsYour failure-mode table is a hypothesis until you’ve watched the system fail. Chaos under load is the only way to know your mitigations work and to measure real recovery time, not the one you wrote down.
- 6
Write the blameless postmortem
5 guided stepsThe senior skill isn’t avoiding failure, it’s learning from it in writing without blaming people. A blameless postmortem of your own load/chaos run is the artifact that proves you reason from evidence and ship fixes.
What's inside when you start
You'll walk away with
This is portfolio-grade. Build it free.
Sign up to unlock every milestone step-by-step, the code skeletons, full reference solutions, and checkable tasks, with your progress saved as you build.
Start building