The three pillars of observability Logs, metrics, and traces - why all three matter
12 min 150 XP 2 lessons
Start → Cardinality: the metric that breaks your metrics Why adding one label can destroy your monitoring platform
18 min 260 XP 3 lessons
Start → Lights on, nobody's home Green dashboards that lie, phantom health, and zombie services
16 min 220 XP 3 lessons
Start → Percentiles save lives. Averages lie. Why p99 matters more than mean, and how to think in distributions
15 min 200 XP 3 lessons
Start → How to SLO properly SLIs that matter, error budgets that work, and alerting with teeth
20 min 260 XP 3 lessons
Start → Chaos engineering 101 Breaking things on purpose before they break you
14 min 180 XP 3 lessons
Start → Circuit breakers and bulkheads Preventing cascading failures with proven patterns
20 min 300 XP 3 lessons
Start → Instrumenting backend latency the right way APIs, databases, streams - where time actually goes and how to measure it
22 min 300 XP 3 lessons
Start → Trace-driven debugging Finding what metrics and logs can't tell you, one span at a time
20 min 280 XP 2 lessons
Start → Error rate beyond HTTP status codes What counts as an error, why your 0.1% hides real failures, and how to track correctly
18 min 240 XP 2 lessons
Start → Building a reliability strategy From reactive firefighting to proactive engineering - the full playbook
28 min 380 XP 4 lessons
Start → Terraform and Infrastructure as Code Declarative infrastructure, state management, and drift detection
25 min 340 XP 3 lessons
Start → Canary, Blue-Green, and Progressive Delivery Deploying with confidence using traffic shifting and rollback patterns
24 min 320 XP 3 lessons
Start → AI and LLM Observability Monitoring AI agents, LLM performance, and model behavior in production
28 min 380 XP 4 lessons
Start →