SRE Learning Hub

Interactive modules on observability and resilience. Percentiles vs averages, lights-on-nobody's-home failures, SLOs, chaos engineering, and circuit breakers. No certification fluff - just what matters at 3am.

0 XP

Rookie

0/14 modules complete

0/14

Modules Done

0/3810

XP Earned

Rookie

Current Level

Lessons Complete

Track:

Level:14 modules

Accessibility

Font Size: 100%

Contrast

Letter Spacing: 0px

Line Height: 1.5

Reduce Motion

SRE Learning Hub

The three pillars of observability

Cardinality: the metric that breaks your metrics

Lights on, nobody's home

Percentiles save lives. Averages lie.

How to SLO properly

Chaos engineering 101

Circuit breakers and bulkheads

Instrumenting backend latency the right way

Trace-driven debugging

Error rate beyond HTTP status codes

Building a reliability strategy

Terraform and Infrastructure as Code

Canary, Blue-Green, and Progressive Delivery

AI and LLM Observability

Accessibility