NGINX Down After Deployment
A recent push broke the web server. Users are seeing 502s. Restore service before the team notices.
CloudDrill simulates real incidents across Linux, Kubernetes, AWS, Terraform, CI/CD, and FedRAMP environments so engineers can build troubleshooting skills that certifications don't measure.
Most platforms prepare you to answer questions about cloud.
CloudDrill prepares you to fix it when it breaks.
Each challenge is based on outages and interview scenarios that cloud engineers actually face — not toy examples.
A recent push broke the web server. Users are seeing 502s. Restore service before the team notices.
The API deployment is caught in a restart loop after a config change. Identify the root cause and restore the service.
An application can't connect to its database after a VPC change. The network path is broken somewhere.
State drift is causing a plan that would destroy and recreate live infrastructure. Prevent downtime without losing state.
A newly built container starts and exits with code 1 before the app initializes. Debug and identify the cause.
An auditor has requested evidence for the Audit Events control. Collect, format, and submit compliant evidence.
Track progress across tracks, pick up challenges where you left off, and drill the exact skills interviewers test.
Every challenge you complete moves your readiness score. When you walk into an interview, you'll know exactly where you stand — and so will your interviewer.
Join the waitlist. Shape what gets built. Your answers directly determine which labs we build first.
We'll be in touch when beta access opens.
If you said yes to an interview, expect a calendar link within the week.