February 20, 2024
I used to think ops just meant fixing whatever broke overnight. But over time, I started to see it differently. It’s really about building systems that are easier to understand and harder to break. I started with basic things like Sidekiq tuning and Redis cleanup. Later, I got more into Docker and Kubernetes. I also started using tools like Terraform to make setups repeatable. What changed wasn’t just the tech — I got more serious about handling failure the right way.
I like knowing what a system is doing at all times. Good metrics and logs make me feel confident. Moving to Kubernetes helped me keep things consistent and cut down on messy config. It also made me think more about how services should start, stop, and talk to each other. These days, I try to add visibility early and test edge cases before shipping. I’ve learned that solid infrastructure isn’t overhead — it makes everything else faster and safer.
Ops can feel like a never-ending list of fires if you let it. But if you invest early — in logs, alerts, health checks — things start to calm down. I try to build systems that tell you what’s wrong before users do. It’s not always fun work, but it saves time and stress down the line.

Will McCracken
Will McCracken is a seasoned fullstack engineer with 8+ years of experience in Ruby, JavaScript, and DevOps. He's worked on e-discovery platforms, job queue orchestration, and infrastructure automation. Will's writing distills technical challenges into readable reflections, with an emphasis on simplicity, observability, and trade-off thinking.