About GoLiveApp
GoLiveApp is a technical blog for DevOps engineers and SREs who run production systems on Kubernetes. Every guide here is written to solve a specific operational problem — not to introduce concepts.
What we publish
Production Kubernetes is full of gaps between "the docs say it works" and "it works at 3am under load with real traffic." GoLiveApp covers those gaps: the YAML settings most teams skip, the failure modes that only show up at scale, and the operational patterns that separate stable systems from ones that page you on weekends.
Topics
- Kubernetes deployments — rolling updates, zero-downtime configuration, probes, graceful shutdown
- Cluster resilience — Pod Disruption Budgets, topology spread, node affinity, taints and tolerations
- AWS EKS operations — cluster upgrades, rollbacks, managed node groups, Fargate, Auto Mode
- Autoscaling — HPA, Cluster Autoscaler, Karpenter, cost-aware scaling
- SRE practices — SLOs, error budgets, on-call runbooks, incident response
- Cost optimization — Spot instances, resource right-sizing, namespace quotas
- Observability — metrics, logs, traces, alerting, dashboards
Who this is for
Engineers who are already running Kubernetes and want to run it better. These guides assume familiarity with Kubernetes basics — pods, deployments, services. They don't explain what a container is.
Affiliate disclosure
Some posts contain affiliate links to tools and services we recommend. If you click a link and make a purchase, we may earn a commission at no extra cost to you. We only link to tools that are genuinely useful for the topic being covered. See our Privacy Policy for more details.
Contact
Questions, corrections, or topic suggestions: [email protected]