About GoLiveApp

GoLiveApp is a technical blog for DevOps engineers and SREs who run production systems on Kubernetes. Every guide here is written to solve a specific operational problem — not to introduce concepts.

What we publish

Production Kubernetes is full of gaps between "the docs say it works" and "it works at 3am under load with real traffic." GoLiveApp covers those gaps: the YAML settings most teams skip, the failure modes that only show up at scale, and the operational patterns that separate stable systems from ones that page you on weekends.

Topics

Kubernetes deployments — rolling updates, zero-downtime configuration, probes, graceful shutdown
Cluster resilience — Pod Disruption Budgets, topology spread, node affinity, taints and tolerations
AWS EKS operations — cluster upgrades, rollbacks, managed node groups, Fargate, Auto Mode
Autoscaling — HPA, Cluster Autoscaler, Karpenter, cost-aware scaling
SRE practices — SLOs, error budgets, on-call runbooks, incident response
Cost optimization — Spot instances, resource right-sizing, namespace quotas
Observability — metrics, logs, traces, alerting, dashboards

Who this is for

Engineers who are already running Kubernetes and want to run it better. These guides assume familiarity with Kubernetes basics — pods, deployments, services. They don't explain what a container is.

Affiliate disclosure

Some posts contain affiliate links to tools and services we recommend. If you click a link and make a purchase, we may earn a commission at no extra cost to you. We only link to tools that are genuinely useful for the topic being covered. See our Privacy Policy for more details.

Contact

Questions, corrections, or topic suggestions: [email protected]