A production incident involving intermittent connection failures and pod restarts under normal traffic patterns.
Situation
You're the on-call engineer at a SaaS platform. Production pods are restarting intermittently — not crashing, just cycling. Users are hitting sporadic 502s. The Kubernetes dashboard shows restarts climbing, but resource metrics look normal. No recent deploys.
Stakes
What do you do first?