All ~32 containers on Server 1 experienced simultaneous crashes due to suspected resource exhaustion. No data was lost.
12:45 PM — Resolved
All containers batch-restarted successfully. Monitoring increased. Memory limits adjusted to prevent recurrence.
12:10 PM — Identified
Root cause identified as resource exhaustion from orphaned containers. Cleanup initiated.
12:00 PM — Investigating
Received reports of unresponsive containers. Investigating server health.