High latency, packet loss, and connectivity failures are complex to diagnose. OnCallReady traces the network path, isolates the faulty segment, flushes stale routing state, and reroutes traffic to healthy endpoints automatically.
Fires on network performance alerts indicating degraded connectivity. Typical triggers: "Network latency 840ms (threshold: 200ms)", "Packet loss 18% on prod-vpc", "Service unreachable from us-east-2", "High error rate on external API calls".
Executes traceroute, ping, and MTR from affected host to key destinations. Maps where in the path latency or loss begins. Determines if it's internal (VPC routing) or external (ISP/CDN).
Clears ARP tables and routing cache on affected hosts. Resolves stale routes that can cause traffic black-holing after network topology changes or instance replacements.
If the NIC shows errors or the route table is corrupt, performs a controlled interface bounce. Uses the management network if available to maintain connectivity during the operation.
If degradation is upstream (CDN edge, transit provider), updates load balancer backend weights to route away from the affected AZ or region. Traffic shifts in under 5 seconds.
Confirms round-trip time returns to baseline (<50ms internal, <200ms external). Verifies packet loss is below 0.1%. Marks resolved or escalates with full traceroute output for ISP escalation.
OnCallReady diagnoses and fixes network degradation in seconds. See it demonstrated live.