Reports until 12:45, Thursday 21 March 2024
LHO General
nyath.maxwell@LIGO.ORG - posted 12:45, Thursday 21 March 2024 (76600)
Juniper SRX Router Outage
At 11:15AM the Juniper SRX was found to have stopped communicating on the 198.129.208.0/24 and 198.129.209/24 GC Compute subnets respectively. This was found to have been configured as residing on LAG ae0 at the router to LAG ae3 on the switch and vlans 3 and 4. All routes were found to have traversed this link in common. Physical reset of interfaces, by physically removing fiber and SFP+ Transceivers from the Juniper SRX-4100 at ports xe-0/0/2 and xe-0/0/3, waited 30 seconds, reseated the transceivers, and reconnected the fiber. Immediate restoration of lost services was observed. There was no observable information in the message logs on the SRX or on the CORE switch to indicate any problem. Interface reset was diagnosed by observable traffic flows. No other information is available. 

Outage secured at 12:30PM 3/21/2024