Ryan C, Dave:
At 20:14:41 Tue 25 Feb 2025 PST we had a corner station issue which put the IOP models for h1sush34 and h1lsc0 into a DACKILL state. First attachment shows the status before any resets had been issued, second shows overview and the iop models after DIAG_RESET and DAQ_CRC_RESET had been issued.
There were no dmesg messages on either h1sush34 and h1lsc0 from today. I checked that both of these front ends had their full complement of cards in their IO Chassis.
In prep for restarting the IOP models, which requires restarting all the models on these front ends, Ryan put HAM3,4 into a safe state. I engaged the SWWD bypass on the SEI system for HAM 3,4.
After restarting the models on h1sush34 and h1lsc0 they came back with no issues.
The initial cause of the crash is not clear. Timing of the IOP STATE_WORDs going offline suggests sush34 crashed first glitching lsc0 at that time, but lsc0 didn't go fully offline until 6 seconds later.
restart log
20:38:26 h1sush34 h1iopsush34
20:42:20 h1lsc0 h1ioplsc0
20:43:09 h1lsc0 h1lsc
20:43:33 h1lsc0 h1lscaux
20:43:51 h1lsc0 h1sqz
20:44:12 h1lsc0 h1ascsqzfc
20:44:40 h1sush34 h1susmc2
20:45:01 h1sush34 h1suspr2
20:45:20 h1sush34 h1sussr2