Here is the current state of the H1 DAQ.
h1fw0 and h1fw1 have been completely stable for several weeks, and following the code fix on Wednesday 9/7 the frames written by both are 100% identical. These systems have the same hardware and are running the same code, but h1fw0 occasionally asks for retransmissions and h1fw0 never does. The OS install is slightly different between the machines, and we will try cloning fw0 from fw1 next week.
h1fw2 is a front end computer, running U12 with a local disk system for frames. It was running the original daqd code, but went unstable after the Sat 9/10 power outage. We upgraded it to the rcg3.2 daqd this afternoon to see if this improves stability. It is now connected to UPS power.
Now that h1fw2 has the new set of EPICS diagnostics channels I have expanded the DAQ overview medm screen to show these.
as of monday morning, fw2 has been running 2.8 days. Looks like Jonathan's code has made it stable. It is still a mystery why the power outage apparently caused the instability of the old code (it has been power cycled since then).