The upper E18 in the MSR (h1fw0's raid) was sounding an audible alarm this morning. The management web interface is showing that controller0 on this unit has/had an over-temp event, though its current temp of 45C is the same as h1fw1's raid which is not in alarm.
Trending the MSR MAX temp over three days does not show much variation. RACK1 temp shows a slightly elevated temp of 25C (75F) at 8am PST this morning.
I've silenced the audible alarm and I am working on resetting the latched alarm.
raid-msr-e18-0-0 error log confirms the 08:19 PST timing of the event:
0000:C0 01-Dec-2002 at 00:08:19:(E): A failure of controller 1, ID 000402C34787 has been detected
On further investigation, this is most probably not an over-temp alarm. I can find no logs for this event (the one posted above is the only one in the error log, but it is from the 1st of December). There are no red leds on the rear of the E18, there are two STAT red leds on the front of the unit (one on each side). Since the raid is still operational and no disks have failed, we have decided to hand this over the Dan for further investigation.