Last Tuesday (24th Nov) Jim and I modified the monit on h1hwinj1 machine such that when it restarts the psinject process it smoothly ramps the excitation amplitude over a time period of 10 seconds. We manually started the new system on Tuesday and since then there have been no crashes of psinject until the last 24 hours. There have been 4 stops (with subsequent automatic restarts) in the past 24 hours, each stop was logged as being due to the error:
SIStrAppend() error adding data to stream: Block time is already past
Here are the start and crash times (all times PST). Monit automatic restarts are maked with an asterix
time of start | time of crash |
Tue 11/24 14:55:47 | Sun 11/29 17:15:56 |
Sun 11/29 17:16:00* | Mon 11/30 00:00:14 |
Mon 11/30 00:01:13* | Mon 11/30 13:09:07 |
Mon 11/30 13:09:36* | Mon 11/30 13:12:43 |
Mon 11:30 13:13:39* | still running |