H1SUSEX front end computer and IO chassis were rebooted this morning to deal with the issue posted by Corey. Richard / Peter
Same failure mode on h1susex today as h1seiex had yesterday. Therefore we were not able to take h1susex out of the Dophin fabric, and so all dolphin connected models were glitched after the reboot of h1susex.
Richard power cycled h1susex and its IO Chassis. I killed all models on h1seiex and h1iscex, and then started all models on these computers. No IRIG-B timing excursions. Cleared IPC and CRC errors, Corey reset the SWWD. IFO recovery has started.
here are the front end computer uptimes (times since last reboot) ran at 10:02 this morning. The longest any machine has ran is 210 days since the site power outage 30 Sep 2016.
h1psl0 up 131 days, 18:38, 0 users, load average: 0.37, 0.13, 0.10
h1seih16 up 210 days, 3:00, 0 users, load average: 0.11, 0.14, 0.05
h1seih23 up 210 days, 3:00, 0 users, load average: 0.62, 1.59, 1.37
h1seih45 up 210 days, 3:00, 0 users, load average: 0.38, 1.31, 1.17
h1seib1 up 210 days, 3:00, 0 users, load average: 0.02, 0.04, 0.01
h1seib2 up 210 days, 3:00, 0 users, load average: 0.02, 0.08, 0.04
h1seib3 up 210 days, 3:00, 0 users, load average: 0.00, 0.05, 0.06
h1sush2a up 210 days, 3:00, 0 users, load average: 1.64, 0.59, 0.56
h1sush2b up 210 days, 3:00, 0 users, load average: 0.00, 0.00, 0.00
h1sush34 up 210 days, 3:00, 0 users, load average: 0.00, 0.03, 0.00
h1sush56 up 210 days, 3:00, 0 users, load average: 0.00, 0.00, 0.00
h1susb123 up 210 days, 3:00, 0 users, load average: 0.17, 1.07, 1.10
h1susauxh2 up 210 days, 3:00, 0 users, load average: 0.00, 0.00, 0.00
h1susauxh34 up 117 days, 17:07, 0 users, load average: 0.08, 0.02, 0.01
h1susauxh56 up 210 days, 3:00, 0 users, load average: 0.00, 0.00, 0.00
h1susauxb123 up 210 days, 2:07, 0 users, load average: 0.00, 0.00, 0.00
h1oaf0 up 164 days, 20:16, 0 users, load average: 0.10, 0.24, 0.23
h1lsc0 up 207 days, 41 min, 0 users, load average: 0.06, 0.57, 0.65
h1asc0 up 210 days, 3:00, 0 users, load average: 1.03, 1.82, 1.80
h1pemmx up 210 days, 3:53, 0 users, load average: 0.05, 0.02, 0.00
h1pemmy up 210 days, 3:53, 0 users, load average: 0.00, 0.00, 0.00
h1susauxey up 205 days, 23:41, 0 users, load average: 0.07, 0.02, 0.00
h1susey up 210 days, 3:06, 0 users, load average: 0.14, 0.04, 0.01
h1seiey up 206 days, 51 min, 0 users, load average: 0.00, 0.03, 0.00
h1iscey up 210 days, 3:07, 0 users, load average: 0.04, 0.21, 0.20
h1susauxex up 210 days, 3:16, 0 users, load average: 0.00, 0.00, 0.00
h1susex up 3:06, 0 users, load average: 0.00, 0.00, 0.00
h1seiex up 1 day, 2:57, 0 users, load average: 0.00, 0.00, 0.00
h1iscex up 177 days, 21:59, 0 users, load average: 0.08, 0.33, 0.24
Here is the list of free RAM on the front end computers in kB:
h1psl0 4130900
h1seih16 4404868
h1seih23 4023316
h1seih45 4024452
h1seib1 4754280
h1seib2 4763256
h1seib3 4753960
h1sush2a 4009216
h1sush2b 5160476
h1sush34 4389108
h1sush56 4400720
h1susb123 4013144
h1susauxh2 5338804
h1susauxh34 5351172
h1susauxh56 5350144
h1susauxb123 5339900
h1oaf0 9102096*
h1lsc0 4065228
h1asc0 3988536
h1pemmx 5358464
h1pemmy 5358352
h1susauxey 5352196
h1susey 64277012~
h1seiey 4758336
h1iscey 4117788
h1susauxex 5349644
h1susex 64301796~
h1seiex 4769840
h1iscex 4138204
* oaf has 12GB
~ end station sus have 66GB
At 12:44utc (5:44amPDT):
(attached is a screenshot of all the WHITE screens we have for EX.)
Tally of activities taken to recover
Noticed some new YELLOW on the Guardian Overview (and Intent Bit) medm. It's related to a Time Ramp for the HEPI IPS' (i.e. H1:HPI-ETMX_IPS_[H/V]P_TRAMP). The other (6) channels have 30sec, but the setpoints for these two is currently 0. Leaving this for the SEI crew to remedy (perhaps next time we drop out of OBSERVING.
Attached is a screen shot showing the medms in question.
TITLE: 04/28 Owl Shift: 07:00-15:00 UTC (00:00-08:00 PST), all times posted in UTC
STATE of H1: Observing at 61Mpc
OUTGOING OPERATOR: Nutsinee
CURRENT ENVIRONMENT:
Wind: 9mph
Primary useism: 0.02 μm/s
Secondary useism: 0.11 μm/s (at 50 percentile)
QUICK SUMMARY:
H1 range hovering just above 60Mpc & winds are slightly high at the end stations.
TITLE: 04/28 Eve Shift: 23:00-07:00 UTC (16:00-00:00 PST), all times posted in UTC
STATE of H1: Observing at 63Mpc
INCOMING OPERATOR: Corey
SHIFT SUMMARY: One lockloss. *Probably* caused by the Noise Eater (see attached plot). No issue recovering.
Got a NPRO out of range message on Diag Main so I went in LVEA and toggle the Noise Eater. Not sure if the noise eater went bad and cause the lockloss or the other way round. Anyway, resume locking now.
Verbal Alarm complained about Type Error then crashed. There were similar errors in the log last night when seismic FE tripped.
Back to Observe 06:09 UTC
Been Observing for 5.5 hours. A big tour group in the control room early in the evening. Wind seems to be dying down. No issue to report.
J. Kissel
I've gathered a full set of transfer functions to measure the IFO sensing and actuator functions. All looks quite normal, detailed analysis to come.
The data lives here:
Sensing Function Measurements
2017-04-27_H1DARM_OLGTF_4to1200Hz_25min.xml
2017-04-27_H1_PCAL2DARMTF_4to1200Hz_8min.xml
2017-04-27_H1_PCAL2DARMTF_BB_5to1000Hz_0p25BW_250avgs_5min.xml
2017-04-27_H1_OMCDCPDSUM_to_DARMIN1.xml <-- new measurement that captures the [mA/ct]
scale of the digital portion of the sensing function
Actuation Function Measurements
UIM: 2017-04-27_H1SUSETMY_L1_iEXC2DARM_25min.xml
2017-04-27_H1SUSETMY_L1_PCAL2DARM_8min.xml
PUM: 2017-04-27_H1SUSETMY_L2_iEXC2DARM_17min.xml
2017-04-27_H1SUSETMY_L2_PCAL2DARM_8min.xml
TST: 2017-04-27_H1SUSETMY_L3_iEXC2DARM_8min.xml
2017-04-27_H1SUSETMY_L3_PCAL2DARM_8min.xml
All has been committed to the Calibration SVN.
These graphs compare the STS2-C instrument with ITMY (STS2-B.) Roam2 is ~24" -X-Y of the +X-Y (NE) leg of the WBSC8 (H2 ITMY) Chamber. The graphs show velocity ASDs of both instruments during Calm (<5mph) and Windy (20+mph) Periods. Also shown is the calm and windy period coherence. Plots are segregated by dof. The calm period begins at 1300utc 26 April; the windy period begins at 000utc 26 April. The wind direction was generally from the S to SSW (from +Y and -X.)
Comparing the similar plots of alog 35186, a few things may be concluded:
Despite the calm period being even calmer here at Roam2, the X axis noise is greater than at Roam1 (drawing coming soon) by ~x10. Likewise during the windy period.
The Y axis comparison has the calm winds producing similar noise on the two seismometers with the windy period elevating the noise more at the Roam2 position by a few.
The Z axis response also indicates a noisier response during higher winds.
The coherence plot (not shown in 35186 alog) is uhhhh, well, depending on freq and dof, shows either more or less coherence between the instruments. I could report specifics or details but just enjoy the silence and see for your self.
If these comparisons are valid and I haven't screwed anything up, the conclusion is, this location is not nearly as good as the current ITMY or the Roam1 position, and, the chamber is not providing any pinning against tilting of the seismometer.
TITLE: 04/27 Day Shift: 15:00-23:00 UTC (08:00-16:00 PST), all times posted in UTC
STATE of H1: Observing at 56Mpc
INCOMING OPERATOR: Nutsinee
SHIFT SUMMARY: Started rough after EX sei crash, but running smooth now
LOG:
The big event was the recovery after the EX seismic front end crashed (see Jeff and Dave's logs for details). Otherwise a few people went to the mid stations today, but we were a little distracted with recovery.
Post the 7 day OpLev trends.
SUM: ITMY - No data after 04/24 - Zoomed the plot way out and could find a bottom.
ETMY - Big jump after the 04/25 maintenance window. This is a result of Jason's tune up during the window.
Yaw: ITMX - Near the 10urad line. Trend line is flat.
Pitch: SR3 - Near the 10urad line. Trend line is flat.
Everything looks normal with the trends. ITMx yaw is beginning to get close to the 10 µrad limit, and will likely need re-centering in the near future.
Regarding ITMy SUM, it changed due to my swap of the laser (reported here). Unfortunately it's not entirely out of the woods yet, as the laser is showing signs of low output power. While the SUM signal doesn't show up on Jeff's plot above (not sure if this is due to something in the script or some odd artifact of the OS change with the control room workstations, or something else entirely), it is there; DetChar summary page for ITMy oplev shows it, as well as the oplev MEDM screen. The oplev seems to be functioning normally, just looks like its output power is low. I plan on investigating this during the 5/2/2017 maintenance window.
J. Kissel, J. Warner, J. Driggers, J. Oberling, C. Gray
Executive Summary
Running an old, infrequently used "ditherAlign" script to recover green spots after gross X arm misalignment (i.e. because of the SEI front-end failure early this morning; see LHO aLOG 35824, 35834) caused more than usual trouble regaining the X ARM ALS angular control. After slowly working / stumbling our way through identifying the problems by performing "the usual" troubleshooting (i.e. G1602280), we were able to return to full initial alignment and lock acquisition and achieved nominal low noise by 20:00 UTC.
Total down time -- 7 hours from 2017-04-27 13:00 to 20:00 UTC.
Lessons Learned
- There are four places any given operator goes for information to diagnose a problem when alone on evening / night shifts:
(1) Jenne's H1 Troubleshooting Presentation: G1602280
(2) The OPS Wiki Troubleshooting Page: Trouble Shooting the IFO
(3) The OPS Wiki Useful Scripts Page: Useful Scripts For Operators
(4) Nutsinee has her own Trouble Shooting Page: Nutsinee's Page
When an operator has just restarted shifting after a month off, and nothing's gone wrong for that operator in a while, you forget even the location of resources let alone which resource to use. It will be a giant effort to merge these documents, but we could at least work to link all of them to the other.
Jim recommends we banish (2), we update and maintain (3), and acknowledge that (4) is not cannon if used by others who are not Nutsinee. Jenne acknowledges that (1) has several pending updates, and will add a few things from today's experience.
- It's important to have such a "ditherAlign" script that rescues us from a gross arm misalignment in which we've lost spots. But we have those events so infrequently that the script suffers from bit rot between uses (e.g. it uses the TDS library, and we just upgraded the control room to Debian8, which doesn't support the TDS library). As such, we should upgrade, debug and fix this script and update the associated documentation. However, after looking at it, it's a beast of a spaghetti monster. Also a giant effort.
- When we get such a gross misalignment, we should not expect *any* operator to be able to fix the problem (let alone diagnose it) quickly or by themselves. It took all of the authors patiently sitting through the problem, picking up clues, trying this and that, looking in 15 different places (only possible with 3-4 pairs of eyes) for us to solved the problem, and later identify it. We should just expect this after we lose a seismic front-end.
- Since we've moved toward the O2 model of "do not call commissioners if you have a problem," operators have, in general, become reticent to call if there's a problem, especially on owl shifts -- and that call list is Keita. Further, in the era of 71 hour locks and 80-90% duty cycle, commissioners and detector engineers are far less regularly in the control room. Yet further, shift changes are also a really tough point in the chain of communication and on the day operator. Not only do they have the stress of a mid-night failure from which they don't have all the information, but the gets compounded with the phones ringing, everyone coming in asking what's wrong and/or is it fixed, and not knowing who can actually stay to help. I make this last statement with no proposed solution, but merely to expose what happens these days during an un-identified mid-night failure mode and to encourage patience and cooperation by all.
Detailed Timeline
- Seismic front-end crashed
- After seismic computer and platform recovery (see LHO aLOG 35824, 35834), we did not see any spot on the ALS X Green camera.
- After some manual tries (unclear if any restoration to slider / oplev / OSEM values was done), an infrequently used script to recover green spots after gross arm misalignment,
/opt/rtcds/userapps/release/asc/h1/scripts/ditherAlign.py
was run on both TMSX (twice) and ITMX. These scripts failed, and left a whole bunch of stuff in a bad alignment state, namely
- all X arm optics (ITMX, ETMX, TMSX) were aligned to a bad location,
- the ITM *misalignment* offsets were changed, and
- The green camera (CAM) Reference OFFSETS were changed.
Some, but not all recovery was made, because the users weren't aware of everything this script touched.
(And before you cry "but the SDF system!" remember that there are lots of DIFFs that appear in down snaps that are usually overlooked because things work out in the end.)
(Corey departs, Jim Arrives)
- After a restoration of the TMSX alignment sliders to the previous observation stretch's OSEM location, we regained spots on the camera. Jim then heroically pushed the ETMX and ITMX alignment around until he recorved *decent* arm cavity transmission.
- As is standard practice, he then went for through an initial alignment of the X ARM (INITIAL ALIGNMENT state on ALS_XARM) which turns on automatic alignment, including green WFS and green Camera (CAM) ITMX spot restoration. However, because the end station alignment was still far enough off that the WFS error signals were too large, and the green CAM references had been errantly changed by the ditherAlign script the WFS and CAM servos blew up after every attempt to automatically close them as normal.
(Jeff Arrives)
- After further efforts to manually increase the transmission to reduce the WFS and CAM error signals without success, we remembered one has to clear WFS / SUS offsets if/when/after they drive optics into the weeds.
(Jenne Arrives)
- Having cleared the weeds, we were able to close the green YAW WFS 1 & 2 loops that control TMSX and ETMX. To do so, we needed to set the ALS_XARM guaridan to ENABLE_WFS, and force the triggering of the alignment servos to be ALWAYS ON, i.e. set
- H1:ALS-X_WFS_TRIG_THRESH_ON and H1:ALS-X_WFS_DOF_FM_TRIG_THRESH_ON to -0.1 (i.e. anything below zero),
- Flip to the master gain switch (H1:ALS-X_WFS_SWITCH) to ON.
However, with the triggering forced ON, that meant that if the arm lost lock, we'd need to quickly turn off the inputs to the WFS loops we had under control so more weeds would not grow. Recall from G1602280, successful closure of these loops was only possible if the error signals were less than ~1000 and preferably better than 500 [ct]. We were using Jenne's premade StripTool template for the WFS error signals,
/ligo/home/jenne.driggers/Templates/Strip/Green_Y_AlignErrorSigs.stp
(Jason Arrives)
- With TMS and ETMX under control in YAW, we tried slowly moving ITMX in yaw (with the green WFS) to reduce the CAM YAW error signal (at this point both were in the 0.7 range, and we want better than 0.1). As we (very slowly) moved the ITM (so that the end-station optics' WFS could follow), we realized that reducing the CAM error signal made MICH fringes on the AS AIR camera look worse, implying that we were doing the wrong thing. So, we reverted the ITM location, and went to close the Pitch loops.
- Closing WFS A / DOF1 / ETMX pitch loop was relatively easy, but even though the error signals were sufficiently small, closing the WFS B / DOF 2 / TMSX loop would start growing weeds and break. After several tries, we realized we may be on the wrong side of the WFS A / DOF1 / ETMX PDH error signal hump, so we pushed the ETM back in the oppposite direction, indeed went over the hump, and began to reduce the error signal again. After that discovery, both PIT DOFs clsoed nicely.
- Now having ALL WFS end X optics controlled DOFS closed, and the green arm transmission up in the high 0.9s to 1, we again went back to ITMX to reduce the CAM error signal. This again made the AS AIR spot / MICH fringes look like crap, and reduced the green arm transmission. We [incorrectly] concluded that the cameras must have moved, and remeasured and set the CAM offsets.
- At this point we were able to resume regular initial alignment. All went well, until SRC_ALIGN, where we saw excess fringing on the AS port camera. Jenne knew this was because some optic was not entirely *misaligned* so we trended the *misalignment offsets* in the M0_TEST bank of ITMX, and found they were wrong by half. Restoring these allowed us to complete initial alignment as normal.
- All the rest of normal lock acquisition worked swimmingly.
- Upon arrival at nominal low noise, we began checking for SDF differences -- and it was only then that we started to put the pieces together that the driveAlign script had messed with everything listed above.
While updating the FMCS MEDM screens for the migration to the BACNet IOC I noticed that the alarm status for chiller pump 2 at end Y (H0:FMC-EY_CY_ALARM_2) has been active (value = 2) for at least a month. Is this normal?
This is NOT a chiller pump alarm. It is a "Chiller 2" alarm. There are two devices which are supplying chilled water for the building HVAC - The Chiller is a refrigeration machine which cools the water and there is a separate chilled water pump (CWP) which circulates the water through the chiller and up to the building.
This chiller has two cooling circuits - one has a known fault - hence the alarm.
To confuse the issue further there are two chillers and two chilled water pumps at the end station - this provides us redundancy in case of failure.
The critical alarm is the "Chilled Water Supply Temperature". This temperature is currently normal.
I added two band stops to the CHARD loops, which have reduced the CHARD drives from 15-25 Hz by about a factor of 10. ASC noise should no longer be limiting our DARM sensitivity above 15Hz, the noise is only slightly better from 15-20 Hz.
The first attachment shows the slight improvement in the DARM noise, the difference in the drives (the cut offs I added were 2nd order elliptic bandstops), and the reduction in the coherence between the ASC drives and DARM. For CHARD Y the first screenshot shows the loop measurement before I added the bandstop, the bandstop should have reduced the phase at the upper ugf (2.5 Hz) by about 5 degrees. For CHARD P I reduced the gain by about 3dB, the third screenshot shows the before measurement in blue, a measurement after I reduced the gain but before I added the cut off in red. For CHARD the cut off only reduced the phase at the upper ugf of 3 Hz by 6 degrees, we are left with almost 50 degrees of phase margin.
I also re ran the noise budget injections for CHARD, DHARD, MICH, SRCL, PRCL and IMC PZT jitter. There only real change is that the ASC noise is lower, and there is a larger gap between the sum and the measured noise at 25 Hz. I am not able to download GDS strain from this afternoon, so I will post a noise budget when I can get data.
Was there a change to H1:ASC-CHARD_P_GAIN from -0.10 to -0.14? (This came up as an SDF Diff for the next lock.)
Yes, Corey, sorry I forgot to load the guardian.
The first attachment is the noise budget with measurements from yesterday. You can see that the broad lump that we blame on beam size jitter is worse, there is a gap between the measured noise and the sum of the predicted noises from 300-800 Hz which was not present in the noise budget from early Jan (here) Looking at the summary pages, you can see that this has happened in the last week. (April 18th compared to yesterday). Kiwamu and I had a look at some alignment sensors, and at first glance it doesn't seem like we've had an unusual alignment change this week. We asked Aidan to check the Hartmann data to see if there has been a change in absorption on ITMX.
The linear jitter is also slowly getting worse, which you can see by comparing the 350 Hz peak to January. The next two attached pngs are screenshots of the jitter transfer functions measured yesterday using the IMC PZT. You can compare these to measurements from mid Feb and see that the coupling is about 50% worse for yaw and almost a factor fo 2 worse for pit.
The 4th attachment shows a comparison of the coherence between DARM and the IMC WFS DC signals for February to earlier today. We now have broad coherence between IMC WFS B pit and darm, which I don''t think I have seen before even when we had a broad lump of noise in DARM before our pre O2 alignment change.
The last attachement shows coherences between DARM and the bullseye PD on the PSL.
WP 6577 Dave B., Carlos P., Bubba G., John W., Patrick T. I have migrated a subset of the EPICS channels provided by the FMCS IOC on h0epics to an IOC I created on fmcs-epics-cds. The IOC on fmcs-epics-cds connects to the BACNet server that Apollo has installed as part of the FMCS upgrade. The channels that I migrated have been taken over by this upgrade and can no longer be read out by the server that the IOC on h0epics reads from. The fmcs-epics-cds computer connects to the slow controls network (10.105.0.1) on eth0 and the BACNet network (10.2.0.1) on eth1. It is running Debian 8. The IOC on h0epics is started from the target directory /ligo/lho/h0/target/h0fmcs (https://cdswiki.ligo-wa.caltech.edu/wiki/h0fmcs). I commented out the appropriate channels from the fmcs.db and chiller.db files in the db directory of this path and restarted this IOC. I made no changes to the files in svn. The IOC on fmcs-epics-cds uses code from SNS: http://ics-web.sns.ornl.gov/webb/BACnet/ and resides in /home/cdsadmin/BACnet_R0-8. This is a local directory on fmcs-epics-cds. This IOC is started as cdsadmin: > ssh cdsadmin@10.105.0.112 cdsadmin@fmcs-epics-cds: screen Hit Enter cdsadmin@fmcs-epics-cds: cd /home/cdsadmin/BACnet_R0-8/iocBoot/e2b-ioc/ cdsadmin@fmcs-epics-cds: ../../bin/linux-x86_64/epics2bacnet st.cmd Hit CTRL-a then 'd' Issues: I came to realize during this migration that the logic behind the binary input channels is different in BACNet. In BACNet a value of 0 corresponds to 'inactive' and a value of 1 corresponds to 'active'. In the server being migrated from a value of 0 corresponds to 'invalid'. This was verified for the reverse osmosis alarm: H0:FMC-CS_WS_RO_ALARM. In the BACNet server it reads as 0 or 'inactive' when not in alarm. When John W. forced it into alarm it read as 1 or 'active'. I believe Dave has updated his cell phone alarm notifier to match this. A similar situation exists for the state of the chiller pumps. In the server being migrated from a value of 1 appears to correspond to 'OFF' and a value of 2 appears to correspond to 'ON'. It has not been verified, but I believe in the BACNet server a value of 0 corresponds to 'OFF' and a value of 1 corresponds to 'ON'. The pump status for each building is calculated by looking at the state of the pumps. The calculation in the database for the IOC being migrated from appears to be such that as long as one pump is running the status is 'OPERATIONAL'. If no pump is running the status is 'FAILED'. I need to double check this with John or Bubba. I updated the corresponding calc records in the database for the BACNet IOC to match this. In the server being migrated from, channels that are read by BACNet as binary inputs and binary outputs are read as analog inputs. I changed these in the database for the BACNet IOC to binary inputs and set the ONAM to 'active' and the ZNAM to 'inactive'. The alarm levels also need to be updated. They are currently set through the autoburt snapshot files that contain a separate channel for each alarm field. The autoburt request file has to be updated for the binary channels to have channels for .ZSV and .OSV instead of .HSV, .LSV, etc. So currently there is no control room alarm set for the binary channels, including the reverse osmosis alarm. I also need to update the medm screens to take account of this change. Also, there is an invalid alarm on the control room alarm station computer for the mid X air handler reheat temperature. Looking on the BACNet FMCS server this channel actually does appear to be genuinely invalid. It should be noted that this BACNet IOC is a temporary install until an OPC server is installed on the BACNet server. I would like to leave the permit for this work open until the FMCS upgrade is complete and all the channels have been migrated to the BACNet IOC.
I updated the autoBurt.req file.
I have set the alarm on the RO channel: caput H0:FMC-CS_WS_RO_ALARM.OSV MAJOR
I have updated the medm screens.
Note: The 'ARCH = linux-x86' line in the Makefile under 'BACnet_R0-8/iocBoot/e2b-ioc' had to be changed to 'ARCH = linux-x86_64' in the code copied from SNS.
Evan G., Robert S. Looking back at Keith R.'s aLOGs documenting a changes happening on March 14 (see 35146, 35274, and 35328), we found that one cause seems to be the shuttering of the OpLev lasers on March 14. Right around this time, 17:00 UTC on March 14 at EY and 16:07 UTC at EX, there is an increase in line activity. The correlated cause is Travis' visit to the end station to take images of the Pcal spot positions. The images are taken using the Pcal camera system and needs the OpLevs to be shuttered so that a clean image can be taken without the light contamination. We spoke with Travis and he explained that he disconnected the USB interface between the DSLR and the ethernet adapter, and used a laptop to directly take images. Around this time, the lines seem to get worse in the magnetometer channels (see, for example, the plots attached to Keith's aLOG 35328). After establishing this connection, we went to the end stations to turn off the ethernet adapters for the Pcal cameras (the cameras are blocked anyway, so this active connection is not needed). I made some magnetometer spectra before and after this change (see attached). This shows that a number of lines in the magnetometers are reduced or are now down in the noise. Hopefully this will mitigate some of the recent reports of combs in h(t). We also performed a short test turning off another ethernet adapter for the H1 illuminator and PD. This was turned off at 20:05:16 18/04/2014 UTC and turned back on at 20:09:56 UTC. I'll post another aLOG with this investigation as well.
Good work! That did a lot of good in DARM. Attached are spectra in which many narrow lines went away or were reduced (comparing 22 hours of FScan SFTs before the change (Apr 18) with 10 hours of SFTs after the change (Apr 19). We will need to collect much more data to verify that all of the degradation that began March 14 has been mitigated, but this first look is very promising - many thanks! Fig 1: 20-50 Hz Fig 2: 50-100 Hz Fig 3: 100-200 Hz
Attached are post-change spectra using another 15 hours of FScan SFTs since yesterday. Things continue to look good. Fig 1: 20-50 Hz Fig 2: 50-100 Hz Fig 3: 100-200 Hz
Correction: the date is 18/04/2017 UTC.
Another follow-up with more statistics. The mitigation from turning off the ethernet adapter continues to be confirmed with greater certainty. Figures 1-3 show spectra from pre-March 14 (1210 hours), a sample of post-March 14 data (242 hours) and post-April 18 (157 hours) for 20-50 Hz, 50-100 Hz and 100-200 Hz. With enough post-April 18 statistics, one can also look more closely at the difference between pre-March 14 and and post-April 18. Figures 4-6 and 7-9 show such comparisons with different orderings and threrefore different overlays of the curves. It appears there are lines in the post-April 18 data that are stronger than in the pre-March 14 data and lines in the earlier data that are not present in the recent data. Most notably, 1-Hz combs with +0.25-Hz and 0.50-Hz offsets from integers have disappeared. Narrow low-frequency lines that are distinctly stronger in recent data include these frequencies: 21.4286 Hz 22.7882 Hz - splitting of 0.0468 Hz 27.4170 Hz 28.214 Hz 28.6100 Hz - PEM in O1 31.4127 Hz and 2nd harmonic at 62.8254 Hz 34.1840 Hz 34.909 Hz (absent in earlier data) 41.8833 Hz 43.409 Hz (absent in earlier data) 43.919 Hz 45.579 Hz 46.9496 Hz 47.6833 Hz 56.9730 Hz 57.5889 Hz 66.7502 Hz (part of 1 Hz comb in O1) 68.3677 Hz 79.763 Hz 83.315 Hz 83.335 Hz 85.7139 Hz 85.8298 Hz 88.8895 Hz 91.158 Hz 93.8995 Hz 95.995 Hz (absent in earlier data) 107.1182 Hz 114.000 Hz (absent in earlier data) Narrow low-frequency lines in the earlier data that no longer appear include these frequencies: 20.25 Hz - 50.25 Hz (1-Hz comb wiped out!) 24.50 Hz - 62.50 Hz (1-Hz comb wiped out!) 29.1957 Hz 29.969 Hz Note that I'm not claiming change points occurred for the above lines on March 14 (as I did for the original set of lines flagged) or on April 18. I'm merely noting a difference in average line strengths before March 14 vs after April 18. Change points could have occurred between March 14 and April 18, shortly before March 14 or shortly after April 18.
To pin down better when the two 1-Hz combs disappeared from DARM, I checked Ansel's handy-dandy comb tracker and found the answer immediately. The two attached figures (screen grabs) show the summed power in the teeth of those combs. The 0.5-Hz offset comb is elevated before March 14, jumps up after March 14 and drops down to normal after April 18. The 0.25-Hz offset comb is highly elevated before March 14, jumps way up after March 14 and drops down to normal after April 18. These plots raise the interesting question of what was done on April 18 that went beyond the mitigation of the problems triggered on March 14. Figure 1 - Strength of 1-Hz comb (0.5-Hz offset) vs time (March 14 is day 547 after 9/15/2014, April 18 is day 582) Figure 2 - Strength of 1-Hz comb (0.25-Hz offset) vs time