aLIGO LHO Logbook

H1 SUS (SUS)

peter.king@LIGO.ORG - posted 07:12, Friday 28 April 2017 - last comment - 11:10, Friday 28 April 2017(35861)

H1SUSEX reboot

H1SUSEX front end computer and IO chassis were rebooted this morning to deal with the issue posted by Corey.



Richard / Peter

Images attached to this report

Comments related to this report

david.barker@LIGO.ORG - 07:24, Friday 28 April 2017 (35862)

Link

Same failure mode on h1susex today as h1seiex had yesterday. Therefore we were not able to take h1susex out of the Dophin fabric, and so all dolphin connected models were glitched after the reboot of h1susex.

Richard power cycled h1susex and its IO Chassis. I killed all models on h1seiex and h1iscex, and then started all models on these computers. No IRIG-B timing excursions. Cleared IPC and CRC errors, Corey reset the SWWD. IFO recovery has started.

david.barker@LIGO.ORG - 10:32, Friday 28 April 2017 (35866)

Link

here are the front end computer uptimes (times since last reboot) ran at 10:02 this morning. The longest any machine has ran is 210 days since the site power outage 30 Sep 2016.

h1psl0          up 131 days, 18:38, 0 users, load average: 0.37, 0.13, 0.10
h1seih16        up 210 days, 3:00, 0 users, load average: 0.11, 0.14, 0.05
h1seih23        up 210 days, 3:00, 0 users, load average: 0.62, 1.59, 1.37
h1seih45        up 210 days, 3:00, 0 users, load average: 0.38, 1.31, 1.17
h1seib1         up 210 days, 3:00, 0 users, load average: 0.02, 0.04, 0.01
h1seib2         up 210 days, 3:00, 0 users, load average: 0.02, 0.08, 0.04
h1seib3         up 210 days, 3:00, 0 users, load average: 0.00, 0.05, 0.06
h1sush2a        up 210 days, 3:00, 0 users, load average: 1.64, 0.59, 0.56
h1sush2b        up 210 days, 3:00, 0 users, load average: 0.00, 0.00, 0.00
h1sush34        up 210 days, 3:00, 0 users, load average: 0.00, 0.03, 0.00
h1sush56        up 210 days, 3:00, 0 users, load average: 0.00, 0.00, 0.00
h1susb123       up 210 days, 3:00, 0 users, load average: 0.17, 1.07, 1.10
h1susauxh2      up 210 days, 3:00, 0 users, load average: 0.00, 0.00, 0.00
h1susauxh34     up 117 days, 17:07, 0 users, load average: 0.08, 0.02, 0.01
h1susauxh56     up 210 days, 3:00, 0 users, load average: 0.00, 0.00, 0.00
h1susauxb123    up 210 days, 2:07, 0 users, load average: 0.00, 0.00, 0.00
h1oaf0          up 164 days, 20:16, 0 users, load average: 0.10, 0.24, 0.23
h1lsc0          up 207 days, 41 min, 0 users, load average: 0.06, 0.57, 0.65
h1asc0          up 210 days, 3:00, 0 users, load average: 1.03, 1.82, 1.80
h1pemmx         up 210 days, 3:53, 0 users, load average: 0.05, 0.02, 0.00
h1pemmy         up 210 days, 3:53, 0 users, load average: 0.00, 0.00, 0.00
h1susauxey      up 205 days, 23:41, 0 users, load average: 0.07, 0.02, 0.00
h1susey         up 210 days, 3:06, 0 users, load average: 0.14, 0.04, 0.01
h1seiey         up 206 days, 51 min, 0 users, load average: 0.00, 0.03, 0.00
h1iscey         up 210 days, 3:07, 0 users, load average: 0.04, 0.21, 0.20
h1susauxex      up 210 days, 3:16, 0 users, load average: 0.00, 0.00, 0.00
h1susex         up 3:06, 0 users, load average: 0.00, 0.00, 0.00
h1seiex         up 1 day, 2:57, 0 users, load average: 0.00, 0.00, 0.00
h1iscex         up 177 days, 21:59, 0 users, load average: 0.08, 0.33, 0.24

david.barker@LIGO.ORG - 11:10, Friday 28 April 2017 (35868)

Link

Here is the list of free RAM on the front end computers in kB:

h1psl0       4130900
h1seih16   4404868
h1seih23   4023316
h1seih45   4024452
h1seib1       4754280
h1seib2       4763256
h1seib3       4753960
h1sush2a   4009216
h1sush2b   5160476
h1sush34   4389108
h1sush56   4400720
h1susb123   4013144
h1susauxh2   5338804
h1susauxh34   5351172
h1susauxh56   5350144
h1susauxb123   5339900
h1oaf0       9102096*
h1lsc0       4065228
h1asc0       3988536
h1pemmx       5358464
h1pemmy       5358352
h1susauxey   5352196
h1susey        64277012~
h1seiey       4758336
h1iscey       4117788
h1susauxex   5349644
h1susex        64301796~
h1seiex       4769840
h1iscex       4138204

* oaf has 12GB
~ end station sus have 66GB

H1 CDS (CDS, SUS)

Link

corey.gray@LIGO.ORG - posted 06:01, Friday 28 April 2017 - last comment - 07:46, Friday 28 April 2017(35857)

12:44 Out Of Observing: EX SUS FrontEnds DOWN!!

(I'm almost cutting & pasting from what happened almost 24hrs ago! Yesterday it was EX SEI, Today it is EX SUS!)

At 12:44utc (5:44amPDT):

Received Verbal Alarms:
- TypeError with ETMX RMS WD,
- SUS ETMX watch dog tripped,
- SUS ETMX SW watchdog tripped
- SUS TMSX SW watchdog tripped
- SUS TMSX watch dog tripped
CDS Overview WHITE for H1IOPSUSEX, H1SUSETMX, H1SUSTMSX, SUSETMXPI
All through this, ETMx SUS & TMS systems remained damped and H1 is still in Nominal Low Noise, but am OUT OF OBSERVING.
Flagging this as CORRECTIVE MAINTENANCE
Called LLO Operator (Doug L) to notify them of our status.
Will be down on the order of hours (ie. trip to EX to restart frontends & other work)
Will make sure NOT to touch the DITHER scripts! (thanks, for the alog to Jeff & DAY crew)
Could not ssh into h1susex
Richard just arrived and we'll run through what we did yesterday morning. I will let him contact Dave when he's ready.

(attached is a screenshot of all the WHITE screens we have for EX.)

Images attached to this report

Comments related to this report

corey.gray@LIGO.ORG - 07:46, Friday 28 April 2017 (35860)

Link

Tally of activities taken to recover

Richard (& I) had trouble logging into EX Frontends (and any other ones); this was due to the controls password being changed.
Dave called in.
13:42 Richard heading out to EX
13:42 Since we'll probably be rebooting the EX World, taking any Guardian Nodes I can access to a SAFE state (i.e. SEI, HEPI, etc.)
- Since I did not see an obvious "SAFE" state on the SEI/HEPI/ISI Guardian Nodes (or anything in the SEI wiki), I called Hugh and he suggested I take the EX SEI Guardian Node to OFFLINE.
1351 h1susex booted by Richard
14:05 Richard back to corner station (Dave must still be working on getting us back & GREEN at EX).
14:11 Dave restarting models
When we lost EX SUS, the SUS & TMS bias sliders shifted. Used Time Machine to find out values from ~5:05amPDT, but also trending Oplev signals (for etmx) & M1_DAMP pit & yaw signals (for tmsx). The sliders shifted during the reboot this morning (and yesterday morning).
- 14:35 Starting with no light on the ALSx camera---tweaking sliders to restore signals above has brought back a spot on the ALSx camera!
- (NO DITHER scripts used this morning!)
- OK, working on getting stuff to a decent state for Jim, so that hopefully all he'll have to do is an Initial Alignment.

H1 SEI (SEI)

Link

corey.gray@LIGO.ORG - posted 05:43, Friday 28 April 2017 (35856)

EX HEPI SPM Diffs

Noticed some new YELLOW on the Guardian Overview (and Intent Bit) medm. It's related to a Time Ramp for the HEPI IPS' (i.e. H1:HPI-ETMX_IPS_[H/V]P_TRAMP). The other (6) channels have 30sec, but the setpoints for these two is currently 0. Leaving this for the SEI crew to remedy (perhaps next time we drop out of OBSERVING.

Attached is a screen shot showing the medms in question.

Images attached to this report

LHO General

Link

corey.gray@LIGO.ORG - posted 00:36, Friday 28 April 2017 (35854)

Transition To OWL

TITLE: 04/28 Owl Shift: 07:00-15:00 UTC (00:00-08:00 PST), all times posted in UTC
STATE of H1: Observing at 61Mpc
OUTGOING OPERATOR: Nutsinee
CURRENT ENVIRONMENT:
    Wind: 9mph
    Primary useism: 0.02 μm/s
    Secondary useism: 0.11 μm/s (at 50 percentile)
QUICK SUMMARY:

H1 range hovering just above 60Mpc & winds are slightly high at the end stations.

H1 General

Link

nutsinee.kijbunchoo@LIGO.ORG - posted 00:23, Friday 28 April 2017 (35853)

Ops Eve Shift Summary

TITLE: 04/28 Eve Shift: 23:00-07:00 UTC (16:00-00:00 PST), all times posted in UTC

STATE of H1: Observing at 63Mpc

INCOMING OPERATOR: Corey

SHIFT SUMMARY: One lockloss. *Probably* caused by the Noise Eater (see attached plot). No issue recovering.

Images attached to this report

H1 General

Link

nutsinee.kijbunchoo@LIGO.ORG - posted 22:34, Thursday 27 April 2017 - last comment - 23:10, Thursday 27 April 2017(35851)

Lockloss 05:19 UTC

Got a NPRO out of range message on Diag Main so I went in LVEA and toggle the Noise Eater. Not sure if the noise eater went bad and cause the lockloss or the other way round. Anyway, resume locking now.

Verbal Alarm complained about Type Error then crashed. There were similar errors in the log last night when seismic FE tripped.

Comments related to this report

nutsinee.kijbunchoo@LIGO.ORG - 23:10, Thursday 27 April 2017 (35852)

Link

Back to Observe 06:09 UTC

H1 General

Link

nutsinee.kijbunchoo@LIGO.ORG - posted 21:05, Thursday 27 April 2017 (35850)

Ops Eve Midshift Summary

Been Observing for 5.5 hours. A big tour group in the control room early in the evening. Wind seems to be dying down. No issue to report.

H1 CAL

Link

jeffrey.kissel@LIGO.ORG - posted 17:10, Thursday 27 April 2017 (35849)

2017-04-27 New Full Calibration Measurement Suite

J. Kissel

I've gathered a full set of transfer functions to measure the IFO sensing and actuator functions. All looks quite normal, detailed analysis to come.

The data lives here:
Sensing Function Measurements
    2017-04-27_H1DARM_OLGTF_4to1200Hz_25min.xml
    2017-04-27_H1_PCAL2DARMTF_4to1200Hz_8min.xml
    2017-04-27_H1_PCAL2DARMTF_BB_5to1000Hz_0p25BW_250avgs_5min.xml
    2017-04-27_H1_OMCDCPDSUM_to_DARMIN1.xml                         <-- new measurement that captures the [mA/ct] 
                                                                        scale of the digital portion of the sensing function

Actuation Function Measurements
    UIM: 2017-04-27_H1SUSETMY_L1_iEXC2DARM_25min.xml
         2017-04-27_H1SUSETMY_L1_PCAL2DARM_8min.xml

    PUM: 2017-04-27_H1SUSETMY_L2_iEXC2DARM_17min.xml
         2017-04-27_H1SUSETMY_L2_PCAL2DARM_8min.xml

    TST: 2017-04-27_H1SUSETMY_L3_iEXC2DARM_8min.xml
         2017-04-27_H1SUSETMY_L3_PCAL2DARM_8min.xml

All has been committed to the Calibration SVN.

H1 SEI

Link

hugh.radkins@LIGO.ORG - posted 16:41, Thursday 27 April 2017 (35848)

STS2-C (HAM5) at Roam2 (very close to BSC8 Chamber Leg) during calm and windy times

These graphs compare the STS2-C instrument with ITMY (STS2-B.) Roam2 is ~24" -X-Y of the +X-Y (NE) leg of the WBSC8 (H2 ITMY) Chamber. The graphs show velocity ASDs of both instruments during Calm (<5mph) and Windy (20+mph) Periods. Also shown is the calm and windy period coherence. Plots are segregated by dof. The calm period begins at 1300utc 26 April; the windy period begins at 000utc 26 April. The wind direction was generally from the S to SSW (from +Y and -X.)

Comparing the similar plots of alog 35186, a few things may be concluded:

Despite the calm period being even calmer here at Roam2, the X axis noise is greater than at Roam1 (drawing coming soon) by ~x10. Likewise during the windy period.

The Y axis comparison has the calm winds producing similar noise on the two seismometers with the windy period elevating the noise more at the Roam2 position by a few.

The Z axis response also indicates a noisier response during higher winds.

The coherence plot (not shown in 35186 alog) is uhhhh, well, depending on freq and dof, shows either more or less coherence between the instruments. I could report specifics or details but just enjoy the silence and see for your self.

If these comparisons are valid and I haven't screwed anything up, the conclusion is, this location is not nearly as good as the current ITMY or the Roam1 position, and, the chamber is not providing any pinning against tilting of the seismometer.

Images attached to this report

H1 General

Link

jim.warner@LIGO.ORG - posted 16:03, Thursday 27 April 2017 (35846)

Shift Summary

TITLE: 04/27 Day Shift: 15:00-23:00 UTC (08:00-16:00 PST), all times posted in UTC
STATE of H1: Observing at 56Mpc
INCOMING OPERATOR: Nutsinee
SHIFT SUMMARY: Started rough after EX sei crash, but running smooth now
LOG:
The big event was the recovery after the EX seismic front end crashed (see Jeff and Dave's logs for details). Otherwise a few people went to the mid stations today, but we were a little distracted with recovery.

H1 AOS (AOS, SEI, SUS)

Link

jeffrey.bartlett@LIGO.ORG - posted 15:41, Thursday 27 April 2017 - last comment - 16:04, Thursday 27 April 2017(35845)

OpLev 7-Day Trends

   Post the 7 day OpLev trends. 

SUM: ITMY - No data after 04/24 - Zoomed the plot way out and could find a bottom.
     ETMY - Big jump after the 04/25 maintenance window. This is a result of Jason's tune up during the window. 

Yaw: ITMX - Near the 10urad line. Trend line is flat.

Pitch: SR3 - Near the 10urad line. Trend line is flat.

Images attached to this report

Comments related to this report

jason.oberling@LIGO.ORG - 16:04, Thursday 27 April 2017 (35847)

Link

Everything looks normal with the trends. ITMx yaw is beginning to get close to the 10 µrad limit, and will likely need re-centering in the near future.

Regarding ITMy SUM, it changed due to my swap of the laser (reported here). Unfortunately it's not entirely out of the woods yet, as the laser is showing signs of low output power. While the SUM signal doesn't show up on Jeff's plot above (not sure if this is due to something in the script or some odd artifact of the OS change with the control room workstations, or something else entirely), it is there; DetChar summary page for ITMy oplev shows it, as well as the oplev MEDM screen. The oplev seems to be functioning normally, just looks like its output power is low. I plan on investigating this during the 5/2/2017 maintenance window.

H1 OpsInfo (FRS, ISC, SEI, SUS)

Link

jeffrey.kissel@LIGO.ORG - posted 15:16, Thursday 27 April 2017 (35839)

Recovery from SEI Front-End Shut Down

J. Kissel, J. Warner, J. Driggers, J. Oberling, C. Gray

Executive Summary
    Running an old, infrequently used "ditherAlign" script to recover green spots after gross X arm misalignment (i.e. because of the SEI front-end failure early this morning; see LHO aLOG 35824, 35834) caused more than usual trouble regaining the X ARM ALS angular control. After slowly working / stumbling our way through identifying the problems by performing "the usual" troubleshooting (i.e. G1602280), we were able to return to full initial alignment and lock acquisition and achieved nominal low noise by 20:00 UTC. 

Total down time -- 7 hours from 2017-04-27 13:00 to 20:00 UTC.

Lessons Learned
- There are four places any given operator goes for information to diagnose a problem when alone on evening / night shifts:
        (1) Jenne's H1 Troubleshooting Presentation: G1602280
        (2) The OPS Wiki Troubleshooting Page: Trouble Shooting the IFO
        (3) The OPS Wiki Useful Scripts Page: Useful Scripts For Operators
        (4) Nutsinee has her own Trouble Shooting Page: Nutsinee's Page
When an operator has just restarted shifting after a month off, and nothing's gone wrong for that operator in a while, you forget even the location of resources let alone which resource to use. It will be a giant effort to merge these documents, but we could at least work to link all of them to the other.
        Jim recommends we banish (2), we update and maintain (3), and acknowledge that (4) is not cannon if used by others who are not Nutsinee. Jenne acknowledges that (1) has several pending updates, and will add a few things from today's experience.

- It's important to have such a "ditherAlign" script that rescues us from a gross arm misalignment in which we've lost spots. But we have those events so infrequently that the script suffers from bit rot between uses (e.g. it uses the TDS library, and we just upgraded the control room to Debian8, which doesn't support the TDS library). As such, we should upgrade, debug and fix this script and update the associated documentation. However, after looking at it, it's a beast of a spaghetti monster. Also a giant effort.

- When we get such a gross misalignment, we should not expect *any* operator to be able to fix the problem (let alone diagnose it) quickly or by themselves. It took all of the authors patiently sitting through the problem, picking up clues, trying this and that, looking in 15 different places (only possible with 3-4 pairs of eyes) for us to solved the problem, and later identify it. We should just expect this after we lose a seismic front-end. 

- Since we've moved toward the O2 model of "do not call commissioners if you have a problem," operators have, in general, become reticent to call if there's a problem, especially on owl shifts -- and that call list is Keita. Further, in the era of 71 hour locks and 80-90% duty cycle, commissioners and detector engineers are far less regularly in the control room. Yet further, shift changes are also a really tough point in the chain of communication and on the day operator. Not only do they have the stress of a mid-night failure from which they don't have all the information, but the gets compounded with the phones ringing, everyone coming in asking what's wrong and/or is it fixed, and not knowing who can actually stay to help. I make this last statement with no proposed solution, but merely to expose what happens these days during an un-identified mid-night failure mode and to encourage patience and cooperation by all.

Detailed Timeline
- Seismic front-end crashed

- After seismic computer and platform recovery (see LHO aLOG 35824, 35834), we did not see any spot on the ALS X Green camera.

- After some manual tries (unclear if any restoration to slider / oplev / OSEM values was done), an infrequently used script to recover green spots after gross arm misalignment, 
     /opt/rtcds/userapps/release/asc/h1/scripts/ditherAlign.py
was run on both TMSX (twice) and ITMX. These scripts failed, and left a whole bunch of stuff in a bad alignment state, namely 
    - all X arm optics (ITMX, ETMX, TMSX) were aligned to a bad location, 
    - the ITM *misalignment* offsets were changed, and 
    - The green camera (CAM) Reference OFFSETS were changed. 
Some, but not all recovery was made, because the users weren't aware of everything this script touched. 
(And before you cry "but the SDF system!" remember that there are lots of DIFFs that appear in down snaps that are usually overlooked because things work out in the end.)

(Corey departs, Jim Arrives)

- After a restoration of the TMSX alignment sliders to the previous observation stretch's OSEM location, we regained spots on the camera. Jim then heroically pushed the ETMX and ITMX alignment around until he recorved *decent* arm cavity transmission.

- As is standard practice, he then went for through an initial alignment of the X ARM (INITIAL ALIGNMENT state on ALS_XARM) which turns on automatic alignment, including green WFS and green Camera (CAM) ITMX spot restoration. However, because the end station alignment was still far enough off that the WFS error signals were too large, and the green CAM references had been errantly changed by the ditherAlign script the WFS and CAM servos blew up after every attempt to automatically close them as normal.

(Jeff Arrives)

- After further efforts to manually increase the transmission to reduce the WFS and CAM error signals without success, we remembered one has to clear WFS / SUS offsets if/when/after they drive optics into the weeds. 

(Jenne Arrives)

- Having cleared the weeds, we were able to close the green YAW WFS 1 & 2 loops that control TMSX and ETMX. To do so, we needed to set the ALS_XARM guaridan to ENABLE_WFS, and force the triggering of the alignment servos to be ALWAYS ON, i.e. set
    - H1:ALS-X_WFS_TRIG_THRESH_ON and H1:ALS-X_WFS_DOF_FM_TRIG_THRESH_ON to -0.1 (i.e. anything below zero), 
    - Flip to the master gain switch (H1:ALS-X_WFS_SWITCH) to ON.
However, with the triggering forced ON, that meant that if the arm lost lock, we'd need to quickly turn off the inputs to the WFS loops we had under control so more weeds would not grow. Recall from G1602280, successful closure of these loops was only possible if the error signals were less than ~1000 and preferably better than 500 [ct]. We were using Jenne's premade StripTool template for the WFS error signals,
    /ligo/home/jenne.driggers/Templates/Strip/Green_Y_AlignErrorSigs.stp

(Jason Arrives)

- With TMS and ETMX under control in YAW, we tried slowly moving ITMX in yaw (with the green WFS) to reduce the CAM YAW error signal (at this point both were in the 0.7 range, and we want better than 0.1). As we (very slowly) moved the ITM (so that the end-station optics' WFS could follow), we realized that reducing the CAM error signal made MICH fringes on the AS AIR camera look worse, implying that we were doing the wrong thing. So, we reverted the ITM location, and went to close the Pitch loops.

- Closing WFS A / DOF1 / ETMX pitch loop was relatively easy, but even though the error signals were sufficiently small, closing the WFS B / DOF 2 / TMSX loop would start growing weeds and break. After several tries, we realized we may be on the wrong side of the WFS A / DOF1 / ETMX PDH error signal hump, so we pushed the ETM back in the oppposite direction, indeed went over the hump, and began to reduce the error signal again. After that discovery, both PIT DOFs clsoed nicely.

- Now having ALL WFS end X optics controlled DOFS closed, and the green arm transmission up in the high 0.9s to 1, we again went back to ITMX to reduce the CAM error signal. This again made the AS AIR spot / MICH fringes look like crap, and reduced the green arm transmission. We [incorrectly] concluded that the cameras must have moved, and remeasured and set the CAM offsets.

- At this point we were able to resume regular initial alignment. All went well, until SRC_ALIGN, where we saw excess fringing on the AS port camera. Jenne knew this was because some optic was not entirely *misaligned* so we trended the *misalignment offsets* in the M0_TEST bank of ITMX, and found they were wrong by half. Restoring these allowed us to complete initial alignment as normal. 

- All the rest of normal lock acquisition worked swimmingly. 

- Upon arrival at nominal low noise, we began checking for SDF differences -- and it was only then that we started to put the pieces together that the driveAlign script had messed with everything listed above.

LHO FMCS

Link

patrick.thomas@LIGO.ORG - posted 14:18, Thursday 27 April 2017 - last comment - 06:10, Friday 28 April 2017(35843)

Chiller pump at end Y in alarm?

While updating the FMCS MEDM screens for the migration to the BACNet IOC I noticed that the alarm status for chiller pump 2 at end Y (H0:FMC-EY_CY_ALARM_2) has been active (value = 2) for at least a month. Is this normal?

Non-image files attached to this report

endY_pump2.pdf

Comments related to this report

john.worden@LIGO.ORG - 06:10, Friday 28 April 2017 (35859)

Link

This is NOT a chiller pump alarm. It is a "Chiller 2" alarm. There are two devices which are supplying chilled water for the building HVAC - The Chiller is a refrigeration machine which cools the water and there is a separate chilled water pump (CWP) which circulates the water through the chiller and up to the building.

This chiller has two cooling circuits - one has a known fault - hence the alarm.

To confuse the issue further there are two chillers and two chilled water pumps at the end station - this provides us redundancy in case of failure.

The critical alarm is the "Chilled Water Supply Temperature". This temperature is currently normal.

H1 ISC

Link

sheila.dwyer@LIGO.ORG - posted 17:13, Wednesday 26 April 2017 - last comment - 17:53, Thursday 27 April 2017(35810)

CHARD cut offs

I added two band stops to the CHARD loops, which have reduced the CHARD drives from 15-25 Hz by about a factor of 10. ASC noise should no longer be limiting our DARM sensitivity above 15Hz, the noise is only slightly better from 15-20 Hz.

The first attachment shows the slight improvement in the DARM noise, the difference in the drives (the cut offs I added were 2nd order elliptic bandstops), and the reduction in the coherence between the ASC drives and DARM. For CHARD Y the first screenshot shows the loop measurement before I added the bandstop, the bandstop should have reduced the phase at the upper ugf (2.5 Hz) by about 5 degrees. For CHARD P I reduced the gain by about 3dB, the third screenshot shows the before measurement in blue, a measurement after I reduced the gain but before I added the cut off in red. For CHARD the cut off only reduced the phase at the upper ugf of 3 Hz by 6 degrees, we are left with almost 50 degrees of phase margin.

I also re ran the noise budget injections for CHARD, DHARD, MICH, SRCL, PRCL and IMC PZT jitter. There only real change is that the ASC noise is lower, and there is a larger gap between the sum and the measured noise at 25 Hz. I am not able to download GDS strain from this afternoon, so I will post a noise budget when I can get data.

Images attached to this report

Comments related to this report

corey.gray@LIGO.ORG - 05:31, Thursday 27 April 2017 (35821)ISC

Link

Was there a change to H1:ASC-CHARD_P_GAIN from -0.10 to -0.14? (This came up as an SDF Diff for the next lock.)

sheila.dwyer@LIGO.ORG - 17:53, Thursday 27 April 2017 (35838)

Link

Yes, Corey, sorry I forgot to load the guardian.

The first attachment is the noise budget with measurements from yesterday. You can see that the broad lump that we blame on beam size jitter is worse, there is a gap between the measured noise and the sum of the predicted noises from 300-800 Hz which was not present in the noise budget from early Jan (here) Looking at the summary pages, you can see that this has happened in the last week. (April 18th compared to yesterday). Kiwamu and I had a look at some alignment sensors, and at first glance it doesn't seem like we've had an unusual alignment change this week. We asked Aidan to check the Hartmann data to see if there has been a change in absorption on ITMX.

The linear jitter is also slowly getting worse, which you can see by comparing the 350 Hz peak to January. The next two attached pngs are screenshots of the jitter transfer functions measured yesterday using the IMC PZT. You can compare these to measurements from mid Feb and see that the coupling is about 50% worse for yaw and almost a factor fo 2 worse for pit.

The 4th attachment shows a comparison of the coherence between DARM and the IMC WFS DC signals for February to earlier today. We now have broad coherence between IMC WFS B pit and darm, which I don''t think I have seen before even when we had a broad lump of noise in DARM before our pre O2 alignment change.

The last attachement shows coherences between DARM and the bullseye PD on the PSL.

Images attached to this comment

Non-image files attached to this comment

Noise_budget_April262017.fig

H1 CDS

Link

patrick.thomas@LIGO.ORG - posted 23:26, Tuesday 25 April 2017 - last comment - 00:49, Tuesday 08 August 2017(35792)

Migration of FMCS EPICS channels to BACNet IOC

WP 6577

Dave B., Carlos P., Bubba G., John W., Patrick T.

I have migrated a subset of the EPICS channels provided by the FMCS IOC on h0epics to an IOC I created on fmcs-epics-cds. The IOC on fmcs-epics-cds connects to the BACNet server that Apollo has installed as part of the FMCS upgrade. The channels that I migrated have been taken over by this upgrade and can no longer be read out by the server that the IOC on h0epics reads from. The fmcs-epics-cds computer connects to the slow controls network (10.105.0.1) on eth0 and the BACNet network (10.2.0.1) on eth1. It is running Debian 8.

The IOC on h0epics is started from the target directory /ligo/lho/h0/target/h0fmcs (https://cdswiki.ligo-wa.caltech.edu/wiki/h0fmcs). I commented out the appropriate channels from the fmcs.db and chiller.db files in the db directory of this path and restarted this IOC. I made no changes to the files in svn.

The IOC on fmcs-epics-cds uses code from SNS: http://ics-web.sns.ornl.gov/webb/BACnet/ and resides in /home/cdsadmin/BACnet_R0-8. This is a local directory on fmcs-epics-cds. This IOC is started as cdsadmin:

> ssh cdsadmin@10.105.0.112
cdsadmin@fmcs-epics-cds: screen
Hit Enter
cdsadmin@fmcs-epics-cds: cd /home/cdsadmin/BACnet_R0-8/iocBoot/e2b-ioc/
cdsadmin@fmcs-epics-cds: ../../bin/linux-x86_64/epics2bacnet st.cmd
Hit CTRL-a then 'd'

Issues:

I came to realize during this migration that the logic behind the binary input channels is different in BACNet. In BACNet a value of 0 corresponds to 'inactive' and a value of 1 corresponds to 'active'. In the server being migrated from a value of 0 corresponds to 'invalid'. This was verified for the reverse osmosis alarm: H0:FMC-CS_WS_RO_ALARM. In the BACNet server it reads as 0 or 'inactive' when not in alarm. When John W. forced it into alarm it read as 1 or 'active'. I believe Dave has updated his cell phone alarm notifier to match this.

A similar situation exists for the state of the chiller pumps. In the server being migrated from a value of 1 appears to correspond to 'OFF' and a value of 2 appears to correspond to 'ON'. It has not been verified, but I believe in the BACNet server a value of 0 corresponds to 'OFF' and a value of 1 corresponds to 'ON'. The pump status for each building is calculated by looking at the state of the pumps. The calculation in the database for the IOC being migrated from appears to be such that as long as one pump is running the status is 'OPERATIONAL'. If no pump is running the status is 'FAILED'. I need to double check this with John or Bubba. I updated the corresponding calc records in the database for the BACNet IOC to match this.

In the server being migrated from, channels that are read by BACNet as binary inputs and binary outputs are read as analog inputs. I changed these in the database for the BACNet IOC to binary inputs and set the ONAM to 'active' and the ZNAM to 'inactive'. The alarm levels also need to be updated. They are currently set through the autoburt snapshot files that contain a separate channel for each alarm field. The autoburt request file has to be updated for the binary channels to have channels for .ZSV and .OSV instead of .HSV, .LSV, etc. So currently there is no control room alarm set for the binary channels, including the reverse osmosis alarm.

I also need to update the medm screens to take account of this change.

Also, there is an invalid alarm on the control room alarm station computer for the mid X air handler reheat temperature. Looking on the BACNet FMCS server this channel actually does appear to be genuinely invalid.

It should be noted that this BACNet IOC is a temporary install until an OPC server is installed on the BACNet server.

I would like to leave the permit for this work open until the FMCS upgrade is complete and all the channels have been migrated to the BACNet IOC.

Comments related to this report

patrick.thomas@LIGO.ORG - 14:33, Wednesday 26 April 2017 (35806)

Link

I updated the autoBurt.req file.

patrick.thomas@LIGO.ORG - 16:56, Wednesday 26 April 2017 (35811)

Link

I have set the alarm on the RO channel: caput H0:FMC-CS_WS_RO_ALARM.OSV MAJOR

patrick.thomas@LIGO.ORG - 14:19, Thursday 27 April 2017 (35844)

Link

I have updated the medm screens.

patrick.thomas@LIGO.ORG - 00:49, Tuesday 08 August 2017 (38061)

Link

Note: The 'ARCH = linux-x86' line in the Makefile under 'BACnet_R0-8/iocBoot/e2b-ioc' had to be changed to 'ARCH = linux-x86_64' in the code copied from SNS.

H1 DetChar (DetChar)

Link

evan.goetz@LIGO.ORG - posted 15:52, Tuesday 18 April 2017 - last comment - 06:11, Friday 28 April 2017(35640)

Turned off Pcal camera ethernet adapter -- maybe mitigates a ~1 Hz comb?

Evan G., Robert S.

Looking back at Keith R.'s aLOGs documenting a changes happening on March 14 (see 35146, 35274, and 35328), we found that one cause seems to be the shuttering of the OpLev lasers on March 14. Right around this time, 17:00 UTC on March 14 at EY and 16:07 UTC at EX, there is an increase in line activity.

The correlated cause is Travis' visit to the end station to take images of the Pcal spot positions. The images are taken using the Pcal camera system and needs the OpLevs to be shuttered so that a clean image can be taken without the light contamination. We spoke with Travis and he explained that he disconnected the USB interface between the DSLR and the ethernet adapter, and used a laptop to directly take images. Around this time, the lines seem to get worse in the magnetometer channels (see, for example, the plots attached to Keith's aLOG 35328).

After establishing this connection, we went to the end stations to turn off the ethernet adapters for the Pcal cameras (the cameras are blocked anyway, so this active connection is not needed). I made some magnetometer spectra before and after this change (see attached). This shows that a number of lines in the magnetometers are reduced or are now down in the noise.

Hopefully this will mitigate some of the recent reports of combs in h(t). 

We also performed a short test turning off another ethernet adapter for the H1 illuminator and PD. This was turned off at 20:05:16 18/04/2014 UTC and turned back on at 20:09:56 UTC. I'll post another aLOG with this investigation as well.

Images attached to this report

Comments related to this report

keith.riles@LIGO.ORG - 13:46, Wednesday 19 April 2017 (35667)DetChar

Link

Good work! That did a lot of good in DARM. Attached are spectra in which many narrow lines went 
away or were reduced (comparing 22 hours of FScan SFTs before the change (Apr 18) with 10 hours of
SFTs after the change (Apr 19). We will need to collect much more data to verify that all of the 
degradation that began March 14 has been mitigated, but this first look is very promising - many thanks!

Fig 1: 20-50 Hz 
Fig 2: 50-100 Hz
Fig 3: 100-200 Hz

Images attached to this comment

keith.riles@LIGO.ORG - 09:01, Thursday 20 April 2017 (35682)DetChar

Link

Attached are post-change spectra using another 15 hours of FScan SFTs since yesterday. Things continue to look good.

Fig 1: 20-50 Hz 
Fig 2: 50-100 Hz
Fig 3: 100-200 Hz

Images attached to this comment

evan.goetz@LIGO.ORG - 11:54, Friday 21 April 2017 (35706)

Link

Correction: the date is 18/04/2017 UTC.

keith.riles@LIGO.ORG - 19:28, Thursday 27 April 2017 (35826)DetChar

Link

Another follow-up with more statistics. The mitigation from turning off
the ethernet adapter continues to be confirmed with greater certainty.
Figures 1-3 show spectra from pre-March 14 (1210 hours), a sample of 
post-March 14 data (242 hours) and post-April 18 (157 hours) 
for 20-50 Hz, 50-100 Hz and 100-200 Hz.

With enough post-April 18 statistics, one can also look more closely at
the difference between pre-March 14 and and post-April 18. Figures 4-6
and 7-9 show such comparisons with different orderings and threrefore
different overlays of the curves. It appears there are lines in the post-April 18
data that are stronger than in the pre-March 14 data and lines in the earlier 
data that are not present in the recent data. Most notably, 1-Hz combs
with +0.25-Hz and 0.50-Hz offsets from integers have disappeared.

Narrow low-frequency lines that are distinctly stronger in recent data include these frequencies:

21.4286 Hz
22.7882 Hz - splitting of 0.0468 Hz
27.4170 Hz 
28.214 Hz 
28.6100 Hz - PEM in O1
31.4127 Hz and 2nd harmonic at  62.8254 Hz
34.1840 Hz
34.909 Hz (absent in earlier data)
41.8833 Hz 
43.409 Hz (absent in earlier data)
43.919 Hz
45.579 Hz
46.9496 Hz
47.6833 Hz
56.9730 Hz
57.5889 Hz
66.7502 Hz (part of 1 Hz comb in O1)
68.3677 Hz
79.763 Hz
83.315 Hz
83.335 Hz
85.7139 Hz
85.8298 Hz
88.8895 Hz
91.158 Hz
93.8995 Hz
95.995 Hz (absent in earlier data)
107.1182 Hz 
114.000 Hz (absent in earlier data)

Narrow low-frequency lines in the earlier data that no longer appear include these frequencies:

20.25 Hz - 50.25 Hz (1-Hz comb wiped out!)
24.50 Hz - 62.50 Hz (1-Hz comb wiped out!)
29.1957 Hz 
29.969 Hz


Note that I'm not claiming change points occurred  for the above lines on March 14 (as I did for the
original set of lines flagged) or on April 18. I'm merely noting a difference in average line strengths 
before March 14 vs after April 18. Change points could have occurred between March 14 and April 18,
shortly before March 14 or shortly after April 18.

Images attached to this comment

keith.riles@LIGO.ORG - 06:11, Friday 28 April 2017 (35858)DetChar

Link

To pin down better when the two 1-Hz combs disappeared from DARM,
I checked Ansel's handy-dandy comb tracker and found the answer immediately.

The two attached figures (screen grabs) show the summed power in the teeth of those combs.
The 0.5-Hz offset comb is elevated before March 14, jumps up after March 14
and drops down to normal after April 18. The 0.25-Hz offset comb is highly
elevated before March 14, jumps way up after March 14 and drops down to normal
after April 18.

These plots raise the interesting question of what was done on April 18 that
went beyond the mitigation of the problems triggered on March 14. 

Figure 1 - Strength of 1-Hz comb (0.5-Hz offset) vs time (March 14 is day 547 after 9/15/2014, April 18 is day 582)

Figure 2 - Strength of 1-Hz comb (0.25-Hz offset) vs time

Images attached to this comment