Upon my arrival (at 7:45 am PT) in the control room I found the IFO locked and in "Observation" mode with the intent bit set. The Lock Clock showed 8:50 hours and counting. The input power was 24.3 W. Camera images looked stable and as expected. (Because the video5 computer had frozen! argh). DARM ASD looked terrible, but not unreasonable for the operating condition of the IFO. ISC_Lock Guardian was in Nominal Low Noise.
This is a better start of the day then we have had for awhile. This should allow the ER9 testing of the hwinj, cal, and data pipelines and, with the intent bit set, allow the analysis streams to test their readiness.
15:13 UTC (8:13 PT) - just lost lock. Morning activities?
Patrick, Sheila, Carl and Ross damping PIs
After Keita's ISS fixes we had a few hours of bad weather and earthquakes that contributed to dificulty locking. Once things calmed down we locked and sat at several stages to see if the lock was stable, DC readout, 25 Watts, ISS 2nd loop engaged, and now we have made it to some kind of "nominal low noise" state. Patrick hit the intent bit.
There are several things wrong with this spectrum, some of which could be solved by going back to 40 Watts in the morning:
The last attached screenshot is just to show how many locking attempts everyone made in the last 14 hours. The range displayed by the DMT viewer in the control room seems to have a problem, it should be stable at some low range but on the DMT viewer it looks like we lost lock in the last half hour.
Sheila's comment
There were several things which had to be done today because of the change to run ER9 at 25 Watts.
We should remember to undo these things as soon as we go back to 40 Watts. According to Ross and Carl, there isn't a reason to think our PI situation has gotten worse than it was for the last 2 weeks where we had stable locks more than 4 hours long, but the PI damping would need to be babysat at 40 Watts. Maybe an engineering run is a good time for detector engineers and operators to learn how to do this.
There are several things that should be done if we want to continue the ER at 25 Watts with a reasonable sensitivity:
none of these are all that hard to do, but to me it seems like it would be more productive to try continuing the ER at 40 Watts, to see what we can learn about the IFO in a state that is more similar to the configuration we would like to run at for O2.
DARM-ISS 2nd loop coherence doesn't change by increasing the ISS 2nd loop gain by 6dB. Jitter or whatever, ISS is imprinting noise onto intensity.
Keita, this is something I think we saw in the past too:
https://alog.ligo-wa.caltech.edu/aLOG/index.php?callRep=20394
CO2 related comment:
The phase of the 18040Hz mode flipped during this lock stretch. The damping has been switched to the arm transmission QPDs gain 10,000 phase + 60 deg.
The 18056Hz mode also became unstable and was succesffully damped with settings in SUS_PI guardian.
The ETMX 15541.8Hz mode and ETMY 15542.6Hz modes that have been difficult to damp were damped well tonight with the arm tranmssion QPD signals. The rational was that the arm transmission QPD's differentiate the arm the mode is in to some extent reducing the beat effect. These settings are in the SUS_PI guardian. The revert comment out the ETM QPD gain settings and uncomment the OMC gain settings for these modes.
Carl - I'm wondering if the width and apparent double nature of each mode could be due to the same mode being excited in the coating-bearing mass and in the reaction mass - could the reaction mass body modes be getting rung up as well? Just a thought. Might be nonsense.
There has been something wrong with the ETMY oplev for quite a while now, it says that the optic is moving a lot more than the other test masses, swinging around by more than half a micro radian. This must be false, and needs some investigation.
Looking at the SUM out of the oplev at 2 different times, it seems like there may be an issue with the laser. The first attachment shows the ETMy SUM signal from a period in April while the second shows the same signal at the start of July (y axis is counts, and the scale is the same between the two pictures). As can be seen the SUM signal is noisier now that it was back in April. This could be what is causing the issue Sheila reports above. We can swap the laser at the next available opportunity to see if this fixes the problem.
after 8 hours of writing a complete set of framed data with no errors, h1fw0 went unstable again at 10pm PDT. I have reconfigured it to stop writing commissioning frames, and it has been running for 20 mins. We will leave it in this configuration overnight.
The lockloss from the state REFL_POP_WFS which happened at 23:56:29 UTC July 07 2016 is very similar to what is described in alog 26840 and comments. I've attached the guardian logs for both ISC_DRMI and ISC_LOCK, and a plot of the lockloss.
The first thing that happens is that the ISC_LOCK gaurdian starts to transition our front end triggering for the LSC from POP18 to POPLF, by lowering the thresholds, sleeping 0.1 seconds, and then setting the trigger matrix elements to 0.
Yes, the expected behavior is that the target state of a jump transition of a managed node will be executed normally after the transition. The stall just prevents the system from following any standard transtions after the state returns true. Nothing prevents jump transitions, though.
If you don't want the system to do anything after a jump I would suggest inserting a do-nothing state in between.
The ext_alert process on h1fescript0 which alerts the control room of Gamma Ray Burst and Supernovae events has stopped running because its robot certificate (needed to query GraceDB) expired on 27 June 2016. We are in the process of renewing this certificate.
Yesterday at 13:40 and at 22:40 PDT the CW stopped exciting the EX PCAL because its testpoint was cleared from awgtpman. The excitation slot in the awg remained even though the excitation was not active, so the CW sender (psinject process on h1hwinj1) was unaware that its excitation was no longer operational and did not try to restart. When this happened the CDS overview screen did show a red excitation error for h1calex (meaning that an excitation should be present but is missing) and the operator medm overview should have shown a missing CW error as well.
The problem was found to be a "feature" in diag which allows the user to clear every test point on every model with one command. The feature was accidentally acitvated yesterday when the dcu-id and the testpoint were interchanged in the tp clear command.
Jim is working on removing this feature from the current diag code to prevent this accidentally happening again during ER9.
Actually the fix is very intrusive (will result in new binaries for the complete GDS suite of tools; diaggui, foton, awgtpman etc) so we will delay the change until next Tuesday.
In the mean time, within a 'diag -l' session, please do not type the following commands:
tp clear *
tp clear * <dcuid>
tp clear * *
tpclear *
tpclear * <dcuid>
tpclear * *
Take it from us, they will clear all test points on all models and stop the CW hardware injection.
The reason why the IFO second loop didn't like the ISS 2nd loop is because the ISS 2nd loop offset is now hard coded (28076) while the power and diffraction both changed because of the diode swap and subsequent tune-up.
I remeasured it at 40W (H1:PSL-ISS_SECONDLOOP_REF_SIGNAL_ANA=-0.027689945 for 16-17% diffraction) but Vern told me that it was decided that we'll run IFO at 25W during ER9, so I measured it yet again at 25W (-0.027651985).
I'm puzzled that the number changed this much (previously it was -0.9826934814453125 at 40W).
It's also odd that there's not much difference between 40W and 25W. The diodes are definitely connected, the readback of analog sum (H1:PSL-ISS_SECONDLOOP_PD_14_SUM_OUT) is almost 100% coherent with the digital sum of individual PDs (H1:PSL-ISS_SECONDLOOP_SUM14_AC_OUT and H1:PSL-ISS_SECONDLOOP_SUM58_AC_OUTPUT).
Anyway IMC_LOCK guardian was modified for 25W (H1:PSL-ISS_SECONDLOOP_REF_SIGNAL_ANA, iss_diffracted_power_target=16.5).
Update: I was likely tricked by MEDM screen (graphics of switch states sometimes don't agree with reality) when I was doing the above measurement with only MC locked.
With full IFO the above offset was found totally off, and at 25W H1:PSL-ISS_SECONDLOOP_REF_SIGNAL_ANA=-0.5885 or so for 15%-ish diffraction. Didn't have time to remeasure at 40W.
Summary:
The 2nd loop engagement logic is bad as it wastes too much time waiting for a luck, but waiting for a luck doesn't do anything good.
Details:
2nd loop offset servo can take the 2nd loop board output or the diffracted power as the error signal.
At 40W or 25W, without engaging the 2nd loop, the output of the 2nd loop board always goes rail to rail even if the offset is correctly set just because the error signal is big.
Despite this, the offset adjustment servo is engaged anyway using the 2nd loop board output. The board bang-bangs forever, but eventually the guardian grabs a lucky moment when the board output happens to be small enough of a number, and thinks (incorrectly) that the offset servo converges. And then it engages the second loop. But this is as good as nothing IF you know that your static offset is reasonable.
Until a better criteria to engage the 2nd loop is found, I think the best strategy is to
I changed the guardian sans step 3. above:
It works.
2nd loop sudden death problem:
Jenne found that ISS 2nd loop is suddenly disengaged because the 2nd loop board output exceeds the OFF trigger threshold of 5 (first attachment), killing IFO.
The second board output goes close to 5 kind of often now, it seems. Since we don't have time to do a good investigation, for the moment I set the threshold to 10 (which sounds too large) and see how it goes.
In the last lock the IFO survived with ISS 2nd loop on for 10 minutes. The lock loss didn't seem to be due to ISS (2nd attachment).
We were showing Dan the h1fw0 problems (after turning on all frame writing again) and after less than an hour h1fw0 became stable again and has been running for 2+ hours. We will leave it in this state for now, but if it becomes very unstable again we have several options:
Turn off LDAS frame comparison jobs (reads every frame on both writers)
Remove all LDAS archiving from h1fw0 over to h1fw1
Reduce h1fw0 back to only writing science frames
Fingers crossed we don't have to do any of these.
Dan confirmed how the frame files written by CDS are currently accessed by LDAS for archival and checking, summarized below.
h1fw0 writes to the ldas-h1-frames file system (located on SATABOYs in the warehouse)
h1fw1 writes to the cds-h1-frames file system (located on SATABOYs in the MSR)
h1fw0/ldas-h1-frames
science frames are copied over to LDAS every 64 secconds
science frames are read every 64 seconds to compare them with h1fw1's frame
commissioning frames are read every 64 seconds to compare them with h1fw1's frame
second trend files are copied over to LDAS archive area every 10 minutes
second trend files are read every 10 minutes to compare them with h1fw1's frame
minute trend files are copied over to LDAS archive area every 10 minutes
minute trend files are read every 10 minutes to compare them with h1fw1's frame
science frames are read every 64 seconds to compare them with h1fw0's frame
commissioning frames are copied over to LDAS every 64 secconds
commissioning frames are read every 64 seconds to compare them with h1fw0's frame
second trend files are copied over to LDAS scratch area every 10 minutes
second trend files are read every 10 minutes to compare them with h1fw0's frame
minute trend files are copied over to LDAS scratch area every 10 minutes
minute trend files are read every 10 minutes to compare them with h1fw0's frame
Started and ran the purge-air skid, Turbo, QDP80 and newly added scroll pump. Tested the functionality of the Turbo's Safety Valve with the control cable connected to the scroll pump's relay box - demonstrating that the Safety Valve closes upon the loss of scroll pump motor AC and thus mimicking the "QDP80 Running" signal. Note: The purge-air skid has developed some problems from lack of use over these past few years, namely the radiator fan never came on while running the compressors, the drying tower never cycled and the low pressure alarm never sounded. These features worked fine when last this unit was run. Now they don't. Also, the Turbo spun-up normally even with the locally mounted scroll pump running (new vibration source) but tripped into EMERGENCY OPERATION upon BRAKING and when at ~1/3 NORMAL rpm. This behavior is seen on other Turbos (XBM for example) and might be an age related issue? I am leaving the Turbo energized overnight to ensure that the rotor is completely stopped. I will de-energize it tomorrow.
Friday, July 8th ~1345 hrs. local -> De-energized MTP controller
State of H1: locked at 25W, but no ISS 2nd loop
Activities:
Glitches:
Special Instructions:
I did turn the AC off when I left the enclosure this morning. It took a few tries but the facility control unit said they were off. Also when I went to go in this morning, all the HEPA fan speeds were set to 20% instead of the 100% (they are off when no one is inside).
After one lock loss FSS started oscillating (PZT signal H1:PSL-FSS_FAST_MON_OUTPUT was going rail to rail).
Bringing down the fast gain by 1dB (from 22 to 21) quenched it, and though it didn't start oscillation when the fast gain was brought back up 1dB, I just leave it at 21.
Update:
FSS oscillated multiple times after the above was written.
The temporary cure seemed to be to lower the FSS common gain and then bring it back up. No need to change FAST gain.
[Everyone]
We have determined (a few hours ago) that flipping the ESD bias sign on ETMX does not allow us to lock ALS DIFF. Jeff has looked through the settings, and everything is flipped to match the bias sign as appropriate, but we're still not able to lock. It looks like a crossover is unstable, or something like that. For now, we have reverted the ETMX ESD bias to its pre-Tuesday state, which has facilitated locking. Team SUS will look into this later.
The ISS is much better behaved now that the alignment work was done. However the FSS gain change was making the IMC loop very nearly unstable, and we lost lock during the power-up twice due to this instability. Sheila and Keita will post their measurements that discovered and fixed this issue.
We have so far been able to power up to 40W once, but it looks like a PI rang up pretty quickly. Carl will post details, but this is a new mode that hasn't been a problem previously, so it does not yet have damping settings.
At this time, it looks like there is no problem in getting to 40W, and other than PI damping we should be able to get all the way to low noise.
If Carl is unable to find damping settings relatively quickly, we may choose to instead only go to 20W for tonight, so that we can get to low noise and set the intent bit. Stay tuned for more updates...
Locklosses around 0230 and 0345 were associated with the 18040Hz instability first two images. In other locks this mode's amplitude did not reach these elevated levels. The lock ending at 0800 appears to have just scraped through the transient, in the last image the mode amplitude can be seen to grow, then saturate the sensing (I assume), before damping.