Yesterday at 13:40 and at 22:40 PDT the CW stopped exciting the EX PCAL because its testpoint was cleared from awgtpman. The excitation slot in the awg remained even though the excitation was not active, so the CW sender (psinject process on h1hwinj1) was unaware that its excitation was no longer operational and did not try to restart. When this happened the CDS overview screen did show a red excitation error for h1calex (meaning that an excitation should be present but is missing) and the operator medm overview should have shown a missing CW error as well.
The problem was found to be a "feature" in diag which allows the user to clear every test point on every model with one command. The feature was accidentally acitvated yesterday when the dcu-id and the testpoint were interchanged in the tp clear command.
Jim is working on removing this feature from the current diag code to prevent this accidentally happening again during ER9.
The reason why the IFO second loop didn't like the ISS 2nd loop is because the ISS 2nd loop offset is now hard coded (28076) while the power and diffraction both changed because of the diode swap and subsequent tune-up.
I remeasured it at 40W (H1:PSL-ISS_SECONDLOOP_REF_SIGNAL_ANA=-0.027689945 for 16-17% diffraction) but Vern told me that it was decided that we'll run IFO at 25W during ER9, so I measured it yet again at 25W (-0.027651985).
I'm puzzled that the number changed this much (previously it was -0.9826934814453125 at 40W).
It's also odd that there's not much difference between 40W and 25W. The diodes are definitely connected, the readback of analog sum (H1:PSL-ISS_SECONDLOOP_PD_14_SUM_OUT) is almost 100% coherent with the digital sum of individual PDs (H1:PSL-ISS_SECONDLOOP_SUM14_AC_OUT and H1:PSL-ISS_SECONDLOOP_SUM58_AC_OUTPUT).
Anyway IMC_LOCK guardian was modified for 25W (H1:PSL-ISS_SECONDLOOP_REF_SIGNAL_ANA, iss_diffracted_power_target=16.5).
Update: I was likely tricked by MEDM screen (graphics of switch states sometimes don't agree with reality) when I was doing the above measurement with only MC locked.
With full IFO the above offset was found totally off, and at 25W H1:PSL-ISS_SECONDLOOP_REF_SIGNAL_ANA=-0.5885 or so for 15%-ish diffraction. Didn't have time to remeasure at 40W.
Summary:
The 2nd loop engagement logic is bad as it wastes too much time waiting for a luck, but waiting for a luck doesn't do anything good.
Details:
2nd loop offset servo can take the 2nd loop board output or the diffracted power as the error signal.
At 40W or 25W, without engaging the 2nd loop, the output of the 2nd loop board always goes rail to rail even if the offset is correctly set just because the error signal is big.
Despite this, the offset adjustment servo is engaged anyway using the 2nd loop board output. The board bang-bangs forever, but eventually the guardian grabs a lucky moment when the board output happens to be small enough of a number, and thinks (incorrectly) that the offset servo converges. And then it engages the second loop. But this is as good as nothing IF you know that your static offset is reasonable.
Until a better criteria to engage the 2nd loop is found, I think the best strategy is to
I changed the guardian sans step 3. above:
It works.
2nd loop sudden death problem:
Jenne found that ISS 2nd loop is suddenly disengaged because the 2nd loop board output exceeds the OFF trigger threshold of 5 (first attachment), killing IFO.
The second board output goes close to 5 kind of often now, it seems. Since we don't have time to do a good investigation, for the moment I set the threshold to 10 (which sounds too large) and see how it goes.
In the last lock the IFO survived with ISS 2nd loop on for 10 minutes. The lock loss didn't seem to be due to ISS (2nd attachment).
We were showing Dan the h1fw0 problems (after turning on all frame writing again) and after less than an hour h1fw0 became stable again and has been running for 2+ hours. We will leave it in this state for now, but if it becomes very unstable again we have several options:
Turn off LDAS frame comparison jobs (reads every frame on both writers)
Remove all LDAS archiving from h1fw0 over to h1fw1
Reduce h1fw0 back to only writing science frames
Fingers crossed we don't have to do any of these.
Dan confirmed how the frame files written by CDS are currently accessed by LDAS for archival and checking, summarized below.
h1fw0 writes to the ldas-h1-frames file system (located on SATABOYs in the warehouse)
h1fw1 writes to the cds-h1-frames file system (located on SATABOYs in the MSR)
h1fw0/ldas-h1-frames
science frames are copied over to LDAS every 64 secconds
science frames are read every 64 seconds to compare them with h1fw1's frame
commissioning frames are read every 64 seconds to compare them with h1fw1's frame
second trend files are copied over to LDAS archive area every 10 minutes
second trend files are read every 10 minutes to compare them with h1fw1's frame
minute trend files are copied over to LDAS archive area every 10 minutes
minute trend files are read every 10 minutes to compare them with h1fw1's frame
science frames are read every 64 seconds to compare them with h1fw0's frame
commissioning frames are copied over to LDAS every 64 secconds
commissioning frames are read every 64 seconds to compare them with h1fw0's frame
second trend files are copied over to LDAS scratch area every 10 minutes
second trend files are read every 10 minutes to compare them with h1fw0's frame
minute trend files are copied over to LDAS scratch area every 10 minutes
minute trend files are read every 10 minutes to compare them with h1fw0's frame
Started and ran the purge-air skid, Turbo, QDP80 and newly added scroll pump. Tested the functionality of the Turbo's Safety Valve with the control cable connected to the scroll pump's relay box - demonstrating that the Safety Valve closes upon the loss of scroll pump motor AC and thus mimicking the "QDP80 Running" signal. Note: The purge-air skid has developed some problems from lack of use over these past few years, namely the radiator fan never came on while running the compressors, the drying tower never cycled and the low pressure alarm never sounded. These features worked fine when last this unit was run. Now they don't. Also, the Turbo spun-up normally even with the locally mounted scroll pump running (new vibration source) but tripped into EMERGENCY OPERATION upon BRAKING and when at ~1/3 NORMAL rpm. This behavior is seen on other Turbos (XBM for example) and might be an age related issue? I am leaving the Turbo energized overnight to ensure that the rotor is completely stopped. I will de-energize it tomorrow.
Friday, July 8th ~1345 hrs. local -> De-energized MTP controller
State of H1: locked at 25W, but no ISS 2nd loop
Activities:
Glitches:
Special Instructions:
I did turn the AC off when I left the enclosure this morning. It took a few tries but the
facility control unit said they were off.
Also when I went to go in this morning, all the HEPA fan speeds were set to 20% instead
of the 100% (they are off when no one is inside).
After one lock loss FSS started oscillating (PZT signal H1:PSL-FSS_FAST_MON_OUTPUT was going rail to rail).
Bringing down the fast gain by 1dB (from 22 to 21) quenched it, and though it didn't start oscillation when the fast gain was brought back up 1dB, I just leave it at 21.
Update:
FSS oscillated multiple times after the above was written.
The temporary cure seemed to be to lower the FSS common gain and then bring it back up. No need to change FAST gain.
Files ending with .001 are Lo Power; files ending with .002 are HI power (with the exception of the msc file .003)
DBB: all scans look as good as can be expected given the recent tribulations in PSL land. Higher order modes (TEM 01, 02 & 20) are a bit high in the HI power.
ISS_RPN: AOM diffracted power running higher than usaual these days. No surprise there that's it's much higher than reference. PDB is the Diode du jour and it's running above reference as well to the tune of 1 order of magnitude at 10Hz and 2 OOM at lower freq.
For better in-depth analysis, please refer to the experts; Peter K. and Jason O.
Due to the inability so far to damp the 18040Hz mode I have switch the PI damping to the DCPD HF path.
I changed up a few things such as:
If there is any problem with this version, please don't hesitate to go back one revision.
Attached are plots of the transfer functions measured this morning. The input modecleaner
was disabled whilst these measurements were obtained.
Note the frequency servo schematic is D040105
C20F22-1.tif: common gain = 20 dB, fast gain = 22 dB
C20F16-1.tif: common gain = 20 dB, fast gain = 16 dB
C20F22FastMon.jpg: transfer function measured at TP10
C20F22PCMon.jpg: transfer function measured at TP9
C20F22TP8.jpg: transfer function measured at TP8 (should be 10X greater than TP9 measurement)
common gain = 20 dB, fast gain = 22 dB
C16F22TP8.jpg: transfer function measured at TP8, common gain = 20 dB, fast gain = 22 dB
C16F22TP9.jpg: transfer function measured at TP9, common gain = 20 dB, fast gain = 22 dB
C20F16TP9.jpg: transfer function measured at TP9, common gain = 20 dB, fast gain = 16 dB
C20F16TP8.jpg: transfer function measured at TP8, common gain = 20 dB, fast gain = 16 dB
C20F16TP3Spectrum.jpg: noise measured at TP3 (the mixer monitor, or error point)
common gain = 20 dB, fast gain = 16 dB
C20F22TP3Spectrum.jpg: noise measured at TP3, common gain = 20 dB, fast gain = 22 dB
CrossoverSpectrum.jpg: has the two noise spectra overlapped for a visual comparison.
Peter was in the PSL enclosure and looking at the FSS - his alog will give details.
He's done so I'm starting to lock.
DRMI locked at 16:36UTC
POP X PZT railed, but POP signal is still centered on the QPD, so this is on the "to be fixed" list, but does not prevent locking. POP X is accessable from the ASC overview screen.
in DC READOUT TRANSITION, OMC guardian was stuck in DOWN, and was unstuck by running it through INIT.
I unintentionally killed the lock by requesting the INCREASE POWER state in ISC_LOCK.
I thought this would just stop the ISC_LOCK guardian in this state, keeping the input power at 38W.
Instead it reran the code for INCREASE POWER from the beginning and lowered the input power to 10W, killing the lock.
Broken:
So I thought the power was going to increase to 50W, based on what the LASER_POWER guardian main screen was showing, however TJ explained that the guardian can only request so many states, so the power was going to stop at 40W but the guardian could not show that because 40W was not a requestable state.
Fixed:
TJ rewrote the LASER_POWER guardian to make 40W a requestable state, so when we're going to 40W it will show this.
TJ also set the ISC_LOCK to stop at 25W input power on out next lock.
Currently, we have the IMC locked and at 40W input power.
Keita is looking at the ISS Second Loop and measuring the correct offset for 40W.
18:32UTC - adjustments made for running at 25W are complete - now relocking H1
DRMI locked at 18:43UTC
19:12UTC - H1 in DC readout and heading to ENGAGE ISS 2nd loop
19:58UTC -------------
H1 made it to ENGAGE ISS 2ND LOOP which is good, because getting to the point where we could engage the 2nd loop is progress, however engaging the loop revealed an issue which broke the lock.
Jenne and Keita are looking into it.
******** my estimate of having H1 in Observer was a bit optomistic - 4PM today (23:00UTC) is more likely.
21:04UTC - locked and going to ENGAGE_ISS2ND_LOOP without engaging the loop, to do measurements
lockloss
last entry for this log:
21:52UTC - H1 is back in ENGAGE ISS 2ND LOOP with the ISS 2nd loop disabled.
Keita and Jenne are looking for a fix, so H1 can go to Observe.
[Everyone]
We have determined (a few hours ago) that flipping the ESD bias sign on ETMX does not allow us to lock ALS DIFF. Jeff has looked through the settings, and everything is flipped to match the bias sign as appropriate, but we're still not able to lock. It looks like a crossover is unstable, or something like that. For now, we have reverted the ETMX ESD bias to its pre-Tuesday state, which has facilitated locking. Team SUS will look into this later.
The ISS is much better behaved now that the alignment work was done. However the FSS gain change was making the IMC loop very nearly unstable, and we lost lock during the power-up twice due to this instability. Sheila and Keita will post their measurements that discovered and fixed this issue.
We have so far been able to power up to 40W once, but it looks like a PI rang up pretty quickly. Carl will post details, but this is a new mode that hasn't been a problem previously, so it does not yet have damping settings.
At this time, it looks like there is no problem in getting to 40W, and other than PI damping we should be able to get all the way to low noise.
If Carl is unable to find damping settings relatively quickly, we may choose to instead only go to 20W for tonight, so that we can get to low noise and set the intent bit. Stay tuned for more updates...
Locklosses around 0230 and 0345 were associated with the 18040Hz instability first two images. In other locks this mode's amplitude did not reach these elevated levels. The lock ending at 0800 appears to have just scraped through the transient, in the last image the mode amplitude can be seen to grow, then saturate the sensing (I assume), before damping.
Actually the fix is very intrusive (will result in new binaries for the complete GDS suite of tools; diaggui, foton, awgtpman etc) so we will delay the change until next Tuesday.
In the mean time, within a 'diag -l' session, please do not type the following commands:
tp clear *
tp clear * <dcuid>
tp clear * *
tpclear *
tpclear * <dcuid>
tpclear * *
Take it from us, they will clear all test points on all models and stop the CW hardware injection.