T. Jarrett, IPAC
(971021)
Bright stars create havoc within coadd images and UNLESS handled aggressively they are the primary source of contaminents to the extended source database. They are the progenitors of "halos" (low surface brightness emission extending well beyond the PSF), diffraction spikes, horizontal stripes, glints and persistence residual ghosts -- a real horror show. For further sunny information on these deamons, see GALWORKS Bright Star Cleansing .
After months of 3-channel data reduction and analysis (not to mention a year or two of work with protocam data), I have come to the conclusion that using the R1 mags to generate 'blanking parameter values' (e.g., confusion radii, diffraction spike lengths, etc) via lookup tables or mathematical functions is NOT dependible (re: robust) under a wide variety of conditions -- stellar number density and sky background. The reasons are many (one of which is that the R1 mags are not reliable, particularly near saturation) of which I will not discuss here. Instead, I maintain that lookup table values suffice as "first" guesses to the ultimate final solution we desire. The masking parameters (confusion radii, diffraction spike lengths, horizontal stripe(s) and persistence ghost blanking) are found by measuring and scoring the corresponding features for each object (per band). We take a cpu hit since we require a fair number of complicated computations, but we gain in the end with uniform performance regardless of the quality of the R1 mags or other input (first guess) parametric values. This memo describes the algorithm and performance on a number of fields, including high glat fields, low glat (3 to 7 degrees above the plane) and a field directly in the plane (our old friend, MSX, taken way back in April).
Algorithm
It would be fair say that the set of algorithms can be described as the "bright star processor' given the relative complexity compared to lookup tables or characteristic functions. In brief:
Step 1: derive sky noise (per band)
Step 2: Derive Confusion Radii
determine confusion radii:
Step 3: Horizontal Stripe Terminator
'predict' stripe SNRs via fit to lookup tables;
modify first-guess stripe SNRs by the "confusion noise" factor;
using modified first-guess SNRs, decide whether further computation is
required (default threshold: 1-sigma).
determine SNR of primary and secondary stripes:
To measure the relative (or local) 'background' value, a similar computation is made on an adjacent stripe region. The SNR is then computed by normalizing the median value (minus the background value) by the confusion noise. The SNR is then compared to some threshold. The threshold is 0.4 dn, modified by the confusion noise factor as follows:
For the secondary stripes, the procedure is the same as that for the primary stripe, except that we use both secondary stripes in the median value determination (since they have roughly the same surface brightness we improve our statistics by root 2).
Step 4: Spike Hammer
'predict' the spike lengths via fit to lookup tables;
modify first-guess lengths by the "confusion noise" factor;
using modified first-guess values, derive boundary limits to be used
in the fits described below.
determine diffractin spike lengths:
The threshold is 0.75 dn, modified by the confusion noise factor as follows:
By actually measuring the halo, stripe and spike features of bright stars, we can reliably judge if the regions comprising the features require masking and thus avoid the often misleading information that look-up tables and characteristic functions give (the axium, GIGO, applies to this process).
Persistence ghosts have not been addressed thus far in this discussion. Given that these objects are (or should be) well characterized early in the pipeline (well before galworks), it is possible to eliminate them by simply looking for the appropriate flag generated by (?MAPCOR?). If a detection is a highly probable persistence ghost, then GALWORKS will mask an appropriate region comprising the ghost. This operation will occur before bright star processing.
Look-Up Tables
See the following link for a discussion of generating the look-up tables. Bright Star Parameter Tuning .
Additional fields have been added to the databank since the previous memo was whipped up, including MSX -- the galactic plane. Soon, I will add the scan containing Maffei 1. The latest greatest plot of the confusion radii for bright stars is given below, as well as he spike lengths plot and the horizontal stripe plots.
The confusion radii, spike lengths and horizontal stripe ratings are summarized in the following plots. The horizontal stripe analysis is summarized by a "rating". By this we mean the number of stars (per mag bin) in which a horizontal stripe is observed, normalized by the total number of sources. Thus, for the brightest stars, these stripes are always observed, so they have a appearance rating of 1.0. For the faintest stars, these stripes begin to dissappear and the rating approaches zero. We also compute the SNR of the stripe itself -- this is the parameter used in the algorithm described above.
Note the large scatter in the plots. This is due to the uncertainty in the R1 mag (due to saturation, clipped images and other effects) and the variable confusion noise (and changing background level). It is this scatter that renders look-up tables unreliable as the only source for masking information.
Performance
Several bright stars were examined in detail with the new automated bright star masking algorithm. The stars are scattered widely across the sky, including the high glat fields of of Hercules, Coma, Hercules supercluster and a couple other anonymous fields, and low glat fields including scans located between 3 and 7 degrees glat, and finally one MSX scan located around 1 degree glat. Thus, we have a wide range in confusion noise and sky background variation -- the two most troublesome components to robust bright star masking. We will focus on a couple test cases from a high glat field to the low glat fields, and include a gaggle of additional gif images showing other bright stars.
Note that some of the R1 mags appear to be far from
realistic -- this demonstrates one of the problems with the
photometry.
High GLAT -- confusion noise minimal
This star is clearly saturated and thus the R1 mags are probably not very reliable. Given that, the predicted confusion radii are 79.1, 83.2, 55.6, for JHK respectively -- much MUCH to small. The surface brightness of this very bright star does not fall below the 1-sigma threshold for 160, 150 and 103 radii, respectively. The JHK images showing the confusion radii (in blue, see above gif) clearly are a much better boundary for masking.
The predicted stripe SNR for the primary/secondary horizontal stripes are well above the threshold limit. The measured SNR are in accordance with the prediction.
The predicted spike lengths are 115.3, 124.3, 89.5, respectively.
Again, these are much too small due to poor R1 mags (saturation). The actual
spike lengths measured are 159.5, 159.6 and 177.8 pixels, respectively.
The source density is very high in this field. The confusion noise quantifies what the eye sees: 1.0, 1.8 and 1.5 dn, JHK respectively. Compare this with the nominal high glat noise values of 0.6, 1.1 and 1.2, respectively. The confusion noise factor is then 0.59, 0.60 and 0.87, respectively. This factor will be used to modify the thresholds for spike length convergence and stripe score.
Predicted confusion radii: 126.7, 32.4, 31.4, respectively. The J value is clearly out of whack -- due to a poor R1 mag. The actual confusion radii measured turns out to be 67.5, 54.6, 41.6, respectively.
Only the J and K primary stripes have measured SNR greater than the modified thresholds (H just barely fails the thresh). For the secondary stripes, only J has a large enough SNR for blanking.
The predicted spike lengths are 173, 48 and 51, respectively. The J value is clearly bogus. The actual spike lengths are 82, 65 and 62, respectively.
The source density is extremely high in this field. The confusion noise for this MSX coadd is 1.9, 4.0 and 3.1 dn, JHK respectively. Compare this with the nominal high glat noise values of 0.6, 1.1 and 1.2, respectively. The confusion noise factor is then 0.31, 0.28 and 0.40, respectively. This factor will be used to modify the thresholds for spike length convergence and stripe score.
Predicted confusion radii: 50, 41 & 27, respectively. The actual confusion radii measured turns out to be 76, 39, 31, respectively (close to the predicted values).
Both the primary and secondary stripes do not have a measured SNR greater than the modified threshold.
The predicted spike lengths are 130, 117 and 68, respectively. The actual spike lengths are 76, 49 and 66, respectively.