IOP PUBLISHING MEASUREMENT SCIENCE AND TECHNOLOGY Meas. Sci. Technol. 19 (2008) 055104 (15pp) doi:10.1088/0957-0233/19/5/055104 An advanced approach to finding magnetometer zero levels in the interplanetary magnetic field
H K Leinweber1, C T Russell1,KTorkar2, T L Zhang2 and V Angelopoulos1
1 Institute of Geophysics and Planetary Physics, University of California, 405 Hilgard Ave., Los Angeles, CA 90095-1567, USA 2 Space Research Institute, Austrian Academy of Sciences, Schmiedlstrasse 6, 8042 Graz, Austria E-mail: [email protected]
Received 10 November 2007, in final form 12 March 2008 Published 21 April 2008 Online at stacks.iop.org/MST/19/055104
Abstract For a magnetometer that measures weak interplanetary fields, the in-flight determination of zero levels is a crucial step of the overall calibration procedure. This task is more difficult when a time-varying magnetic field of the spacecraft interferes with the surrounding natural magnetic field or when the spacecraft spends only short periods of time in the interplanetary magnetic field. Thus it is important to examine the algorithms by which these zero levels are determined, and optimize them. We find that the method presented by Davis and Smith (1968 EOS Trans. AGU 49 257) has significant mathematical advantages over that published by Belcher (1973 J. Geophys. Res. 71 5509) as well as over the correlation technique published by Hedgecock (1975 Space Sci. Instrum. 1 83–90). We present an alternative derivation of the Davis–Smith method which illustrates that it is also a correlation technique. It also works with first differences as well as filtered data as input. In contrast to the postulate by Hedgecock (1975 Space Sci. Instrum. 1 83–90), we find that using first differences in general provides no advantage in determining the zero levels. Our new algorithm obtains zero levels by searching for pure rotations of the interplanetary magnetic field, with a set of sophisticated selection criteria. With our algorithm, we require shorter periods (of the order of a few hours, depending on solar wind conditions) of interplanetary data for accurate zero level determination than previously published algorithms.
Keywords: magnetometer calibration, interplanetary magnetic field, solar wind, Alfven waves
1. Introduction levels of the component axes that lie in the spin plane can be estimated by averaging in the spinning spacecraft frame Post-launch spacecraft magnetometer zero levels can differ over many spin periods when gains and angles are known from their pre-launch values for many reasons. Some of to high enough precision and the magnetic field is constant the most common issues are temperature changes of the during the averaging period. However, spacecraft-generated sensor and the electronics, varying magnetic fields of the ac magnetic fields at the spin frequency in the spinning frame spacecraft due to electric currents or magnetic permeability, will cause erroneous zero levels in the despun spacecraft aging of electronic parts, exposure to strong radiation and other frame that cannot be found by averaging. For a discussion causes. See Acuna˜ (2002) for a general review on space-based of the calibration of magnetometers on a spinning spacecraft magnetometers. A historical review on measurements of the such as determining relative gains and orientations, see Kepko interplanetary magnetic field can be found in Ness and Burlaga et al (1996). Farell et al (1995) is another reference on the (2001). Also see Snare (1998) for a historical review on vector calibration of magnetometers on a spinning spacecraft, but it magnetometry in space. For a spinning spacecraft, the zero omits relative gains. A commonly used term for the zero level
0957-0233/08/055104+15$30.00 1 © 2008 IOP Publishing Ltd Printed in the UK Meas. Sci. Technol. 19 (2008) 055104 H K Leinweber et al is offset. We will use both terms interchangeably throughout it is particularly useful when the spacecraft spends most of the this paper. time inside a magnetosphere and only occasionally enters the In the solar wind, a simple method that can be used to interplanetary magnetic field. The magnetic field is usually determine constant or extremely slowly varying zero levels observed close to the bow shock and in many cases highly is averaging of the magnetic field values over a few solar disturbed by non-Alfvenic´ upstream waves, or the spacecraft rotations. The averages of all three component axes should be might observe the interplanetary field only because of very zero since the divergence of B is zero, and the configuration unusual solar wind conditions that have higher than average of the solar magnetic field should not be correlated with dynamic pressure values. Our revised method can also be the spacecraft location. However, we note that there are used for missions that require the adjustment of zero levels field configurations that can be symmetric about the Sun’s at more frequent intervals than a few days as required for 0 0 0 rotation axis such as those associated with the g1,g2,g3,... the published selection criteria of Belcher and of Hedgecock. coefficients of the Legendre polynomial expansion (see, for Such missions may not have had an appropriate magnetic example, Altschuler et al (1977) and references therein); so cleanliness program and therefore the magnetic field at the this technique is not foolproof. Furthermore, this technique location of the magnetometer is heavily disturbed by the requires that the spacecraft stays in the interplanetary magnetic spacecraft’s own time-varying magnetic field, or they may field continually and returns an average zero level at most experience diurnal variations due to a changing spacecraft or only monthly. It is desirable to determine offsets much more sensor fields as the solar illumination or spacecraft temperature often and to not rely on the Sun having a favorable magnetic varies with orientation or the orbital position. configuration. Fluctuations of the interplanetary field are For convenience of the reader, below we review all primarily changes in the direction rather than in the magnitude three methods. This review gives us further insight into so that the field magnitude is more constant than any of its three the differences between the three techniques and serves to component axes (Ness et al 1964). See also power spectra illuminate the improvements we have made. We first discuss and histograms in Coleman (1966). Fortunately, the Alfvenic´ the Belcher and the Hedgecock methods. nature (Belcher et al 1969) of the solar wind fluctuations allows us to determine offsets comparatively rapidly, much faster than 1.1. Belcher’s method once per month. This method is based on the assumption that the fluctuations of There are three documented methods for finding zero the solar wind are predominantly transverse to the background levels in the interplanetary magnetic field, based on the above- field (1). It optimizes the zero levels so that the maximum described property of the interplanetary magnetic field to variance vector (Sonnerup and Cahill 1967) is orthogonal to determine slowly changing zero levels. The first method the background field (2). Angular brackets denote averages: developed is called the Davis–Smith method (Davis and Smith 1968). This method optimizes the zero levels so that the = · BA ≈ δB δB 0, (1) variance of the squared magnitude of the magnetic field is | BA | minimized. Rosenberg (1971) provides a detailed derivation where δB is the unit vector in the maximum variance direction, of this technique. The Davis–Smith equation (22) was not δB is the part of δB in the direction of the background field published until Belcher (1973) developed his own method and compared it with the Davis–Smith method. Below we give an and BA is the actual background field vector: alternative derivation of the Davis–Smith method and discuss δB ≡ 0. (2) the variant of Belcher (1973), which gives greater insight into Assuming that the measured field B consists of the actual the methodology. The Belcher method optimizes the zero M field B plus a constant offset vector O ,weobtain levels so that the maximum variance vector is orthogonal to A the background field. The third method (Hedgecock 1975)is δB · BM =δB · O + δB · BA , (3) based on the assumption that averaged over a suitable time writing the sum over n data windows (angular brackets denote interval, there should be no correlation between changes in the averages over single data windows) field magnitude and changes in the inclination of the field to n n any one of the three component axes. = i · i 2 = i · i − 2 D δB BA δB BM O , (4) All three methods must be applied with some caution i=1 i=1 because the assumptions, on which each particular method and using dD = 0, we obtain Belcher’s matrix equation is based, might not always be true. Their results stand and dO fall with the appropriate selection criteria for data intervals. n n i · = i i The Davis–Smith method has no published selection criteria. T O T BM , (5) Davis and Smith selected data intervals by eye (Belcher 1973). i=1 i=1 Belcher and Hedgecock published rather loose selection where i is the index of the data window, T i is the outer product criteria that require long intervals (a few days to weeks) of of δB i (T is a matrix) and n is the number of data windows. input data. The matrix on the left-hand side of (5) is singular for We have modified the Davis–Smith method to determine n = 1, but (5) can still be used by selecting several data magnetometer zero levels on much shorter time scales than windows and solving the system of equations by using a least previously possible. While this can be useful for all missions, squares approach. Belcher’s method has two selection criteria
2 Meas. Sci. Technol. 19 (2008) 055104 H K Leinweber et al
where BtM is the difference between two consecutive Z (Ok )j measured field values at the kth iteration, θ(Ok )j is the difference between two consecutive inclination angles at the O Y Bt M kth iteration, C(Ok ) is the covariance for the kth value of O and N is the length of the data window. The method requires that C is calculated for a range θ (Ok ) of different offsets (e.g. using 0.1 nT steps for the offsets). B t A The offset at the zero crossing of the covariances is the desired offset. Hedgecock’s selection criteria are such that pairs of X measurements are rejected if θ(Ok ) lies outside of the range 30 <θ < 150 or B > 1 nT. For more details, see (Ok ) tM(O ) Figure 1. Relationship between the actual field magnitude BtA ,the k measured field magnitude BtM and the offset O. Hedgecock (1975). that need to be applied to each data window. The method 1.3. Davis–Smith method requires the precise knowledge of the maximum variance The original derivation of the Davis–Smith method was never direction, and therefore one of his selection criteria is λ3/λ1 published in a refereed journal. It is based on minimization of 0.1, where λ1, λ2 and λ3 are the eigenvalues (in descending the variance of the squared magnitude of the magnetic field of magnitude) of the same covariance matrix that was used to several (n) data windows. The initially observed uncorrected find the maximum variance direction (Sonnerup and Cahill variance is 1967). His other selection criterion requires the sum of the V = B · B − B · B 2 . (10a) eigenvalues to be several times above the noise level. For i Mi Mi Mi Mi more detailed information, see Belcher (1973). Expanding, we obtain = · 2 − · · Vi BMi BMi 2 BMi BMi BMi BMi 1.2. Hedgecock method · 2 + BMi BMi , (10b) This method is based on the assumption that averaged over a which reduces to suitable time interval, there should be no correlation between V = B · B 2 − B · B 2. (10c) changes in the field magnitude and changes in the inclination i Mi Mi Mi Mi of the field to any one of the three coordinate axes. The word If we correct the magnetic field measurement by subtracting ‘changes’ refers to first differences (differences of consecutive an offset vector O , the variance of the correction becomes i measurements, a simple approximation to the time derivative). c 2 V = BM − Oi · BM − Oi Figure 1 shows the relationship between the actual field i i i − − · − 2 magnitude BtA , the measured field magnitude BtM and the BMi Oi BMi Oi . (11a) offset O. Using the law of cosines, we write Expanding, we obtain 2 = 2 2 − c = 4 − 2 · − 2 · Bt Bt + O 2BtM O cos(θ), (6) V B 4B Oi BM 4O Oi BM A M i Mi Mi i i i 4 2 2 where θ is the elevation angle. +4Oi BM BM Oi + O +2B O i i i Mi i Taking BtA and O as constants, differentiating and 2 2 2 2 − B +4 Oi · BM B +4 O Oi · BM collecting terms, we obtain Mi i Mi i i − 4 O · B O · B − O2 2 − 2O2 B2 , (11b) O sin θ dθ i Mi i Mi i i Mi dBt = . (7) M 1 − O cos θ which reduces to Bt M c 4 2 2 V = B − B +4Oi BM · BM Oi Assuming that the term O cos θ is small (the offset is small i Mi M i i i BtM 2 − 4 Oi · BM Oi · BM − 4 B Oi · BM or the measured field is approximately normal to the Z-axis), i i Mi i we write +4 B2 O · B , (11c) Mi i Mi
= where Vi is the variance of the squared field magnitude of the BtM O sin θθ. (8) c ◦ data window i, V is the variance after offset adjustment, B Since sin θ is positive for 0 <θ<180 , the sign of B i Mi tM are the measured magnetic field vectors of the data window i relative to θ depends on the sign of O. The covariance C (O) and O is the offset vector of the data window i. between B and θ changes sign as O crosses through zero i tM Let C =−C − C = ( (O) ( O) and (0) 0). Writing the equation 2 N Q = B4 − B2 scalar (12) i Mi Mi j=1 BtM θ(Ok )j = (Ok )j C(Ok ) = 2 − 2 N − 1 Wi B BMi B BMi vector (13) Mi Mi N N = BtM = θ(Ok )j = − j 1 (O ) j 1 Di BM BM BM BM dyadic (covariance matrix). − k j , (9) i i i i N(N − 1) (14)
3 Meas. Sci. Technol. 19 (2008) 055104 H K Leinweber et al We note that these values are all constants depending only on From (18a) it is easy to see that since O2, O2 and O2 do not 1 2 3 the measured values BM . change in time, they do not change the covariance and for that Then, reason can be omitted from (21). c = − After bringing all offsets to one side, one can write the Vi Qi +4Oi (Di )Oi 4Oi W i . (15) c equation as Minimization of Vi by differentiating with respect to Oi gives ⎛ ⎞ B2 − B 2 B B − B B B B − B B ⎛ ⎞ W ⎜ 1 1 1 2 1 2 1 3 1 3 ⎟ O = i ⎜ ⎟ 1 Oi (Di ) , (16) ⎜ B B − B B B2 − B 2 B B − B B ⎟ ⎝O ⎠ 2 ⎝ 2 1 2 1 2 2 2 3 2 3 ⎠ 2 − − 2 − 2 O3 which is the Davis–Smith equation. For a mathematically less B3B1 B3 B1 B3B2 B3 B2 B3 B3 abstract version of this equation, see (22). A combination of ⎛ ⎞ B |B |2 − B |B |2 1 1 M 1 M all data windows leads to = ⎝ | |2 − | |2 ⎠ n n B2 BM B2 BM . (22) 1 2 | |2 − | |2 D ·O = W , (17) B3 BM B3 BM i 2 i i=1 i=1 Equation (22) is the same as (16). This demonstrates that where n is the number of data windows and O is the offset for the Davis–Smith method is also a correlation technique. The all combined data windows. matrix on the left-hand side of (22) is the covariance matrix that is independent of the offset. 2. Mathematical insights into the Davis–Smith equation 2.2. Davis–Smith equation for first differences 2.1. Derivation of the Davis–Smith equation using It is similarly instructive to derive the Davis–Smith method for correlations first differences. Equation (23) is the time derivative of (20):