[← Version history] [↑ SPD on the web] [Directional statistics →]
Statistic: i and nmax
The index i is used to denote the ith temperature step of the paleointensity experiment. i is used to index Arai plot data (e.g., xi, or yi) and ranges from i=1 to nmax, where nmax is the total number of steps on the Arai plot.
Statistic: start and end
start and end denote the i indices of the selected steps used for analyzing the paleointensity results. i=start denotes the first selected data point and i=end denotes the last.
Statistic: Tmin and Tmax
The minimum and maximum temperatures used for the best-fit linear segment on the Arai plot, where Tmin≡Ti=start and Tmax≡Ti=end.
Statistic: n
The number of points on an Arai diagram used to estimate the best-fit linear segment and the paleointensity (n=end−start+1).
Statistic: b
Report to 3 d.p.
The slope of the best-fit line of the selected TRM and NRM points on the Arai plot. Determination of the slope uses the standardized major axis form of least squares linear fitting (York, 1966; Coe et al., 1978). b=sign{end∑i=start(xi−ˉx)(yi−ˉy)}(end∑i=start(yi−ˉy)2end∑i=start(xi−ˉx)2)12, where ˉx and ˉy are the mean TRM and NRM values of the data selected for the best-fit, that is, ˉx=end∑i=startxin, and ˉy=end∑i=startyin.
Statistic: σb
Report to 3 d.p.
The standard error on the slope is given by: σb=(2end∑i=start(yi−ˉy)2−2bend∑i=start(xi−ˉx)(yi−ˉy)(n−2)end∑i=start(xi−ˉx)2)12
Useful Note...
It should be noted that the standard line-fitting routines available in most analysis software (e.g., Excel) do not use the standardized major axis fitting routine,
but instead use linear regression (sometimes known as ordinary least-squares), whereby only the y-axis residuals are minimized. Given that accurate estimation of the
slope is the objective of paleointensity analysis, standardized major axis, as outlined above, is the most appropriate method
(e.g., Warton et al., 2006).
Statistic: BAnc and σB, the paleointensity estimate and its error
Report to 1 d.p.
A paleointensity estimate is obtained from BAnc=|b|×BLab, where BLab is the strength of the laboratory field. The associated standard error of the estimate is given by σB=σb×BLab.
Statistic: YInt.
The y-axis (NRM) intercept of the best-fit line on the Arai plot. YInt.=ˉy−bˉx
Statistic: XInt.
The x-axis (TRM) intercept of the best-fit line on the Arai plot. XInt.=−YInt.b
Statistic: Vector difference sum, VDS
The vector difference sum of the entire NRM vector (NRM). VDS=|NRMnmax|+nmax−1∑i=1|NRMi+1−NRMi|,
where |NRMi| denotes the length of the NRM vector at the ith step.
Statistic: x′ and y′
x′ and y′ the x and y points on the Arai plot projected on to the best-fit line. These are used to calculate the NRM fraction and the length of the best-fit line among other parameters. There are multiple ways of calculating x′ and y′, below is one example. x′i=12(xi+yi−YInt.b) y′i=12(yi+bx+YInt)
Statistic: Δx′ and Δy′
Δx′ and Δy′ are TRM and NRM lengths of the best-fit line on the Arai plot, respectively (Figure 1). Δx′=|[max{x′i}−min{x′i}]i=start,…,end| Δy′=|[max{y′i}−min{y′i}]i=start,…,end|
Statistic: f
Report to 3 d.p.
NRM fraction used for the best-fit on an Arai diagram (Coe et al., 1978). f=Δy′|YInt.|
Statistic: fVDS
Report to 3 d.p.
NRM fraction used for the best-fit on an Arai diagram calculated as a vector difference sum (Tauxe and Staudigel, 2004). fVDS=Δy′VDS
Statistic: FRAC
Report to 3 d.p.
NRM fraction used for the best-fit on an Arai diagram determined entirely by vector difference sum calculation (Shaar and Tauxe, 2013). FRAC=end−1∑i=start|NRMi+1−NRMi|VDS
Statistic: β
Report to 3 d.p.
β is a measure of the relative data scatter around the best-fit line and is the ratio of the standard error of the slope to the absolute value of the slope (Coe et al., 1978) β=σb|b|
Statistic: g
Report to 3 d.p.
The gap factor (g) is a measure of the average NRM lost between successive temperature steps of the segment chosen for the best-fit line on the Arai plot. The gap reflects the average spacing of the selected Arai plot points along the best-fit line. g=1−end−1∑i=start(y′i+1−y′i)2Δy′2.
The upper limit of g is dependent on n and occurs when the points on the Arai plot are evenly spaced. glim=n−2n−1.
Statistic: GAP-MAX
Report to 3 d.p.
The gap factor defined above is measure of the average Arai plot point spacing and may not represent extremes of spacing. To account for this Shaar and Tauxe (2013)) proposed GAP-MAX, which is the maximum gap between two points determined by vector arithmetic. GAP-MAX=max{|NRMi+1−NRMi|}i=start,…,end−1end−1∑i=start|NRMi+1−NRMi|
Statistic: q
Report to 1 d.p.
The quality factor (q) is a measure of the overall quality of the paleointensity estimate and combines the relative scatter of the best-fit line, the NRM fraction and the gap factor (Coe et al., 1978). q=|b|fgσb=fgβ
Statistic: w
Report to 1 d.p.
Weighting factor of Prévot et al. (1985). w=fgs, where s2 is given by: s2=2+2end∑i=start(xi−ˉx)(yi−ˉy)(end∑i=start(xi−ˉx)12end∑i=start(yi−ˉy)2)2. It can be noted, however, that w can be more readily calculated as: w=q√n−2.
Statistic: |→k|
Report to 3 d.p.
The curvature of the Arai plot as determined by the best-fit circle to all of the data (Paterson, 2011). To determine the Arai curvature, a best-fit circle of the form (x−a)2+(y−b)2=r2 is fitted to all of the data using a least-squares approach. For the fitting process, each axis is normalized by the maximum value of the data on that axis such that 0 ≤ TRM ≤ 1, and 0 ≤ NRM ≤ 1, which ensures a consistent comparison between data measured with different BLab. |→k| is defined as the reciprocal of the radius (r) of the best-fit circle: |→k|=1r.
Curvature can be given a sense of direction by considering the position of the circle center (a,b) with respect to the centroid of all of the data (Cx,Cy). →k={1r if (Cx<a) and (Cy<b)−1r if (a<Cx) and (b<Cy)0 if (a=Cx) and (b=Cy).
Numerical Tip...
Standard non-linear line fitting routines can be used for the calculation of the best-fit circle to the Arai plot data, however, the convergence of these algorithms can
be poor and they are often numerically inaccurate when the data form a small arc of a much larger circle. This latter point is particularly important for near linear Arai
plots as the data represent an increasingly smaller arc of the circle as the linearity increases. Chernov and Lesort (2005)
developed an algorithm for fitting circles to data that is less affected by both of these issues and should be the preferred method of circle fitting
(Paterson, 2011). Code for this algorithm in C++ and MATLAB are
available from the downloads page.
Statistic: |→k′|
Report to 3 d.p.
The curvature of the Arai plot as determined by the best-fit circle to the selected best-fit Arai plot segment (Paterson, 2011). |→k′| is calculated in the same fashion as |→k|, but the data are normalized by the respective maximums of the segment NRM and TRM.
Statistic: SSE
Report to 3 d.p.The quality of the best-fit circle used to determine |→k| (Paterson, 2011). SSE=nmax∑i=1(√(xi−a)2+(yi−b)2−r)2 Where xi and yi denote the normalized TRM and NRM, respectively.
Statistic: SCAT
SCAT is a parameter proposed by Shaar and Tauxe (2013) in an effort to reduce the number of parameters used to quantify a paleointensity estimate. SCAT is a Boolean operator, which uses the error on the best-fit Arai plot slope to indicate whether the data over the selected range are too scattered or not. This parameter provides a test for the scatter of the points on the Arai plot, pTRM checks, and pTRM tail checks. A schematic illustration of $SCAT$ and some examples are shown in Figure 2.
First, from the chosen the best-fit segment on the Arai plot, the slope (b), the standard error of the slope (σb), and
β(=σb|b|) are obtained. For a given sample, the threshold value for β that is used to select data (βthreshold)
is used to determine an equivalent threshold for σb (σthreshold=|b|βthreshold).
b and σthreshold are then used to determine two lines that pass through the center of mass of the selected Arai plot segment (ˉx and ˉy),
one with a slope of b+2σthreshold, the other with a slope of b−2σthreshold Figure 2a). The so called SCAT box,
is the box that is defined by the four intercepts that the above two lines make with the x- and y-axes (Figure 2a).
If all the data points associated with the chosen Arai plot segment, which includes both pTRM and tail checks, fall within the SCAT box then SCAT is TRUE.
If one or more points fall outside the SCAT box then SCAT is FALSE. Samples are accepted only if SCAT is TRUE. pTRM checks and pTRM tail checks
are included in the calculation of SCAT if the temperature of the check falls within temperature range of the selected Arai plot segment and the peak temperature
before the check was performed is less than or equal to maximum temperature of the selected Arai plot segment. For example, if the temperature range of the selected Arai plot
segment was 100-500°C, a pTRM check to 200°C performed after the 400°C step would be included in the calculation of SCAT. However, a pTRM check to
200°C performed after the 540°C would be not included in the calculation of SCAT. Examples of samples pass and fail SCAT are shown in
Figure 2b and c, respectively.
Statistic: R2corr
Report to 3 d.p.The correlation coefficient to estimate the strength of the linear relationship between the NRM and TRM over the best-fit Arai plot segment (the square of the Pearson correlation). R2corr=(end∑i=start(xi−ˉx)(yi−ˉy))2end∑i=start(xi−ˉx)2end∑i=start(yi−ˉy)2
Statistic: R2det
Report to 3 d.p.Coefficient of determination to estimate variance accounted for by the linear model fit. R2det=1−end∑i=start(yi−y′i)2end∑i=start(yi−ˉy)2
Useful Note...
It should be noted that this is similar to, but strictly not the same as the square of the Pearson correlation coefficient (R2corr). For least squares fitting that
minimizes the y residuals only, R2corr is the same as the coefficient of determination for the model fit. Since Arai plot analysis uses the standardized major axis
least-squares variant, this is not the case. For most practical purposes, however, the difference is small, particularly when the chosen Arai plot segment is highly linear
with low noise.
Statistic: Z and Z∗
Report to 1 d.p.Z is an Arai plot zigzag parameter defined by Yu and Tauxe (2005). Z=end∑i=startxi|˜bi−|b|||XInt.|, where ˜bi is the instantaneous slope on the Arai plot determined from the ratio of the NRM lost to the TRM gained at the ith step: ˜bi=NRMtotal−NRMiTRMi=YInt.−yixi. Since no TRM is gained during the first step ˜b1=0. ˜bi−|b| is a measure of the scatter around the best-fit slope on the Arai plot.
Yu (2012) proposed a modified version, Z∗. Z∗=1n−1end∑i=start100×xi|˜bi−|b|||YInt.|.
Statistic: IZZI_MD
Report to 3 d.p.IZZI_MD is a parameter to quantify the zigzagging on an Arai plot (Shaar et al., 2011), which is most pronounced for multidomain
grains measured with the IZZI protocol (e.g., Yu et al., 2004).
IZZI_MD is a measure of the area mapped out on the Arai plot and is determined using all the points on an Arai plot with the exception of the first step, where no
TRM is imparted. The calculation is performed after the points have been normalized by the initial NRM, such that
x(n)i=xiy1 and y(n)i=yiy1.
If we consider the three consecutive Arai plot points illustrated in Figure 3a. The lengths of each side of the triangle formed by these points are given by: L1=√(x(n)i−x(n)i+1)2+(y(n)i−y(n)i+1))2; L2=√(x(n)i+1−x(n)i+2)2+(y(n)i+1−y(n)i+2)2; and L3=√(x(n)i+2−x(n)i)2+(y(n)i+2−y(n)i)2.
Following the cosine rule, the angle ϕ is ϕ=arccos(L22+L23−L212L2L3). The height of the triangle can be expressed as H=L3sin(ϕ), and hence the area of the triangle is given by Ai=L2L3sin(ϕ)2.
Each Ai is given a sign (±) based on whether or not the ZI steps lie above the IZ steps, or vice versa. For ZI above IZ, Ai is positive, for IZ above ZI,
Ai is negative. For the example in Figure 3, ZI is above IZ and the area is given a positive sign.
To determine the sign, we must determine the relative position of the mid-point of the three consecutive points. First, we calculate the best-fit line through the first and
last points and obtain the intercept of the line (a1; Figure 3b). Using the slope of this best-fit line we determine the intercept
of the line (a2) when the line passes through the mid-point. If a1 is less than a2 the mid-point lies above the end points, but if a2 is less than a1 the
mid-point lies below the end points. In the case where a1=a2 all three points lie on a perfect straight line and both the area and the sign are identically zero. The
pseudo-code for this is as follows, where Si denotes the sign of the ith area.
for i=2→(nmax−2) do
if a1=a2 then
Si=0
else
if Mid-point is a ZI then
if a1<a2 then
Si=1
←{Figure 3b falls here}
else
Si=−1
end if
else if Mid-point is a IZ then
if a1<a2 then
Si=−1
else
Si=1
end if
end if
end if
end for
IZZI_MD is the sum of the signed areas, normalized by length of the line connecting all of the ZI steps.
IZZI_MD=nmax−2∑i=2SiAiLZI.
where LZI is given by:
LZI=∑i∈ ZI points√(x(n)i+2−x(n)i)2+(y(n)i+2−y(n)i)2 .
N.B. The i+2 increment assumes alternating ZI and IZ step, whereby if i is a ZI step, i+1 is an IZ step, and i+2 is a ZI step.
Numerical Tip...
Calculating the area bounded by a series of points is a common geometric problem. As a consequence most programming languages have functions to perform the
calculation either as inbuilt features or as freely available routines. For example, MATLAB has the inbuilt function
A=polyarea(p) to return the area, A , bounded by the points given by the two-dimensional
matrix p :
p=(xiyixi+1yi+1xi+2yi+2)
↑ TOP