Processing math: 100%

    Home          Downloads          SPD on the web          Links

[← Version history]     [↑ SPD on the web]      [Directional statistics →]

3 Arai plot statistics

3.1 A note on data indexing

Statistic: i and nmax

The index i is used to denote the ith temperature step of the paleointensity experiment. i is used to index Arai plot data (e.g., xi, or yi) and ranges from i=1 to nmax, where nmax is the total number of steps on the Arai plot.

 

Statistic: start and end

start and end denote the i indices of the selected steps used for analyzing the paleointensity results. i=start denotes the first selected data point and i=end denotes the last.

 

Statistic: Tmin and Tmax

The minimum and maximum temperatures used for the best-fit linear segment on the Arai plot, where TminTi=start and TmaxTi=end.

 

Statistic: n

The number of points on an Arai diagram used to estimate the best-fit linear segment and the paleointensity (n=endstart+1).

 

3.2 The paleointensity estimate

Statistic: b
Report to 3 d.p.

The slope of the best-fit line of the selected TRM and NRM points on the Arai plot. Determination of the slope uses the standardized major axis form of least squares linear fitting (York, 1966; Coe et al., 1978). b=sign{endi=start(xiˉx)(yiˉy)}(endi=start(yiˉy)2endi=start(xiˉx)2)12, where ˉx and ˉy are the mean TRM and NRM values of the data selected for the best-fit, that is, ˉx=endi=startxin, and ˉy=endi=startyin.

 

Statistic: σb
Report to 3 d.p.

The standard error on the slope is given by: σb=(2endi=start(yiˉy)22bendi=start(xiˉx)(yiˉy)(n2)endi=start(xiˉx)2)12

Useful Note...
It should be noted that the standard line-fitting routines available in most analysis software (e.g., Excel) do not use the standardized major axis fitting routine, but instead use linear regression (sometimes known as ordinary least-squares), whereby only the y-axis residuals are minimized. Given that accurate estimation of the slope is the objective of paleointensity analysis, standardized major axis, as outlined above, is the most appropriate method (e.g., Warton et al., 2006).

 

Statistic: BAnc and σB, the paleointensity estimate and its error
Report to 1 d.p.

A paleointensity estimate is obtained from BAnc=|b|×BLab, where BLab is the strength of the laboratory field. The associated standard error of the estimate is given by σB=σb×BLab.

3.3 Arai plot statistics

Figure 1. Schematic illustration of an Arai plot and some quantities used in the calculation of paleointensity statistics.

 

Statistic: YInt.

The y-axis (NRM) intercept of the best-fit line on the Arai plot. YInt.=ˉybˉx

 

Statistic: XInt.

The x-axis (TRM) intercept of the best-fit line on the Arai plot. XInt.=YInt.b

 

Statistic: Vector difference sum, VDS

The vector difference sum of the entire NRM vector (NRM). VDS=|NRMnmax|+nmax1i=1|NRMi+1NRMi|,

where |NRMi| denotes the length of the NRM vector at the ith step.

 

Statistic: x and y

x and y the x and y points on the Arai plot projected on to the best-fit line. These are used to calculate the NRM fraction and the length of the best-fit line among other parameters. There are multiple ways of calculating x and y, below is one example. xi=12(xi+yiYInt.b) yi=12(yi+bx+YInt)

Statistic: Δx and Δy

Δx and Δy are TRM and NRM lengths of the best-fit line on the Arai plot, respectively (Figure 1). Δx=|[max{xi}min{xi}]i=start,,end| Δy=|[max{yi}min{yi}]i=start,,end|

 

Statistic: f
Report to 3 d.p.

NRM fraction used for the best-fit on an Arai diagram (Coe et al., 1978). f=Δy|YInt.|

 

Statistic: fVDS
Report to 3 d.p.

NRM fraction used for the best-fit on an Arai diagram calculated as a vector difference sum (Tauxe and Staudigel, 2004). fVDS=ΔyVDS

 

Statistic: FRAC
Report to 3 d.p.

NRM fraction used for the best-fit on an Arai diagram determined entirely by vector difference sum calculation (Shaar and Tauxe, 2013). FRAC=end1i=start|NRMi+1NRMi|VDS

 

Statistic: β
Report to 3 d.p.

β is a measure of the relative data scatter around the best-fit line and is the ratio of the standard error of the slope to the absolute value of the slope (Coe et al., 1978) β=σb|b|

 

Statistic: g
Report to 3 d.p.

The gap factor (g) is a measure of the average NRM lost between successive temperature steps of the segment chosen for the best-fit line on the Arai plot. The gap reflects the average spacing of the selected Arai plot points along the best-fit line. g=1end1i=start(yi+1yi)2Δy2.

The upper limit of g is dependent on n and occurs when the points on the Arai plot are evenly spaced. glim=n2n1.

 

Statistic: GAP-MAX
Report to 3 d.p.

The gap factor defined above is measure of the average Arai plot point spacing and may not represent extremes of spacing. To account for this Shaar and Tauxe (2013)) proposed GAP-MAX, which is the maximum gap between two points determined by vector arithmetic. GAP-MAX=max{|NRMi+1NRMi|}i=start,,end1end1i=start|NRMi+1NRMi|

 

Statistic: q
Report to 1 d.p.

The quality factor (q) is a measure of the overall quality of the paleointensity estimate and combines the relative scatter of the best-fit line, the NRM fraction and the gap factor (Coe et al., 1978). q=|b|fgσb=fgβ

 

Statistic: w
Report to 1 d.p.

Weighting factor of Prévot et al. (1985). w=fgs, where s2 is given by: s2=2+2endi=start(xiˉx)(yiˉy)(endi=start(xiˉx)12endi=start(yiˉy)2)2. It can be noted, however, that w can be more readily calculated as: w=qn2.

 

Statistic: |k|
Report to 3 d.p.

3}

The curvature of the Arai plot as determined by the best-fit circle to all of the data (Paterson, 2011). To determine the Arai curvature, a best-fit circle of the form (xa)2+(yb)2=r2 is fitted to all of the data using a least-squares approach. For the fitting process, each axis is normalized by the maximum value of the data on that axis such that 0 TRM 1, and 0 NRM 1, which ensures a consistent comparison between data measured with different BLab. |k| is defined as the reciprocal of the radius (r) of the best-fit circle: |k|=1r.

Curvature can be given a sense of direction by considering the position of the circle center (a,b) with respect to the centroid of all of the data (Cx,Cy). k={1r if (Cx<a) and (Cy<b)1r if (a<Cx) and (b<Cy)0 if (a=Cx) and (b=Cy).

Numerical Tip...
Standard non-linear line fitting routines can be used for the calculation of the best-fit circle to the Arai plot data, however, the convergence of these algorithms can be poor and they are often numerically inaccurate when the data form a small arc of a much larger circle. This latter point is particularly important for near linear Arai plots as the data represent an increasingly smaller arc of the circle as the linearity increases. Chernov and Lesort (2005) developed an algorithm for fitting circles to data that is less affected by both of these issues and should be the preferred method of circle fitting (Paterson, 2011). Code for this algorithm in C++ and MATLAB are available from the downloads page.

 

Statistic: |k|
Report to 3 d.p.

The curvature of the Arai plot as determined by the best-fit circle to the selected best-fit Arai plot segment (Paterson, 2011). |k| is calculated in the same fashion as |k|, but the data are normalized by the respective maximums of the segment NRM and TRM.

 

Statistic: SSE

Report to 3 d.p.

The quality of the best-fit circle used to determine |k| (Paterson, 2011). SSE=nmaxi=1((xia)2+(yib)2r)2 Where xi and yi denote the normalized TRM and NRM, respectively.

 

Statistic: SCAT

SCAT is a parameter proposed by Shaar and Tauxe (2013) in an effort to reduce the number of parameters used to quantify a paleointensity estimate. SCAT is a Boolean operator, which uses the error on the best-fit Arai plot slope to indicate whether the data over the selected range are too scattered or not. This parameter provides a test for the scatter of the points on the Arai plot, pTRM checks, and pTRM tail checks. A schematic illustration of $SCAT$ and some examples are shown in Figure 2.

Figure 2. (a) Schematic illustration of the calculation of the SCAT box. Examples of data that (b) pass and (c) fail SCAT. In all examples the gray shaded area is the SCAT box. In parts (b) and (c) open (closed) circles denote the selected (unselected) points. The triangles and squares represent pTRM checks and pTRM tail checks, respectively.

First, from the chosen the best-fit segment on the Arai plot, the slope (b), the standard error of the slope (σb), and β(=σb|b|) are obtained. For a given sample, the threshold value for β that is used to select data (βthreshold) is used to determine an equivalent threshold for σb (σthreshold=|b|βthreshold).
b and σthreshold are then used to determine two lines that pass through the center of mass of the selected Arai plot segment (ˉx and ˉy), one with a slope of b+2σthreshold, the other with a slope of b2σthreshold Figure 2a). The so called SCAT box, is the box that is defined by the four intercepts that the above two lines make with the x- and y-axes (Figure 2a).
If all the data points associated with the chosen Arai plot segment, which includes both pTRM and tail checks, fall within the SCAT box then SCAT is TRUE. If one or more points fall outside the SCAT box then SCAT is FALSE. Samples are accepted only if SCAT is TRUE. pTRM checks and pTRM tail checks are included in the calculation of SCAT if the temperature of the check falls within temperature range of the selected Arai plot segment and the peak temperature before the check was performed is less than or equal to maximum temperature of the selected Arai plot segment. For example, if the temperature range of the selected Arai plot segment was 100-500°C, a pTRM check to 200°C performed after the 400°C step would be included in the calculation of SCAT. However, a pTRM check to 200°C performed after the 540°C would be not included in the calculation of SCAT. Examples of samples pass and fail SCAT are shown in Figure 2b and c, respectively.

 

Statistic: R2corr

Report to 3 d.p.

The correlation coefficient to estimate the strength of the linear relationship between the NRM and TRM over the best-fit Arai plot segment (the square of the Pearson correlation). R2corr=(endi=start(xiˉx)(yiˉy))2endi=start(xiˉx)2endi=start(yiˉy)2

 

Statistic: R2det

Report to 3 d.p.

Coefficient of determination to estimate variance accounted for by the linear model fit. R2det=1endi=start(yiyi)2endi=start(yiˉy)2

Useful Note...
It should be noted that this is similar to, but strictly not the same as the square of the Pearson correlation coefficient (R2corr). For least squares fitting that minimizes the y residuals only, R2corr is the same as the coefficient of determination for the model fit. Since Arai plot analysis uses the standardized major axis least-squares variant, this is not the case. For most practical purposes, however, the difference is small, particularly when the chosen Arai plot segment is highly linear with low noise.

 

Statistic: Z and Z

Report to 1 d.p.

Z is an Arai plot zigzag parameter defined by Yu and Tauxe (2005). Z=endi=startxi|˜bi|b|||XInt.|, where ˜bi is the instantaneous slope on the Arai plot determined from the ratio of the NRM lost to the TRM gained at the ith step: ˜bi=NRMtotalNRMiTRMi=YInt.yixi. Since no TRM is gained during the first step ˜b1=0. ˜bi|b| is a measure of the scatter around the best-fit slope on the Arai plot.

Yu (2012) proposed a modified version, Z. Z=1n1endi=start100×xi|˜bi|b|||YInt.|.

 

Statistic: IZZI_MD

Report to 3 d.p.

IZZI_MD is a parameter to quantify the zigzagging on an Arai plot (Shaar et al., 2011), which is most pronounced for multidomain grains measured with the IZZI protocol (e.g., Yu et al., 2004).
IZZI_MD is a measure of the area mapped out on the Arai plot and is determined using all the points on an Arai plot with the exception of the first step, where no TRM is imparted. The calculation is performed after the points have been normalized by the initial NRM, such that x(n)i=xiy1  and  y(n)i=yiy1.

If we consider the three consecutive Arai plot points illustrated in Figure 3a. The lengths of each side of the triangle formed by these points are given by: L1=(x(n)ix(n)i+1)2+(y(n)iy(n)i+1))2; L2=(x(n)i+1x(n)i+2)2+(y(n)i+1y(n)i+2)2; and L3=(x(n)i+2x(n)i)2+(y(n)i+2y(n)i)2.

Following the cosine rule, the angle ϕ is ϕ=arccos(L22+L23L212L2L3). The height of the triangle can be expressed as H=L3sin(ϕ), and hence the area of the triangle is given by Ai=L2L3sin(ϕ)2.

 

Figure 3.Illustration of the calculation of IZZI_MD. (a) The calculation of the area bounded by three consecutive Arai plot points. (b) The determination of the relative position of the mid-point.

Each Ai is given a sign (±) based on whether or not the ZI steps lie above the IZ steps, or vice versa. For ZI above IZ, Ai is positive, for IZ above ZI, Ai is negative. For the example in Figure 3, ZI is above IZ and the area is given a positive sign.
To determine the sign, we must determine the relative position of the mid-point of the three consecutive points. First, we calculate the best-fit line through the first and last points and obtain the intercept of the line (a1; Figure 3b). Using the slope of this best-fit line we determine the intercept of the line (a2) when the line passes through the mid-point. If a1 is less than a2 the mid-point lies above the end points, but if a2 is less than a1 the mid-point lies below the end points. In the case where a1=a2 all three points lie on a perfect straight line and both the area and the sign are identically zero. The pseudo-code for this is as follows, where Si denotes the sign of the ith area.

for i=2(nmax2) do
    if a1=a2 then
        Si=0
    else
        if Mid-point is a ZI then
            if a1<a2 then
                Si=1           ←{Figure 3b falls here}
            else
                Si=1
            end if
        else if Mid-point is a IZ then
            if a1<a2 then
                Si=1
            else
                Si=1
            end if
        end if
    end if
end for

IZZI_MD is the sum of the signed areas, normalized by length of the line connecting all of the ZI steps. IZZI_MD=nmax2i=2SiAiLZI. where LZI is given by: LZI=i ZI points(x(n)i+2x(n)i)2+(y(n)i+2y(n)i)2 .
N.B. The i+2 increment assumes alternating ZI and IZ step, whereby if i is a ZI step, i+1 is an IZ step, and i+2 is a ZI step.

Numerical Tip...
Calculating the area bounded by a series of points is a common geometric problem. As a consequence most programming languages have functions to perform the calculation either as inbuilt features or as freely available routines. For example, MATLAB has the inbuilt function A=polyarea(p) to return the area, A , bounded by the points given by the two-dimensional matrix p : p=(xiyixi+1yi+1xi+2yi+2)

 

↑ TOP