CUSUM, or cumulative sum is used for detecting and monitoring change detection. Let us introduce a measurable space (Ω, F), where Ω = R∞, F = ∪nFn and Fn = σ{Yi, i ∈ {0, 1, …, n}}. The law of the sequence Yi, i = 1, …., is defined by the family of probability measures {Pv}, v ∈ N*. In other words, the probability measure Pv for a given v > 0, playing the role of the change point, is the measure generated on Ω by the sequence Yi, i = 1, … , when the distribution of the Yi’s changes at time v. The probability measures P0 and P∞ are the measures generated on Ω by the random variables Yi when they have an identical distribution. In other words, the system defined by the sequence Yi undergoes a “regime change” from the distribution P0 to the distribution P∞ at the change point time v.
The CUSUM (cumulative sum control chart) statistic is defined as the maximum of the log-likelihood ratio of the measure Pv to the measure P∞ on the σ-algebra Fn. That is,
Cn := max0≤v≤n log dPv/dP∞|Fn —– (1)
is the CUSUM statistic on the σ-algebra Fn. The CUSUM statistic process is then the collection of the CUSUM statistics {Cn} of (1) for n = 1, ….
The CUSUM stopping rule is then
T(h) := inf {n ≥ 0: max0≤v≤n log dPv/dP∞|Fn ≥ h} —– (2)
for some threshold h > 0. In the CUSUM stopping rule (2), the CUSUM statistic process of (1) is initialized at
C0 = 0 —– (3)
The CUSUM statistic process was first introduced by E. S. Page in the form that it takes when the sequence of random variables Yi is independent and Gaussian; that is, Yi ∼ N(μ, 1), i = 1, 2,…, with μ = μ0 for i < 𝜈 and μ = μ1 for i ≥ 𝜈. Since its introduction by Page, the CUSUM statistic process of (1) and its associated CUSUM stopping time of (2) have been used in a plethora of applications where it is of interest to perform detection of abrupt changes in the statistical behavior of observations in real time. Examples of such applications are signal processing, monitoring the outbreak of an epidemic, financial surveillance, and computer vision. The popularity of the CUSUM stopping time (2) is mainly due to its low complexity and optimality properties in both discrete and continuous time models.
Let Yi ∼ N(μ0, σ2) that change to Yi ∼ N(μ1, σ2) at the change point time v. We now proceed to derive the form of the CUSUM statistic process (1) and its associated CUSUM stopping time (2). Let us denote by φ(x) = 1/√2π e-x2/2 the Gaussian kernel. For the sequence of random variables Yi given earlier,
Cn := max0≤v≤n log dPv/dP∞|Fn
= max0≤v≤n log (∏i=1v-1φ(Yi-μ0)/σ ∏i=vnφ(Yi-μ1)/σ)/∏i=1nφ(Yi-μ0)/σ
= 1/σ2max0≤v≤n (μ1 – μ0)∑i=vn[Yi – (μ1 + μ0)/2] —– (4)
In view of (3), let us initialize the sequence (4) at Y0 = (μ1 + μ0)/2 and distinguish two cases.
a) μ1 > μ0: divide out (μ1 – μ0), multiply by the constant σ2 in (4) and use (2) to obtain CUSUM stopping T+:
T+(h+) = inf {n ≥ 0: max0≤v≤n ∑i=vn[Yi – (μ1 + μ0)/2] ≥ h+} —– (5)
for an appropriately scaled threshold h+ > 0.
b) μ1 < μ0: divide out (μ1 – μ0), multiply by the constant σ2 in (4) and use (2) to obtain CUSUM stopping T–:
T–(h–) = inf {n ≥ 0: max0≤v≤n ∑i=vn[(μ1 + μ0)/2 – Yi] ≥ h–} —– (6)
for an appropriately scaled threshold h– > 0.
The sequences form a CUSUM according to the deviation of the monitored sequential observations from the average of their pre- and postchange means. Although the stopping times (5) and (6) can be derived by formal CUSUM regime change considerations, they may also be used as general nonparametric stopping rules directly applied to sequential observations.