Windowed Count-Change Correct Classification Rate

Motivation: A natural metric for evaluating count-estimation performance is the Mean Absolute Error (MAE). However, using MAE to assess the count-estimation performance of an algorithm that produces counts by accumulating (summing) count changes is problematic because just a single isolated error in the count-change estimate early in time could persist forever and contribute an MAE of 1 (the so-called error-accumulation problem).

An alternative method to assess performance is to focus on count changes instead of counts. Count changes are quantitative and also discrete (integer-valued). One could focus on the quantitative aspect of count changes and evaluate performance via the MAE of count changes. An alternative approach is to ignore the numerical values of count changes and treat the discrete values, that they could possibly take, as different classes. One could then use the Correct Classification Rate (CCR) as a performance metric. However, a straightforward computation of CCR does not take into account the following important issues which arise in practice:

Temporal mis-synchronization: the time at which the ground-truth count is deemed to change may not be matched exactly by the estimated count-change time due to subjectivity of annotators and algorithm parameters. This could substantially decrease the CCR. In practice, however, this slight temporal misalignment does not affect the occupancy estimate on a longer time scale and should be ignored.
Sporadic and sparse count changes: ground-truth counts typically remain constant and change only occasionally. This could allow the trivial algorithm, which estimates a constant zero-valued count change, to have a very high CCR.

To address these practical problems, we propose a new CCR-like performance metric named Windowed Count-Change CCR (CCR_WCC) defined below.

Definition: The new performance metric CCR_WCC is described by the following equations:

In words, the numerator of CCR_WCC represents the number of count changes in the ground truth which have exactly the same count changes in the estimates within a temporal window. The denominator counts the number of time instants in which the count change in the ground truth is non-zero or the count change in the ground truth is zero, but there is no zero count change in the estimates within the temporal window. In addition, the denominator includes the number of time instants in which the count change in the estimate in non-zero, but it is not paired with any time instant in the ground truth.

Explanation: To overcome the mis-synchronization problem, for the ground-truth count change at time n , i.e., (y_n+1 − y_n), we find the estimated count change that best matches it within ±w time instants of time n, i.e., (ŷ_{n+1+δ_n} − ŷ_{n+δ_n}), where δ_n ∈ [-w, w] and n+δ_n is the time instant at which the estimate that best matches the ground-truth count change at time n is found. To prevent ambiguity, we adopt the convention that if δ = 0 is one of the best-matching time offsets in the range [-w, w], then δ_n=0. Otherwise, we set δ_n to be the smallest value of δ in the range [-w, w] where a best match is found. The value of w can be determined in practice via either physical constraints or by tuning it on an annotated validation dataset that is representative of the real-world environment pertinent to the application.

To prevent the small number of time instances where the count change is non-zero from being outnumbered by the much larger number of time instants where count remains constant, in CCR calculation we focus on those time instants where the ground-truth count changes, i.e., (y_n+1≠y_n). Additionally, we also take into account time instants where an estimated non-zero count-change is not matched to any ground-truth count change. This corresponds to the condition n ∉ N̂ in the defining equations for CCR_WCC.

We now illustrate the calculation of CCR_WCCthrough the following numerical example.

Example:

The figure shows temporal sequences of: (i) ground-truth count changes (y_n+1− y_n), (ii) estimated count changes (ŷ_n+1 − ŷ_n), and (iii) temporal offsets δ_n ∈ [-w, w], with w = 1, for which the value of (ŷ_{n+1+δ_n} − ŷ_{n+δ_n}) is closest to (y_n+1 − y_n). The black arrows point from time instant n in the ground-truth count-change sequence to (the best-matching) time instant n+δ_n in the estimate count-change sequence.

In this example, the time instants that will be contributing to the numerator are n=2 and n=6, which are the blue cells in the figure. At these two time instants, there is a non-zero count change in the ground truth which exactly equals a count change in the estimate within ±1 time instants.

In the denominator, the time instants , n = 2, 6, 10, 14 (blue and red cells) will contribute to the first term in the denominator because the count change in the ground truth at these time instants is non-zero. The time instant n = 17 (orange cell) corresponds to the situation where the count change in the ground truth is zero but there is no zero count-change in the estimate within ±1 time instants. Finally, the time instants n = 8, 12, 16, 18 (green cells) contribute to the term “M” in the defining equations, because these are the time instants which are not paired with any time instant in the ground truth during the matching process.