10. Extension: Zero and One Inflation#
Score distributions in end-to-end self-driving carry a sharp spike at zero, the footprint of routes where the driving policy failed catastrophically, and a smaller spike at one for clean passes. Continuous flavors of IRT cannot put finite mass on a single point, so the fix is to peel each spike off into its own branch and let a vanilla IRT describe the interior. This section covers the GRM-style zero-and-one-inflated mixture, after Molenaar (2022).
10.1. Why a Plain Beta Cannot Absorb Boundary Spikes#
NavHard scores live in \([0, 1]\) and are not smoothly distributed across that range. About one route attempt in ten ends with a hard zero (collision, off-road, scene incompletion), and a small share lands at exactly one.
Recall the Beta-IRT shape from the previous chapter: each cell is \(\mathrm{Beta}(a_{s,n}, b_{s,n})\) on \((0, 1)\) with
where \(\theta_s\) is the subject’s ability, \(\beta_n\) is the route’s difficulty, \(\alpha_n\) is the route’s discrimination, and \(o_n\) is the route’s log-precision (see Beta-IRT in the previous chapter). With both shapes at least one, the density vanishes at the endpoints; push a shape below one and it blows up there as a continuous U-shape, never a point mass.
So a Beta cannot place finite mass on \(\{0\}\) or \(\{1\}\). The only continuous knob for chasing a boundary spike is to shrink the precision \(o_n\), which drives the per-cell Beta into a U-shape and ruins the interior fit. The fix is to peel each spike off into its own branch, leaving the continuous part free to shape the interior [3, 4].
10.2. Mixture Likelihood#
We use the GRM-style inflation from Molenaar [1]. Two per-route thresholds, \(g_n^0 < g_n^1\), split the ability axis into three regions, and the inflation rates depend on the subject’s standing \(\alpha_n \theta_s\) on that axis through the same discrimination \(\alpha_n\) as the continuous branch:
Weak subjects (\(\alpha_n \theta_s \ll g_n^0\)) pile up at zero; strong subjects (\(\alpha_n \theta_s \gg g_n^1\)) pile up at one; subjects in between split their mass into the interior Beta. We enforce \(g_n^1 > g_n^0\) through a softplus link, \(g_n^1 = g_n^0 + \mathrm{softplus}(g_n^{1,\mathrm{raw}})\), instead of a truncated prior, which keeps \(g_n^0\) and \(g_n^{1,\mathrm{raw}}\) unconstrained for the SVI guide.
The likelihood for a single cell is the three-branch mixture
with the three weights summing to one for every \(\theta_s\) because \(g_n^1 > g_n^0\). The continuous mean is the Molenaar Beta mean \(\sigma(\eta_{s,n})\), so weighting each branch by its probability gives the expected score:
The interior branch ranks subjects through \(\theta_s\) exactly as in plain Beta-IRT. The mixture only pulls the boundary mass off; the inflation rates also depend on \(\theta_s\), but through the same \(\alpha_n \theta_s\) that drives the interior, so they sharpen rather than override the ranking signal [2].
10.3. References#
D. Molenaar, M. Cúri, and J. L. Bazán. Zero and one inflated item response theory models for bounded continuous data. Journal of Educational and Behavioral Statistics, 47(6):693–735, 2022.
C. Zopluoglu and J. R. Lockwood. A comparative study of item response theory models for mixed discrete-continuous responses. Journal of Intelligence, 12(3):26, 2024.
D. Lambert. Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics, 34(1):1–14, 1992.
R. Ospina and S. L. P. Ferrari. Inflated beta distributions. Statistical Papers, 51(1):111–126, 2010.