-
A theoretical lower bound on factorization error in MDMs.
We identify a fundamental lower bound on the factorization error that standard Masked Diffusion Models cannot escape, originating from their use of a single deterministic mask state. This bound explains why MDMs degrade sharply in the few-step regime.
-
Infinite Mask Diffusion Model (IMDM) with a stochastic infinite-state mask.
IMDM replaces the deterministic single-state mask with a stochastic, infinite-state mask, mitigating the theoretical error bound while preserving the simplicity and conditional-generation benefits of MDMs — including direct compatibility with pre-trained MDM weights.