RiffOn - Why Diffusion Models Work So Well — And Where They Break | Machine Learning Tech Brief By HackerNoon

Diffusion models have a hidden flaw: an SNR-timestep bias degrades images. A new wavelet-based correction method boosts quality with no extra cost.

Diffusion Models Degrade Images by Mismatching Training and Inference Conditions

During training, diffusion models learn a perfect relationship between noise level (SNR) and denoising step (T). During inference, this relationship breaks as the model's own predictions introduce errors, creating SNR values it never trained on for a given step. This causes compounding errors and quality loss.

Why Diffusion Models Work So Well — And Where They Break

Machine Learning Tech Brief By HackerNoon·19 hours ago

Generative AI Builds Images Like an Artist: Broad Strokes First, Fine Details Last

Diffusion models naturally reconstruct images in layers. In early denoising stages with high noise, they focus on low-frequency information like overall composition and color. As noise decreases in later steps, they add high-frequency details like textures and sharp edges. This hierarchical process is key to understanding their behavior.

Why Diffusion Models Work So Well — And Where They Break

Machine Learning Tech Brief By HackerNoon·19 hours ago

Fixing Flawed Diffusion Models Requires No Retraining, Just Per-Step Frequency Corrections

The SNR-T bias can be fixed efficiently without retraining models. At each denoising step, the image is broken into frequency bands using wavelets. Each band is then given a small correction based on its specific noise mismatch before being recombined. This surgical approach is computationally cheap and universally effective.

Why Diffusion Models Work So Well — And Where They Break

Machine Learning Tech Brief By HackerNoon·19 hours ago

Get your free personalized podcast brief

Diffusion Models Degrade Images by Mismatching Training and Inference Conditions

Generative AI Builds Images Like an Artist: Broad Strokes First, Fine Details Last

Fixing Flawed Diffusion Models Requires No Retraining, Just Per-Step Frequency Corrections

Get your free personalized podcast brief

Diffusion Models Degrade Images by Mismatching Training and Inference Conditions

Generative AI Builds Images Like an Artist: Broad Strokes First, Fine Details Last

Fixing Flawed Diffusion Models Requires No Retraining, Just Per-Step Frequency Corrections