DDPM - Diffusion Models Beat GANs On Image Synthesis (Machine Learning Research Paper Explained)

#ddpm #diffusionmodels #openai

GANs have dominated the image generation space for the majority of the last decade. This paper shows for the first time, how a non-GAN model, a DDPM, can be improved to overtake GANs at standard evaluation metrics for image generation. The produced samples look amazing and other than GANs, the new model has a formal probabilistic foundation. Is there a future for GANs or are Diffusion Models going to overtake them for good?

OUTLINE:
0:00 – Intro & Overview
4:10 – Denoising Diffusion Probabilistic Models
11:30 – Formal derivation of the training loss
23:00 – Training in practice
27:55 – Learning the covariance
31:25 – Improving the noise schedule
33:35 – Reducing the loss gradient noise
40:35 – Classifier guidance
52:50 – Experimental Results

Paper (this):
Paper (previous):
Code:

Abstract:
We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better architecture through a series of ablations. For conditional image synthesis, we further improve sample quality with classifier guidance: a simple, compute-efficient method for trading off diversity for sample quality using gradients from a classifier. We achieve an FID of 2.97 on ImageNet 128×128, 4.59 on ImageNet 256×256, and 7.72 on ImageNet 512×512, and we match BigGAN-deep even with as few as 25 forward passes per sample, all while maintaining better coverage of the distribution. Finally, we find that classifier guidance combines well with upsampling diffusion models, further improving FID to 3.85 on ImageNet 512×512. We release our code at this https URL

Authors: Alex Nichol, Prafulla Dhariwal

Links:
TabNine Code Completion (Referral):
YouTube:
Twitter:
Discord:
BitChute:
Minds:
Parler:
LinkedIn:
BiliBili:

If you want to support me, the best thing to do is to share out the content 🙂

If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar:
Patreon:
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n

source

What's Hot

Go with Your Gut: Scaling Confidence for Autoregressive Image Generation – Takara TLDR

OpenAI declares ‘huge focus’ on enterprise growth with array of partnerships

From Silicon Valley to Nairobi: What the Global South’s AI leapfrogging teaches tech leaders

DDPM – Diffusion Models Beat GANs on Image Synthesis (Machine Learning Research Paper Explained)

AGI is not coming!

Context Rot: How Increasing Input Tokens Impacts LLM Performance (Paper Analysis)

Energy-Based Transformers are Scalable Learners and Thinkers (Paper Review)

Tomb of Amenhotep III Reopens After Two-Decade Renovation

Limited Edition Print of Ozzy Osbourne Art Sold To Benefit Charities

Odili Donald Odita Sues Jack Shainman Gallery over ‘Withheld’ Artworks

Mohamed Hamidi, Moroccan Modernist Painter, Has Died at 84

Go with Your Gut: Scaling Confidence for Autoregressive Image Generation – Takara TLDR

OpenAI declares ‘huge focus’ on enterprise growth with array of partnerships

From Silicon Valley to Nairobi: What the Global South’s AI leapfrogging teaches tech leaders

What's Hot

DDPM – Diffusion Models Beat GANs on Image Synthesis (Machine Learning Research Paper Explained)

Related Posts

Subscribe to Updates