Skip to content

References: Diffusion models II, training and sampling

  • Berkeley CS294-158 Sp24, Deep Unsupervised Learning, Lecture 6: Diffusion Models (Pieter Abbeel, Wilson Yan, Kevin Frans, Philipp Wu). The primary anchor for this lesson’s sampler-design material (DDIM, classifier-free guidance, the latency-quality trade-off). Course page: sites.google.com/view/berkeley-cs294-158-sp24/.
  • Stanford CS236, Deep Generative Models, Lecture 16 (and the diffusion sections of the syllabus) (Stefano Ermon). Secondary framing for the diffusion paradigm overall, with the score-based view that lesson 14 will return to. Course page: deepgenerativemodels.github.io.

Foundational papers (the math this lesson is built on)

Section titled “Foundational papers (the math this lesson is built on)”

Production-grade systems built on the L13 stack

Section titled “Production-grade systems built on the L13 stack”

Further reading (faster samplers, distillation)

Section titled “Further reading (faster samplers, distillation)”
  • FID (Fréchet Inception Distance) is the standard image-quality metric for diffusion systems. Read across step counts to characterize the latency-quality Pareto frontier.
  • CLIP score measures text-image alignment for text-conditioned diffusion. Useful alongside FID for prompt-fidelity evaluation.
  • Sample-quality vs step-count Pareto frontier is the right reporting frame for any new sampler claim. A single number (FID at one step count) is not the right comparison; the full curve is.
  • Memorization probes detect when a diffusion model reproduces training-image-like content (a relevant question for the §6 watch territory’s IP-and-licensing forum).
Source curriculum (structural mirror, cited as further study):
• UC Berkeley CS294-158: Deep Unsupervised Learning (Spring 2024)
Course page: https://sites.google.com/view/berkeley-cs294-158-sp24/
Lecture videos: YouTube (link-out only)
Clawdemy's lessons are original prose that follows the pedagogical arc of this
source. We do not reproduce or transcribe it; we cite it as a recommended
companion. All rights to the original material remain with its authors.