References: The shape of data: distributions and histograms
Source material
Section titled “Source material”Source curriculum (structural mirror, cited as further study):• Khan Academy, "Displaying and comparing quantitative data" and "Modeling data distributions" (Statistics & Probability) Author: Sal Khan and the Khan Academy team Unit page: https://www.khanacademy.org/math/statistics-probability/displaying-describing-data License: CC BY-NC-SA 4.0Clawdemy's lessons are original prose that follows the pedagogical arc of theseunits. We do not embed, reproduce, or transcribe Khan's text or videos; we linkout to the relevant units as recommended further study. The non-commercialclause aligns with Clawdemy's free, zero-revenue posture. All rights to theoriginal materials remain with their authors.
Source-scope note: this lesson mirrors Khan's treatment of histograms anddistribution shape and restates it in Clawdemy's voice with originalhand-drawn examples. The machine-learning connections (skew transforms, hiddensubpopulations, class imbalance as a visible base-rate problem) are Clawdemyframing. The bell shape is only introduced here; the normal distribution getsits own Track 9 lesson. Exact per-unit URLs are verified at promotion.Read this next
Section titled “Read this next”- Khan Academy: Displaying and comparing quantitative data by Sal Khan and the Khan Academy team. The full unit this lesson mirrors, with videos and interactive practice on building and reading histograms, dot plots, and box plots, free and CC-licensed. The place to drill shape-reading until it is automatic.
Going deeper
Section titled “Going deeper”A short, durable list. Both are free.
- Khan Academy, “Modeling data distributions” (within the course above). Where distribution shape meets the standard deviation: density curves, the empirical rule, and the bell shape that this lesson only names. The natural bridge to Track 9’s normal-distribution lesson.
- Khan Academy, “Summarizing quantitative data” (within the course above). The previous lesson’s source; revisit it to connect the numeric summaries (mean, median, standard deviation) to the shapes you now read in a histogram.
Adjacent topics
Section titled “Adjacent topics”Where this sits inside this track.
- Summarizing data: center and spread. The previous lesson. Center and spread are the numbers; this lesson is the picture they summarize, and skew is where the two meet.
- The bell curve: the normal distribution. Later in the track (Phase 3). The bell shape introduced here becomes a precise, central tool, with the empirical rule and z-scores.
- Why AI runs on statistics. Lesson 1. Class imbalance, visible in a histogram of labels, is the base-rate problem from the opener made concrete.