Skip to content

Cheatsheet: Why AI runs on statistics

AI systems report degrees of belief (probabilities), not certainties, because they learn from a limited, noisy sample. Statistics is the discipline of reasoning about those degrees of belief without fooling yourself.

ProbabilityStatistics
DirectionForward: model to dataBackward: data to model
You knowThe rules of chanceThe observed data
You wantWhat data to expectWhat model produced it
ExampleFair coin to chance of 10 heads16 of 20 heads to is it fair?
In AIA model’s prediction + confidenceEvaluating / comparing models
ReasonWhat it means
Learning from a sampleA model saw some data, not all; generalizing is always a bet
The world is noisyIdentical-looking cases have different outcomes; no model removes that
PhaseIdeaShows up as
1 Describing dataCenter, spread, shape, correlationReading data before modeling; spotting redundant or degenerate features
2 The laws of chanceConditional probability, BayesSpam filters, fraud detection, medical triage; the base-rate trap
3 Random variablesDistributions, expected valueThe normal everywhere; loss functions and rewards as expected value
4 From sample to truthSampling, CLT, intervals, testsWhy a test set predicts the future; A/B tests; is model B really better?
Disease: 1 in 100 people. Test: 99% accurate both ways. Population: 10,000.
Sick = 100 Healthy = 9,900
True positives 99% of 100 = 99 False positives 1% of 9,900 = 99
Total positives = 99 + 99 = 198
P(sick | positive) = 99 / 198 = 50% (NOT 99%)
Rarer target -> more lopsided. The false positives from the large negative
group swamp the true positives from the small positive group.
  • Mistaking confidence for correctness (a confident wrong answer is still wrong).
  • Reading a single accuracy number without the base rate.
  • Hearing “correlation” and concluding “cause.”
  • Treating the training sample as the whole truth.
  • Thinking probability removes uncertainty (it measures it, it does not erase it).
  • Probability: a degree of belief about an uncertain event; runs forward, model to data.
  • Statistics: inferring the model behind observed data; runs backward, data to model.
  • Base rate: how common the thing being detected actually is; decides what an accuracy number means.
  • Calibration: whether a reported confidence matches the real frequency of being right.
  • Sample: the limited data a model actually saw, as opposed to the whole population.