Overfitting and the bias-variance tradeoff
What you’ll learn
Section titled “What you’ll learn”This is lesson 13 of Track 10, the opener of Phase 4 (Knowing whether your model is any good). By the end you will be able to diagnose underfitting versus overfitting by reading a model’s training error and test error together, the foundational skill in applied machine learning. The one capability to walk away with: given a pair of error numbers, tell which side of the bias-variance U-curve you are on and name the right next move.
The track structurally mirrors StatQuest’s intuition-first machine learning videos, with Microsoft’s “ML For Beginners” as the hands-on companion for readers who want to build the models in code. Full attribution is in this lesson’s references.
Where this fits
Section titled “Where this fits”The first three phases built a substantial toolbox: regression, classification, ensembles, clustering, compression. Phase 4 turns to the question hovering over every one of them: how do you know a model is actually any good? This lesson is the foundation of that phase, because bias and variance are the framework for understanding how a model can be bad. The next lesson, cross-validation, is about measuring it honestly, and the final lesson is about choosing the right metric to measure it with. Together the three lessons make the difference between guessing and engineering.
Before you start
Section titled “Before you start”Prerequisite: Lesson 1, What machine learning actually is. You need the core rule planted there, that a model is judged on data it has never seen, because this lesson formalizes exactly how a model can fail to generalize. The lesson also synthesizes patterns from across the track (linear regression, trees, forests, boosting, SVMs); you do not need to remember every detail of those lessons, but having seen them helps the bias-variance lens land.
By the end, you’ll be able to
Section titled “By the end, you’ll be able to”- Define bias (underfitting) and variance (overfitting)
- Explain the tradeoff and the U-shape of total error vs complexity
- Diagnose high bias, high variance, or a good fit from training and test error
- Place each method from the track on the bias-variance spectrum
- Explain regularization (ridge, lasso) as the standard low-variance dial
Time and difficulty
Section titled “Time and difficulty”- Read time: about 12 minutes
- Practice time: about 15 minutes (a diagnose-from-numbers exercise, a place-the-method question, and flashcards)
- Difficulty: standard