Skip to content

What machine learning actually is

This is lesson 1 of Track 10, the opener of Phase 1 (What learning from data means). By the end you will be able to look at almost any problem and classify it as supervised, unsupervised, or not a machine learning problem at all, and say in plain language what learning from data buys you over writing rules by hand. That single skill, naming what kind of problem you have, is the first decision in every machine learning project and the frame the rest of this track builds on.

The track structurally mirrors StatQuest’s intuition-first machine learning videos, with Microsoft’s “ML For Beginners” as the hands-on companion for readers who want to build the models in code. Full attribution is in this lesson’s references.

This is the track’s front door, so it assumes nothing and sets up everything. The map it draws (rules versus learning, supervised versus unsupervised, and the rule that a model is judged on unseen data) is the scaffolding for every lesson that follows. The very next lesson makes it concrete with the simplest supervised algorithm, linear regression, and Phase 4 returns to the unseen-data rule in full.

No prerequisites. This is the first lesson of the track. You need no prior machine learning, no calculus, and only the most basic comfort with the idea of data (a table of examples). If you can read a little code or follow a simple formula, that is more than enough.

  • Describe the flip from traditional programming to machine learning
  • Decide when learning from data beats writing a rule
  • Distinguish supervised (regression, classification) from unsupervised (clustering, dimensionality reduction) learning
  • Apply the two-question test to classify a problem as supervised, unsupervised, or neither
  • Explain why a model is judged on data it has never seen
  • Read time: about 10 minutes
  • Practice time: about 15 minutes (the classify-the-problem exercise plus flashcards)
  • Difficulty: intro