Summary: Why area equals slope
The last lesson handed you the fundamental theorem as a tool; this lesson explains why it is true. Area and slope look like unrelated ideas, one about how much region sits under a curve, the other about how steeply the curve rises, but they are secretly the same thing, and the proof is a single picture: extending an area by a thin sliver adds a rectangle whose height is the curve. From that one observation, the entire connection between integration and differentiation falls into place. This is the scan-it-in-five-minutes version.
Core ideas
Section titled “Core ideas”- The area function. Fix the left end at
aand let the right end slide:A(x) = ∫_a^x f(t) dtis the area accumulated up tox. The question that unlocks everything: how fast doesA(x)grow, i.e. what isA'(x)? - The one geometric move. Slide the right end by
dx. The new area is a thin sliver of widthdxand heightf(x), soA(x + dx) - A(x) ≈ f(x)·dx. Divide bydxand take the limit:A'(x) = f(x). The derivative of the area function is the original curve, because that height is what each new sliver is made of. - That single fact is the fundamental theorem.
A' = fmeansAis an antiderivative off; any antiderivativeFdiffers by a constant (which cancels), soF(b) - F(a) = ∫_a^b f(x) dx. Integration and differentiation are inverse operations because the slope of an accumulated area is the thing being accumulated. - Seen on familiar curves.
f = x²givesA = x³/3,A' = x²;f = e^xgivesA = e^x - 1,A' = e^x;f = sin xgivesA = 1 - cos x,A' = sin x; and the circle’sf = 2πrgivesA = πR²,A' = 2πR, the exact observation the first lesson made by hand. - The everyday version. A bucket under a tap: the rate the water level rises is the tap’s flow at that moment. Water is the integral, flow is the function, and fill-rate = flow is
A'(x) = f(x). A car’s odometer and speedometer are the same pair.
What changes for you
Section titled “What changes for you”The fundamental theorem stops being a lucky formula you memorize and becomes something you can reconstruct from one image: a sliver of area has height f(x), so the accumulated area grows at rate f(x). Area and slope are revealed as one idea seen twice, which is why the antiderivative trick works at all. In machine learning this is the exact relationship between a probability density and its cumulative distribution function: F(x) = ∫ f(t) dt and F'(x) = f(x), so every PDF/CDF pair is an instance of A'(x) = f(x). The same pairing shows up whenever you read a running total against the rate that feeds it, a cumulative loss whose slope is the current loss, a cumulative count whose slope is the instantaneous rate, which is why a practitioner can move between a rate and its running total without recomputing from scratch. With both halves of calculus now built and bound together, the final two lessons go deeper into rates: higher-order derivatives, then the Taylor series that approximates any function from them.