20 Taylor Approximation, Higher Derivative Test for Local Extrema, Riemann Integral, Darboux Sums and Integrals

Lecture from: 08.05.2024 | Video: Video ETHZ

Review: Power Series and Their Derivatives

Flashback: Power Series and Their Coefficients A key fact we’ve learned: if a function $f (x)$ can be represented by a power series $f (x) = \sum_{k = 0}^{\infty} c_{k} (x - x_{0})^{k}$ (that converges in some region), then:

$f$ is infinitely differentiable (smooth) in that region.

We can find all its derivatives by differentiating the series term by term.

Crucially, the coefficients $c_{k}$ of the series are directly determined by the function’s derivatives at the center point $x_{0}$ : $c_{j} = \frac{f ^{(j)} ( x _{0} )}{j !}$ . This is a HUGE link!

If $f (x) = \sum_{k = 0}^{\infty} c_{k} (x - x_{0})^{k}$ is a power series with radius of convergence $ρ > 0$ , then $f$ is smooth (infinitely differentiable) in the open interval $(x_{0} - ρ, x_{0} + ρ)$ . Furthermore, for any non-negative integer $j \in N$ (where $f^{(0)} = f$ ), its $j$ -th derivative is given by differentiating term-by-term: $f^{(j)} (x) = \sum_{k = j}^{\infty} c_{k} \cdot k (k - 1) \dots (k - j + 1) (x - x_{0})^{k - j}$ A crucial consequence is obtained by evaluating this at $x = x_{0}$ . All terms in the sum become zero except for the first term (where $k = j$ ): $f^{(j)} (x_{0}) = c_{j} \cdot j (j - 1) \dots (1) \cdot (x_{0} - x_{0})^{0} = c_{j} \cdot j!$ .

This gives us the formula for the coefficients: $c_{j} = \frac{f ^{(j)} ( x _{0} )}{j !}$

Without Summation Notation (to see the pattern more clearly): If $f (x) = c_{0} + c_{1} (x - x_{0}) + c_{2} (x - x_{0})^{2} + c_{3} (x - x_{0})^{3} + \dots$

$f (x_{0}) = c_{0} ⟹ c_{0} = \frac{f ^{(0)} ( x _{0} )}{0 !}$
$f^{'} (x) = c_{1} + 2 c_{2} (x - x_{0}) + 3 c_{3} (x - x_{0})^{2} + \dots ⟹ f^{'} (x_{0}) = c_{1} ⟹ c_{1} = \frac{f ^{(1)} ( x _{0} )}{1 !}$
$f^{''} (x) = 2 c_{2} + 3 \cdot 2 c_{3} (x - x_{0}) + \dots ⟹ f^{''} (x_{0}) = 2 c_{2} ⟹ c_{2} = \frac{f ^{''} ( x _{0} )}{2 !}$
$f^{'''} (x) = 3 \cdot 2 \cdot 1 c_{3} + \dots ⟹ f^{'''} (x_{0}) = 6 c_{3} ⟹ c_{3} = \frac{f ^{'''} ( x _{0} )}{3 !}$
And in general, $f^{(n)} (x_{0}) = n! c_{n}$ , so $c_{n} = \frac{f ^{(n)} ( x _{0} )}{n !}$ .

Taylor Approximation

Using Derivatives to Build Polynomial Approximations The beautiful relationship $c_{k} = \frac{f ^{(k)} ( x _{0} )}{k !}$ from power series gives us a brilliant idea. If a function $f$ is smooth (or at least has enough derivatives) near a point $x_{0}$ , we can use its derivatives at that single point $x_{0}$ to construct a polynomial that should approximate $f (x)$ well for $x$ values near $x_{0}$ . This is the essence of Taylor approximation.

The relationship $c_{k} = \frac{f ^{(k)} ( x _{0} )}{k !}$ for power series makes it plausible that if a function $f$ is smooth (or at least sufficiently differentiable) near a point $x_{0}$ , we can approximate it using a polynomial constructed from its derivatives at $x_{0}$ .

Indeed, if $f (x)$ can be represented as a power series $\sum_{k = 0}^{\infty} c_{k} (x - x_{0})^{k}$ , then substituting $c_{k} = \frac{f ^{(k)} ( x _{0} )}{k !}$ gives $f (x) = \sum_{k = 0}^{\infty} \frac{f ^{(k)} ( x _{0} )}{k !} (x - x_{0})^{k}$ This means $f (x)$ is the limit of the partial sums: $f (x) = lim_{n \to \infty} \sum_{k = 0}^{n} \frac{f ^{(k)} ( x _{0} )}{k !} (x - x_{0})^{k}$ This convergence is uniform on any closed interval around $x_{0}$ that is strictly within the radius of convergence.

This motivates the following definition:

Definition: Taylor Polynomial

The Taylor Polynomial: A Function’s “Best Fit” Polynomial For a function $f$ that’s $n$ -times differentiable at a point $x_{0}$ , its Taylor polynomial of order $n$ centered at $x_{0}$ is a specific polynomial of degree at most $n$ . Its magic property? This polynomial $T_{n} (x)$ and its first $n$ derivatives perfectly match the values of $f$ and its first $n$ derivatives at the point $x_{0}$ . It’s designed to be the best possible polynomial approximation of degree $n$ to $f$ right around $x_{0}$ .

Let $I \subseteq R$ be an interval with more than one point. Let $n \in N$ (a non-negative integer), and let $f : I \to R$ be $n$ -times differentiable at $x_{0} \in I$ .

The Taylor polynomial of order $n$ (or degree $n$ ) of $f$ at (or around, or centered at) $x_{0}$ is: $T_{n} (f; x; x_{0}) = \sum_{k = 0}^{n} \frac{f ^{(k)} ( x _{0} )}{k !} (x - x_{0})^{k}$ This polynomial is often denoted $T_{n, f, x_{0}} (x)$ or simply $T_{n} (x)$ if $f$ and $x_{0}$ are clear from context. It is the unique polynomial $P (x)$ of degree at most $n$ such that $P (x)$ and its first $n$ derivatives match $f (x)$ and its first $n$ derivatives at the point $x_{0}$ : $P (x_{0}) = f (x_{0})$ , $P^{'} (x_{0}) = f^{'} (x_{0})$ , $\dots$ , $P^{(n)} (x_{0}) = f^{(n)} (x_{0})$ .

Taylor Polynomials for $f (x) = cos (x)$ at $x_{0} = 0$

Example: Approximating Cosine with Polynomials Let’s build some Taylor polynomials for $f (x) = cos (x)$ centered at $x_{0} = 0$ (these are also called Maclaurin polynomials). We need the derivatives of $cos (x)$ at $0$ : $cos (0) = 1$ , $- sin (0) = 0$ , $- cos (0) = - 1$ , $sin (0) = 0$ , $cos (0) = 1$ , and so on. The polynomials will use these values to try and “hug” the cosine curve near $x = 0$ .

The derivatives of $f (x) = cos (x)$ evaluated at $x_{0} = 0$ are: $f (x) = cos (x) ⟹ f (0) = 1$ $f^{'} (x) = - sin (x) ⟹ f^{'} (0) = 0$ $f^{''} (x) = - cos (x) ⟹ f^{''} (0) = - 1$ $f^{'''} (x) = sin (x) ⟹ f^{'''} (0) = 0$ $f^{(4)} (x) = cos (x) ⟹ f^{(4)} (0) = 1$ (The pattern of derivatives at $0$ is $1, 0, - 1, 0, 1, 0, - 1, 0, \dots$ )

The first few Taylor polynomials for $cos (x)$ at $x_{0} = 0$ are: $T_{0} (f; x; 0) = \frac{f ( 0 )}{0 !} x^{0} = \frac{1}{1} \cdot 1 = 1$ $T_{1} (f; x; 0) = f (0) + \frac{f ^{'} ( 0 )}{1 !} x = 1 + \frac{0}{1} x = 1$ (Since $f^{'} (0) = 0$ , $T_{1}$ is the same as $T_{0}$ ).

$T_{2} (f; x; 0) = f (0) + f^{'} (0) x + \frac{f ^{''} ( 0 )}{2 !} x^{2} = 1 + 0 x + \frac{- 1}{2} x^{2} = 1 - \frac{x ^{2}}{2}$ $T_{3} (f; x; 0) = T_{2} (f; x; 0) + \frac{f ^{'''} ( 0 )}{3 !} x^{3} = 1 - \frac{x ^{2}}{2} + \frac{0}{6} x^{3} = 1 - \frac{x ^{2}}{2}$ (Since $f^{'''} (0) = 0$ , $T_{3}$ is the same as $T_{2}$ ).

$T_{4} (f; x; 0) = T_{3} (f; x; 0) + \frac{f ^{(4)} ( 0 )}{4 !} x^{4} = (1 - \frac{x ^{2}}{2}) + \frac{1}{24} x^{4} = 1 - \frac{x ^{2}}{2 !} + \frac{x ^{4}}{4 !}$

Notice how these polynomials start to resemble the well-known Maclaurin series for $cos (x)$ : $cos (x) = 1 - \frac{x ^{2}}{2 !} + \frac{x ^{4}}{4 !} - \frac{x ^{6}}{6 !} + \frac{x ^{8}}{8 !} - \dots$

Satz: Taylor’s Theorem with Lagrange Remainder (Taylor Approximation)

How Good is the Taylor Approximation? Enter the Remainder! Taylor’s Theorem is a cornerstone result. It doesn’t just say that $T_{n} (x)$ approximates $f (x)$ ; it gives us an explicit formula for the error or remainder term, $R_{n} (x) = f (x) - T_{n} (x)$ . The Lagrange form of this remainder tells us the error is $\frac{f ^{(n + 1)} ( ξ )}{( n + 1 )!} (x - x_{0})^{n + 1}$ , where $ξ$ is some mysterious point between $x_{0}$ and $x$ . This means the error depends on the $(n + 1)$ -th derivative of $f$ and how far $x$ is from $x_{0}$ .

Let $a < b$ in $R$ . Let $f : [a, b] \to R$ be a function that is $n$ -times continuously differentiable on the closed interval $[a, b]$ and $(n + 1)$ -times differentiable on the open interval $(a, b)$ . Let $x_{0} \in [a, b]$ . Then for every $x \in [a, b]$ with $x \neq = x_{0}$ , there exists a point $ξ$ strictly between $x_{0}$ and $x$ such that: $f (x) = T_{n} (f; x; x_{0}) + \frac{f ^{(n + 1)} ( ξ )}{( n + 1 )!} (x - x_{0})^{n + 1}$ Remark: The term $R_{n} (f; x; x_{0}) = f (x) - T_{n} (f; x; x_{0}) = \frac{f ^{(n + 1)} ( ξ )}{( n + 1 )!} (x - x_{0})^{n + 1}$ is called the remainder term. This specific formula is the Lagrange form of the remainder. It quantifies the error when approximating $f (x)$ by its $n$ -th Taylor polynomial. The existence of such a $ξ$ is guaranteed, though its exact value is usually unknown.

Korollar: Qualitative Version of Taylor Approximation (Peano Form of Remainder)

The Error Shrinks Super Fast! (Little-o Notation) This corollary gives us another way to think about the error. It says that the difference $f (x) - T_{n} (f; x; x_{0})$ goes to zero faster than $(x - x_{0})^{n}$ does as $x$ approaches $x_{0}$ . We use “little-o” notation for this: $f (x) - T_{n} (f; x; x_{0}) = o ((x - x_{0})^{n})$ . This essentially means that $T_{n}$ is an exceptionally good polynomial approximation of degree $n$ to $f$ very close to $x_{0}$ .

Let $a < b$ in $R$ , and $f : [a, b] \to R$ be $n$ -times continuously differentiable on $[a, b]$ (this implies $f^{(n)}$ is continuous). Let $x_{0} \in [a, b]$ . Then: $lim_{x \to x_{0}} \frac{f ( x ) - T _{n} ( f ; x ; x _{0} )}{( x - x _{0} ) ^{n}} = 0$ This means that the error $f (x) - T_{n} (f; x; x_{0})$ diminishes more rapidly than $(x - x_{0})^{n}$ as $x \to x_{0}$ . We write this using little-o notation as $f (x) - T_{n} (f; x; x_{0}) = o ((x - x_{0})^{n})$ as $x \to x_{0}$ . The Taylor polynomial $T_{n}$ is the “best” polynomial approximation of degree at most $n$ to $f$ near $x_{0}$ in this specific sense.

Exercise: Find the corresponding statement for the tangent line approximation (which is $T_{1} (f; x; x_{0}) = f (x_{0}) + f^{'} (x_{0}) (x - x_{0})$ ) that we encountered at the beginning of our discussion of differentiability. (Hint: The definition of differentiability is $lim_{x \to x_{0}} \frac{f ( x ) - f ( x _{0} )}{x - x _{0}} = f^{'} (x_{0})$ . Rearranging this gives $lim_{x \to x_{0}} \frac{f ( x ) - ( f ( x _{0} ) + f ^{'} ( x _{0} ) ( x - x _{0} ))}{x - x _{0}} = 0$ . This is exactly the statement for $n = 1$ .)

Proof of the Corollary (using Taylor’s Theorem for order $n - 1$ ): We need $f$ to be $n$ -times differentiable for $T_{n}$ to be defined, and $f^{(n)}$ to be continuous at $x_{0}$ for the limit step. By Taylor’s Theorem applied with order $n - 1$ (so we need $f$ to be $n$ -times differentiable for the remainder term $f^{(n)} (ξ_{x})$ to make sense): $f (x) = T_{n - 1} (f; x; x_{0}) + \frac{f ^{(n)} ( ξ _{x} )}{n !} (x - x_{0})^{n}$ , for some $ξ_{x}$ between $x_{0}$ and $x$ . Recall the definition of $T_{n} (f; x; x_{0})$ : $T_{n} (f; x; x_{0}) = T_{n - 1} (f; x; x_{0}) + \frac{f ^{(n)} ( x _{0} )}{n !} (x - x_{0})^{n}$ . Subtracting these two equations: $f (x) - T_{n} (f; x; x_{0}) = (T_{n - 1} (f; x; x_{0}) + \frac{f ^{(n)} ( ξ _{x} )}{n !} (x - x_{0})^{n}) - (T_{n - 1} (f; x; x_{0}) + \frac{f ^{(n)} ( x _{0} )}{n !} (x - x_{0})^{n})$ $f (x) - T_{n} (f; x; x_{0}) = \frac{f ^{(n)} ( ξ _{x} )}{n !} (x - x_{0})^{n} - \frac{f ^{(n)} ( x _{0} )}{n !} (x - x_{0})^{n}$ $f (x) - T_{n} (f; x; x_{0}) = \frac{1}{n !} (f^{(n)} (ξ_{x}) - f^{(n)} (x_{0})) (x - x_{0})^{n}$ . So, for $x \neq = x_{0}$ : $\frac{f ( x ) - T _{n} ( f ; x ; x _{0} )}{( x - x _{0} ) ^{n}} = \frac{1}{n !} (f^{(n)} (ξ_{x}) - f^{(n)} (x_{0}))$ As $x \to x_{0}$ , we also have $ξ_{x} \to x_{0}$ (since $ξ_{x}$ is squeezed between $x$ and $x_{0}$ ). Since $f$ is $n$ -times continuously differentiable, its $n$ -th derivative $f^{(n)}$ is continuous at $x_{0}$ . Therefore, $lim_{ξ_{x} \to x_{0}} f^{(n)} (ξ_{x}) = f^{(n)} (x_{0})$ , which means $lim_{x \to x_{0}} (f^{(n)} (ξ_{x}) - f^{(n)} (x_{0})) = 0$ . This implies: $lim_{x \to x_{0}} \frac{f ( x ) - T _{n} ( f ; x ; x _{0} )}{( x - x _{0} ) ^{n}} = \frac{1}{n !} \cdot 0 = 0$ This completes the proof. $□$

Consequences for Local Extrema: Higher Derivative Test

Taylor’s Idea Helps Find Maxima and Minima! Taylor approximations give us a powerful tool to analyze a function’s behavior near a critical point (where $f^{'} (x_{0}) = 0$ ). If the first few derivatives are zero, the first non-zero derivative at $x_{0}$ can tell us if we have a local maximum, minimum, or neither.

Korollar (Higher Order Derivative Test - slightly reformulated)

The Test: Using the First Non-Zero Derivative Suppose at a point $x_{0}$ , the function’s derivative is zero, and maybe even the next few are also zero: $f^{'} (x_{0}) = f^{''} (x_{0}) = \dots = f^{(n - 1)} (x_{0}) = 0$ . But the $n$ -th derivative, $f^{(n)} (x_{0})$ , is NOT zero.

If $n$ is odd: $f$ has no local extremum at $x_{0}$ . (It’s typically an inflection point with a horizontal tangent if $n \geq 3$ ). Intuition: The term $(x - x_{0})^{n}$ changes sign as $x$ passes $x_{0}$ , so $f (x) - f (x_{0})$ also changes sign locally.

If $n$ is even and $f^{(n)} (x_{0}) > 0$ : $f$ has a strict local minimum at $x_{0}$ . Intuition: $(x - x_{0})^{n}$ is positive, so $f (x) - f (x_{0})$ is positive locally; $f (x) > f (x_{0})$ .

If $n$ is even and $f^{(n)} (x_{0}) < 0$ : $f$ has a strict local maximum at $x_{0}$ . Intuition: $(x - x_{0})^{n}$ is positive, so $f (x) - f (x_{0})$ is negative locally; $f (x) < f (x_{0})$ .

Let $n \geq 1$ be an integer, $a < b$ in $R$ . Let $f : [a, b] \to R$ be $n$ -times continuously differentiable, and let $x_{0} \in (a, b)$ be an interior point of the interval. Assume: $f^{'} (x_{0}) = f^{''} (x_{0}) = \dots = f^{(n - 1)} (x_{0}) = 0$ , but $f^{(n)} (x_{0}) \neq = 0$ . (This means $f^{(n)} (x_{0})$ is the first non-zero derivative at $x_{0}$ ).

Then:

If $n$ is odd, $f$ has no local extremum at $x_{0}$ . (If $n \geq 3$ , $x_{0}$ is a saddle/inflection point with a horizontal tangent. If $n = 1$ , this condition means $f^{'} (x_{0}) \neq = 0$ , so $x_{0}$ wasn’t a critical point to begin with unless we are just classifying points).
If $n$ is even and $f^{(n)} (x_{0}) > 0$ , then $f$ has a strict local minimum at $x_{0}$ . (This means there exists some $δ > 0$ such that for all $x \in (x_{0} - δ, x_{0} + δ)$ with $x \neq = x_{0}$ , we have $f (x) > f (x_{0})$ ).
If $n$ is even and $f^{(n)} (x_{0}) < 0$ , then $f$ has a strict local maximum at $x_{0}$ .

(Proof idea: Near $x_{0}$ , $f (x) - f (x_{0}) \approx \frac{f ^{(n)} ( x _{0} )}{n !} (x - x_{0})^{n}$ . The sign of this difference determines if $f (x)$ is above or below $f (x_{0})$ .)

Korollar (The Special Case $n = 2$ : Second Derivative Test)

The Familiar Second Derivative Test This is the higher order test when $n = 2$ , which we often use: If $f^{'} (x_{0}) = 0$ (critical point):

If $f^{''} (x_{0}) > 0 ⟹$ cup up $⟹$ strict local minimum.

If $f^{''} (x_{0}) < 0 ⟹$ cup down $⟹$ strict local maximum. (If $f^{''} (x_{0}) = 0$ , this test is inconclusive, and we’d need to check higher derivatives or other methods).

Let $a < b$ in $R$ , $f : [a, b] \to R$ be twice continuously differentiable ( $C^{2}$ ), and $x_{0} \in (a, b)$ . Assume: $f^{'} (x_{0}) = 0$ (i.e., $x_{0}$ is a critical point) and $f^{''} (x_{0}) \neq = 0$ . Then:

If $f^{''} (x_{0}) > 0$ , then $f$ has a (strict) local minimum at $x_{0}$ .
If $f^{''} (x_{0}) < 0$ , then $f$ has a (strict) local maximum at $x_{0}$ .

Beispiel: Finding Local Extrema for $f (x) = x^{4} - x^{2} + 1$

Putting the Test to Work Let’s find local maxima/minima for $f (x) = x^{4} - x^{2} + 1$ .

Find critical points (where $f^{'} (x) = 0$ ).

Use the second derivative test $f^{''} (x)$ at these points.

What are the local extrema of $f (x) = x^{4} - x^{2} + 1$ ?

This function is smooth (it’s a polynomial). Candidates for local extrema occur where $f^{'} (x) = 0$ .

$f^{'} (x) = 4 x^{3} - 2 x$ . Set $f^{'} (x) = 0 ⟹ 4 x^{3} - 2 x = 0 ⟹ 2 x (2 x^{2} - 1) = 0$ .

The critical points (candidates) are: $x_{0} = 0$ , or $2 x^{2} - 1 = 0 ⟹ x^{2} = 1/2 ⟹ x_{1, 2} = \pm \frac{1}{2}$ .

To determine the nature of these extrema, we use the second derivative: $f^{''} (x) = \frac{d}{d x} (4 x^{3} - 2 x) = 12 x^{2} - 2$ .

At $x_{0} = 0$ : $f^{'} (0) = 0$ . $f^{''} (0) = 12 (0)^{2} - 2 = - 2$ . Since $f^{''} (0) = - 2 < 0$ , by the Second Derivative Test, $f$ has a local maximum at $x_{0} = 0$ . The value is $f (0) = 0^{4} - 0^{2} + 1 = 1$ .
At $x_{1} = \frac{1}{2}$ (and similarly for $x_{2} = - \frac{1}{2}$ due to symmetry, since $x^{2} = 1/2$ in both cases): $f^{'} (\pm \frac{1}{2}) = 0$ . $f^{''} (\pm \frac{1}{2}) = 12 ((\pm \frac{1}{2})^{2}) - 2 = 12 (\frac{1}{2}) - 2 = 6 - 2 = 4$ . Since $f^{''} (\pm \frac{1}{2}) = 4 > 0$ , by the Second Derivative Test, $f$ has local minima at $x = \frac{1}{2}$ and $x = - \frac{1}{2}$ . The value is $f (\pm \frac{1}{2}) = (\frac{1}{2})^{2} - (\frac{1}{2}) + 1 = \frac{1}{4} - \frac{1}{2} + 1 = \frac{3}{4}$ .

Caveat: It’s possible for a smooth function to have a strict local extremum at a point even if all higher derivatives are zero at that point. The Higher Order Derivative Test (Korollar) only applies if some $f^{(n)} (x_{0})$ is non-zero.

Example: The function $f (x) = {e^{- 1/ x^{2}} 0 if x \neq = 0 if x = 0$ (which is a classic example of a $C^{\infty}$ function that is not analytic at $x = 0$ ) has a strict local minimum at $x = 0$ (since $f (x) > 0$ for $x \neq = 0$ and $f (0) = 0$ ).

However, it can be shown that $f^{(k)} (0) = 0$ for all $k \in N$ . The Taylor series for this function around $x = 0$ is identically zero and does not represent the function anywhere but at $x = 0$ .

The Riemann Integral

New Adventure: From Slopes to Areas! We’re shifting gears from derivatives (which tell us about rates of change and slopes) to a new concept: the integral. The Riemann integral is a way to formally define and calculate the area under a curve.

Motivation: Calculating Areas

Motivation: Calculation of areas. For example:

What is the area of a circle of radius $r$ ? (Answer: $π r^{2}$ )
What is the area of the region bounded by the graph of $y = 1/ x$ , the x-axis, and the vertical lines $x = 1$ and $x = 2$ ? This area will turn out to be $ln (2)$ .

Definition and Integrability Criteria

Laying the Groundwork: Slicing and Summing To find the area under a potentially complicated curve, the basic idea is to:

Chop the interval on the x-axis into many small pieces (a “partition”).

Over each small piece, approximate the area under the curve with a simple rectangle.

Sum the areas of these rectangles.

See what happens to this sum as we make the pieces smaller and smaller. This section introduces the formal machinery: partitions, Darboux sums, and finally, the definition of Riemann integrability.

Basic Setup for Integration Interval

In the following, $a < b$ in $R$ and $I = [a, b]$ is a closed, bounded interval.

Definition: Partition of an Interval

Chopping Up the Interval: Partitions A “partition” of an interval $[a, b]$ is just a fancy way of saying we pick a finite set of points in the interval, starting with $a$ and ending with $b$ , that divide the interval into smaller subintervals.

Formal Definition of a Partition

A partition $P$ of an interval $I = [a, b]$ is a finite, ordered subset of $I$ that includes the endpoints $a$ and $b$ .

We usually write $P = {x_{0}, x_{1}, \dots, x_{n}}$ where $a = x_{0} < x_{1} < \dots < x_{n} = b$ .

Resulting Subintervals and Their Lengths

Remark: A partition $P = {x_{0}, x_{1}, \dots, x_{n}}$ (consisting of $n + 1$ points) divides the interval $I = [a, b]$ into $n$ subintervals: $[x_{0}, x_{1}], [x_{1}, x_{2}], \dots, [x_{n - 1}, x_{n}]$ . The $i$ -th subinterval is $I_{i} = [x_{i - 1}, x_{i}]$ , and its length is $Δ x_{i} = x_{i} - x_{i - 1}$ . (The $δ_{i}$ in the original text is also commonly used for $Δ x_{i}$ ).

If $P$ and $P^{'}$ are partitions of $I$ , then $P^{'}$ is called a refinement of $P$ if $P \subseteq P^{'}$ (meaning $P^{'}$ contains all the points of $P$ , and possibly more). Intuition: A refinement means you’re chopping the interval into even smaller (or at least, not larger) pieces.

Remark: Any two partitions $P, P^{'}$ of $I$ always possess a common refinement, for example, their union $P \cup P^{'}$ . This common refinement contains all the points from both $P$ and $P^{'}$ .

Prerequisite: Boundedness of the Function

Now, let $f : [a, b] \to R$ be a bounded function. This is a crucial prerequisite: $f$ must not shoot off to $\pm \infty$ within the interval. Formally, this means there exists a real number $M_{g l o b} \geq 0$ such that $∣ f (x) ∣ \leq M_{g l o b}$ for all $x \in [a, b]$ .

Notation for Subinterval Lengths in Darboux Sums

For a partition $P = {x_{0}, x_{1}, \dots, x_{n}}$ of $I = [a, b]$ , let $δ_{i} = Δ x_{i} = x_{i} - x_{i - 1}$ be the length of the $i$ -th subinterval $[x_{i - 1}, x_{i}]$ (for $i = 1, \dots, n$ ).

Definition: Lower and Upper Darboux Sums

Approximating Area: Underestimates and Overestimates For each small slice (subinterval) created by our partition, we can find the lowest value ( $m_{i}$ ) and highest value ( $M_{i}$ ) the function $f$ takes on that slice.

Lower Darboux Sum ( $s (f, P)$ ): Use $m_{i}$ as the height of a rectangle on the $i$ -th slice. The sum of these “under-rectangles” gives an underestimate of the total area.

Upper Darboux Sum ( $S (f, P)$ ): Use $M_{i}$ as the height of a rectangle on the $i$ -th slice. The sum of these “over-rectangles” gives an overestimate.

Local Infima ( $m_{i}$ ) and Suprema ( $M_{i}$ ) on Subintervals

For a bounded function $f$ and a partition $P$ :

Let $m_{i} = in f {f (x) ∣ x \in [x_{i - 1}, x_{i}]}$ (the infimum, or greatest lower bound, of $f$ on the $i$ -th subinterval).
Let $M_{i} = sup {f (x) ∣ x \in [x_{i - 1}, x_{i}]}$ (the supremum, or least upper bound, of $f$ on the $i$ -th subinterval).

Since $f$ is bounded on $[a, b]$ , $m_{i}$ and $M_{i}$ are guaranteed to be finite real numbers for each subinterval.

The Lower Darboux Sum ( $s (f, P)$ )

The lower Darboux sum of $f$ with respect to the partition $P$ is: $s (f, P) := \sum_{i = 1}^{n} m_{i} δ_{i} = \sum_{i = 1}^{n} m_{i} (x_{i} - x_{i - 1})$ This represents an approximation to the area under the graph of $f$ using rectangles that are inscribed under the graph.

The Upper Darboux Sum ( $S (f, P)$ )

The upper Darboux sum of $f$ with respect to the partition $P$ is: $S (f, P) := \sum_{i = 1}^{n} M_{i} δ_{i} = \sum_{i = 1}^{n} M_{i} (x_{i} - x_{i - 1})$ This represents an approximation using rectangles that are circumscribed over (or contain the part of) the graph of $f$ .

Remark: Bounds on Darboux Sums

Relationship to Global Function Bounds

If $m_{g l o b} = in f {f (x) ∣ x \in [a, b]}$ and $M_{g l o b} = sup {f (x) ∣ x \in [a, b]}$ are the global infimum and supremum of $f$ on $I$ , then for any subinterval $[x_{i - 1}, x_{i}]$ : $m_{g l o b} \leq m_{i} \leq M_{i} \leq M_{g l o b}$ .

Overall Inequality Chain

Then, for any partition $P$ : $m_{g l o b} (b - a) = \sum m_{g l o b} δ_{i} \leq \sum m_{i} δ_{i} = s (f, P)$ .

Similarly, $S (f, P) = \sum M_{i} δ_{i} \leq \sum M_{g l o b} δ_{i} = M_{g l o b} (b - a)$ . So, $m_{g l o b} (b - a) \leq s (f, P) \leq S (f, P) \leq M_{g l o b} (b - a)$ .

The set of all lower sums and the set of all upper sums are bounded.

Squeezing In on the True Area This lemma tells us two important things:

Refining helps: If you add more points to your partition (refine it), your lower sum can only get bigger (or stay the same), and your upper sum can only get smaller (or stay the same). They get closer to each other and, intuitively, closer to the true area.

Lower sums are always below upper sums: Any lower sum (for any partition) is always less than or equal to any upper sum (for any, possibly different, partition). This is crucial for defining the integral.

If $P^{'}$ is a refinement of a partition $P$ (i.e., $P \subseteq P^{'}$ ), then: $s (f, P) \leq s (f, P^{'}) and S (f, P^{'}) \leq S (f, P)$ Combining these, we always have $s (f, P) \leq s (f, P^{'}) \leq S (f, P^{'}) \leq S (f, P)$ .

Intuition: Imagine you have a single subinterval in $P$ . If $P^{'}$ splits this subinterval into two smaller ones, the minimum values on the smaller intervals can only be greater than or equal to the minimum on the larger original interval (and similarly for maximums). This pushes the lower sum up (or keeps it level) and the upper sum down (or keeps it level).

Property 2: Comparison of Any Lower Sum with Any Upper Sum

For any two arbitrary partitions $P_{1}, P_{2}$ of the interval $I$ : $s (f, P_{1}) \leq S (f, P_{2})$ (The lower sum for any partition is less than or equal to the upper sum for any other partition, or even the same partition).

Proof Sketch for Property 2

Proof sketch for (2): Let $P^{'} = P_{1} \cup P_{2}$ be the common refinement of $P_{1}$ and $P_{2}$ .

Using part (1): $s (f, P_{1}) \leq s (f, P^{'})$ (since $P^{'}$ refines $P_{1}$ ) We also know that for any single partition $P^{'}$ , $s (f, P^{'}) \leq S (f, P^{'})$ (since $m_{i} \leq M_{i}$ for all $i$ ).

And again using part (1): $S (f, P^{'}) \leq S (f, P_{2})$ (since $P^{'}$ refines $P_{2}$ ) Combining these inequalities: $s (f, P_{1}) \leq s (f, P^{'}) \leq S (f, P^{'}) \leq S (f, P_{2})$ . Thus, $s (f, P_{1}) \leq S (f, P_{2})$ .

Definition: Lower and Upper Darboux Integrals

The Best Possible Underestimate and Overestimate Now we consider all possible partitions of our interval.

The lower Darboux integral ( $s (f)$ ) is the “highest” of all possible lower Darboux sums. It’s the supremum (least upper bound) of all the underestimates.

The upper Darboux integral ( $S (f)$ ) is the “lowest” of all possible upper Darboux sums. It’s the infimum (greatest lower bound) of all the overestimates.

The Lower Darboux Integral ( $s (f)$ )

Let $P (I)$ be the set of all possible partitions of the interval $I = [a, b]$ .

The lower Darboux integral of $f$ over $I$ is defined as: $s (f) := sup {s (f, P) ∣ P \in P (I)} (also denoted \underline{\int_{a}^{b}} f (x) d x or "Unterintegral")$

The Upper Darboux Integral ( $S (f)$ )

The upper Darboux integral of $f$ over $I$ is defined as: $S (f) := in f {S (f, P) ∣ P \in P (I)} (also denoted \overline{\int_{a}^{b}} f (x) d x or "Oberintegral")$

Fundamental Inequality: $s (f) \leq S (f)$

Remark: From Lemma 5.1.2 (part 2), we know that any $s (f, P_{1})$ is a lower bound for the set of all upper sums ${S (f, P_{2})}$ . Therefore, the supremum of lower sums, $s (f)$ , must be less than or equal to any particular upper sum $S (f, P_{2})$ . This makes $s (f)$ a lower bound for the set of all upper sums. Thus, $s (f)$ must be less than or equal to the infimum of the upper sums, $S (f)$ .

So, for any bounded function $f$ , we always have: $s (f) \leq S (f)$

Riemann Integrability

When Does the Area Make Sense? Riemann Integrability! A bounded function $f$ is said to be Riemann integrable on $[a, b]$ if its lower Darboux integral and its upper Darboux integral are equal. That is, if the best possible underestimate matches the best possible overestimate. If they are equal, this common value is defined to be the Riemann integral of $f$ from $a$ to $b$ , denoted $\int_{a}^{b} f (x) d x$ . This is what we intuitively think of as the “area under the curve.”

Condition for Riemann Integrability

A bounded function $f : [a, b] \to R$ is Riemann integrable (or simply “integrable”) on $[a, b]$ if its lower Darboux integral equals its upper Darboux integral: $s (f) = S (f)$

The Riemann Integral $\int_{a}^{b} f (x) d x$

In this case, this common value is called the Riemann integral of $f$ over $[a, b]$ (or from $a$ to $b$ ) and is denoted by: $\int_{a}^{b} f (x) d x := s (f) = S (f)$

Consequence of Non-Integrability

If $s (f) < S (f)$ , the function is not Riemann integrable on $[a, b]$ . Intuitively, this means there’s an unresolvable “gap” between the total area of inscribed and circumscribed rectangles, no matter how fine the partition.

Continue here: 21 Dirichlet’s Function, Integrability

CS Notes

Explorer

20 Taylor Approximation, Higher Derivative Test for Local Extrema, Riemann Integral, Darboux Sums and Integrals

Review: Power Series and Their Derivatives

Taylor Approximation

Definition: Taylor Polynomial

Taylor Polynomials for f(x)=cos(x) at x0​=0

Satz: Taylor’s Theorem with Lagrange Remainder (Taylor Approximation)

Korollar: Qualitative Version of Taylor Approximation (Peano Form of Remainder)

Consequences for Local Extrema: Higher Derivative Test

Korollar (Higher Order Derivative Test - slightly reformulated)

Korollar (The Special Case n=2: Second Derivative Test)

Beispiel: Finding Local Extrema for f(x)=x4−x2+1

The Riemann Integral

Motivation: Calculating Areas

Definition and Integrability Criteria

Basic Setup for Integration Interval

Definition: Partition of an Interval

Formal Definition of a Partition

Resulting Subintervals and Their Lengths

Refinements of Partitions

Definition of a Refinement

Common Refinement

Prerequisite: Boundedness of the Function

Notation for Subinterval Lengths in Darboux Sums

Definition: Lower and Upper Darboux Sums

Local Infima (mi​) and Suprema (Mi​) on Subintervals

The Lower Darboux Sum (s(f,P))

The Upper Darboux Sum (S(f,P))

Remark: Bounds on Darboux Sums

Relationship to Global Function Bounds

Overall Inequality Chain

Lemma: Properties of Darboux Sums under Refinement

Property 1: Effect of Refinement on Individual Sums

Property 2: Comparison of Any Lower Sum with Any Upper Sum

Proof Sketch for Property 2

Definition: Lower and Upper Darboux Integrals

The Lower Darboux Integral (s(f))

The Upper Darboux Integral (S(f))

Fundamental Inequality: s(f)≤S(f)