Lecture from: 08.05.2024 | Video: Video ETHZ
Review: Power Series and Their Derivatives
Flashback: Power Series and Their Coefficients A key fact we’ve learned: if a function can be represented by a power series (that converges in some region), then:
- is infinitely differentiable (smooth) in that region.
- We can find all its derivatives by differentiating the series term by term.
- Crucially, the coefficients of the series are directly determined by the function’s derivatives at the center point : . This is a HUGE link!
If is a power series with radius of convergence , then is smooth (infinitely differentiable) in the open interval . Furthermore, for any non-negative integer (where ), its -th derivative is given by differentiating term-by-term: A crucial consequence is obtained by evaluating this at . All terms in the sum become zero except for the first term (where ): .
This gives us the formula for the coefficients:
Without Summation Notation (to see the pattern more clearly): If
- And in general, , so .
Taylor Approximation
Using Derivatives to Build Polynomial Approximations The beautiful relationship from power series gives us a brilliant idea. If a function is smooth (or at least has enough derivatives) near a point , we can use its derivatives at that single point to construct a polynomial that should approximate well for values near . This is the essence of Taylor approximation.
The relationship for power series makes it plausible that if a function is smooth (or at least sufficiently differentiable) near a point , we can approximate it using a polynomial constructed from its derivatives at .
Indeed, if can be represented as a power series , then substituting gives This means is the limit of the partial sums: This convergence is uniform on any closed interval around that is strictly within the radius of convergence.
This motivates the following definition:
Definition: Taylor Polynomial
The Taylor Polynomial: A Function’s “Best Fit” Polynomial For a function that’s -times differentiable at a point , its Taylor polynomial of order centered at is a specific polynomial of degree at most . Its magic property? This polynomial and its first derivatives perfectly match the values of and its first derivatives at the point . It’s designed to be the best possible polynomial approximation of degree to right around .
Let be an interval with more than one point. Let (a non-negative integer), and let be -times differentiable at .
The Taylor polynomial of order (or degree ) of at (or around, or centered at) is: This polynomial is often denoted or simply if and are clear from context. It is the unique polynomial of degree at most such that and its first derivatives match and its first derivatives at the point : , , , .
Taylor Polynomials for at
Example: Approximating Cosine with Polynomials Let’s build some Taylor polynomials for centered at (these are also called Maclaurin polynomials). We need the derivatives of at : , , , , , and so on. The polynomials will use these values to try and “hug” the cosine curve near .
The derivatives of evaluated at are: (The pattern of derivatives at is )
The first few Taylor polynomials for at are: (Since , is the same as ).
(Since , is the same as ).
Notice how these polynomials start to resemble the well-known Maclaurin series for :
Satz: Taylor’s Theorem with Lagrange Remainder (Taylor Approximation)
How Good is the Taylor Approximation? Enter the Remainder! Taylor’s Theorem is a cornerstone result. It doesn’t just say that approximates ; it gives us an explicit formula for the error or remainder term, . The Lagrange form of this remainder tells us the error is , where is some mysterious point between and . This means the error depends on the -th derivative of and how far is from .
Let in . Let be a function that is -times continuously differentiable on the closed interval and -times differentiable on the open interval . Let . Then for every with , there exists a point strictly between and such that: Remark: The term is called the remainder term. This specific formula is the Lagrange form of the remainder. It quantifies the error when approximating by its -th Taylor polynomial. The existence of such a is guaranteed, though its exact value is usually unknown.
Korollar: Qualitative Version of Taylor Approximation (Peano Form of Remainder)
The Error Shrinks Super Fast! (Little-o Notation) This corollary gives us another way to think about the error. It says that the difference goes to zero faster than does as approaches . We use “little-o” notation for this: . This essentially means that is an exceptionally good polynomial approximation of degree to very close to .
Let in , and be -times continuously differentiable on (this implies is continuous). Let . Then: This means that the error diminishes more rapidly than as . We write this using little-o notation as as . The Taylor polynomial is the “best” polynomial approximation of degree at most to near in this specific sense.
Exercise: Find the corresponding statement for the tangent line approximation (which is ) that we encountered at the beginning of our discussion of differentiability. (Hint: The definition of differentiability is . Rearranging this gives . This is exactly the statement for .)
Proof of the Corollary (using Taylor’s Theorem for order ): We need to be -times differentiable for to be defined, and to be continuous at for the limit step. By Taylor’s Theorem applied with order (so we need to be -times differentiable for the remainder term to make sense): , for some between and . Recall the definition of : . Subtracting these two equations: . So, for : As , we also have (since is squeezed between and ). Since is -times continuously differentiable, its -th derivative is continuous at . Therefore, , which means . This implies: This completes the proof.
Consequences for Local Extrema: Higher Derivative Test
Taylor’s Idea Helps Find Maxima and Minima! Taylor approximations give us a powerful tool to analyze a function’s behavior near a critical point (where ). If the first few derivatives are zero, the first non-zero derivative at can tell us if we have a local maximum, minimum, or neither.
Korollar (Higher Order Derivative Test - slightly reformulated)
The Test: Using the First Non-Zero Derivative Suppose at a point , the function’s derivative is zero, and maybe even the next few are also zero: . But the -th derivative, , is NOT zero.
- If is odd: has no local extremum at . (It’s typically an inflection point with a horizontal tangent if ). Intuition: The term changes sign as passes , so also changes sign locally.
- If is even and : has a strict local minimum at . Intuition: is positive, so is positive locally; .
- If is even and : has a strict local maximum at . Intuition: is positive, so is negative locally; .
Let be an integer, in . Let be -times continuously differentiable, and let be an interior point of the interval. Assume: , but . (This means is the first non-zero derivative at ).
Then:
- If is odd, has no local extremum at . (If , is a saddle/inflection point with a horizontal tangent. If , this condition means , so wasn’t a critical point to begin with unless we are just classifying points).
- If is even and , then has a strict local minimum at . (This means there exists some such that for all with , we have ).
- If is even and , then has a strict local maximum at .
(Proof idea: Near , . The sign of this difference determines if is above or below .)
Korollar (The Special Case : Second Derivative Test)
The Familiar Second Derivative Test This is the higher order test when , which we often use: If (critical point):
- If cup up strict local minimum.
- If cup down strict local maximum. (If , this test is inconclusive, and we’d need to check higher derivatives or other methods).
Let in , be twice continuously differentiable (), and . Assume: (i.e., is a critical point) and . Then:
- If , then has a (strict) local minimum at .
- If , then has a (strict) local maximum at .
Beispiel: Finding Local Extrema for
Putting the Test to Work Let’s find local maxima/minima for .
- Find critical points (where ).
- Use the second derivative test at these points.
What are the local extrema of ?
This function is smooth (it’s a polynomial). Candidates for local extrema occur where .
. Set .
The critical points (candidates) are: , or .
To determine the nature of these extrema, we use the second derivative: .
-
At : . . Since , by the Second Derivative Test, has a local maximum at . The value is .
-
At (and similarly for due to symmetry, since in both cases): . . Since , by the Second Derivative Test, has local minima at and . The value is .
Caveat: It’s possible for a smooth function to have a strict local extremum at a point even if all higher derivatives are zero at that point. The Higher Order Derivative Test (Korollar) only applies if some is non-zero.
Example: The function (which is a classic example of a function that is not analytic at ) has a strict local minimum at (since for and ).
However, it can be shown that for all . The Taylor series for this function around is identically zero and does not represent the function anywhere but at .
The Riemann Integral
New Adventure: From Slopes to Areas! We’re shifting gears from derivatives (which tell us about rates of change and slopes) to a new concept: the integral. The Riemann integral is a way to formally define and calculate the area under a curve.
Motivation: Calculating Areas
Motivation: Calculation of areas. For example:
- What is the area of a circle of radius ? (Answer: )
- What is the area of the region bounded by the graph of , the x-axis, and the vertical lines and ?
This area will turn out to be .
Definition and Integrability Criteria
Laying the Groundwork: Slicing and Summing To find the area under a potentially complicated curve, the basic idea is to:
- Chop the interval on the x-axis into many small pieces (a “partition”).
- Over each small piece, approximate the area under the curve with a simple rectangle.
- Sum the areas of these rectangles.
- See what happens to this sum as we make the pieces smaller and smaller. This section introduces the formal machinery: partitions, Darboux sums, and finally, the definition of Riemann integrability.
Basic Setup for Integration Interval
In the following, in and is a closed, bounded interval.
Definition: Partition of an Interval
Chopping Up the Interval: Partitions A “partition” of an interval is just a fancy way of saying we pick a finite set of points in the interval, starting with and ending with , that divide the interval into smaller subintervals.
Formal Definition of a Partition
A partition of an interval is a finite, ordered subset of that includes the endpoints and .
We usually write where .
Resulting Subintervals and Their Lengths
Remark: A partition (consisting of points) divides the interval into subintervals: . The -th subinterval is , and its length is . (The in the original text is also commonly used for ).
Refinements of Partitions
Definition of a Refinement
If and are partitions of , then is called a refinement of if (meaning contains all the points of , and possibly more). Intuition: A refinement means you’re chopping the interval into even smaller (or at least, not larger) pieces.
Common Refinement
Remark: Any two partitions of always possess a common refinement, for example, their union . This common refinement contains all the points from both and .
Prerequisite: Boundedness of the Function
Now, let be a bounded function. This is a crucial prerequisite: must not shoot off to within the interval. Formally, this means there exists a real number such that for all .
Notation for Subinterval Lengths in Darboux Sums
For a partition of , let be the length of the -th subinterval (for ).
Definition: Lower and Upper Darboux Sums
Approximating Area: Underestimates and Overestimates For each small slice (subinterval) created by our partition, we can find the lowest value () and highest value () the function takes on that slice.
- Lower Darboux Sum (): Use as the height of a rectangle on the -th slice. The sum of these “under-rectangles” gives an underestimate of the total area.
- Upper Darboux Sum (): Use as the height of a rectangle on the -th slice. The sum of these “over-rectangles” gives an overestimate.
Local Infima () and Suprema () on Subintervals
For a bounded function and a partition :
- Let (the infimum, or greatest lower bound, of on the -th subinterval).
- Let (the supremum, or least upper bound, of on the -th subinterval).
Since is bounded on , and are guaranteed to be finite real numbers for each subinterval.
The Lower Darboux Sum ()
The lower Darboux sum of with respect to the partition is: This represents an approximation to the area under the graph of using rectangles that are inscribed under the graph.
The Upper Darboux Sum ()
The upper Darboux sum of with respect to the partition is: This represents an approximation using rectangles that are circumscribed over (or contain the part of) the graph of .
Remark: Bounds on Darboux Sums
Relationship to Global Function Bounds
If and are the global infimum and supremum of on , then for any subinterval : .
Overall Inequality Chain
Then, for any partition : .
Similarly, . So, .
The set of all lower sums and the set of all upper sums are bounded.
Lemma: Properties of Darboux Sums under Refinement
Squeezing In on the True Area This lemma tells us two important things:
- Refining helps: If you add more points to your partition (refine it), your lower sum can only get bigger (or stay the same), and your upper sum can only get smaller (or stay the same). They get closer to each other and, intuitively, closer to the true area.
- Lower sums are always below upper sums: Any lower sum (for any partition) is always less than or equal to any upper sum (for any, possibly different, partition). This is crucial for defining the integral.
Property 1: Effect of Refinement on Individual Sums
- If is a refinement of a partition (i.e., ), then:
Combining these, we always have .
Intuition: Imagine you have a single subinterval in . If splits this subinterval into two smaller ones, the minimum values on the smaller intervals can only be greater than or equal to the minimum on the larger original interval (and similarly for maximums). This pushes the lower sum up (or keeps it level) and the upper sum down (or keeps it level).
Property 2: Comparison of Any Lower Sum with Any Upper Sum
-
For any two arbitrary partitions of the interval : (The lower sum for any partition is less than or equal to the upper sum for any other partition, or even the same partition).
Proof Sketch for Property 2
Proof sketch for (2): Let be the common refinement of and .
Using part (1): (since refines ) We also know that for any single partition , (since for all ).
And again using part (1): (since refines ) Combining these inequalities: . Thus, .
Definition: Lower and Upper Darboux Integrals
The Best Possible Underestimate and Overestimate Now we consider all possible partitions of our interval.
- The lower Darboux integral () is the “highest” of all possible lower Darboux sums. It’s the supremum (least upper bound) of all the underestimates.
- The upper Darboux integral () is the “lowest” of all possible upper Darboux sums. It’s the infimum (greatest lower bound) of all the overestimates.
The Lower Darboux Integral ()
Let be the set of all possible partitions of the interval .
The lower Darboux integral of over is defined as:
The Upper Darboux Integral ()
The upper Darboux integral of over is defined as:
Fundamental Inequality:
Remark: From Lemma 5.1.2 (part 2), we know that any is a lower bound for the set of all upper sums . Therefore, the supremum of lower sums, , must be less than or equal to any particular upper sum . This makes a lower bound for the set of all upper sums. Thus, must be less than or equal to the infimum of the upper sums, .
So, for any bounded function , we always have:
Riemann Integrability
When Does the Area Make Sense? Riemann Integrability! A bounded function is said to be Riemann integrable on if its lower Darboux integral and its upper Darboux integral are equal. That is, if the best possible underestimate matches the best possible overestimate. If they are equal, this common value is defined to be the Riemann integral of from to , denoted . This is what we intuitively think of as the “area under the curve.”
Condition for Riemann Integrability
A bounded function is Riemann integrable (or simply “integrable”) on if its lower Darboux integral equals its upper Darboux integral:
The Riemann Integral
In this case, this common value is called the Riemann integral of over (or from to ) and is denoted by:
Consequence of Non-Integrability
If , the function is not Riemann integrable on . Intuitively, this means there’s an unresolvable “gap” between the total area of inscribed and circumscribed rectangles, no matter how fine the partition.
Continue here: 21 Dirichlet’s Function, Integrability