Lecture from: 29.04.2024 | Video: Video ETHZ

Review: Key Results from the Mean Value Theorem – What the Derivative Tells Us

Let’s quickly remember the main takeaways from the Mean Value Theorem (MVT): If we have a function that’s nice and smooth (continuous on and differentiable on ), its derivative gives us a lot of information about how behaves over the whole interval:

  • If is always zero inside the interval, then must be a constant function. It’s not going up or down.
  • If is always non-negative (), then is monotonically increasing (it might flatten out, but it never goes down).
  • If is always strictly positive (), then is strictly monotonically increasing (it’s always going up).
  • Similarly, if is always non-positive () or strictly negative (), then is monotonically decreasing or strictly monotonically decreasing, respectively.

And for inverse functions: If is well-behaved (continuous, strictly monotonic) and its derivative isn’t zero, then its inverse is also differentiable at the corresponding point , and its derivative is just the reciprocal: .

Example: Derivatives of Inverse Trigonometric Functions – Putting the Inverse Rule to Work

Let’s see how to find the derivatives of the “arc” functions, which are the inverses of our standard trigonometric functions (when restricted to suitable domains to make them invertible).

1. Arcsine Function ( or )

First, we need to make invertible. We do this by looking at it only on the interval .

On this interval, does a few nice things:

  • It’s strictly monotonically increasing. Why? Because its derivative, , is strictly positive for between and (think about the cosine graph in that region).
  • It hits all the values between (at ) and (at ).

Because it’s strictly increasing and covers this range, it’s bijective (one-to-one and onto) from to .

This means we can define an inverse function, which we call arcsine: .

Now, for its derivative: Since is not zero on (the interior of the domain where sine is invertible), arcsine will be differentiable on (the interior of its domain).

Let , where . Then .

Using our rule for the derivative of an inverse function: This is correct, but we usually want the derivative in terms of . We need to express using .

The fundamental identity comes to the rescue. So, . Since is in , is positive. Thus, .

Substituting , we get . Plugging this back into our derivative formula:

2. Arccosine Function ( or )

The story is very similar for arccosine. We restrict to the interval . On this interval, is strictly monotonically decreasing (because its derivative is negative for ). It’s also bijective.

The inverse function is .

It’s differentiable on . If for : Using , and noting that for , we have . So,

3. Arctangent Function ( or )

For the tangent function, , we look at the interval . Its derivative is . Since is positive for , is also positive. Thus, is strictly monotonically increasing.

As approaches from the left, . As approaches from the right, . This means maps bijectively onto the entire real line .

The inverse function is . It’s differentiable everywhere on . If for :

How to write in terms of ?

Recall the identity: .

So, .

Substituting , we get . Therefore,

L’Hôpital’s Rule: A Clever Trick for Tricky Limits

Sometimes we encounter limits of fractions where both the numerator and the denominator head towards (an ” form”) or both head towards (an ” form”). These are called “indeterminate forms” because we can’t tell the limit just by looking. L’Hôpital’s Rule gives us a way to tackle these.

Theorem: L’Hospital Rule

Let . Suppose and are differentiable functions from to , and importantly, for all in .

Now, assume one of these two scenarios as approaches from the left (similar statements hold for or for some ):

  1. The “0/0” case: AND .
  2. The "" case: AND .

IF the limit of the ratio of their derivatives exists: (where can be a finite number, or , or ).

THEN, L’Hôpital’s Rule says that the limit of the original ratio also exists and is the same:

Important Notes:

  • can be a real number, , or .
  • The interval can be infinite; for example, could be or could be .
  • The rule also works for limits from the right () or two-sided limits ().

A Quick Intuition for the “0/0” Case

Why should this rule work? Imagine we’re near a point where both and .

Near , functions are well-approximated by their tangent lines:

So, the ratio near should be approximately: (assuming and ).

This suggests that the limit of the ratio of the functions might be the ratio of their derivatives. L’Hôpital’s Rule formalizes this.

Examples: L’Hospital’s Rule in Action

  1. Logarithm vs. Power: For , what is ? As , and . This is an "" case. Let , so . Let , so .

    Now, look at the limit of the ratio of derivatives: Since , as , . So, . Because this limit (0) exists, L’Hôpital’s Rule tells us:

    The takeaway: The natural logarithm grows much slower than any positive power of .

  2. A form: What is ? As , and . This "" form is indeterminate. We need to rewrite it as a fraction to use L’Hôpital’s. . As , and . Now it’s a "" form. Let . Let . The limit of the ratio of derivatives: Since this limit exists, by L’Hôpital’s Rule:

  3. Clicker Question Revisited: . As , numerator , and denominator . It’s a "" case.

    First application of L’Hôpital: Derivative of numerator: . Derivative of denominator: . So we consider . This is still "".

    Second application of L’Hôpital: Derivative of new numerator: . Derivative of new denominator: . Now consider the limit: Since this limit exists, our original limit is also 4.

    Alternative for the Clicker (without a second L’Hôpital): After the first L’Hôpital step, we had . Using the double angle formula :

A Word of Caution: When L’Hospital’s Rule Fails or Misleads

Consider the limit . This is an "" form.

If we try L’Hôpital’s Rule:

The limit does not exist because and oscillate.

Crucial Point: If does not exist, L’Hôpital’s Rule tells us nothing about the original limit .

In this specific case, we can find the original limit directly: Since and , and . So the limit is .

L’Hôpital’s Rule was not helpful here because its condition (existence of the limit of derivative ratios) was not met.

Rule of Thumb: Only apply L’Hôpital’s Rule if the limit actually exists (or is ). Otherwise, you might be led astray or conclude nothing.

Convex Functions

For the remainder of this chapter, we’ll assume is an interval containing more than one point.

Definition: Convexity

A function is said to be:

  1. Convex on if for any two points and any (which parameterizes the segment between and ):
  2. Strictly convex on if for any two distinct points () and any (strictly between and ):

Geometric Intuition: The point is a point on the line segment connecting and on the x-axis. The value is the function’s value at this point.

The expression represents the y-value on the straight line segment (the “secant line”) connecting the points and on the graph of .

So, convexity means that the graph of the function always lies below or on any of its secant lines.

A Classic Example: defined on .

This function is convex. By the triangle inequality, for any and : .

Since and , this becomes . So, .

However, is not strictly convex (e.g., take ; then and . Here . But if , and . Equality can hold for non-distinct points on a straight segment of the graph). It’s not strictly convex because, for instance, if and are both positive, the secant line is the function itself between and .

Remark: Jensen’s Inequality (Generalized Convexity)

If is convex, then for any points and any non-negative weights that sum to (), Jensen’s inequality holds: The definition we gave is just the case.

Lemma: Convexity and Slopes of Secant Lines

A function is convex if and only if for any three points in , the slope of the secant line from to is less than or equal to the slope of the secant line from to : For strict convexity, the inequality in is strict () when are distinct.

Why this matters: This condition means that as you move from left to right, the slopes of the secant lines are non-decreasing. This is a key characteristic.

Why are convex functions important? They appear everywhere, especially in optimization. If you’re trying to minimize a convex function, any local minimum you find is guaranteed to be a global minimum. This makes finding the “best” solution much easier, and many efficient algorithms rely on this property.

Quick Exercise: If is convex, prove that any local minimum of is also a global minimum on .

How Can We Tell if a Function is Convex? (Using Derivatives)

If our function is differentiable, the derivative gives us a powerful way to check for convexity.

Theorem: Convexity and the First Derivative

Let be a differentiable function. Then:

  • is convex on (its derivative) is monotonically increasing on .
  • is strictly convex on is strictly monotonically increasing on .

Proof Sketch (for increasing convex)

We want to use the slope condition from the previous Lemma. Take any in . We need to show .

By the Mean Value Theorem, there’s an such that . And there’s a such that .

Since , we have . If is monotonically increasing, then . This directly gives us , which by the Lemma means is convex.

Corollary: Convexity and the Second Derivative

Now, if is twice differentiable on (meaning exists and is itself differentiable, giving ):

  • If for all , then is convex on .
  • If for all , then is strictly convex on .

Proof

If for all , then the derivative of (which is ) is non-negative. By the consequences of the MVT (Corollary 4.2.5, part 3), this means is monotonically increasing.

Then, by the theorem we just proved (Theorem 4.2.16), if is monotonically increasing, is convex. The argument for strict convexity is similar.

Example: Is convex on ?

Let’s check its second derivative.

Since , is always positive, so for all .

Therefore, is strictly convex on .

Continue here: 19 Higher Order Derivatives, Smooth Functions, Power Series and Taylor Approximation