0% found this document useful (0 votes)
112 views5 pages

Inverse and Implicit Function Theorems Explained

The document summarizes the Inverse Function Theorem and Implicit Function Theorem. The Inverse Function Theorem states that if a function f is continuously differentiable and its differential is invertible at a point, then f has a local inverse. The Implicit Function Theorem gives conditions for solving a system of equations locally for one set of variables in terms of the other set. It states that if f is continuously differentiable, f(a,b)=0, and the partial differential with respect to the second set of variables is invertible at (a,b), then there exists a local function that gives one set of variables in terms of the other. Both theorems guarantee the local existence of inverse or implicit functions if the differential/partial

Uploaded by

peter
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
112 views5 pages

Inverse and Implicit Function Theorems Explained

The document summarizes the Inverse Function Theorem and Implicit Function Theorem. The Inverse Function Theorem states that if a function f is continuously differentiable and its differential is invertible at a point, then f has a local inverse. The Implicit Function Theorem gives conditions for solving a system of equations locally for one set of variables in terms of the other set. It states that if f is continuously differentiable, f(a,b)=0, and the partial differential with respect to the second set of variables is invertible at (a,b), then there exists a local function that gives one set of variables in terms of the other. Both theorems guarantee the local existence of inverse or implicit functions if the differential/partial

Uploaded by

peter
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

18.950 Handout 4. Inverse and Implicit Function Theorems.

Theorem 1 (Inverse Function Theorem). Suppose U ⊂ Rn is open,


f : U → Rn is C 1 , x0 ∈ U and dfx0 is invertible. Then there exists a
neighborhood V of x0 in U and a neighborhood W of f (x0 ) in Rn such that
f has a C 1 inverse g = f −1 : W → V. (Thus f (g(y)) = y for all y ∈ W
and g(f (x)) = x for all x ∈ V .) Moreover,
dgy = (dfg(y) )−1 for all y ∈ W
and g is smooth whenever f is smooth.
Remark. The theorem says that a continuously differentiable function f
between regions in Rn is locally invertible near points where its differential
is invertible.

Proof. Without loss of generality, we may assume that x0 = 0, f (x0 ) = 0


and dfx0 = I. (Otherwise, replace f with f�(x) = dfx−1 0
(f (x + x0 ) − f (x0 )).
Note that if the theorem holds with f , 0, 0, I and a function g� in place of f

x0 , f (x0 ), dfx0 and g respectively, then it is easily verified that the theorem
as stated holds with g(y) = x0 + g�(dfx−1 0
(y − f (x0 ))).)

Since dfx is continuous in x at x0 (see Exercise 1), there exists a number


r > 0 such that
1
x ∈ B r (0) =⇒ �dfx − I� ≤ .
2
(Recall that for a linear transformation A : Rn → Rm we define the norm
of A by �A� = sup{|v|≤1} |A(v)|.) Fix y ∈ Br/2 (0). Define a function φ by

φ(x) = x − f (x) + y.
Note that dφx = I − dfx and hence

�dφx � ≤ 1/2 if x ∈ B r (0).


Thus
1�
d
|φ(x)| ≤ |φ(x) − y | + |y | = | φ(tx)dt| + |y |
0 dt
� 1 � 1
= | dφtx · xdt| + |y | ≤ �dφtx �|x|dt + |y |
0 0
≤ r/2 + r/2 = r (1)

1
whenever x ∈ B r (0). i.e. φ is a map from B r (0) into itself. For any x, z ∈
B r (0),

�� 1 �
� d �
|φ(z) − φ(x)| =

� φ(x + t(z − x))dt��

0 dt
� 1
≤ |dφx+t(z−x) · (z − x)|dt
0
� 1
≤ �dφx+t(z−x) �|z − x|dt
0
1
≤ |z − x|.
2
Thus φ : B r (0) → B r (0) is a contraction, and hence φ has a unique fixed
point xy ∈ B r (0). i.e. there is a unique point xy ∈ B r (0) with f (xy ) = y. In
fact xy ∈ Br (0) since 2r > |y | = |f (xy )| ≥ |xy | − |xy − f (xy )| ≥ |xy | − 21 |xy | =
1 −1 (W ) ∩ B (0). Note then that V is open.
2 |xy |. Set W = Br/2 (0) and V = f r
Define g : W → V by g(y) = xy . Then f (g(y)) = y for all y ∈ W and
g(f (x)) = x for all x ∈ V.

Next we show that g is differentiable, with dgy = (dfg(y) )−1 . First note
that with ψ : Br (0) → Rn defined by ψ(x) = x − f (x), we have that for
x1 , x2 ∈ Br (0),

|x1 − x2 | − |f (x1 ) − f (x2 )| ≤ |(x1 − x2 ) − (f (x1 ) − f (x2 ))|


≤ |ψ(x1 ) − ψ(x2 )|
1
≤ |x1 − x2 |
2
where the last inequality follows by estimating as in (1), using dψx = I −dfx .
Hence
1
|x1 − x2 | ≤ |f (x1 ) − f (x2 )|
2
for any x1 , x2 ∈ Br (0), which implies

|g(y1 ) − g(y2 )| ≤ 2|y1 − y2 | (2)


for any y1 , y2 ∈ W = Br/2 (0). In particular, g is continuous.

2
Now fix y ∈ W , and let A = dfg(y) . Since W is open, there exists δ > 0
such that y + k ∈ W if k ∈ Bδ (0). Let h = g(y + k) − g(y). Then k =
y + k − y = f (g(y + k)) − f (g(y)) = f (g(y) + h) − f (g(y)) and hence, for
k ∈ Bδ (0) \ {0},

|g(y + k) − g(y) − A−1 k | |A−1 (Ah − k)| |h|

=
|k| |h| |k |

−1
�A �|k − Ah| |h|

|h| |k|
−1
�A �|f (g(y) + h) − f (g(y)) − Ah|
≤ 2 (3)
|h|

where the last estimate follows from (2). Note that since g(y+k) = g(y) =⇒
f (g(y + k)) = f (g(y)) =⇒ y + k = y =⇒ k = 0, we have that h = � 0
if k �= 0. Sice A = dfg(y) , it follows from the definition of differentiability
of f that the right hand side of (3) tends to 0 as h → 0, and hence, since
|h| ≤ 2|k | by (2), it follows that

|g(y + k) − g(y) − A−1 k |


lim = 0.
k→0 |k |
i.e. g is differentiable at y and

dgy = (dfg(y) )−1 . (4)


Finally, note that the function y �→ dgy is the composition of the function
y �→ dfg(y) and matrix inversion A �→ A−1 . Matrix inversion is a smooth map
of the entries, and the function y �→ dfg(y) is continuous since g is continuous
and f is C 1 . Hence we conclude that y �→ dgy is continuous; i.e. that g is
C 1 . Repeatedly differentiating (4) shows that g is smooth if f is smooth.

Exercise 1. Let L(Rn ; Rn ) be the set of linear transformations from Rn


into itself with the metric d(A, B) = �A − B�. (cf. Exercise 10 of handout
1.) Let U ⊂ Rn be open and f : U → Rn be a C 1 function. Show that the
map x �→ dfx is continuous as a map from U into L(Rn ; Rn ).

Exercise 2. Suppose g : [a, b] → Rn is continuous. Show that


�� b � � b
� �
� �≤
g(t)dt
|g(t)|dt

a a

3
where
�� | · |� denotes the Euclidean norm. You may use without proof that
� b � �b
� a h(t)dt� ≤ a |h(t)|dt for a scalar valued function h.

Exercise 3. Define f : R → R by f (x) = x2 + x2 sin x1 if x = � 0 and


f (0) = 0. Compute f � (x) for all x ∈ R. Show that f � (0) > 0, yet f is not
one­to­one in any neighborhood of 0. This example shows that in the Inverse
Function Theorem, the hypothesis that f is C 1 cannot be weakened to the
hypothesis that f is differentiable.

Exercise 4. Define f : R2 → R2 by f (x, y) = (ex cos y, ex sin y). Show


that f is C 1 and that df(x,y) is invertible for all (x, y) ∈ R2 and yet f is
not a one­to­one function globally. Why doesn’t this contradict the Inverse
Function Theorem?

Next we prove the Implicit Function Theorem. This theorem gives con­
ditions under which one can solve, locally, a system of equations

fi (x, y) = 0, i = 1, 2, . . . n
where x ∈ Rm and y ∈ Rn , for y in terms of x. (Thus, y = (y1 , . . . , yn )
where y1 , . . . , yn are regarded as n unknowns, satisfying the n equations
fi (x, y) = 0, i = 1, . . . , n.) Geometrically, the set of solutions (x, y) to the
system of equations is the graph of a function y = g(x). Note that we have
from linear algebra that if for each i, the function fi is linear with constant
coefficients
� � in the variables yj , then whenever the (constant) n × n matrix
∂ fi
∂ yj is invertible, the system of equations is solvable for y in terms
1≤i,j≤n
of x. Implicit function theorem says that whenever fi are C 1 and this matrix
is invertible at a point (a, b), then the system is solvable for y in terms of x
locally in a neighborhood of (a, b).

We shall use the following notation: For an Rn valued function f (x, y) =


(f1 (x, y), f2 (x, y), . . . , fn (x, y)) in a domain U ⊂ Rm+n ≡ Rm × Rn , where
x ∈ Rm , y ∈ Rn , we shall � denote
� by dx f the partial differential represented
∂ fi
by the n × m matrix ∂ xj and by dy f the partial differential
1≤i≤n,1≤j≤m
� �
represented by the n × n matrix ∂∂ yfji .
1≤i,j≤n

Theorem 2 (Implicit Function Theorem). Let U ⊂ Rm+n ≡ Rm × Rn


be an open set, f : U → Rn a C 1 function, (a, b) ∈ U a point such that
f (a, b) = 0 and dy f |(a,b) invertible. Then there exists a neighborhood V of

4
(a, b) in U , a neighborhood W of a in Rm and a C 1 function g : W → Rn
such that

{(x, y) ∈ V : f (x, y) = 0} = {(x, g(x)) : x ∈ W } .


Moreover,

−1 �
dgx = − (dy f ) �
dx f |(x,g(x))
(x,g(x))

and g is smooth if f is smooth.

Proof. Define F : U → Rm+n by F (x, y) = (x, f (x, y)). Then F is C 1 in


U, F (a, b) = (a, 0) and det dF(a,b) = det dy f |(a,b) �= 0. Hence by the Inverse
Function Theorem, F has a C 1 inverse F −1 : W � → V for neighborhoods
V of (a, b) and
W of (a, 0) in R × R . Set W = {x ∈ Rm : (x, 0) ∈ W
� m n � }.
m
Then W is open in R . Note then that if x ∈ W , then (x, 0) ∈ W � so that
(x, 0) = F (x1 , y1 ) where (x1 , y1 ) ∈ V is uniquely determined by x. (In fact,
by the definition of F , x1 = x.) Define g : W → Rn by setting y1 = g(x).
Thus g(x) is defined by F −1 (x, 0) = (x, g(x)); i.e. by g(x) = π ◦ F −1 (x, 0)
where π : Rm × Rn → Rn is the projection map π(x, y) = y. Then {(x, y) ∈
V : f (x, y) = 0} = {(x, y) ∈ V : F (x, y) = (x, 0)} = {(x, g(x)) : x ∈ W }.
Since π is a smooth map and F −1 is C 1 , it follows that g is C 1 . The formula
for dgx follows by differentiating the identity

f (x, g(x)) ≡ 0 on W
using the chain rule. By repeatedly differentiating this identity, it follows
that g is smooth if f is smooth.

You might also like