
Linearization in Two Variables: Total Differential and Tangent Plane

Published 9/9/2025
Extension to Multiple Variables (Two-Variable Case)Let f:R2→R. We study the change of f when moving from (x0,y0) by (h,k).If f is differentiable at (x0,y0), there exist constants A,B and an error E(h,k) withf(x0+h,y0+k)−f(x0,y0)=Ah+Bk+E(h,k),(h,k)→(0,0)limh2+k2E(h,k)=0(equivalently E(h,k)=ε(h,k)h2+k2, ε→0).These constants are the partials: A=∂x∂f(x0,y0),B=∂y∂f(x0,y0).Total Differential & Linearizationdf=∂x∂f(x0,y0)dx+∂y∂f(x0,y0)dy,and the linear approximation (tangent plane) isf(x0+h,y0+k)≈f(x0,y0)+∂x∂f(x0,y0)h+∂y∂f(x0,y0)k.In vector form: f(x0+h,y0+k)≈f(x0,y0)+∇f(x0,y0)⋅[hk], with error o(h2+k2).For a chosen finite step (Δx,Δy), set dx=Δx, dy=Δy to get Δf≈df.Why A,B are the partials (Uniqueness Argument)Fix k=0: f(x0+h,y0)−f(x0,y0)=Ah+E(h,0), ∣h∣E(h,0)→0.Dividing by h and letting h→0 gives A=∂f/∂x(x0,y0). Similarly with h=0 for B.Differentiability vs. Partial DerivativesDifferentiability is stronger than mere existence of partials. If the partialsare continuous near (x0,y0) (the usual C1 condition), then f is differentiable at (x0,y0).Equivalently, differentiability means a single linear map L(h,k)=Ah+Bk fits all directions,with a remainder o(h2+k2).Directional Derivatives & GradientFor a unit vector u=(u1,u2), Duf(x0,y0)=t→0limtf((x0,y0)+tu)−f(x0,y0).If f is differentiable at (x0,y0), Duf(x0,y0)=∇f(x0,y0)⋅u.Thus ∇f(x0,y0) gives the direction of steepest increase, with magnitude ∥∇f∥.Why Duf=∇f⋅u and why ∥∇f∥ is the maximal rateDifferentiability at (x0,y0) gives one linear map L withf(x0+h,y0+k)=f(x0,y0)+L(h,k)+r(h,k), r(h,k)=o(h2+k2).For f:R2→R, L(h,k)=fx(x0,y0)h+fy(x0,y0)k=∇f(x0,y0)⋅(h,k).Along γ(t)=(x0,y0)+tu, ∥u∥=1: f(γ(t))−f(x0,y0)=t∇f(x0,y0)⋅u+r(tu).Divide by t and let t→0: Duf(x0,y0)=∇f(x0,y0)⋅u.By Cauchy–Schwarz, for unit u: Duf=∇f⋅u≤∥∇f∥,with equality when u=∇f/∥∇f∥ (∇f=0). Thus ∇f points in the direction of steepest increase and ∥∇f∥ is the maximal directional rate.Example. Let f(x,y)=x2+xy at (x0,y0)=(2,1).∂x∂f=2x+y,∂y∂f=x ⇒ ∂x∂f(2,1)=5, ∂y∂f(2,1)=2, f(2,1)=6.Linearization near (2,1): f(2+h,1+k)≈6+5h+2k.Check for h=k=0.1: f(2.1,1.1)=6.72, linear ≈6+5(0.1)+2(0.1)=6.7, error 0.02.Distance of step h2+k2=0.12+0.12≈0.141, and the error is o(h2+k2).