Formal Derivatives

Formal Derivatives, An Introduction

Introduction

Let R be any commutative ring, and let p(x) be a polynomial with coefficients in R.  The formal derivative of p(x) is produced by folding the exponents into the coefficients, and decreasing the exponents by 1.  Thus 4x3+5x+1 becomes 12x2+5.

If you've studied differential calculus, this definition will look familiar.  But it has nothing to do with calculus; hence it is called a formal derivative.  For instance, R might have characteristic n.  In other words, R is an extension of the integers mod n.  In such a ring, the derivative of xn-1 is 0, simply because nxn-1 is 0.

Verify that the derivative of p(x)+q(x) is the sum p′+q′, and the derivative of c×p(x) is c times p′.  This follows directly from the definition.

Next prove the product rule for differentiation.  Expand p*q using the distributive property, giving a sum of various terms c×xi×xj.  The derivative of such a term is c×(i+j)×xi+j-1.  This is the same as,

c × (i×xi-1×xj + j×xj-1×xi)

Regroup terms to show the derivative of p×q is p′×q + p×q′.

Using induction on n, prove that the derivative of p(x)n is n times p(x)n-1 times p′(x).  Spread this across an entire polynomial and we have the chain rule.  The derivative of q(p) is q′(p)×p′.