Numerical differentiation

Numerical differentiation#

Introduction#

Every elementary function can be differentiated analytically. The definition of the derivative is:

\[ f'(x)=\lim_{\Delta x \rightarrow 0}\frac{f(x+\Delta x)-f(x)}{\Delta x}. \]

Directly applying the equation above leads to subtracting very similar function values (\(f(x+\Delta x)\), \(f(x)\)) that are burdened by round-off error, and then dividing them by a small value \(\Delta x\); as a result, the derivative has substantially fewer significant digits than the function values. We avoid numerical differentiation whenever we can; in some cases, however (e.g., solving differential equations), it is an indispensable tool!

In numerical differentiation we have two fundamentally different approaches:

first perform interpolation/approximation, then compute the derivative based on the known interpolation/approximation functions (we already discussed this topic in interpolation and approximation), and
computing the derivative directly from the values in a table.

In this chapter we will learn how to numerically compute the derivative of a function \(f(x)\); here the values of the function \(f(x)\) are given in tabular form (pairs \(x_i\), \(y_i\)), as shown in the figure: We will first focus on equidistant values \(x_i\) spaced with a step \(h\); the function values will be \(y_i=f(x_i)\).

Based on the definition of the derivative above, the first derivative (at location \(i\)) could be written as:

\[ y_i'=\frac{y_{i+1}-y_{i}}{h}, \]

where \(h=x_{i+1}-x_{i}\). By rearranging the equation:

\[ y_i'=-\frac{y_{i}}{h}+\frac{y_{i+1}}{h}, \]

we can also say that for the first derivative of the function at location \(i\) we weight the function value at \(i\) by \(-1/h\) and the function value at \(i+1\) by \(+1/h\).

In what follows we will look at the theoretical background of how to determine the appropriate weights for various orders of derivatives, what options we have here, and how this affects the order of accuracy.

Approximating the first derivative using the finite-difference method#

As an introduction, let’s watch the video below:

from IPython.display import YouTubeVideo
YouTubeVideo('YYuGL-VP2BE', width=800, height=300)

The derivative \(f'(x)\) can be approximated based on a Taylor series expansion. We call this method the finite-difference method or the difference method.

Let’s expand the forward Taylor series (forward, because of the \(+h\) term):

\[ f{\left (x + h \right )} =\sum_{n=0}^{\infty}\frac{h^n}{n!}\frac{d^n}{dx^n}f(x)= f{\left (x \right )} + h\, f'\left (x \right ) + \underbrace{\frac{h^2}{2}\,f''(x)+\cdots}_{\mathcal{O}\left(h^{2}\right)} \]

The term \(\mathcal{O}\left(h^{2}\right)\) denotes a second-order error. If we express the first derivative from the equation:

\[ f'{\left (x \right )}=\frac{1}{h}\left(f{\left (x + h \right )} - f{\left (x \right )}\right) - \underbrace{\frac{h}{2}\,f''(x)+\cdots}_{\mathcal{O}\left(h^{1}\right)} \]

We find that we can estimate the first derivative at the point \(x_i\) (that is: \(f_o'(x_i)\)) based on two consecutive function values:

\[ f_o'(x_i)=\frac{1}{h}\left(y_{i+1}-y_i\right) \]

and in doing so we make a method error, which is of first order \(\mathcal{O}\left(h^{1}\right)\).

We used \(y_i=f(x_i)\) (see the figure above).

The error is:

\[e=-\frac{h}{2}\,f''(\xi),\]

where \(\xi\) is an unknown value on the interval \([x_i, x_{i+1}]\) and we neglected the higher-order terms.

The following expression therefore holds:

\[f'(x_i)=f_o'(x_i)+e\]

Now let’s see how we arrive at the same result with a machine-assisted derivation; first we import sympy:

import sympy as sym
sym.init_printing()

Let’s define the symbols:

f = sym.Function('f')
x, h = sym.symbols('x, h')

Then we continue with the forward Taylor series expansion:

f(x+h).series(h, n=2)

../_images/e7bdd43315b530ed78ee51ce8fb647cd82b3f141352f49fdab0682d91afb6dd6.png

The term \(\mathcal{O}\left(h^{2}\right)\) contains terms of second and higher order. The equation above uses a temporary variable for differentiation \(\xi_1\); let’s carry out the differentiation and substitute \(\xi_1=x\):

f(x+h).series(h, n=2).doit()

../_images/668177f3d890d56bd33aea45725e36b838a5d02e2b67cb4787fe5da8deffebec.png

We write the equation:

equation = sym.Eq(f(x+h), f(x+h).series(h, n=2).doit())
equation

../_images/cc1240802dcac7a6fe0df117ba9ad23b1f28db98a21453c0d61cb0b2bd59c787.png

We solve it for the first derivative \(f'(x)\):

f1_forward_exact = sym.solve(equation, f(x).diff(x))[0]
f1_forward_exact

../_images/d42fa1dbddffa76ccf7e4c77a0ef0b07915fdaf368a3a4943c25a82e9287d42d.png

If we do not account for the second- and higher-order derivatives, we have therefore made an error:

f1_forward_O = f1_forward_exact.expand().getO()
f1_forward_O

../_images/bfff2b1d7fb2740e6961450d5c43dfb746b739fa807d86c8d5631d0b4d5912de.png

The error \(\mathcal{O}\left(h^{1}\right)\) is therefore of first order, and if we neglect this term we make a method error and obtain an estimate of the derivative:

f1_forward_estimate = f1_forward_exact.expand().removeO()
f1_forward_estimate

../_images/4b9754b2843de0a2097b3f1b8cf801694bd1850f3704ce846f3f506fb2fe1fda.png

We find that this is the same expression we derived above, so it is:

\[y_i'=\frac{1}{h}\left(-y_i+y_{i+1}\right).\]

The weights are therefore:

Derivative \(\downarrow\) \(\backslash\) Values \(\rightarrow\)	\(y_{i}\)	\(y_{i+1}\)
\(y_i'=\frac{1}{h}\cdot\)	-1	1

Central difference scheme#

Derivative \(f'(x)\)#

First let’s look at the backward Taylor series expansion:

f(x-h).series(h, n=3).doit()

../_images/139a563d74c47041cc8ac8d5a10806ebf81b0db9e3db7ed2dd860a71a1f824dd.png

We find that in the difference between the forward and backward series the even-order terms cancel out; let’s define:

def difference(n=3):
    return f(x+h).series(h, n=n).doit()-f(x-h).series(h, n=n).doit()
difference(n=3)

../_images/40d219642293d014b6a34288723a9770945aa205d4d39bd89ff57dcbeaffaef0.png

We carry out the following steps:

subtract the backward Taylor series from the forward series, the even-order derivatives cancel out,
solve the equation for the first derivative,
determine the method error,
determine the estimate of the derivative.

Let’s carry out the steps above:

f1_cent_exact = sym.solve(
           sym.Eq(f(x+h) - f(x-h), difference(n=3)), # step 1
           f(x).diff(x))[0]                       # step 2
f1_cent_O = f1_cent_exact.expand().getO()         # step 3
f1_cent_estimate = f1_cent_exact.expand().removeO()  # step 4

The estimate of the 1st derivative is therefore:

f1_cent_estimate

../_images/8514e4a913ee2131b9f50696ac556ec194de812cdc228c5954e30877dac6f04b.png

Or:

\[y_i'=\frac{1}{2h}\left(-y_{i-1}+y_{i+1}\right)\]

The weights are therefore:

Derivative \(\downarrow\) \(\backslash\) Values \(\rightarrow\)	\(y_{i-1}\)	\(y_{i}\)	\(y_{i+1}\)
\(y_i'=\frac{1}{2h}\cdot\)	-1	0	1

The method error is therefore of second order:

f1_cent_O

../_images/e5e097012a0adf699dc5b9be7d3b46dcdee991455f09939b5d54b37e6009f216.png

Example: \(\exp(-x)\)#

Let’s look at an example of the exponential function \(f(x)=\exp(-x)\) and, for the point \(x=1.0\), compute the first derivative \(f'(x)=-\exp(-x)\) at the steps \(h_0=1\) and \(h_1=0.1\).

First we prepare a table of numerical values and the exact result:

import numpy as np
x0 = np.array([0., 1., 2.]) # step h=1
y0 = np.exp(-x0)
h0 = x0[1]-x0[0]

x1 = np.array([0.9, 1.0, 1.1]) # step h=0.1
y1 = np.exp(-x1)
h1 = x1[1]-x1[0]

f1_exact0 = - np.exp(-1) # exact result
f1_exact1 = - np.exp(-1) # exact result

Then we use the forward scheme:

f1_forward_estimate # as a reminder

f1_forward0 = (y0[1:]-y0[:-1])/h0   # step h_0
f1_forward1 = (y1[1:]-y1[:-1])/h1   # step h_1

Let’s compute the error at \(x=1.0\):

f1_exact0 - f1_forward0[1]

../_images/f3bce32b05c514e0cd9fa595f43f2ed01b370c2c3d4c9e15944e24851c18de18.png

f1_exact1 - f1_forward1[1] # step h1

../_images/878e6bf55915207b54ca28cdb53dd846dce5d5e2787864a565c20d5c489edc8f.png

We can confirm that the error at step \(h/10\) is indeed approximately 1/10 of that at step \(h\).

Now let’s also look at the error for the central difference scheme, which is of second order:

f1_cent_estimate

f1_cent0 = (y0[2:]-y0[:-2])/(2*h0) # step h_0
f1_cent1 = (y1[2:]-y1[:-2])/(2*h1) # step h_1

Let’s analyze the error:

f1_exact0 - f1_cent0[0] # step h0

../_images/4cb4587d5ce740138e6107c00486f801c721064f86a05516a007adcfa4e50afc.png

f1_exact1 - f1_cent1[0] # step h1

../_images/9815f4939edc7b3236f11faa6a12ea09385d1e36dea6ef3975678f27ecd33a34.png

We can confirm that the error at step \(h/10\) is indeed approximately 1/100 of that at step \(h\).

Derivative \(f''(x)\)#

If we add the forward and backward Taylor series, the odd-order derivatives cancel out:

def total(n=3):
    return f(x+h).series(h, n=n).doit() + f(x-h).series(h, n=n).doit()
total(n=4)

../_images/9cb89c8a066d8bcc06edf7fe1fef124fa897a6485e1dd30602972ecffbacf604.png

Let’s determine the second derivative:

f2_cent_exact = sym.solve(
           sym.Eq(f(x+h) + f(x-h), total(n=4)),   # step 1
           f(x).diff(x,2))[0]                     # step 2
f2_cent_O = f2_cent_exact.expand().getO()         # step 3
f2_cent_estimate = f2_cent_exact.expand().removeO()  # step 4

The estimate of the second derivative is:

f2_cent_estimate

../_images/9eb46761984e85ddd6722bce00d061aacd65d9d23b353ea4eb38022f37e64d76.png

Or:

\[y_i''=\frac{1}{h^2}\left(y_{i-1}-2\,y_{i}+y_{i+1}\right)\]

The weights are therefore:

Derivative \(\downarrow\) \(\backslash\) Values \(\rightarrow\)	\(y_{i-1}\)	\(y_{i}\)	\(y_{i+1}\)
\(y_i''=\frac{1}{h^2}\cdot\)	1	-2	1

The method error is again of second order:

f2_cent_O

Derivative \(f'''(x)\)#

If we want to determine the third derivative, we have to expand the Taylor series to order 5:

eq_h = sym.Eq(f(x+h)-f(x-h), difference(n=5))
eq_h

../_images/f6c9fad739c960445329e9ed7836351645d3b950a558f856d34e3029f0e8d297.png

Using the 1st derivative we derived above would not help us, since the order of the error is \(\mathcal{O}\left(h^{2}\right)\), which means that in the expression above, when dividing by \(h^3\), we would get \(\mathcal{O}\left(h^{-1}\right)\).

Let’s use a trick: we repeat the expansion, but based on additional points that are \(2h\) and \(-2h\) away from \(x\):

eq_2h = eq_h.subs(h, 2*h)
eq_2h

../_images/63b6bb2ef15adae5fc2b93c7a5fa1e12d55bc1c1debf34c8c907b2d02c380d6f.png

Now we have two equations and two unknowns; we will solve the system step by step:

solve equation eq_h for the first derivative,
solve equation eq_2h for the first derivative,
equate the results of the first two steps and solve for the third derivative,
determine the method error,
determine the estimate of the derivative.

Let’s carry out the stated steps:

f3_cent_exact = sym.solve(
        sym.Eq(sym.solve(eq_h, f(x).diff(x))[0],  # step 1
        sym.solve(eq_2h, f(x).diff(x))[0]),       # step 2
        f(x).diff(x,3))[0]                        # step 3
f3_cent_O = f3_cent_exact.expand().getO()         # step 4
f3_cent_estimate = f3_cent_exact.expand().removeO()  # step 5

The estimate of the 3rd derivative is:

f3_cent_estimate

../_images/fe9cf6e846fc9e7fa8b2990e39f8c6dfc9db155d2a7006031a0e4097fecda71c.png

Or:

\[y_i'''=\frac{1}{h^3}\left(-y_{i-2}/2+y_{i-1}-y_{i+1}+y_{i+2}/2\right)\]

The weights are therefore:

Derivative \(\downarrow\) \(\backslash\) Values \(\rightarrow\)	\(y_{i-2}\)	\(y_{i-1}\)	\(y_{i}\)	\(y_{i+1}\)	\(y_{i+2}\)
\(y_i'''=\frac{1}{h^3}\cdot\)	-0.5	1	0	-1	0.5

Let’s confirm that the method error is of second order:

f3_cent_O

Derivative \(f^{(4)}(x)\)#

Let’s repeat a procedure similar to the one for the 3rd derivative, but for the 4th derivative we add the forward and backward Taylor series (up to order 6):

eq_h = sym.Eq(f(x+h)+f(x-h), total(n=6))
eq_h

../_images/818a8789d470fb7d1ad5c914ac130546771997730f3f709d3cb91a698b5406fa.png

Let’s prepare an additional equation based on the points that are \(2h\) and \(-2h\) away from \(x\):

eq_2h = eq_h.subs(h, 2*h)
eq_2h

../_images/5d23b4d07eabcb58a347a2c81a379b59f0273d282dc5ec1aae31afbe2f605359.png

From the two equations we determine the 4th derivative:

f4_cent_exact = sym.solve(
        sym.Eq(sym.solve(eq_h, f(x).diff(x,2))[0],  # step 1
        sym.solve(eq_2h, f(x).diff(x,2))[0]),       # step 2
        f(x).diff(x,4))[0]                          # step 3
f4_cent_O = f4_cent_exact.expand().getO()           # step 4
f4_cent_estimate = f4_cent_exact.expand().removeO()    # step 5

The estimate of the 4th derivative is:

f4_cent_estimate

../_images/d538f8edc55001b0a7c6775cf705a2bbdede65f49e5be6d0dfaf78b860cc295e.png

Or:

\[y_i^{(4)}=\frac{1}{h^4}\left(y_{i-2}-4\,y_{i-1}+6\,y_i-4\,y_{i+1}+y_{i+2}\right)\]

The weights are therefore:

Derivative \(\downarrow\) \(\backslash\) Values \(\rightarrow\)	\(y_{i-2}\)	\(y_{i-1}\)	\(y_{i}\)	\(y_{i+1}\)	\(y_{i+2}\)
\(y_i^{(4)}=\frac{1}{h^4}\cdot\)	1	-4	6	-4	1

Let’s confirm that the method error is of second order:

f4_cent_O

Summary of the central difference scheme#

Above we derived the first four derivatives with a 2nd-order method error. The key point of the derivations above is that they give us the weights by which we must multiply the function values in order to compute the approximation of a given derivative. For this reason, we will collect these weights here. If you are impatient, you can jump to the table below. Continue reading if you want to learn how to organize the previously derived expressions by machine.

First let’s collect all the derivative estimates into a list:

derivatives = [f1_cent_estimate, f2_cent_estimate, f3_cent_estimate, f4_cent_estimate]
derivatives

../_images/5b534b10f74a3e4e113a9bd6d933afcf253ab88a7e760938885f1915ece7ca95.png

We have 5 function values available (at locations \(x-2h, x-h, x, x+h, x+2h\)), which we put into a list:

function_values = [f(x-2*h), f(x-h), f(x), f(x+h), f(x+2*h)]

We compute the weight of the first derivative for the function value \(f(x-h)\):

f1_cent_estimate.expand().coeff(function_values[1])

../_images/8d3c62a982d76556e16791b35c8668a30a5ca34dd5c2e205c28803be56c419fe.png

Now let’s generalize and compute the weights for all function values and for all derivative estimates:

central_diff_scheme = [[derivative.expand().coeff(fv) for fv in function_values] \
                        for derivative in derivatives]
central_diff_scheme

../_images/8695af3e331f3937c4b5d55e7b572a6d27c949ef9917da71113d38b4448f31af.png

The summaries above can also be written in tabular form:

Derivative \(\downarrow\) \(\backslash\) Values \(\rightarrow\)	\(y_{i-2}\)	\(y_{i-1}\)	\(y_{i}\)	\(y_{i+1}\)	\(y_{i+2}\)
\(y_i'=\frac{1}{h}\cdot\)	0	-0.5	0	0.5	0
\(y_i''=\frac{1}{h^2}\cdot\)	0	1	-2	1	0
\(y_i'''=\frac{1}{h^3}\cdot\)	-0.5	1	0	-1	0.5
\(y_i^{(4)}=\frac{1}{h^4}\cdot\)	1	-4	6	-4	1

The central difference scheme shown has a 2nd-order error \(\mathcal{O}(h^{2})\).

Note: if you are interested in a generalization of the difference method, please see the findiff package.

Improved approximation - Richardson extrapolation#

If the exact derivative is computed as:

\[f'(x_i)=f_o'(x_i)+e,\]

where \(f_o'(x_i)\) is the numerically computed derivative at the point \(x_i\) and \(e\) is the error estimate.

For a method of accuracy order \(n\): \(\mathcal{O}(h^{n})\) at step \(h\) the following holds:

\[f'(x_i)=f_o'(x_i, h)+K\,h^n,\]

where \(K\) is an unknown constant.

If we halve the step and assume that \(K\) does not change, the following holds:

\[f'(x_i)=f_o'\left(x_i, \frac{h}{2}\right)+K\,\left(\frac{h}{2}\right)^n.\]

From both equations we eliminate the constant \(K\) and determine the improved approximation:

\[\overline{f}'(x_i)=\frac{2^n\,f_o'\left(x_i, \frac{h}{2}\right)-f_o'(x_i, h)}{2^n-1}\]

Example#

Let’s look at the example \(f(x)=\sin(x)\) (the analytical derivative is: \(f'(x)=\cos(x)\)):

x = np.linspace(0, 2*np.pi, 9)
y = np.sin(x)

At step \(h\) we have the function values defined at:

array([0.        , 0.78539816, 1.57079633, 2.35619449, 3.14159265,
       3.92699082, 4.71238898, 5.49778714, 6.28318531])

The numerical derivative at \(x=\pi\):

x[4]

../_images/25f9db0a7cd6f6ed0a3cd35b7119a298f04a9cd1070aa618d5835fc0f91b6e4b.png

and at step \(h\) is

h = x[1] - x[0]
f_estimate_h = (-0.5*y[3] + 0.5*y[5])/h
f_estimate_h

../_images/88a0c8016c2626e169df92ce92c0651e4178b8fec17d851e124ae538ce66075e.png

The computation at step \(2h\):

h2 = x[2] - x[0]
f_estimate_2h = (-0.5*y[2] + 0.5*y[6])/h2
f_estimate_2h

../_images/95a615b32c2dd721c0b50ec39dcae45ba1de95164358b3b342f6a3fe69fc1687.png

Let’s compute the improved estimate for \(x=\pi\):

f_estimate_improved = (2**2 * f_estimate_h - f_estimate_2h)/(2**2-1)
f_estimate_improved

../_images/49f3b8e77adaca32c410a9610624c16bbf016796471acfa10e7f5d9b0085e979.png

We see that the improved estimate is closest to the theoretical value \(\cos(\pi)=-1\).

Non-central difference scheme#

The central difference scheme we learned above is very useful and relatively accurate. However, since we cannot always use it (for example, at the beginning or end of the table), we have to rely on non-central difference schemes for computing derivatives.

We have:

the forward difference scheme, which approximates the derivative at a point with the function values at the following points, and
the backward difference scheme, which approximates the derivative at a point with the values at the preceding points.

The derivations are similar to what we showed for the central difference scheme, so we will not treat them here and will only show the final result.

Forward difference scheme#

Forward difference scheme with error order \(\mathcal{O}(h^{1})\):

Derivative \(\downarrow\) \(\backslash\) Values \(\rightarrow\)	\(y_{i}\)	\(y_{i+1}\)	\(y_{i+2}\)	\(y_{i+3}\)	\(y_{i+4}\)
\(y_i'=\frac{1}{h}\cdot\)	-1	1	0	0	0
\(y_i''=\frac{1}{h^2}\cdot\)	1	-2	1	0	0
\(y_i'''=\frac{1}{h^3}\cdot\)	-1	3	-3	1	0
\(y_i^{(4)}=\frac{1}{h^4}\cdot\)	1	-4	6	-4	1

Forward difference scheme with error order \(\mathcal{O}(h^{2})\):

Derivative \(\downarrow\) \(\backslash\) Values \(\rightarrow\)	\(y_{i}\)	\(y_{i+1}\)	\(y_{i+2}\)	\(y_{i+3}\)	\(y_{i+4}\)	\(y_{i+5}\)
\(y_i'=\frac{1}{2h}\cdot\)	-3	4	-1	0	0	0
\(y_i''=\frac{1}{h^2}\cdot\)	2	-5	4	-1	0	0
\(y_i'''=\frac{1}{2h^3}\cdot\)	-5	18	-24	14	-3	0
\(y_i^{(4)}=\frac{1}{h^4}\cdot\)	3	-14	26	-24	11	-2

Backward difference scheme#

Backward difference scheme with error order \(\mathcal{O}(h^{1})\):

Derivative \(\downarrow\) \(\backslash\) Values \(\rightarrow\)	\(y_{i-4}\)	\(y_{i-3}\)	\(y_{i-2}\)	\(y_{i-1}\)	\(y_{i}\)
\(y_i'=\frac{1}{h}\cdot\)	0	0	0	-1	1
\(y_i''=\frac{1}{h^2}\cdot\)	0	0	1	-2	1
\(y_i'''=\frac{1}{h^3}\cdot\)	0	-1	3	-3	1
\(y_i^{(4)}=\frac{1}{h^4}\cdot\)	1	-4	6	-4	1

Backward difference scheme with error order \(\mathcal{O}(h^{2})\):

Derivative \(\downarrow\) \(\backslash\) Values \(\rightarrow\)	\(y_{i-5}\)	\(y_{i-4}\)	\(y_{i-3}\)	\(y_{i-2}\)	\(y_{i-1}\)	\(y_{i}\)
\(y_i'=\frac{1}{2h}\cdot\)	0	0	0	1	-4	3
\(y_i''=\frac{1}{h^2}\cdot\)	0	0	-1	4	-5	2
\(y_i'''=\frac{1}{2h^3}\cdot\)	0	3	-14	24	-18	5
\(y_i^{(4)}=\frac{1}{h^4}\cdot\)	-2	11	-24	26	-14	3

Using `numpy.gradient`#

To compute numerical derivatives (2nd-order central difference scheme) we can also use numpy.gradient() (documentation):

gradient(f, *varargs, **kwargs)

where f represents a table of values (in the form of a numerical array) of the function whose derivative we are looking for. f can be one or more dimensions. The positional parameters varargs define the spacing between the values of the argument of the function f; the default value is 1. This value can be a scalar, but it can also be a list of values of the independent variable (or even a combination of both). At the edges, the gradient method uses the forward or backward scheme; the parameter edge_order defines the order of the scheme used at the edges (we can choose between 1 or 2, the default value is 1).

The result of the gradient function is a numerical array (or an array of numerical arrays) with the computed derivatives.

For details see the documentation.

For differentiating a function (and not a table of values), see: scipy.differentiate.

Example#

We will look at an example of how to use the weights, the gradient function, and the special handling at the edges. First we prepare a table of data:

x, h = np.linspace(0, 1, 20, retstep=True)
y = np.sin(2*np.pi*x)

Weights of the difference schemes:

central = np.array([-0.5, 0, 0.5]) # could also be obtained via central_diff_weights(3,1)
forward = np.array([-3/2, 2, -1/2])
backward = np.array([1/2, -2, 3/2])

Now we compute the derivative of the interior points (the first way is with list comprehensions, the second is vectorized):

derivative_interior = np.array([y[i-1:i+2] @ central/h for i in range(1, len(x)-1)]) # list comprehension
derivative_interior = np.convolve(y, central[::-1], mode='valid') / h # vectorized

At the edges we use the forward or backward difference scheme:

derivative_first = y[:len(forward)] @ forward / h  # forward
derivative_last = y[-len(backward):] @ backward / h # backward

Let’s assemble the result:

derivative_full = np.hstack([derivative_first, derivative_interior, derivative_last])

We display the result together with the result of the np.gradient function:

import matplotlib.pyplot as plt
%matplotlib inline
plt.plot(x, derivative_full, 'ko', lw=3, label='own implementation')
plt.plot(x, np.gradient(y, h), 'g.', label='np.gradient, edge_order=1')
plt.plot(x, np.gradient(y, h, edge_order=2), 'r.', label='np.gradient, edge_order=2')
plt.legend();

../_images/99e1a3966b00d1fdac5c485df6b27ef45566b7987bcf71f811b422a7dce56f2a.png

Using `numpy.diff`#

Let’s also present here the frequently used function numpy.diff (documentation), which computes the n-th discrete difference along a given axis. The first difference is given as \(a[i+1] - a[i]\). This is the basic operation for numerical differentiation (it is a first-order accurate derivative), but we have to divide the result by the step \(h\).

When using np.diff, the length of the array is reduced by 1.

x_d = np.linspace(0, 2*np.pi, 20)
y_d = np.sin(x_d)
h_d = x_d[1] - x_d[0]

# First derivative with np.diff (forward difference)
dy = np.diff(y_d) / h_d
x_diff = x_d[:-1] # the x axis shrinks

plt.figure()
plt.plot(x_d, np.cos(x_d), label='Exact derivative')
plt.plot(x_diff, dy, 'o', label='np.diff approximation')
plt.legend()
plt.title('Using np.diff for differentiation')
plt.show()

../_images/d23767a485b0fe661624d1f6e8a25fd3e6bd6a27bdf197d81621f75a4cdd7b0f.png

Round-off error in numerical differentiation#

Above we focused on the method error. In numerical differentiation, however, we must also pay very close attention to the round-off (or representation) error! Let’s look at the first derivative (using the central difference scheme) written with the method error (\(k\,h^2\)) and the round-off error \(\varepsilon\):

\[y_i'=\frac{1}{2h}\left((-y_{i-1}\pm\varepsilon)+(y_{i+1}\pm\varepsilon)\right) + k\,h^2\]

In the worst case the round-off errors add up and the total error is:

\[n=\frac{\varepsilon}{h}+k\,h^2\]

When \(h\) is large, the method error \(k\,h^2\) dominates; when \(h\) is small, the round-off error dominates. The error has a minimum when:

\[n'=-\frac{\varepsilon}{h^2}+2\,k\,h=0.\]

It follows that:

\[h=\sqrt[3]{\frac{\varepsilon}{2\,k}}.\]

Example#

Below we will look at an example where we vary the precision in three steps:

float16 - 16-bit format: sign 1 bit, exponent 5 bits, mantissa 10 bits
float32 - 32-bit format: sign 1 bit, exponent 8 bits, mantissa 23 bits
float64 - 64-bit format: sign 1 bit, exponent 11 bits, mantissa 52 bits (this is the default precision).

For more about types in numpy see the documentation

Now let’s determine the basic round-off error for each type:

eps16 = np.finfo(np.float16).eps
eps32 = np.finfo(np.float32).eps
eps64 = np.finfo(np.float64).eps
print(f'The basic round-off error for the types \
`float16`, `float32`, and `float64` is:\n{[eps16, eps32, eps64]}')

The basic round-off error for the types `float16`, `float32`, and `float64` is:
[np.float16(0.000977), np.float32(1.1920929e-07), np.float64(2.220446049250313e-16)]

As an example let’s look at addition: we add half of the basic round-off error eps16 to the number 1. and convert to the type float16; we find that the new value is still equal to the value 1.:

(1.+eps16/2).astype('float16')

np.float16(1.0)

First let’s define the function \(\exp(x)\), which will give a result with the precision defined by the parameter dtype:

def fun(x, dtype=float): # function
    return np.exp(-dtype(x)).astype(dtype)

Let’s also define a function for the analytically determinable derivative (we will need this later to determine the relative error):

def f1_fun(x): # "exact" derivative of the function
    return -np.exp(-x)

Similarly to addition above, we can also see for the function value that a change in the value \(x\) smaller than \(\epsilon\) leads to the same result:

fun(1., dtype=np.float16)

np.float16(0.368)

fun(1+eps16/2, dtype=np.float16)

np.float16(0.368)

Now let’s use the central difference scheme for the first derivative. Here the numbers should be written with precision dtype, and with the help of the exact derivative the relative error is also computed:

def f1_CDS(fun, x, h, dtype=np.float64):
    f1_estimate = (fun(x+h, dtype=dtype)-fun(x-h,dtype=dtype))/(2*dtype(h))
    f1_exact = f1_fun(x)
    relative_error = (f1_exact - f1_estimate) / f1_exact
    return f1_estimate, relative_error

Let’s look at an example of the derivative at \(x=1.0\) (the Python function returns the value and the relative error):

f1_CDS(fun, x=1., h=.01, dtype=np.float16)

(np.float16(-0.3662), np.float64(0.004535463210798897))

f1_CDS(fun, x=1., h=.01, dtype=np.float64)

../_images/5982436ed22153fe09cc284cad66306477e77dd3ed78522cce56375b410118f2.png

Now let’s define the step:

h=0.25**np.arange(30)
h[:10]

array([1.00000000e+00, 2.50000000e-01, 6.25000000e-02, 1.56250000e-02,
       3.90625000e-03, 9.76562500e-04, 2.44140625e-04, 6.10351562e-05,
       1.52587891e-05, 3.81469727e-06])

We compute the derivative estimate for various precisions (because of division by 0 we get a warning):

f1_16 = f1_CDS(fun, x=1., h=h, dtype=np.float16)
f1_32 = f1_CDS(fun, x=1., h=h, dtype=np.float32)
f1_64 = f1_CDS(fun, x=1., h=h, dtype=np.float64)

/tmp/ipykernel_3650/3908051182.py:2: RuntimeWarning: invalid value encountered in divide
  f1_estimate = (fun(x+h, dtype=dtype)-fun(x-h,dtype=dtype))/(2*dtype(h))

We plot the various types as a function of the step size \(h\). First we import the required libraries:

import matplotlib.pyplot as plt
%matplotlib inline

Let’s define the figure:

def fig_estimate():
    plt.semilogx(h, f1_16[0], 'b', lw=3, alpha=0.5, label='float16')
    plt.semilogx(h, f1_32[0], 'r', lw=3, alpha=0.5, label='float32')
    plt.semilogx(h, f1_64[0], 'g', lw=3, alpha=0.5, label='float64=float')
    plt.title('Derivative estimate for various precision types')
    plt.xlabel('$h$')
    plt.ylabel('Derivative estimate: $-\\exp(-1)=−0.3678794$')
    plt.ylim(-0.37, -0.365)
    plt.legend();

Let’s display it:

fig_estimate()

../_images/9dd17d4739198810a8926883a74f52d567a833594d806a88789886968108d720.png

At a relatively large step \(h\) the method error dominates, while at a small step the round-off error dominates; the optimal step can be estimated based on:

\[h=\sqrt[3]{\frac{\varepsilon}{2\,k}}.\]

In this particular case:

\[k=-\frac{f'''(x)}{6}=-\frac{-e^{-x}}{6}=\frac{1}{6\,e}\qquad(x=1)\]

It follows that:

\[h=\sqrt[3]{3\,\varepsilon\,e}.\]

We compute a suitable step for 16-, 32-, and 64-bit format:

np.power(3*eps16*np.exp(1),1/3)

../_images/7e8025f7ecfe57378a30aca233e67449149e71fa5bacd90811ef06c4bcd12fa2.png

np.power(3*eps32*np.exp(1),1/3)

../_images/08d730afec98e5b412b2f420f13cb454b387573d9c0b2318679ab289a3211318.png

np.power(3*eps64*np.exp(1),1/3)

../_images/91f3ef81ade698e47b74c4d9aff497a24305b31c5a427a983040c42d77daafbf.png

The 64-bit format performs best, but even with it a step smaller than about 1e-5 is not recommended!

Complex-step derivative approximation#

The complex-step derivative approximation is useful when we have any analytic function defined but do not have an analytical expression for the first derivative; for details see (source). The idea stems from the Taylor series expansion of the function \(f(x)\), but the step is taken in the direction of the imaginary axis \(i\,h\) (\(i=\sqrt(-1)\)):

\[f{\left(x + i\,h \right)} = f{\left(x \right)} + i\,h\,\frac{d}{d x} f{\left(x \right)} - \frac{h^{2}}{2} \frac{d^{2}}{d x^{2}} f{\left(x \right)} + O\left(h^{3}\right)\]

If we now assume that the function \(f(x)\) maps real values to the real axis and that \(x\) and \(h\) are real values, then we can derive:

\[\mathbf{Im} \left( f{\left(x + i\,h \right)}\right)= h\,\frac{d}{d x} f{\left(x \right)}+ O\left(h^{3}\right)\]

\[\mathbf{Re} \left( f{\left(x + i\,h \right)}\right) = f{\left(x \right)} - \frac{h^{2}}{2} \frac{d^{2}}{d x^{2}} f{\left(x \right)} + O\left(h^{3}\right)\rightarrow \mathbf{Re} \left( f{\left(x + i\,h \right)}\right) = f{\left(x \right)} + O\left(h^{2}\right)\]

From the first equation (above) we then derive the expression for the first derivative:

\[f'{\left(x \right)}=\frac{\mathbf{Im} \left( f{\left(x + i\,h \right)}\right)}{h}+ O\left(h^{2}\right)\]

The result is in a way very surprising. Above, with a one-step forward scheme, we managed to obtain accuracy \(O\left(h^{1}\right)\), while here it is \(O\left(h^{2}\right)\)!

An important advantage of the complex-step method is that it has no problem with subtracting nearly equal numbers (cancellation error), which plagues classical difference methods at very small steps \(h\). Therefore we can choose an extremely small \(h\) (e.g., \(10^{-200}\)) and obtain a practically exact result (down to machine precision).

If we now look at the use of the method on the example from above, \(\exp(-x)\), let’s first recall the exact result at the value \(x=1\):

f1_exact0

../_images/74afa1bde453c628fb9a6a8caa4e5257f546d2e281451ae8238cdc5f58130d24.png

When called with a step \(h=0.1\) we get two (three) correct digits (similarly to the central difference scheme), and by reducing the step to one tenth we would roughly double the number of correct digits.

h1_complex = 0.1
y1_complex = np.exp(-(1.+1.j*h1_complex))
f1_complex = np.imag(y1_complex)/h1_complex
f1_complex

../_images/ba515d584ba0da9a3cff4f82ee9b4201bb1fddbe9cc3dd5c7e08e144b2a4e273.png

Since we have no problem with subtracting nearly equal numbers, we can reduce the step extremely and obtain an exact result (down to machine precision):

h1_complex = 1e-10
y1_complex = np.exp(-(1.+1.j*h1_complex))
f1_complex = np.imag(y1_complex)/h1_complex
f1_complex

np.sin(2*x0*1e-5j)

array([0.+0.e+00j, 0.+2.e-05j, 0.+4.e-05j])

np.sin(x0**2+2*x0*1e-5j)

array([ 0.        +0.00000000e+00j,  0.84147098+1.08060461e-05j,
       -0.7568025 -2.61457448e-05j])

Let’s also look at an example of differentiating the function sin(x**2). From the code and the figure below we can see a substantially faster convergence of the complex-step method. Why does the method work so well? If we substitute \(x+i\,h\) into \(x^2\), we get: \((x+i\,h)^2 = x^2 + 2i\,x\,h - h^2\)

Since \(h\) is very small, \(h^2\) is negligible in the real part, while the imaginary part \(2\,x\,h\) carries linearly through the sine. In this way we obtain an approximation of the theoretical derivative \(2\,x\,\cos(x^2)\). The method sees the derivative in the imaginary part without subtraction, so no round-off error occurs.

import matplotlib.pyplot as plt
import numpy as np

# Function: sin(x^2)
def f(x):
    return np.sin(x**2)

# Analytical derivative: 2x * cos(x^2)
def df_analytical(x):
    return 2*x * np.cos(x**2)

x0 = 1.5
exact = df_analytical(x0)

h_vals = np.logspace(-10, 0, 100)
err_fd = []
err_cs = []

for h in h_vals:
    # Finite difference (Forward)
    fd_approx = (f(x0 + h) - f(x0)) / h
    err_fd.append(abs(fd_approx - exact))

    # Complex step
    # f(x + ih) = sin((x+ih)^2) ≈ sin(x^2 + 2ixh)
    #           ≈ sin(x^2) + i * 2xh * cos(x^2)
    # Imag/h gives us exactly 2x*cos(x^2)
    cs_approx = f(x0 + 1j*h).imag / h
    err_cs.append(abs(cs_approx - exact))

plt.figure(figsize=(10, 6))
plt.loglog(h_vals, err_fd, 'r-o', label='Finite difference', markersize=4)
plt.loglog(h_vals, err_cs, 'b--', label='Complex step', linewidth=2)

plt.xlabel('Step $h$')
plt.ylabel('Absolute error')
plt.title(r'Error in differentiating $f(x)=\sin(x^2)$')
plt.grid(True, which="both", alpha=0.5)
plt.legend()
plt.show()

../_images/a8aa8aca539a62cafa984b8f0ec63190253df5021d8490d3123624580ed55516.png

Additional#

Lecture video on the topic of numerical differentiation.

Exercise questions#

Question 1: The measured values of a runner’s distance covered s are given at \(N\) points in time. Numerically compute the velocity and acceleration profiles (you may use any method).

Observe and comment on what happens as you increase the number of points \(N\).

# Data
N = 30
t = np.linspace(0, 12, N)
s_ = 50*(np.cos(2*np.pi*(t-12)/24) +1)
s = s_ + np.random.randn(len(t))
plt.plot(t, s, '-', label='with noise')
plt.plot(t, s_, 'o', label='without noise')
plt.grid()
plt.legend()

<matplotlib.legend.Legend at 0x7f6113d78090>

../_images/f8310e6c383eb9745d34c9a806ceefe498ac260ca0e5a1bcd0d701fd72e36df5.png

Question 2: A function \(f(x)\) and its derivative \(f'(x)\) are given symbolically.

\[x(t) = e^{-t} \Big( \sin(\omega t) - 0.5 \cos(2 \omega t) \Big)\]

\[\omega = 2\pi f \quad \text{and} \quad f=4~\text{Hz}\]

We observe the function and its derivative at 100 time points \(t\in [0, 1]\).

Prepare the functions x_fun, dx_fun for numerical computation. Plot the prepared functions on the interval \(t\in [0, 1]\) using the matplotlib package.

# Data
import sympy as sym

t_sym = sym.Symbol('t', real=True, positive=True)
w = 2*sym.pi*4
x_sym = sym.exp(-t_sym) * (sym.sin(w*t_sym) - 0.5*sym.cos(2*w*t_sym))
dx_sym = sym.diff(x_sym, t_sym)

Question 3: Divide the observed time interval \(t \in [0,1]\) into 100 equidistant segments.

Prepare a vector of values \(x(t)\) and compute the derivative \(x'(t)\) numerically as well, using the function numpy.gradient(x, h).

Question 4: Use the function np.gradient to also compute the second derivative \(x''(t)\). Observe the effect of the edge_order parameter on the accuracy of the derivatives at the edges of the interval.

Question 5: To the numerical array \(x(t)\) at 500 equidistant points on the interval \(t \in [0, 1]\), add random noise \(n(t)\) with amplitude \(A=0.1\) using the function np.random.randn.

\[x_{\text{noise}}(t) = x(t) + A\cdot n(t)\]

Using the function np.gradient, numerically determine the value of the first and second derivative of the noisy signal.

Weights of the central difference scheme in `scipy`#

Example 6: Compute the values of the second derivative \(x''(t)\). Use the central difference scheme for the second derivative over 5 points (error order 4).

# Data
#!pip install findiff
import findiff
findiff.coefficients(deriv=2, acc=4)['center']['coefficients']

../_images/75737e934a22596f8ef4ec0376691fe29cbc9e2017cc09796fe8bdc42919b58d.png

t, dt = np.linspace(0, 1, 100, retstep=True)
x = sym.lambdify(t_sym, x_sym)(t)
second_derivative = findiff.FinDiff(0, dt, 2, acc=4) # axis, dx, derivative, accuracy
plt.plot(t, second_derivative(x))

/tmp/ipykernel_3650/3530144622.py:3: DeprecationWarning: FinDiff is deprecated, use Diff instead. See https://findiff.readthedocs.io/en/latest/ for the new API. To suppress this warning: import warnings; warnings.filterwarnings('ignore', message='FinDiff is deprecated')
  second_derivative = findiff.FinDiff(0, dt, 2, acc=4) # axis, dx, derivative, accuracy

[<matplotlib.lines.Line2D at 0x7f61139a7110>]

../_images/c3376baa72c1314616c9684ffc09f1e5e4bfe695810b556f8191d852fe4c2054.png

Question 7: Based on the numerical data \(x(t)\), with the interval \(t \in [0,1]\) divided into 100 points, compute the value of the first derivative \(x'(t)\) using the central difference scheme equation over three points. Can you determine the value of the derivative at the points \(t=0\) and \(t=1\) this way?

# Data
w = np.array([-1/2, 0, 1/2])

Question 8: Using the forward or backward difference scheme, compute the first derivative \(x'(t)\) at the edge points as well (\(t=0\), \(t=1\)). Compare the obtained values with the result of the function np.gradient, where you set the argument edge_order=2.

\[\begin{split} \begin{align} \text{forward: } & \frac{d}{dx}f(x) = \frac{1}{2h}\Big( -3f(x) + 4f(x+h) -f(x+2h)\Big)\\ \text{backward: } & \frac{d}{dx}f(x) = \frac{1}{2h}\Big(f(x-2h) - 4f(x-h) + 3f(x)\Big) \end{align}\end{split}\]

(Hint: you may use the function np.dot)

# Data
forward = np.array([-3/2, 2, -1/2])
backward = np.array([1/2, -2, 3/2])

Improving the derivative approximation with Richardson extrapolation#

Question 9: Compute the improved approximation of the derivative of the function:

\[x(t) = e^{-t} \Big( \sin(8 \pi t) - 0.5 \cos(16 \pi t) \Big)\]

at the value \(t_i = 0.5\), by using the equation above for two different subdivisions, \(h=0.05\) and \(h/2 = 0.025\). Use the central difference scheme for the 1st derivative with error order \(n=2\).

Using `np.convolve`#

Question 10: Use the function np.convolve and compute the values of the second derivative \(x(t)\) with the interval \(t \in [0, 1]\) divided into 100 points. Use the central difference scheme for the second derivative over 5 points, computing the weights with the help of scipy.misc.central_diff_weights.

Display the result graphically (mind the number of edge points!).

Numerical differentiation

Contents

Numerical differentiation#

Introduction#

Approximating the first derivative using the finite-difference method#

Central difference scheme#

Derivative \(f'(x)\)#

Example: \(\exp(-x)\)#

Derivative \(f''(x)\)#

Derivative \(f'''(x)\)#

Derivative \(f^{(4)}(x)\)#

Summary of the central difference scheme#

Improved approximation - Richardson extrapolation#

Example#

Non-central difference scheme#

Forward difference scheme#

Backward difference scheme#

Using numpy.gradient#

Example#

Using numpy.diff#

Round-off error in numerical differentiation#

Example#

Complex-step derivative approximation#

Additional#

Exercise questions#

Weights of the central difference scheme in scipy#

Improving the derivative approximation with Richardson extrapolation#

Using np.convolve#

Using `numpy.gradient`#

Using `numpy.diff`#

Weights of the central difference scheme in `scipy`#

Using `np.convolve`#