Runge–Kutta methods

From Wikipedia, the free encyclopedia

In numerical analysis, the Runge–Kutta methods (German pronunciation: [ˌʁʊŋəˈkʊta]) are an important family of implicit and explicit iterative methods, which are used in temporal discretization for the approximation of solutions of ordinary differential equations. These techniques were developed around 1900 by the German mathematicians C. Runge and M. W. Kutta.

See the article on numerical methods for ordinary differential equations for more background and other methods. See also List of Runge–Kutta methods.

The Runge–Kutta method

One member of the family of Runge–Kutta methods is often referred to as "RK4", "classical Runge–Kutta method" or simply as "the Runge–Kutta method".

Let an initial value problem be specified as follows.

{\dot  {y}}=f(t,y),\quad y(t_{0})=y_{0}.

Here, y is an unknown function (scalar or vector) of time t which we would like to approximate; we are told that {\dot  {y}}, the rate at which y changes, is a function of t and of y itself. At the initial time t_{0} the corresponding y-value is y_{0}. The function f and the data t_{0}, y_{0} are given.

Now pick a step-size h>0 and define

y_{{n+1}}=y_{n}+{\tfrac  {1}{6}}h\left(k_{1}+2k_{2}+2k_{3}+k_{4}\right)
t_{{n+1}}=t_{n}+h

for n = 0, 1, 2, 3, . . . , using

k_{1}=f(t_{n},y_{n}),
k_{2}=f(t_{n}+{\tfrac  {1}{2}}h,y_{n}+{\tfrac  {h}{2}}k_{1}),
k_{3}=f(t_{n}+{\tfrac  {1}{2}}h,y_{n}+{\tfrac  {h}{2}}k_{2}),
k_{4}=f(t_{n}+h,y_{n}+hk_{3}). [1]
(Note: the above equations have different but equivalent definitions in different texts).[2]

Here y_{{n+1}} is the RK4 approximation of y(t_{{n+1}}), and the next value (y_{{n+1}}) is determined by the present value (y_{n}) plus the weighted average of four increments, where each increment is the product of the size of the interval, h, and an estimated slope specified by function f on the right-hand side of the differential equation.

  • k_{1} is the increment based on the slope at the beginning of the interval, using {\dot  {y}}, (Euler's method) ;
  • k_{2} is the increment based on the slope at the midpoint of the interval, using {\dot  {y}}+{\tfrac  12}hk_{1} ;
  • k_{3} is again the increment based on the slope at the midpoint, but now using {\dot  {y}}+{\tfrac  12}hk_{2} ;
  • k_{4} is the increment based on the slope at the end of the interval, using {\dot  {y}}+hk_{3} .

In averaging the four increments, greater weight is given to the increments at the midpoint. The weights are chosen such that if f is independent of y, so that the differential equation is equivalent to a simple integral, then RK4 is Simpson's rule.[3]

The RK4 method is a fourth-order method, meaning that the local truncation error is on the order of O(h^{5}), while the total accumulated error is order O(h^{4}).

Tan Delin and Chen Zheng have developed a general formula for Runge-Kutta method in fourth-order as the following.

y_{{n+1}}=y_{n}+{\tfrac  {1}{6}}h\left[k_{1}+\left(4-\lambda \right)k_{2}+\lambda k_{3}+k_{4}\right]
t_{{n+1}}=t_{n}+h

for n = 0, 1, 2, 3, . . . , using

k_{1}=f(t_{n},y_{n}),
k_{2}=f(t_{n}+{\tfrac  {1}{2}}h,y_{n}+{\tfrac  {1}{2}}k_{1}h),
k_{3}=f(t_{n}+{\tfrac  {1}{2}}h,y_{n}+\left({\tfrac  {1}{2}}-{\tfrac  {1}{\lambda }}\right)k_{1}h+{\tfrac  {1}{\lambda }}k_{2}h),
k_{4}=f(t_{n}+h,y_{n}+\left(1-{\tfrac  {\lambda }{2}}\right)k_{2}h+{\tfrac  {\lambda }{2}}k_{3}h).

Where \lambda is a free parameter. Choosing \lambda =2, this is the classical fourth-order Runge-Kutta method. With \lambda =1,3,4,5, this formula produces other fourth-order Runge-Kutta methods.

Explicit Runge–Kutta methods

The family of explicit Runge–Kutta methods is a generalization of the RK4 method mentioned above. It is given by

y_{{n+1}}=y_{n}+\sum _{{i=1}}^{s}b_{i}k_{i},

where

k_{1}=hf(t_{n},y_{n}),\,
k_{2}=hf(t_{n}+c_{2}h,y_{n}+a_{{21}}k_{1}),\,
k_{3}=hf(t_{n}+c_{3}h,y_{n}+a_{{31}}k_{1}+a_{{32}}k_{2}),\,
\vdots
k_{s}=hf(t_{n}+c_{s}h,y_{n}+a_{{s1}}k_{1}+a_{{s2}}k_{2}+\cdots +a_{{s,s-1}}k_{{s-1}}). [4]
(Note: the above equations have different but equivalent definitions in different texts).[2]

To specify a particular method, one needs to provide the integer s (the number of stages), and the coefficients aij (for 1 ≤ j < is), bi (for i = 1, 2, ..., s) and ci (for i = 2, 3, ..., s). The matrix [aij] is called the Runge–Kutta matrix, while the bi and ci are known as the weights and the nodes.[5] These data are usually arranged in a mnemonic device, known as a Butcher tableau (after John C. Butcher):

0
c_{2} a_{{21}}
c_{3} a_{{31}} a_{{32}}
\vdots \vdots \ddots
c_{s} a_{{s1}} a_{{s2}} \cdots a_{{s,s-1}}
b_{1} b_{2} \cdots b_{{s-1}} b_{s}

The Runge–Kutta method is consistent if

\sum _{{j=1}}^{{i-1}}a_{{ij}}=c_{i}\ {\mathrm  {for}}\ i=2,\ldots ,s.

There are also accompanying requirements if we require the method to have a certain order p, meaning that the local truncation error is O(hp+1). These can be derived from the definition of the truncation error itself. For example, a 2-stage method has order 2 if b1 + b2 = 1, b2c2 = 1/2, and a21 = c2.[6]

Examples

The RK4 method falls in this framework. Its tableau is:[7]

0
1/2 1/2
1/2 0 1/2
1 0 0 1
1/6 1/3 1/3 1/6

A slight variation of "the" Runge-Kutta method is also due to Kutta in 1901 and is called the 3/8-rule.[8] The primary advantage this method has is that almost all of the error coefficients are smaller than the popular method, but it requires slightly more FLOPs (floating point operations) per time step. Its Butcher tableau is given by:

0
1/3 1/3
2/3 −1/3 1
1 1 −1 1
1/8 3/8 3/8 1/8

However, the simplest Runge–Kutta method is the (forward) Euler method, given by the formula y_{{n+1}}=y_{n}+hf(t_{n},y_{n}). This is the only consistent explicit Runge–Kutta method with one stage. The corresponding tableau is:

0
1

Second-order methods with two stages

An example of a second-order method with two stages is provided by the midpoint method

y_{{n+1}}=y_{n}+hf\left(t_{n}+{\frac  {1}{2}}h,y_{n}+{\frac  {1}{2}}hf(t_{n},y_{n})\right).

The corresponding tableau is:

0
1/2 1/2
0 1

The midpoint method is not the only second-order Runge–Kutta method with two stages. In fact, there is a family of such methods, parameterized by α, and given by the formula

y_{{n+1}}=y_{n}+h{\bigl (}(1-{\tfrac  1{2\alpha }})f(t_{n},y_{n})+{\tfrac  1{2\alpha }}f(t_{n}+\alpha h,y_{n}+\alpha hf(t_{n},y_{n})){\bigr )}. [9]

Its Butcher tableau is

0
\alpha \alpha
(1-{\tfrac  1{2\alpha }}) {\tfrac  1{2\alpha }}

In this family, \alpha ={\tfrac  12} gives the midpoint method and \alpha =1 is Heun's method.[3]

Usage

As an example, consider the two-stage second-order Runge–Kutta method with α = 2/3. It is given by the tableau

0
2/3 2/3
1/4 3/4

with the corresponding equations

k_{1}=f(t_{n},y_{n}),
k_{2}=f(t_{n}+{\tfrac  {2}{3}}h,y_{n}+{\tfrac  {2}{3}}hk_{1}),
y_{{n+1}}=y_{n}+h\left({\tfrac  {1}{4}}k_{1}+{\tfrac  {3}{4}}k_{2}\right).

This method is used to solve the initial-value problem

y'=\tan(y)+1,\quad y_{0}=1,\ t\in [1,1.1]

with step size h = 0.025, so the method needs to take four steps.

The method proceeds as follows:

t_{0}=1\colon
y_{0}=1
t_{1}=1.025\colon
y_{0}=1 k_{1}=2.557407725 k_{2}=f(t_{0}+{\tfrac  23}h,y_{0}+{\tfrac  23}hk_{1})=2.7139
y_{1}=y_{0}+h({\tfrac  14}k_{1}+{\tfrac  34}k_{2})=\underline {1.066869388}
t_{2}=1.05\colon
y_{1}=1.066869388 k_{1}=2.813524695 k_{2}=f(t_{1}+{\tfrac  23}h,y_{1}+{\tfrac  23}hk_{1})
y_{2}=y_{1}+h({\tfrac  14}k_{1}+{\tfrac  34}k_{2})=\underline {1.141332181}
t_{3}=1.075\colon
y_{2}=1.141332181 k_{1}=3.183536647 k_{2}=f(t_{2}+{\tfrac  23}h,y_{2}+{\tfrac  23}hk_{1})
y_{3}=y_{2}+h({\tfrac  14}k_{1}+{\tfrac  34}k_{2})=\underline {1.227417567}
t_{4}=1.1\colon
y_{3}=1.227417567 k_{1}=3.796866512 k_{2}=f(t_{3}+{\tfrac  23}h,y_{3}+{\tfrac  23}hk_{1})
y_{4}=y_{3}+h({\tfrac  14}k_{1}+{\tfrac  34}k_{2})=\underline {1.335079087}.

The numerical solutions correspond to the underlined values.

Adaptive Runge–Kutta methods

The adaptive methods are designed to produce an estimate of the local truncation error of a single Runge–Kutta step. This is done by having two methods in the tableau, one with order p and one with order p-1.

The lower-order step is given by

y_{{n+1}}^{*}=y_{n}+\sum _{{i=1}}^{s}b_{i}^{*}k_{i},

where the k_{i} are the same as for the higher-order method. Then the error is

e_{{n+1}}=y_{{n+1}}-y_{{n+1}}^{*}=\sum _{{i=1}}^{s}(b_{i}-b_{i}^{*})k_{i},

which is O(h^{p}). The Butcher tableau for this kind of method is extended to give the values of b_{i}^{*}:

0
c_{2} a_{{21}}
c_{3} a_{{31}} a_{{32}}
\vdots \vdots \ddots
c_{s} a_{{s1}} a_{{s2}} \cdots a_{{s,s-1}}
b_{1} b_{2} \cdots b_{{s-1}} b_{s}
b_{1}^{*} b_{2}^{*} \cdots b_{{s-1}}^{*} b_{s}^{*}

The Runge–Kutta–Fehlberg method has two methods of orders 5 and 4. Its extended Butcher tableau is:

0
1/4 1/4
3/8 3/32 9/32
12/13 1932/2197 −7200/2197 7296/2197
1 439/216 −8 3680/513 -845/4104
1/2 −8/27 2 −3544/2565 1859/4104 −11/40
16/135 0 6656/12825 28561/56430 −9/50 2/55
25/216 0 1408/2565 2197/4104 −1/5 0

However, the simplest adaptive Runge–Kutta method involves combining the Heun method, which is order 2, with the Euler method, which is order 1. Its extended Butcher tableau is:

0
1 1
1/2 1/2
1 0

The error estimate is used to control the stepsize.

Other adaptive Runge–Kutta methods are the Bogacki–Shampine method (orders 3 and 2), the Cash–Karp method and the Dormand–Prince method (both with orders 5 and 4).

Nonconfluent Runge-Kutta methods

A Runge-Kutta method is said to be nonconfluent [10] if all the c_{i},\,i=1,2,\ldots ,s are distinct.

Implicit Runge–Kutta methods

All Runge–Kutta methods mentioned up to now are explicit methods. Unfortunately, explicit Runge–Kutta methods are generally unsuitable for the solution of stiff equations because their region of absolute stability is small; in particular, it is bounded.[11] This issue is especially important in the solution of partial differential equations.

The instability of explicit Runge–Kutta methods motivates the development of implicit methods. An implicit Runge–Kutta method has the form

y_{{n+1}}=y_{n}+\sum _{{i=1}}^{s}b_{i}k_{i},

where

k_{i}=hf{\bigl (}t_{n}+c_{i}h,y_{n}+\sum _{{j=1}}^{s}a_{{ij}}k_{j}{\bigr )},\quad i=1,\ldots ,s. [12]

The difference with an explicit method is that in an explicit method, the sum over j only goes up to i − 1. This also shows up in the Butcher tableau. For an implicit method, the coefficient matrix a_{{ij}} is not necessarily lower triangular:[7]

{\begin{array}{c|cccc}c_{1}&a_{{11}}&a_{{12}}&\dots &a_{{1s}}\\c_{2}&a_{{21}}&a_{{22}}&\dots &a_{{2s}}\\\vdots &\vdots &\vdots &\ddots &\vdots \\c_{s}&a_{{s1}}&a_{{s2}}&\dots &a_{{ss}}\\\hline &b_{1}&b_{2}&\dots &b_{s}\\\end{array}}={\begin{array}{c|c}{\mathbf  {c}}&A\\\hline &{\mathbf  {b^{T}}}\\\end{array}}

The consequence of this difference is that at every step, a system of algebraic equations has to be solved. This increases the computational cost considerably. If a method with s stages is used to solve a differential equation with m components, then the system of algebraic equations has ms components. In contrast, an implicit s-step linear multistep method needs to solve a system of algebraic equations with only s components.[13]

Examples

The simplest example of an implicit Runge–Kutta method is the backward Euler method:

y_{{n+1}}=y_{n}+hf(t_{n}+h,y_{{n+1}}).\,

The Butcher tableau for this is simply:

{\begin{array}{c|c}1&1\\\hline &1\\\end{array}}

This Butcher tableau corresponds to the formulae

k_{1}=f(t_{n}+h,y_{n}+hk_{1})\quad {\text{and}}\quad y_{{n+1}}=y_{n}+hk_{1},

which can be re-arranged to get the formula for the backward Euler method listed above.

Another example for an implicit Runge–Kutta method is the trapezoidal rule. Its Butcher tableau is:

{\begin{array}{c|cc}0&0&0\\1&{\frac  {1}{2}}&{\frac  {1}{2}}\\\hline &{\frac  {1}{2}}&{\frac  {1}{2}}\\\end{array}}

The trapezoidal rule is a collocation method (as discussed in that article). All collocation methods are implicit Runge–Kutta methods, but not all implicit Runge–Kutta methods are collocation methods.[14]

The Gauss–Legendre methods form a family of collocation methods based on Gauss quadrature. A Gauss–Legendre method with s stages has order 2s (thus, methods with arbitrarily high order can be constructed).[15] The method with two stages (and thus order four) has Butcher tableau:

{\begin{array}{c|cc}{\frac  12}-{\frac  16}{\sqrt  3}&{\frac  14}&{\frac  14}-{\frac  16}{\sqrt  3}\\{\frac  12}+{\frac  16}{\sqrt  3}&{\frac  14}+{\frac  16}{\sqrt  3}&{\frac  14}\\\hline &{\frac  12}&{\frac  12}\end{array}} [13]

Stability

The advantage of implicit Runge–Kutta methods above explicit ones is their greater stability, especially when applied to stiff equations. Consider the linear test equation y' = λy. A Runge–Kutta method applied to this equation reduces to the iteration y_{{n+1}}=r(h\lambda )\,y_{n}, with r given by

r(z)=1+zb^{T}(I-zA)^{{-1}}e={\frac  {\det(I-zA+zeb^{T})}{\det(I-zA)}}, [16]

where e stands for the vector of ones. The function r is called the stability function.[17] It follows from the formula that r is the quotient of two polynomials of degree s if the method has s stages. Explicit methods have a strictly lower triangular matrix A, which implies that det(IzA) = 1 and that the stability function is a polynomial.[18]

The numerical solution to the linear test equation decays to zero if | r(z) | < 1 with z = hλ. The set of such z is called the domain of absolute stability. In particular, the method is said to be A-stable if all z with Re(z) < 0 are in the domain of absolute stability. The stability function of an explicit Runge–Kutta method is a polynomial, so explicit Runge–Kutta methods can never be A-stable.[18]

If the method has order p, then the stability function satisfies r(z)={\textrm  {e}}^{z}+O(z^{{p+1}}) as z\to 0. Thus, it is of interest to study quotients of polynomial of given degrees that approximate the exponential function the best. These are known as Padé approximants. A Padé approximant with numerator of degree m and denominator of degree n is A-stable if and only if mnm + 2.[19]

The Gauss–Legendre method with s stages has order 2s, so its stability function is the Padé approximant with m = n = s. It follows that the method is A-stable.[20] This shows that A-stable Runge–Kutta can have arbitrarily high order. In contrast, the order of A-stable linear multistep methods cannot exceed two.[21]

B-stability. Algebraic stability

The A-stability concept for the solution of differential equations is related to the linear autonomous equation y'=\lambda y. Dahlquist proposed the investigation of stability of numerical schemes when applied to nonlinear systems which satisfies a monotonicity condition. The corresponding concepts were defined as G-stability for multistep methods (and the related one-leg methods) and B-stability (Butcher, 1975) for Runge-Kutta methods. A Runge-Kutta method applied to the non-linear system y'=f(y), which verifies \langle f(y)-f(z),y-z\rangle <0, is called B-stable, if this condition implies \|y_{{n+1}}-z_{{n+1}}\|\leq \|y_{{n}}-z_{{n}}\| for two numerical solutions.

Let B, M and Q three s\times s matrices defined by

B=\operatorname {diag}(b_{1},b_{2},\ldots ,b_{s}),\,M=BA+A^{T}B-bb^{T}\,Q=BA^{{-1}}+A^{{-T}}B-A^{{-T}}bb^{T}A^{{-1}}.

A Runge-Kutta method is said to be algebraically stable [22] if the matrices both B and M are both non-negative definite. A sufficient condition for B-stability [23] is: B and Q are non-negative definite.

Derivation of the Runge–Kutta fourth-order method

In general a Runge–Kutta method of order s can be written as:

y_{{t+h}}=y_{t}+h\cdot \sum _{{i=1}}^{s}a_{i}k_{i}+{\mathcal  {O}}(h^{{s+1}})

where:

k_{i}=f\left(y_{t}+h\cdot \sum _{{j=1}}^{s}\beta _{{ij}}k_{j},t_{n}+\alpha _{i}h\right)

are increments obtained evaluating the derivatives of y_{t} at the i-th order.

We develop the derivation[24] for the Runge–Kutta fourth-order method using the general formula with s=4 evaluated, as explained above, at the starting point, the midpoint and the end point of any interval (t,t+h), thus we choose:

{\begin{array}{|l|l|}\hline \alpha _{i}&\beta _{{ij}}\\[8pt]\hline \alpha _{1}=0&\beta _{{21}}={\frac  {1}{2}}\\[8pt]\alpha _{2}={\frac  {1}{2}}&\beta _{{32}}={\frac  {1}{2}}\\[8pt]\alpha _{3}={\frac  {1}{2}}&\beta _{{43}}=1\\[8pt]\alpha _{4}=1&\\[8pt]\hline \end{array}}

and \beta _{{ij}}=0 otherwise. We begin by defining the following quantities:

y_{{t+h}}^{1}=y_{t}+hf\left(y_{t},t\right):y_{{t+h}}^{2}=y_{t}+hf\left(y_{{t+h/2}}^{1},t+{\frac  {h}{2}}\right):y_{{t+h}}^{3}=y_{t}+hf\left(y_{{t+h/2}}^{2},t+{\frac  {h}{2}}\right)

where y_{{t+h/2}}^{1}={\dfrac  {y_{t}+y_{{t+h}}^{1}}{2}} and y_{{t+h/2}}^{2}={\dfrac  {y_{t}+y_{{t+h}}^{2}}{2}} If we define:

k_{1}=f(y_{t},t):k_{2}=f\left(y_{{t+h/2}}^{1},t+{\frac  {h}{2}}\right):k_{3}=f\left(y_{{t+h/2}}^{2},t+{\frac  {h}{2}}\right):k_{4}=f\left(y_{{t+h}}^{3},t+h\right)

and for the previous relations we can show that the following equalities holds up to {\mathcal  {O}}(h^{2}):

{\begin{array}{ll}k_{2}&=f\left(y_{{t+h/2}}^{1},t+{\frac  {h}{2}}\right)=f\left(y_{t}+{\frac  {h}{2}}k_{1},t+{\frac  {h}{2}}\right)\\&=f\left(y_{t},t\right)+{\frac  {h}{2}}{\frac  {d}{dt}}f\left(y_{t},t\right)\\k_{3}&=f\left(y_{{t+h/2}}^{2},t+{\frac  {h}{2}}\right)=f\left(y_{t}+{\frac  {h}{2}}f\left(y_{t}+{\frac  {h}{2}}k_{1},t+{\frac  {h}{2}}\right),t+{\frac  {h}{2}}\right)\\&=f\left(y_{t},t\right)+{\frac  {h}{2}}{\frac  {d}{dt}}\left[f\left(y_{t},t\right)+{\frac  {h}{2}}{\frac  {d}{dt}}f\left(y_{t},t\right)\right]\\k_{4}&=f\left(y_{{t+h}}^{3},t+h\right)=f\left(y_{t}+hf\left(y_{t}+{\frac  {h}{2}}k_{2},t+{\frac  {h}{2}}\right),t+h\right)\\&=f\left(y_{t}+hf\left(y_{t}+{\frac  {h}{2}}f\left(y_{t}+{\frac  {h}{2}}f\left(y_{t},t\right),t+{\frac  {h}{2}}\right),t+{\frac  {h}{2}}\right),t+h\right)\\&=f\left(y_{t},t\right)+h{\frac  {d}{dt}}\left[f\left(y_{t},t\right)+{\frac  {h}{2}}{\frac  {d}{dt}}\left[f\left(y_{t},t\right)+{\frac  {h}{2}}{\frac  {d}{dt}}f\left(y_{t},t\right)\right]\right]\end{array}}

where:

{\frac  {d}{dt}}f(y_{t},t)={\frac  {\partial }{\partial y}}f(y_{t},t){\dot  y}_{t}+{\frac  {\partial }{\partial t}}f(y_{t},t)=f_{y}(y_{t},t){\dot  y}+f_{t}(y_{t},t):={\ddot  y}_{t}

is the total derivative of f with respect to time.

If we now express the general formula using what we just derived we obtain:

{\begin{array}{ll}y_{{t+h}}&=y_{t}+h\left\lbrace a\cdot f(y_{t},t)+b\cdot \left[f\left(y_{t},t\right)+{\frac  {h}{2}}{\frac  {d}{dt}}f\left(y_{t},t\right)\right]\right.+\\&{}+c\cdot \left[f\left(y_{t},t\right)+{\frac  {h}{2}}{\frac  {d}{dt}}\left[f\left(y_{t},t\right)+{\frac  {h}{2}}{\frac  {d}{dt}}f\left(y_{t},t\right)\right]\right]+\\&{}+d\cdot \left[f\left(y_{t},t\right)+h{\frac  {d}{dt}}\left[f\left(y_{t},t\right)+{\frac  {h}{2}}{\frac  {d}{dt}}\left[f\left(y_{t},t\right)+\left.{\frac  {h}{2}}{\frac  {d}{dt}}f\left(y_{t},t\right)\right]\right]\right]\right\rbrace +{\mathcal  {O}}(h^{5})\\&=y_{t}+a\cdot hf_{t}+b\cdot hf_{t}+b\cdot {\frac  {h^{2}}{2}}{\frac  {df_{t}}{dt}}+c\cdot hf_{t}+c\cdot {\frac  {h^{2}}{2}}{\frac  {df_{t}}{dt}}+\\&{}+c\cdot {\frac  {h^{3}}{4}}{\frac  {d^{2}f_{t}}{dt^{2}}}+d\cdot hf_{t}+d\cdot h^{2}{\frac  {df_{t}}{dt}}+d\cdot {\frac  {h^{3}}{2}}{\frac  {d^{2}f_{t}}{dt^{2}}}+d\cdot {\frac  {h^{4}}{4}}{\frac  {d^{3}f_{t}}{dt^{3}}}+{\mathcal  {O}}(h^{5})\end{array}}

and comparing this with the Taylor series of y_{{t+h}} around y_{t}:

{\begin{array}{ll}y_{{t+h}}&=y_{t}+h{\dot  y}_{t}+{\frac  {h^{2}}{2}}{\ddot  y}_{t}+{\frac  {h^{3}}{6}}y_{t}^{{(3)}}+{\frac  {h^{4}}{24}}y_{t}^{{(4)}}+{\mathcal  {O}}(h^{5})=\\&=y_{t}+hf(y_{t},t)+{\frac  {h^{2}}{2}}{\frac  {d}{dt}}f(y_{t},t)+{\frac  {h^{3}}{6}}{\frac  {d^{2}}{dt^{2}}}f(y_{t},t)+{\frac  {h^{4}}{24}}{\frac  {d^{3}}{dt^{3}}}f(y_{t},t)\end{array}}

we obtain a system of constraints on the coefficients:

{\begin{cases}&a+b+c+d=1\\[6pt]&{\frac  {1}{2}}b+{\frac  {1}{2}}c+d={\frac  {1}{2}}\\[6pt]&{\frac  {1}{4}}c+{\frac  {1}{2}}d={\frac  {1}{6}}\\[6pt]&{\frac  {1}{4}}d={\frac  {1}{24}}\end{cases}}

which solved gives a={\frac  {1}{6}},b={\frac  {1}{3}},c={\frac  {1}{3}},d={\frac  {1}{6}} as stated above.

See also

Notes

  1. Press et al. 2007, p. 908; Süli & Mayers 2003, p. 328
  2. 2.0 2.1 Atkinson (1989, p. 423), Hairer, Nørsett & Wanner (1993, p. 134), Kaw & Kalu (2008, §8.4) and Stoer & Bulirsch (2002, p. 476) leave out the factor h in the definition of the stages. Ascher & Petzold (1998, p. 81), Butcher (2003, p. 93) and Iserles (1996, p. 38) use the y-values as stages.
  3. 3.0 3.1 Süli & Mayers 2003, p. 328
  4. Press et al. 2007, p. 907
  5. Iserles 1996, p. 38
  6. Iserles 1996, p. 39
  7. 7.0 7.1 Süli & Mayers 2003, p. 352
  8. Hairer, Nørsett & Wanner (1993, p. 138) refer to Kutta (1901)
  9. Süli & Mayers 2003, p. 327
  10. Lambert 1991, p. 278
  11. Süli & Mayers 2003, pp. 349–351
  12. Iserles 1996, p. 41; Süli & Mayers 2003, pp. 351–352
  13. 13.0 13.1 Süli & Mayers 2003, p. 353
  14. Iserles 1996, pp. 43–44
  15. Iserles 1996, p. 47
  16. Hairer & Wanner 1996, pp. 40–41
  17. Hairer & Wanner 1996, p. 40
  18. 18.0 18.1 Iserles 1996, p. 60
  19. Iserles 1996, pp. 62–63
  20. Iserles 1996, p. 63
  21. This result is due to Dahlquist (1963).
  22. Lambert 1991, p. 275
  23. Lambert 1991, p. 274
  24. PDF reporting this derivation

References

External links

This article is issued from Wikipedia. The text is available under the Creative Commons Attribution/Share Alike; additional terms may apply for the media files.