Polyharmonic spline

From Wikipedia, the free encyclopedia

In mathematics, polyharmonic splines are used for function approximation and data interpolation. They are very useful for interpolation of scattered data in many dimensions.

Polyharmonic splines are a special case of radial basis functions and are defined as a linear combination of non-linear basis functions φ(.) that depend only on distances together with a low order polynomial (for notational simplicity, in the sequel only linear polynomials are treated):


y(\mathbf{x}) \, = \, \sum_{i=1}^N w_i \, \phi(||\mathbf{x} - \mathbf{c}_i||) + 
                  \mathbf{v}^T \, \begin{bmatrix} 1 \\ \mathbf{x} \end{bmatrix}

where

Polyharmonic  basis functions
Polyharmonic basis functions
  • \mathbf{x} = [x_1, x_2, \cdots, x_{nx}]^T is a real-valued vector of nx independent variables,
  • \mathbf{c}_i = [c_{1,i}, c_{2,i}, \cdots, c_{nx,i}]^T are N vectors of the same size as \mathbf{x} (often called center).
  • \mathbf{w} = [w_1, w_2, \cdots, w_N]^T are the N weights of the basis functions.
  • \mathbf{v} = [v_1, v_2, \cdots, v_{nx+1}]^T are the nx+1 weights of the polynomial.
  • The linear polynomial with the weighting factors \mathbf{v} improves the interpolation close to the "boundary" and expecially the extrapolation "outside" of the centers \mathbf{c}_i. If this is not desired, this term can also be removed (see also figure) below.

The basis functions of polyharmonic splines are radial basis functions of the form:

 
\begin{matrix}
  \phi(r) = \begin{cases}
                r^k & \mbox{with } k=1,3,5,\dots, \\
                r^k \ln(r) & \mbox{with } k=2,4,6,\dots
            \end{cases} \\[5mm]
   r = ||\mathbf{x} - \mathbf{c}_i||_2 
    = \sqrt{ (\mathbf{x} - \mathbf{c}_i)^T \, (\mathbf{x} - \mathbf{c}_i) }
 \end{matrix}

Other values of exponent k are not useful (such as φ(r) = r2), because a solution of the interpolation problem might no longer exist. To avoid problems at r=0 (since ln(0) = -∞), the polyharmonic splines with the natural logarithm might be implemented as:


\phi(r) = \begin{cases}
             r^{k-1} \ln(r^r) & \mbox{for } r < 1 \\
             r^k \ln(r)       & \mbox{for } r \ge 1
          \end{cases}

The weights wi and vj are determined such that the function passes through N given points (\mathbf{c}_i, y_i) (i=1,2,...,N) and fulfill the nx + 1 orthogonality conditions:

 
  0 = \sum_{i=1}^N w_i, \;\; 0 = \sum_{i=1}^N w_i \, c_{j,i} \;\;\; (j=1,2,...,nx)

In order to compute the weights, a symmetric, linear system of equations has to be solved:

 
\begin{bmatrix}
  \mathbf{A} & \mathbf{V}^T \\
  \mathbf{V} & \mathbf{0} \end{bmatrix}
\;
\begin{bmatrix}
  \mathbf{w} \\
  \mathbf{v}
\end{bmatrix} \; = \; 
\begin{bmatrix}
  \mathbf{y} \\
  \mathbf{0}
\end{bmatrix}\;\;\;\;

where

 
  A_{i,j} =  \phi(||\mathbf{c}_i - \mathbf{c}_j||), \;\;\;
  \mathbf{V} =  
  \begin{bmatrix}
         1       &       1      & \cdots & 1 \\
    \mathbf{c}_1 & \mathbf{c}_2 & \cdots & \mathbf{c}_{nx}
  \end{bmatrix}, \;\;\;
  \mathbf{y}  = [y_1, y_2, \cdots, y_N]^T

Under very mild conditions (essentially, that at least nx+1 points are not in a subspace; e.g. for nx=2 that at least 3 points are not on a straight line), the system matrix of the linear system of equations is positiv definite and therefore a unique solution of the equation system exists.

Once the weights are determined, interpolation requires to just evaluate the top most formula for the provided \mathbf{x}.

The main advantage of polyharmonic spline interpolation is that usually very good interpolation results are obtained for scattered data without performing any "tuning", so automatic interpolation is feasible. This is not the case for other radial basis functions. For example, the Gaussian function (e^{-k\cdot r^2}) needs to be tuned, so that k is selected according to the underlying grid of the independent variables. If this grid is non-uniform, a proper selection of k is difficult or impossible, in order to achieve a good interpolation result.

The main disadvantages are

  • In order to determine the weights, a linear system of equations has to be solved, which is non-sparse. The solution of a non-sparse linear system becomes no longer practical if the dimension n is larger as about 1000 (since the storage requirements are O(n2) and the number of operations to solve the linear system is O(n3). For example n=10000 requires about 100 Mbyte of storage and 1000 Gflops of operations).
  • In order to perform the interpolation of M data points, operations in the order of O(M*N) are needed. In many applications, like image processing, M is much larger as N, and if both numbers are large, this is no longer practical.

Recently, methods have been developed to overcome the mentioned difficulties, see, e.g., (Beatson et.al. 2006)


Interpolation with different polyharmonic splines that shall pass the 4 predefined points marked by a circle (the interpolation with phi = r2 is not useful, since the linear equation  system of the interpolation problem has no solution; it is solved in a least squares sense, but then does not pass the centers)
Interpolation with different polyharmonic splines that shall pass the 4 predefined points marked by a circle (the interpolation with phi = r2 is not useful, since the linear equation system of the interpolation problem has no solution; it is solved in a least squares sense, but then does not pass the centers)

The figure shows the interpolation through four points (marked by "circles") using different types of polyharmonic splines. The "curvature" of the interpolated curves grows with the order of the spline and the extrapolation at the left boundary (x < 0) is reasonable. The figure also includes the radial basis functions phi = exp(-r2) which gives a good interpolation as well. Finally, the figure includes also the non-polyharmonic spline phi = r2 to demonstrate, that this radial basis function is not able to pass through the predefined points (the linear equation has no solution and is solved in a least squares sense).


The same interpolation as in the first figure, but the points to be interpolated are scaled by 100
The same interpolation as in the first figure, but the points to be interpolated are scaled by 100

The figure shows the same interpolation as in the first figure, with the only exception that the points to be interpolated are scaled by a factor of 100 (and the case phi = r2 is no longer included). Since phi = (scale*r)k = (scalek)*rk, the factor (scalek) can be extracted from matrix A of the linear equation system and therefore the solution is not influenced by the scaling. This is different for the logarithmic form of the spline, although the scaling has not much influence. This analysis is reflected in the figure, where the interpolation shows not much differences. Note, for other radial basis functions, such as phi = exp(-k*r2) with k=1, the interpolation is no longer reasonable and it would be necessary to adapt k.


The same interpolation as in the first figure, but without the polynomial term
The same interpolation as in the first figure, but without the polynomial term

The figure shows the same interpolation as in the first figure, with the only exception that the polynomial term of the function is not taken into account (and the case phi = r2 is no longer included). As can be seen from the figure, the extrapolation for x < 0 is no longer as "natural" as in the first figure for some of the basis functions. This indicates, that the polynomial term is useful if extrapolation occurs.


[edit] See also


[edit] References