Davidon–Fletcher–Powell formula

The Davidon–Fletcher–Powell formula (or DFP; named after William C. Davidon, Roger Fletcher, and Michael J. D. Powell) finds the solution to the secant equation that is closest to the current estimate and satisfies the curvature condition (see below). It was the first quasi-Newton method to generalize the secant method to a multidimensional problem. This update maintains the symmetry and positive definiteness of the Hessian matrix.

Given a function $f(x)$ , its gradient ( $\nabla f$ ), and positive definite Hessian matrix $B$ , the Taylor series is:

f(x_k+s_k)=f(x_k)+\nabla f(x_k)^T s_k+\frac{1}{2} s^T_k {B} s_k,

and the Taylor series of the gradient itself (secant equation):

\nabla f(x_k+s_k)=\nabla f(x_k)+B s_k,

is used to update $B$ . The DFP formula finds a solution that is symmetric, positive definite and closest to the current approximate value of $B_k$ :

B_{k+1}= (I-\gamma_k y_k s_k^T) B_k (I-\gamma_k s_k y_k^T)+\gamma_k y_k y_k^T,

where

y_k=\nabla f(x_k+s_k)-\nabla f(x_k),

\gamma_k =\frac{1}{y_k^T s_k}.

and $B_k$ is a symmetric and positive definite matrix. The corresponding update to the inverse Hessian approximation $H_k=B_k^{-1}$ is given by:

H_{k+1}=H_{k}-\frac{H_k y_k y_k^T H_k}{y_k^T H_k y_k}+\frac{s_k s_k^T}{y_k^{T} s_k}.

$B$ is assumed to be positive definite, and the vectors $s_k^T$ and $y$ must satisfy the curvature condition:

s_k^T y_k=s_k^T B s_k>0. \,

The DFP formula is quite effective, but it was soon superseded by the BFGS formula, which is its dual (interchanging the roles of y and s).

References

Davidon, W. C. (1991), "Variable metric method for minimization", SIAM Journal on Optimization 1: 1–17, doi:10.1137/0801001
Fletcher, Roger (1987), Practical methods of optimization (2nd ed.), New York: John Wiley & Sons, ISBN 978-0-471-91547-8 .
Nocedal, Jorge & Wright, Stephen J. (1999), Numerical Optimization, Springer-Verlag, ISBN 0-387-98793-2

Optimization: Algorithms, methods, and heuristics

Unconstrained nonlinear: Methods calling …

… functions

… and gradients

Convergence	Trust region Wolfe conditions

Quasi–Newton	BFGS and L-BFGS DFP Symmetric rank-one (SR1)

Other methods	Gauss–Newton Gradient Levenberg–Marquardt Conjugate gradient

… and Hessians

Newton's method

The graph of a strictly concave quadratic function is shown in blue, with its unique maximum shown as a red dot. Below the graph appears the contours of the function: The level sets are nested ellipses.

Constrained nonlinear

General	Barrier methods Penalty methods

Differentiable	Augmented Lagrangian methods Sequential quadratic programming Successive linear programming

Convex optimization

Convex
minimization

Linear and
quadratic

Interior point	Ellipsoid algorithm of Khachiyan Projective algorithm of Karmarkar

Basis-Exchange	Simplex algorithm of Dantzig Revised simplex algorithm Criss-cross algorithm Principal pivoting algorithm of Lemke

Combinatorial

Paradigms

Graph
algorithms

Minimum spanning tree	Bellman–Ford Borůvka Dijkstra Floyd–Warshall Johnson Kruskal

Network flows

Dinic
Edmonds–Karp
Ford–Fulkerson
Push-relabel maximum flow

Metaheuristics

Evolutionary algorithm Hill climbing Local search Simulated annealing Tabu search

Categories
- Algorithms and methods
- Heuristics
Software

Davidon–Fletcher–Powell formula

See also

References