Karush-Kuhn-Tucker conditions

From Wikipedia, the free encyclopedia

In mathematics, the Karush-Kuhn-Tucker conditions (also known as the Kuhn-Tucker or the KKT conditions) are necessary for a solution in nonlinear programming to be optimal. It is a generalization of method of Lagrange multipliers.

Let us consider the following nonlinear optimization problem:

\mbox{minimize } f(x) \mbox{ subject to } g_i(x) \le 0, h_j(x) = 0

where f(x) is the function to be minimized, g_i (x)\ (i = 1, \ldots,m) are the nonequality constraints and h_j (x)\ (j = 1,\ldots,l) are the equality constraints, and m and l are the number of nonequality and equality constraints, respectively.

The necessary conditions for inequality constrained problem were first published in the Masters thesis of W. Karush,[1] although they became renowned after a seminal conference paper by Harold W. Kuhn and Albert W. Tucker.[2]

Contents

[edit] Necessary conditions

Suppose that the objective function, i.e., the function to be minimized, is f : \mathbb{R}^n \rightarrow \mathbb{R} and the constraint functions are g_i : \,\!\mathbb{R}^n \rightarrow \mathbb{R} and h_j : \,\!\mathbb{R}^n \rightarrow \mathbb{R}. Further, suppose they are continuously differentiable at a point x * . If x * is a local minimum, then there exist constants \lambda \ge 0, \mu_i \ge 0\ (i = 1,\ldots,m) and \nu_j\ (j = 1,...,l) such that

\lambda + \sum_{i=1}^m \mu_i + \sum_{j=1}^l |\nu_j| > 0,
\lambda\nabla f(x^*) + \sum_{i=1}^m \mu_i \nabla g_i(x^*) + \sum_{j=1}^l \nu_j \nabla h_j(x^*) = 0,
\mu_i g_i (x^*) = 0\; \mbox{for all}\; i = 1,\ldots,m.

[edit] Regularity conditions (or constraint qualifications)

In the necessary condition above, the dual multiplier λ may be zero. Such cases are referred to as degenerate or abnormal. The necessary condition then does not take into account the properties of the function, but only the geometry of constraints.

A number of regularity conditions exist which ensure that the solution is non-degenerate (i.e. \lambda \ne 0). These include:

  • Linear independence constraint qualification (LICQ): the gradients of the active inequality constraints and the gradients of the equality constraints are linearly independent at x * .
  • Mangasarian-Fromowitz constraint qualification (MFCQ): the gradients of the active inequality constraints and the gradients of the equality constraints are positive-linearly independent at x * .
  • Constant rank constraint qualification (CRCQ): for each subset of the gradients of the active inequality constraints and the gradients of the equality constraints the rank at a vicinity of x * is constant.
  • Constant positive linear dependence constraint qualification (CPLD): for each subset of the gradients of the active inequality constraints and the gradients of the equality constraints, if it is positive-linear dependent at x * then it is positive-linear dependent at a vicinity of x * . (\{v_1,\ldots,v_n\} is positive-linear dependent if there exists a_1>=0,\ldots,a_n>=0 not all zero such that a_1v_1+\ldots+a_nv_n=0)
  • Slater condition: for a problem with inequality constraints only, there exists a point x such that gi(x) < 0 for all i = 1,\ldots,m

It can be shown that LICQ=>MFCQ=>CPLD, LICQ=>CRCQ=>CPLD, although MFCQ is not equivalent to CRCQ. In practice weaker constraint qualifications are preferred since they provide stronger optimality conditions.

[edit] Sufficient conditions

Let the objective function f: \mathbb{R}^n \rightarrow \mathbb{R} and the constraint functions g_i : \mathbb{R}^n \rightarrow \mathbb{R} be convex functions and h_j : \mathbb{R}^n \rightarrow \mathbb{R} be affine functions, and let there be a feasible point x * . If there exist constants \mu_i \ge 0\ (i = 1,\ldots,m) and \nu_j\ (j = 1,\ldots,l) such that

\nabla f(x^*) + \sum_{i=1}^m \mu_i \nabla g_i(x^*) + \sum_{j=1}^l \nu_j \nabla h_j(x^*) = 0
\mu_i g_i (x^*) = 0\; \mbox{for all}\; i = 1,\ldots,m,

then the point x * is a global minimum.

[edit] References

  1. ^ W. Karush (1939). "Minima of Functions of Several Variables with Inequalities as Side Constraints". M.Sc. Dissertation. Dept. of Mathematics, Univ. of Chicago, Chicago, Illinois. . Available from http://wwwlib.umi.com/dxweb/details?doc_no=7371591 (for a fee)
  2. ^ Kuhn, H. W.; Tucker, A. W. (1951). "Nonlinear programming". Proceedings of 2nd Berkeley Symposium: 481-492, Berkeley: University of California Press. 

[edit] See Also

  • Avriel, Mordecai (2003). Nonlinear Programming: Analysis and Methods. Dover Publishing. ISBN 0-486-43227-0.
  • R. Andreani, J. M. Martínez, M. L. Schuverdt, On the relation between constant positive linear dependence condition and quasinormality constraint qualification. Journal of optimization theory and applications, vol. 125, no2, pp. 473-485 (2005).
In other languages