LARS

From Wikipedia, the free encyclopedia

LARS, or Least Angle Regression is a regression algorithm in statistics for high-dimensional data, developed by Bradley Efron, Trevor Hastie, Iain Johnstone and Robert Tibshirani.[1]

Suppose we expect a response variable to be determined by a linear combination of a subset of potential covariates, then the LARS algorithm provides a means of producing an estimate of which variables to include, as well as their coefficients.

Instead of giving a vector result, the LARS solution consists of a curve denoting the solution for each value of the L1 norm of the parameter vector. The algorithm is similar to forward stepwise regression, but instead of including variables at each step, the estimated parameters are increased in a direction equiangular to each's correlations with the residual.

The advantages of the LARS method are:

  1. It is computationally just as fast as forward selection.
  2. It produces a full piecewise linear solution path, which is useful in cross validation or similar attempts to tune the model.
  3. If two variables are almost equally correlated with the response, then their coefficients should increase at approximately the same rate. The algorithm thus behaves as intuition would expect, and also is more stable.
  4. It is easily modified to produce solutions for other estimators, like the Lasso.
  5. It is effective in contexts where p > n.

[edit] References

  1. ^ Efron, Bradley; Hastie, Trevor; Johnstone, Iain and Tibshirani, Robert (2004). "Least Angle Regression". Annals of Statistics 32 (2): pp. 407–499.