Win probability

Win probability is a statistical tool that suggests a sports team's chances of winning at any given point in a game, based on the performance of historical teams in the same situation.^[1] The art of estimating win probability involves choosing which pieces of context matter. Baseball win probability estimates often include whether a team is home or away, inning, number of outs, which bases are occupied, and the score difference. Because baseball proceeds batter by batter, each new batter introduces a discrete state. There are a limited number of possible states, and so baseball win probability tools usually have enough data to make an informed estimate.

American football win probability estimates often include whether a team is home or away, the down and distance, score difference, time remaining, and field position. American football has many more possible states than baseball with far fewer games, so football estimates have a greater margin of error. The first win probability analysis was done in 1971 by Robert E. Machol and former NFL quarterback Virgil Carter.

As a brief example, guessing that each team playing at home will win is based on home advantage. This guess uses a single contextual factor and involves a very large number of games. But with only one factor, the accuracy of this guess is limited to home advantage itself (about 55-70% across sports) and does not change within the game based on in-game factors.

Win probability added is the change in win probability, often how a play or team member affected the probable outcome of the game.^[2]

Current research

Current research work involves measuring the accuracy of win probability estimates, as well as quantifying the uncertainty in individual estimates.^[3]^[4] That is, if a tool estimates a 24% win probability because 24% of previous teams in that situation won their games, do future teams win at the same 24% rate? Estimating from hidden data uses testing tools like cross-validation.

While many models involve frequency analysis of past events, other models use Bayesian processes.^[5]

Some models include a measure of teams' strength coming into the game, while others assume every team is average. Including strength estimates increases the number of possible states, and therefore decreases an estimate's power while possibly increasing its accuracy.^[6]

References

↑ FanGraphs: Win Expectancy at the Wayback Machine (archived November 9, 2014)
↑ Win Probability and Win Probability Added Explained at the Wayback Machine (archived December 15, 2014)
↑ Tango, Tom (October 2, 2006). "Misunderstanding Win Expectancy".
↑ Tango, Tom; Lichtman, Mitchel; Dolphin, Andrew (2007). The Book: Playing the Percentages in Baseball. Potomac Books, Inc. ISBN 978-1-59797-129-4.
↑ Football Commentary: Description of the Dynamic Programming Model at the Wayback Machine (archived November 21, 2014)
↑ Sabermetrics 101: The Game State, Run Expectancy, and Win Expectancy at the Wayback Machine (archived April 11, 2014)

External links

Stoll, Greg. "Baseball Win Probability Calculator".
Pettigrew, Stephen (2014). "Win probability graphs for all 2013/2014 NHL regular season games". Harvard Dataverse Network [Distributor] V2 [Version]. doi:10.7910/DVN/25502

Sports rating systems

Computer models	Advanced Football Analytics Albrecht Matrix Hybrid ARGH Power Ratings Dickinson System Pomeroy College Basketball Ratings Ratings Percentage Index (RPI) Sonny Moore Power Ratings TrueSkill

Methods and concepts	Elo rating system Home advantage Log5 Pythagorean expectation Sabermetrics Strength of schedule Win probability

Polls and opinion	AP Poll FWAA-NFF Grantland Rice Super 16 Poll Harris Interactive Poll Legends Poll NAIA Coaches' Poll USA Today Coaches' Poll

People	John Hollinger Bill James Kenneth Massey Ken Pomeroy Jeff Sagarin Nate Silver Peter Wolfe