Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01t722hc84q
Title: Win Probability Modelling in the National Basketball Association
Authors: Deokar, Viraj
Advisors: Ahmadi, Amir Ali
Department: Operations Research and Financial Engineering
Certificate Program: Applications of Computing Program
Class Year: 2020
Abstract: The modern National Basketball Association (NBA) has seen enormous growth in its statistical analysis and research in recent years. A metric which has risen in popularity as a result is live win probability which tracks the estimated chances of either team winning the current game. While there has been academic research on this topic, there are still many unexplored ideas. Specifically, few academic works exist which focus on the relationship between win probability and the box score of an NBA game. Among other issues is the lack of accessible research data for win probability modelling. In this thesis, we aim to fix both of these problems by building a robust play-by-play data set for the 2008-09 to 2018-19 NBA seasons which incorporates running box score information and will be publicly released. We then train various supervised learning methods both to understand the impact of box score statistics on win probability and to see if we can improve upon current win probability modelling implementations. We find that box score statistics have a negligible impact on win probability predictions as the current score difference is the dominant factor in determining win probability. Our best model performs comparably to ESPN's proprietary win probability model and to the market-implied probabilities from live sports betting.
URI: http://arks.princeton.edu/ark:/88435/dsp01t722hc84q
Type of Material: Princeton University Senior Theses
Language: en
Appears in Collections:Operations Research and Financial Engineering, 2000-2020

Files in This Item:
File Description SizeFormat 
DEOKAR-VIRAJ-THESIS.pdf4.36 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.