Journal of Data Science logo


Login Register

  1. Home
  2. Issues
  3. Volume 21, Issue 1 (2023)
  4. A Joint Analysis for Field Goal Attempts ...

Journal of Data Science

Submit your article Information
  • Article info
  • Related articles
  • More
    Article info Related articles

A Joint Analysis for Field Goal Attempts and Percentages of Professional Basketball Players: Bayesian Nonparametric Resource
Volume 21, Issue 1 (2023), pp. 68–86
Eliot Wong-Toi   Hou-Cheng Yang   Weining Shen     All authors (4)

Authors

 
Placeholder
https://doi.org/10.6339/22-JDS1062
Pub. online: 9 August 2022      Type: Data Science In Action      Open accessOpen Access

Received
9 January 2022
Accepted
17 July 2022
Published
9 August 2022

Abstract

Understanding shooting patterns among different players is a fundamental problem in basketball game analyses. In this paper, we quantify the shooting pattern via the field goal attempts and percentages over twelve non-overlapping regions around the front court. A joint Bayesian nonparametric mixture model is developed to find latent clusters of players based on their shooting patterns. We apply our proposed model to learn the heterogeneity among selected players from the National Basketball Association (NBA) games over the 2018–2019 regular season and 2019–2020 bubble season. Thirteen clusters are identified for 2018–2019 regular season and seven clusters are identified for 2019–2020 bubble season. We further examine the shooting patterns of players in these clusters and discuss their relation to players’ other available information. The results shed new insights on the effect of NBA COVID bubble and may provide useful guidance for player’s shot selection and team’s in-game and recruiting strategy planning.

Supplementary material

 Supplementary Material
The real data and R code used to reproduce the results in this paper can be found https://github.com/ewongtoi/nba_shot_charts. Additional tables are presented in the Online Supplementary Material.

References

 
Aldous DJ (1985). Exchangeability and related topics. In: École d’Été de Probabilités de Saint-Flour XIII—1983, 1–198. Springer.
 
Antoniak CE (1974). Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. The Annals of Statistics, 2(6): 1152–1174.
 
Blackwell D, MacQueen JB, et al. (1973). Ferguson distributions via Pólya urn schemes. The Annals of Statistics, 1(2): 353–355.
 
Bradley JR, Cressie N, Shi T (2011). Selection of rank and basis functions in the spatial random effects model. In: Proceedings of the 2011 Joint Statistical Meetings, 3393–3406. American Statistical Association, Alexandria, VA.
 
Bradley JR, Holan SH, Wikle CK (2015). Multivariate spatio-temporal models for high-dimensional areal data with application to longitudinal employer-household dynamics. The Annals of Applied Statistics, 9(4): 1761–1791.
 
Dahl DB, Vannucci M (2006). Model-Based Clustering for Expression Data via a Dirichlet Process Mixture Model. In: Bayesian Inference for Gene Expression and Proteomics (KA Do, P Muller, eds.), 201–218. Cambridge University Press, Cambridge.
 
de Valpine P, Paciorek C, Turek D, Michaud N, Anderson-Bergman C, Obermeyer F, et al. (2021a). NIMBLE: MCMC, Particle Filtering, and Programmable Hierarchical Modeling. R package version 0.12.1.
 
de Valpine P, Paciorek C, Turek D, Michaud N, Anderson-Bergman C, Obermeyer F, et al. (2021b). NIMBLE User Manual. R package manual version 0.12.1.
 
de Valpine P, Turek D, Paciorek C, Anderson-Bergman C, Temple Lang D, Bodik R (2017). Programming with models: writing statistical algorithms for general model structures with NIMBLE. Journal of Computational and Graphical Statistics, 26: 403–413.
 
Ferguson TS (1973). A Bayesian analysis of some nonparametric problems. The Annals of Statistics, 1(2): 209–230.
 
Franks A, Miller A, Bornn L, Goldsberry K (2015). Characterizing the spatial structure of defensive skill in professional basketball. The Annals of Applied Statistics, 9(1): 94–121.
 
Hu G, Yang HC, Xue Y (2021a). Bayesian group learning for shot selection of professional basketball players. Stat, 10(1): e324.
 
Hu G, Yang HC, Xue Y, Dey DK (2021b). Zero inflated poisson model with clustered regression coefficients: an application to heterogeneity learning of field goal attempts of professional basketball players. Canadian Journal of Statistics.
 
Ibrahim JG, Chu H, Chen LM (2010). Basic concepts and methods for joint models of longitudinal and survival data. Journal of Clinical Oncology, 28(16): 2796.
 
Jiao J, Hu G, Yan J (2021). A bayesian marked spatial point processes model for basketball shot chart. Journal of Quantitative Analysis in Sports, 17(2): 77–90.
 
Li H, Calder CA, Cressie N (2007). Beyond moran’s i: testing for spatial dependence based on the spatial autoregressive model. Geographical Analysis, 39(4): 357–375.
 
Lim J (2021). Incorporating Bayesian variable selection into the spatial mixed effects models with process augmentation: Bayesian wavelet neural network (bwnn), Ph.D. thesis, The Florida State University.
 
Long JD, Mills JA (2018). Joint modeling of multivariate longitudinal data and survival data in several observational studies of huntington’s disease. BMC medical research methodology, 18(1): 1–15.
 
Miller A, Bornn L, Adams R, Goldsberry K (2014). Factorized point process intensities: A spatial analysis of professional basketball. In: International conference on machine learning, 235–243. PMLR.
 
Miller JW, Harrison MT (2018). Mixture models with a prior on the number of components. Journal of the American Statistical Association, 113(521): 340–356.
 
Moran PA (1950). Notes on continuous stochastic phenomena. Biometrika, 37(1/2): 17–23.
 
Neal RM (2000). Markov chain sampling methods for Dirichlet process mixture models. Journal of Computational and Graphical Statistics, 9(2): 249–265.
 
Pitman J (1995). Exchangeable and partially exchangeable random partitions. Probability Theory and Related Fields, 102(2): 145–158.
 
Reich BJ, Hodges JS, Carlin BP, Reich AM (2006). A spatial analysis of basketball shot chart data. The American Statistician, 60(1): 3–12.
 
Xu Z, Bradley JR, Sinha D (2019). Latent multivariate log-gamma models for high-dimensional multi-type responses with application to daily fine particulate matter and mortality counts. arXiv preprint: https://arxiv.org/abs/1909.02528.
 
Yang HC, Bradley JR (2022). Bayesian inference for spatial count data that may be over-dispersed or under-dispersed with application to the 2016 us presidential election. Journal of Data Science, forthcoming: 1–13.
 
Yin F, Hu G, Shen W (2022a). Analysis of professional basketball field goal attempts via a bayesian matrix clustering approach. Journal of Computational and Graphical Statistics, forthcoming: 1–23.
 
Yin F, Jiao J, Hu G, Yan J (2022b). Bayesian nonparametric estimation for point processes with spatial homogeneity: A spatial analysis of nba shot locations. In: Proceedings of the 39th International Conference on Machine Learning (ICML).

Related articles PDF XML
Related articles PDF XML

Copyright
2023 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.
by logo by logo
Open access article under the CC BY license.

Keywords
Chinese restaurant process mixture model shot charts data spatial spline sport analytics

Metrics (since February 2021)
102

Article info
views

0

Full article
views

125

PDF
downloads

91

XML
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

Journal of data science

  • Online ISSN: 1683-8602
  • Print ISSN: 1680-743X

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer

Contact us

  • JDS@ruc.edu.cn
  • No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China
Powered by PubliMill  •  Privacy policy