An S-Curve Method for Abrupt and Gradual Changepoint Analysis
Pub. online: 9 July 2024
Type: Statistical Data Science
Open Access
Received
18 April 2024
18 April 2024
Accepted
19 April 2024
19 April 2024
Published
9 July 2024
9 July 2024
Abstract
Changepoint analysis has had a striking variety of applications, and a rich methodology has been developed. Our contribution here is a new approach that uses nonlinear regression analysis as an intermediate computational device. The tool is quite versatile, covering a number of different changepoint scenarios. It is largely free of parametric model assumptions, and has the major advantage of providing standard errors for formal statistical inference. Both abrupt and gradual changes are covered.
Supplementary material
Supplementary MaterialThe ZIP file contains all code needed to reproduce the figures and results of the experiments.
References
Aggarwal R, Inclan C, Leal R (1999). Volatility in emerging stock markets. Journal of Financial and Quantitative Analysis, 34(1): 33–55. https://doi.org/10.2307/2676245
Aminikhanghahi S, Cook D (2017). A survey of methods for time series change point detection. Knowledge and Information Systems, 51: 339–367. https://doi.org/10.1007/s10115-016-0987-z
Bai J, Perron P (2003). Computation and analysis of multiple structural change models. Journal of Applied Econometrics, 18(1): 1–22. https://doi.org/10.1002/jae.659
Berk R, Brown L, Buja A, Zhang K, Zhao L (2013). Valid post-selection inference. The Annals of Statistics, 41(2): 802–837. https://doi.org/10.1214/12-AOS1077
Bhaduri R, Roy S, Pal S (2022). Rough-fuzzy cpd: a gradual change point detection algorithm. Journal of Data, Information and Management, 4: 1–24. https://doi.org/10.1007/s42488-022-00077-3
Boe LA, Lumley T, Shaw PA (2024). Practical considerations for sandwich variance estimation in two-stage regression settings. American Journal of Epidemiology, 193(5): 798–810. https://doi.org/10.1093/aje/kwad234
Chang ST, Lu KP, Yang MS (2015). Fuzzy change-point algorithms for regression models. IEEE Transactions on Fuzzy Systems, 23(6): 2343–2357. https://doi.org/10.1109/TFUZZ.2015.2421072
Chen C, Chan J, Gerlach R, Hsieh W (2011). A comparison of estimators for regression models with change points. Statistics and Computing, 21: 395–414. https://doi.org/10.1007/s11222-010-9177-0
Erdman C, Emerson JW (2007). bcp: An R package for performing a Bayesian analysis of change point problems. Journal of Statistical Software, 23(3): 1–13. https://doi.org/10.18637/jss.v023.i03
Fong Y (2019). Fast bootstrap confidence intervals for continuous threshold linear regression. Journal of Computational and Graphical Statistics, 28: 466–470. https://doi.org/10.1080/10618600.2018.1537927
Fong Y, Huang Y, Gilbert P, Permar S (2017). chngpt: Threshold regression model estimation and inference. BMC Bioinformatics, 18: 454. https://doi.org/10.1186/s12859-017-1863-x
Fryzlewicz P (2014). Wild binary segmentation for multiple change-point detection. The Annals of Statistics, 42(6): 2243–2281. https://doi.org/10.1214/14-AOS1245
Hušková M (1999). Gradual changes versus abrupt changes. Journal of Statistical Planning and Inference, 76(1): 109–125. https://doi.org/10.1016/S0378-3758(98)00173-6
Jennrich RI (1969). Asymptotic Properties of Non-Linear Least Squares Estimators. The Annals of Mathematical Statistics, 40(2): 633–643. https://doi.org/10.1214/aoms/1177697731
Killick R, Eckley IA (2014). changepoint: An R package for changepoint analysis. Journal of Statistical Software, 58(3): 1–19. https://doi.org/10.18637/jss.v058.i03
Killick R, Fearnhead P, Eckley I (2012a). Optimal detection of changepoints with a linear computational cost. Journal of the American Statistical Association, 107: 1590–1598. https://doi.org/10.1080/01621459.2012.737745
Kuchibhotla AK, Kolassa JE, Kuffner TA (2022). Post-selection inference. Annual Review of Statistics and Its Application, 9(1): 505–527. https://doi.org/10.1146/annurev-statistics-100421-044639
Lindeløv JK (2023). An overview of change point packages in R. https://lindeloev.github.io/mcp/articles/packages.html.
Lu KP, Chang ST (2016). Detecting change-points for shifts in mean and variance using fuzzy classification maximum likelihood change-point algorithms. Journal of Computational and Applied Mathematics, 308: 447–463. https://doi.org/10.1016/j.cam.2016.06.006
Lu KP, Chang ST, Yang MS (2016). Change-point detection for shifts in control charts using fuzzy shift change-point algorithms. Computers & Industrial Engineering, 93: 12–27. https://doi.org/10.1016/j.cie.2015.12.002
Matloff N (1981). Use of regression functions for improved estimation of means. Biometrika, 68(3): 685–689. https://doi.org/10.1093/biomet/68.3.685
Muggeo VM (2017). Interval estimation for the breakpoint in segmented regression: A smoothed score-based approach. Australian & New Zealand Journal of Statistics, 59(3): 311–322. https://doi.org/10.1111/anzs.12200
Muggeo VMR (2003). Estimating regression models with unknown break-points. Statistics in Medicine, 22(19): 3055–3071. https://doi.org/10.1002/sim.1545
Muggeo VMR, Adelfio G (2010). Efficient change point detection for genomic sequences of continuous measurements. Bioinformatics, 27(2): 161–166. https://doi.org/10.1093/bioinformatics/btq647
Pollard D, Radchenko P (2006). Nonlinear least-squares estimation. Journal of Multivariate Analysis, 97(2): 548–562. https://doi.org/10.1016/j.jmva.2005.04.002
Román-Román P, Serrano-Pérez J, Torres-Ruiz F (2019). A note on estimation of multi-sigmoidal Gompertz functions with random noise. Mathematics, 7(6): 541. https://doi.org/10.3390/math7060541
Sablik T (2013). Recession of 1981–82. https://www.federalreservehistory.org/essays/recession-of-1981-82.
Sidik K, Jonkman JN (2016). A comparison of the variance estimation methods for heteroscedastic nonlinear models. Statistics in Medicine, 35(26): 4856–4874. https://doi.org/10.1002/sim.7024
Song H, Chen H (2021). Asymptotic distribution-free changepoint detection for data with repeated observations. Biometrika, 109(3): 783–798. https://doi.org/10.1093/biomet/asab048
Truong C, Oudre L, Vayatis N (2020). Selective review of offline change point detection methods. Signal Processing, 167: 107299. https://doi.org/10.1016/j.sigpro.2019.107299
Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, et al. (2020). SciPy 1.0: Fundamental algorithms for scientific computing in Python. Nature Methods, 17: 261–272. https://doi.org/10.1038/s41592-019-0686-2
Yau CY, Zhao Z (2015). Inference for multiple change points in time series via likelihood ratio scan statistics. Journal of the Royal Statistical Society, Series B, Statistical Methodology, 78(4): 895–916. https://doi.org/10.1111/rssb.12139