Bayesian Multiple Change-Point Estimation and Segmentation Kim, Jaehee; Cheon, Sooyoung;
This study presents a Bayesian multiple change-point detection approach to segment and classify the observations that no longer come from an initial population after a certain time. Inferences are based on the multiple change-points in a sequence of random variables where the probability distribution changes. Bayesian multiple change-point estimation is classifies each observation into a segment. We use a truncated Poisson distribution for the number of change-points and conjugate prior for the exponential family distributions. The Bayesian method can lead the unsupervised classification of discrete, continuous variables and multivariate vectors based on latent class models; therefore, the solution for change-points corresponds to the stochastic partitions of observed data. We demonstrate segmentation with real data.
BIC;multiple change-points;segmentation;stochastic approximation Monte Carlo;
Adams, R. and Mackay, D. (2007). Bayesian Online Changepoint Detection, University of Cambridge Technical Report.
Barry, D. and Hartigan, J. (1993). A Bayesian analysis for change-point problems, Journal of the American Statistical Association, 88, 309-319.
Bartels, R. (1982). The rank version of von Neumann's Ratio test for randomness, Journal of the American Statistical Association, 77, 40-46.
Belisle, P., Joseph, L., MacGibbon, B., Wolfson, D. and du Berger, R. (1998). Change-point analysis of neuron spike train data, Biometrics, 54, 113-123.
Carlin, B., Gelfand, A. and Smith, A. F. M. (1992). Hierarchical Bayesian analysis of change point problems, Applied Statistics, 41, 389-405.
Cheon, S. and Kim, J. (2010). Multiple change-point detection of multivariate mean vectors with Bayesian approach, Computational Statistics & Data Analysis, 54, 406-415.
Chernoff, H. and Zacks, S. (1964). Estimating the current mean of a normal distribution which is subject to changes in time, Annals of Mathematical Statistics, 35, 999-1018.
Chib, S. (1998). Estimation and comparison of multiple change-point models, Journal of Econometrics, 86, 221-241.
Corander, J., Gyllenberg, M. and Koski, T. (2009). Bayesian unsupervised classification framework based on stochastic partitions of data and a parallel search strategy, Advanced Data Analysis and Classification, 3, 3-24.
Cox, D. and Lewis, P. (1966). Methuen's Monographs on Applied Probability and Statistics, In The Statistical Analysis of Series of Events, John Wiley, London.
Fearnhead, P. (2006). Exact and efficient Bayesian inference for multiple changepoint problems, Statistics and Computing, 16, 203-213.
Fearnhead, P. and Clifford, P. (2003). Online inference for hidden Markov models, Journal of the Royal Statistical Society Series B, 65, 887-899.
Gelfand, A. and Smith, A. (1990). Sampling-based approaches to calculating marginal densities, Journal of the American Statistical Association, 85, 398-409.
Gelman, A., Carlin, J., Stern, H. and Rubin, D. (2003). Bayesian Data Analysis, Chapman and Hall, New York.
Green, P. (1995). Reversible jump Markov chain Monte Carlo computation and Bayesian model determination, Biometrika, 82, 711-732.
Hastings, W. K. (1970). Monte Carlo sampling methods using Markov Chains and their applications, Biometrika, 57, 97-109.
Hawkins, D. (2001). Finding multiple change-point models to data, Computational Statistics & Data Analysis, 37, 323-341.
Hinkley, D. (1970). Inference about the change-point in a sequence of random variables, Biometrika, 57, 1-17.
Hinkley, D. (1972). Time-ordered classification, Biometrika, 59, 509-523.
Holmes, D. and Mergen, A. (1993). Improving the performance of the $T^2$ control chart, Quality Engineering, 5, 619-625.
Hsu, D. (1979). Detecting shifts of parameter in gamma sequences with applications to stock price and air traffic flow analysis, Journal of the American Statistical Association, 74, 31-40.
Jarrett, R. (1979). A note on the intervals between coal-mining disasters, Biometrika, 66, 191-193.
Kass, R. and Raftery, A. (1995). Bayes factors, Journal of the American Statistical Association, 90, 773-795.
Kim, J. and Cheon, S. (2010). Bayesian multiple change-point estimation with annealing stochastic approximation Monte Carlo, Computational Statistics, 25, 215-239.
Kojadinovic, I. and Holmes, M. (2009). Tests of independence among continuous random vectors based on Cramer-von Mises functionals of the empirical copula process, Journal of Multivariate Analysis, 100, 1137-1154.
Lee, C.-B. (1998). Bayesian analysis of a change-point in exponential families with applications, Computational Statistics & Data Analysis, 27, 195-208.
Liang, F., Liu, R. and Carroll, R. (2007). Stochastic approximation in Monte Carlo computation, Journal of the American Statistical Association, 102, 305-320.
Maguire, B., Pearson, E. and Wynn, A. (1952). The time intervals between industrial accidents, Biometrika, 38, 168-180.
Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H. and Teller, E. (1953). Equations of state calculations by fast computing machines, Journal of Chemical Physics, 21, 1087-1091.
O Ruanaidh, J. and Fitzgerald, W. (1996). Bayesian Online Changepoint Detection, Springer, New York.
Raftery, A. (1995). Bayesian model selection in social research, In Marsden V (Ed) Sociological methodology, 25, 111-163.
Raftery, A. and Akman, V. (1986). Bayesian analysis of a Poisson process with a change-point, Biometrika, 73, 85-89.
Robbins, H. and Monro, S. (1951). A stochastic approximation method, Annals of Mathematical Statistics, 22, 400-407.
Siegmund D. (1988). Confidence sets in change point problem, International Statistical Review, 56, 31-48.
Smith, A. F. M. (1975). A Bayesian approach to inference about a change-point in a sequence of random variables, Biometrika, 62, 407-416.