Advanced SearchSearch Tips
Recommendation of Personalized Surveillance Interval of Colonoscopy via Survival Analysis
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
 Title & Authors
Recommendation of Personalized Surveillance Interval of Colonoscopy via Survival Analysis
Gu, Jayeon; Kim, Eun Sun; Kim, Seoung Bum;
  PDF(new window)
A colonoscopy is important because it detects the presence of polyps in the colon that can lead to colon cancer. How often one needs to repeat a colonoscopy may depend on various factors. The main purpose of this study is to determine personalized surveillance interval of colonoscopy based on characteristics of patients including their clinical information. The clustering analysis using a partitioning around medoids algorithm was conducted on 625 patients who had a medical examination at Korea University Anam Hospital and found several subgroups of patients. For each cluster, we then performed survival analysis that provides the probability of having polyps according to the number of days until next visit. The results of survival analysis indicated that different survival distributions exist among different patients` groups. We believe that the procedure proposed in this study can provide the patients with personalized medical information about how often they need to repeat a colonoscopy.
Surveillance Interval;Survival Analysis;Patients Clustering;Kaplan-Meier Estimator;Log-Rank Test;Decision Tree;Colonoscopy;
 Cited by
Bhambhri, A. (2011), Smarter Analytics for Big Data, IBM.

Banez, L. L., Prasanna, P., Sun, L., Ali, A., Zou, Z., Adam, B. L., and Srivastava, S. (2003), Diagnostic potential of serum proteomic patterns in prostate cancer, The Journal of urology, 170(2), 442-446. crossref(new window)

Bender, M., Klein, R., Disch, A., and Ebert, A. (2000), A functional framework for web-based information visualization systems, Visualization and Computer Graphics, IEEE Transactions, 6(1), 8-23. crossref(new window)

Berry, M. J. and Linoff, G. (1997), Data mining techniques : for marketing, sales, and customer support, John Wiley and Sons, Inc.

Borg, I. and Groenen, P. J. (2005), Modern multidimensional scaling : Theory and applications, Springer Science and Business Media.

Breiman, L., Friedman, J., Stone, C. J., and Olshen, R. A. (1984), Classification and regression trees, CRC press.

Burroni, M., Corona, R., Dell'Eva, G., Sera, F., Bono, R., Puddu, P., and Rubegni, P. (2004), Melanoma computer-aided diagnosis reliability and feasibility study, Clinical cancer research, 10(6), 1881-1886. crossref(new window)

Chen, M.Y. (2002), Survival duration of plants : evidence from the US petroleum refining industry, International Journal of Industrial Organization, 20(4), 517-555. crossref(new window)

Cho, I. S. and Chung, E. (2011), Predictive bayesian network model using electronic patient records for prevention of hospital-acquired pressure ulcers, Journal of Korean Academy of Nursing, 41(3), 423-431. crossref(new window)

Choi, J., Han, S., Kang, H., and Kim, E. (1998), Data mining decision tree analysis using answer tree, SPSS Academy, 17-23.

Christodoulou, C. and Pattichis, C. S. (1999), Unsupervised pattern recognition for the classification of EMG signals, Biomedical Engineering, IEEE Transactions, 46(2), 169-178. crossref(new window)

Curram, S. P. and Mingers, J. (1994), Neural networks, decision tree induction and discriminant analysis : An empirical comparison, Journal of the Operational Research Society, 45(4), 440-450. crossref(new window)

Goldman, L., Cook, E. F., Brand, D. A., Lee, T. H., Rouan, G. W., Weisberg, M. C., and Jakubowski, R. (1988), A computer protocol to predict myocardial infarction in emergency department patients with chest pain, New England Journal of Medicine, 318(13), 797-803. crossref(new window)

Gorden, A. D. (1999), Classification, Chapman and Hall/CRC.

Gower, J. C. (1971), A general coefficient of similarity and some of its properties, Biometrics, 27(4), 857-871. crossref(new window)

Hastie, T., Friedman, J., and Tibshirani, R. (2001), The elements of statistical learning, Springer.

Han, P. and Baek, J. G. (2014), Prediction model on delivery time in display FAB using survival analysis, Journal of the Korea Institute of Institute of Industrial Engineers, 40(3), 283-290. crossref(new window)

Hartigan, J. A. (1975), Clustering algorithms, John Wiley and Sons, Inc.

Hong, S. N., Yang, D. H., Kim, Y. H., Hong, S. P., Shin, S. J., Kim, S. E., and Yang, S. K. (2012), Korean guidelines for post-polypectomycolonoscopic surveillance, The Korean Journal of Gastroenterology, 59(2), 99-117. crossref(new window)

Hosmer Jr, D. W., Lemeshow, S., and May, S. (2011), Applied survivalanalysis : regression modeling of time to event data,

Jo, I. and Kim, J. (2011), Trend research-based clinical decision support systems based on Electronic Health Records, Communications of the Korean Institute of Information Scientists and Engineers, 29(2), 92-100.

Joo, S., Yang, Y. S., Moon, W. K., and Kim, H. C. (2004), Computer-aided diagnosis of solid breast nodules : use of an artificial neural network based on multiple sonographic features, Medical Imaging, IEEE Transactions, 23(10), 1292-1300. crossref(new window)

Jung, K. W., Won, Y. J., Kong, H. J., Oh, C. M., Cho, H., Lee, D. H., and Lee, K. H. (2015), Cancer statistics in Korea : incidence, mortality, survival, and prevalence in 2012, Cancer research and treatment : official journal of Korean Cancer Association, 47(2), 127. crossref(new window)

Kalbfleisch, J. D. and Prentice, R. L. (2011), The statistical analysis of failure time data, John Wiley and Sons.

Kaplan, E. L. and Meier, P. (1958), Nonparametric estimation from incomplete observations, Journal of the American statistical association, 53(282), 457-481. crossref(new window)

Kaufman, L. and Rousseeuw, P. J. (2009), Finding groups in data : an introduction to cluster analysis, John Wiley and Sons.

Lee, B. and Jung, S. (2002) Korean National Guidelines on Screening and Surveillance for Early Detection of Colorectal Cancers (KSCP and NCC), Korean Society of Gastointestinal Endoscopy, 45(8), 981-891.

Lee, Y. (2010), Study on Prediction Model of insolvent companies using survival analysis techniques Guarantee, Korean Market Economy Research, 39(3), 1-24.

Lieberman, D. A., Rex, D. K., Winawer, S. J., Giardiello, F. M., Johnson, D. A., and Levin, T. R. (2012), Guidelines for colonoscopy surveillance after screening and polypectomy : a consensus update by the US Multi-Society Task Force on Colorectal Cancer, Gastroenterology, 143(3), 844-857. crossref(new window)

Mantel, N. (1966), Evaluation of survival data and two new rank order statistics arising in its consideration, Cancer chemotherapy reports, 50(3), 163-170.

National Cancer Center (2011), National Cancer Control Project, Available at :

Patil, N. N., Mottrie, A., Sundaram, B., and Patel, V. R. (2008), Robotic-assisted laparoscopic ureteral reimplantation with psoas hitch: a multi-institutional, multinational evaluation, Urology, 72(1), 47-50. crossref(new window)

Perou, C. M., Sorlie, T., Eisen, M. B., van de Rijn, M., Jeffrey, S. S., Rees, C. A., and Botstein, D. (2000), Molecular portraits of human breast tumours, Nature, 406(6797), 747-752. crossref(new window)

Ries, L. A. G., Melbert, D., and Krapcho, M. (2007), SEER Cancer Statistics Review, 1975-2004, Bethesda, MD: National Cancer Institute, based on November 2006 SEER data submission, posted to the SEER Web site.

Rousseeuw, P. J. (1987), Silhouettes : a graphical aid to the interpretation and validation of cluster analysis, Journal of computational and applied mathematics, 20, 53-65. crossref(new window)

South Korea Statistics (2013), Year mortality statistics.

Strober, M., Freeman, R., and Morrell, W. (1997), The long-term course of severe anorexia nervosa in adolescents : Survival analysis of recovery, relapse, and outcome predictors over 10-5 years in a prospective study, International Journal of Eating Disorders, 22(4), 339-360. crossref(new window)

Thiis-Evensen, E., Hoff, G. S., Sauar, J., Langmark, F., Majak, B. M., and Vatn, M. H. (1999), Population-based surveillance by colonoscopy : effect on the incidence of colorectal cancer : Telemark Polyp Study I, Scandinavian journal of gastroenterology, 34(4), 414-420. crossref(new window)

Winawer, S. J., Zauber, A. G., Ho, M. N., O'Brien, M. J., Gottlieb, L. S., Sternberg, S. S., and Stewart, E. T. (1993), Prevention of colorectal cancer by colonoscopic polypectomy, New England Journal of Medicine, 329(27), 1977-1981. crossref(new window)

Ziegel, E. R. (1997), Survival analysis using the SAS system, Technometrics, 39(3), 344.