DOI QR코드

DOI QR Code

Visualization analysis using R Shiny

R의 Shiny를 이용한 시각화 분석 활용 사례

  • Na, Jonghwa (Department of Information and Statistics, Chungbuk National University) ;
  • Hwang, Eunji (Korea Health Industry Development Institute)
  • Received : 2017.10.30
  • Accepted : 2017.11.22
  • Published : 2017.11.30

Abstract

R's {shiny} package provides an environment for creating web applications with only R scripts. Shiny does not require knowledge of a separate web programming language and its development is very easy and straightforward. In addition, Shiny has a variety of extensibility, and its functions are expanding day by day. Therefore, the presentation of high-quality results is an excellent tool for R-based analysts. In this paper, we present actual cases of large data analysis using Shiny. First, geological anomaly zone is extracted by analyzing topographical data expressed in the form of contour lines by analysis related to spatial data. Next, we will construct a model to predict major diseases by 16 cities and provinces nationwide using weather, environment, and social media information. In this process, we want to show that Shiny is very effective for data visualization and analysis.

R의 {shiny} 패키지는 R 스크립트만으로 웹 어플리케이션을 제작할 수 있는 환경을 제공한다. Shiny는 별도의 웹 프로그래밍 언어에 대한 지식을 요구하지 않으며 그 개발이 매우 쉽고 간명하다. 또한 Shiny는 다양한 확장성을 가지고 있으며, 그 기능이 날로 확대되고 있다. 따라서 완성도 높은 결과물의 제시가 절실한 R 기반의 분석전문가들에게는 더 없이 훌륭한 도구이다. 본 논문에서는 Shiny를 활용하여 대용량 데이터를 분석한 실제 사례를 소개한다. 먼저, 공간 자료와 관계된 분석으로 등고선 등의 형태로 표현되는 지형자료를 분석하여 지질 이상대를 추출한다. 다음으로, 기상, 환경, 소셜미디어 정보를 이용하여 전국의 16개 시, 도별 주요 질환을 예측하는 모형을 구축한다. 이 과정에서 Shiny가 데이터의 시각화와 분석에 매우 효과적임을 보이고자 한다.

Keywords

References

  1. Bivand, R. S., Pebesma, E. and Gomez-Rubio, V. (2013). Applied spatial data analysis with R, Springer.
  2. Brown, W. J. and Murdoch, D. J. (2007). A first course in statistical programming with R, Cambridge.
  3. Hong, C. S., Park, J. and Park, Y. H. (2017). Multivariate empirical distribution functions and descriptive methods. Journal of the Korean Data & Information Science Society, 28, 87-98. https://doi.org/10.7465/jkdi.2017.28.1.87
  4. Hwang, C and Shim, J. (2017). Geographically weighted least square-support vector machine. Journal of the Korean Data & Information Science Society, 28, 227-235. https://doi.org/10.7465/jkdi.2017.28.1.227
  5. Hwang, E. J. and Na, J. H. (2015). Influenza prediction models by using meteorological and social media informations. Journal of the Korean Data & Information Science Society, 26, 1087-1095. https://doi.org/10.7465/jkdi.2015.26.5.1087
  6. Isaaks, E. H. and Srivastava, R. M. (1989). Applied geostatistics, Oxford University Press.
  7. Na, J. H. (2016a). R visualization and statistical data analysis I, Free Academy Press.
  8. Na, J. H. (2016b). R visualization and statistical data analysis II, Free Academy Press.
  9. Ripley, B. D. (1981). Spatial statistics, Wiley, New York.
  10. Rizzo, M. L. (2007). Statistical computing with R, Chapman & Hall.
  11. Torgo, L. (2011). Data mining with R, CRC Press.
  12. Tukey, J. (1977). Exploratory data analysis, Addison-Wesley.
  13. Verzani, J. (2005). Using R for introductory statistics, Chapman & Hall.