• Title/Summary/Keyword: Matplotlib

Search Result 6, Processing Time 0.025 seconds

Design and Implementation of a Data Visualization Assessment Module in Jupyter Notebook

  • HakNeung Go;Youngjun Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.9
    • /
    • pp.167-176
    • /
    • 2023
  • In this paper, we designed and implemented a graph assessment module that can evaluate graphs in an programming assessment system based on text and numbers. The assessment method of the graph assessment module is self-evaluation that outputs two graphs generated by codes submitted by learners and by answers, automatic-evaluation that converts each graph image into an array, and gives feedback if it is wrong. The data used to generate the graph can be inputted directly or used from external data, and the method of generatng graph that can be evaluated is MATLAB style in matplotlib, and the graph shape that can be evaluated is presented in mathematics and curriculum. Through expert review, it was confirmed that the content elements of the assessment module, the possibility of learning, and the validity of the learner's needs were met. The graph assessment module developed in this study has expanded the evaluation area of the programming automatic asssessment system and is expected to help students learn data visualization.

Using Python Programming Language for Teaching Industrial Engineering Subjects: A Case Study on Engineering Economy (산업공학 전공 교과목 강의를 위한 파이썬 프로그래밍 활용: 경제성공학 교육 사례 연구)

  • Cho, Yongkyu
    • Journal of Practical Engineering Education
    • /
    • v.14 no.2
    • /
    • pp.245-258
    • /
    • 2022
  • Computational thinking with programming skills has been widely emphasized for future industrial engineering researchers and practitioners in Industry 4.0. However, industrial engineering students still have limited opportunities to improve their computational thinking abilities during university coursework. In this regard, this research study proposes to use Python programming language for teaching classical Industrial Engineering subjects. For a specific case study, we designed and instructed an Engineering Economy lecture which cultivates the concept and techniques of economic analysis for engineering students. During the class, we introduced the usage of several Python libraries that include numpy-financial for basic financial functions, numpy and scipy for simple numerical computation and analysis, and matplotlib for data visualization. Anonymous class evaluation survey showed the effectiveness of the proposed teaching method in terms of both educational satisfaction and contents delivery. Finally, we found additional needs for providing lectures that adopt the similar teaching style to the proposed method.

The Correlation Of Weather And Hanhwa Eagles (날씨와 한화 이글스의 상관관계)

  • Heo, Tai-Sung;Kang, Ha-Ram
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.07a
    • /
    • pp.237-238
    • /
    • 2021
  • 야구는 데이터 스포츠라 불릴 만큼 경기마다 많은 데이터가 생성되며, 이를 바탕으로 경기를 진행한다. 본 연구는 한국 프로야구 구단인 한화 이글스의 승률 및 타자의 성적과 날씨 사이의 상관관계를 분석하였다. 이를 위하여 한화 이글스의 승률과 타자의 성적을 한국프로야구(KBO) 공식 홈페이지 및 야구 기록 통계사이트 스탯티즈(statiz)에서 수집하였으며, 날씨 데이터는 온도와 습도를 고려한 불쾌지수 데이터를 기상청으로 부터 수집하였다. 파이선의 pandas 라이브러리를 사용하여 데이터 전처리를 실행하였다. 이후 파이선의 matplotlib 라이브러리를 이용하여 데이터 분석 및 시각화를 진행하였다. 본 연구의 분석 결과로는 불쾌지수가 보통일 때 승률이 가장 크고 높음일 때 가장 낮음을 확인할 수 있었다. 또한, 타자들의 평균 성적을 분석한 결과 보통과 매우 높음은 전체적인 타격 지수가 비슷하나 높음일 때 부진한 것으로 나왔다.

  • PDF

PyOncoPrint: a python package for plotting OncoPrints

  • Jeongbin Park;Nagarajan Paramasivam
    • Genomics & Informatics
    • /
    • v.21 no.1
    • /
    • pp.14.1-14.4
    • /
    • 2023
  • OncoPrint, the plot to visualize an overview of genetic variants in sequencing data, has been widely used in the field of cancer genomics. However, still, there have been no Python libraries capable to generate OncoPrint yet, a big hassle to plot OncoPrints within Python-based genetic variants analysis pipelines. This paper introduces a new Python package PyOncoPrint, which can be easily used to plot OncoPrints in Python. The package is based on the existing widely used scientific plotting library Matplotlib, the resulting plots are easy to be adjusted for various needs.

A Study on the Use of Location Data for Exploring Infant's Peer Relationships in Free-Choice Play Activities (자유선택놀이 활동에서 유아 또래관계 탐색을 위한 위치데이터 활용 방안 연구)

  • Kim, Jeong Kyoum;Lee, Sang-Seon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.9
    • /
    • pp.466-472
    • /
    • 2020
  • The purpose of this study is to explore how to use location data for peer relations of infants in free-choice play activities. For this study, location data was collected using wearable devices for 14 students in one class at an early childhood education institution in Chungnam. For the pre-processing of the collected location data, a smoothing technique was applied to recover missing values during the collection process, and the data was visualized using Python's Matplotlib. Subsequently, the movement distance, distance between infants, and interaction types of infants were extracted from the location data using the formula. As a result of the study, it was possible to derive 1) change in moving distance, cumulative value, average value, 2) change in distance and average distance value between infants, and 3) change and trend in interaction type according to the passage of time. These results can provide valuable information on the process of forming peer groups for infants in situations where it is difficult for a teacher to closely observe all members, and can be used as meaningful information for the design and operation of educational programs.

Semantic Visualization of Dynamic Topic Modeling (다이내믹 토픽 모델링의 의미적 시각화 방법론)

  • Yeon, Jinwook;Boo, Hyunkyung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.131-154
    • /
    • 2022
  • Recently, researches on unstructured data analysis have been actively conducted with the development of information and communication technology. In particular, topic modeling is a representative technique for discovering core topics from massive text data. In the early stages of topic modeling, most studies focused only on topic discovery. As the topic modeling field matured, studies on the change of the topic according to the change of time began to be carried out. Accordingly, interest in dynamic topic modeling that handle changes in keywords constituting the topic is also increasing. Dynamic topic modeling identifies major topics from the data of the initial period and manages the change and flow of topics in a way that utilizes topic information of the previous period to derive further topics in subsequent periods. However, it is very difficult to understand and interpret the results of dynamic topic modeling. The results of traditional dynamic topic modeling simply reveal changes in keywords and their rankings. However, this information is insufficient to represent how the meaning of the topic has changed. Therefore, in this study, we propose a method to visualize topics by period by reflecting the meaning of keywords in each topic. In addition, we propose a method that can intuitively interpret changes in topics and relationships between or among topics. The detailed method of visualizing topics by period is as follows. In the first step, dynamic topic modeling is implemented to derive the top keywords of each period and their weight from text data. In the second step, we derive vectors of top keywords of each topic from the pre-trained word embedding model. Then, we perform dimension reduction for the extracted vectors. Then, we formulate a semantic vector of each topic by calculating weight sum of keywords in each vector using topic weight of each keyword. In the third step, we visualize the semantic vector of each topic using matplotlib, and analyze the relationship between or among the topics based on the visualized result. The change of topic can be interpreted in the following manners. From the result of dynamic topic modeling, we identify rising top 5 keywords and descending top 5 keywords for each period to show the change of the topic. Existing many topic visualization studies usually visualize keywords of each topic, but our approach proposed in this study differs from previous studies in that it attempts to visualize each topic itself. To evaluate the practical applicability of the proposed methodology, we performed an experiment on 1,847 abstracts of artificial intelligence-related papers. The experiment was performed by dividing abstracts of artificial intelligence-related papers into three periods (2016-2017, 2018-2019, 2020-2021). We selected seven topics based on the consistency score, and utilized the pre-trained word embedding model of Word2vec trained with 'Wikipedia', an Internet encyclopedia. Based on the proposed methodology, we generated a semantic vector for each topic. Through this, by reflecting the meaning of keywords, we visualized and interpreted the themes by period. Through these experiments, we confirmed that the rising and descending of the topic weight of a keyword can be usefully used to interpret the semantic change of the corresponding topic and to grasp the relationship among topics. In this study, to overcome the limitations of dynamic topic modeling results, we used word embedding and dimension reduction techniques to visualize topics by era. The results of this study are meaningful in that they broadened the scope of topic understanding through the visualization of dynamic topic modeling results. In addition, the academic contribution can be acknowledged in that it laid the foundation for follow-up studies using various word embeddings and dimensionality reduction techniques to improve the performance of the proposed methodology.