• 제목/요약/키워드: Machine Learning

Search Result 5,492, Processing Time 0.035 seconds

Trends in Artificial Intelligence Applications in Clinical Trials: An analysis of ClinicalTrials.gov (임상시험에서 인공지능의 활용에 대한 분석 및 고찰: ClinicalTrials.gov 분석)

  • Jeong Min Go;Ji Yeon Lee;Yun-Kyoung Song;Jae Hyun Kim
    • Korean Journal of Clinical Pharmacy
    • /
    • v.34 no.2
    • /
    • pp.134-139
    • /
    • 2024
  • Background: Increasing numbers of studies and research about artificial intelligence (AI) and machine learning (ML) have led to their application in clinical trials. The purpose of this study is to analyze computer-based new technologies (AI/ML) applied on clinical trials registered on ClinicalTrials.gov to elucidate current usage of these technologies. Methods: As of March 1st, 2023, protocols listed on ClinicalTrials.gov that claimed to use AI/ML and included at least one of the following interventions-Drug, Biological, Dietary Supplement, or Combination Product-were selected. The selected protocols were classified according to their context of use: 1) drug discovery; 2) toxicity prediction; 3) enrichment; 4) risk stratification/management; 5) dose selection/optimization; 6) adherence; 7) synthetic control; 8) endpoint assessment; 9) postmarketing surveillance; and 10) drug selection. Results: The applications of AI/ML were explored in 131 clinical trial protocols. The areas where AI/ML was most frequently utilized in clinical trials included endpoint assessment (n=80), followed by dose selection/optimization (n=15), risk stratification/management (n=13), drug discovery (n=4), adherence (n=4), drug selection (n=1) and enrichment (n=1). Conclusion: The most frequent application of AI/ML in clinical trials is in the fields of endpoint assessment, where the utilization is primarily focuses on the diagnosis of disease by imaging or video analyses. The number of clinical trials using artificial intelligence will increase as the technology continues to develop rapidly, making it necessary for regulatory associates to establish proper regulations for these clinical trials.

A Study on Impacts of De-identification on Machine Learning's Biased Knowledge (머신러닝 편향성 관점에서 비식별화의 영향분석에 대한 연구)

  • Soohyeon Ha;Jinsong Kim;Yeeun Son;Gaeun Won;Yujin Choi;Soyeon Park;Hyung-Jong Kim;Eunsung Kang
    • Journal of the Korea Society for Simulation
    • /
    • v.33 no.2
    • /
    • pp.27-35
    • /
    • 2024
  • We aimed to shed light on the issue of perpetuating societal disparities by analyzing the impact of inherent biases present in datasets used for training artificial intelligence models on the predictions generated by Artificial Intelligence(AI). Therefore, to examine the influence of data bias on AI models, we constructed an original dataset containing biases related to gender wage gaps and subsequently created a de-identified dataset. Additionally, by utilizing the decision tree algorithm, we compared the outputs of AI models trained on both the original and de-identified datasets, aiming to analyze how data de-identification affects the biases in the results produced by artificial intelligence models. Through this, our goal was to highlight the significant role of data de-identification not only in safeguarding individual privacy but also in addressing biases within the data.

ML-based prediction method for estimating vortex-induced vibration amplitude of steel tubes in tubular transmission towers

  • Jiahong Li;Tao Wang;Zhengliang Li
    • Structural Engineering and Mechanics
    • /
    • v.90 no.1
    • /
    • pp.27-40
    • /
    • 2024
  • The prediction of VIV amplitude is essential for the design and fatigue life estimation of steel tubes in tubular transmission towers. Limited to costly and time-consuming traditional experimental and computational fluid dynamics (CFD) methods, a machine learning (ML)-based method is proposed to efficiently predict the VIV amplitude of steel tubes in transmission towers. Firstly, by introducing the first-order mode shape to the two-dimensional CFD method, a simplified response analysis method (SRAM) is presented to calculate the VIV amplitude of steel tubes in transmission towers, which enables to build a dataset for training ML models. Then, by taking mass ratio M*, damping ratio ξ, and reduced velocity U* as the input variables, a Kriging-based prediction method (KPM) is further proposed to estimate the VIV amplitude of steel tubes in transmission towers by combining the SRAM with the Kriging-based ML model. Finally, the feasibility and effectiveness of the proposed methods are demonstrated by using three full-scale steel tubes with C-shaped, Cross-shaped, and Flange-plate joints, respectively. The results show that the SRAM can reasonably calculate the VIV amplitude, in which the relative errors of VIV maximum amplitude in three examples are less than 6%. Meanwhile, the KPM can well predict the VIV amplitude of steel tubes in transmission towers within the studied range of M*, ξ and U*. Particularly, the KPM presents an excellent capability in estimating the VIV maximum amplitude by using the reduced damping parameter SG.

A Study on Trend Using Time Series Data (시계열 데이터 활용에 관한 동향 연구)

  • Shin-Hyeong Choi
    • Advanced Industrial SCIence
    • /
    • v.3 no.1
    • /
    • pp.17-22
    • /
    • 2024
  • History, which began with the emergence of mankind, has a means of recording. Today, we can check the past through data. Generated data may only be generated and stored at a certain moment, but it is not only continuously generated over a certain time interval from the past to the present, but also occurs in the future, so making predictions using it is an important task. In order to find out trends in the use of time series data among numerous data, this paper analyzes the concept of time series data, analyzes Recurrent Neural Network and Long-Short Term Memory, which are mainly used for time series data analysis in the machine learning field, and analyzes the use of these models. Through case studies, it was confirmed that it is being used in various fields such as medical diagnosis, stock price analysis, and climate prediction, and is showing high predictive results. Based on this, we will explore ways to utilize it in the future.

Classification of Characteristics in Two-Wheeler Accidents Using Clustering Techniques (클러스터링 기법을 이용한 이륜차 사고의 특징 분류)

  • Heo, Won-Jin;Kang, Jin-ho;Lee, So-hyun
    • Knowledge Management Research
    • /
    • v.25 no.1
    • /
    • pp.217-233
    • /
    • 2024
  • The demand for two-wheelers has increased in recent years, driven by the growing delivery culture, which has also led to a rise in the number of two-wheelers. Although two-wheelers are economically efficient in congested traffic conditions, reckless driving and ambiguous traffic laws for two-wheelers have turned two-wheeler accidents into a significant social issue. Given the high fatality rate associated with two-wheelers, the severity and risk of two-wheeler accidents are considerable. It is, therefore, crucial to thoroughly understand the characteristics of two-wheeler accidents by analyzing their attributes. In this study, the characteristics of two-wheeled vehicle accidents were categorized using the K-prototypes algorithm, based on data from two-wheeled vehicle accidents. As a result, the accidents were divided into four clusters according to their characteristics. Each cluster showed distinct traits in terms of the roads where accidents occurred, the major laws violated, the types of accidents, and the times of accident occurrences. By tailoring enforcement methods and regulations to the specific characteristics of each type of accident, we can reduce the incidence of accidents involving two-wheelers in metropolitan areas, thereby enhancing road safety. Furthermore, by applying machine learning techniques to urban transportation and safety, this study adds to the body of related literature.

Forecasting the Business Performance of Restaurants on Social Commerce

  • Supamit BOONTA;Kanjana HINTHAW
    • Journal of Distribution Science
    • /
    • v.22 no.4
    • /
    • pp.11-22
    • /
    • 2024
  • Purpose: This research delves into the various factors that influence the performance of restaurant businesses on social commerce platforms in Bangkok, Thailand. The study considers both internal and external factors, including but not limited to business characteristics and location. Moreover, this research also analyzes the effects of employing multiple social commerce platforms on business efficiency and explores the underlying reasons for such effects. Research design, data, and methodology: Restaurants can be classified into different price ranges: low, medium, and high. To further investigate, we employed natural language processing AI to analyze online reviews and evaluate algorithm performance using machine learning techniques. We aimed to develop a model to gauge customer satisfaction with restaurants across different price categories effectively. Results: According to the research findings, several factors significantly impact restaurant groups in the low and mid-price ranges. Among these factors are population density and the number of seats at the restaurant. On the other hand, in the mid-and high-price ranges, the price levels of the food and drinks offered by the restaurant play a crucial role in determining customer satisfaction. Furthermore, the correlation between different social commerce platforms can significantly affect the business performance of high-price range restaurant groups. Finally, the level of online review sentiment has been found to influence customer decision-making across all restaurant types significantly. Conclusions: The study emphasizes that restaurants' characteristics based on their price level differ significantly, and social commerce platforms have the potential to affect one another. It is worth noting that the sentiment expressed in online reviews has a more significant impact on customer decision-making than any other factor, regardless of the type of restaurant in question.

Predicting restraining effects in CFS channels: A machine learning approach

  • Seyed Mohammad Mojtabaei;Rasoul Khandan;Iman Hajirasouliha
    • Steel and Composite Structures
    • /
    • v.51 no.4
    • /
    • pp.441-456
    • /
    • 2024
  • This paper aims to develop Machine Learning (ML) algorithms to predict the buckling resistance of cold-formed steel (CFS) channels with restrained flanges, widely used in typical CFS sheathed wall panels, and provide practical design tools for engineers. The effects of cross-sectional restraints were first evaluated on the elastic buckling behaviour of CFS channels subjected to pure axial compressive load or bending moment. Feedforward multi-layer Artificial Neural Networks (ANNs) were then trained on different datasets comprising CFS channels with various dimensions and properties, plate thicknesses, and restraining conditions on one or two flanges, while the elastic distortional buckling resistance of the elements were determined according to the Finite Strip Method (FSM). To develop less biased networks and ensure that every observation from the original dataset has the chance of appearing in the training and test set, a K-fold cross-validation technique was implemented. In addition, the hyperparameters of the ANNs were tuned using a grid search technique to provide ANNs with optimum performances. The results demonstrated that the trained ANNs were able to predict the elastic distortional buckling resistance of CFS flange-restrained elements with an average accuracy of 99% in terms of coefficient of determination. The developed models were then used to propose a simple ANN-based design formula for the prediction of the elastic distortional buckling stress of CFS flange-restrained elements. Finally, the proposed formula was further evaluated on a separate set of unseen data to ensure its accuracy for practical applications.

Defect Prediction and Variable Impact Analysis in CNC Machining Process (CNC 가공 공정 불량 예측 및 변수 영향력 분석)

  • Hong, Ji Soo;Jung, Young Jin;Kang, Sung Woo
    • Journal of Korean Society for Quality Management
    • /
    • v.52 no.2
    • /
    • pp.185-199
    • /
    • 2024
  • Purpose: The improvement of yield and quality in product manufacturing is crucial from the perspective of process management. Controlling key variables within the process is essential for enhancing the quality of the produced items. In this study, we aim to identify key variables influencing product defects and facilitate quality enhancement in CNC machining process using SHAP(SHapley Additive exPlanations) Methods: Firstly, we conduct model training using boosting algorithm-based models such as AdaBoost, GBM, XGBoost, LightGBM, and CatBoost. The CNC machining process data is divided into training data and test data at a ratio 9:1 for model training and test experiments. Subsequently, we select a model with excellent Accuracy and F1-score performance and apply SHAP to extract variables influencing defects in the CNC machining process. Results: By comparing the performances of different models, the selected CatBoost model demonstrated an Accuracy of 97% and an F1-score of 95%. Using Shapley Value, we extract key variables that positively of negatively impact the dependent variable(good/defective product). We identify variables with relatively low importance, suggesting variables that should be prioritized for management. Conclusion: The extraction of key variables using SHAP provides explanatory power distinct from traditional machine learning techniques. This study holds significance in identifying key variables that should be prioritized for management in CNC machining process. It is expected to contribute to enhancing the production quality of the CNC machining process.

Enhanced Machine Learning Preprocessing Techniques for Optimization of Semiconductor Process Data in Smart Factories (스마트 팩토리 반도체 공정 데이터 최적화를 위한 향상된 머신러닝 전처리 방법 연구)

  • Seung-Gyu Choi;Seung-Jae Lee;Choon-Sung Nam
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.24 no.4
    • /
    • pp.57-64
    • /
    • 2024
  • The introduction of Smart Factories has transformed manufacturing towards more objective and efficient line management. However, most companies are not effectively utilizing the vast amount of sensor data collected every second. This study aims to use this data to predict product quality and manage production processes efficiently. Due to security issues, specific sensor data could not be verified, so semiconductor process-related training data from the "SAMSUNG SDS Brightics AI" site was used. Data preprocessing, including removing missing values, outliers, scaling, and feature elimination, was crucial for optimal sensor data. Oversampling was used to balance the imbalanced training dataset. The SVM (rbf) model achieved high performance (Accuracy: 97.07%, GM: 96.61%), surpassing the MLP model implemented by "SAMSUNG SDS Brightics AI". This research can be applied to various topics, such as predicting component lifecycles and process conditions.

Development of a predictive model for hypoxia due to sedatives in gastrointestinal endoscopy: a prospective clinical study in Korea

  • Jung Wan Choe;Jong Jin Hyun;Seong-Jin Son;Seung-Hak Lee
    • Clinical Endoscopy
    • /
    • v.57 no.4
    • /
    • pp.476-485
    • /
    • 2024
  • Background/Aims: Sedation has become a standard practice for patients undergoing gastrointestinal (GI) endoscopy. However, considering the serious cardiopulmonary adverse events associated with sedatives, it is important to identify patients at high risk. Machine learning can generate reasonable prediction for a wide range of medical conditions. This study aimed to evaluate the risk factors associated with sedation during GI endoscopy and develop a predictive model for hypoxia during endoscopy under sedation. Methods: This prospective observational study enrolled 446 patients who underwent sedative endoscopy at the Korea University Ansan Hospital. Clinical data were used as predictor variables to construct predictive models using the random forest method that is a machine learning algorithm. Results: Seventy-two of the 446 patients (16.1%) experienced life-threatening hypoxia requiring immediate medical intervention. Patients who developed hypoxia had higher body weight, body mass index (BMI), neck circumference, and Mallampati scores. Propofol alone and higher initial and total dose of propofol were significantly associated with hypoxia during sedative endoscopy. Among these variables, high BMI, neck circumference, and Mallampati score were independent risk factors for hypoxia. The area under the receiver operating characteristic curve for the random forest-based predictive model for hypoxia during sedative endoscopy was 0.82 (95% confidence interval, 0.79-0.86) and displayed a moderate discriminatory power. Conclusions: High BMI, neck circumference, and Mallampati score were independently associated with hypoxia during sedative endoscopy. We constructed a model with acceptable performance for predicting hypoxia during sedative endoscopy.