Comparing Accuracy of Imputation Methods for Incomplete Categorical Data

Shin, Hyung-Won;Sohn, So-Young;

Proceedings of the Korean Statistical Society Conference (한국통계학회:학술대회논문집)

2003.05a
/
Pages.237-242
/
2003

The Korean Statistical Society (한국통계학회)

Comparing Accuracy of Imputation Methods for Incomplete Categorical Data

Shin, Hyung-Won (Dept. of Computer Science & Industrial Systems Engineering, Yonsei University) ;
Sohn, So-Young (Dept. of Computer Science & Industrial Systems Engineering, Yonsei University)

Published : 2003.05.23

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Various kinds of estimation methods have been developed for imputation of categorical missing data. They include modal category method, logistic regression, and association rule. In this study, we propose two imputation methods (neural network fusion and voting fusion) that combine the results of individual imputation methods. A Monte-Carlo simulation is used to compare the performance of these methods. Five factors used to simulate the missing data are (1) true model for the data, (2) data size, (3) noise size (4) percentage of missing data, and (5) missing pattern. Overall, neural network fusion performed the best while voting fusion is better than the individual imputation methods, although it was inferior to the neural network fusion. Result of an additional real data analysis confirms the simulation result.

Proceedings of the Korean Statistical Society Conference (한국통계학회:학술대회논문집)

Comparing Accuracy of Imputation Methods for Incomplete Categorical Data

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)