• Title/Summary/Keyword: Hot deck

Search Result 34, Processing Time 0.021 seconds

Imputation of Missing Data Based on Hot Deck Method Using K-nn (K-nn을 이용한 Hot Deck 기반의 결측치 대체)

  • Kwon, Soonchang
    • Journal of Information Technology Services
    • /
    • v.13 no.4
    • /
    • pp.359-375
    • /
    • 2014
  • Researchers cannot avoid missing data in collecting data, because some respondents arbitrarily or non-arbitrarily do not answer questions in studies and experiments. Missing data not only increase and distort standard deviations, but also impair the convenience of estimating parameters and the reliability of research results. Despite widespread use of hot deck, researchers have not been interested in it, since it handles missing data in ambiguous ways. Hot deck can be complemented using K-nn, a method of machine learning, which can organize donor groups closest to properties of missing data. Interested in the role of k-nn, this study was conducted to impute missing data based on the hot deck method using k-nn. After setting up imputation of missing data based on hot deck using k-nn as a study objective, deletion of listwise, mean, mode, linear regression, and svm imputation were compared and verified regarding nominal and ratio data types and then, data closest to original values were obtained reasonably. Simulations using different neighboring numbers and the distance measuring method were carried out and better performance of k-nn was accomplished. In this study, imputation of hot deck was re-discovered which has failed to attract the attention of researchers. As a result, this study shall be able to help select non-parametric methods which are less likely to be affected by the structure of missing data and its causes.

Imputation Methods for the Population and Housing Census 2000 in Korea

  • Kim, Young-Won;Ryu, Jeabok;Park, Jinwoo;Lee, Jaewon
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.575-583
    • /
    • 2003
  • We proposed imputation strategies for the Population and Housing Census 2000 in Korea. The total area of floor space and marital status which have relatively high non-response rates in the Census are considered to develope the effective missing value imputation procedures. The Classification and Regression Tree(CART) is employed to construct the imputation cells for hot-deck imputation, as well as to predict missing value by model-based approach. We compare three imputation methods which include CART model-based imputation, hot-deck imputation based on CART and logical hot-deck imputation proposed by The Korea National Statistical Office. The results suggest that the proposed hot-deck imputation based on CART is very efficient and strongly recommendable.

Missing Value Imputation Method Using CART : For Marital Status in the Population and Housing Census (CART를 활용한 결측값 대체방법 : 인구주택총조사 혼인상태 항목을 중심으로)

  • 김영원;이주원
    • Survey Research
    • /
    • v.4 no.2
    • /
    • pp.1-21
    • /
    • 2003
  • We proposed imputation strategies for marital status in the Population and Housing Census 2000 in Korea to illustrate the effective missing value imputation methods for social survey. The marital status which have relatively high non-response rates in the Census are considered to develope the effective missing value imputation procedures. The Classification and Regression Tree(CART)is employed to construct the imputation cells for hot-deck imputation, as well as to predict the missing value by model-based approach. We compare to imputation methods which include the CART model-based imputation and the sequential hot-deck imputation based on CART. Also we check whether different modeling for each region provides the more improved results. The results suggest that the proposed hot-deck imputation based on CART is very efficient and strongly recommendable. And the results show that different modeling for each region is not necessary.

  • PDF

Jackknife Variance Estimation under Imputation for Nonrandom Nonresponse with Follow-ups

  • Park, Jinwoo
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.4
    • /
    • pp.385-394
    • /
    • 2000
  • Jackknife variance estimation based on adjusted imputed values when nonresponse is nonrandom and follow-up data are available for a subsample of nonrespondents is provided. Both hot-deck and ratio imputation method are considered as imputation method. The performance of the proposed variance estimator under nonrandom response mechanism is investigated through numerical simulation.

  • PDF

REGRESSION FRACTIONAL HOT DECK IMPUTATION

  • Kim, Jae-Kwang
    • Journal of the Korean Statistical Society
    • /
    • v.36 no.3
    • /
    • pp.423-434
    • /
    • 2007
  • Imputation using a regression model is a method to preserve the correlation among variables and to provide imputed point estimators. We discuss the implementation of regression imputation using fractional imputation. By a suitable choice of fractional weights, the fractional regression imputation can take the form of hot deck fractional imputation, thus no artificial values are constructed after the imputation. A variance estimator, which extends the method of Kim and Fuller (2004), is also proposed. Results from a limited simulation study are presented.

Fatigue Strength and Root-Deck Crack Propagation for U-Rib to Deck Welded Joint in Steel Box Girder

  • Zhiyuan, YuanZhou;Bohai, Ji;Di, Li;Zhongqiu, Fu
    • International journal of steel structures
    • /
    • v.18 no.5
    • /
    • pp.1589-1597
    • /
    • 2018
  • Fatigue tests and numerical analysis were carried out to evaluate the fatigue performance at the U-rib to deck welded joint in steel box girder. Twenty specimens were tested corresponding to different penetration rates (80 and 100%) under fatigue bending load, and the fatigue strength was investigated based on hot spot stress (HSS) method. The detailed stress distribution at U-rib to deck welded joint was analyzed by the finite element method, as well as the stress intensity factor of weld root. The test results show that the specimens with fully penetration rate have longer crack propagation life due to the welding geometry, resulting in higher fatigue failure strength. The classification of FAT-90 is reasonable for evaluating fatigue strength by HSS method. The penetration rate has effect on crack propagation angle near the surface, and the 1-mm stress below weld toe and root approves to be more suitable for fatigue stress assessment, because of its high sensitivity to weld geometry than HSS.

Research on rib-to-diaphragm welded connection by means of hot spot stress approach

  • Wang, Binhua;Lu, Pengmin;Shao, Yuhong
    • Steel and Composite Structures
    • /
    • v.18 no.1
    • /
    • pp.135-148
    • /
    • 2015
  • The cutout hole locating at the place of rib-to-diaphragm welded connection is adopted to minimize the restraint, which is caused by the floor-beam web to rib rotation at the support due to the unsymmetrical loads in orthotropic deck. In practice, an inevitable problem is that there is a large number of welding joint's cracks formed at the edge of cutout hole. In this study, a comparative experiment is carried out with two types of cutout hole, the circular arc transition and the vertical transition. The fatigue life estimation of specimens is investigated with the application of the structural hot spot stress approach by finite element analyses. The results are compared with the ones of the fatigue tests which are carried out on these full-scale specimens. Factors affecting the stress range are also studied.

Weighted Hot-Deck Imputation in Farm and Fishery Household Economy Surveys (농어가경제조사에서 가중핫덱 무응답 대체법의 활용)

  • Kim Kyu-Seong;Lee Kee-Jae;Kim Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.2
    • /
    • pp.311-328
    • /
    • 2005
  • This paper deals with a treatment of nonresponse in farm and fishery household economy surveys in Korea. Since the samples in two surveys were selected by stratified multi-stage sampling and weighted sample means has been used to estimate the population means, we choose a weighted hot-deck imputation method as an appropriate method for two surveys. We investigate the procedure of the weighted hot-deck as well as an adjusted jackknife method for variance estimation. Through an empirical study we found that the method worked very well in both mean and variance estimation in two surveys. In addition, we presented a procedure of forming imputation class and formed four imputation classes for each survey and then compared them with analysis. As a result, we presented two most efficient imputation classes for two surveys.

The Fundamental Study on the Behavior of Deck Slab Reinforced Basalt Fiber (Basalt 콘크리트 섬유보강 상판의 거동에 관한 기초적 연구)

  • Seo, Seung-Tag
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.14 no.1
    • /
    • pp.1-7
    • /
    • 2011
  • Basalt originates from volcanic magma and flood volcanoes, a very hot fluid or semifluid material under the earth's crust, solidified in the open air. Basalt is a common term used for a variety of volcanic rocks, which are gray, dark in colour, formed from the molten lava after solidification. Recently, attention has been devoted to continuous basalt fibers (CBF) whose primary advantage consists in their low cost, good resistance to acids and solvents, and good thermal stability. In order to investigate reinforcement effect, this paper did FEM analysis with shell element. The result were as follows; BCF deck plate did elastic behavior to 450 kN, reinforcement effect of basalt fiber (BF) was less. But BCF's perpendicular deflection occurred little about 23 mm comparing with RC deck plate in load 627 kN. Stiffness was very improved by basalt fiber reinforcement.