• Title/Summary/Keyword: Stratified Sampling

Search Result 602, Processing Time 0.027 seconds

Asymptotic Comparison of Latin Hypercube Sampling and Its Stratified Version

  • Lee, Jooho
    • Journal of the Korean Statistical Society
    • /
    • v.28 no.2
    • /
    • pp.135-150
    • /
    • 1999
  • Latin hypercube sampling(LHS) introduced by McKay et al. (1979) is a widely used method for Monte Carlo integration. Stratified Latin hypercube sampling(SLHS) proposed by Choi and Lee(1993) improves LHS by combining it with stratified sampling. In this article it is shown that SLHS yields an asymptotically more accurate than both stratified sampling and LHS.

  • PDF

Unbiased Balanced Half-Sample Variance Estimation in Stratified Two-stage Sampling

  • Kim, Kyu-Seong
    • Journal of the Korean Statistical Society
    • /
    • v.27 no.4
    • /
    • pp.459-469
    • /
    • 1998
  • Balanced half sample method is a simple variance estimation method for complex sampling designs. Since it is simple and flexible, it has been widely used in large scale sample surveys. However, the usual BHS method overestimate the true variance in without replacement sampling and two-stage cluster sampling. Focusing on this point , we proposed an unbiased BHS variance estimator in a stratified two-stage cluster sampling and then described an implementation method of the proposed estimator. Finally, partially BHS design is explained as a tool of reducing the number of replications of the proposed estimator.

  • PDF

Variance estimation for distribution rate in stratified cluster sampling with missing values

  • Heo, Sunyeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.2
    • /
    • pp.443-449
    • /
    • 2017
  • Estimation of population proportion like the distribution rate of LED TV and the prevalence of a disease are often estimated based on survey sample data. Population proportion is generally considered as a special form of population mean. In complex sampling like stratified multistage sampling with unequal probability sampling, the denominator of mean may be random variable and it is estimated like ratio estimator. In this research, we examined the estimation of distribution rate based on stratified multistage sampling, and determined some numerical outcomes using stratified random sample data with about 25% of missing observations. In the data used for this research, the survey weight was determined by deterministic way. So, the weights are not random variable, and the population distribution rate and its variance estimator can be estimated like population mean estimation. When the weights are not random variable, if one estimates the variance of proportion estimator using ratio method, then the variances may be inflated. Therefore, in estimating variance for population proportion, we need to examine the structure of data and survey design before making any decision for estimation methods.

Support Vector Machine based on Stratified Sampling

  • Jun, Sung-Hae
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.9 no.2
    • /
    • pp.141-146
    • /
    • 2009
  • Support vector machine is a classification algorithm based on statistical learning theory. It has shown many results with good performances in the data mining fields. But there are some problems in the algorithm. One of the problems is its heavy computing cost. So we have been difficult to use the support vector machine in the dynamic and online systems. To overcome this problem we propose to use stratified sampling of statistical sampling theory. The usage of stratified sampling supports to reduce the size of training data. In our paper, though the size of data is small, the performance accuracy is maintained. We verify our improved performance by experimental results using data sets from UCI machine learning repository.

Efficient Use of Auxiliary Information through the Stratified Sampling and Systematic Sampling Design (층화추출과 계통추출을 이용한 효율적인 보조정보 사용)

  • Kim, Gwan-Su;Park, Min-Gue
    • Survey Research
    • /
    • v.10 no.1
    • /
    • pp.155-168
    • /
    • 2009
  • As an efficient sampling design, stratified random sampling is often used when auxiliary information is available at the designing stage. Although one - per - stratum design is an efficient design that can be used when many auxiliary variables are available, it does not provide any unbiased variance estimator. With a two - per - stratum sample in which two elements are selected from each stratum, it is possible to obtain an unbiased variance estimator. However the loss of efficiency could be significant if any important stratification variable is missed. In this study, we investigated a sampling design that uses the all given auxiliary information and also permits an unbiased variance estimator suggested by Park and Fuller(2008). Through a simulation study, we compared several stratified random sampling and systematic sampling design. We also applied the proposed stratified sampling designs to 2007 youth panel data.

  • PDF

A Study on economically optimal Determination of the Parameters of the Stratified Random Sampling (확률추출에 의한 층별 샘플링의 경제성에 관한 연구)

  • 황의철;이영식
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.13 no.21
    • /
    • pp.81-90
    • /
    • 1990
  • In stratified random sampling a simple random sample must be taken in each stratum to reduce the maximum gain in precision given the minimum cost. The purpose of this paper is to deal with the propertics of the estimates and variances and obtain the economic design of stratified random sampling through the optimum allocation of the sample sizes. In addition, the between stratum variation and the within stratum variation is stratifying the population are described.

  • PDF

A Stratified Unknown Repeated Trials in Randomized Response Sampling

  • Singh, Housila P.;Tarray, Tanveer Ahmad
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.6
    • /
    • pp.751-759
    • /
    • 2012
  • This paper proposes an alternative stratified randomized response model based on the model of Singh and Joarder (1997). It is shown numerically that the proposed stratified randomized response model is more efficient than Hong et al. (1994) (under proportional allocation) and Kim and Warde (2004) (under optimum allocation).

A Post-stratified Estimation in Multivariate Stratified Sampling Surveys

  • Park, Jinwoo
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.3
    • /
    • pp.755-760
    • /
    • 1999
  • In multivariate stratified sampling surveys it is general to use a few stratification variables which are highly correlated with the important variables at design stage. But there might be some secondary study variables which are not so highly correlated with those stratification variables. In that case it is not efficient to use the same type of estimator due to the secondary variables as the one base on the important variables. A post-stratified estimation is proposed to increase the efficiency of the estimator with existence of secondary variables. The proposed method is illustrated with a set of fishery household population survey data.

  • PDF

A Optimal Cluster Size in Stratified Two-Stage Cluster Sampling (층화 2-단 표본 추출시 최적 집락의 크기 결정)

  • 신민웅;신기일
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.207-224
    • /
    • 2000
  • Generally cluster size is predetermined when we use the stratified two-stage cluster sampling But in case that the sizes of clusters vary greatly one may want to make the sizes to be about equal. In this paper we study the optimal cluster size in stratified twostage cluster sampling. Also we find the optimal primary sampling unit sizes and optimal secondary sampling unit sizes under the given cost restriction.

  • PDF

A Stratified Multi-proportions Randomized Response Model (층화 다지 확률화응답모형)

  • Lee, Gi-Sung;Park, Kyung-Soon
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1113-1120
    • /
    • 2015
  • We propose a multi-proportions randomized response model by stratified simple random sampling for surveys of sensitive issues of a polychotomous population composed of several stratum. We also systemize a theoretical validity to apply multi-proportions randomized response model (Abul-Ela et al.' model, Eriksson's model) to stratified simple random sampling and derive the estimate and its dispersion matrix of the proportion of sensitive characteristic of population using the suggested model. Two types of sample allocations (proportional allocation and optimum allocation) are considered under the fixed cost. In efficiency, the Eriksson's model by stratified sampling are compared to the Abul-Ela et al.' model.