• 제목/요약/키워드: multi-regression statistics

검색결과 113건 처리시간 0.025초

Fused inverse regression with multi-dimensional responses

  • Cho, Youyoung;Han, Hyoseon;Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • 제28권3호
    • /
    • pp.267-279
    • /
    • 2021
  • A regression with multi-dimensional responses is quite common nowadays in the so-called big data era. In such regression, to relieve the curse of dimension due to high-dimension of responses, the dimension reduction of predictors is essential in analysis. Sufficient dimension reduction provides effective tools for the reduction, but there are few sufficient dimension reduction methodologies for multivariate regression. To fill this gap, we newly propose two fused slice-based inverse regression methods. The proposed approaches are robust to the numbers of clusters or slices and improve the estimation results over existing methods by fusing many kernel matrices. Numerical studies are presented and are compared with existing methods. Real data analysis confirms practical usefulness of the proposed methods.

다반응 반응표면분석에서 특이값의 영향을 평가하기 위한 불꽃그림 (Firework plot for evaluating the impact of influential observations in multi-response surface methodology)

  • 김상익;장대흥
    • 응용통계연구
    • /
    • 제31권1호
    • /
    • pp.97-108
    • /
    • 2018
  • 회귀모형을 이용하여 자료를 분석하는 경우 이상점이나 영향점의 유무를 검정하는 회귀진단기법은 모형의 적합성을 체크하기 위한 필수적인 도구이다. 이러한 이상점이나 영향점이 존재하는 경우 회귀분석의 결과가 왜곡되어 해석이 된다. Jang과 Anderson-Cook (Quality and Reliability Engineering International, 30, 1409-1425, 2014)은 불꽃그림이란 이름을 붙인 그래픽 방법를 제시하였는데 관측값에 부여된 가중치를 1에서 0으로 변화함에 따라 이상점이나 영향점이 회귀계수 및 잔차제곱합에 어떠한 영향을 미치는지 살펴 보았다. 본 연구에서는 다반응 반응표면분석에서 이러한 불꽃그림을 적용하여 보고자 한다.

MP-Lasso chart: a multi-level polar chart for visualizing group Lasso analysis of genomic data

  • Min Song;Minhyuk Lee;Taesung Park;Mira Park
    • Genomics & Informatics
    • /
    • 제20권4호
    • /
    • pp.48.1-48.7
    • /
    • 2022
  • Penalized regression has been widely used in genome-wide association studies for joint analyses to find genetic associations. Among penalized regression models, the least absolute shrinkage and selection operator (Lasso) method effectively removes some coefficients from the model by shrinking them to zero. To handle group structures, such as genes and pathways, several modified Lasso penalties have been proposed, including group Lasso and sparse group Lasso. Group Lasso ensures sparsity at the level of pre-defined groups, eliminating unimportant groups. Sparse group Lasso performs group selection as in group Lasso, but also performs individual selection as in Lasso. While these sparse methods are useful in high-dimensional genetic studies, interpreting the results with many groups and coefficients is not straightforward. Lasso's results are often expressed as trace plots of regression coefficients. However, few studies have explored the systematic visualization of group information. In this study, we propose a multi-level polar Lasso (MP-Lasso) chart, which can effectively represent the results from group Lasso and sparse group Lasso analyses. An R package to draw MP-Lasso charts was developed. Through a real-world genetic data application, we demonstrated that our MP-Lasso chart package effectively visualizes the results of Lasso, group Lasso, and sparse group Lasso.

Multi-variate Fuzzy Polynomial Regression using Shape Preserving Operations

  • Hong, Dug-Hun;Do, Hae-Young
    • Journal of the Korean Data and Information Science Society
    • /
    • 제14권1호
    • /
    • pp.131-141
    • /
    • 2003
  • In this paper, we prove that multi-variate fuzzy polynomials are universal approximators for multi-variate fuzzy functions which are the extension principle of continuous real-valued function under $T_W-based$ fuzzy arithmetic operations for a distance measure that Buckley et al.(1999) used. We also consider a class of fuzzy polynomial regression model. A mixed non-linear programming approach is used to derive the satisfying solution.

  • PDF

Variable Selection with Regression Trees

  • Chang, Young-Jae
    • 응용통계연구
    • /
    • 제23권2호
    • /
    • pp.357-366
    • /
    • 2010
  • Many tree algorithms have been developed for regression problems. Although they are regarded as good algorithms, most of them suffer from loss of prediction accuracy when there are many noise variables. To handle this problem, we propose the multi-step GUIDE, which is a regression tree algorithm with a variable selection process. The multi-step GUIDE performs better than some of the well-known algorithms such as Random Forest and MARS. The results based on simulation study shows that the multi-step GUIDE outperforms other algorithms in terms of variable selection and prediction accuracy. It generally selects the important variables correctly with relatively few noise variables and eventually gives good prediction accuracy.

Applications of response dimension reduction in large p-small n problems

  • Minjee Kim;Jae Keun Yoo
    • Communications for Statistical Applications and Methods
    • /
    • 제31권2호
    • /
    • pp.191-202
    • /
    • 2024
  • The goal of this paper is to show how multivariate regression analysis with high-dimensional responses is facilitated by the response dimension reduction. Multivariate regression, characterized by multi-dimensional response variables, is increasingly prevalent across diverse fields such as repeated measures, longitudinal studies, and functional data analysis. One of the key challenges in analyzing such data is managing the response dimensions, which can complicate the analysis due to an exponential increase in the number of parameters. Although response dimension reduction methods are developed, there is no practically useful illustration for various types of data such as so-called large p-small n data. This paper aims to fill this gap by showcasing how response dimension reduction can enhance the analysis of high-dimensional response data, thereby providing significant assistance to statistical practitioners and contributing to advancements in multiple scientific domains.

Scaling MDS for Preference Data Using Target Configuration

  • Hwang, S.Y.;Park, S.K.
    • Journal of the Korean Data and Information Science Society
    • /
    • 제14권2호
    • /
    • pp.237-245
    • /
    • 2003
  • MDS(multi-dimensional scaling) for preference data is a graphical tool which usually figures out how consumers recognize, evaluate certain products. This article is mainly concerned with an optimal scaling for MDS when target configuration is available. Rotation of axis and SUR(seemingly unrelated regression) methods are employed to get a new configuration which is obtained as close to the target as we can. Methodologies developed here are also illustrated via a real data set.

  • PDF

중간 사건이 결측되었거나 구간 중도절단된 준 경쟁 위험 자료에 대한 회귀모형 (Regression models for interval-censored semi-competing risks data with missing intermediate transition status)

  • 김진흠;김자연
    • 응용통계연구
    • /
    • 제29권7호
    • /
    • pp.1311-1327
    • /
    • 2016
  • 본 논문에서는 종말 사건에 대한 정보는 주어져 있지만 중간 사건이 구간 중도절단되었거나 연구 기간 도중에 추적이 끊겨 중간 사건의 발생 유무를 모르는 준 경쟁 위험 자료에 다중상태모형을 적용하여 모수를 추정하는 방법을 제안하였다. 이를 위해 상태 간 전이 강도는 정규 프레일티를 랜덤효과로 가진 Cox 비례위험모형을 따른다고 가정하였다. 다섯 가지 상태를 가진 다중상태모형에서 가능한 여섯 가지 경로별로 조건부 우도를 정의하였고 주변 우도를 구하기 위해 조정 가우스 구적법을 적용하였으며 뉴튼-랩슨 방법으로 최적 해를 구하였다. 모수의 95% 신뢰구간 포함률을 통해 제안한 방법의 소표본 성질을 살펴보기 위해 모의실험을 수행하였으며, Persones $Ag{\acute{e}}es$ Quid(PAQUID) 자료 (Helmer 등, 2001)에 제안한 모형을 적용하고 그 결과를 해석하였다.

지역별 응급의료 접근성이 환자의 예후 및 응급의료비 지출에 미치는 영향 (Impact of Regional Emergency Medical Access on Patients' Prognosis and Emergency Medical Expenditure)

  • 김연진;이태진
    • 보건행정학회지
    • /
    • 제30권3호
    • /
    • pp.399-408
    • /
    • 2020
  • Background: The purpose of this study was to examine the impact of the regional characteristics on the accessibility of emergency care and the impact of emergency medical accessibility on the patients' prognosis and the emergency medical expenditure. Methods: This study used the 13th beta version 1.6 annual data of Korea Health Panel and the statistics from the Korean Statistical Information Service. The sample included 8,119 patients who visited the emergency centers between year 2013 and 2017. The arrival time, which indicated medical access, was used as dependent variable for multi-level analysis. For ordinal logistic regression and multiple regression, the arrival time was used as independent variable while patients' prognosis and emergency medical expenditure were used as dependent variables. Results: The results for the multi-level analysis in both the individual and regional variables showed that as the number of emergency medical institutions per 100 km2 area increased, the time required to reach emergency centers significantly decreased. Ordinal logistic regression and multiple regression results showed that as the arrival time increased, the patients' prognosis significantly worsened and the emergency medical expenses significantly increased. Conclusion: In conclusion, the access to emergency care was affected by regional characteristics and affected patient outcomes and emergency medical expenditure.

한국아동의 일상적 스트레스 척도의 개발 (Development of Daily Hassles Scale for Children in Korea)

  • 한미현
    • 대한가정학회지
    • /
    • 제33권4호
    • /
    • pp.49-64
    • /
    • 1995
  • The purpose of this study was to develop the Daily Hassles Scale for children in Korea. The subject were 444 children of 184 fourth graders and 260 sixth graders selected form five elementary schools in Seoul(217 male and 227 female). A questionnaire consisting of 90-item daily hassles scale, demographic questions, and some additional questions was used as a methodological instrument. statistics used for data analysis were X2, cramer's V, factor analysis, multi-regression, Pearson's r, Cronbach's α. The major findings of this study were as follows. 1) 87 items of the 90-item scale were acceptible through item discriminant method. The discriminant coefficients of the items(Cramer's V) ranged form .28 to .73. 2) 6 factors(parents, home environment, friends, studies, teachers & school, the surroundings) were extracted from factor analysis. Multi-regression analysis conducted to reduce the length of scale have drawed 42 items for 'the Daily Hassles Scale for Children in Korea'. The correlation between this scale and the Quality of Life Scale(Olson & Barnes, 1982) was conducted to test the criterion-related validity, and the coefficient was significant(r=-.52, p<.001).3) Finally, reliability coefficients(Cronbach'α) of this scale was. 85.

  • PDF