• Title/Summary/Keyword: positive skewness

Search Result 44, Processing Time 0.027 seconds

Retrieving Minority Product Reviews Using Positive/Negative Skewness (긍정/부정 비대칭도를 이용한 소수상품평의 검색)

  • Cho, Heeryon;Lee, Jong-Seok
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.3
    • /
    • pp.121-128
    • /
    • 2015
  • A given product's online product reviews build up to form largely positive or negative reviews or mixed reviews that include both the positive and negative reviews. While the homogeneously positive or negative reviews help readers identify the generally praised or criticized product, the mixed reviews with minority opinions potentially contain valuable information about the product. We present a method of retrieving minority opinions from the online product reviews using the skewness of positive/negative reviews. The proposed method first classifies the positive/negative product reviews using a sentiment dictionary and then calculates the skewness of the classified results to identify minority reviews. Minority review retrieval experiments were conducted on smartphone and movie reviews, and the F1-measures were 24.6% (smartphone) and 15.9% (movie) and the accuracies were 56.8% and 46.8% when the individual reviews' sentiment classification accuracies were 85.3% and 78.8%. The theoretical performance of minority review retrieval is also discussed.

Bivariate skewness, kurtosis and surface plot (이변량 왜도, 첨도 그리고 표면그림)

  • Hong, Chong Sun;Sung, Jae Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.959-970
    • /
    • 2017
  • In this study, we propose bivariate skewness and kurtosis statistics and suggest a surface plot that can visually implement bivariate data containing the correlation coefficient. The skewness statistic is expressed in the form of a paired real values because this represents the skewed directions and degrees of the bivariate random sample. The kurtosis has a positive value which can determine how thick the tail part of the data is compared to the bivariate normal distribution. Moreover, the surface plot implements bivariate data based on the quantile vectors. Skewness and kurtosis are obtained and surface plots are explored for various types of bivariate data. With these results, it has been found that the values of the skewness and kurtosis reflect the characteristics of the bivariate data implemented by the surface plots. Therefore, the skewness, kurtosis and surface plot proposed in this paper could be used as one of valuable descriptive statistical methods for analyzing bivariate distributions.

Defect Diagnosis of Cable Insulating Materials by Partial Discharge Statistical Analysis

  • Shin, Jong-Yeol;Park, Hee-Doo;Lee, Jong-Yong;Hong, Jin-Woong
    • Transactions on Electrical and Electronic Materials
    • /
    • v.11 no.1
    • /
    • pp.42-47
    • /
    • 2010
  • Polymer insulating materials such as cross linked polyethylene (XLPE) are employed in electric cables used for extra high voltage. These materials can degrade due to chemical, mechanical and electric stress, possibly caused by voids, the presence of extrinsic materials and protrusions. Therefore, this study measured discharge patterns, discharge phase angle, quantity and occurrence frequency as well as changes in XLPE under different temperatures and applied voltages. To quantitatively analyze the irregular partial discharge patterns measured, the discharge patterns were examined using a statistical program. A three layer sample was fabricated, wherein the upper and lower layers were composed of non-void XLPE, while the middle layer was composed of an air void and copper particles. After heating to room temperature and $50^{\circ}C$ and $80^{\circ}C$ in silicone oil, partial discharge characteristics were studied by increasing the voltage from the inception voltage to the breakdown voltage. Partial discharge statistical analysis showed that when the K-means clustering was carried out at 9 kV to determine the void discharge characteristics, the amount discharged at low temperatures was small but when the temperature was increased to $80^{\circ}C$, the discharge amount increased to be 5.7 times more than that at room temperature because electric charge injection became easier. An analysis of the kurtosis and the skewness confirmed that positive and negative polarity had counterclockwise and clockwise clustering distribution, respectively. When 5 kV was applied to copper particles, the K-means was conducted as the temperature changed from $50^{\circ}C$ to $80^{\circ}C$. The amount of charge at a positive polarity increased 20.3% and the amount of charge at a negative polarity increased 54.9%. The clustering distribution of a positive polarity and negative polarity showed a straight line in the kurtosis and skewness analyses.

Estimation of Predictive Value of a Positive Test from a Screening Test

  • Shin, Hyun Chul;Park, Sang Gue;Kim, Yong Hee
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.567-574
    • /
    • 2003
  • The estimation problem of predictive value of a positive test(PVP), which is assessing the accuracy of a screening test is considered. Score methods discussed by Gart and Nam(1988) are proposed for constructing confidence interval for PVP. The simulation studies are conducted in evaluating the proposed methods and existing approximate ones.

Property Analyses of Deposits and Landform in Tidal Flat using Satellite Image

  • Jo, Myung-Hee;Sugimori, Yasuhiro;Jo, Wha-Ryong
    • Proceedings of the KSRS Conference
    • /
    • 1998.09a
    • /
    • pp.110-115
    • /
    • 1998
  • Through the ISODATA method, the micro-landform of Julpo-Bay tidal flat was classified into mudflat, mixedflat, and sandflat using Landsat TM image. Each showed an apparent differences in its topographical characteristics and grain size composition. For example, mudflats are formed with flat faces and tidal channel of dissected gully. Its characteristics of grain size analysis that the grains have less than mean grain size 4 phi. Its sorting is bad (higher than 1 S.D.), and it showed strongly positive skewness. But sandflat is topographically flat without tidal channel. It has developed with ripple marks. According to the grain size analysis of deposits, the soil is coarse size with 90% of sand and its sorting is well(lower than 1 S.D.) Also, it showed strongly negative skewness. Mixed flat is in between mudflat and sandflat in its characteristics.

  • PDF

A Study on Exploring the Academic Dropout of College Students(Centering Around D College)

  • Lee, Jae-Do
    • 한국정보컨버전스학회:학술대회논문집
    • /
    • 2008.06a
    • /
    • pp.89-92
    • /
    • 2008
  • This study analyzed the status and causes for the dropouts of college based on the survey conducted among 14,210 freshmen attending D College, other than the supernumerary special selection, from 2001 through 2005. A significant difference was shown in all items of general characteristics. The dropout rate of women, generally selected and general high school graduated were higher than for men, specially selected and special high school graduated, respectively. The most dropouts were due to Not Return(40.16%), followed by Unenrolled(32.98%), Voluntary Leave(26.05%) and Expelled(0.81%) in order. In the distribution of the central tendency values measured from the entire subjects. the high school records and the days of absence showed a positive skewness. while the college records showed a negative skewness with the data mostly around a higher grade. The standard deviation indicating that the dropouts got the scores higher than those of the continuing students demonstrated that there was relatively insignificant difference in scores between two groups.

  • PDF

A Quantative Analysis of activation pattern of Elbow Flexor muscles during contraction (근육 수축시 주관절 굴근의 활성화 유형에 대한 정량적 분석)

  • Lee, D.H.;Lee, Y.S.;Kim, S.H.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1996 no.05
    • /
    • pp.6-9
    • /
    • 1996
  • In this paper, we attempted to analyze the contraction patterns of elbow flexor muscle during isometric, concentric and eccentric contraction. The analysis parameters are consisted of Sequency domain parameters (mean frequency, median frequency, skewness, kurtosis) and time domain parameters (zero crossing, positive maxima, integrated EMG). As a results, the analysis parameters have specific trends for muscles, muscle contraction patterns, muscle contraction angles. Especially, at the time domain analysis, IEMG is a dominant parameter for analysis of activation patterns, and the skewness, kurtosis are useful parameters for functional recognition.

  • PDF

Distribution fitting for the rate of return and value at risk (수익률 분포의 적합과 리스크값 추정)

  • Hong, Chong-Sun;Kwon, Tae-Wan
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.2
    • /
    • pp.219-229
    • /
    • 2010
  • There have been many researches on the risk management due to rapid increase of various risk factors for financial assets. Aa a method for comprehensive risk management, Value at Risk (VaR) is developed. For estimation of VaR, it is important task to solve the problem of asymmetric distribution of the return rate with heavy tail. Most real distributions of the return rate have high positive kurtosis and low negative skewness. In this paper, some alternative distributions are used to be fitted to real distributions of the return rate of financial asset. And estimates of VaR obtained by using these fitting distributions are compared with those obtained from real distribution. It is found that normal mixture distribution is the most fitted where its skewness and kurtosis of practical distribution are close to real ones, and the VaR estimation using normal mixture distribution is more accurate than any others using other distributions including normal distribution.

An Analysis of the Effects of WTI on Korean Stock Market Using HAR Model (국내 주식시장 변동성에 대한 국제유가의 영향: 이질적 자기회귀(HAR) 모형을 사용하여)

  • Kim, Hyung-Gun
    • Environmental and Resource Economics Review
    • /
    • v.30 no.4
    • /
    • pp.535-555
    • /
    • 2021
  • This study empirically analyzes the effects of international oil prices on domestic stock market volatility. The data used for the analysis are 10-minute high-frequency data of the KOSPI index and WTI futures price from January 2, 2015, to July 30, 2021. For using the high-frequency data, a heterogeneous autoregression (HAR) model is employed. The analysis model utilizes the advantages of high frequency data to observe the impact of international oil prices through realized volatility, realized skewness, and kurtosis as well as oil price return. In the estimation, the Box-Cox transformation is applied in consideration of the distribution of realized volatility with high skewness. As a result, it finds that the daily return fluctuation of the WTI price has a statistically significant positive (+) effect on the volatility of the KOSPI return. However, the volatility, skewness, and kurtosis of the WTI return do not appear to affect the volatility of the KOSPI return. This result is believed to be because the volatility of the KOSPI return reflects the daily change in the WTI return, but does not reflect the intraday trading behavior of investors.