Search | Korea Science

Designing a large recording script for open-domain English speech synthesis

Kim, Sunhee;Kim, Hojeong;Lee, Yooseop;Kim, Boryoung;Won, Yongkook;Kim, Bongwan
- Phonetics and Speech Sciences
- /
- v.13 no.3
- /
- pp.65-70
- /
- 2021
This paper proposes a method for designing a large recording script for open domain English speech synthesis. For read-aloud style text, 12 domains and 294 sub-domains were designed using text contained in five different news media publications. For conversational style text, 4 domains and 36 sub-domains were designed using movie subtitles. The final script consists of 43,013 sentences, 27,085 read-aloud style sentences, and 15,928 conversational style sentences, consisting of 549,683 tokens and 38,356 types. The completed script is analyzed using four criteria: word coverage (type coverage and token coverage), high-frequency vocabulary coverage, phonetic coverage (diphone coverage and triphone coverage), and readability. The type coverage of our script reaches 36.86% despite its low token coverage of 2.97%. The high-frequency vocabulary coverage of the script is 73.82%, and the diphone coverage and triphone coverage of the whole script is 86.70% and 38.92%, respectively. The average readability of whole sentences is 9.03. The results of analysis show that the proposed method is effective in producing a large recording script for English speech synthesis, demonstrating good coverage in terms of unique words, high-frequency vocabulary, phonetic units, and readability.
https://doi.org/10.13064/KSSS.2021.13.3.065 인용 PDF KSCI

Detecting Security Vulnerabilities in TypeScript Code with Static Taint Analysis (정적 오염 분석을 활용한 타입스크립트 코드의 보안 취약점 탐지)

Moon, Taegeun;Kim, Hyoungshick
- Journal of the Korea Institute of Information Security & Cryptology
- /
- v.31 no.2
- /
- pp.263-277
- /
- 2021
Taint analysis techniques are popularly used to detect web vulnerabilities originating from unverified user input data, such as Cross-Site Scripting (XSS) and SQL Injection, in web applications written in JavaScript. To detect such vulnerabilities, it would be necessary to trace variables affected by user-submitted inputs. However, because of the dynamic nature of JavaScript, it has been a challenging issue to identify those variables without running the web application code. Therefore, most existing taint analysis tools have been developed based on dynamic taint analysis, which requires the overhead of running the target application. In this paper, we propose a novel static taint analysis technique using symbol information obtained from the TypeScript (a superset of JavaScript) compiler to accurately track data flow and detect security vulnerabilities in TypeScript code. Our proposed technique allows developers to annotate variables that can contain unverified user input data, and uses the annotation information to trace variables and data affected by user input data. Since our proposed technique can seamlessly be incorporated into the TypeScript compiler, developers can find vulnerabilities during the development process, unlike existing analysis tools performed as a separate tool. To show the feasibility of the proposed method, we implemented a prototype and evaluated its performance with 8 web applications with known security vulnerabilities. We found that our prototype implementation could detect all known security vulnerabilities correctly.
https://doi.org/10.13089/JKIISC.2021.31.2.263 인용 PDF KSCI HTML

Detection of Malicious PDF based on Document Structure Features and Stream Objects

Kang, Ah Reum;Jeong, Young-Seob;Kim, Se Lyeong;Kim, Jonghyun;Woo, Jiyoung;Choi, Sunoh
- Journal of the Korea Society of Computer and Information
- /
- v.23 no.11
- /
- pp.85-93
- /
- 2018
In recent years, there has been an increasing number of ways to distribute document-based malicious code using vulnerabilities in document files. Because document type malware is not an executable file itself, it is easy to bypass existing security programs, so research on a model to detect it is necessary. In this study, we extract main features from the document structure and the JavaScript contained in the stream object In addition, when JavaScript is inserted, keywords with high occurrence frequency in malicious code such as function name, reserved word and the readable string in the script are extracted. Then, we generate a machine learning model that can distinguish between normal and malicious. In order to make it difficult to bypass, we try to achieve good performance in a black box type algorithm. For an experiment, a large amount of documents compared to previous studies is analyzed. Experimental results show 98.9% detection rate from three different type algorithms. SVM, which is a black box type algorithm and makes obfuscation difficult, shows much higher performance than in previous studies.
https://doi.org/10.9708/jksci.2018.23.11.085 인용 PDF KSCI HTML

Development of Collaborative Script Analysis Platform Based on Web for Information Retrieval Related to Story (스토리 정보의 검색을 위한 웹 기반의 협업적 스크립트 분석 플랫폼 개발)

Park, Seung-Bo;Kim, Hyun-Sik;Baek, Yeong-Tae;You, Eun-Soon
- Journal of the Korea Society of Computer and Information
- /
- v.19 no.9
- /
- pp.93-101
- /
- 2014
Movie stories can be retrieved efficiently by analyzing a script, which is a blueprint of the movie. Although the movie script is described in the formatted structure of Final Draft, it is hard to restore the type without analyzing the story of the sentences since the scripts open on the website are mostly broken. For this purpose, it is necessary to develop and provide the web-based script analysis software so that users collaboratively and freely check and correct the errors in the results after automatically parsing the script. Hence, in this paper we suggest the structure of the web-based collaborative script analysis platform that enables users to modify and filter the type error of the script for high level of film data accumulation and performance evaluation for the implementation results is conducted. Through the experiment, accuracy of automatically parsing appears to be 64.95% and performance of modification by collaboration showed 99.58% of accuracy of parsing with errors mostly corrected after passing through 5 steps of modification.
https://doi.org/10.9708/jksci.2014.19.9.093 인용 PDF KSCI

Knowledge Representation Characteristics of Categories and Scripts: An Investigation on Hierarchy and Typicality Effects (개념지식의 유형에 따른 표상차이: 범주와 각본의 위계성과 전형성 비교1))

이재호;이정모
- Korean Journal of Cognitive Science
- /
- v.11 no.3_4
- /
- pp.73-81
- /
- 2000
This study was conducted to investigate some characteristics of representation of category knowledge and script knowledge. Using primed lexical decision task with higher level primers in the representation structure, Experiment 1 examined the interaction effects between knowledge type and concept typicality. It was found that the concept typicality has some effects in category representation, while it has no significant effect in script representation. In Experiment 2, primers of the lower hierarchy in the representation structure were employed. The results showed that the main effect of knowledge type was significant: the response time for category knowledge was faster than that for script knowledge. Typicality effect did not show in this experiment. The results of t the two experiments suggest that category knowledge is represented in hierarchy and typicality. while script knowledge may lack in that characteristics. Other aspects of the differences in characteristics of category- and script- knowledge representation were discussed,
PDF

The Functional Extension of the Underwater Vehicle Modeling and Simulation Tactics Manager using the Script Embedding Method (스크립트 임베딩을 활용한 수중운동체 M&S 전술처리기의 기능 확장)

Son, Myeong-Jo;Kim, Tae-Wan;Nah, Young-In
- Journal of the Korea Institute of Military Science and Technology
- /
- v.12 no.5
- /
- pp.590-600
- /
- 2009
In the simulation of underwater vehicles such as a submarine or a torpedo, various type of simulations like an engineering level simulation for predicting the performance precisely and an engagement level simulation for examining the effectiveness of a certain tactic is required. For this reason, a tactics manager which can change the behavior of a simulation model according to external tactics is needed. In this study the tactics manager supporting a script language and engine which can represent various tactics and can help users define external input tactics for the tactic manager easily is suggested. Python and Lua which are representative among script languages have been compared and analyzed from the viewpoint of a tactic manage, and the tactic manger using the script engines of those script languages was implemented. To demonstrate the effectiveness of the tactic manager, a target motion analysis simulation of the warfare between a submarine and a surface ship.
PDF KSCI

Study of Cursive Calligraphy of wu zhen(吳鎮)'s Ink bambooo Collection

Deng, Zhuoren;Lee, Jaewoo
- International Journal of Advanced Culture Technology
- /
- v.10 no.2
- /
- pp.69-78
- /
- 2022
The purpose of this paper is to summarize the cursive script of traditional calligraphy and develop further possibilities based on the study of the painting and postscript of Ink bambooo, which was painted by wu zhen(吳鎮) during the Yuan Dynasty. The second section in this paper provides a summary of wu zhen(吳鎮)'s life, in addition to "Ink bambooo" and its painting postscript. The third and fourth sections are focused on analyzing the cursive script in the painting postscript of Ink bambooo, including the left-and-right structure, head prefix symbols, and bottom prefix symbols. The aim of this paper is the study of cursive script, and the theories and methods of the characters proposed by Dr. Cai Yonggui (from Fujian Normal University) and Dr. Liu Dongqin (from Southeast University) will be used to provide a summary. The presentation of the research results of this paper is designed to develop further possibilities for this type of traditional calligraphy.
https://doi.org/10.17703/IJACT.2022.10.2.69 인용 PDF KSCI

The Role of Script Type in Janpanese Word Recognition:A Connectionist Model (일본어의 단어인지과정에서 표기형태의 역할:연결주의 모형)

;阿部純
- Korean Journal of Cognitive Science
- /
- v.2 no.2
- /
- pp.487-513
- /
- 1990
The present paper reviews experimental finding such as kanji stroop effect, kana superiority effect in naming task, kanji superiority effect in lexical devision task, and the different pattern of facilitatory priming effect in repetition priming task. Most of the experimental findings indicate that kana script and kanji script are processed independently and modularly. These indications are also consistent with the basic observations on Japanese dyslexics. A connectionist model named JIA(Japanese Interactive Activation)is proposed which is a revision of interactive activation model proposed by McClelland & Rumelhart(1981). The differences between the two models are as follows. Firstly, JIA has a kana module and kanji module at letter level. Secondly, JIA adopts script-specific interconnections between letter-level nodes and word-level nodes:word nodes receive larger activation from the script consistent letter-level nodes. JIA successfully explains all the experimental findings and many cases of Japanese dyslexia. A computer program which simulates JIA model was written and run.

The implementation of Korean adult's optimal formant setting by Praat scripting (성인 포먼트 측정에서의 최적 세팅 구현: Praat software와 관련하여)

Park, Jiyeon;Seong, Cheoljae
- Phonetics and Speech Sciences
- /
- v.11 no.4
- /
- pp.97-108
- /
- 2019
An automated Praat script was implemented to measure optimal formant frequencies for adults. Optimal formant analysis could be interpreted to show that the deviation of formant frequency that resulted from the two variously combined setting parameters (maximum formant and number of formants) was minimal. To increase the reliability of formant analysis, LPC order should be set differently, based on the gender or vowel type. Praat recommends 5,000 Hz and 5,500 Hz as maximum formant settings and, at the same time, recommends 5 as the number of formants for males and females. However, verification is needed to determine whether these recommended settings are valid for Korean vowels. Statistical analysis showed that formant frequencies significantly varied across the adapted scripts, especially with respect to the data on females. Formant plots and statistical results showed that linear_script and qtone_script are much more reliable in formant measurements. Among four kinds of scripts, the linear and qtone_scripts proved to be more stable and reliable. While the linear_script was designed to have a linearly increased formant step in for-loop, the increment of formant step in the qtone_script was arranged by quarter tone scale (base frequency×common ratio ($\sqrt[24]{2}$)). When looking at the tendency of the formant setting drawn by the two referred algorithms in the context of front vowel [i, e], the maximum formant was set higher; and the number of formants set at a lower value than recommended by Praat. The back vowel [o, u], on the contrary, has a lower maximum formant and a higher number of formants than the standard setting.
https://doi.org/10.13064/KSSS.2019.11.4.097 인용 PDF KSCI

JavaScript-to-c++ Type Inferencing Transcompiler Using Cartesian Product Algorithm (Cartesian Product Algorithm을 사용한 JavaScript-to-C++ 타입 추론 컴파일러)

Kim, Jaeju;Han, Hwansoo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2015.10a
- /
- pp.910-913
- /
- 2015
자바스크립트는 웹 페이지를 제어하기 위한 표준적인 스크립트 언어로 오랫동안 사용되어 왔다. 최근 웹 앱이나 서버사이드 응용 프로그램을 자바스크립트로 작성하게 되면서, 자바스크립트 프로그램을 더욱 빠르게 동작하도록 만드는 것이 중요한 이슈가 되었다. 본 논문에서는 암시적인 동적 타입 시스템을 사용하는 자바스크립트 언어에 Cartesian Product Algorithm을 적용하여 타입을 추론하고, 이 정보를 바탕으로 정적 타입 시스템인 C++ 코드로 변환하는 컴파일러의 구조와 알고리즘을 제시한다.
https://doi.org/10.3745/PKIPS.y2015m10a.910 인용 PDF

Search Result 70, Processing Time 0.04 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)