• 제목/요약/키워드: speech management

검색결과 256건 처리시간 0.027초

CSL Computerized Speech Lab - Model 4300B Software version 5.X

  • Ahn, Cheol-Min
    • 대한음성언어의학회:학술대회논문집
    • /
    • 대한음성언어의학회 1995년도 제4회 학술대회 심포지움 및 워크샵
    • /
    • pp.154-164
    • /
    • 1995
  • CSL, Model 4300B is a highly flexible audio processing package designed to provide a wide variety of speech analysis operations for both new and sophisticated users. Operations include 1) Data acquisition 2) File management 3) Graphics 4) Numerical display 5) Audio output 6) Signal editing 7) A variety of analysis functions, External module include 1) Input control B) Output control 3) Jacks, Software include 1) Wide range of speech display manipulation 2) Editing 3) Analysis (omitted)

  • PDF

말소리지각에 대한 종설: 음성공학과의 융복합을 위한 첫 단계 (A review of speech perception: The first step for convergence on speech engineering)

  • 이영림
    • 디지털융복합연구
    • /
    • 제15권12호
    • /
    • pp.509-516
    • /
    • 2017
  • 사람들은 항상 사건들과 접하고 말소리 지각과 같은 사건을 지각하는데 별 어려움이 없다. 생물학적 운동의 지각과 마찬가지로, 말소리 지각에 대한 두 이론이 논쟁해 왔다. 이 논문의 목적은 말소리 지각에 대해 설명하고 말소리 지각에 대한 운동이론과 직접지각 이론을 비교하는 것이다. 운동이론학자들은 인간은 운동신경의 명령에 의해 말소리를 지각하고 생성해 내기 때문에 인간은 말소리 지각에 있어서 특별한 감각을 가지고 있다고 주장해 왔다. 하지만, 직접지각 이론학자들은 말소리 지각은 여느 다른 소리를 지각하는 것과 다르지 않다고 제안했다. 왜냐하면, 말소리를 지각하는 것은 다른 모든 사건을 지각하는 것과 마찬가지로 필요한 정보를 직접 탐지하면 되기 때문이다. 음성공학과의 융합에 있어서 이러한 인간의 기본적인 말소리 지각 능력을 먼저 이해하는 것이 중요하다. 따라서 이러한 말소리 지각에 대한 기본적인 이해는 인공 지능, 음성 인식 기술, 음성 인식 시스템 등에 사용될 수 있을 것으로 기대된다.

TTS를 이용한 매장 음악 방송 서비스 시스템 구현 (Implementation of Music Broadcasting Service System in the Shopping Center Using Text-To-Speech Technology)

  • 장문수;강선미
    • 음성과학
    • /
    • 제14권4호
    • /
    • pp.169-178
    • /
    • 2007
  • This thesis describes the development of a service system for small-sized shops which support not only music broadcasting, but editing and generating voice announcement using the TTS(Text-To-Speech) technology. The system has been developed based on web environments with an easy access whenever and wherever it is needed. The system is able to control the sound using silverlight media player based on the ASP .NET 2.0 technology without any additional application software. Use of the Ajax control allows for multiple users to get the maximum load when needed. TTS is built in the server side so that the service can be provided without user's computer. Due to convenience and usefulness of the system, the business sector can provide better service to many shops. Further additional functions such as statistical analysis will undoubtedly help shop management provide desirable services.

  • PDF

퇴행성질환과 말언어장애 재활 (Neurodegenerative Disease and Speech Rehabilitation)

  • 윤지혜
    • 대한후두음성언어의학회지
    • /
    • 제28권2호
    • /
    • pp.79-83
    • /
    • 2017
  • Neurodegenerative diseases such as Parkinson's disease and amyotrophic lateral sclerosis may induce impairment of speech motor system. This review discusses the characteristics of dysarthria and symptom management for these conditions. Given the progressive nature of the neurodegenerative diseases, speech-language pathologists must be aware of appropriate augmentative and alternative communication equipment at the early stage of the disease course. Patients with neurodegenerative diseases can maintain functional communication with augmentative and alternative communication supports.

  • PDF

학령기 말더듬 아동의 첫음연장기법을 이용한 치료프로그램 효과 연구 (The Effectiveness of a Prolonged-speech Treatment Program for School-age Children with Stuttering)

  • 오승아
    • 가정과삶의질연구
    • /
    • 제22권6호
    • /
    • pp.143-152
    • /
    • 2004
  • The purpose of this study was to know the effectiveness of prolonged-speech treatment program on school-age children with stuttering. Two male and One female subjects participated in this study. The speech of 3 subjects in the treatment was assessed on frequency of stuttering, stuttering Pattern, degree of severity in stuttering. This Program was taken from Ryan's the step of traditional therapy Program and prolonged-speech technique program. and then, modified in accordance with the purpose of this study. The treatment program were consisted of Four stages. The results of this study were as follows: First, 3 subjects can speak with greatly reduced stuttering frequency after treatment Second, in the stuttering pattern, all subjects were changed from part-word repetition in stuttering into a prolongation in stuttering. And also, all subjects showed similar effect in the maintenance.

운동실조형 마비성구음장애에 적용되는 지각적, 음향학적, 생리학적 도구에 관하여 - 환자사례를 중심으로 - (Perceptual, Acoustical, and Physiological Tools in Ataxic Dysarthria Management: A Case Report)

  • 김향희
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 1996년도 2월 학술대회지
    • /
    • pp.9-22
    • /
    • 1996
  • Among the various dysarthric subtypes, diagnosis of ataxic dysarthria is rendered when the speech characteristics include imprecise and irregular articulatory breakdowns, marked degree of speech rate impairment, overall monopitch and monoloudness, and respiratory-articulatory incoordination. Traditionally, speech pathologists have relied only upon their ‘ears’ to describe and evaluate the dysarthric speech. A statement of percentage of correct words identified by a listener do not provide so much more than an index of severity. Within the same perceptual dimension, a carefully constructed speech intelligibility test can specify patterns of errors. The patterns can contain a diagnostic value as well as Provide strategies for remediation. The phonetically transcribed texts on single words and a standard passage, 'kail' produced by an ataxic dysarthria are presented in this report, with an emphasis of the articulatory error analysis. Furthermore,, acoustic tools [e.g., spectrography to measure formant transitions, segment durations, consonant spectra, etc.] are utilized to serve as basic measures that objectively document patients' speech intelligibility, Finally, the treatment methods [e.g., spectrography as a visual feedback, gestural reorganization using pacing method, DAF (Delayed Auditory Feedback)] to modify the dysarthric behaviors are presented.

  • PDF

MFCC와 LPC 특징 추출 방법을 이용한 음성 인식 오류 보정 (Speech Recognition Error Compensation using MFCC and LPC Feature Extraction Method)

  • 오상엽
    • 디지털융복합연구
    • /
    • 제11권6호
    • /
    • pp.137-142
    • /
    • 2013
  • 음성 인식 시스템은 부정확한 음성 신호의 입력으로 특징을 추출하여 인식할 경우 오인식의 결과가 나타나거나 유사한 음소로 인식된다. 따라서 본 논문에서는 음소가 갖는 특징을 기반으로 음소 유사율과 신뢰도 측정을 이용한 음성 인식 오류 보정 방법을 제안하였다. 음소 유사율은 학습 모델의 음소에 MFCC와 LPC 특징 추출 방법을 이용하여 구하였으며 신뢰도로 측정하였다. 음소 유사율과 신뢰도를 측정하여 오인식되는 오류를 최소화하였으며 음성 인식 과정에서 오류로 판명된 음성에 대하여 오류 보정을 수행하였다. 본 논문에서 제안한 시스템을 적용한 결과 98.3%의 인식률과 95.5%의 오류 보정율을 나타내었다.

초등학생의 자기수용, 사회적 지지, 내적통제성이 발표불안에 미치는 영향 (The Effects of Self-Acceptance, Social Support and Internal Locus of Control on Speech Anxiety in Elementary School Students)

  • 김윤전;박부진
    • 가정과삶의질연구
    • /
    • 제30권1호
    • /
    • pp.41-53
    • /
    • 2012
  • The purpose of this study was to determine how elementary school students' self-acceptance, social support and internal locus of control affect their speech anxiety. A questionnaire survey was distributed to 570 fifth and sixth graders attending 4 elementary schools located in Seoul. A total of 534 surveys were completed and were analyzed with SPSS WIN 12.0 including frequency test, t-test, Pearson's correlations analysis, simultaneous multiple regression and hierarchical multiple regression analysis. The findings of this study are summarized as follows. First, among self-acceptance, social support, internal locus of control and speech anxiety, gender affected speech anxiety. Second, speech anxiety was most affected by self-acceptance, followed by social support, internal locus of control and gender in the order of mention. Third, social support had moderating effects on the relationship between self-acceptance and speech anxiety.

히어 캠 임베디드 플랫폼 설계 (HearCAM Embedded Platform Design)

  • 홍선학;조경순
    • 디지털산업정보학회논문지
    • /
    • 제10권4호
    • /
    • pp.79-87
    • /
    • 2014
  • In this paper, we implemented the HearCAM platform with Raspberry PI B+ model which is an open source platform. Raspberry PI B+ model consists of dual step-down (buck) power supply with polarity protection circuit and hot-swap protection, Broadcom SoC BCM2835 running at 700MHz, 512MB RAM solered on top of the Broadcom chip, and PI camera serial connector. In this paper, we used the Google speech recognition engine for recognizing the voice characteristics, and implemented the pattern matching with OpenCV software, and extended the functionality of speech ability with SVOX TTS(Text-to-speech) as the matching result talking to the microphone of users. And therefore we implemented the functions of the HearCAM for identifying the voice and pattern characteristics of target image scanning with PI camera with gathering the temperature sensor data under IoT environment. we implemented the speech recognition, pattern matching, and temperature sensor data logging with Wi-Fi wireless communication. And then we directly designed and made the shape of HearCAM with 3D printing technology.