• Title/Summary/Keyword: speech management

Search Result 256, Processing Time 0.029 seconds

CSL Computerized Speech Lab - Model 4300B Software version 5.X

  • Ahn, Cheol-Min
    • Proceedings of the KSLP Conference
    • /
    • 1995.11a
    • /
    • pp.154-164
    • /
    • 1995
  • CSL, Model 4300B is a highly flexible audio processing package designed to provide a wide variety of speech analysis operations for both new and sophisticated users. Operations include 1) Data acquisition 2) File management 3) Graphics 4) Numerical display 5) Audio output 6) Signal editing 7) A variety of analysis functions, External module include 1) Input control B) Output control 3) Jacks, Software include 1) Wide range of speech display manipulation 2) Editing 3) Analysis (omitted)

  • PDF

A review of speech perception: The first step for convergence on speech engineering (말소리지각에 대한 종설: 음성공학과의 융복합을 위한 첫 단계)

  • Lee, Young-lim
    • Journal of Digital Convergence
    • /
    • v.15 no.12
    • /
    • pp.509-516
    • /
    • 2017
  • People observe a lot of events in our environment and we do not have any difficulty to perceive events including speech perception. Like perception of biological motion, two main theorists have debated on speech perception. The purpose of this review article is to briefly describe speech perception and compare these two theories of speech perception. Motor theorists claim that speech perception is special to human because we both produce and perceive articulatory events that are processed by innate neuromotor commands. However, direct perception theorists claim that speech perception is not different from nonspeech perception because we only need to detect information directly like all other kinds of event. It is important to grasp the fundamental idea of how human perceive articulatory events for the convergence on speech engineering. Thus, this basic review of speech perception is expected to be able to used for AI, voice recognition technology, speech recognition system, etc.

Implementation of Music Broadcasting Service System in the Shopping Center Using Text-To-Speech Technology (TTS를 이용한 매장 음악 방송 서비스 시스템 구현)

  • Chang, Moon-Soo;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.169-178
    • /
    • 2007
  • This thesis describes the development of a service system for small-sized shops which support not only music broadcasting, but editing and generating voice announcement using the TTS(Text-To-Speech) technology. The system has been developed based on web environments with an easy access whenever and wherever it is needed. The system is able to control the sound using silverlight media player based on the ASP .NET 2.0 technology without any additional application software. Use of the Ajax control allows for multiple users to get the maximum load when needed. TTS is built in the server side so that the service can be provided without user's computer. Due to convenience and usefulness of the system, the business sector can provide better service to many shops. Further additional functions such as statistical analysis will undoubtedly help shop management provide desirable services.

  • PDF

Neurodegenerative Disease and Speech Rehabilitation (퇴행성질환과 말언어장애 재활)

  • Yoon, Ji Hye
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.28 no.2
    • /
    • pp.79-83
    • /
    • 2017
  • Neurodegenerative diseases such as Parkinson's disease and amyotrophic lateral sclerosis may induce impairment of speech motor system. This review discusses the characteristics of dysarthria and symptom management for these conditions. Given the progressive nature of the neurodegenerative diseases, speech-language pathologists must be aware of appropriate augmentative and alternative communication equipment at the early stage of the disease course. Patients with neurodegenerative diseases can maintain functional communication with augmentative and alternative communication supports.

  • PDF

The Effectiveness of a Prolonged-speech Treatment Program for School-age Children with Stuttering (학령기 말더듬 아동의 첫음연장기법을 이용한 치료프로그램 효과 연구)

  • Oh Seung Ah
    • Journal of Families and Better Life
    • /
    • v.22 no.6 s.72
    • /
    • pp.143-152
    • /
    • 2004
  • The purpose of this study was to know the effectiveness of prolonged-speech treatment program on school-age children with stuttering. Two male and One female subjects participated in this study. The speech of 3 subjects in the treatment was assessed on frequency of stuttering, stuttering Pattern, degree of severity in stuttering. This Program was taken from Ryan's the step of traditional therapy Program and prolonged-speech technique program. and then, modified in accordance with the purpose of this study. The treatment program were consisted of Four stages. The results of this study were as follows: First, 3 subjects can speak with greatly reduced stuttering frequency after treatment Second, in the stuttering pattern, all subjects were changed from part-word repetition in stuttering into a prolongation in stuttering. And also, all subjects showed similar effect in the maintenance.

Perceptual, Acoustical, and Physiological Tools in Ataxic Dysarthria Management: A Case Report (운동실조형 마비성구음장애에 적용되는 지각적, 음향학적, 생리학적 도구에 관하여 - 환자사례를 중심으로 -)

  • Kim Hyang Hui
    • Proceedings of the KSPS conference
    • /
    • 1996.02a
    • /
    • pp.9-22
    • /
    • 1996
  • Among the various dysarthric subtypes, diagnosis of ataxic dysarthria is rendered when the speech characteristics include imprecise and irregular articulatory breakdowns, marked degree of speech rate impairment, overall monopitch and monoloudness, and respiratory-articulatory incoordination. Traditionally, speech pathologists have relied only upon their ‘ears’ to describe and evaluate the dysarthric speech. A statement of percentage of correct words identified by a listener do not provide so much more than an index of severity. Within the same perceptual dimension, a carefully constructed speech intelligibility test can specify patterns of errors. The patterns can contain a diagnostic value as well as Provide strategies for remediation. The phonetically transcribed texts on single words and a standard passage, 'kail' produced by an ataxic dysarthria are presented in this report, with an emphasis of the articulatory error analysis. Furthermore,, acoustic tools [e.g., spectrography to measure formant transitions, segment durations, consonant spectra, etc.] are utilized to serve as basic measures that objectively document patients' speech intelligibility, Finally, the treatment methods [e.g., spectrography as a visual feedback, gestural reorganization using pacing method, DAF (Delayed Auditory Feedback)] to modify the dysarthric behaviors are presented.

  • PDF

Speech Recognition Error Compensation using MFCC and LPC Feature Extraction Method (MFCC와 LPC 특징 추출 방법을 이용한 음성 인식 오류 보정)

  • Oh, Sang-Yeob
    • Journal of Digital Convergence
    • /
    • v.11 no.6
    • /
    • pp.137-142
    • /
    • 2013
  • Speech recognition system is input of inaccurate vocabulary by feature extraction case of recognition by appear result of unrecognized or similar phoneme recognized. Therefore, in this paper, we propose a speech recognition error correction method using phoneme similarity rate and reliability measures based on the characteristics of the phonemes. Phonemes similarity rate was phoneme of learning model obtained used MFCC and LPC feature extraction method, measured with reliability rate. Minimize the error to be unrecognized by measuring the rate of similar phonemes and reliability. Turned out to error speech in the process of speech recognition was error compensation performed. In this paper, the result of applying the proposed system showed a recognition rate of 98.3%, error compensation rate 95.5% in the speech recognition.

The Effects of Self-Acceptance, Social Support and Internal Locus of Control on Speech Anxiety in Elementary School Students (초등학생의 자기수용, 사회적 지지, 내적통제성이 발표불안에 미치는 영향)

  • Kim, Yun-Jeon;Park, Boo-Jin
    • Journal of Families and Better Life
    • /
    • v.30 no.1
    • /
    • pp.41-53
    • /
    • 2012
  • The purpose of this study was to determine how elementary school students' self-acceptance, social support and internal locus of control affect their speech anxiety. A questionnaire survey was distributed to 570 fifth and sixth graders attending 4 elementary schools located in Seoul. A total of 534 surveys were completed and were analyzed with SPSS WIN 12.0 including frequency test, t-test, Pearson's correlations analysis, simultaneous multiple regression and hierarchical multiple regression analysis. The findings of this study are summarized as follows. First, among self-acceptance, social support, internal locus of control and speech anxiety, gender affected speech anxiety. Second, speech anxiety was most affected by self-acceptance, followed by social support, internal locus of control and gender in the order of mention. Third, social support had moderating effects on the relationship between self-acceptance and speech anxiety.

HearCAM Embedded Platform Design (히어 캠 임베디드 플랫폼 설계)

  • Hong, Seon Hack;Cho, Kyung Soon
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.10 no.4
    • /
    • pp.79-87
    • /
    • 2014
  • In this paper, we implemented the HearCAM platform with Raspberry PI B+ model which is an open source platform. Raspberry PI B+ model consists of dual step-down (buck) power supply with polarity protection circuit and hot-swap protection, Broadcom SoC BCM2835 running at 700MHz, 512MB RAM solered on top of the Broadcom chip, and PI camera serial connector. In this paper, we used the Google speech recognition engine for recognizing the voice characteristics, and implemented the pattern matching with OpenCV software, and extended the functionality of speech ability with SVOX TTS(Text-to-speech) as the matching result talking to the microphone of users. And therefore we implemented the functions of the HearCAM for identifying the voice and pattern characteristics of target image scanning with PI camera with gathering the temperature sensor data under IoT environment. we implemented the speech recognition, pattern matching, and temperature sensor data logging with Wi-Fi wireless communication. And then we directly designed and made the shape of HearCAM with 3D printing technology.