ATE531031T1 - Segmentbasierte tonale modellierung für tonale sprachen - Google Patents

Segmentbasierte tonale modellierung für tonale sprachen

Info

Publication number
ATE531031T1
ATE531031T1 AT05100347T AT05100347T ATE531031T1 AT E531031 T1 ATE531031 T1 AT E531031T1 AT 05100347 T AT05100347 T AT 05100347T AT 05100347 T AT05100347 T AT 05100347T AT E531031 T1 ATE531031 T1 AT E531031T1
Authority
AT
Austria
Prior art keywords
tonal
phones
segment
modeling
languages
Prior art date
Application number
AT05100347T
Other languages
English (en)
Inventor
Chao Huang
Min Chu
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of ATE531031T1 publication Critical patent/ATE531031T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/027Syllables being the recognition units
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
  • Telephonic Communication Services (AREA)
AT05100347T 2004-01-21 2005-01-20 Segmentbasierte tonale modellierung für tonale sprachen ATE531031T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/762,060 US7684987B2 (en) 2004-01-21 2004-01-21 Segmental tonal modeling for tonal languages

Publications (1)

Publication Number Publication Date
ATE531031T1 true ATE531031T1 (de) 2011-11-15

Family

ID=34634585

Family Applications (1)

Application Number Title Priority Date Filing Date
AT05100347T ATE531031T1 (de) 2004-01-21 2005-01-20 Segmentbasierte tonale modellierung für tonale sprachen

Country Status (6)

Country Link
US (1) US7684987B2 (de)
EP (1) EP1557821B1 (de)
JP (1) JP5208352B2 (de)
KR (1) KR101169074B1 (de)
CN (1) CN1645478B (de)
AT (1) ATE531031T1 (de)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI258087B (en) * 2004-12-31 2006-07-11 Delta Electronics Inc Voice input method and system for portable device
US8249873B2 (en) * 2005-08-12 2012-08-21 Avaya Inc. Tonal correction of speech
US20070050188A1 (en) * 2005-08-26 2007-03-01 Avaya Technology Corp. Tone contour transformation of speech
CN101154379B (zh) 2006-09-27 2011-11-23 夏普株式会社 定位语音中的关键词的方法和设备以及语音识别***
US20080120108A1 (en) * 2006-11-16 2008-05-22 Frank Kao-Ping Soong Multi-space distribution for pattern recognition based on mixed continuous and discrete observations
US20090048837A1 (en) * 2007-08-14 2009-02-19 Ling Ju Su Phonetic tone mark system and method thereof
US8244534B2 (en) 2007-08-20 2012-08-14 Microsoft Corporation HMM-based bilingual (Mandarin-English) TTS techniques
JP4962962B2 (ja) * 2007-09-11 2012-06-27 独立行政法人情報通信研究機構 音声認識装置、自動翻訳装置、音声認識方法、プログラム、及びデータ構造
US8583438B2 (en) * 2007-09-20 2013-11-12 Microsoft Corporation Unnatural prosody detection in speech synthesis
JP5178109B2 (ja) * 2007-09-25 2013-04-10 株式会社東芝 検索装置、方法及びプログラム
CN101383149B (zh) * 2008-10-27 2011-02-02 哈尔滨工业大学 弦乐音乐颤音自动检测方法
US8510103B2 (en) * 2009-10-15 2013-08-13 Paul Angott System and method for voice recognition
US9058751B2 (en) * 2011-11-21 2015-06-16 Age Of Learning, Inc. Language phoneme practice engine
US9824695B2 (en) * 2012-06-18 2017-11-21 International Business Machines Corporation Enhancing comprehension in voice communications
TW201403354A (zh) * 2012-07-03 2014-01-16 Univ Nat Taiwan Normal 以資料降維法及非線性算則建構中文文本可讀性數學模型之系統及其方法
CN103971677B (zh) * 2013-02-01 2015-08-12 腾讯科技(深圳)有限公司 一种声学语言模型训练方法和装置
US9396723B2 (en) 2013-02-01 2016-07-19 Tencent Technology (Shenzhen) Company Limited Method and device for acoustic language model training
CN103839546A (zh) * 2014-03-26 2014-06-04 合肥新涛信息科技有限公司 一种基于江淮语系的语音识别***
CN103943109A (zh) * 2014-04-28 2014-07-23 深圳如果技术有限公司 一种将语音转换为文字的方法及装置
US10199034B2 (en) 2014-08-18 2019-02-05 At&T Intellectual Property I, L.P. System and method for unified normalization in text-to-speech and automatic speech recognition
US9953646B2 (en) 2014-09-02 2018-04-24 Belleau Technologies Method and system for dynamic speech recognition and tracking of prewritten script
CN110189744A (zh) * 2019-04-09 2019-08-30 阿里巴巴集团控股有限公司 文本处理的方法、装置和电子设备
US11393455B2 (en) * 2020-02-28 2022-07-19 Rovi Guides, Inc. Methods for natural language model training in natural language understanding (NLU) systems
US11574127B2 (en) * 2020-02-28 2023-02-07 Rovi Guides, Inc. Methods for natural language model training in natural language understanding (NLU) systems
US11392771B2 (en) * 2020-02-28 2022-07-19 Rovi Guides, Inc. Methods for natural language model training in natural language understanding (NLU) systems
US11626103B2 (en) 2020-02-28 2023-04-11 Rovi Guides, Inc. Methods for natural language model training in natural language understanding (NLU) systems

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5220639A (en) * 1989-12-01 1993-06-15 National Science Council Mandarin speech input method for Chinese computers and a mandarin speech recognition machine
JPH04238396A (ja) * 1991-01-23 1992-08-26 Matsushita Electric Ind Co Ltd 音声合成用音声持続期間処理装置
US5623609A (en) * 1993-06-14 1997-04-22 Hal Trust, L.L.C. Computer system and computer-implemented process for phonology-based automatic speech recognition
JP3234371B2 (ja) * 1993-11-12 2001-12-04 松下電器産業株式会社 音声合成用音声持続時間処理方法及びその装置
US5680510A (en) 1995-01-26 1997-10-21 Apple Computer, Inc. System and method for generating and using context dependent sub-syllable models to recognize a tonal language
US5751905A (en) * 1995-03-15 1998-05-12 International Business Machines Corporation Statistical acoustic processing method and apparatus for speech recognition using a toned phoneme system
US6038533A (en) * 1995-07-07 2000-03-14 Lucent Technologies Inc. System and method for selecting training text
US6006175A (en) * 1996-02-06 1999-12-21 The Regents Of The University Of California Methods and apparatus for non-acoustic speech characterization and recognition
JP2001166789A (ja) * 1999-12-10 2001-06-22 Matsushita Electric Ind Co Ltd 初頭/末尾の音素類似度ベクトルによる中国語の音声認識方法及びその装置
US6553342B1 (en) * 2000-02-02 2003-04-22 Motorola, Inc. Tone based speech recognition
US6510410B1 (en) * 2000-07-28 2003-01-21 International Business Machines Corporation Method and apparatus for recognizing tone languages using pitch information
JP2002229590A (ja) * 2001-02-01 2002-08-16 Atr Onsei Gengo Tsushin Kenkyusho:Kk 音声認識システム
JP2002268672A (ja) * 2001-03-13 2002-09-20 Atr Onsei Gengo Tsushin Kenkyusho:Kk 音声データベース用文セットの選択方法

Also Published As

Publication number Publication date
CN1645478A (zh) 2005-07-27
JP2005208652A (ja) 2005-08-04
EP1557821A2 (de) 2005-07-27
KR101169074B1 (ko) 2012-07-26
CN1645478B (zh) 2012-03-21
EP1557821A3 (de) 2008-04-02
US20050159954A1 (en) 2005-07-21
EP1557821B1 (de) 2011-10-26
JP5208352B2 (ja) 2013-06-12
KR20050076712A (ko) 2005-07-26
US7684987B2 (en) 2010-03-23

Similar Documents

Publication Publication Date Title
ATE531031T1 (de) Segmentbasierte tonale modellierung für tonale sprachen
CN106531185B (zh) 基于语音相似度的语音评测方法及***
CN102231278B (zh) 实现语音识别中自动添加标点符号的方法及***
Coulston et al. Amplitude convergence in children’s conversational speech with animated personas
ATE508453T1 (de) Generierung von grossen graphonem-einheiten mit kriterium gegenseitiger information für die sprachsynthese
DE602005009091D1 (de) Erzeugen einer Spracherkennungsgrammatik für alphanumerische Ausdrücke
Michaud et al. Tone and intonation: Introductory notes and practical recommendations
Prieto et al. Early intonational development in Catalan
Ido Bukharan Tajik
Cosi et al. Baldini: baldi speaks italian!
Morales et al. Speech-based human and service robot interaction: An application for Mexican dysarthric people
Madureira Intonation and variation: the multiplicity of forms and senses
Caron Tone and intonation
Sun et al. A method for generation of Mandarin F0 contours based on tone nucleus model and superpositional model
KR20200056835A (ko) 새로운 소리 분류방법에 따른 한국어 발음표기방법 및 이를 이용한 음성변환 및 음성인식 시스템
Qiang Paralanguage
Watson et al. Resources created for building New Zealand English voices
KR20090109501A (ko) 언어학습용 리듬훈련 시스템 및 방법
Maurice Your voice speaks volumes: It’s not what you say but how you say it (a review)
WO2006034152A3 (en) Discriminative training of document transcription system
Amin et al. Nine voices, one artist: Linguistic and acoustic analysis
Bairagi et al. Implementing Concatenative Text-To-Speech Synthesis System for Marathi Language using Python
Chlébowski et al. Nasal grunts” in the NECTE corpus, Meaningful interactional sounds
Kaan et al. Applicability of the theory of phonology to the sound system of Tiv language
Hoffmann Speech, text and braille conversion technology

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties