ATE531031T1 - Segmentbasierte tonale modellierung für tonale sprachen - Google Patents
Segmentbasierte tonale modellierung für tonale sprachenInfo
- Publication number
- ATE531031T1 ATE531031T1 AT05100347T AT05100347T ATE531031T1 AT E531031 T1 ATE531031 T1 AT E531031T1 AT 05100347 T AT05100347 T AT 05100347T AT 05100347 T AT05100347 T AT 05100347T AT E531031 T1 ATE531031 T1 AT E531031T1
- Authority
- AT
- Austria
- Prior art keywords
- tonal
- phones
- segment
- modeling
- languages
- Prior art date
Links
- 238000006243 chemical reaction Methods 0.000 abstract 1
- 230000001419 dependent effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/027—Syllables being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Telephonic Communication Services (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/762,060 US7684987B2 (en) | 2004-01-21 | 2004-01-21 | Segmental tonal modeling for tonal languages |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE531031T1 true ATE531031T1 (de) | 2011-11-15 |
Family
ID=34634585
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT05100347T ATE531031T1 (de) | 2004-01-21 | 2005-01-20 | Segmentbasierte tonale modellierung für tonale sprachen |
Country Status (6)
Country | Link |
---|---|
US (1) | US7684987B2 (de) |
EP (1) | EP1557821B1 (de) |
JP (1) | JP5208352B2 (de) |
KR (1) | KR101169074B1 (de) |
CN (1) | CN1645478B (de) |
AT (1) | ATE531031T1 (de) |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI258087B (en) * | 2004-12-31 | 2006-07-11 | Delta Electronics Inc | Voice input method and system for portable device |
US8249873B2 (en) * | 2005-08-12 | 2012-08-21 | Avaya Inc. | Tonal correction of speech |
US20070050188A1 (en) * | 2005-08-26 | 2007-03-01 | Avaya Technology Corp. | Tone contour transformation of speech |
CN101154379B (zh) | 2006-09-27 | 2011-11-23 | 夏普株式会社 | 定位语音中的关键词的方法和设备以及语音识别*** |
US20080120108A1 (en) * | 2006-11-16 | 2008-05-22 | Frank Kao-Ping Soong | Multi-space distribution for pattern recognition based on mixed continuous and discrete observations |
US20090048837A1 (en) * | 2007-08-14 | 2009-02-19 | Ling Ju Su | Phonetic tone mark system and method thereof |
US8244534B2 (en) | 2007-08-20 | 2012-08-14 | Microsoft Corporation | HMM-based bilingual (Mandarin-English) TTS techniques |
JP4962962B2 (ja) * | 2007-09-11 | 2012-06-27 | 独立行政法人情報通信研究機構 | 音声認識装置、自動翻訳装置、音声認識方法、プログラム、及びデータ構造 |
US8583438B2 (en) * | 2007-09-20 | 2013-11-12 | Microsoft Corporation | Unnatural prosody detection in speech synthesis |
JP5178109B2 (ja) * | 2007-09-25 | 2013-04-10 | 株式会社東芝 | 検索装置、方法及びプログラム |
CN101383149B (zh) * | 2008-10-27 | 2011-02-02 | 哈尔滨工业大学 | 弦乐音乐颤音自动检测方法 |
US8510103B2 (en) * | 2009-10-15 | 2013-08-13 | Paul Angott | System and method for voice recognition |
US9058751B2 (en) * | 2011-11-21 | 2015-06-16 | Age Of Learning, Inc. | Language phoneme practice engine |
US9824695B2 (en) * | 2012-06-18 | 2017-11-21 | International Business Machines Corporation | Enhancing comprehension in voice communications |
TW201403354A (zh) * | 2012-07-03 | 2014-01-16 | Univ Nat Taiwan Normal | 以資料降維法及非線性算則建構中文文本可讀性數學模型之系統及其方法 |
CN103971677B (zh) * | 2013-02-01 | 2015-08-12 | 腾讯科技(深圳)有限公司 | 一种声学语言模型训练方法和装置 |
US9396723B2 (en) | 2013-02-01 | 2016-07-19 | Tencent Technology (Shenzhen) Company Limited | Method and device for acoustic language model training |
CN103839546A (zh) * | 2014-03-26 | 2014-06-04 | 合肥新涛信息科技有限公司 | 一种基于江淮语系的语音识别*** |
CN103943109A (zh) * | 2014-04-28 | 2014-07-23 | 深圳如果技术有限公司 | 一种将语音转换为文字的方法及装置 |
US10199034B2 (en) | 2014-08-18 | 2019-02-05 | At&T Intellectual Property I, L.P. | System and method for unified normalization in text-to-speech and automatic speech recognition |
US9953646B2 (en) | 2014-09-02 | 2018-04-24 | Belleau Technologies | Method and system for dynamic speech recognition and tracking of prewritten script |
CN110189744A (zh) * | 2019-04-09 | 2019-08-30 | 阿里巴巴集团控股有限公司 | 文本处理的方法、装置和电子设备 |
US11393455B2 (en) * | 2020-02-28 | 2022-07-19 | Rovi Guides, Inc. | Methods for natural language model training in natural language understanding (NLU) systems |
US11574127B2 (en) * | 2020-02-28 | 2023-02-07 | Rovi Guides, Inc. | Methods for natural language model training in natural language understanding (NLU) systems |
US11392771B2 (en) * | 2020-02-28 | 2022-07-19 | Rovi Guides, Inc. | Methods for natural language model training in natural language understanding (NLU) systems |
US11626103B2 (en) | 2020-02-28 | 2023-04-11 | Rovi Guides, Inc. | Methods for natural language model training in natural language understanding (NLU) systems |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5220639A (en) * | 1989-12-01 | 1993-06-15 | National Science Council | Mandarin speech input method for Chinese computers and a mandarin speech recognition machine |
JPH04238396A (ja) * | 1991-01-23 | 1992-08-26 | Matsushita Electric Ind Co Ltd | 音声合成用音声持続期間処理装置 |
US5623609A (en) * | 1993-06-14 | 1997-04-22 | Hal Trust, L.L.C. | Computer system and computer-implemented process for phonology-based automatic speech recognition |
JP3234371B2 (ja) * | 1993-11-12 | 2001-12-04 | 松下電器産業株式会社 | 音声合成用音声持続時間処理方法及びその装置 |
US5680510A (en) | 1995-01-26 | 1997-10-21 | Apple Computer, Inc. | System and method for generating and using context dependent sub-syllable models to recognize a tonal language |
US5751905A (en) * | 1995-03-15 | 1998-05-12 | International Business Machines Corporation | Statistical acoustic processing method and apparatus for speech recognition using a toned phoneme system |
US6038533A (en) * | 1995-07-07 | 2000-03-14 | Lucent Technologies Inc. | System and method for selecting training text |
US6006175A (en) * | 1996-02-06 | 1999-12-21 | The Regents Of The University Of California | Methods and apparatus for non-acoustic speech characterization and recognition |
JP2001166789A (ja) * | 1999-12-10 | 2001-06-22 | Matsushita Electric Ind Co Ltd | 初頭/末尾の音素類似度ベクトルによる中国語の音声認識方法及びその装置 |
US6553342B1 (en) * | 2000-02-02 | 2003-04-22 | Motorola, Inc. | Tone based speech recognition |
US6510410B1 (en) * | 2000-07-28 | 2003-01-21 | International Business Machines Corporation | Method and apparatus for recognizing tone languages using pitch information |
JP2002229590A (ja) * | 2001-02-01 | 2002-08-16 | Atr Onsei Gengo Tsushin Kenkyusho:Kk | 音声認識システム |
JP2002268672A (ja) * | 2001-03-13 | 2002-09-20 | Atr Onsei Gengo Tsushin Kenkyusho:Kk | 音声データベース用文セットの選択方法 |
-
2004
- 2004-01-21 US US10/762,060 patent/US7684987B2/en not_active Expired - Fee Related
-
2005
- 2005-01-20 EP EP05100347A patent/EP1557821B1/de active Active
- 2005-01-20 AT AT05100347T patent/ATE531031T1/de not_active IP Right Cessation
- 2005-01-20 JP JP2005013319A patent/JP5208352B2/ja active Active
- 2005-01-21 KR KR1020050005828A patent/KR101169074B1/ko active IP Right Grant
- 2005-01-21 CN CN2005100094029A patent/CN1645478B/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN1645478A (zh) | 2005-07-27 |
JP2005208652A (ja) | 2005-08-04 |
EP1557821A2 (de) | 2005-07-27 |
KR101169074B1 (ko) | 2012-07-26 |
CN1645478B (zh) | 2012-03-21 |
EP1557821A3 (de) | 2008-04-02 |
US20050159954A1 (en) | 2005-07-21 |
EP1557821B1 (de) | 2011-10-26 |
JP5208352B2 (ja) | 2013-06-12 |
KR20050076712A (ko) | 2005-07-26 |
US7684987B2 (en) | 2010-03-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE531031T1 (de) | Segmentbasierte tonale modellierung für tonale sprachen | |
CN106531185B (zh) | 基于语音相似度的语音评测方法及*** | |
CN102231278B (zh) | 实现语音识别中自动添加标点符号的方法及*** | |
Coulston et al. | Amplitude convergence in children’s conversational speech with animated personas | |
ATE508453T1 (de) | Generierung von grossen graphonem-einheiten mit kriterium gegenseitiger information für die sprachsynthese | |
DE602005009091D1 (de) | Erzeugen einer Spracherkennungsgrammatik für alphanumerische Ausdrücke | |
Michaud et al. | Tone and intonation: Introductory notes and practical recommendations | |
Prieto et al. | Early intonational development in Catalan | |
Ido | Bukharan Tajik | |
Cosi et al. | Baldini: baldi speaks italian! | |
Morales et al. | Speech-based human and service robot interaction: An application for Mexican dysarthric people | |
Madureira | Intonation and variation: the multiplicity of forms and senses | |
Caron | Tone and intonation | |
Sun et al. | A method for generation of Mandarin F0 contours based on tone nucleus model and superpositional model | |
KR20200056835A (ko) | 새로운 소리 분류방법에 따른 한국어 발음표기방법 및 이를 이용한 음성변환 및 음성인식 시스템 | |
Qiang | Paralanguage | |
Watson et al. | Resources created for building New Zealand English voices | |
KR20090109501A (ko) | 언어학습용 리듬훈련 시스템 및 방법 | |
Maurice | Your voice speaks volumes: It’s not what you say but how you say it (a review) | |
WO2006034152A3 (en) | Discriminative training of document transcription system | |
Amin et al. | Nine voices, one artist: Linguistic and acoustic analysis | |
Bairagi et al. | Implementing Concatenative Text-To-Speech Synthesis System for Marathi Language using Python | |
Chlébowski et al. | Nasal grunts” in the NECTE corpus, Meaningful interactional sounds | |
Kaan et al. | Applicability of the theory of phonology to the sound system of Tiv language | |
Hoffmann | Speech, text and braille conversion technology |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |