ATE531031T1 - Segmentbasierte tonale modellierung für tonale sprachen - Google Patents

Segmentbasierte tonale modellierung für tonale sprachen

Info

Publication number: ATE531031T1
Authority: AT; Austria
Prior art keywords: tonal; phones; segment; modeling; languages
Prior art date: 2004-01-21

Application number

AT05100347T

Other languages

English (en)

Inventor

Chao Huang

Min Chu

Original Assignee

Microsoft Corp

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2004-01-21

Filing date

2005-01-20

Publication date

2011-11-15

2005-01-20 Application filed by Microsoft Corp filed Critical Microsoft Corp

2011-11-15 Application granted granted Critical

2011-11-15 Publication of ATE531031T1 publication Critical patent/ATE531031T1/de

Links

238000006243 chemical reaction Methods 0.000 abstract 1
230000001419 dependent effect Effects 0.000 abstract 1

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/027—Syllables being the recognition units
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information

Landscapes

Engineering & Computer Science (AREA)
Computational Linguistics (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Human Computer Interaction (AREA)
Physics & Mathematics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Computer Vision & Pattern Recognition (AREA)
Machine Translation (AREA)
Document Processing Apparatus (AREA)
Telephonic Communication Services (AREA)

AT05100347T 2004-01-21 2005-01-20 Segmentbasierte tonale modellierung für tonale sprachen ATE531031T1 (de)

Applications Claiming Priority (1)

Application Number	Priority Date	Filing Date	Title
US10/762,060 US7684987B2 (en)	2004-01-21	2004-01-21	Segmental tonal modeling for tonal languages

Publications (1)

Publication Number	Publication Date
ATE531031T1 true ATE531031T1 (de)	2011-11-15

Family

ID=34634585

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
AT05100347T ATE531031T1 (de)	2004-01-21	2005-01-20	Segmentbasierte tonale modellierung für tonale sprachen

Country Status (6)

Country	Link
US (1)	US7684987B2 (de)
EP (1)	EP1557821B1 (de)
JP (1)	JP5208352B2 (de)
KR (1)	KR101169074B1 (de)
CN (1)	CN1645478B (de)
AT (1)	ATE531031T1 (de)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
TWI258087B (en) *	2004-12-31	2006-07-11	Delta Electronics Inc	Voice input method and system for portable device
US8249873B2 (en) *	2005-08-12	2012-08-21	Avaya Inc.	Tonal correction of speech
US20070050188A1 (en) *	2005-08-26	2007-03-01	Avaya Technology Corp.	Tone contour transformation of speech
CN101154379B (zh)	2006-09-27	2011-11-23	夏普株式会社	定位语音中的关键词的方法和设备以及语音识别***
US20080120108A1 (en) *	2006-11-16	2008-05-22	Frank Kao-Ping Soong	Multi-space distribution for pattern recognition based on mixed continuous and discrete observations
US20090048837A1 (en) *	2007-08-14	2009-02-19	Ling Ju Su	Phonetic tone mark system and method thereof
US8244534B2 (en)	2007-08-20	2012-08-14	Microsoft Corporation	HMM-based bilingual (Mandarin-English) TTS techniques
JP4962962B2 (ja) *	2007-09-11	2012-06-27	独立行政法人情報通信研究機構	音声認識装置、自動翻訳装置、音声認識方法、プログラム、及びデータ構造
US8583438B2 (en) *	2007-09-20	2013-11-12	Microsoft Corporation	Unnatural prosody detection in speech synthesis
JP5178109B2 (ja) *	2007-09-25	2013-04-10	株式会社東芝	検索装置、方法及びプログラム
CN101383149B (zh) *	2008-10-27	2011-02-02	哈尔滨工业大学	弦乐音乐颤音自动检测方法
US8510103B2 (en) *	2009-10-15	2013-08-13	Paul Angott	System and method for voice recognition
US9058751B2 (en) *	2011-11-21	2015-06-16	Age Of Learning, Inc.	Language phoneme practice engine
US9824695B2 (en) *	2012-06-18	2017-11-21	International Business Machines Corporation	Enhancing comprehension in voice communications
TW201403354A (zh) *	2012-07-03	2014-01-16	Univ Nat Taiwan Normal	以資料降維法及非線性算則建構中文文本可讀性數學模型之系統及其方法
CN103971677B (zh) *	2013-02-01	2015-08-12	腾讯科技（深圳）有限公司	一种声学语言模型训练方法和装置
US9396723B2 (en)	2013-02-01	2016-07-19	Tencent Technology (Shenzhen) Company Limited	Method and device for acoustic language model training
CN103839546A (zh) *	2014-03-26	2014-06-04	合肥新涛信息科技有限公司	一种基于江淮语系的语音识别***
CN103943109A (zh) *	2014-04-28	2014-07-23	深圳如果技术有限公司	一种将语音转换为文字的方法及装置
US10199034B2 (en)	2014-08-18	2019-02-05	At&T Intellectual Property I, L.P.	System and method for unified normalization in text-to-speech and automatic speech recognition
US9953646B2 (en)	2014-09-02	2018-04-24	Belleau Technologies	Method and system for dynamic speech recognition and tracking of prewritten script
CN110189744A (zh) *	2019-04-09	2019-08-30	阿里巴巴集团控股有限公司	文本处理的方法、装置和电子设备
US11393455B2 (en) *	2020-02-28	2022-07-19	Rovi Guides, Inc.	Methods for natural language model training in natural language understanding (NLU) systems
US11574127B2 (en) *	2020-02-28	2023-02-07	Rovi Guides, Inc.	Methods for natural language model training in natural language understanding (NLU) systems
US11392771B2 (en) *	2020-02-28	2022-07-19	Rovi Guides, Inc.	Methods for natural language model training in natural language understanding (NLU) systems
US11626103B2 (en)	2020-02-28	2023-04-11	Rovi Guides, Inc.	Methods for natural language model training in natural language understanding (NLU) systems

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5220639A (en) *	1989-12-01	1993-06-15	National Science Council	Mandarin speech input method for Chinese computers and a mandarin speech recognition machine
JPH04238396A (ja) *	1991-01-23	1992-08-26	Matsushita Electric Ind Co Ltd	音声合成用音声持続期間処理装置
US5623609A (en) *	1993-06-14	1997-04-22	Hal Trust, L.L.C.	Computer system and computer-implemented process for phonology-based automatic speech recognition
JP3234371B2 (ja) *	1993-11-12	2001-12-04	松下電器産業株式会社	音声合成用音声持続時間処理方法及びその装置
US5680510A (en)	1995-01-26	1997-10-21	Apple Computer, Inc.	System and method for generating and using context dependent sub-syllable models to recognize a tonal language
US5751905A (en) *	1995-03-15	1998-05-12	International Business Machines Corporation	Statistical acoustic processing method and apparatus for speech recognition using a toned phoneme system
US6038533A (en) *	1995-07-07	2000-03-14	Lucent Technologies Inc.	System and method for selecting training text
US6006175A (en) *	1996-02-06	1999-12-21	The Regents Of The University Of California	Methods and apparatus for non-acoustic speech characterization and recognition
JP2001166789A (ja) *	1999-12-10	2001-06-22	Matsushita Electric Ind Co Ltd	初頭／末尾の音素類似度ベクトルによる中国語の音声認識方法及びその装置
US6553342B1 (en) *	2000-02-02	2003-04-22	Motorola, Inc.	Tone based speech recognition
US6510410B1 (en) *	2000-07-28	2003-01-21	International Business Machines Corporation	Method and apparatus for recognizing tone languages using pitch information
JP2002229590A (ja) *	2001-02-01	2002-08-16	Atr Onsei Gengo Tsushin Kenkyusho:Kk	音声認識システム
JP2002268672A (ja) *	2001-03-13	2002-09-20	Atr Onsei Gengo Tsushin Kenkyusho:Kk	音声データベース用文セットの選択方法

2004
- 2004-01-21 US US10/762,060 patent/US7684987B2/en not_active Expired - Fee Related
2005
- 2005-01-20 EP EP05100347A patent/EP1557821B1/de active Active
- 2005-01-20 AT AT05100347T patent/ATE531031T1/de not_active IP Right Cessation
- 2005-01-20 JP JP2005013319A patent/JP5208352B2/ja active Active
- 2005-01-21 KR KR1020050005828A patent/KR101169074B1/ko active IP Right Grant
- 2005-01-21 CN CN2005100094029A patent/CN1645478B/zh not_active Expired - Fee Related

Also Published As

Publication number	Publication date
CN1645478A (zh)	2005-07-27
JP2005208652A (ja)	2005-08-04
EP1557821A2 (de)	2005-07-27
KR101169074B1 (ko)	2012-07-26
CN1645478B (zh)	2012-03-21
EP1557821A3 (de)	2008-04-02
US20050159954A1 (en)	2005-07-21
EP1557821B1 (de)	2011-10-26
JP5208352B2 (ja)	2013-06-12
KR20050076712A (ko)	2005-07-26
US7684987B2 (en)	2010-03-23

Legal Events

Date	Code	Title	Description
2012-04-15	RER	Ceased as to paragraph 5 lit. 3 law introducing patent treaties

Publication	Publication Date	Title
ATE531031T1 (de)	2011-11-15	Segmentbasierte tonale modellierung für tonale sprachen
CN106531185B (zh)	2019-12-13	基于语音相似度的语音评测方法及***
CN102231278B (zh)	2013-08-21	实现语音识别中自动添加标点符号的方法及***
Coulston et al.	2002	Amplitude convergence in children’s conversational speech with animated personas
ATE508453T1 (de)	2011-05-15	Generierung von grossen graphonem-einheiten mit kriterium gegenseitiger information für die sprachsynthese
DE602005009091D1 (de)	2008-10-02	Erzeugen einer Spracherkennungsgrammatik für alphanumerische Ausdrücke
Michaud et al.	2015	Tone and intonation: Introductory notes and practical recommendations
Prieto et al.	2007	Early intonational development in Catalan
Ido	2014	Bukharan Tajik
Cosi et al.	2002	Baldini: baldi speaks italian!
Morales et al.	2013	Speech-based human and service robot interaction: An application for Mexican dysarthric people
Madureira	2016	Intonation and variation: the multiplicity of forms and senses
Caron	2015	Tone and intonation
Sun et al.	2012	A method for generation of Mandarin F0 contours based on tone nucleus model and superpositional model
KR20200056835A (ko)	2020-05-25	새로운 소리 분류방법에 따른 한국어 발음표기방법 및 이를 이용한 음성변환 및 음성인식 시스템
Qiang	2013	Paralanguage
Watson et al.	2014	Resources created for building New Zealand English voices
KR20090109501A (ko)	2009-10-20	언어학습용 리듬훈련 시스템 및 방법
Maurice	2020	Your voice speaks volumes: It’s not what you say but how you say it (a review)
WO2006034152A3 (en)	2007-03-01	Discriminative training of document transcription system
Amin et al.	2012	Nine voices, one artist: Linguistic and acoustic analysis
Bairagi et al.	2022	Implementing Concatenative Text-To-Speech Synthesis System for Marathi Language using Python
Chlébowski et al.	2015	Nasal grunts” in the NECTE corpus, Meaningful interactional sounds
Kaan et al.	2014	Applicability of the theory of phonology to the sound system of Tiv language
Hoffmann	2008	Speech, text and braille conversion technology