CN1604185A - Voice synthesizing system and method by utilizing length variable sub-words - Google Patents
Voice synthesizing system and method by utilizing length variable sub-words Download PDFInfo
- Publication number
- CN1604185A CN1604185A CN 03164848 CN03164848A CN1604185A CN 1604185 A CN1604185 A CN 1604185A CN 03164848 CN03164848 CN 03164848 CN 03164848 A CN03164848 A CN 03164848A CN 1604185 A CN1604185 A CN 1604185A
- Authority
- CN
- China
- Prior art keywords
- waveform
- input text
- sound inventory
- sound
- sub
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 41
- 230000002194 synthesizing effect Effects 0.000 title claims abstract description 7
- 230000015572 biosynthetic process Effects 0.000 claims description 15
- 238000003786 synthesis reaction Methods 0.000 claims description 14
- 230000011218 segmentation Effects 0.000 claims description 11
- 230000008569 process Effects 0.000 claims description 7
- 230000003203 everyday effect Effects 0.000 claims description 5
- 238000010586 diagram Methods 0.000 description 6
- 230000006835 compression Effects 0.000 description 3
- 238000007906 compression Methods 0.000 description 3
- 230000000712 assembly Effects 0.000 description 2
- 238000000429 assembly Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000010189 synthetic method Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
Images
Landscapes
- Telephone Function (AREA)
Abstract
Description
?Word | ?CV-like?unit |
?Battery | ?b’ae(Wo)+tax(Wo)+riy(Wf) |
?Level | ?l’eh(Wo)+vaxl(Wf) |
?Is | ?’Ih(Ws)+s |
?Low | ?l’ow(Ws) |
Type cases | Example | The duration result | ||
Left side text | The right text | |||
1+1 in the syllable<=2 | ?1 | Vowel | Semivowel/nasal sound ending | 1+1<2 |
?2 | Vowel | Vowel | 1+1<2 | |
?3 | Vowel | The consonant ending | 1+1=2 | |
?4 | The consonant initial | Semivowel/nasal sound | 1+1~=2 | |
?5 | The consonant initial | Vowel | 1+1~=2 | |
?6 | Semivowel/nasal sound initial | Vowel | 1+1=2 | |
Inter-syllable 1+1>=2 | ?7 | Vowel | Semivowel/nasal sound | 1+1>=2 |
?8 | Semivowel/nasal sound | Semivowel/nasal sound | 1+1>=2 | |
?9 | Vowel | Consonant | 1+1=2 | |
?10 | The consonant ending | The consonant initial | 1+1=2 |
Claims (12)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 03164848 CN1604185B (en) | 2003-09-29 | 2003-09-29 | Voice synthesizing system and method by utilizing length variable sub-words |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 03164848 CN1604185B (en) | 2003-09-29 | 2003-09-29 | Voice synthesizing system and method by utilizing length variable sub-words |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1604185A true CN1604185A (en) | 2005-04-06 |
CN1604185B CN1604185B (en) | 2010-05-26 |
Family
ID=34660846
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 03164848 Expired - Fee Related CN1604185B (en) | 2003-09-29 | 2003-09-29 | Voice synthesizing system and method by utilizing length variable sub-words |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1604185B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102334119A (en) * | 2009-02-26 | 2012-01-25 | 国立大学法人丰桥技术科学大学 | Speech search device and speech search method |
CN109313249A (en) * | 2016-06-28 | 2019-02-05 | 微软技术许可有限责任公司 | Audio augmented reality system |
CN112562637A (en) * | 2019-09-25 | 2021-03-26 | 北京中关村科金技术有限公司 | Method, device and storage medium for splicing voice and audio |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2290684A (en) * | 1994-06-22 | 1996-01-03 | Ibm | Speech synthesis using hidden Markov model to determine speech unit durations |
US6064960A (en) * | 1997-12-18 | 2000-05-16 | Apple Computer, Inc. | Method and apparatus for improved duration modeling of phonemes |
GB0113587D0 (en) * | 2001-06-04 | 2001-07-25 | Hewlett Packard Co | Speech synthesis apparatus |
-
2003
- 2003-09-29 CN CN 03164848 patent/CN1604185B/en not_active Expired - Fee Related
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102334119A (en) * | 2009-02-26 | 2012-01-25 | 国立大学法人丰桥技术科学大学 | Speech search device and speech search method |
US8626508B2 (en) | 2009-02-26 | 2014-01-07 | National University Corporation Toyohashi University Of Technology | Speech search device and speech search method |
CN102334119B (en) * | 2009-02-26 | 2014-05-21 | 国立大学法人丰桥技术科学大学 | Speech search device and speech search method |
CN109313249A (en) * | 2016-06-28 | 2019-02-05 | 微软技术许可有限责任公司 | Audio augmented reality system |
CN109313249B (en) * | 2016-06-28 | 2023-06-27 | 微软技术许可有限责任公司 | Audio augmented reality system |
CN112562637A (en) * | 2019-09-25 | 2021-03-26 | 北京中关村科金技术有限公司 | Method, device and storage medium for splicing voice and audio |
CN112562637B (en) * | 2019-09-25 | 2024-02-06 | 北京中关村科金技术有限公司 | Method, device and storage medium for splicing voice audios |
Also Published As
Publication number | Publication date |
---|---|
CN1604185B (en) | 2010-05-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Abushariah et al. | Arabic speaker-independent continuous automatic speech recognition based on a phonetically rich and balanced speech corpus. | |
EP2259252B1 (en) | Speech recognition method for selecting a combination of list elements via a speech input | |
CN1112669C (en) | Method and system for speech recognition using continuous density hidden Markov models | |
US6684187B1 (en) | Method and system for preselection of suitable units for concatenative speech | |
US5680510A (en) | System and method for generating and using context dependent sub-syllable models to recognize a tonal language | |
Lee | Voice dictation of mandarin chinese | |
JP3481497B2 (en) | Method and apparatus using a decision tree to generate and evaluate multiple pronunciations for spelled words | |
WO2010018796A1 (en) | Exception dictionary creating device, exception dictionary creating method and program therefor, and voice recognition device and voice recognition method | |
WO2018067547A1 (en) | Speech synthesis | |
US7454343B2 (en) | Speech synthesizer, speech synthesizing method, and program | |
SG185300A1 (en) | System and method for distributed text-to-speech synthesis and intelligibility | |
US8942983B2 (en) | Method of speech synthesis | |
Liu et al. | Syllable language models for Mandarin speech recognition: Exploiting character language models | |
Chen et al. | The ustc system for blizzard challenge 2011 | |
Chen et al. | A new prosody-assisted mandarin ASR system | |
CN1604185B (en) | Voice synthesizing system and method by utilizing length variable sub-words | |
Pellegrini et al. | Automatic word decompounding for asr in a morphologically rich language: Application to amharic | |
Hu et al. | Investigating the Use of Mixed-Units Based Modeling for Improving Uyghur Speech Recognition. | |
Lei et al. | Development of the 2008 SRI Mandarin speech-to-text system for broadcast news and conversation. | |
Kominek | Tts from zero: Building synthetic voices for new languages | |
US8175865B2 (en) | Method and apparatus of generating text script for a corpus-based text-to speech system | |
JP5137588B2 (en) | Language model generation apparatus and speech recognition apparatus | |
JP2001100776A (en) | Vocie synthesizer | |
Lo et al. | Multi-scale spoken document retrieval for Cantonese broadcast news | |
KR100451919B1 (en) | Decomposition and synthesis method of english phonetic symbols |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: NUANCE COMMUNICATIONS CO., LTD. Free format text: FORMER OWNER: MOTOROLA INC. Effective date: 20100908 |
|
C41 | Transfer of patent application or patent right or utility model | ||
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: ILLINOIS, UNITED STATES TO: MASSACHUSETTS, UNITED STATES |
|
TR01 | Transfer of patent right |
Effective date of registration: 20100908 Address after: Massachusetts, USA Patentee after: Nuance Communications, Inc. Address before: Illinois Instrunment Patentee before: Motorola, Inc. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200923 Address after: Massachusetts, USA Patentee after: Serenes operations Address before: Massachusetts, USA Patentee before: Nuance Communications, Inc. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20100526 |
|
CF01 | Termination of patent right due to non-payment of annual fee |