CN105654942A - Speech synthesis method of interrogative sentence and exclamatory sentence based on statistical parameter - Google Patents
Speech synthesis method of interrogative sentence and exclamatory sentence based on statistical parameter Download PDFInfo
- Publication number
- CN105654942A CN105654942A CN201610000676.XA CN201610000676A CN105654942A CN 105654942 A CN105654942 A CN 105654942A CN 201610000676 A CN201610000676 A CN 201610000676A CN 105654942 A CN105654942 A CN 105654942A
- Authority
- CN
- China
- Prior art keywords
- sentence
- acoustic model
- neural network
- deep neural
- interrogative
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000001308 synthesis method Methods 0.000 title abstract 3
- 238000000034 method Methods 0.000 claims abstract description 62
- 238000012549 training Methods 0.000 claims abstract description 47
- 230000003044 adaptive effect Effects 0.000 claims abstract description 23
- 238000004519 manufacturing process Methods 0.000 claims abstract description 5
- 238000013528 artificial neural network Methods 0.000 claims description 52
- 230000008569 process Effects 0.000 claims description 24
- 239000000463 material Substances 0.000 claims description 21
- 238000001228 spectrum Methods 0.000 claims description 16
- 230000002194 synthesizing effect Effects 0.000 claims description 14
- 230000009466 transformation Effects 0.000 claims description 12
- 238000007476 Maximum Likelihood Methods 0.000 claims description 9
- 238000003062 neural network model Methods 0.000 claims description 5
- 230000005284 excitation Effects 0.000 claims description 4
- 238000000605 extraction Methods 0.000 claims description 4
- 238000013459 approach Methods 0.000 claims description 3
- 230000004927 fusion Effects 0.000 claims description 3
- 238000013507 mapping Methods 0.000 claims description 3
- 239000000203 mixture Substances 0.000 claims description 3
- 210000002569 neuron Anatomy 0.000 claims description 3
- 230000015572 biosynthetic process Effects 0.000 abstract description 18
- 238000003786 synthesis reaction Methods 0.000 abstract description 17
- 238000009826 distribution Methods 0.000 description 15
- 239000011159 matrix material Substances 0.000 description 6
- 230000006870 function Effects 0.000 description 5
- 230000006978 adaptation Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000003066 decision tree Methods 0.000 description 3
- 230000033764 rhythmic process Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 230000007812 deficiency Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 206010068319 Oropharyngeal pain Diseases 0.000 description 1
- 201000007100 Pharyngitis Diseases 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008451 emotion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 230000036651 mood Effects 0.000 description 1
- 238000007634 remodeling Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 235000015170 shellfish Nutrition 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (3)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610000676.XA CN105654942A (en) | 2016-01-04 | 2016-01-04 | Speech synthesis method of interrogative sentence and exclamatory sentence based on statistical parameter |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610000676.XA CN105654942A (en) | 2016-01-04 | 2016-01-04 | Speech synthesis method of interrogative sentence and exclamatory sentence based on statistical parameter |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105654942A true CN105654942A (en) | 2016-06-08 |
Family
ID=56491319
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610000676.XA Pending CN105654942A (en) | 2016-01-04 | 2016-01-04 | Speech synthesis method of interrogative sentence and exclamatory sentence based on statistical parameter |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105654942A (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107103900A (en) * | 2017-06-06 | 2017-08-29 | 西北师范大学 | A kind of across language emotional speech synthesizing method and system |
CN108364631A (en) * | 2017-01-26 | 2018-08-03 | 北京搜狗科技发展有限公司 | A kind of phoneme synthesizing method and device |
CN108447474A (en) * | 2018-03-12 | 2018-08-24 | 北京灵伴未来科技有限公司 | A kind of modeling and the control method of virtual portrait voice and Hp-synchronization |
CN110942763A (en) * | 2018-09-20 | 2020-03-31 | 阿里巴巴集团控股有限公司 | Voice recognition method and device |
CN111710326A (en) * | 2020-06-12 | 2020-09-25 | 携程计算机技术(上海)有限公司 | English voice synthesis method and system, electronic equipment and storage medium |
CN111950545A (en) * | 2020-07-23 | 2020-11-17 | 南京大学 | Scene text detection method based on MSNDET and space division |
WO2022134833A1 (en) * | 2020-12-23 | 2022-06-30 | 深圳壹账通智能科技有限公司 | Speech signal processing method, apparatus and device, and storage medium |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1835074A (en) * | 2006-04-07 | 2006-09-20 | 安徽中科大讯飞信息科技有限公司 | Speaking person conversion method combined high layer discription information and model self adaption |
CN103035247A (en) * | 2012-12-05 | 2013-04-10 | 北京三星通信技术研究有限公司 | Method and device of operation on audio/video file based on voiceprint information |
CN103345656A (en) * | 2013-07-17 | 2013-10-09 | 中国科学院自动化研究所 | Method and device for data identification based on multitask deep neural network |
US20140257809A1 (en) * | 2011-10-28 | 2014-09-11 | Vaibhava Goel | Sparse maximum a posteriori (map) adaption |
CN105118498A (en) * | 2015-09-06 | 2015-12-02 | 百度在线网络技术(北京)有限公司 | Training method and apparatus of speech synthesis model |
CN105184303A (en) * | 2015-04-23 | 2015-12-23 | 南京邮电大学 | Image marking method based on multi-mode deep learning |
CN105206258B (en) * | 2015-10-19 | 2018-05-04 | 百度在线网络技术(北京)有限公司 | The generation method and device and phoneme synthesizing method and device of acoustic model |
-
2016
- 2016-01-04 CN CN201610000676.XA patent/CN105654942A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1835074A (en) * | 2006-04-07 | 2006-09-20 | 安徽中科大讯飞信息科技有限公司 | Speaking person conversion method combined high layer discription information and model self adaption |
US20140257809A1 (en) * | 2011-10-28 | 2014-09-11 | Vaibhava Goel | Sparse maximum a posteriori (map) adaption |
CN103035247A (en) * | 2012-12-05 | 2013-04-10 | 北京三星通信技术研究有限公司 | Method and device of operation on audio/video file based on voiceprint information |
CN103345656A (en) * | 2013-07-17 | 2013-10-09 | 中国科学院自动化研究所 | Method and device for data identification based on multitask deep neural network |
CN105184303A (en) * | 2015-04-23 | 2015-12-23 | 南京邮电大学 | Image marking method based on multi-mode deep learning |
CN105118498A (en) * | 2015-09-06 | 2015-12-02 | 百度在线网络技术(北京)有限公司 | Training method and apparatus of speech synthesis model |
CN105206258B (en) * | 2015-10-19 | 2018-05-04 | 百度在线网络技术(北京)有限公司 | The generation method and device and phoneme synthesizing method and device of acoustic model |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108364631A (en) * | 2017-01-26 | 2018-08-03 | 北京搜狗科技发展有限公司 | A kind of phoneme synthesizing method and device |
CN108364631B (en) * | 2017-01-26 | 2021-01-22 | 北京搜狗科技发展有限公司 | Speech synthesis method and device |
CN107103900A (en) * | 2017-06-06 | 2017-08-29 | 西北师范大学 | A kind of across language emotional speech synthesizing method and system |
CN108447474A (en) * | 2018-03-12 | 2018-08-24 | 北京灵伴未来科技有限公司 | A kind of modeling and the control method of virtual portrait voice and Hp-synchronization |
CN110942763A (en) * | 2018-09-20 | 2020-03-31 | 阿里巴巴集团控股有限公司 | Voice recognition method and device |
CN110942763B (en) * | 2018-09-20 | 2023-09-12 | 阿里巴巴集团控股有限公司 | Speech recognition method and device |
CN111710326A (en) * | 2020-06-12 | 2020-09-25 | 携程计算机技术(上海)有限公司 | English voice synthesis method and system, electronic equipment and storage medium |
CN111710326B (en) * | 2020-06-12 | 2024-01-23 | 携程计算机技术(上海)有限公司 | English voice synthesis method and system, electronic equipment and storage medium |
CN111950545A (en) * | 2020-07-23 | 2020-11-17 | 南京大学 | Scene text detection method based on MSNDET and space division |
CN111950545B (en) * | 2020-07-23 | 2024-02-09 | 南京大学 | Scene text detection method based on MSDNet and space division |
WO2022134833A1 (en) * | 2020-12-23 | 2022-06-30 | 深圳壹账通智能科技有限公司 | Speech signal processing method, apparatus and device, and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105654942A (en) | Speech synthesis method of interrogative sentence and exclamatory sentence based on statistical parameter | |
CN101000765B (en) | Speech synthetic method based on rhythm character | |
CN101178896B (en) | Unit selection voice synthetic method based on acoustics statistical model | |
CN106531150B (en) | Emotion synthesis method based on deep neural network model | |
CN106971703A (en) | A kind of song synthetic method and device based on HMM | |
CN1835074B (en) | Speaking person conversion method combined high layer discription information and model self adaption | |
CN101777347B (en) | Model complementary Chinese accent identification method and system | |
CN108364639A (en) | Speech processing system and method | |
CN102254554B (en) | Method for carrying out hierarchical modeling and predicating on mandarin accent | |
CN102568476B (en) | Voice conversion method based on self-organizing feature map network cluster and radial basis network | |
KR102311922B1 (en) | Apparatus and method for controlling outputting target information to voice using characteristic of user voice | |
CN1835075B (en) | Speech synthetizing method combined natural sample selection and acaustic parameter to build mould | |
CN105654939A (en) | Voice synthesis method based on voice vector textual characteristics | |
Liu et al. | Mongolian text-to-speech system based on deep neural network | |
WO2012164835A1 (en) | Prosody generator, speech synthesizer, prosody generating method and prosody generating program | |
Khanam et al. | Text to speech synthesis: a systematic review, deep learning based architecture and future research direction | |
Dongmei | Design of English text-to-speech conversion algorithm based on machine learning | |
CN103226946B (en) | Voice synthesis method based on limited Boltzmann machine | |
Shahid et al. | Generative emotional ai for speech emotion recognition: The case for synthetic emotional speech augmentation | |
Savargiv et al. | Study on unit-selection and statistical parametric speech synthesis techniques | |
Sakai | Additive modeling of English F0 contour for speech synthesis | |
Wen et al. | Improving deep neural network based speech synthesis through contextual feature parametrization and multi-task learning | |
CN116092471A (en) | Multi-style personalized Tibetan language speech synthesis model oriented to low-resource condition | |
TWI402824B (en) | A pronunciation variation generation method for spontaneous speech synthesis | |
Coto-Jiménez et al. | LSTM deep neural networks postfiltering for improving the quality of synthetic voices |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: 310000 Room 1105, 11/F, Building 4, No. 9, Jiuhuan Road, Jianggan District, Hangzhou City, Zhejiang Province Applicant after: Limit element (Hangzhou) intelligent Polytron Technologies Inc. Address before: 100089 Floor 1-312-316, No. 1 Building, 35 Shangdi East Road, Haidian District, Beijing Applicant before: Limit element (Beijing) smart Polytron Technologies Inc. Address after: 100089 Floor 1-312-316, No. 1 Building, 35 Shangdi East Road, Haidian District, Beijing Applicant after: Limit element (Beijing) smart Polytron Technologies Inc. Address before: 100089 Floor 1-312-316, No. 1 Building, 35 Shangdi East Road, Haidian District, Beijing Applicant before: Limit Yuan (Beijing) Intelligent Technology Co.,Ltd. Address after: 100089 Floor 1-312-316, No. 1 Building, 35 Shangdi East Road, Haidian District, Beijing Applicant after: Limit Yuan (Beijing) Intelligent Technology Co.,Ltd. Address before: 100085 Block 318, Yiquanhui Office Building, 35 Shangdi East Road, Haidian District, Beijing Applicant before: BEIJING TIMES RUILANG TECHNOLOGY Co.,Ltd. |
|
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20160608 |