CN102496363A - 一种用于汉语语音合成的音调修正方法 - Google Patents
一种用于汉语语音合成的音调修正方法 Download PDFInfo
- Publication number
- CN102496363A CN102496363A CN2011103562596A CN201110356259A CN102496363A CN 102496363 A CN102496363 A CN 102496363A CN 2011103562596 A CN2011103562596 A CN 2011103562596A CN 201110356259 A CN201110356259 A CN 201110356259A CN 102496363 A CN102496363 A CN 102496363A
- Authority
- CN
- China
- Prior art keywords
- module
- fundamental frequency
- responsible
- model
- synthetic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 41
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 27
- 238000003786 synthesis reaction Methods 0.000 title claims abstract description 27
- 238000012937 correction Methods 0.000 title claims abstract description 9
- 238000012549 training Methods 0.000 claims abstract description 68
- 238000004458 analytical method Methods 0.000 claims abstract description 23
- 239000011295 pitch Substances 0.000 claims description 26
- 238000010606 normalization Methods 0.000 claims description 24
- 238000012986 modification Methods 0.000 claims description 19
- 230000004048 modification Effects 0.000 claims description 19
- 238000001228 spectrum Methods 0.000 claims description 17
- 238000000605 extraction Methods 0.000 claims description 14
- 239000011318 synthetic pitch Substances 0.000 claims description 14
- 230000011218 segmentation Effects 0.000 claims description 10
- 238000003066 decision tree Methods 0.000 claims description 9
- 238000009825 accumulation Methods 0.000 claims description 8
- 238000012821 model calculation Methods 0.000 claims description 6
- 230000003595 spectral effect Effects 0.000 claims description 4
- 230000033764 rhythmic process Effects 0.000 abstract description 7
- 238000010586 diagram Methods 0.000 description 10
- 238000004364 calculation method Methods 0.000 description 6
- 238000004422 calculation algorithm Methods 0.000 description 5
- 238000013461 design Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 238000013179 statistical model Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000007476 Maximum Likelihood Methods 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000002715 modification method Methods 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
Images
Landscapes
- Machine Translation (AREA)
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011103562596A CN102496363B (zh) | 2011-11-11 | 2011-11-11 | 一种用于汉语语音合成的音调修正方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011103562596A CN102496363B (zh) | 2011-11-11 | 2011-11-11 | 一种用于汉语语音合成的音调修正方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102496363A true CN102496363A (zh) | 2012-06-13 |
CN102496363B CN102496363B (zh) | 2013-07-17 |
Family
ID=46188180
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2011103562596A Active CN102496363B (zh) | 2011-11-11 | 2011-11-11 | 一种用于汉语语音合成的音调修正方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102496363B (zh) |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103531196A (zh) * | 2013-10-15 | 2014-01-22 | 中国科学院自动化研究所 | 一种波形拼接语音合成的选音方法 |
CN104282300A (zh) * | 2013-07-05 | 2015-01-14 | ***通信集团公司 | 一种非周期成分音节模型建立、及语音合成的方法和设备 |
CN104361896A (zh) * | 2014-12-04 | 2015-02-18 | 上海流利说信息技术有限公司 | 语音质量评价设备、方法和*** |
CN104916282A (zh) * | 2015-03-27 | 2015-09-16 | 北京捷通华声语音技术有限公司 | 一种语音合成的方法和装置 |
CN105529023A (zh) * | 2016-01-25 | 2016-04-27 | 百度在线网络技术(北京)有限公司 | 语音合成方法和装置 |
CN105654939A (zh) * | 2016-01-04 | 2016-06-08 | 北京时代瑞朗科技有限公司 | 一种基于音向量文本特征的语音合成方法 |
CN107039033A (zh) * | 2017-04-17 | 2017-08-11 | 海南职业技术学院 | 一种语音合成装置 |
CN107886938A (zh) * | 2016-09-29 | 2018-04-06 | 中国科学院深圳先进技术研究院 | 虚拟现实引导催眠语音处理方法及装置 |
CN107924677A (zh) * | 2015-06-11 | 2018-04-17 | 交互智能集团有限公司 | 用于异常值识别以移除语音合成中的不良对准的***和方法 |
CN108288464A (zh) * | 2018-01-25 | 2018-07-17 | 苏州奇梦者网络科技有限公司 | 一种修正合成音中错误声调的方法 |
CN108346424A (zh) * | 2017-01-23 | 2018-07-31 | 北京搜狗科技发展有限公司 | 语音合成方法和装置、用于语音合成的装置 |
CN109087627A (zh) * | 2018-10-16 | 2018-12-25 | 百度在线网络技术(北京)有限公司 | 用于生成信息的方法和装置 |
CN109300468A (zh) * | 2018-09-12 | 2019-02-01 | 科大讯飞股份有限公司 | 一种语音标注方法及装置 |
CN112289298A (zh) * | 2020-09-30 | 2021-01-29 | 北京大米科技有限公司 | 合成语音的处理方法、装置、存储介质以及电子设备 |
CN112786027A (zh) * | 2021-01-06 | 2021-05-11 | 浙江大学 | 一种语音输入矫正处理方法、装置、电子设备及存储介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101452699A (zh) * | 2007-12-04 | 2009-06-10 | 株式会社东芝 | 韵律自适应及语音合成的方法和装置 |
EP2337006A1 (en) * | 2009-11-24 | 2011-06-22 | Kai Yu | Speech processing and learning |
CN102201234A (zh) * | 2011-06-24 | 2011-09-28 | 北京宇音天下科技有限公司 | 一种基于音调自动标注及预测的语音合成方法 |
-
2011
- 2011-11-11 CN CN2011103562596A patent/CN102496363B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101452699A (zh) * | 2007-12-04 | 2009-06-10 | 株式会社东芝 | 韵律自适应及语音合成的方法和装置 |
EP2337006A1 (en) * | 2009-11-24 | 2011-06-22 | Kai Yu | Speech processing and learning |
CN102201234A (zh) * | 2011-06-24 | 2011-09-28 | 北京宇音天下科技有限公司 | 一种基于音调自动标注及预测的语音合成方法 |
Non-Patent Citations (2)
Title |
---|
CHENG-CHENG WANG ET AL: "Multi-Layer F0 Modeling for HMM-Based Speech Synthesis", 《CHINESE SPOKEN LANGUAGE PROCESSING, 2008. ISCSLP "08. 6TH INTERNATIONAL SYMPOSIUM ON》 * |
JIANHUA TAO ET AL: "Prosody Conversion From Neutral Speech to Emotional Speech", 《IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING》 * |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104282300A (zh) * | 2013-07-05 | 2015-01-14 | ***通信集团公司 | 一种非周期成分音节模型建立、及语音合成的方法和设备 |
CN103531196A (zh) * | 2013-10-15 | 2014-01-22 | 中国科学院自动化研究所 | 一种波形拼接语音合成的选音方法 |
CN103531196B (zh) * | 2013-10-15 | 2016-04-13 | 中国科学院自动化研究所 | 一种波形拼接语音合成的选音方法 |
CN104361896A (zh) * | 2014-12-04 | 2015-02-18 | 上海流利说信息技术有限公司 | 语音质量评价设备、方法和*** |
CN104361896B (zh) * | 2014-12-04 | 2018-04-13 | 上海流利说信息技术有限公司 | 语音质量评价设备、方法和*** |
CN104916282A (zh) * | 2015-03-27 | 2015-09-16 | 北京捷通华声语音技术有限公司 | 一种语音合成的方法和装置 |
CN104916282B (zh) * | 2015-03-27 | 2018-11-06 | 北京捷通华声科技股份有限公司 | 一种语音合成的方法和装置 |
CN107924677B (zh) * | 2015-06-11 | 2022-01-25 | 交互智能集团有限公司 | 用于异常值识别以移除语音合成中的不良对准的***和方法 |
CN107924677A (zh) * | 2015-06-11 | 2018-04-17 | 交互智能集团有限公司 | 用于异常值识别以移除语音合成中的不良对准的***和方法 |
CN105654939A (zh) * | 2016-01-04 | 2016-06-08 | 北京时代瑞朗科技有限公司 | 一种基于音向量文本特征的语音合成方法 |
CN105654939B (zh) * | 2016-01-04 | 2019-09-13 | 极限元(杭州)智能科技股份有限公司 | 一种基于音向量文本特征的语音合成方法 |
CN105529023A (zh) * | 2016-01-25 | 2016-04-27 | 百度在线网络技术(北京)有限公司 | 语音合成方法和装置 |
CN105529023B (zh) * | 2016-01-25 | 2019-09-03 | 百度在线网络技术(北京)有限公司 | 语音合成方法和装置 |
CN107886938B (zh) * | 2016-09-29 | 2020-11-17 | 中国科学院深圳先进技术研究院 | 虚拟现实引导催眠语音处理方法及装置 |
CN107886938A (zh) * | 2016-09-29 | 2018-04-06 | 中国科学院深圳先进技术研究院 | 虚拟现实引导催眠语音处理方法及装置 |
CN108346424A (zh) * | 2017-01-23 | 2018-07-31 | 北京搜狗科技发展有限公司 | 语音合成方法和装置、用于语音合成的装置 |
CN108346424B (zh) * | 2017-01-23 | 2021-11-19 | 北京搜狗科技发展有限公司 | 语音合成方法和装置、用于语音合成的装置 |
CN107039033A (zh) * | 2017-04-17 | 2017-08-11 | 海南职业技术学院 | 一种语音合成装置 |
CN108288464A (zh) * | 2018-01-25 | 2018-07-17 | 苏州奇梦者网络科技有限公司 | 一种修正合成音中错误声调的方法 |
CN109300468A (zh) * | 2018-09-12 | 2019-02-01 | 科大讯飞股份有限公司 | 一种语音标注方法及装置 |
CN109300468B (zh) * | 2018-09-12 | 2022-09-06 | 科大讯飞股份有限公司 | 一种语音标注方法及装置 |
CN109087627A (zh) * | 2018-10-16 | 2018-12-25 | 百度在线网络技术(北京)有限公司 | 用于生成信息的方法和装置 |
CN112289298A (zh) * | 2020-09-30 | 2021-01-29 | 北京大米科技有限公司 | 合成语音的处理方法、装置、存储介质以及电子设备 |
CN112786027A (zh) * | 2021-01-06 | 2021-05-11 | 浙江大学 | 一种语音输入矫正处理方法、装置、电子设备及存储介质 |
CN112786027B (zh) * | 2021-01-06 | 2022-02-22 | 浙江大学 | 一种语音输入矫正处理方法、装置、电子设备及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN102496363B (zh) | 2013-07-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102496363B (zh) | 一种用于汉语语音合成的音调修正方法 | |
US11222620B2 (en) | Speech recognition using unspoken text and speech synthesis | |
US20220101826A1 (en) | Variational Embedding Capacity in Expressive End-to-End Speech Synthesis | |
CN101944359B (zh) | 一种面向特定人群的语音识别方法 | |
CN102800316B (zh) | 基于神经网络的声纹识别***的最优码本设计方法 | |
CN109036371B (zh) | 用于语音合成的音频数据生成方法及*** | |
CN102385859B (zh) | 参数语音合成方法和*** | |
CN103531205B (zh) | 基于深层神经网络特征映射的非对称语音转换方法 | |
US10621969B2 (en) | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system | |
CN102201234B (zh) | 一种基于音调自动标注及预测的语音合成方法 | |
CN101599271A (zh) | 一种数字音乐情感的识别方法 | |
CN1815552B (zh) | 基于线谱频率及其阶间差分参数的频谱建模与语音增强方法 | |
CN114203147A (zh) | 用于文本到语音的跨说话者样式传递以及用于训练数据生成的***和方法 | |
Ai et al. | A neural vocoder with hierarchical generation of amplitude and phase spectra for statistical parametric speech synthesis | |
CN110767210A (zh) | 一种生成个性化语音的方法及装置 | |
CN106653056A (zh) | 基于lstm循环神经网络的基频提取模型及训练方法 | |
CN105654939A (zh) | 一种基于音向量文本特征的语音合成方法 | |
CN110648684B (zh) | 一种基于WaveNet的骨导语音增强波形生成方法 | |
CN116457870A (zh) | 并行化Tacotron:非自回归且可控的TTS | |
CN102237083A (zh) | 一种基于WinCE平台的便携式口语翻译***及其语言识别方法 | |
CN110930975B (zh) | 用于输出信息的方法和装置 | |
CA3195582A1 (en) | Audio generator and methods for generating an audio signal and training an audio generator | |
US20240127832A1 (en) | Decoder | |
CN103886859B (zh) | 基于一对多码书映射的语音转换方法 | |
US10446133B2 (en) | Multi-stream spectral representation for statistical parametric speech synthesis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: ZHUHAI YUYIN TIANXIA TECHNOLOGY CO., LTD. Free format text: FORMER OWNER: BEIJING YUYIN TIANXIA TECHNOLOGY CO., LTD. Effective date: 20140707 |
|
C41 | Transfer of patent application or patent right or utility model | ||
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: 100085 HAIDIAN, BEIJING TO: 519000 ZHUHAI, GUANGDONG PROVINCE |
|
TR01 | Transfer of patent right |
Effective date of registration: 20140707 Address after: 519000 Guangdong city of Zhuhai province high tech Zone Tangjiawan Town Road No. 101, University of Tsinghua Science Park (Zhuhai) business building A A1013 Patentee after: Zhuhai Yu World Technology Co.,Ltd. Address before: 100085, room 15, 915 information road, Beijing, Haidian District Patentee before: BEIJING YUYIN TIANXIA TECHNOLOGY Co.,Ltd. |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20170106 Address after: 518057 Guangdong city of Shenzhen province Nanshan District science and Technology Park North Yuanxing Technology Building 406 North Block Patentee after: SHENZHEN AVSNEST TECHNOLOGY CO.,LTD. Address before: The financial trade No. 15 building, 100085 Beijing city Haidian District information Road Room 915 Patentee before: BEIJING YUYIN TIANXIA TECHNOLOGY Co.,Ltd. Effective date of registration: 20170106 Address after: The financial trade No. 15 building, 100085 Beijing city Haidian District information Road Room 915 Patentee after: BEIJING YUYIN TIANXIA TECHNOLOGY Co.,Ltd. Address before: 519000 Guangdong city of Zhuhai province high tech Zone Tangjiawan Town Road No. 101, University of Tsinghua Science Park (Zhuhai) business building A A1013 Patentee before: Zhuhai Yu World Technology Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20181023 Address after: 519000 Tsinghua Science and Technology Park (Zhuhai) Pioneering Building A Block A1013, 101 University Road, Tangjiawan Town, Zhuhai High-tech Zone, Guangdong Province Patentee after: Zhuhai Yu World Technology Co.,Ltd. Address before: 518057 Guangdong North Shenzhen science and Technology Park, north of Nanshan District science and technology tower, 406 Patentee before: SHENZHEN AVSNEST TECHNOLOGY CO.,LTD. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20190104 Address after: 100085 room 915, finance and trade building, 15 Information Road, Haidian District, Beijing. Co-patentee after: Zhuhai Hi-tech Angel Venture Capital Co.,Ltd. Patentee after: BEIJING YUYIN TIANXIA TECHNOLOGY Co.,Ltd. Address before: 519000 Tsinghua Science and Technology Park (Zhuhai) Pioneering Building A Block A1013, 101 University Road, Tangjiawan Town, Zhuhai High-tech Zone, Guangdong Province Patentee before: Zhuhai Yu World Technology Co.,Ltd. |