CN106710585B - 语音交互过程中的多音字播报方法及*** - Google Patents
语音交互过程中的多音字播报方法及*** Download PDFInfo
- Publication number
- CN106710585B CN106710585B CN201611199610.4A CN201611199610A CN106710585B CN 106710585 B CN106710585 B CN 106710585B CN 201611199610 A CN201611199610 A CN 201611199610A CN 106710585 B CN106710585 B CN 106710585B
- Authority
- CN
- China
- Prior art keywords
- information
- polyphone
- module
- voice
- feedback information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 34
- 230000002452 interceptive effect Effects 0.000 title claims abstract description 22
- 238000005266 casting Methods 0.000 claims abstract description 10
- 238000000465 moulding Methods 0.000 claims description 2
- 230000000694 effects Effects 0.000 abstract description 5
- 230000015572 biosynthetic process Effects 0.000 description 10
- 238000003786 synthesis reaction Methods 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013499 data model Methods 0.000 description 1
- 238000003066 decision tree Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611199610.4A CN106710585B (zh) | 2016-12-22 | 2016-12-22 | 语音交互过程中的多音字播报方法及*** |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611199610.4A CN106710585B (zh) | 2016-12-22 | 2016-12-22 | 语音交互过程中的多音字播报方法及*** |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106710585A CN106710585A (zh) | 2017-05-24 |
CN106710585B true CN106710585B (zh) | 2019-11-08 |
Family
ID=58902972
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611199610.4A Active CN106710585B (zh) | 2016-12-22 | 2016-12-22 | 语音交互过程中的多音字播报方法及*** |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106710585B (zh) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108364652A (zh) * | 2018-01-16 | 2018-08-03 | 成都易讯呼科技有限公司 | 一种用于人工智能电话的智能语音对答交互控制*** |
CN109616111B (zh) * | 2018-12-24 | 2023-03-14 | 北京恒泰实达科技股份有限公司 | 一种基于语音识别的场景交互控制方法 |
CN110032626B (zh) * | 2019-04-19 | 2022-04-12 | 百度在线网络技术(北京)有限公司 | 语音播报方法和装置 |
CN110277085B (zh) * | 2019-06-25 | 2021-08-24 | 腾讯科技(深圳)有限公司 | 确定多音字发音的方法及装置 |
CN110264994B (zh) * | 2019-07-02 | 2021-08-20 | 珠海格力电器股份有限公司 | 一种语音合成方法、电子设备及智能家居*** |
CN111128186B (zh) * | 2019-12-30 | 2022-06-17 | 云知声智能科技股份有限公司 | 多音字标音方法及装置 |
CN112259092B (zh) * | 2020-10-15 | 2023-09-01 | 深圳市同行者科技有限公司 | 一种语音播报方法、装置及语音交互设备 |
CN113658586B (zh) * | 2021-08-13 | 2024-04-09 | 北京百度网讯科技有限公司 | 语音识别模型的训练方法、语音交互方法及装置 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1612209A (zh) * | 2003-10-29 | 2005-05-04 | 何佩娟 | 一种语音录入电话号码条目的方法及其装置 |
CN1697019A (zh) * | 2004-05-13 | 2005-11-16 | 深圳市移动核软件有限公司 | 使汉字自动发音的方法及使手机朗读短消息的方法 |
CN101033977A (zh) * | 2007-04-18 | 2007-09-12 | 江苏新科数字技术有限公司 | 导航仪的语音导航方法 |
CN101324884A (zh) * | 2008-07-29 | 2008-12-17 | 无敌科技(西安)有限公司 | 一种多音字发音方法 |
CN103456297A (zh) * | 2012-05-29 | 2013-12-18 | ***通信集团公司 | 一种语音识别匹配的方法和设备 |
CN105336322A (zh) * | 2015-09-30 | 2016-02-17 | 百度在线网络技术(北京)有限公司 | 多音字模型训练方法、语音合成方法及装置 |
-
2016
- 2016-12-22 CN CN201611199610.4A patent/CN106710585B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1612209A (zh) * | 2003-10-29 | 2005-05-04 | 何佩娟 | 一种语音录入电话号码条目的方法及其装置 |
CN1697019A (zh) * | 2004-05-13 | 2005-11-16 | 深圳市移动核软件有限公司 | 使汉字自动发音的方法及使手机朗读短消息的方法 |
CN101033977A (zh) * | 2007-04-18 | 2007-09-12 | 江苏新科数字技术有限公司 | 导航仪的语音导航方法 |
CN101324884A (zh) * | 2008-07-29 | 2008-12-17 | 无敌科技(西安)有限公司 | 一种多音字发音方法 |
CN103456297A (zh) * | 2012-05-29 | 2013-12-18 | ***通信集团公司 | 一种语音识别匹配的方法和设备 |
CN105336322A (zh) * | 2015-09-30 | 2016-02-17 | 百度在线网络技术(北京)有限公司 | 多音字模型训练方法、语音合成方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
CN106710585A (zh) | 2017-05-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106710585B (zh) | 语音交互过程中的多音字播报方法及*** | |
US11496582B2 (en) | Generation of automated message responses | |
US20220165268A1 (en) | Indicator for voice-based communications | |
US10140973B1 (en) | Text-to-speech processing using previously speech processed data | |
US10074363B2 (en) | Method and apparatus for keyword speech recognition | |
US10074369B2 (en) | Voice-based communications | |
US10917758B1 (en) | Voice-based messaging | |
EP2595143B1 (en) | Text to speech synthesis for texts with foreign language inclusions | |
US10163436B1 (en) | Training a speech processing system using spoken utterances | |
Ramani et al. | A common attribute based unified HTS framework for speech synthesis in Indian languages | |
US20080177543A1 (en) | Stochastic Syllable Accent Recognition | |
US20170169811A1 (en) | Text-to-speech processing systems and methods | |
Prahallad et al. | Sub-phonetic modeling for capturing pronunciation variations for conversational speech synthesis | |
CN105654943A (zh) | 一种语音唤醒方法、装置及*** | |
US8015008B2 (en) | System and method of using acoustic models for automatic speech recognition which distinguish pre- and post-vocalic consonants | |
Lileikytė et al. | Conversational telephone speech recognition for Lithuanian | |
Chen et al. | Retrieval of broadcast news speech in Mandarin Chinese collected in Taiwan using syllable-level statistical characteristics | |
EP3507796A1 (en) | Voice-based communications | |
Banerjee et al. | Application of triphone clustering in acoustic modeling for continuous speech recognition in Bengali | |
JP2019056791A (ja) | 音声認識装置、音声認識方法およびプログラム | |
CN112397053B (zh) | 语音识别方法、装置、电子设备及可读存储介质 | |
Wang et al. | Content-based language models for spoken document retrieval | |
US11328713B1 (en) | On-device contextual understanding | |
Barnard et al. | Phone recognition for spoken web search | |
Kiruthiga et al. | Annotating Speech Corpus for Prosody Modeling in Indian Language Text to Speech Systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20170929 Address after: 200233 Shanghai City, Xuhui District Guangxi 65 No. 1 Jinglu room 702 unit 03 Applicant after: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Address before: 200233 Shanghai, Qinzhou, North Road, No. 82, building 2, layer 1198, Applicant before: SHANGHAI YUZHIYI INFORMATION TECHNOLOGY Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Method and system of polyphone broadcasting in speech interaction Effective date of registration: 20201201 Granted publication date: 20191108 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY Co.,Ltd. Registration number: Y2020310000047 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20220307 Granted publication date: 20191108 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2020310000047 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: The method and system of polyphonic broadcasting in the process of voice interaction Effective date of registration: 20230210 Granted publication date: 20191108 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2023310000028 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Granted publication date: 20191108 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2023310000028 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: The method and system for broadcasting polyphonic characters in the process of voice interaction Granted publication date: 20191108 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2024310000165 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |