GB2591245B - An expressive text-to-speech system - Google Patents
An expressive text-to-speech system Download PDFInfo
- Publication number
- GB2591245B GB2591245B GB2000883.5A GB202000883A GB2591245B GB 2591245 B GB2591245 B GB 2591245B GB 202000883 A GB202000883 A GB 202000883A GB 2591245 B GB2591245 B GB 2591245B
- Authority
- GB
- United Kingdom
- Prior art keywords
- speech system
- expressive text
- expressive
- text
- speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Signal Processing (AREA)
- Machine Translation (AREA)
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB2000883.5A GB2591245B (en) | 2020-01-21 | 2020-01-21 | An expressive text-to-speech system |
KR1020200062637A KR20210095010A (en) | 2020-01-21 | 2020-05-25 | Expressive text-to-speech system and method |
US17/037,023 US11830473B2 (en) | 2020-01-21 | 2020-09-29 | Expressive text-to-speech system and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB2000883.5A GB2591245B (en) | 2020-01-21 | 2020-01-21 | An expressive text-to-speech system |
Publications (3)
Publication Number | Publication Date |
---|---|
GB202000883D0 GB202000883D0 (en) | 2020-03-04 |
GB2591245A GB2591245A (en) | 2021-07-28 |
GB2591245B true GB2591245B (en) | 2022-06-15 |
Family
ID=69636811
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB2000883.5A Active GB2591245B (en) | 2020-01-21 | 2020-01-21 | An expressive text-to-speech system |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR20210095010A (en) |
GB (1) | GB2591245B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112951202B (en) * | 2021-03-11 | 2022-11-08 | 北京嘀嘀无限科技发展有限公司 | Speech synthesis method, apparatus, electronic device and program product |
EP4293660A1 (en) * | 2021-06-22 | 2023-12-20 | Samsung Electronics Co., Ltd. | Electronic device and method for controlling same |
CN113611309B (en) * | 2021-07-13 | 2024-05-10 | 北京捷通华声科技股份有限公司 | Tone conversion method and device, electronic equipment and readable storage medium |
CN113838452B (en) * | 2021-08-17 | 2022-08-23 | 北京百度网讯科技有限公司 | Speech synthesis method, apparatus, device and computer storage medium |
US11978475B1 (en) * | 2021-09-03 | 2024-05-07 | Wells Fargo Bank, N.A. | Systems and methods for determining a next action based on a predicted emotion by weighting each portion of the action's reply |
CN115985282A (en) * | 2021-10-14 | 2023-04-18 | 北京字跳网络技术有限公司 | Method and device for adjusting speech rate, electronic equipment and readable storage medium |
CN114255737B (en) * | 2022-02-28 | 2022-05-17 | 北京世纪好未来教育科技有限公司 | Voice generation method and device and electronic equipment |
CN115116431B (en) * | 2022-08-29 | 2022-11-18 | 深圳市星范儿文化科技有限公司 | Audio generation method, device, equipment and storage medium based on intelligent reading kiosk |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150186359A1 (en) * | 2013-12-30 | 2015-07-02 | Google Inc. | Multilingual prosody generation |
US20190172443A1 (en) * | 2017-12-06 | 2019-06-06 | International Business Machines Corporation | System and method for generating expressive prosody for speech synthesis |
WO2019139428A1 (en) * | 2018-01-11 | 2019-07-18 | 네오사피엔스 주식회사 | Multilingual text-to-speech synthesis method |
-
2020
- 2020-01-21 GB GB2000883.5A patent/GB2591245B/en active Active
- 2020-05-25 KR KR1020200062637A patent/KR20210095010A/en active Search and Examination
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150186359A1 (en) * | 2013-12-30 | 2015-07-02 | Google Inc. | Multilingual prosody generation |
US20190172443A1 (en) * | 2017-12-06 | 2019-06-06 | International Business Machines Corporation | System and method for generating expressive prosody for speech synthesis |
WO2019139428A1 (en) * | 2018-01-11 | 2019-07-18 | 네오사피엔스 주식회사 | Multilingual text-to-speech synthesis method |
Also Published As
Publication number | Publication date |
---|---|
GB2591245A (en) | 2021-07-28 |
GB202000883D0 (en) | 2020-03-04 |
KR20210095010A (en) | 2021-07-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB2591245B (en) | An expressive text-to-speech system | |
EP3739476A4 (en) | Multilingual text-to-speech synthesis method | |
EP3895159A4 (en) | Multi-speaker neural text-to-speech synthesis | |
EP3754650C0 (en) | Location-based voice recognition system through voice command | |
SG11202009556XA (en) | Text-to-speech synthesis system and method | |
EP3665916A4 (en) | Emergency voice service support indications | |
IL254317A0 (en) | System and method for generating accurate speech transcription from natural speech audio signals | |
EP3709249A4 (en) | System for providing user-customized last and method therefor | |
EP3641345C0 (en) | A method for operating a hearing instrument and a hearing system comprising a hearing instrument | |
EP3690878A4 (en) | Voice command system and voice command method | |
IL285994A (en) | An aerosol provision system | |
ZA201907037B (en) | Hydraulic support voice control system and method based on vocal cord vibration measurement | |
EP3641344C0 (en) | A method for operating a hearing instrument and a hearing system comprising a hearing instrument | |
DK3833043T3 (en) | HEARING SYSTEM INCLUDING A PERSONAL BEAM SHAPER | |
SG11202009311RA (en) | Speech analysis system | |
EP3602539A4 (en) | System providing expressive and emotive text-to-speech | |
GB201811458D0 (en) | An ambisonic microphone apparatus | |
EP3614696A4 (en) | Beam former, beam forming method and hearing aid system | |
GB2607903B (en) | Text-to-speech system | |
GB202010620D0 (en) | System | |
GB202105780D0 (en) | Emotion recognition for artificially-intelligent system | |
CA208966S (en) | Siren | |
GB2591790B (en) | Speaker system | |
GB2611336B (en) | Net-launching system | |
GB202102114D0 (en) | Laser system |