PL401371A1 - Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę - Google Patents

Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę

Info

Publication number
PL401371A1
PL401371A1 PL401371A PL40137112A PL401371A1 PL 401371 A1 PL401371 A1 PL 401371A1 PL 401371 A PL401371 A PL 401371A PL 40137112 A PL40137112 A PL 40137112A PL 401371 A1 PL401371 A1 PL 401371A1
Authority
PL
Poland
Prior art keywords
voice
text
development
synthesized speech
conversion system
Prior art date
Application number
PL401371A
Other languages
English (en)
Inventor
Łukasz M. Osowski
Michał T. Kaszczuk
Original Assignee
Ivona Software Spółka Z Ograniczoną Odpowiedzialnością
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ivona Software Spółka Z Ograniczoną Odpowiedzialnością filed Critical Ivona Software Spółka Z Ograniczoną Odpowiedzialnością
Priority to PL401371A priority Critical patent/PL401371A1/pl
Priority to US13/720,925 priority patent/US9196240B2/en
Publication of PL401371A1 publication Critical patent/PL401371A1/pl

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

Wynalazek dotyczy opracowania głosu dla zautomatyzowanej zamiany tekstu na mowę. Grupie użytkowników przedstawia się tekst oraz nagrania mowy syntetyzowanej dla danego tekstu. Użytkownicy wysłuchują nagrania mowy syntetyzowanej i przekazują informacje zwrotne na temat błędów nagrania oraz innych kwestii dotyczących syntetyzowanej mowy. System obejmujący co najmniej jedno urządzenie obliczeniowe przeprowadza analizę informacji zwrotnych, dokonuje modyfikacji głosu lub reguł przekształcania i cyklicznie testuje zmodyfikowane nagrania. Modyfikacje są określane przy pomocy algorytmów uczenia maszynowego oraz innych zautomatyzowanych procesów.
PL401371A 2012-10-26 2012-10-26 Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę PL401371A1 (pl)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PL401371A PL401371A1 (pl) 2012-10-26 2012-10-26 Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę
US13/720,925 US9196240B2 (en) 2012-10-26 2012-12-19 Automated text to speech voice development

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PL401371A PL401371A1 (pl) 2012-10-26 2012-10-26 Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę

Publications (1)

Publication Number Publication Date
PL401371A1 true PL401371A1 (pl) 2014-04-28

Family

ID=50515001

Family Applications (1)

Application Number Title Priority Date Filing Date
PL401371A PL401371A1 (pl) 2012-10-26 2012-10-26 Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę

Country Status (2)

Country Link
US (1) US9196240B2 (pl)
PL (1) PL401371A1 (pl)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109634872A (zh) * 2019-02-25 2019-04-16 北京达佳互联信息技术有限公司 应用测试方法、装置、终端及存储介质

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9275633B2 (en) * 2012-01-09 2016-03-01 Microsoft Technology Licensing, Llc Crowd-sourcing pronunciation corrections in text-to-speech engines
US9311913B2 (en) * 2013-02-05 2016-04-12 Nuance Communications, Inc. Accuracy of text-to-speech synthesis
US9524717B2 (en) * 2013-10-15 2016-12-20 Trevo Solutions Group LLC System, method, and computer program for integrating voice-to-text capability into call systems
US20150149178A1 (en) * 2013-11-22 2015-05-28 At&T Intellectual Property I, L.P. System and method for data-driven intonation generation
US9911408B2 (en) * 2014-03-03 2018-03-06 General Motors Llc Dynamic speech system tuning
US9384728B2 (en) 2014-09-30 2016-07-05 International Business Machines Corporation Synthesizing an aggregate voice
US10360716B1 (en) * 2015-09-18 2019-07-23 Amazon Technologies, Inc. Enhanced avatar animation
KR20170044849A (ko) * 2015-10-16 2017-04-26 삼성전자주식회사 전자 장치 및 다국어/다화자의 공통 음향 데이터 셋을 활용하는 tts 변환 방법
US10074359B2 (en) 2016-11-01 2018-09-11 Google Llc Dynamic text-to-speech provisioning
DE212016000292U1 (de) * 2016-11-03 2019-07-03 Bayerische Motoren Werke Aktiengesellschaft System zur Text-zu-Sprache-Leistungsbewertung
US9741337B1 (en) * 2017-04-03 2017-08-22 Green Key Technologies Llc Adaptive self-trained computer engines with associated databases and methods of use thereof
EP3625791A4 (en) 2017-05-18 2021-03-03 Telepathy Labs, Inc. TEXT-SPEECH SYSTEM AND PROCESS BASED ON ARTIFICIAL INTELLIGENCE
US10614826B2 (en) * 2017-05-24 2020-04-07 Modulate, Inc. System and method for voice-to-voice conversion
US10565981B2 (en) * 2017-09-26 2020-02-18 Microsoft Technology Licensing, Llc Computer-assisted conversation using addressible conversation segments
US11416801B2 (en) * 2017-11-20 2022-08-16 Accenture Global Solutions Limited Analyzing value-related data to identify an error in the value-related data and/or a source of the error
US10521946B1 (en) 2017-11-21 2019-12-31 Amazon Technologies, Inc. Processing speech to drive animations on avatars
US11232645B1 (en) 2017-11-21 2022-01-25 Amazon Technologies, Inc. Virtual spaces as a platform
US10732708B1 (en) * 2017-11-21 2020-08-04 Amazon Technologies, Inc. Disambiguation of virtual reality information using multi-modal data including speech
US10755725B2 (en) 2018-06-04 2020-08-25 Motorola Solutions, Inc. Determining and remedying audio quality issues in a voice communication
CN110032626B (zh) * 2019-04-19 2022-04-12 百度在线网络技术(北京)有限公司 语音播报方法和装置
WO2021030759A1 (en) 2019-08-14 2021-02-18 Modulate, Inc. Generation and detection of watermark for real-time voice conversion
JP2023546989A (ja) 2020-10-08 2023-11-08 モジュレイト インク. コンテンツモデレーションのためのマルチステージ適応型システム

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5920840A (en) * 1995-02-28 1999-07-06 Motorola, Inc. Communication system and method using a speaker dependent time-scaling technique
JP4132109B2 (ja) * 1995-10-26 2008-08-13 ソニー株式会社 音声信号の再生方法及び装置、並びに音声復号化方法及び装置、並びに音声合成方法及び装置
DE19610019C2 (de) * 1996-03-14 1999-10-28 Data Software Gmbh G Digitales Sprachsyntheseverfahren
EP1187100A1 (en) * 2000-09-06 2002-03-13 Koninklijke KPN N.V. A method and a device for objective speech quality assessment without reference signal
US20020087224A1 (en) * 2000-12-29 2002-07-04 Barile Steven E. Concatenated audio title
US6487494B2 (en) * 2001-03-29 2002-11-26 Wingcast, Llc System and method for reducing the amount of repetitive data sent by a server to a client for vehicle navigation
US6658383B2 (en) * 2001-06-26 2003-12-02 Microsoft Corporation Method for coding speech and music signals
US6999066B2 (en) * 2002-06-24 2006-02-14 Xerox Corporation System for audible feedback for touch screen displays
WO2005059895A1 (en) * 2003-12-16 2005-06-30 Loquendo S.P.A. Text-to-speech method and system, computer program product therefor
US7454348B1 (en) * 2004-01-08 2008-11-18 At&T Intellectual Property Ii, L.P. System and method for blending synthetic voices
WO2005071663A2 (en) * 2004-01-16 2005-08-04 Scansoft, Inc. Corpus-based speech synthesis based on segment recombination
EP1805753A1 (en) * 2004-10-18 2007-07-11 Koninklijke Philips Electronics N.V. Data-processing device and method for informing a user about a category of a media content item
US7735012B2 (en) * 2004-11-04 2010-06-08 Apple Inc. Audio user interface for computing devices
US20070124142A1 (en) * 2005-11-25 2007-05-31 Mukherjee Santosh K Voice enabled knowledge system
US7684991B2 (en) * 2006-01-05 2010-03-23 Alpine Electronics, Inc. Digital audio file search method and apparatus using text-to-speech processing
US9318108B2 (en) 2010-01-18 2016-04-19 Apple Inc. Intelligent automated assistant
US20080129520A1 (en) * 2006-12-01 2008-06-05 Apple Computer, Inc. Electronic device with enhanced audio feedback
US8321222B2 (en) * 2007-08-14 2012-11-27 Nuance Communications, Inc. Synthesis by generation and concatenation of multi-form segments
US8996376B2 (en) * 2008-04-05 2015-03-31 Apple Inc. Intelligent text-to-speech conversion
US8352268B2 (en) * 2008-09-29 2013-01-08 Apple Inc. Systems and methods for selective rate of speech and speech preferences for text to speech synthesis
US20100082328A1 (en) * 2008-09-29 2010-04-01 Apple Inc. Systems and methods for speech preprocessing in text to speech synthesis
KR101617461B1 (ko) * 2009-11-17 2016-05-02 엘지전자 주식회사 이동 통신 단말기에서의 티티에스 음성 데이터 출력 방법 및 이를 적용한 이동 통신 단말기
US20110161085A1 (en) * 2009-12-31 2011-06-30 Nokia Corporation Method and apparatus for audio summary of activity for user

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109634872A (zh) * 2019-02-25 2019-04-16 北京达佳互联信息技术有限公司 应用测试方法、装置、终端及存储介质

Also Published As

Publication number Publication date
US20140122081A1 (en) 2014-05-01
US9196240B2 (en) 2015-11-24

Similar Documents

Publication Publication Date Title
PL401371A1 (pl) Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę
BR112021018532A2 (pt) Sistemas e métodos para fidedignidade de modelo
WO2015009586A3 (en) Performing an operation relative to tabular data based upon voice input
WO2018038385A3 (ko) 음성 인식 방법 및 이를 수행하는 전자 장치
BR112017010222A2 (pt) discriminando expressões ambíguas para aprimorar experiência do usuário
MX2016012195A (es) Esquema flexible de adaptacion de modelo de lenguaje.
WO2013134641A3 (en) Recognizing speech in multiple languages
CR20150552A (es) Entorno de aprendizaje de idiomas
EA201391245A1 (ru) Интеррогативные клеточные анализы и их применение
IN2014CN03209A (pl)
WO2014140816A3 (en) Apparatus and method for performing actions based on captured image data
WO2014085776A3 (en) Web search ranking
DOP2014000045A (es) Sistema y método para el aprendizaje de idiomas
PL401372A1 (pl) Hybrydowa kompresja danych głosowych w systemach zamiany tekstu na mowę
MX2014006124A (es) Sistema de enseñanza de idiomas que facilita la implicación del mentor.
BR112015020015A8 (pt) método, meio de armazenamento legível por computador e aparelho para premiar conteúdo gerado por usuário
WO2012057588A3 (ko) 학습능력 진단 장치 및 방법
WO2015050321A8 (ko) 자율학습 정렬 기반의 정렬 코퍼스 생성 장치 및 그 방법과, 정렬 코퍼스를 사용한 파괴 표현 형태소 분석 장치 및 그 형태소 분석 방법
MY194297A (en) A method and device for providing search engine label
BR112014032998A2 (pt) método para formar etileno a partir de metano
Shin et al. Assessing the relative roles of vocabulary and syntactic knowledge in reading comprehension
Tronnier et al. Tendencies of Swedish word accent production by L2-learners with tonal and non-tonal L1
WO2016029045A3 (en) Lexical dialect analysis system
WO2012064997A3 (en) Language training system
NO341675B1 (en) A method and system for removing a core sample from a borehole