SG10201912562SA - A training method, a readable storage medium and a voice cloning method for a voice cloning model - Google Patents

A training method, a readable storage medium and a voice cloning method for a voice cloning model

Info

Publication number
SG10201912562SA
SG10201912562SA SG10201912562SA SG10201912562SA SG10201912562SA SG 10201912562S A SG10201912562S A SG 10201912562SA SG 10201912562S A SG10201912562S A SG 10201912562SA SG 10201912562S A SG10201912562S A SG 10201912562SA SG 10201912562S A SG10201912562S A SG 10201912562SA
Authority
SG
Singapore
Prior art keywords
voice cloning
storage medium
readable storage
voice
model
Prior art date
Application number
SG10201912562SA
Other languages
English (en)
Inventor
Zining Zhang
Xiaoyan Yang
Zhenjie Zhang
Original Assignee
Yitu Pte Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yitu Pte Ltd filed Critical Yitu Pte Ltd
Priority to SG10201912562SA priority Critical patent/SG10201912562SA/en
Priority to CN202010476440.XA priority patent/CN111696521B/zh
Publication of SG10201912562SA publication Critical patent/SG10201912562SA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • G10L13/047Architecture of speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Signal Processing (AREA)
  • Machine Translation (AREA)
SG10201912562SA 2019-12-18 2019-12-18 A training method, a readable storage medium and a voice cloning method for a voice cloning model SG10201912562SA (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
SG10201912562SA SG10201912562SA (en) 2019-12-18 2019-12-18 A training method, a readable storage medium and a voice cloning method for a voice cloning model
CN202010476440.XA CN111696521B (zh) 2019-12-18 2020-05-29 语音克隆模型的训练方法、可读存储介质和语音克隆方法

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
SG10201912562SA SG10201912562SA (en) 2019-12-18 2019-12-18 A training method, a readable storage medium and a voice cloning method for a voice cloning model

Publications (1)

Publication Number Publication Date
SG10201912562SA true SG10201912562SA (en) 2021-07-29

Family

ID=72478905

Family Applications (1)

Application Number Title Priority Date Filing Date
SG10201912562SA SG10201912562SA (en) 2019-12-18 2019-12-18 A training method, a readable storage medium and a voice cloning method for a voice cloning model

Country Status (2)

Country Link
CN (1) CN111696521B (zh)
SG (1) SG10201912562SA (zh)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112233646B (zh) * 2020-10-20 2024-05-31 携程计算机技术(上海)有限公司 基于神经网络的语音克隆方法、***、设备及存储介质
CN112185340B (zh) * 2020-10-30 2024-03-15 网易(杭州)网络有限公司 语音合成方法、语音合成装置、存储介质与电子设备
CN112652291B (zh) * 2020-12-15 2024-04-05 携程旅游网络技术(上海)有限公司 基于神经网络的语音合成方法、***、设备及存储介质
CN112992117B (zh) * 2021-02-26 2023-05-26 平安科技(深圳)有限公司 多语言语音模型生成方法、装置、计算机设备及存储介质
CN113488057B (zh) * 2021-08-18 2023-11-14 山东新一代信息产业技术研究院有限公司 面向康养的对话实现方法及***

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11238843B2 (en) * 2018-02-09 2022-02-01 Baidu Usa Llc Systems and methods for neural voice cloning with a few samples
CN108630190B (zh) * 2018-05-18 2019-12-10 百度在线网络技术(北京)有限公司 用于生成语音合成模型的方法和装置
CN110136687B (zh) * 2019-05-20 2021-06-15 深圳市数字星河科技有限公司 一种基于语音训练克隆口音及声韵方法
CN110288973B (zh) * 2019-05-20 2024-03-29 平安科技(深圳)有限公司 语音合成方法、装置、设备及计算机可读存储介质

Also Published As

Publication number Publication date
CN111696521B (zh) 2023-08-08
CN111696521A (zh) 2020-09-22

Similar Documents

Publication Publication Date Title
SG10201912562SA (en) A training method, a readable storage medium and a voice cloning method for a voice cloning model
EP3971772A4 (en) METHOD AND APPARATUS FOR PATTERN TRAINING, AND TERMINAL AND STORAGE MEDIA
EP3862893A4 (en) RECOMMENDATION MODEL LEARNING PROCESS, RECOMMENDATION PROCESS, DEVICE, AND COMPUTER READABLE MEDIA
EP3709226A4 (en) MODEL LEARNING SYSTEM AND PROCESS AND INFORMATION SUPPORT
EP3739572A4 (en) METHOD AND DEVICE FOR TEXT-TO-LANGUAGE SYNTHESIS USING MACHINE LEARNING AND COMPUTER-READABLE STORAGE MEDIUM
EP3937165A4 (en) SPEECH SYNTHESIS METHOD AND APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM
EP3805988A4 (en) TRAINING PROCESS FOR MODEL, STORAGE MEDIA AND COMPUTER DEVICE
EP3792789A4 (en) MODEL TRANSLATION LEARNING PROCESS, SENTENCE TRANSLATION PROCESS AND APPARATUS, AND INFORMATION MEDIA
EP3933754A4 (en) IMAGE FUSION METHOD, MODEL TRAINING METHOD AND RELATED DEVICE
EP3690768A4 (en) USER BEHAVIOR PREDICTION METHOD AND APPARATUS, AND BEHAVIOR PREDICTION MODEL TRAINING METHOD AND APPARATUS
SG11202105466QA (en) Method and device for generating neural network model, and computer-readable storage medium
EP3968243A4 (en) Method and apparatus for realizing model training, and computer storage medium
ZA202206486B (en) Method and apparatus for detecting fault, method and apparatus for training model, and device and storage medium
EP3992975A4 (en) METHOD AND DEVICE FOR ANALYZING COMPOUND PROPERTIES, METHOD FOR ANALYZING COMPOUND PROPERTIES AND STORAGE MEDIA
EP3989109A4 (en) IMAGE IDENTIFICATION METHOD AND DEVICE, IDENTIFICATION PATTERN TRAINING METHOD AND DEVICE, AND STORAGE MEDIA
EP4024261A4 (en) PATTERN LEARNING METHOD, APPARATUS AND SYSTEM
EP3937073A4 (en) VIDEO CLASSIFICATION METHOD, MODEL FORMING METHOD AND DEVICE AND STORAGE MEDIA
EP3989104A4 (en) FACIAL FEATURE EXTRACTION MODEL TRAINING METHOD AND APPARATUS, FACIAL FEATURE EXTRACTION METHOD AND APPARATUS, DEVICE AND INFORMATION MEDIA
EP3270239A4 (en) Device characteristic model learning device, device characteristic model learning method, and storage medium
EP3951702A4 (en) IMAGE PROCESSING MODEL LEARNING METHOD, IMAGE PROCESSING METHOD, NETWORK DEVICE AND STORAGE MEDIA
EP4044175A4 (en) VOICE RECOGNITION METHOD AND APPARATUS AND COMPUTER READABLE STORAGE MEDIUM
SG11202104492QA (en) Model training methods, apparatuses, and systems
EP4181026A4 (en) RECOMMENDATION MODEL FORMING METHOD AND APPARATUS, RECOMMENDATION METHOD AND APPARATUS, AND COMPUTER READABLE MEDIUM
EP3866068A4 (en) METHOD AND DEVICE FOR FORMING IMAGE DESCRIPTION MODEL AND INFORMATION HOLDER
EP3594940A4 (en) TRAINING PROCEDURE FOR VOICE DATA SET, COMPUTER DEVICE AND COMPUTER READABLE STORAGE MEDIUM