SE520065C2 - Anordning och metod för prosodigenerering vid visuell talsyntes - Google Patents

Anordning och metod för prosodigenerering vid visuell talsyntes

Info

Publication number
SE520065C2
SE520065C2 SE9701101A SE9701101A SE520065C2 SE 520065 C2 SE520065 C2 SE 520065C2 SE 9701101 A SE9701101 A SE 9701101A SE 9701101 A SE9701101 A SE 9701101A SE 520065 C2 SE520065 C2 SE 520065C2
Authority
SE
Sweden
Prior art keywords
face
movement
speech
recorded
words
Prior art date
Application number
SE9701101A
Other languages
English (en)
Swedish (sv)
Other versions
SE9701101D0 (sv
SE9701101L (sv
Inventor
Bertil Lyberg
Original Assignee
Telia Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telia Ab filed Critical Telia Ab
Priority to SE9701101A priority Critical patent/SE520065C2/sv
Publication of SE9701101D0 publication Critical patent/SE9701101D0/xx
Priority to EEP199900419A priority patent/EE03883B1/xx
Priority to DK98911338T priority patent/DK0970465T3/da
Priority to PCT/SE1998/000506 priority patent/WO1998043235A2/en
Priority to DE69816049T priority patent/DE69816049T2/de
Priority to JP54446198A priority patent/JP2001517326A/ja
Priority to US09/381,632 priority patent/US6389396B1/en
Priority to EP98911338A priority patent/EP0970465B1/en
Publication of SE9701101L publication Critical patent/SE9701101L/xx
Priority to NO19994599A priority patent/NO318698B1/no
Publication of SE520065C2 publication Critical patent/SE520065C2/sv

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • G06T13/2053D [Three Dimensional] animation driven by audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • G06T13/403D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • G10L13/07Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Machine Translation (AREA)
  • Processing Or Creating Images (AREA)
  • Steroid Compounds (AREA)
SE9701101A 1997-03-25 1997-03-25 Anordning och metod för prosodigenerering vid visuell talsyntes SE520065C2 (sv)

Priority Applications (9)

Application Number Priority Date Filing Date Title
SE9701101A SE520065C2 (sv) 1997-03-25 1997-03-25 Anordning och metod för prosodigenerering vid visuell talsyntes
EP98911338A EP0970465B1 (en) 1997-03-25 1998-03-20 Device and method for prosody generation for visual synthesis
DE69816049T DE69816049T2 (de) 1997-03-25 1998-03-20 Vorrichtung und verfahren zur prosodie-erzeugung bei der visuellen synthese
DK98911338T DK0970465T3 (da) 1997-03-25 1998-03-20 Indretning og fremgangsmåde til prosodigenerering til visuel syntese
PCT/SE1998/000506 WO1998043235A2 (en) 1997-03-25 1998-03-20 Device and method for prosody generation at visual synthesis
EEP199900419A EE03883B1 (et) 1997-03-25 1998-03-20 Seade ja meetod prosoodia genereerimiseks visuaalsünteesil
JP54446198A JP2001517326A (ja) 1997-03-25 1998-03-20 視覚的合成における韻律生成のための装置および方法
US09/381,632 US6389396B1 (en) 1997-03-25 1998-03-20 Device and method for prosody generation at visual synthesis
NO19994599A NO318698B1 (no) 1997-03-25 1999-09-22 Anordning og fremgangsmate for prosodigenering av visuell syntese

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
SE9701101A SE520065C2 (sv) 1997-03-25 1997-03-25 Anordning och metod för prosodigenerering vid visuell talsyntes

Publications (3)

Publication Number Publication Date
SE9701101D0 SE9701101D0 (sv) 1997-03-25
SE9701101L SE9701101L (sv) 1998-09-26
SE520065C2 true SE520065C2 (sv) 2003-05-20

Family

ID=20406308

Family Applications (1)

Application Number Title Priority Date Filing Date
SE9701101A SE520065C2 (sv) 1997-03-25 1997-03-25 Anordning och metod för prosodigenerering vid visuell talsyntes

Country Status (9)

Country Link
US (1) US6389396B1 (no)
EP (1) EP0970465B1 (no)
JP (1) JP2001517326A (no)
DE (1) DE69816049T2 (no)
DK (1) DK0970465T3 (no)
EE (1) EE03883B1 (no)
NO (1) NO318698B1 (no)
SE (1) SE520065C2 (no)
WO (1) WO1998043235A2 (no)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6947044B1 (en) * 1999-05-21 2005-09-20 Kulas Charles J Creation and playback of computer-generated productions using script-controlled rendering engines
US20020194006A1 (en) * 2001-03-29 2002-12-19 Koninklijke Philips Electronics N.V. Text to visual speech system and method incorporating facial emotions
CN1159702C (zh) 2001-04-11 2004-07-28 国际商业机器公司 具有情感的语音-语音翻译***和方法
US7076430B1 (en) * 2002-05-16 2006-07-11 At&T Corp. System and method of providing conversational visual prosody for talking heads
US20060009978A1 (en) * 2004-07-02 2006-01-12 The Regents Of The University Of Colorado Methods and systems for synthesis of accurate visible speech via transformation of motion capture data
JP4985714B2 (ja) * 2009-06-12 2012-07-25 カシオ計算機株式会社 音声表示出力制御装置、および音声表示出力制御処理プログラム
US8571870B2 (en) * 2010-02-12 2013-10-29 Nuance Communications, Inc. Method and apparatus for generating synthetic speech with contrastive stress
US8949128B2 (en) * 2010-02-12 2015-02-03 Nuance Communications, Inc. Method and apparatus for providing speech output for speech-enabled applications
US8447610B2 (en) * 2010-02-12 2013-05-21 Nuance Communications, Inc. Method and apparatus for generating synthetic speech with contrastive stress
AU2012100262B4 (en) * 2011-12-15 2012-05-24 Nguyen, Phan Thi My Ngoc Ms Speech visualisation tool
JP2012098753A (ja) * 2012-01-27 2012-05-24 Casio Comput Co Ltd 音声表示出力制御装置、画像表示制御装置、および音声表示出力制御処理プログラム、画像表示制御処理プログラム
CN112100352A (zh) * 2020-09-14 2020-12-18 北京百度网讯科技有限公司 与虚拟对象的对话方法、装置、客户端及存储介质

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2518683B2 (ja) 1989-03-08 1996-07-24 国際電信電話株式会社 画像合成方法及びその装置
GB9019829D0 (en) * 1990-09-11 1990-10-24 British Telecomm Speech analysis and image synthesis
US6122616A (en) * 1993-01-21 2000-09-19 Apple Computer, Inc. Method and apparatus for diphone aliasing
US5878396A (en) * 1993-01-21 1999-03-02 Apple Computer, Inc. Method and apparatus for synthetic speech in facial animation
SE500277C2 (sv) 1993-05-10 1994-05-24 Televerket Anordning för att öka talförståelsen vid översätttning av tal från ett första språk till ett andra språk
SE516526C2 (sv) * 1993-11-03 2002-01-22 Telia Ab Metod och anordning vid automatisk extrahering av prosodisk information
US5657426A (en) * 1994-06-10 1997-08-12 Digital Equipment Corporation Method and apparatus for producing audio-visual synthetic speech
AU3668095A (en) 1994-11-07 1996-05-16 At & T Corporation Acoustic-assisted image processing
SE519244C2 (sv) * 1995-12-06 2003-02-04 Telia Ab Anordning och metod vid talsyntes
SE9600959L (sv) * 1996-03-13 1997-09-14 Telia Ab Metod och anordning vid tal-till-talöversättning

Also Published As

Publication number Publication date
JP2001517326A (ja) 2001-10-02
EP0970465B1 (en) 2003-07-02
NO994599L (no) 1999-12-14
SE9701101D0 (sv) 1997-03-25
NO318698B1 (no) 2005-04-25
SE9701101L (sv) 1998-09-26
EE03883B1 (et) 2002-10-15
DK0970465T3 (da) 2003-10-27
NO994599D0 (no) 1999-09-22
US6389396B1 (en) 2002-05-14
EE9900419A (et) 2000-04-17
WO1998043235A2 (en) 1998-10-01
EP0970465A2 (en) 2000-01-12
DE69816049D1 (de) 2003-08-07
WO1998043235A3 (en) 1998-12-23
DE69816049T2 (de) 2004-04-22

Similar Documents

Publication Publication Date Title
Roelofs Phonological segments and features as planning units in speech production
SE519244C2 (sv) Anordning och metod vid talsyntes
Moberg Contributions to Multilingual Low-Footprint TTS System for Hand-Held Devices
Theune et al. From data to speech: a general approach
Eide et al. A corpus-based approach to< ahem/> expressive speech synthesis
EP0831460B1 (en) Speech synthesis method utilizing auxiliary information
Granström et al. Prosodic cues in multimodal speech perception
US5878396A (en) Method and apparatus for synthetic speech in facial animation
Benoı̂t et al. Audio-visual speech synthesis from French text: Eight years of models, designs and evaluation at the ICP
SE520065C2 (sv) Anordning och metod för prosodigenerering vid visuell talsyntes
KR20060051951A (ko) 대화형 음성 응답 시스템들에 의해 스피치 이해를 방지하기 위한 방법 및 장치
El Haddad et al. An HMM approach for synthesizing amused speech with a controllable intensity of smile
Schröder Can emotions be synthesized without controlling voice quality
Nordstrand et al. Measurements of articulatory variation in expressive speech for a set of Swedish vowels
US6385580B1 (en) Method of speech synthesis
Brooke et al. Two-and three-dimensional audio-visual speech synthesis
Minnis et al. Modeling visual coarticulation in synthetic talking heads using a lip motion unit inventory with concatenative synthesis
Aaron et al. Conversational computers
Granström Towards a virtual language tutor
Roehling et al. Towards expressive speech synthesis in english on a robotic platform
JPH03273280A (ja) 発声練習用音声合成方式
Ouni et al. Acoustic-visual synthesis technique using bimodal unit-selection
Granström et al. Eyebrow movements as a cue to prominence
Azzopardi-Alexander The phonetic study of speakers along the Maltese-English continuum
Gambino et al. Virtual conversation with a real talking head

Legal Events

Date Code Title Description
NUG Patent has lapsed