PT2494546E - Método, servidor e sistema para a transcrição de língua falada - Google Patents

Método, servidor e sistema para a transcrição de língua falada Download PDF

Info

Publication number
PT2494546E
PT2494546E PT107717308T PT10771730T PT2494546E PT 2494546 E PT2494546 E PT 2494546E PT 107717308 T PT107717308 T PT 107717308T PT 10771730 T PT10771730 T PT 10771730T PT 2494546 E PT2494546 E PT 2494546E
Authority
PT
Portugal
Prior art keywords
transcription
server
spoken language
spoken
language
Prior art date
Application number
PT107717308T
Other languages
English (en)
Inventor
Michaela Nachtrab
Robin Nachtrab-Ribback
Original Assignee
Verbavoice Gmbh
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Verbavoice Gmbh filed Critical Verbavoice Gmbh
Publication of PT2494546E publication Critical patent/PT2494546E/pt

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42391Systems providing special services or facilities to subscribers where the subscribers are hearing-impaired persons, e.g. telephone devices for the deaf
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/42Graphical user interfaces
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/60Medium conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2203/00Aspects of automatic or semi-automatic exchanges
    • H04M2203/20Aspects of automatic or semi-automatic exchanges related to features of supplementary services
    • H04M2203/2061Language aspects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234336Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by media transcoding, e.g. video is transformed into a slideshow of still pictures or audio is converted into text
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/8106Monomedia components thereof involving special audio data, e.g. different tracks for different languages

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Telephonic Communication Services (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
PT107717308T 2009-10-27 2010-10-27 Método, servidor e sistema para a transcrição de língua falada PT2494546E (pt)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP09174254A EP2325838A1 (en) 2009-10-27 2009-10-27 A method and system for transcription of spoken language

Publications (1)

Publication Number Publication Date
PT2494546E true PT2494546E (pt) 2014-03-31

Family

ID=41728119

Family Applications (1)

Application Number Title Priority Date Filing Date
PT107717308T PT2494546E (pt) 2009-10-27 2010-10-27 Método, servidor e sistema para a transcrição de língua falada

Country Status (5)

Country Link
US (1) US9544430B2 (pt)
EP (3) EP2325838A1 (pt)
ES (1) ES2453891T3 (pt)
PT (1) PT2494546E (pt)
WO (1) WO2011051325A1 (pt)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110347834A (zh) * 2010-02-18 2019-10-18 株式会社尼康 信息处理装置、便携式装置以及信息处理***
EP2541436A1 (en) * 2011-07-01 2013-01-02 Alcatel Lucent System and method for providing translations in a telecommunication-signal test device
US8788257B1 (en) * 2011-10-25 2014-07-22 Google Inc. Unified cross platform input method framework
US9007448B2 (en) 2012-02-03 2015-04-14 Bank Of America Corporation Video-assisted customer experience
GB2507797A (en) * 2012-11-12 2014-05-14 Prognosis Uk Ltd Translation application allowing bi-directional speech to speech translation and text translation in real time
EP2966601A4 (en) * 2013-03-07 2016-06-29 Nec Solution Innovators Ltd UNDERSTANDING ASSISTANCE SYSTEM, UNDERHENSION ASSISTANT SERVER, UNDERSTANDING ASSISTING METHOD, AND COMPUTER READABLE RECORDING MEDIUM
AU2014227586C1 (en) * 2013-03-15 2020-01-30 Apple Inc. User training by intelligent digital assistant
US20190312973A1 (en) * 2014-02-28 2019-10-10 Ultratec, Inc. Semiautomated relay method and apparatus
US20180270350A1 (en) 2014-02-28 2018-09-20 Ultratec, Inc. Semiautomated relay method and apparatus
US10423760B2 (en) * 2014-04-29 2019-09-24 Vik Moharir Methods, system and apparatus for transcribing information using wearable technology
US10424405B2 (en) * 2014-04-29 2019-09-24 Vik Moharir Method, system and apparatus for transcribing information using wearable technology
US9854139B2 (en) 2014-06-24 2017-12-26 Sony Mobile Communications Inc. Lifelog camera and method of controlling same using voice triggers
US9497315B1 (en) * 2016-07-27 2016-11-15 Captioncall, Llc Transcribing audio communication sessions
DK201770428A1 (en) 2017-05-12 2019-02-18 Apple Inc. LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT
US11373654B2 (en) * 2017-08-07 2022-06-28 Sonova Ag Online automatic audio transcription for hearing aid users
US10192554B1 (en) 2018-02-26 2019-01-29 Sorenson Ip Holdings, Llc Transcription of communications using multiple speech recognition systems
US10834455B2 (en) 2018-06-27 2020-11-10 At&T Intellectual Property I, L.P. Integrating real-time text with video services
US10573312B1 (en) * 2018-12-04 2020-02-25 Sorenson Ip Holdings, Llc Transcription generation from multiple speech recognition systems
US11017778B1 (en) 2018-12-04 2021-05-25 Sorenson Ip Holdings, Llc Switching between speech recognition systems
CN110289016A (zh) * 2019-06-20 2019-09-27 深圳追一科技有限公司 一种基于实时对话的语音质检方法、装置及电子设备
US11539900B2 (en) 2020-02-21 2022-12-27 Ultratec, Inc. Caption modification and augmentation systems and methods for use by hearing assisted user
CN111833848B (zh) * 2020-05-11 2024-05-28 北京嘀嘀无限科技发展有限公司 用于识别语音的方法、装置、电子设备和存储介质
CN111836062A (zh) * 2020-06-30 2020-10-27 北京小米松果电子有限公司 视频播放方法、装置及计算机可读存储介质
US11488604B2 (en) 2020-08-19 2022-11-01 Sorenson Ip Holdings, Llc Transcription of audio

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6385586B1 (en) * 1999-01-28 2002-05-07 International Business Machines Corporation Speech recognition text-based language conversion and text-to-speech in a client-server configuration to enable language translation devices
US6230138B1 (en) * 2000-06-28 2001-05-08 Visteon Global Technologies, Inc. Method and apparatus for controlling multiple speech engines in an in-vehicle speech recognition system
US6510206B2 (en) * 2000-09-19 2003-01-21 Ultratec, Inc. Relay for personal interpreter
US7035804B2 (en) 2001-04-26 2006-04-25 Stenograph, L.L.C. Systems and methods for automated audio transcription, translation, and transfer
US8416925B2 (en) 2005-06-29 2013-04-09 Ultratec, Inc. Device independent text captioned telephone service
US7054804B2 (en) * 2002-05-20 2006-05-30 International Buisness Machines Corporation Method and apparatus for performing real-time subtitles translation
US7328155B2 (en) * 2002-09-25 2008-02-05 Toyota Infotechnology Center Co., Ltd. Method and system for speech recognition using grammar weighted based upon location information
US8027438B2 (en) * 2003-02-10 2011-09-27 At&T Intellectual Property I, L.P. Electronic message translations accompanied by indications of translation
US20060074660A1 (en) * 2004-09-29 2006-04-06 France Telecom Method and apparatus for enhancing speech recognition accuracy by using geographic data to filter a set of words
US8041568B2 (en) * 2006-10-13 2011-10-18 Google Inc. Business listing search
US10056077B2 (en) * 2007-03-07 2018-08-21 Nuance Communications, Inc. Using speech recognition results based on an unstructured language model with a music system
US20080221862A1 (en) * 2007-03-09 2008-09-11 Yahoo! Inc. Mobile language interpreter with localization
US8447285B1 (en) * 2007-03-26 2013-05-21 Callwave Communications, Llc Methods and systems for managing telecommunications and for translating voice messages to text messages
US8515728B2 (en) * 2007-03-29 2013-08-20 Microsoft Corporation Language translation of visual and audio input
CN101309390B (zh) * 2007-05-17 2012-05-23 华为技术有限公司 视讯通信***、装置及其字幕显示方法
US8041555B2 (en) * 2007-08-15 2011-10-18 International Business Machines Corporation Language translation based on a location of a wireless device
US8290779B2 (en) * 2007-09-18 2012-10-16 Verizon Patent And Licensing Inc. System and method for providing a managed language translation service
JP2009205579A (ja) * 2008-02-29 2009-09-10 Toshiba Corp 音声翻訳装置およびプログラム
EP2106121A1 (en) * 2008-03-27 2009-09-30 Mundovision MGI 2000, S.A. Subtitle generation methods for live programming
US8949124B1 (en) * 2008-09-11 2015-02-03 Next It Corporation Automated learning for speech-based applications
US8279861B2 (en) * 2009-12-08 2012-10-02 International Business Machines Corporation Real-time VoIP communications using n-Way selective language processing
US9560206B2 (en) * 2010-04-30 2017-01-31 American Teleconferencing Services, Ltd. Real-time speech-to-text conversion in an audio conference session
US8775156B2 (en) * 2010-08-05 2014-07-08 Google Inc. Translating languages in response to device motion
US8538742B2 (en) * 2011-05-20 2013-09-17 Google Inc. Feed translation for a social network
US20150011251A1 (en) * 2013-07-08 2015-01-08 Raketu Communications, Inc. Method For Transmitting Voice Audio Captions Transcribed Into Text Over SMS Texting
KR20150031896A (ko) * 2013-09-17 2015-03-25 한국전자통신연구원 음성인식장치 및 그 동작방법
US9524717B2 (en) * 2013-10-15 2016-12-20 Trevo Solutions Group LLC System, method, and computer program for integrating voice-to-text capability into call systems

Also Published As

Publication number Publication date
ES2453891T3 (es) 2014-04-08
US20120265529A1 (en) 2012-10-18
EP2725816A1 (en) 2014-04-30
WO2011051325A1 (en) 2011-05-05
EP2494546A1 (en) 2012-09-05
EP2494546B1 (en) 2013-12-25
EP2325838A1 (en) 2011-05-25
EP2494546B8 (en) 2014-03-05
US9544430B2 (en) 2017-01-10

Similar Documents

Publication Publication Date Title
PT2494546E (pt) Método, servidor e sistema para a transcrição de língua falada
EP2515286A4 (en) DEVICE AND METHOD FOR STUDYING FOREIGN LANGUAGES
EP2492910A4 (en) SPEECH TRANSLATION SYSTEM, CONTROL APPARATUS AND CONTROL METHOD
SG10201602571PA (en) System and method for distributed text-to-speech synthesis and intelligibility
EP2438590A4 (en) NAVIGATION SYSTEM WITH SPEECH PROCESSING MECHANISM AND METHOD OF OPERATION THEREOF
IL220669A0 (en) Method and system for tissue recognition
EP2406731A4 (en) SYSTEM AND METHOD FOR THE AUTOMATIC SEMANTIC MARKING OF NATURAL LANGUAGE TEXTS
GB201401220D0 (en) System and method for language extraction and encoding
EP2487591A4 (en) COMPUTER SYSTEM AND METHOD FOR MAINTENANCE OF COMPUTER SYSTEM
EP2399243A4 (en) METHOD AND SYSTEM FOR RECOGNIZING GESTURE
EP2395464A4 (en) METHOD, SYSTEM AND EQUIPMENT FOR IMPLEMENTING INTERNET BANKING SERVICE
HK1153000A1 (en) Checking method, system and server for sql sentence sql
EP2430575A4 (en) SEARCH, DEVICE AND SYSTEM
EP2389672A4 (en) METHOD, DEVICE AND COMPUTER PROGRAM PRODUCT FOR PROVIDING COMPOUND MODELS FOR LANGUAGE RECOGNITION ADAPTATION
GB2471875B (en) A speech recognition system and method
EP2445594A4 (en) METHOD AND DEVICE FOR IMPROVING THE COMMUNICATION OF LANGUAGES
EP2455936A4 (en) Speech translation system, dictionary server device, and program
ZA201005812B (en) Method and system for situational language interpretation
GB201012361D0 (en) System and method for generating search terms
EP2772906A4 (en) MULTILINGUAL LANGUAGE SYSTEM AND CHARACTER PROCESS
EP2431888A4 (en) SYSTEM AND METHOD FOR A SEMANTIC SERVICE
EP2297650A4 (en) PROCEDURE FOR CONTINUING THE LEARNING OF LANGUAGES
TWI372371B (en) Sign language recognition system and method
EP2479683A4 (en) METHOD, DEVICE AND SYSTEM FOR GENERATING BOOKMARKS
EP2552137A4 (en) METHOD AND TERMINAL FOR SENDING AN INSTRUCTION LANGUAGE