CN1407795A - 以选定的语言提供电视语音的装置和方法 - Google Patents

以选定的语言提供电视语音的装置和方法 Download PDF

Info

Publication number
CN1407795A
CN1407795A CN02141460A CN02141460A CN1407795A CN 1407795 A CN1407795 A CN 1407795A CN 02141460 A CN02141460 A CN 02141460A CN 02141460 A CN02141460 A CN 02141460A CN 1407795 A CN1407795 A CN 1407795A
Authority
CN
China
Prior art keywords
language
implicit
caption data
voice
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN02141460A
Other languages
English (en)
Inventor
C·J·斯通
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Arris Technology Inc
Original Assignee
General Instrument Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by General Instrument Corp filed Critical General Instrument Corp
Publication of CN1407795A publication Critical patent/CN1407795A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/60Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4396Processing of audio elementary streams by muting the audio signal
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440236Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by media transcoding, e.g. video is transformed into a slideshow of still pictures, audio is converted into text
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4856End-user interface for client configuration for language selection, e.g. for the menu or subtitles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/8106Monomedia components thereof involving special audio data, e.g. different tracks for different languages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/8166Monomedia components thereof involving executable data, e.g. software
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/08Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division
    • H04N7/087Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only
    • H04N7/088Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only the inserted signal being digital
    • H04N7/0884Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only the inserted signal being digital for the transmission of additional display-information, e.g. menu for programme or channel selection
    • H04N7/0885Systems for the simultaneous or sequential transmission of more than one television signal, e.g. additional information signals, the signals occupying wholly or partially the same frequency band, e.g. by time division with signal insertion during the vertical blanking interval only the inserted signal being digital for the transmission of additional display-information, e.g. menu for programme or channel selection for the transmission of subtitles

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Acoustics & Sound (AREA)
  • Machine Translation (AREA)
  • Television Systems (AREA)

Abstract

根据收到的电视信号中的隐含字幕数据,以所需语言提供电视语音。代表文字的隐含字幕数据从电视信号中被提取,然后,隐含字幕数据经过一个语音合成器的处理,生成所述文字的所需语言的语音。隐含字幕数据在转换成语音的同时或者以前由第一语言被翻译成第二语言。或者,电视信号携带的隐含字幕数据可以有多种语言,所需语言的数据可以从电视信号中被选择和提取出来并转换成语音。

Description

以选定的语言提供电视语音的装置和方法
技术领域
本发明涉及电视***,尤其涉及让电视节目提供随节目录制的语言以外的另一种语言的装置和方法。
背景技术
电视节目包括音频部分和视频部分,音频部分以节目播放地的语言录制,然而,同一个地方并非所有的居民都说同一种语言,因此,应当提供对语言的选择,这样观众就可以更好的欣赏电视节目。
以前,解决语言问题的技术方法主要立足于提供一个以上的附加音频信号,每路附加音频信号携带电视节目不同的语言的音频部分。例如,在数字电视传输的许多建议中,有的主张提供第二音频节目(SAP),可以用来以第二语言提供电视音频。这一解决方案存在一个问题,每路单独的音频信号需要占用额外的传输带宽。这种额外带宽的使用是不希望的,因为这些带宽本来可以用来提供如额外节目的服务。
以前,人们提供隐含字幕数据(closed caption data),让听力障碍者可以以文字的形式欣赏到电视节目的音频部分。根据实用电视标准,这种数据以模拟和数字电视信号传输,例如,美国的国际电视***委员会的模拟电视标准,动画专家组的数字电视标准。过去,隐含字幕数据仅仅用于文字显示。
希望有一个***,它能够让观众可以在多种语言中选择电视节目音频部分使用的语言,而且这个***提供多种语言但每种语言又不占用额外的带宽。
本发明提供的一种电视音频***,除具有以上的优点外,还具有其它优点。
发明内容
本发明让电视观众可以选择电视语音的语言,为了达到这种功能,把隐含字幕数据从电视信号中提取出来。隐含字幕数据主要是文字,提取的隐含字幕数据经语音合成器处理生成所需语言的语音。
一个用户接口可以让用户从语音合成器提供的多种语言中选择一种,用户接口可以包括电视屏幕显示等。在一个实施例中,用户通过电视遥控所述屏幕显示进行交互。
由于电视信号已经包含第一种语言的音频,当选择另外一种语言时,该音频会被置于无声状态,这样,电视节目携带的音频就不会干扰语音合成器的音频输出。
在一个实施例中,隐含字幕数据首先被转换成文本,然后文本再转换成语音。隐含字幕数据可以是所需语言的文字,也可能不是所需语言的文字,这种情况下,在合成语音之前,要将其翻译成所需语言的文字。
实现本发明的实施例的设备包括:一个隐含字幕处理器,用以从已经有第一语言音频的电视节目中把隐含字幕数据提取出来,隐含字幕数据代表文字。一个语音合成器,用来把隐含字幕数据代表的文字转化成第二种语言的语音。
用户接口,用以让用户选择第二种语言。它可以包括一个可以让用户操控电视屏幕显示的遥控器,一个哑音电路,当语音合成器输出替换的语音时,将电视信号的音频置于无声状态。
本发明至少有一部分可以由软件程序实现,用来以所需语言提供电视语音。该软件包括,一个隐含字幕处理模块,用以从已经有第一语言音频的电视节目中把隐含字幕数据提取出来,所述隐含字幕数据代表文字,该软件可进一步包括一个语音合成模块,用来将所述隐含字幕数据代表的文字转换成第二语言的语音。
该软件还可进一步包括一个用户接口模块,让用户从多个不同的语言中选择一种作为第二语言。例如,用户接口模块可以包括一段软件代码,用以产生一个屏幕显示让用户通过遥控器选择想要的第二语言。还可以有一个哑音模块,当语音合成模块输出替换的语音时,启动哑音电路将电视信号的音频置于无声状态。
软件程序中的隐含字幕模块可以设计成能够把隐含字幕数据转化成文本,由语音合成模块处理成语音,文本可能是所需语言,也可能不是所需语言的文字,这种情况下,语音合成模块可以先将其翻译成第二语言再处理成语音,软件程序可以以机读媒体提供。
还有一种方法,在电视信号中提供多种语言其中一种的音频。电视信号中包含其中一种语言的音频,用户从中选择一种语言,如果所需语言并不是电视信号中包含的语言,电视信号中包含的语言就会被转换成所需语言的音频表示,一种情况,语言由隐含字幕信号提供的文本转换,另一种情况,语言由电视信号的音频转换。
附图说明
图1表示本发明***的主要部件的框图;
图2表示应用于本发明的软件举例的框图。
具体实施方式
本发明利用隐含字幕数据的文字,以及一个语音合成器,使电视音频以所需语言输出。这样,看电视时,观众就可以选择与节目相关联的主语言以外的另一种语言,作为听节目的语言。以前,观众要想听到节目随带的语言以外的语言,节目提供者就得在节目上提供另一种语言。这种需求限制了语言的数目,而且让节目提供者承担提供额外语言的重负。本发明解决了这一问题,它利用隐含字幕数据和文本至语音转换器(也就是一个语音合成器),把隐含字幕文本转换成用户选择的语言,提供给用户的是所选择的语言而不是节目随带的语言。
图1表示本发明的相关硬件部件,一个隐含字幕处理器10从收到的电视节目中将隐含字幕数据(例如以文本的形式)提取出来,隐含字幕数据被传给文本至语音处理器12,它包含文本识别转换软件,用来将隐含字幕数据转换成所需语言。尽管图1表示处理器12可以把隐含字幕文本从英语转换成西班牙语、德语、法语和俄语,应当指出的是,只要有适当的软件,任何语言都可以作为起始语言,也可以提供任何目标语言。
文本至语音处理器技术已广为人知,任何适当的设备都可用以实施本发明,例如,日本东京的Oki Electric Industry Co.,Ltd.(Oki电子工业有限公司)销售的MSM7630型多路语音控制处理器能够对包括美式英语、欧洲英语、法语、德语、西班牙语和日语的六种语言进行文本至语音合成,该产品利用具有12位数模转换器的一个大型集成电路芯片,通过时域音调同步叠加技术(time domain-pitch synchronousoverlap-add technology)来提供人声音中的声波,从而提供自然发音,根据不同的应用,可以使用串口和并口,对用户词典进行编程以扩大词汇量,也可使用闪存(只读存储器)以便轻松升级。
本发明的文本至语音处理器12被编程以能够输出任何所需语言,语言还可以更换和扩充。例如,通过下载到设备上的软件模块,或者在设备的插口***一个永久存储卡(例如闪存)。为了进行语言选择,还可以为用户提供一个电动开关,或者图形化用户接口GUI。在一个实施例中,一个图形化用户接口(例如利用标准屏幕显示软硬件)出现在用户的电视屏幕上,上面列出该设备能“说”的语言,用户可以利用电视遥控器14选择一种语言,例如,按下对应于所需语言的按钮(比如数字按钮),用户接口检测到遥控感应(比如通过红外线接收),启动文本至语音处理器把收到的隐含字幕文本转换成所需的语言。
如果选择了节目随带的主语言以外的一种语言,文本至语音处理器12就向开关20发出一个切换信号,使文本至语音处理器的输出与电视音频放大器22和扬声器24连接。当开关20与文本至语音处理器连接时,原节目音频因为与音频电路22、24断开,所以处于无声状态。要想听节目原来的语言,就切换开关20,使原来的电视音频输出与放大器22和扬声器24连接。
图2给出了一个处理流程图和用于实现本发明的软件组件。特别指出,用户输入30传递给一个处理器32,处理器32可以是一个已经安装在电视机顶盒里的微处理器。微处理器控制的机顶盒例如美国宾夕法尼亚州摩托罗拉公司宽带通信部生产的DCT5000。处理器还接收包含主语言音频部分和隐含字幕数据的数字电视信号。需要指出,尽管图2说明了数字电视信号的处理过程,但是,隐含字幕数据也可以由模拟电视信号携带,再被提取出来以数字形式输入到处理器32。
处理器32以传统方式为用户电视提供视频34和音频36,根据本发明,所包括的软件38用以提供可以选择替换语言的电视音频36。软件38可以安装在机顶盒的永久存储部分(例如ROM),可以在工厂或商店里安装,或者通过有线电视网、电话线以及无线通信途径下载到机顶盒。软件还可以存储在与机顶盒连接的个人多功能存储器、个人电脑等的硬盘和其他存储部分。
如图2所示,软件38包括一个使隐含字幕处理能把隐含字幕数据从电视信号中提取出来的隐含字幕处理模块,该隐含字幕处理模块把隐含字幕数据以文本形式提供给一个语音合成模块,把文本转换成所要的语言,并把由文本转化成的语音提供给用户电视或其他视频设备(比如磁带录像机、PVR等)的音频电路。
软件38还包括一个用户接口模块,它提供一个屏幕显示让用户可以选择他们想听的语言,该用户接口模块还负责电视(或者机顶盒,VCR,PVR等)遥控输入的信号的解码。还有一个哑音模块,用来将主要节目音频输出置于无声状态,从而可以通过电视音频***听到所选择的替换语言。需要指出的是,图2所示的实例只是用来说明本发明的目的,根据本发明还可以提供其他的实例。
这里应该指出,本发明给出了隐含字幕数据的一种新用途。这些数据用来让能听到语音的观众可以听到不同语言的语音,而不是为听力障碍者提供字幕文本。隐含字幕数据也可以以不同的语言由电视信号携带,可以直接输入到语音处理器,转换成语音而无需翻译。
尽管通过一个具体实例说明了本发明,但是应当理解,可以进行各种改动和变型而不脱离本发明的权利要求所述的范围。

Claims (27)

1、一种以选定的语言提供电视语音的方法,该方法包括:
把隐含字幕数据从电视信号中提取出来,所述隐含字幕数据代表文字;以及
用一个语音合成器对提取出来的隐含字幕数据进行处理,提供所需语言的所述文字的语音。
2、如权利要求1所述的方法,包括提供一个用户接口,让用户从语音合成器能够提供的多种语言中选择一种语言。
3、如权利要求2所述的方法,其中所述用户接口包括一个电视屏幕显示。
4、如权利要求3所述的方法,其中所述用户通过一个电视遥控器所述屏幕显示进行交互。
5、如权利要求1所述的方法,其中所述电视信号包括一音频部分和一视频部分,所述方法包括进一步将所述音频部分置于无声状态。
6、如权利要求1所述的方法,其中所述处理步骤把所述隐含字幕数据转换成文本,然后将所述文本转换成语音。
7、如权利要求1所述的方法,其中所述隐含字幕数据代表所述所需语言的文字。
8、如权利要求1所述的方法,其中所述隐含字幕数据代表不同于所述所需语言的另一种语言的文字,所述处理步骤把所述文字翻译成所需语言。
9、一种以选定的语言提供电视语音的装置,该装置包括:
一隐含字幕处理器,用以把隐含字幕数据从带有第一语言音频部分的电视信号中提取出来,所述隐含字幕数据代表文字;以及
一个语音合成器,用来把所述隐含字幕数据代表的文字转换成第二种语言的语音。
10、如权利要求9所述的装置,进一步包括:
一个与所述语音合成器可操作地相联系的用户接口,让用户可以从多种不同的语言中选择出一种作为所述第二种语言。
11、如权利要求10所述的装置,其中所述用户接口包括一个电视屏幕显示。
12、如权利要求11所述的装置,其中所述用户接口进一步包括所述用户用来与所述屏幕显示进行交互的遥控器。
13、如权利要求9所述的装置,进一步包括一个哑音电路,用于在所述语音合成器提供替换的语音时,将所述电视信号的音频部分置于无声状态。
14、如权利要求9所述的装置,其中所述隐含字幕处理器将所述隐含字幕数据转换成文本以由所述语音合成器处理成语音。
15、如权利要求14所述的装置,其中所述文本是所述的第二语言文本。
16、如权利要求14所述的装置,其中所述文本是所述第二语言以外的一种语言的文本,所述语音合成器能够将所述文本翻译成所述第二语言以处理成语音。
17、一种以选定的语言提供电视语音的软件程序,该程序包括:
一个隐含字幕处理模块,用于把隐含字幕数据从具有第一语言音频部分的电视信号中提取出来,所述隐含字幕数据代表文字;以及
一个语音合成模块,用于将所述隐含字幕数据代表的文字转换成第二种语言的语音。
18、如权利要求17所述的软件程序,进一步包括一个用户接口模块,让用户可以从多种不同的语言中选择出一种作为所述的第二语言。
19、如权利要求18所述的软件程序,其中所述用户接口模块包括能产生一个屏幕显示让所述用户可以使用遥控器选择第二语言的软件代码。
20、如权利要求17所述的软件程序,进一步包括一个哑音模块,用以在所述语音合成模块输出替换的语音时,启动一哑音电路将所述电视信号的音频部分置于无声状态。
21、如权利要求17所述的软件程序,其中所述隐含字幕模块将所述隐含字幕数据转换成文本以由所述语音合成模块处理成语音。
22、如权利要求21所述的软件程序,其中所述文本是所述第二语言文本。
23、如权利要求21所述的软件程序,其中所述文本是所述第二语言以外的另一语言文本,所述语音合成模块用以将所述文本翻译成所述第二语言以用来处理成语音。
24、一个含有权利要求17所述软件程序的机读媒体。
25、一种根据电视信号以多种语言中的一种语言提供音频的方法,所述电视信号包含所述语言之一的所述音频,该方法包括:
允许用户从所述语言中选择一种;以及
如果所选择的语言并不包含在所述电视信号中,就将包含在所述电视信号中的语言转换成所选择的语言,以音频提供给所述用户。
26、如权利要求25所述的方法,其中所述语言是由隐含字幕信号提供的文本转换来的。
27、如权利要求25所述的方法,其中所述语言是由所述电视信号的音频部分转换来的。
CN02141460A 2001-08-30 2002-08-30 以选定的语言提供电视语音的装置和方法 Pending CN1407795A (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/943,142 2001-08-30
US09/943,142 US20030046075A1 (en) 2001-08-30 2001-08-30 Apparatus and methods for providing television speech in a selected language

Publications (1)

Publication Number Publication Date
CN1407795A true CN1407795A (zh) 2003-04-02

Family

ID=25479163

Family Applications (1)

Application Number Title Priority Date Filing Date
CN02141460A Pending CN1407795A (zh) 2001-08-30 2002-08-30 以选定的语言提供电视语音的装置和方法

Country Status (3)

Country Link
US (1) US20030046075A1 (zh)
CN (1) CN1407795A (zh)
CA (1) CA2398875A1 (zh)

Cited By (56)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101437149B (zh) * 2007-11-12 2010-10-20 华为技术有限公司 一种提供多语种节目的方法、***及装置
CN1801321B (zh) * 2005-01-06 2010-11-10 台达电子工业股份有限公司 文字转语音的***与方法
CN101924863A (zh) * 2010-05-21 2010-12-22 中山大学 一种数字电视设备
CN102014256A (zh) * 2010-12-24 2011-04-13 深圳Tcl新技术有限公司 播放音视频文件时伴音或者字幕智能切换的方法
CN103188564A (zh) * 2011-12-28 2013-07-03 联想(北京)有限公司 电子设备及其信息处理方法
CN103853704A (zh) * 2012-11-28 2014-06-11 上海能感物联网有限公司 计算机外语有声影像资料自动加注中外文字幕的方法
CN104244081A (zh) * 2014-09-26 2014-12-24 可牛网络技术(北京)有限公司 视频的提供方法及装置
CN104380284A (zh) * 2012-03-06 2015-02-25 苹果公司 针对多种语言处理内容的语音合成
US9668024B2 (en) 2014-06-30 2017-05-30 Apple Inc. Intelligent automated assistant for TV user interactions
US9865248B2 (en) 2008-04-05 2018-01-09 Apple Inc. Intelligent text-to-speech conversion
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9966060B2 (en) 2013-06-07 2018-05-08 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US9971774B2 (en) 2012-09-19 2018-05-15 Apple Inc. Voice-based media searching
US9986419B2 (en) 2014-09-30 2018-05-29 Apple Inc. Social reminders
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US10049675B2 (en) 2010-02-25 2018-08-14 Apple Inc. User profiling for voice input processing
US10049668B2 (en) 2015-12-02 2018-08-14 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10049663B2 (en) 2016-06-08 2018-08-14 Apple, Inc. Intelligent automated assistant for media exploration
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
US10079014B2 (en) 2012-06-08 2018-09-18 Apple Inc. Name recognition system
US10089072B2 (en) 2016-06-11 2018-10-02 Apple Inc. Intelligent device arbitration and control
US10169329B2 (en) 2014-05-30 2019-01-01 Apple Inc. Exemplar-based natural language processing
US10192552B2 (en) 2016-06-10 2019-01-29 Apple Inc. Digital assistant providing whispered speech
US10223066B2 (en) 2015-12-23 2019-03-05 Apple Inc. Proactive assistance based on dialog communication between devices
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
US10269345B2 (en) 2016-06-11 2019-04-23 Apple Inc. Intelligent task discovery
US10283110B2 (en) 2009-07-02 2019-05-07 Apple Inc. Methods and apparatuses for automatic speech recognition
US10297253B2 (en) 2016-06-11 2019-05-21 Apple Inc. Application integration with a digital assistant
US10318871B2 (en) 2005-09-08 2019-06-11 Apple Inc. Method and apparatus for building an intelligent automated assistant
US10356243B2 (en) 2015-06-05 2019-07-16 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10354011B2 (en) 2016-06-09 2019-07-16 Apple Inc. Intelligent automated assistant in a home environment
US10366158B2 (en) 2015-09-29 2019-07-30 Apple Inc. Efficient word encoding for recurrent neural network language models
CN110073437A (zh) * 2016-07-21 2019-07-30 欧斯拉布斯私人有限公司 一种用于将文本数据转换为多种语音数据的***和方法
US10410637B2 (en) 2017-05-12 2019-09-10 Apple Inc. User-specific acoustic models
US10446143B2 (en) 2016-03-14 2019-10-15 Apple Inc. Identification of voice inputs providing credentials
US10482874B2 (en) 2017-05-15 2019-11-19 Apple Inc. Hierarchical belief states for digital assistants
US10490187B2 (en) 2016-06-10 2019-11-26 Apple Inc. Digital assistant providing automated status report
US10509862B2 (en) 2016-06-10 2019-12-17 Apple Inc. Dynamic phrase expansion of language input
US10521466B2 (en) 2016-06-11 2019-12-31 Apple Inc. Data driven natural language event detection and classification
CN110647267A (zh) * 2019-09-20 2020-01-03 深圳思远创新科技有限公司 多语言语音的经文播放方法、装置和计算机可读存储介质
CN110659387A (zh) * 2019-09-20 2020-01-07 上海掌门科技有限公司 用于提供视频的方法和设备
US10567477B2 (en) 2015-03-08 2020-02-18 Apple Inc. Virtual assistant continuity
US10593346B2 (en) 2016-12-22 2020-03-17 Apple Inc. Rank-reduced token representation for automatic speech recognition
US10671428B2 (en) 2015-09-08 2020-06-02 Apple Inc. Distributed personal assistant
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US10706841B2 (en) 2010-01-18 2020-07-07 Apple Inc. Task flow identification based on user intent
US10733993B2 (en) 2016-06-10 2020-08-04 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US10755703B2 (en) 2017-05-11 2020-08-25 Apple Inc. Offline personal assistant
US10791176B2 (en) 2017-05-12 2020-09-29 Apple Inc. Synchronization and task delegation of a digital assistant
US10795541B2 (en) 2009-06-05 2020-10-06 Apple Inc. Intelligent organization of tasks items
US10810274B2 (en) 2017-05-15 2020-10-20 Apple Inc. Optimizing dialogue policy decisions for digital assistants using implicit feedback
US11010550B2 (en) 2015-09-29 2021-05-18 Apple Inc. Unified language modeling framework for word prediction, auto-completion and auto-correction
US11080012B2 (en) 2009-06-05 2021-08-03 Apple Inc. Interface for a virtual digital assistant
US11217255B2 (en) 2017-05-16 2022-01-04 Apple Inc. Far-field extension for digital assistant services

Families Citing this family (87)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8645137B2 (en) 2000-03-16 2014-02-04 Apple Inc. Fast, language-independent method for user authentication by voice
KR20040098020A (ko) * 2002-03-21 2004-11-18 코닌클리케 필립스 일렉트로닉스 엔.브이. 다중-언어 클로즈드-캡셔닝
JP3953886B2 (ja) * 2002-05-16 2007-08-08 セイコーエプソン株式会社 字幕抽出装置
WO2005002431A1 (en) * 2003-06-24 2005-01-13 Johnson & Johnson Consumer Companies Inc. Method and system for rehabilitating a medical condition across multiple dimensions
US20070276285A1 (en) * 2003-06-24 2007-11-29 Mark Burrows System and Method for Customized Training to Understand Human Speech Correctly with a Hearing Aid Device
WO2005003902A2 (en) * 2003-06-24 2005-01-13 Johnson & Johnson Consumer Companies, Inc. Method and system for using a database containing rehabilitation plans indexed across multiple dimensions
US20050261890A1 (en) * 2004-05-21 2005-11-24 Sterling Robinson Method and apparatus for providing language translation
EP1767058A4 (en) * 2004-06-14 2009-11-25 Johnson & Johnson Consumer HEARING DEVICE SOUND SIMULATION SYSTEM AND METHOD OF USE OF THE SYSTEM
US20080298614A1 (en) * 2004-06-14 2008-12-04 Johnson & Johnson Consumer Companies, Inc. System for and Method of Offering an Optimized Sound Service to Individuals within a Place of Business
EP1767060A4 (en) * 2004-06-14 2009-07-29 Johnson & Johnson Consumer HEARING AID SYSTEM AND METHOD AT HOME
US20080056518A1 (en) * 2004-06-14 2008-03-06 Mark Burrows System for and Method of Optimizing an Individual's Hearing Aid
WO2005125282A2 (en) * 2004-06-14 2005-12-29 Johnson & Johnson Consumer Companies, Inc. System for and method of increasing convenience to users to drive the purchase process for hearing health that results in purchase of a hearing aid
US20080269636A1 (en) * 2004-06-14 2008-10-30 Johnson & Johnson Consumer Companies, Inc. System for and Method of Conveniently and Automatically Testing the Hearing of a Person
EP1767055A4 (en) * 2004-06-14 2009-07-08 Johnson & Johnson Consumer HOME CLEANING AND TEST SYSTEM OF HEARING PROSTHESIS
EP1767057A4 (en) * 2004-06-15 2009-08-19 Johnson & Johnson Consumer SYSTEM AND METHOD FOR ENHANCED INTELLIGIBILITY OF SOUND ISSUED BY TELEVISION FOR THE DISABLED
EP1767061A4 (en) * 2004-06-15 2009-11-18 Johnson & Johnson Consumer HEART-RESISTANT, LIMIT-TIME, PROGRAMMABLE, LOW-COST PROSTHESES APPARATUS, METHOD OF USE, AND PROGRAMMING SYSTEM FOR SAME
JP4517746B2 (ja) * 2004-06-25 2010-08-04 船井電機株式会社 デジタル放送受信装置
US20060178865A1 (en) * 2004-10-29 2006-08-10 Edwards D Craig Multilingual user interface for a medical device
RU2007146365A (ru) * 2005-05-31 2009-07-20 Конинклейке Филипс Электроникс Н.В. (De) Способ и устройство для выполнения автоматического дублирования мультимедийного сигнала
US7711543B2 (en) * 2006-04-14 2010-05-04 At&T Intellectual Property Ii, Lp On-demand language translation for television programs
US7809549B1 (en) 2006-06-15 2010-10-05 At&T Intellectual Property Ii, L.P. On-demand language translation for television programs
US8924194B2 (en) * 2006-06-20 2014-12-30 At&T Intellectual Property Ii, L.P. Automatic translation of advertisements
US8977255B2 (en) 2007-04-03 2015-03-10 Apple Inc. Method and system for operating a multi-function portable electronic device using voice-activation
US8239767B2 (en) * 2007-06-25 2012-08-07 Microsoft Corporation Audio stream management for television content
US20090150951A1 (en) * 2007-12-06 2009-06-11 At&T Knowledge Ventures, L.P. Enhanced captioning data for use with multimedia content
DE102007063086B4 (de) * 2007-12-28 2010-08-12 Loewe Opta Gmbh Fernsehempfangsvorrichtung mit Untertiteldecoder und Sprachsynthesizer
US9330720B2 (en) 2008-01-03 2016-05-03 Apple Inc. Methods and apparatus for altering audio output signals
US10496753B2 (en) 2010-01-18 2019-12-03 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US20100030549A1 (en) 2008-07-31 2010-02-04 Lee Michael M Mobile device having human language translation capability with positional feedback
US20100106482A1 (en) * 2008-10-23 2010-04-29 Sony Corporation Additional language support for televisions
US8330864B2 (en) * 2008-11-02 2012-12-11 Xorbit, Inc. Multi-lingual transmission and delay of closed caption content through a delivery system
US20100265397A1 (en) * 2009-04-20 2010-10-21 Tandberg Television, Inc. Systems and methods for providing dynamically determined closed caption translations for vod content
US10706373B2 (en) 2011-06-03 2020-07-07 Apple Inc. Performing actions associated with task items that represent tasks to perform
US9858925B2 (en) 2009-06-05 2018-01-02 Apple Inc. Using context information to facilitate processing of commands in a virtual assistant
US20110020774A1 (en) * 2009-07-24 2011-01-27 Echostar Technologies L.L.C. Systems and methods for facilitating foreign language instruction
JP5551186B2 (ja) * 2009-12-25 2014-07-16 パナソニック株式会社 放送受信装置及び放送受信装置における番組情報音声出力方法
US10679605B2 (en) 2010-01-18 2020-06-09 Apple Inc. Hands-free list-reading by intelligent automated assistant
US10705794B2 (en) 2010-01-18 2020-07-07 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US10276170B2 (en) 2010-01-18 2019-04-30 Apple Inc. Intelligent automated assistant
US10553209B2 (en) 2010-01-18 2020-02-04 Apple Inc. Systems and methods for hands-free notification summaries
WO2011158010A1 (en) * 2010-06-15 2011-12-22 Jonathan Edward Bishop Assisting human interaction
US9262612B2 (en) 2011-03-21 2016-02-16 Apple Inc. Device access using voice authentication
US10057736B2 (en) 2011-06-03 2018-08-21 Apple Inc. Active transport based notifications
US8994660B2 (en) 2011-08-29 2015-03-31 Apple Inc. Text correction processing
US9280610B2 (en) 2012-05-14 2016-03-08 Apple Inc. Crowd sourcing information to fulfill user requests
CN103458321B (zh) * 2012-06-04 2016-08-17 联想(北京)有限公司 一种字幕加载方法及装置
US9672209B2 (en) * 2012-06-21 2017-06-06 International Business Machines Corporation Dynamic translation substitution
US9495129B2 (en) 2012-06-29 2016-11-15 Apple Inc. Device, method, and user interface for voice-activated navigation and browsing of a document
JP2014011676A (ja) * 2012-06-29 2014-01-20 Casio Comput Co Ltd コンテンツ再生制御装置、コンテンツ再生制御方法及びプログラム
WO2014141054A1 (en) * 2013-03-11 2014-09-18 Video Dubber Ltd. Method, apparatus and system for regenerating voice intonation in automatically dubbed videos
WO2014197336A1 (en) 2013-06-07 2014-12-11 Apple Inc. System and method for detecting errors in interactions with a voice-based digital assistant
US9582608B2 (en) 2013-06-07 2017-02-28 Apple Inc. Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
WO2014197335A1 (en) 2013-06-08 2014-12-11 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
KR101922663B1 (ko) 2013-06-09 2018-11-28 애플 인크. 디지털 어시스턴트의 둘 이상의 인스턴스들에 걸친 대화 지속성을 가능하게 하기 위한 디바이스, 방법 및 그래픽 사용자 인터페이스
US10176167B2 (en) 2013-06-09 2019-01-08 Apple Inc. System and method for inferring user intent from speech inputs
CN104301771A (zh) * 2013-07-15 2015-01-21 中兴通讯股份有限公司 视频文件播放进度的调整方法及装置
US9842101B2 (en) 2014-05-30 2017-12-12 Apple Inc. Predictive conversion of language input
WO2015184186A1 (en) 2014-05-30 2015-12-03 Apple Inc. Multi-command single utterance input method
US9715875B2 (en) 2014-05-30 2017-07-25 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US9785630B2 (en) 2014-05-30 2017-10-10 Apple Inc. Text prediction using combined word N-gram and unigram language models
US9760559B2 (en) 2014-05-30 2017-09-12 Apple Inc. Predictive text input
US10078631B2 (en) 2014-05-30 2018-09-18 Apple Inc. Entropy-guided text prediction using combined word and character n-gram language models
US10659851B2 (en) 2014-06-30 2020-05-19 Apple Inc. Real-time digital assistant knowledge updates
US10446141B2 (en) 2014-08-28 2019-10-15 Apple Inc. Automatic speech recognition based on user feedback
US9818400B2 (en) 2014-09-11 2017-11-14 Apple Inc. Method and apparatus for discovering trending terms in speech requests
US10789041B2 (en) 2014-09-12 2020-09-29 Apple Inc. Dynamic thresholds for always listening speech trigger
US10074360B2 (en) 2014-09-30 2018-09-11 Apple Inc. Providing an indication of the suitability of speech recognition
US9646609B2 (en) 2014-09-30 2017-05-09 Apple Inc. Caching apparatus for serving phonetic pronunciations
US10127911B2 (en) 2014-09-30 2018-11-13 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US9886432B2 (en) 2014-09-30 2018-02-06 Apple Inc. Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US10552013B2 (en) 2014-12-02 2020-02-04 Apple Inc. Data detection
US9865280B2 (en) 2015-03-06 2018-01-09 Apple Inc. Structured dictation using intelligent automated assistants
US9886953B2 (en) 2015-03-08 2018-02-06 Apple Inc. Virtual assistant activation
US9721566B2 (en) 2015-03-08 2017-08-01 Apple Inc. Competing devices responding to voice triggers
US9899019B2 (en) 2015-03-18 2018-02-20 Apple Inc. Systems and methods for structured stem and suffix language models
US9842105B2 (en) 2015-04-16 2017-12-12 Apple Inc. Parsimonious continuous-space phrase representations for natural language processing
US10083688B2 (en) 2015-05-27 2018-09-25 Apple Inc. Device voice control for selecting a displayed affordance
US10127220B2 (en) 2015-06-04 2018-11-13 Apple Inc. Language identification from short strings
US10101822B2 (en) 2015-06-05 2018-10-16 Apple Inc. Language input correction
US11025565B2 (en) 2015-06-07 2021-06-01 Apple Inc. Personalized prediction of responses for instant messaging
US10255907B2 (en) 2015-06-07 2019-04-09 Apple Inc. Automatic accent detection using acoustic models
US10186254B2 (en) 2015-06-07 2019-01-22 Apple Inc. Context-based endpoint detection
US9697820B2 (en) 2015-09-24 2017-07-04 Apple Inc. Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
US11587559B2 (en) 2015-09-30 2023-02-21 Apple Inc. Intelligent device identification
JP6398945B2 (ja) * 2015-10-29 2018-10-03 コニカミノルタ株式会社 情報付加文書生成装置、プログラム
US9916127B1 (en) * 2016-09-14 2018-03-13 International Business Machines Corporation Audio input replay enhancement with closed captioning display
US10291964B2 (en) * 2016-12-06 2019-05-14 At&T Intellectual Property I, L.P. Multimedia broadcast system

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4627101A (en) * 1985-02-25 1986-12-02 Rca Corporation Muting circuit
US5428404A (en) * 1993-01-29 1995-06-27 Scientific-Atlanta, Inc. Apparatus for method for selectively demodulating and remodulating alternate channels of a television broadcast
US5615301A (en) * 1994-09-28 1997-03-25 Rivers; W. L. Automated language translation system
US5677739A (en) * 1995-03-02 1997-10-14 National Captioning Institute System and method for providing described television services
JP3018966B2 (ja) * 1995-12-01 2000-03-13 松下電器産業株式会社 記録再生装置
US5737725A (en) * 1996-01-09 1998-04-07 U S West Marketing Resources Group, Inc. Method and system for automatically generating new voice files corresponding to new text from a script
US5894320A (en) * 1996-05-29 1999-04-13 General Instrument Corporation Multi-channel television system with viewer-selectable video and audio
JP3363712B2 (ja) * 1996-08-06 2003-01-08 株式会社リコー 光ディスク装置
US6430357B1 (en) * 1998-09-22 2002-08-06 Ati International Srl Text data extraction system for interleaved video data streams

Cited By (69)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1801321B (zh) * 2005-01-06 2010-11-10 台达电子工业股份有限公司 文字转语音的***与方法
US10318871B2 (en) 2005-09-08 2019-06-11 Apple Inc. Method and apparatus for building an intelligent automated assistant
CN101437149B (zh) * 2007-11-12 2010-10-20 华为技术有限公司 一种提供多语种节目的方法、***及装置
US9865248B2 (en) 2008-04-05 2018-01-09 Apple Inc. Intelligent text-to-speech conversion
US10795541B2 (en) 2009-06-05 2020-10-06 Apple Inc. Intelligent organization of tasks items
US11080012B2 (en) 2009-06-05 2021-08-03 Apple Inc. Interface for a virtual digital assistant
US10283110B2 (en) 2009-07-02 2019-05-07 Apple Inc. Methods and apparatuses for automatic speech recognition
US10706841B2 (en) 2010-01-18 2020-07-07 Apple Inc. Task flow identification based on user intent
US11423886B2 (en) 2010-01-18 2022-08-23 Apple Inc. Task flow identification based on user intent
US10049675B2 (en) 2010-02-25 2018-08-14 Apple Inc. User profiling for voice input processing
CN101924863A (zh) * 2010-05-21 2010-12-22 中山大学 一种数字电视设备
CN102014256A (zh) * 2010-12-24 2011-04-13 深圳Tcl新技术有限公司 播放音视频文件时伴音或者字幕智能切换的方法
CN103188564B (zh) * 2011-12-28 2016-08-17 联想(北京)有限公司 电子设备及其信息处理方法
CN103188564A (zh) * 2011-12-28 2013-07-03 联想(北京)有限公司 电子设备及其信息处理方法
CN104380284A (zh) * 2012-03-06 2015-02-25 苹果公司 针对多种语言处理内容的语音合成
CN104380284B (zh) * 2012-03-06 2018-01-30 苹果公司 针对多种语言处理内容的语音合成
US9483461B2 (en) 2012-03-06 2016-11-01 Apple Inc. Handling speech synthesis of content for multiple languages
US10079014B2 (en) 2012-06-08 2018-09-18 Apple Inc. Name recognition system
US9971774B2 (en) 2012-09-19 2018-05-15 Apple Inc. Voice-based media searching
CN103853704A (zh) * 2012-11-28 2014-06-11 上海能感物联网有限公司 计算机外语有声影像资料自动加注中外文字幕的方法
US9966060B2 (en) 2013-06-07 2018-05-08 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US10169329B2 (en) 2014-05-30 2019-01-01 Apple Inc. Exemplar-based natural language processing
US10904611B2 (en) 2014-06-30 2021-01-26 Apple Inc. Intelligent automated assistant for TV user interactions
US9668024B2 (en) 2014-06-30 2017-05-30 Apple Inc. Intelligent automated assistant for TV user interactions
CN104244081A (zh) * 2014-09-26 2014-12-24 可牛网络技术(北京)有限公司 视频的提供方法及装置
CN104244081B (zh) * 2014-09-26 2018-10-16 可牛网络技术(北京)有限公司 视频的提供方法及装置
US9986419B2 (en) 2014-09-30 2018-05-29 Apple Inc. Social reminders
US10567477B2 (en) 2015-03-08 2020-02-18 Apple Inc. Virtual assistant continuity
US10356243B2 (en) 2015-06-05 2019-07-16 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10671428B2 (en) 2015-09-08 2020-06-02 Apple Inc. Distributed personal assistant
US11500672B2 (en) 2015-09-08 2022-11-15 Apple Inc. Distributed personal assistant
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US11010550B2 (en) 2015-09-29 2021-05-18 Apple Inc. Unified language modeling framework for word prediction, auto-completion and auto-correction
US10366158B2 (en) 2015-09-29 2019-07-30 Apple Inc. Efficient word encoding for recurrent neural network language models
US11526368B2 (en) 2015-11-06 2022-12-13 Apple Inc. Intelligent automated assistant in a messaging environment
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US10049668B2 (en) 2015-12-02 2018-08-14 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10223066B2 (en) 2015-12-23 2019-03-05 Apple Inc. Proactive assistance based on dialog communication between devices
US10446143B2 (en) 2016-03-14 2019-10-15 Apple Inc. Identification of voice inputs providing credentials
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
US10049663B2 (en) 2016-06-08 2018-08-14 Apple, Inc. Intelligent automated assistant for media exploration
US11069347B2 (en) 2016-06-08 2021-07-20 Apple Inc. Intelligent automated assistant for media exploration
US10354011B2 (en) 2016-06-09 2019-07-16 Apple Inc. Intelligent automated assistant in a home environment
US11037565B2 (en) 2016-06-10 2021-06-15 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10733993B2 (en) 2016-06-10 2020-08-04 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
US10490187B2 (en) 2016-06-10 2019-11-26 Apple Inc. Digital assistant providing automated status report
US10192552B2 (en) 2016-06-10 2019-01-29 Apple Inc. Digital assistant providing whispered speech
US10509862B2 (en) 2016-06-10 2019-12-17 Apple Inc. Dynamic phrase expansion of language input
US10269345B2 (en) 2016-06-11 2019-04-23 Apple Inc. Intelligent task discovery
US10297253B2 (en) 2016-06-11 2019-05-21 Apple Inc. Application integration with a digital assistant
US10089072B2 (en) 2016-06-11 2018-10-02 Apple Inc. Intelligent device arbitration and control
US11152002B2 (en) 2016-06-11 2021-10-19 Apple Inc. Application integration with a digital assistant
US10521466B2 (en) 2016-06-11 2019-12-31 Apple Inc. Data driven natural language event detection and classification
CN110073437A (zh) * 2016-07-21 2019-07-30 欧斯拉布斯私人有限公司 一种用于将文本数据转换为多种语音数据的***和方法
US10553215B2 (en) 2016-09-23 2020-02-04 Apple Inc. Intelligent automated assistant
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US10593346B2 (en) 2016-12-22 2020-03-17 Apple Inc. Rank-reduced token representation for automatic speech recognition
US10755703B2 (en) 2017-05-11 2020-08-25 Apple Inc. Offline personal assistant
US11405466B2 (en) 2017-05-12 2022-08-02 Apple Inc. Synchronization and task delegation of a digital assistant
US10410637B2 (en) 2017-05-12 2019-09-10 Apple Inc. User-specific acoustic models
US10791176B2 (en) 2017-05-12 2020-09-29 Apple Inc. Synchronization and task delegation of a digital assistant
US10482874B2 (en) 2017-05-15 2019-11-19 Apple Inc. Hierarchical belief states for digital assistants
US10810274B2 (en) 2017-05-15 2020-10-20 Apple Inc. Optimizing dialogue policy decisions for digital assistants using implicit feedback
US11217255B2 (en) 2017-05-16 2022-01-04 Apple Inc. Far-field extension for digital assistant services
CN110647267A (zh) * 2019-09-20 2020-01-03 深圳思远创新科技有限公司 多语言语音的经文播放方法、装置和计算机可读存储介质
CN110659387A (zh) * 2019-09-20 2020-01-07 上海掌门科技有限公司 用于提供视频的方法和设备

Also Published As

Publication number Publication date
US20030046075A1 (en) 2003-03-06
CA2398875A1 (en) 2003-02-28

Similar Documents

Publication Publication Date Title
CN1407795A (zh) 以选定的语言提供电视语音的装置和方法
US7013273B2 (en) Speech recognition based captioning system
CN1894965B (zh) 视频信号中编码文本的翻译
US5677739A (en) System and method for providing described television services
US5900908A (en) System and method for providing described television services
CN1774715A (zh) 用于对音频-视频流执行自动配音的***和方法
CN1559042A (zh) 多语言转录***
US20050080631A1 (en) Information processing apparatus and method therefor
CN103561217A (zh) 一种生成字幕的方法及终端
CN103260071B (zh) 一种自动选择菜单语言和伴音语言的机顶盒及实现方法
CN101453589A (zh) 支持多语言应用环境的装置与方法
JP2001022374A (ja) 電子番組ガイドの操作装置および電子番組ガイドの送信装置
JP2005210196A (ja) 情報処理装置、情報処理方法
US20100154004A1 (en) Television and method for operating the same
CN101764970B (zh) 电视机及其操作方法
CN105120324B (zh) 一种分布式播放器实现方法及***
CN1267863A (zh) 具有学习功能的图象装置及其控制方法
KR100499032B1 (ko) 텔레비젼 수신기를 기반으로 하는 오디오 및 비디오 합성편집장치
JP2009260685A (ja) 放送受信装置
JP4167346B2 (ja) ディジタル放送用聴覚補償方法およびそれに用いる受信装置
KR100777275B1 (ko) 비트 스트림 분석 기능을 갖는 디지털 방송 수신기
US20090232478A1 (en) Audio service playback method and apparatus thereof
KR20010067826A (ko) 디지털 tv 방송신호에 한글자막을 삽입하는 장치 및 방법
JPH10149193A (ja) 情報処理装置および方法
JP4167347B2 (ja) ディジタル放送用音韻情報送受信方法およびそれに用いる受信装置

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication