CN1849579A - 语音信息*** - Google Patents

语音信息*** Download PDF

Info

Publication number
CN1849579A
CN1849579A CNA2004800262085A CN200480026208A CN1849579A CN 1849579 A CN1849579 A CN 1849579A CN A2004800262085 A CNA2004800262085 A CN A2004800262085A CN 200480026208 A CN200480026208 A CN 200480026208A CN 1849579 A CN1849579 A CN 1849579A
Authority
CN
China
Prior art keywords
audio file
menu
text string
media
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2004800262085A
Other languages
English (en)
Inventor
A·B·比曼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Apple Inc
Original Assignee
Apple Computer Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Apple Computer Inc filed Critical Apple Computer Inc
Publication of CN1849579A publication Critical patent/CN1849579A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/0482Interaction with lists of selectable items, e.g. menus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Circuits Of Receivers In General (AREA)

Abstract

公开了一种语音信息***。本发明一般地适用于可更新的音频信息(例如、菜单)。虽然装置可能有一些预装的菜单组件,但是也可从服务器接收其他的菜单组件。每个菜单组件,不管它是原有的或接收自服务器的,均具有相关的语音名称。当用户突出显示菜单选项时,语音名称可被播放。于是用户拥有选择该菜单选项或翻滚到新菜单选项的选择权。这样,用户无须实际看着菜单的可视显示屏就可以对菜单导航,这可能对于不能看到可视显示屏的用户或有视力障碍的用户特别有用。

Description

语音信息***
技术领域
本发明涉及媒体播放器,更具体地说,涉及在媒体播放器上提供语音信息。
背景技术
在信息时代,计算机能够共享信息的能力是非常重要的。网络是计算机借以能彼此进行通信的的机构。一般,提供资源的装置称为服务器,而利用这些资源的装置称为客户机。根据网络类型,装置可能专用于一种类型的任务或者可能既作为客户机又作为服务器,这取决于装置是给出资源还是请求资源。
人们想共享的资源类型通常与娱乐有关,这种情况日益增多。具体地说,音乐、电影、图片和印刷物是用户可能想通过网络访问的娱乐相关媒体的全部类型。例如,尽管音乐库可以驻留在台式计算机上,但媒体拥有者可能想在携带式媒体播放器上听音乐。
为了实现便携性,许多便携式媒体播放器使用让用户经由简单图形用户界面访问音乐的最低限(minimalist)显示屏。显示屏并不总是被良好照明,在黑暗中也许不可导航。而且,用户可能在某些场合(例如,开车时)不便于或不适合看显示屏,或者用户可能残疾,这使得不可能对菜单进行可视导航。另外,许多人可轻易发现显示屏太小且不便于以常规方式使用。
虽然所描述的技术在很多应用中效果不错,但仍需继续努力以进一步提高用户感受。
发明内容
本发明提供用于提供音频信息的方法。在一个实施例中,音频信息属于音频菜单。首先,在服务器上提供正文串,每个正文串能够表示一个菜单选项。其次,产生音频文件,每个音频文件表示正文串之一的语音名称,并且将每个音频文件与其正文串相关联。然后服务器将音频文件及其关联传送到客户机。
包括由正文串代表的菜单选项的菜单随后呈现在客户机上,该菜单选项能够被突出显示即选择。当与音频文件关联的菜单选项被突出显示时,在客户机上播放该音频文件。
在本发明的另一方面,设有包含处理器、存储器和网络接口的服务器。该服务器的处理器可用来执行指令,包括提供正文串这样的指令。该服务器的处理器也可用于执行其它指令,例如产生正文串的音频表达的音频文件并将音频文件传送到客户机装置。在一个实施例中,正文串代表菜单组件。菜单组件可为能从客户机装置的菜单中选择的若干选项之一。在一个实施例中,客户机装置是媒体播放器,例如手持媒体播放器。
在本发明的又一方面,提供包括处理器、存储器和网络接口的客户机装置。客户机的处理器可用来执行包括允许其从服务器接收菜单组件的音频表达的音频文件的指令,由此菜单组件是可从菜单中选择的若干选项之一。客户机的处理器也可用于执行包括关于允许它更新菜单以包括菜单组件并且在突出显示菜单组件时播放音频文件的指令。
在本发明的又一方面,提供媒体管理***。该媒体管理***包括媒体数据库、媒体集合记录、媒体记录、语音名称数据库和字符串关联记录。媒体数据库存储媒体文件。媒体集合记录包括与媒体文件分组有关的数据。媒体记录包括与媒体文件有关的元数据。语音名称数据库存储音频文件。字符串关联记录将音频文件与媒体集合记录中的数据以及媒体记录中的元数据关联起来。
附图说明
通过参照以下结合附图的描述可很好地理解本发明,附图中:
图1是说明可实现本发明的示例性环境的方框图;
图2是说明本发明一实施例的媒体管理***的组织机构的方框图;
图3是说明可与本发明一实施例结合使用的一般步骤的流程图;
图4是说明一种按照图3所示的本发明一实施例产生语音名称的可能方法的流程图;
图5是说明本发明一实施例的在客户机装置中激活可闻菜单选项时执行的步骤的流程图。
图6是说明本发明一实施例的可在菜单导航期间执行的步骤的流程图;以及
图7是说明可实现本发明不同实施例的示例性计算装置的图。
应理解,附图中相同的数字指示相同的构成要素。同样应理解,图中的描绘未必按比例。
具体实施方式
在下面的描述中,阐述许多具体细节以提供对本发明的深入理解。然而,本领域技术人员显见,无需若干或全部这些具体细节也可实现本发明。在其他情况,为了避免不必要地使本发明的阐述变得不清晰,未对众所周知的处理步骤作详细描述。
本发明提供用于提供音频信息的方法。在一个实施例中,该音频信息属于音频菜单。
本发明通常考虑到可更新的声音菜单。虽然装置可能有一些预装的菜单组件,但其他的菜单组件接收自服务器。例如,可以与音乐播放器一起提供一些预装的菜单组件(例如,“播放列表”、“歌曲”、“艺术家”、“设置”和“关于”的顶层菜单级),但也允许其他菜单组件添加到各种菜单选项(例如,用户添加的顶级菜单“风格”或可用播放列表、歌曲和艺术家的二级菜单列表)。每个菜单组件,无论是原有的还是接收自服务器,均有相关的语音名称。在用户将菜单选项突出显示时,播放其语音名称。然后用户可选择该菜单选项或翻到新菜单选项。这样,用户无须观看显示屏就可对菜单导航。
图1是说明可实现本发明的示例性环境的方框图。网络105将服务器110连接到各客户机115、120、125和130。网络105通常为数据网络,例如LAN、WAN或因特网。服务器110可以是专用装置或者不是专用装置。在图1所示的例中,服务器110是通用计算机。各种客户机115、120、125和130可以是具有不同级别处理能力的肥或瘦客户机。客户机可包括便携式计算机115、台式计算机120、专用装置例如可从加利福尼亚库珀蒂诺的苹果计算机公司买到的iPodsTM125、甚至设计用来跨网络105工作的网络感知的音频/视频部件130。某些装置例如iPod 125可以经由FireWire、USB或一些其它的允许客户机125和服务器110更直接联网在一起的外部总线直接连接到服务器110。
图2是说明本发明一实施例的媒体管理***200的组织机构的方框图。媒体管理***200是允许用户组织和访问数字媒体的计算机程序。为简单起见,下面讨论将假设数字媒体限于音乐。然而,应了解,对“歌曲”或“音乐”的任何引用可以推广到任何形式的数字媒体,这包括声音文件、图片数据、电影、文本文件或任何其他类型的可采用数字方式存储在计算机上的媒体。类似的,对“播放列表”的任何引用可以推广到媒体集合,包括混合数字媒体集合。
虽然服务器110和客户机115、120、125、130均可以具有特别适合那些装置所需的特定功能性的媒体管理***200的不同版本,但是媒体管理***200的基本组件是相似的。具体而言,媒体管理***200可包括媒体管理器205、音乐数据库210和语音名称数据库215。媒体管理器205管理数据库210和215。
音乐数据库210有许多歌曲记录220和用于分类、识别和/或描述音乐数据库210中的媒体(即,媒体项)的播放列表记录225。歌曲记录220包含关于在数据库210中可得的每个媒体项的元数据。元数据可包括例如歌曲名称、艺术家、专辑、歌曲大小、歌曲格式和任何其他适当的信息。当然,信息类型可能取决于媒体类型。视频文件可能还有导演和制片人字段,但可不使用专辑字段。
播放列表记录225包含关于在音乐数据库210中可得的每个播放列表的信息。而且,关于给定播放列表的信息可包括该播放列表内的每首歌曲的识别信息。播放列表可以是采用任何特定顺序或者不采用任何特定顺序的媒体的集合。用户可以选择按流派、基调、艺术家、听众或任何其他有意义的安排来组合媒体。
一些包含在各种记录220、225和230中的信息用作菜单组件。例如,顶级的菜单组件可允许用户通过“歌曲”、“艺术家”或“播放列表”导航。这些分类可能与媒体管理***200预装在一起,或者在媒体管理***200允许修改时由用户修改过。然后用户将能够通过若干不同的路径导航到特定媒体。
例如,如果用户想通过“歌曲”菜单组件访问歌曲“Little Angel ofMine”,则用户将翻滚顶级选项,直到“歌曲”菜单组件被突出显示。一旦突出显示,用户将选择“歌曲”并用菜单组件的二级列表来呈现。该二级列表可能只是用户可得的所有歌曲的按字母顺序的列表,每首歌曲作为二级菜单组件。一般,这些二级菜单组件中没有一个是预装的,并且它们完全取决于用户的特殊音乐偏好。该用户将翻滚歌曲直到“Little Angel of Mine”被突出显示,然后选择该菜单组件来播放该歌曲。
或者,如果用户想通过“艺术家”访问歌曲,则用户将翻滚到菜单组件的顶级,直到“艺术家”被突出显示,然后选择“艺术家”以用菜单组件的第二级来呈现。用户将翻滚艺术家的按字母顺序的列表,直到组合“No Secrets”被突出显示。若选择“No Secrets”二级菜单组件则将用户导引到列出由组合“No Secrets”演奏的全部歌曲的菜单组件的第三级。然后歌曲“Little Angel of Mine”就会在第三级菜单组件当中。
导航到声音的另一备选方法是通过用户定义的播放列表访问歌曲。选择顶级菜单组件“播放列表”将用户带到用户已经创建的所有播放列表的二级列表。歌曲“Little Angel of Mine”可能列出于若干不同的播放列表下。例如“Stuart Little 2 Soundtrack”或者“SongsWritten by Orrin Hatch”播放列表可能包含该歌曲。选择这些二级菜单组件中的任一个将都将用户带到播放列表中的歌曲的三级列表。
所描述菜单组件中的每一个均直接从记录220和225得到。与各菜单组件关联的是菜单组件的音频表达。在前例中,“歌曲”、“艺术家”、“播放列表”、“No Secrets”、“Stuart Little 2 Soundtrack”、“Songs Written by Orrin Hatch”和″Little Angle of Mine″都需要相关联的发音,以让用户无须任何视觉的提示对菜单导航。
一种保存发音的机构是语音名称数据库215。语音名称数据库215包含每个发音的音频文件以及保存音频文件和其对应菜单组件之间的关联的多个记录230。虽然也能采用另一些机构(例如,在歌曲记录220和播放列表记录225中嵌入发音,从而不需要语音名称数据库215),但是使用分离的语音名称数据库215允许与用户如何导航到特定菜单组件无关地使用单个发音。
图3是说明可与本发明一实施例结合而执行的一般步骤的流程图。在步骤305,将表示新菜单组件的正文串引入服务器110。这种引入可能发生在用户手工输入例如新播放列表的新条目时,或者引入可自动发生,例如在购买与歌曲记录215装在一起的新歌曲文件时。
在步骤310,必要时产生菜单组件的语音名称的音频文件。如果购买的歌曲包括语音名称或如果语音名称已经存在于语音名称数据库215,则不必产生语音名称。例如,如果用户已有″The Beatles″的语音名称,则每当将新的Beatles歌曲增加到音乐数据库210时,就不需要创建完全相同的语音名称。
图4是说明本发明一实施例的产生语音名称涉及的详细步骤的流程图。在步骤405,媒体管理***200接收触发信号以创建语音名称。一般,该触发信号通过引入新歌曲记录220或新播放列表记录225创建一个新菜单组件而产生。然而,如果语音名称选项先前已关闭,则第一次开启该选项将产生一个触发信号,通知媒体管理***200需要语音名称。
一旦产生了触发信号,媒体管理***200就在步骤410确定是否已经存在特定字符串的语音名称。如果不存在语音名称,则服务器110在415能使用标准的文本/话音转换工具来产生音频文件。最好,还对这些文件进行压缩以节省空间。一种普遍采用的编码并压缩话音的编解码器是Qualcomm PureVoice,加利福尼亚圣迭戈的Qualcomm公司有售。
一旦创建了一个音频文件,服务器110在步骤420视情况可为用户重放语音名称,使得用户能听到该音频文件。在步骤425,用户可作出许可或拒绝发音的选择。如果用户许可发音,则媒体管理***200在步骤430将创建适当的字符串关联记录230,使得音频文件与适当的菜单组件相关联。
如果用户在步骤425不认可发音,则在步骤435用户可选择修改文本/语音转换工具用来创建语音名称的文本。能以选择方式让用户输入的文本独立于菜单组件,从而允许用户试听菜单组件而无需改变用于记录220和225的实际正文,从而使得菜单组件在拼写和发音上都正确。在步骤420,向用户播放新发音,给用户认可新发音的选择机会。
或者,如果用户在435不选择改变文本,则媒体管理***200可允许用户在440记录他或她自己的发音或者可提供其他音频文件。于是,用户自己的语音能用于稍后对菜单的导航。
再参考图3,在步骤3 10创建语音名称的音频文件之后,服务器110在步骤315将所有新文件传送到客户机装置115、120、125或130。一般,当用户从服务器110将音乐数据库210和它们相关的记录220和225下载到客户机装置115、120、125或130时,将传送语音名称数据库215和字符串关联记录230的内容。但是,并不存在语音名称数据库215和关联记录230不能独立于音乐数据库210及其记录220和225而传送的理由。
在步骤320,客户机装置115、120、125或130接收音频文件以及所有适当的新菜单组件。一旦接收,客户机的媒体管理***200上的菜单就在步骤325被更新,以反映任何变化。然后,在步骤330,只要用户突出显示任一菜单组件,向用户重放适当的音频文件,让用户通过声音提示来对菜单导航。
一般,媒体管理***200让用户选择是打开或关闭可听菜单。图5是说明本发明一实施例中在设置可听菜单选项时可执行的步骤的流程图。在步骤505,用户可视情况选择语言选项。语言选项允许以其它语言呈现预装的菜单组件。例如,“歌曲”菜单组件将以其他语言呈现。例如,  “歌曲”菜单组件以西班牙语“Canciones”、以法语“Chansons”和意大利语“Canzoni”呈现给用户。另外,英语版本的语音名称将不再是适当的,并可以用适当的外语发音替换。外语发音可以预装在媒体管理***200中,或者可能需要从服务器110处下载。一般,语言选项一旦设定,它们将不被改变。
在步骤510,用户激活可听菜单特征。虽然这可能导致客户机装置115、120、125、或130使用预定义的设置,但是也能向用户呈现各种定制选项。例如,在步骤515,用户能选择在浏览菜单时播放音乐。一旦用户选择要播放的歌曲,用户可能想在听他或她的第一选择时将另一歌曲排队等候。因此,用户可被给予在第一首选定歌曲播放时允许呈现语音名称的选项。如果用户不想在菜单导航期间播放音乐,则可在520将***设置为暂停或静音。
如果用户想在对菜单导航时听音乐,则在步骤525可允许用户将音乐与语音名称混合。通过在当前播放的歌曲中播放音频文件简单地实现混合。如果希望混合,则在步骤530设置混合选项。如果不希望混合,但用户仍想在对菜单导航时播放音乐,则媒体管理***200在步骤535可以允许在一个声道(左边或右边的扬声器)中播放音乐,并通过设置单声道选项在另一声道中播放语音名称。因此,当用户戴耳机时,语音名称将在一个耳朵中呈现而不需要中断在另一耳朵播放音乐。另外,即使用户在步骤530选择了混合选项或在步骤520选择了暂停音乐选项,用户仍有理由在步骤540还选择在单声道中输出语音名称。
一旦设置了所有可听菜单特征,在菜单导航期间客户机装置115、120、125或130就随时可使用语音名称。图6是说明本发明一实施例中在菜单导航期间可执行的步骤的流程图。
在步骤605将菜单激活。如果菜单总是活动的,则可能不需要激活,在但经过一段非激活时间之后一些客户机装置115、120、125或130会使菜单休眠。一般,通过按压导航控制件使菜单停止休眠。导航控制件可包括拨号盘、按钮、触摸屏或任何其他便利的输入机构。导航控制件可呈现在客户机装置115、120、125或130上,或通过远程控制来实现。应知,许多远程控制件没有任何可视显示,如果在客户机装置115、120、125或130上必须使用可视显示,则菜单导航会变得不方便。
一旦激活,媒体管理***200在步骤610选择确定菜单组件是否已突出显示了充分的时间。用户翻滚菜单组件并听到各菜单组件开始的语音名称,只是被下一菜单组件的语音名称打断,然后又被下一菜单组件的语音名称打断,这可能很令人烦扰。最好是,媒体管理***200具有较短的延迟,使得用户没有这种烦扰就可以快速地翻滚各种选项。在615,媒体管理***200等待直到用户停止翻滚菜单组件,并在单个菜单组件上暂停足够的时间以允许在620播放语音名称。这段时间不需要太长,一般不超过几秒,甚至可以是几分之一秒。
在625,用户则具有导航到新菜单组件并重新开始处理的选择权。可通过滚动,或者如果当前突出显示的菜单组件导向另一级菜单,则通过选择当前菜单组件来实现导航。或者,如果用户简单地停止对菜单导航,或进行没有导向更多菜单选项(例如,播放歌曲)的菜单组件选择,该处理可结束。
一般,本发明的方法可以在软件和/或硬件中实现。例如,它们可以在操作***、在单独的用户处理、在绑定到应用程序中的库程序包或在特别构造的设备中实现。在本发明特定实施例中,本发明的方法采用软件(例如操作***和/或运行在操作***上的应用程序)实现。
本发明技术的软件或软件/硬件混合实现可以实现在由存储在存储器中的计算机程序选择性激活或重新配置的通用可编程设备上。在备选实施例中,本发明的方法可实现在通用网络主机例如个人计算机、工作站或服务器上。而且,本发明可至少部分实现在通用计算装置上。
现在参考图7,适于实现本发明技术的计算装置700包括主中央处理器(CPU)705、接口710、存储器715和总线720。当在适当的软件或固件的控制下工作时,CPU 705可以负责实现与期望的计算装置的功能相关联的特定功能。优选是,CPU 705在包括操作***(例如,Mac OSX)和任何适合的应用软件(例如,iTunes)的软件的控制下完成所有这些功能。
CPU 705可包括一个或多个处理器,例如来自摩托罗拉微处理器族或MIPS微处理器族的那些处理器。在备选实施例中,特别设计处理器作为控制计算装置700的操作的硬件。
通常提供接口710作为接口卡。一般来说,它们控制通过网络发送和接收数据包并且有时支持与计算装置700一起使用的其他***设备。可提供的接口包括以太网接口、帧中继接口、电缆接口、DSL接口、令牌环接口等等。另外,可以提供各种超高速度接口,例如高速以太网接口、十亿比特以太网接口、ATM接口、HSSI接口、POS接口、FDDI接口、ASI接口、DHEI接口、Firewire接口、USB接口等等。一般来说,这些接口可包括适于与适当的媒体通信的端口。在某些情况下,它们还可包括独立处理器以及,在一些情况下,易失性RAM。
不管计算装置的配置,可使用一个或多个配置用于储存数据、程序指令和/或与本文描述的技术的功能性有关的其他信息的存储器或存储模块(例如,存储器715)。例如,程序指令可控制操作***和/或一个或多个应用程序的操作。
因为可使用这种信息和程序指令来实现本文描述的***/方法,所以本发明涉及包括程序指令、状态信息等用于执行本文描述的各种操作的可读媒体的设备(例如,计算机)。机器可读媒体的例子包括但不限于例如硬盘、软盘和磁带的磁性媒体;例如CD-ROM光盘的光学媒体;例如光磁软盘的磁光媒体;以及特别配置以存储程序指令的硬件装置,例如只读存储器装置(ROM)和随机存取存储器(RAM)。本发明还可嵌入在通过适当的媒体例如电波、光缆、电线等传播的载波中。程序指令的例子包括机器代码、例如由编译器产生的机器代码以及可由计算机(例如,使用解释器)执行的较高级代码。
虽然本文示出并描述本发明的说明性实施例和应用,但是许多变化和修改是可能的,它们保持在本发明的概念、范围和精神之内,在熟读本应用之后,这些变化对本领域技术人员而言是显见的。例如,术语“滚动”和“突出显示”用于菜单的上下文时,并不局限于它们的字面解释。可以用一个菜单组件替换上一菜单组件在单线上“滚动”菜单选项。同样地,即使菜单选项是斜体、粗体或以着重号列出,也可“突出显示”该菜单选项。因此,所呈现的实施例认为是说明性的而非限制性的,并且本发明不局限于本文所给出的细节,而是可在所附权利要求的范围和等效物内修改。

Claims (24)

1.一种用于提供可听菜单的方法,包括:
在服务器上设置正文串,每个正文串能代表一个菜单选项;
生成音频文件,每个音频文件代表所述正文串之一的语音名称;
将各所述音频文件和与其对应的正文串相关联;
将所述音频文件从服务器传送到客户机;
在包括由所述正文串代表的菜单选项的所述客户机上呈现菜单,所述菜单选项能被突出显示或选择;
当关联的菜单选项被突出显示时,在所述客户机上播放所述音频文件。
2.如权利要求1所述的方法,还包括:
提供可通过所述客户机上的所述菜单来导航的远程控制。
3.如权利要求1所述的方法,其中:
所述语音名称采用非英语的语言。
4.如权利要求1所述的方法,其中:
所述客户机能够播放音乐;以及
在播放音乐时播放所述音频文件并不停止所述音乐的播放。
5.如权利要求4所述的方法,其中:
所述客户机至少在两个声道中生成音频输出;以及
仅通过一个声道输出所述音频文件。
6.如权利要求5所述的方法,其中:
恰好有两个声道用于所述客户机的音频输出,所述两个声道是左声道和右声道。
7.如权利要求4所述的方法,其中:
在播放音乐时所述音频文件与所述音乐混合。
8.一种在服务器计算机上创建音频表达而用于客户机装置的方法,包括:
提供正文串;
生成作为所述正文串的音频表达的音频文件;
将所述音频文件传送到客户机装置。
9.如权利要求8所述的方法,其中:所述正文串属于菜单组件,因此所述菜单组件是可从所述客户机装置上显示的菜单中选择的若干选项之一。
10.如权利要求8所述的方法,其中:所述客户机装置是媒体播放器,且所述正文串属于媒体项。
11.如权利要求8至10中任一权利要求所述的方法,还包括:
播放所述音频文件;以及
在将所述音频文件传送到客户机装置之前,请求认可所播放的音频文件。
12.如权利要求11所述的方法,其中:
通过一个文本/话音转换算法来实现所述音频文件的生成。
13.如权利要求12所述的方法,其中:
如果未得到认可,则提供修改所述正文串的机会;以及
如果修改了所述正文串,则用根据所修改的正文串生成的新音频文件替换所述音频文件;
播放音频文件;以及
请求认可所播放的音频文件。
14.如权利要求13所述的方法,其中:
如果所述正文串未被修改,则提供用从录音生成的新音频文件替换所述音频文件的机会。
15.如权利要求8至10中任一权利要求所述的方法,其中:
所述音频文件的生成至少包括所述音频文件的压缩。
16.如权利要求8至10中任一权利要求所述的方法,其中:
所述音频文件的传送包括在元数据中嵌入所述音频文件。
17.如权利要求8至10中任一权利要求所述的方法,还包含:
确定所述音频文件是否呈现在所述客户机装置上;
其中,仅当所述音频文件未呈现在所述客户机装置上时才执行所述音频文件的传送。
18.一种服务器,包括:
处理器;以及
在操作上与所述处理器连接的存储器;
其中,所述处理器可用来执行指令,所述指令包括
提供代表菜单组件的正文串,从而所述菜单组件是可从客户机装置上的菜单中选择的若干选项之一;
生成作为所述菜单组件的音频表达的音频文件;
将所述音频文件传送到客户机装置。
19.一种在菜单中使用音频文件的方法,包括:
从服务器接收作为菜单组件的音频表达的音频文件,从而所述菜单组件是可选自所述菜单的若干选项之一;
更新所述菜单以包括所述菜单组件;以及
当所述菜单组件被突出显示时,播放所述音频文件。
20.如权利要求19所述的方法,其中:
所述菜单包括还未被所述服务器接收的菜单组件;以及
预装音频文件与还未被所述服务器接收的所述菜单组件相关联。
21.如权利要求19所述的方法,其中:
仅在所述菜单组件已被突出显示一段预定时间之后播放所述音频文件。
22.一种客户机装置,包括:
处理器;以及
在操作上与所述处理器连接的存储器;
其中,所述处理器可用来执行包括以下操作的指令:
从服务器接收作为正文串的音频表达的音频文件;
在所述存储器中存储与相应的正文串相关联的所述音频文件;以及
在所述相应的正文串被显示时播放所述音频文件。
23.一种媒体管理***,包括:
存储媒体文件的媒体数据库;
包含与媒体文件分组有关的数据的媒体集合记录;
包含与所述媒体文件有关的元数据的媒体记录;
存储音频文件的语音名称数据库;以及
将所述音频文件与所述媒体集合记录中的数据和所述媒体记录中的元数据相关联的字符串关联记录。
24.如权利要求23所述的媒体管理***,其中:
所述媒体管理***在便携式数字音乐播放器上运行。
CNA2004800262085A 2003-07-18 2004-05-25 语音信息*** Pending CN1849579A (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/623,339 2003-07-18
US10/623,339 US7757173B2 (en) 2003-07-18 2003-07-18 Voice menu system

Publications (1)

Publication Number Publication Date
CN1849579A true CN1849579A (zh) 2006-10-18

Family

ID=34063359

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2004800262085A Pending CN1849579A (zh) 2003-07-18 2004-05-25 语音信息***

Country Status (4)

Country Link
US (1) US7757173B2 (zh)
EP (1) EP1646936A2 (zh)
CN (1) CN1849579A (zh)
WO (1) WO2005015382A2 (zh)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101458093B (zh) * 2007-12-12 2011-12-07 株式会社查纳位资讯情报 导航设备
CN101419528B (zh) * 2007-10-24 2012-08-29 兄弟工业株式会社 数据处理装置
CN113766414A (zh) * 2013-04-03 2021-12-07 杜比实验室特许公司 用于基于对象的音频的交互式渲染的方法和***

Families Citing this family (270)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8645137B2 (en) 2000-03-16 2014-02-04 Apple Inc. Fast, language-independent method for user authentication by voice
US8046689B2 (en) * 2004-11-04 2011-10-25 Apple Inc. Media presentation with supplementary media
US6934812B1 (en) * 2001-10-22 2005-08-23 Apple Computer, Inc. Media player with instant play capability
US7433546B2 (en) * 2004-10-25 2008-10-07 Apple Inc. Image scaling arrangement
US8151259B2 (en) 2006-01-03 2012-04-03 Apple Inc. Remote content updates for portable media devices
US8372112B2 (en) * 2003-04-11 2013-02-12 St. Jude Medical, Cardiology Division, Inc. Closure devices, related delivery methods, and related methods of use
US7724716B2 (en) 2006-06-20 2010-05-25 Apple Inc. Wireless communication system
US7831199B2 (en) * 2006-01-03 2010-11-09 Apple Inc. Media data exchange, transfer or delivery for portable electronic devices
US7653542B2 (en) * 2004-05-26 2010-01-26 Verizon Business Global Llc Method and system for providing synthesized speech
TWI254576B (en) * 2004-10-22 2006-05-01 Lite On It Corp Auxiliary function-switching method for digital video player
US7706637B2 (en) 2004-10-25 2010-04-27 Apple Inc. Host configured for interoperation with coupled portable media player device
US7593782B2 (en) 2005-01-07 2009-09-22 Apple Inc. Highly portable media device
US8300841B2 (en) * 2005-06-03 2012-10-30 Apple Inc. Techniques for presenting sound effects on a portable media player
US7424431B2 (en) * 2005-07-11 2008-09-09 Stragent, Llc System, method and computer program product for adding voice activation and voice control to a media player
US8977636B2 (en) 2005-08-19 2015-03-10 International Business Machines Corporation Synthesizing aggregate data of disparate data types into data of a uniform data type
US7590772B2 (en) * 2005-08-22 2009-09-15 Apple Inc. Audio status information for a portable electronic device
US7439465B2 (en) * 2005-09-02 2008-10-21 White Electronics Designs Corporation Switch arrays and systems employing the same to enhance system reliability
US7417202B2 (en) * 2005-09-02 2008-08-26 White Electronic Designs Corporation Switches and systems employing the same to enhance switch reliability and control
US8677377B2 (en) 2005-09-08 2014-03-18 Apple Inc. Method and apparatus for building an intelligent automated assistant
US8266220B2 (en) * 2005-09-14 2012-09-11 International Business Machines Corporation Email management and rendering
US7930369B2 (en) 2005-10-19 2011-04-19 Apple Inc. Remotely configured media device
US8694319B2 (en) 2005-11-03 2014-04-08 International Business Machines Corporation Dynamic prosody adjustment for voice-rendering synthesized data
US8654993B2 (en) 2005-12-07 2014-02-18 Apple Inc. Portable audio device providing automated control of audio volume parameters for hearing protection
US8255640B2 (en) 2006-01-03 2012-08-28 Apple Inc. Media device with intelligent cache utilization
US7673238B2 (en) * 2006-01-05 2010-03-02 Apple Inc. Portable media device with video acceleration capabilities
US8271107B2 (en) 2006-01-13 2012-09-18 International Business Machines Corporation Controlling audio operation for data management and data rendering
US20070192674A1 (en) * 2006-02-13 2007-08-16 Bodin William K Publishing content through RSS feeds
US20070192683A1 (en) * 2006-02-13 2007-08-16 Bodin William K Synthesizing the content of disparate data types
US9135339B2 (en) 2006-02-13 2015-09-15 International Business Machines Corporation Invoking an audio hyperlink
US7996754B2 (en) * 2006-02-13 2011-08-09 International Business Machines Corporation Consolidated content management
US7505978B2 (en) * 2006-02-13 2009-03-17 International Business Machines Corporation Aggregating content of disparate data types from disparate data sources for single point access
US7848527B2 (en) * 2006-02-27 2010-12-07 Apple Inc. Dynamic power management in a portable media delivery system
US9361299B2 (en) * 2006-03-09 2016-06-07 International Business Machines Corporation RSS content administration for rendering RSS content on a digital audio player
US8849895B2 (en) * 2006-03-09 2014-09-30 International Business Machines Corporation Associating user selected content management directives with user selected ratings
US9092542B2 (en) * 2006-03-09 2015-07-28 International Business Machines Corporation Podcasting content associated with a user account
US8607149B2 (en) * 2006-03-23 2013-12-10 International Business Machines Corporation Highlighting related user interface controls
US8073984B2 (en) * 2006-05-22 2011-12-06 Apple Inc. Communication protocol for use with portable electronic devices
US7643895B2 (en) * 2006-05-22 2010-01-05 Apple Inc. Portable media device with workout support
US20070271116A1 (en) 2006-05-22 2007-11-22 Apple Computer, Inc. Integrated media jukebox and physiologic data handling application
US9137309B2 (en) * 2006-05-22 2015-09-15 Apple Inc. Calibration techniques for activity sensing devices
US20070270663A1 (en) * 2006-05-22 2007-11-22 Apple Computer, Inc. System including portable media player and physiologic data gathering device
US7596765B2 (en) 2006-05-23 2009-09-29 Sony Ericsson Mobile Communications Ab Sound feedback on menu navigation
US8358273B2 (en) 2006-05-23 2013-01-22 Apple Inc. Portable media device with power-managed display
US8286229B2 (en) * 2006-05-24 2012-10-09 International Business Machines Corporation Token-based content subscription
US20070277088A1 (en) * 2006-05-24 2007-11-29 Bodin William K Enhancing an existing web page
US7778980B2 (en) * 2006-05-24 2010-08-17 International Business Machines Corporation Providing disparate content as a playlist of media files
WO2008027919A2 (en) * 2006-08-28 2008-03-06 Shaul Shalev Audio-marking of information items for identifying and activating links to information
US7913297B2 (en) * 2006-08-30 2011-03-22 Apple Inc. Pairing of wireless devices using a wired medium
US7813715B2 (en) * 2006-08-30 2010-10-12 Apple Inc. Automated pairing of wireless accessories with host devices
US9318108B2 (en) 2010-01-18 2016-04-19 Apple Inc. Intelligent automated assistant
US8341524B2 (en) * 2006-09-11 2012-12-25 Apple Inc. Portable electronic device with local search capabilities
US8090130B2 (en) 2006-09-11 2012-01-03 Apple Inc. Highly portable media devices
US8036766B2 (en) * 2006-09-11 2011-10-11 Apple Inc. Intelligent audio mixing among media playback and at least one other non-playback application
US7729791B2 (en) * 2006-09-11 2010-06-01 Apple Inc. Portable media playback device including user interface event passthrough to non-media-playback processing
US9196241B2 (en) * 2006-09-29 2015-11-24 International Business Machines Corporation Asynchronous communications using messages recorded on handheld devices
US7831432B2 (en) * 2006-09-29 2010-11-09 International Business Machines Corporation Audio menus describing media contents of media players
US8001400B2 (en) * 2006-12-01 2011-08-16 Apple Inc. Power consumption management for functional preservation in a battery-powered electronic device
US8219402B2 (en) 2007-01-03 2012-07-10 International Business Machines Corporation Asynchronous receipt of information from a user
US9318100B2 (en) * 2007-01-03 2016-04-19 International Business Machines Corporation Supplementing audio recorded in a media file
US8132104B2 (en) * 2007-01-24 2012-03-06 Cerner Innovation, Inc. Multi-modal entry for electronic clinical documentation
KR20080073868A (ko) * 2007-02-07 2008-08-12 엘지전자 주식회사 단말기 및 메뉴표시방법
KR20080073869A (ko) * 2007-02-07 2008-08-12 엘지전자 주식회사 단말기 및 메뉴표시방법
US20080194175A1 (en) * 2007-02-09 2008-08-14 Intellitoys Llc Interactive toy providing, dynamic, navigable media content
CN101247247B (zh) * 2007-02-15 2012-06-27 华为技术有限公司 一种利用呈现信息传播广告的方法、***和服务器
US7589629B2 (en) * 2007-02-28 2009-09-15 Apple Inc. Event recorder for portable media device
US7698101B2 (en) * 2007-03-07 2010-04-13 Apple Inc. Smart garment
US8977255B2 (en) 2007-04-03 2015-03-10 Apple Inc. Method and system for operating a multi-function portable electronic device using voice-activation
US8010345B2 (en) * 2007-12-18 2011-08-30 International Business Machines Corporation Providing speech recognition data to a speech enabled device when providing a new entry that is selectable via a speech recognition interface of the device
US10002189B2 (en) 2007-12-20 2018-06-19 Apple Inc. Method and apparatus for searching using an active ontology
US9330720B2 (en) 2008-01-03 2016-05-03 Apple Inc. Methods and apparatus for altering audio output signals
US8996376B2 (en) 2008-04-05 2015-03-31 Apple Inc. Intelligent text-to-speech conversion
US10496753B2 (en) 2010-01-18 2019-12-03 Apple Inc. Automatically adapting user interfaces for hands-free interaction
JP2011521383A (ja) 2008-05-20 2011-07-21 ザ・フィードルーム, インコーポレイテッド 身体障害のあるユーザに対応するビデオプレーヤの実時間作成および変更のためのシステムおよび方法
US20100030549A1 (en) 2008-07-31 2010-02-04 Lee Michael M Mobile device having human language translation capability with positional feedback
US8078397B1 (en) 2008-08-22 2011-12-13 Boadin Technology, LLC System, method, and computer program product for social networking utilizing a vehicular assembly
US8131458B1 (en) 2008-08-22 2012-03-06 Boadin Technology, LLC System, method, and computer program product for instant messaging utilizing a vehicular assembly
US8265862B1 (en) 2008-08-22 2012-09-11 Boadin Technology, LLC System, method, and computer program product for communicating location-related information
US8073590B1 (en) 2008-08-22 2011-12-06 Boadin Technology, LLC System, method, and computer program product for utilizing a communication channel of a mobile device by a vehicular assembly
US8768702B2 (en) * 2008-09-05 2014-07-01 Apple Inc. Multi-tiered voice feedback in an electronic device
US8898568B2 (en) * 2008-09-09 2014-11-25 Apple Inc. Audio user interface
US8676904B2 (en) 2008-10-02 2014-03-18 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US20100131845A1 (en) * 2008-11-26 2010-05-27 Toyota Motor Engineering & Manufacturing North America, Inc. Human interface of a media playing device
US9959870B2 (en) 2008-12-11 2018-05-01 Apple Inc. Speech recognition involving a mobile device
US8862252B2 (en) * 2009-01-30 2014-10-14 Apple Inc. Audio user interface for displayless electronic device
CN201408397Y (zh) * 2009-05-12 2010-02-17 李厚敦 带声音提示菜单选择功能的单旋转按钮装置
US20120309363A1 (en) 2011-06-03 2012-12-06 Apple Inc. Triggering notifications associated with tasks items that represent tasks to perform
US10241644B2 (en) 2011-06-03 2019-03-26 Apple Inc. Actionable reminder entries
US10241752B2 (en) 2011-09-30 2019-03-26 Apple Inc. Interface for a virtual digital assistant
US9858925B2 (en) 2009-06-05 2018-01-02 Apple Inc. Using context information to facilitate processing of commands in a virtual assistant
US9431006B2 (en) 2009-07-02 2016-08-30 Apple Inc. Methods and apparatuses for automatic speech recognition
US10276170B2 (en) 2010-01-18 2019-04-30 Apple Inc. Intelligent automated assistant
US10679605B2 (en) 2010-01-18 2020-06-09 Apple Inc. Hands-free list-reading by intelligent automated assistant
US10553209B2 (en) 2010-01-18 2020-02-04 Apple Inc. Systems and methods for hands-free notification summaries
US10705794B2 (en) 2010-01-18 2020-07-07 Apple Inc. Automatically adapting user interfaces for hands-free interaction
DE202011111062U1 (de) 2010-01-25 2019-02-19 Newvaluexchange Ltd. Vorrichtung und System für eine Digitalkonversationsmanagementplattform
US8903073B2 (en) 2011-07-20 2014-12-02 Zvi Or-Bach Systems and methods for visual presentation and selection of IVR menu
US9001819B1 (en) 2010-02-18 2015-04-07 Zvi Or-Bach Systems and methods for visual presentation and selection of IVR menu
US8548135B1 (en) 2010-02-03 2013-10-01 Tal Lavian Systems and methods for visual presentation and selection of IVR menu
US8625756B1 (en) 2010-02-03 2014-01-07 Tal Lavian Systems and methods for visual presentation and selection of IVR menu
US8594280B1 (en) 2010-02-03 2013-11-26 Zvi Or-Bach Systems and methods for visual presentation and selection of IVR menu
US8687777B1 (en) 2010-02-03 2014-04-01 Tal Lavian Systems and methods for visual presentation and selection of IVR menu
US8681951B1 (en) 2010-02-03 2014-03-25 Tal Lavian Systems and methods for visual presentation and selection of IVR menu
US8572303B2 (en) 2010-02-03 2013-10-29 Tal Lavian Portable universal communication device
US8537989B1 (en) 2010-02-03 2013-09-17 Tal Lavian Device and method for providing enhanced telephony
US8406388B2 (en) 2011-07-18 2013-03-26 Zvi Or-Bach Systems and methods for visual presentation and selection of IVR menu
US8548131B1 (en) 2010-02-03 2013-10-01 Tal Lavian Systems and methods for communicating with an interactive voice response system
US8553859B1 (en) 2010-02-03 2013-10-08 Tal Lavian Device and method for providing enhanced telephony
US8879698B1 (en) 2010-02-03 2014-11-04 Tal Lavian Device and method for providing enhanced telephony
US8682667B2 (en) 2010-02-25 2014-03-25 Apple Inc. User profiling for selecting user specific voice input processing information
WO2011105981A1 (en) * 2010-02-26 2011-09-01 Echostar Ukraine, L.L.C. System and methods for enhancing operation of a graphical user interface
US10762293B2 (en) 2010-12-22 2020-09-01 Apple Inc. Using parts-of-speech tagging and named entity recognition for spelling correction
US9262612B2 (en) 2011-03-21 2016-02-16 Apple Inc. Device access using voice authentication
US10057736B2 (en) 2011-06-03 2018-08-21 Apple Inc. Active transport based notifications
US8994660B2 (en) 2011-08-29 2015-03-31 Apple Inc. Text correction processing
US10134385B2 (en) 2012-03-02 2018-11-20 Apple Inc. Systems and methods for name pronunciation
US8867708B1 (en) 2012-03-02 2014-10-21 Tal Lavian Systems and methods for visual presentation and selection of IVR menu
US8731148B1 (en) 2012-03-02 2014-05-20 Tal Lavian Systems and methods for visual presentation and selection of IVR menu
US9483461B2 (en) 2012-03-06 2016-11-01 Apple Inc. Handling speech synthesis of content for multiple languages
US9280610B2 (en) 2012-05-14 2016-03-08 Apple Inc. Crowd sourcing information to fulfill user requests
US10417037B2 (en) 2012-05-15 2019-09-17 Apple Inc. Systems and methods for integrating third party services with a digital assistant
US9721563B2 (en) 2012-06-08 2017-08-01 Apple Inc. Name recognition system
US9495129B2 (en) 2012-06-29 2016-11-15 Apple Inc. Device, method, and user interface for voice-activated navigation and browsing of a document
US9576574B2 (en) 2012-09-10 2017-02-21 Apple Inc. Context-sensitive handling of interruptions by intelligent digital assistant
US9547647B2 (en) 2012-09-19 2017-01-17 Apple Inc. Voice-based media searching
US9164954B2 (en) 2012-10-08 2015-10-20 The Coca-Cola Company Vending accommodation and accessibility
DE112014000709B4 (de) 2013-02-07 2021-12-30 Apple Inc. Verfahren und vorrichtung zum betrieb eines sprachtriggers für einen digitalen assistenten
US9368114B2 (en) 2013-03-14 2016-06-14 Apple Inc. Context-sensitive handling of interruptions
US10652394B2 (en) 2013-03-14 2020-05-12 Apple Inc. System and method for processing voicemail
WO2014144579A1 (en) 2013-03-15 2014-09-18 Apple Inc. System and method for updating an adaptive speech recognition model
US9922642B2 (en) 2013-03-15 2018-03-20 Apple Inc. Training an at least partial voice command system
US9507561B2 (en) * 2013-03-15 2016-11-29 Verizon Patent And Licensing Inc. Method and apparatus for facilitating use of touchscreen devices
US10748529B1 (en) 2013-03-15 2020-08-18 Apple Inc. Voice activated device for use with a voice-based digital assistant
WO2014197334A2 (en) 2013-06-07 2014-12-11 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
WO2014197336A1 (en) 2013-06-07 2014-12-11 Apple Inc. System and method for detecting errors in interactions with a voice-based digital assistant
US9582608B2 (en) 2013-06-07 2017-02-28 Apple Inc. Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
WO2014197335A1 (en) 2013-06-08 2014-12-11 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
EP3008641A1 (en) 2013-06-09 2016-04-20 Apple Inc. Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
US10176167B2 (en) 2013-06-09 2019-01-08 Apple Inc. System and method for inferring user intent from speech inputs
WO2014200731A1 (en) 2013-06-13 2014-12-18 Apple Inc. System and method for emergency calls initiated by voice command
KR101749009B1 (ko) 2013-08-06 2017-06-19 애플 인크. 원격 디바이스로부터의 활동에 기초한 스마트 응답의 자동 활성화
US10296160B2 (en) 2013-12-06 2019-05-21 Apple Inc. Method for extracting salient dialog usage from live data
US9620105B2 (en) 2014-05-15 2017-04-11 Apple Inc. Analyzing audio input for efficient speech and music recognition
US10592095B2 (en) 2014-05-23 2020-03-17 Apple Inc. Instantaneous speaking of content on touch devices
US9502031B2 (en) 2014-05-27 2016-11-22 Apple Inc. Method for supporting dynamic grammars in WFST-based ASR
US10078631B2 (en) 2014-05-30 2018-09-18 Apple Inc. Entropy-guided text prediction using combined word and character n-gram language models
US9633004B2 (en) 2014-05-30 2017-04-25 Apple Inc. Better resolution when referencing to concepts
AU2015266863B2 (en) 2014-05-30 2018-03-15 Apple Inc. Multi-command single utterance input method
US9785630B2 (en) 2014-05-30 2017-10-10 Apple Inc. Text prediction using combined word N-gram and unigram language models
US10170123B2 (en) 2014-05-30 2019-01-01 Apple Inc. Intelligent assistant for home automation
US10289433B2 (en) 2014-05-30 2019-05-14 Apple Inc. Domain specific language for encoding assistant dialog
US9430463B2 (en) 2014-05-30 2016-08-30 Apple Inc. Exemplar-based natural language processing
US9842101B2 (en) 2014-05-30 2017-12-12 Apple Inc. Predictive conversion of language input
US9715875B2 (en) 2014-05-30 2017-07-25 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US9760559B2 (en) 2014-05-30 2017-09-12 Apple Inc. Predictive text input
US9734193B2 (en) 2014-05-30 2017-08-15 Apple Inc. Determining domain salience ranking from ambiguous words in natural speech
US9338493B2 (en) 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
US10659851B2 (en) 2014-06-30 2020-05-19 Apple Inc. Real-time digital assistant knowledge updates
CN114115461B (zh) 2014-08-06 2024-04-26 苹果公司 用于电池管理的减小尺寸的用户界面
US10446141B2 (en) 2014-08-28 2019-10-15 Apple Inc. Automatic speech recognition based on user feedback
EP4027227A1 (en) 2014-09-02 2022-07-13 Apple Inc. Reduced-size interfaces for managing alerts
US9818400B2 (en) 2014-09-11 2017-11-14 Apple Inc. Method and apparatus for discovering trending terms in speech requests
US10789041B2 (en) 2014-09-12 2020-09-29 Apple Inc. Dynamic thresholds for always listening speech trigger
US10127911B2 (en) 2014-09-30 2018-11-13 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US9886432B2 (en) 2014-09-30 2018-02-06 Apple Inc. Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US9646609B2 (en) 2014-09-30 2017-05-09 Apple Inc. Caching apparatus for serving phonetic pronunciations
US10074360B2 (en) 2014-09-30 2018-09-11 Apple Inc. Providing an indication of the suitability of speech recognition
US9668121B2 (en) 2014-09-30 2017-05-30 Apple Inc. Social reminders
US10552013B2 (en) 2014-12-02 2020-02-04 Apple Inc. Data detection
US9711141B2 (en) 2014-12-09 2017-07-18 Apple Inc. Disambiguating heteronyms in speech synthesis
US9865280B2 (en) 2015-03-06 2018-01-09 Apple Inc. Structured dictation using intelligent automated assistants
US10152299B2 (en) 2015-03-06 2018-12-11 Apple Inc. Reducing response latency of intelligent automated assistants
US9886953B2 (en) 2015-03-08 2018-02-06 Apple Inc. Virtual assistant activation
US10567477B2 (en) 2015-03-08 2020-02-18 Apple Inc. Virtual assistant continuity
US9721566B2 (en) 2015-03-08 2017-08-01 Apple Inc. Competing devices responding to voice triggers
CN104679415A (zh) * 2015-03-18 2015-06-03 吴爱好 一种智能菜谱推荐播报设备及实现方法
US9899019B2 (en) 2015-03-18 2018-02-20 Apple Inc. Systems and methods for structured stem and suffix language models
US9842105B2 (en) 2015-04-16 2017-12-12 Apple Inc. Parsimonious continuous-space phrase representations for natural language processing
US10460227B2 (en) 2015-05-15 2019-10-29 Apple Inc. Virtual assistant in a communication session
US10083688B2 (en) 2015-05-27 2018-09-25 Apple Inc. Device voice control for selecting a displayed affordance
US10127220B2 (en) 2015-06-04 2018-11-13 Apple Inc. Language identification from short strings
US9578173B2 (en) 2015-06-05 2017-02-21 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10101822B2 (en) 2015-06-05 2018-10-16 Apple Inc. Language input correction
US11025565B2 (en) 2015-06-07 2021-06-01 Apple Inc. Personalized prediction of responses for instant messaging
US10186254B2 (en) 2015-06-07 2019-01-22 Apple Inc. Context-based endpoint detection
US10255907B2 (en) 2015-06-07 2019-04-09 Apple Inc. Automatic accent detection using acoustic models
US20160378747A1 (en) 2015-06-29 2016-12-29 Apple Inc. Virtual assistant for media playback
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US10671428B2 (en) 2015-09-08 2020-06-02 Apple Inc. Distributed personal assistant
US9697820B2 (en) 2015-09-24 2017-07-04 Apple Inc. Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
US10366158B2 (en) 2015-09-29 2019-07-30 Apple Inc. Efficient word encoding for recurrent neural network language models
US11010550B2 (en) 2015-09-29 2021-05-18 Apple Inc. Unified language modeling framework for word prediction, auto-completion and auto-correction
US11587559B2 (en) 2015-09-30 2023-02-21 Apple Inc. Intelligent device identification
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US10049668B2 (en) 2015-12-02 2018-08-14 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10223066B2 (en) 2015-12-23 2019-03-05 Apple Inc. Proactive assistance based on dialog communication between devices
US10446143B2 (en) 2016-03-14 2019-10-15 Apple Inc. Identification of voice inputs providing credentials
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US11227589B2 (en) 2016-06-06 2022-01-18 Apple Inc. Intelligent list reading
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
US10049663B2 (en) 2016-06-08 2018-08-14 Apple, Inc. Intelligent automated assistant for media exploration
DK179309B1 (en) 2016-06-09 2018-04-23 Apple Inc Intelligent automated assistant in a home environment
US10509862B2 (en) 2016-06-10 2019-12-17 Apple Inc. Dynamic phrase expansion of language input
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
US10192552B2 (en) 2016-06-10 2019-01-29 Apple Inc. Digital assistant providing whispered speech
US10586535B2 (en) 2016-06-10 2020-03-10 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10490187B2 (en) 2016-06-10 2019-11-26 Apple Inc. Digital assistant providing automated status report
DK179049B1 (en) 2016-06-11 2017-09-18 Apple Inc Data driven natural language event detection and classification
DK201670540A1 (en) 2016-06-11 2018-01-08 Apple Inc Application integration with a digital assistant
DK179415B1 (en) 2016-06-11 2018-06-14 Apple Inc Intelligent device arbitration and control
DK179343B1 (en) 2016-06-11 2018-05-14 Apple Inc Intelligent task discovery
US10474753B2 (en) 2016-09-07 2019-11-12 Apple Inc. Language identification using recurrent neural networks
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US11281993B2 (en) 2016-12-05 2022-03-22 Apple Inc. Model and ensemble compression for metric learning
US10593346B2 (en) 2016-12-22 2020-03-17 Apple Inc. Rank-reduced token representation for automatic speech recognition
US11204787B2 (en) 2017-01-09 2021-12-21 Apple Inc. Application integration with a digital assistant
DK201770383A1 (en) 2017-05-09 2018-12-14 Apple Inc. USER INTERFACE FOR CORRECTING RECOGNITION ERRORS
US10417266B2 (en) 2017-05-09 2019-09-17 Apple Inc. Context-aware ranking of intelligent response suggestions
US10726832B2 (en) 2017-05-11 2020-07-28 Apple Inc. Maintaining privacy of personal information
DK201770439A1 (en) 2017-05-11 2018-12-13 Apple Inc. Offline personal assistant
US10395654B2 (en) 2017-05-11 2019-08-27 Apple Inc. Text normalization based on a data-driven learning network
DK201770427A1 (en) 2017-05-12 2018-12-20 Apple Inc. LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT
DK179496B1 (en) 2017-05-12 2019-01-15 Apple Inc. USER-SPECIFIC Acoustic Models
DK179745B1 (en) 2017-05-12 2019-05-01 Apple Inc. SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT
US11301477B2 (en) 2017-05-12 2022-04-12 Apple Inc. Feedback analysis of a digital assistant
DK201770431A1 (en) 2017-05-15 2018-12-20 Apple Inc. Optimizing dialogue policy decisions for digital assistants using implicit feedback
DK201770432A1 (en) 2017-05-15 2018-12-21 Apple Inc. Hierarchical belief states for digital assistants
US10303715B2 (en) 2017-05-16 2019-05-28 Apple Inc. Intelligent automated assistant for media exploration
DK179560B1 (en) 2017-05-16 2019-02-18 Apple Inc. FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES
US10311144B2 (en) 2017-05-16 2019-06-04 Apple Inc. Emoji word sense disambiguation
US10403278B2 (en) 2017-05-16 2019-09-03 Apple Inc. Methods and systems for phonetic matching in digital assistant services
US20180336892A1 (en) 2017-05-16 2018-11-22 Apple Inc. Detecting a trigger of a digital assistant
US10657328B2 (en) 2017-06-02 2020-05-19 Apple Inc. Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling
US10445429B2 (en) 2017-09-21 2019-10-15 Apple Inc. Natural language understanding using vocabularies with compressed serialized tries
US10755051B2 (en) 2017-09-29 2020-08-25 Apple Inc. Rule-based natural language processing
US10636424B2 (en) 2017-11-30 2020-04-28 Apple Inc. Multi-turn canned dialog
US10733982B2 (en) 2018-01-08 2020-08-04 Apple Inc. Multi-directional dialog
US10733375B2 (en) 2018-01-31 2020-08-04 Apple Inc. Knowledge-based framework for improving natural language understanding
US10789959B2 (en) 2018-03-02 2020-09-29 Apple Inc. Training speaker recognition models for digital assistants
US10592604B2 (en) 2018-03-12 2020-03-17 Apple Inc. Inverse text normalization for automatic speech recognition
US10818288B2 (en) 2018-03-26 2020-10-27 Apple Inc. Natural assistant interaction
US10909331B2 (en) 2018-03-30 2021-02-02 Apple Inc. Implicit identification of translation payload with neural machine translation
US10928918B2 (en) 2018-05-07 2021-02-23 Apple Inc. Raise to speak
US11145294B2 (en) 2018-05-07 2021-10-12 Apple Inc. Intelligent automated assistant for delivering content from user experiences
US10984780B2 (en) 2018-05-21 2021-04-20 Apple Inc. Global semantic word embeddings using bi-directional recurrent neural networks
US10892996B2 (en) 2018-06-01 2021-01-12 Apple Inc. Variable latency device coordination
DK201870355A1 (en) 2018-06-01 2019-12-16 Apple Inc. VIRTUAL ASSISTANT OPERATION IN MULTI-DEVICE ENVIRONMENTS
DK180639B1 (en) 2018-06-01 2021-11-04 Apple Inc DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT
DK179822B1 (da) 2018-06-01 2019-07-12 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US11386266B2 (en) 2018-06-01 2022-07-12 Apple Inc. Text correction
US11076039B2 (en) 2018-06-03 2021-07-27 Apple Inc. Accelerated task performance
US11010561B2 (en) 2018-09-27 2021-05-18 Apple Inc. Sentiment prediction from textual data
US11462215B2 (en) 2018-09-28 2022-10-04 Apple Inc. Multi-modal inputs for voice commands
US11170166B2 (en) 2018-09-28 2021-11-09 Apple Inc. Neural typographical error modeling via generative adversarial networks
US10839159B2 (en) 2018-09-28 2020-11-17 Apple Inc. Named entity normalization in a spoken dialog system
US11475898B2 (en) 2018-10-26 2022-10-18 Apple Inc. Low-latency multi-speaker speech recognition
US11638059B2 (en) 2019-01-04 2023-04-25 Apple Inc. Content playback on multiple devices
US11348573B2 (en) 2019-03-18 2022-05-31 Apple Inc. Multimodality in digital assistant systems
DK201970509A1 (en) 2019-05-06 2021-01-15 Apple Inc Spoken notifications
US11423908B2 (en) 2019-05-06 2022-08-23 Apple Inc. Interpreting spoken requests
US11307752B2 (en) 2019-05-06 2022-04-19 Apple Inc. User configurable task triggers
US11475884B2 (en) 2019-05-06 2022-10-18 Apple Inc. Reducing digital assistant latency when a language is incorrectly determined
US11140099B2 (en) 2019-05-21 2021-10-05 Apple Inc. Providing message response suggestions
US11496600B2 (en) 2019-05-31 2022-11-08 Apple Inc. Remote execution of machine-learned models
DK180129B1 (en) 2019-05-31 2020-06-02 Apple Inc. USER ACTIVITY SHORTCUT SUGGESTIONS
DK201970511A1 (en) 2019-05-31 2021-02-15 Apple Inc Voice identification in digital assistant systems
US11289073B2 (en) 2019-05-31 2022-03-29 Apple Inc. Device text to speech
US11360641B2 (en) 2019-06-01 2022-06-14 Apple Inc. Increasing the relevance of new available information
US11488406B2 (en) 2019-09-25 2022-11-01 Apple Inc. Text detection using global geometry estimators
US11810578B2 (en) 2020-05-11 2023-11-07 Apple Inc. Device arbitration for digital assistant-based intercom systems

Family Cites Families (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05108065A (ja) * 1991-10-15 1993-04-30 Kawai Musical Instr Mfg Co Ltd 自動演奏装置
US5890122A (en) * 1993-02-08 1999-03-30 Microsoft Corporation Voice-controlled computer simulateously displaying application menu and list of available commands
US5661787A (en) * 1994-10-27 1997-08-26 Pocock; Michael H. System for on-demand remote access to a self-generating audio recording, storage, indexing and transaction system
US5999895A (en) * 1995-07-24 1999-12-07 Forest; Donald K. Sound operated menu method and apparatus
US20030051136A1 (en) * 1995-11-06 2003-03-13 Pavel Curtis Multimedia coordination system
US5802526A (en) * 1995-11-15 1998-09-01 Microsoft Corporation System and method for graphically displaying and navigating through an interactive voice response menu
US5912952A (en) * 1996-06-27 1999-06-15 At&T Corp Voice response unit with a visual menu interface
US5950123A (en) * 1996-08-26 1999-09-07 Telefonaktiebolaget L M Cellular telephone network support of audible information delivery to visually impaired subscribers
US5721827A (en) 1996-10-02 1998-02-24 James Logan System for electrically distributing personalized information
US20070026852A1 (en) * 1996-10-02 2007-02-01 James Logan Multimedia telephone system
US20020002039A1 (en) * 1998-06-12 2002-01-03 Safi Qureshey Network-enabled audio device
US6563769B1 (en) * 1998-06-11 2003-05-13 Koninklijke Philips Electronics N.V. Virtual jukebox
US6493428B1 (en) * 1998-08-18 2002-12-10 Siemens Information & Communication Networks, Inc Text-enhanced voice menu system
US6360237B1 (en) * 1998-10-05 2002-03-19 Lernout & Hauspie Speech Products N.V. Method and system for performing text edits during audio recording playback
US6983251B1 (en) * 1999-02-15 2006-01-03 Sharp Kabushiki Kaisha Information selection apparatus selecting desired information from plurality of audio information by mainly using audio
US20020013852A1 (en) * 2000-03-03 2002-01-31 Craig Janik System for providing content, management, and interactivity for thin client devices
WO2001030046A2 (en) * 1999-10-22 2001-04-26 Tellme Networks, Inc. Streaming content over a telephone interface
US6978127B1 (en) * 1999-12-16 2005-12-20 Koninklijke Philips Electronics N.V. Hand-ear user interface for hand-held device
US6519566B1 (en) * 2000-03-01 2003-02-11 International Business Machines Corporation Method for hands-free operation of a pointer
NL1014847C1 (nl) 2000-04-05 2001-10-08 Minos B V I O Gegevensoverdracht.
NZ523065A (en) * 2000-05-11 2004-11-26 Nes Stewart Irvine A graphical user interface where a procedure is activated by movement of a pointer over a predetermined path
KR100867760B1 (ko) * 2000-05-15 2008-11-10 소니 가부시끼 가이샤 재생장치, 재생방법 및 기록매체
US6754504B1 (en) * 2000-06-10 2004-06-22 Motorola, Inc. Method and apparatus for controlling environmental conditions using a personal area network
US20020013784A1 (en) * 2000-07-31 2002-01-31 Swanson Raymond H. Audio data transmission system and method of operation thereof
US6529586B1 (en) * 2000-08-31 2003-03-04 Oracle Cable, Inc. System and method for gathering, personalized rendering, and secure telephonic transmission of audio data
US6556971B1 (en) 2000-09-01 2003-04-29 Snap-On Technologies, Inc. Computer-implemented speech recognition system training
US20020046315A1 (en) * 2000-10-13 2002-04-18 Interactive Objects, Inc. System and method for mapping interface functionality to codec functionality in a portable audio device
US6947728B2 (en) * 2000-10-13 2005-09-20 Matsushita Electric Industrial Co., Ltd. Mobile phone with music reproduction function, music data reproduction method by mobile phone with music reproduction function, and the program thereof
US6731312B2 (en) 2001-01-08 2004-05-04 Apple Computer, Inc. Media player interface
WO2001030127A2 (de) * 2001-01-23 2001-05-03 Phonak Ag Verfahren zur kommunikation und hörhilfegerätsystem
US6448485B1 (en) * 2001-03-16 2002-09-10 Intel Corporation Method and system for embedding audio titles
US6834264B2 (en) * 2001-03-29 2004-12-21 Provox Technologies Corporation Method and apparatus for voice dictation and document production
US6892083B2 (en) * 2001-09-05 2005-05-10 Vocera Communications Inc. Voice-controlled wireless communications system and method
US7010581B2 (en) * 2001-09-24 2006-03-07 International Business Machines Corporation Method and system for providing browser functions on a web page for client-specific accessibility
US7027990B2 (en) 2001-10-12 2006-04-11 Lester Sussman System and method for integrating the visual display of text menus for interactive voice response systems
JP4204977B2 (ja) 2001-10-22 2009-01-07 アップル インコーポレイテッド メディアプレーヤーのためのインテリジェントなシンクロ操作
US20030167318A1 (en) 2001-10-22 2003-09-04 Apple Computer, Inc. Intelligent synchronization of media player with host computer
ATE365413T1 (de) * 2001-10-30 2007-07-15 Hewlett Packard Co Kommunikationssystem und -verfahren
EP1311102A1 (en) 2001-11-08 2003-05-14 Hewlett-Packard Company Streaming audio under voice control
US6996777B2 (en) * 2001-11-29 2006-02-07 Nokia Corporation Method and apparatus for presenting auditory icons in a mobile terminal
US20030158737A1 (en) * 2002-02-15 2003-08-21 Csicsatka Tibor George Method and apparatus for incorporating additional audio information into audio data file identifying information
US6999066B2 (en) * 2002-06-24 2006-02-14 Xerox Corporation System for audible feedback for touch screen displays
US7166791B2 (en) * 2002-07-30 2007-01-23 Apple Computer, Inc. Graphical user interface and methods of use thereof in a multimedia player
US7136874B2 (en) * 2002-10-16 2006-11-14 Microsoft Corporation Adaptive menu system for media players
US7054888B2 (en) * 2002-10-16 2006-05-30 Microsoft Corporation Optimizing media player memory during rendering
US20040218451A1 (en) * 2002-11-05 2004-11-04 Said Joe P. Accessible user interface and navigation system and method
US20060235550A1 (en) * 2003-04-24 2006-10-19 Csicsatka Tibor G Creation of playlists using audio identification
US6728729B1 (en) 2003-04-25 2004-04-27 Apple Computer, Inc. Accessing media across networks
US20050045373A1 (en) * 2003-05-27 2005-03-03 Joseph Born Portable media device with audio prompt menu
KR20050072256A (ko) * 2004-01-06 2005-07-11 엘지전자 주식회사 고밀도 광디스크의 메뉴 사운드 구성방법 및 재생방법과기록재생장치

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101419528B (zh) * 2007-10-24 2012-08-29 兄弟工业株式会社 数据处理装置
CN101458093B (zh) * 2007-12-12 2011-12-07 株式会社查纳位资讯情报 导航设备
CN113766414A (zh) * 2013-04-03 2021-12-07 杜比实验室特许公司 用于基于对象的音频的交互式渲染的方法和***
CN113766414B (zh) * 2013-04-03 2024-03-01 杜比实验室特许公司 用于基于对象的音频的交互式渲染的方法和***

Also Published As

Publication number Publication date
US20050015254A1 (en) 2005-01-20
US7757173B2 (en) 2010-07-13
WO2005015382A3 (en) 2006-01-05
EP1646936A2 (en) 2006-04-19
WO2005015382A2 (en) 2005-02-17

Similar Documents

Publication Publication Date Title
CN1849579A (zh) 语音信息***
US7779357B2 (en) Audio user interface for computing devices
US11080474B2 (en) Calculations on sound associated with cells in spreadsheets
EP2324416B1 (en) Audio user interface
US8108462B2 (en) Information processing apparatus, information processing method, information processing program and recording medium for storing the program
US8438485B2 (en) System, method, and apparatus for generating, customizing, distributing, and presenting an interactive audio publication
KR101242040B1 (ko) 포터블 기기의 재생 목록 자동 생성 방법 및 장치
US20110153330A1 (en) System and method for rendering text synchronized audio
US20090307199A1 (en) Method and apparatus for generating voice annotations for playlists of digital media
US20110016079A1 (en) Summarizing a Body of Media
KR20050060753A (ko) Tts탐색기능을 지원하는 방법 및 이를 이용한멀티미디어 장치
US20240126500A1 (en) Device and method for creating a sharable clip of a podcast
CN1818899A (zh) Mpeg播放器的数据检索方法
WO2008080775A2 (en) Templates and style sheets for audio broadcasts
CN2679758Y (zh) 具有乐曲检索功能的音乐播放器
TW201340693A (zh) 智慧電視股票看盤個人化語音播報裝置與方法
KR20080052525A (ko) 메타데이터를 동반한 직접 인코딩 시스템
Mazzoni et al. Podcasting with Audacity: Creating a Podcast With Free Audio Software (Digital Short Cut)
Proctor Microware Review
KR20070016620A (ko) 오디오 데이터의 메타 데이터를 음성으로 제공하는 장치 및그 방법

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20061018