WO2018202073A1 - 语音控制智能设备的方法、装置和智能设备 - Google Patents

语音控制智能设备的方法、装置和智能设备 Download PDF

Info

Publication number
WO2018202073A1
WO2018202073A1 PCT/CN2018/085442 CN2018085442W WO2018202073A1 WO 2018202073 A1 WO2018202073 A1 WO 2018202073A1 CN 2018085442 W CN2018085442 W CN 2018085442W WO 2018202073 A1 WO2018202073 A1 WO 2018202073A1
Authority
WO
WIPO (PCT)
Prior art keywords
prompt information
voice operation
voice
operation prompt
smart device
Prior art date
Application number
PCT/CN2018/085442
Other languages
English (en)
French (fr)
Inventor
李良
葛均辉
王熙
刘义平
于鸿洋
Original Assignee
北京奇虎科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京奇虎科技有限公司 filed Critical 北京奇虎科技有限公司
Publication of WO2018202073A1 publication Critical patent/WO2018202073A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72433User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72436User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for text messaging, e.g. short messaging services [SMS] or e-mails
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • H04M1/72454User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to context-related or environment-related conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72469User interfaces specially adapted for cordless or mobile telephones for operating the device by selecting functions from two or more displayed items, e.g. menus or icons
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/74Details of telephonic subscriber devices with voice recognition means

Definitions

  • the present invention relates to the field of smart devices, and in particular, to a method, an apparatus, and a smart device for a voice control smart device.
  • Smart devices are more convenient to operate with voice commands during normal use, so most voice operations are supported. Even so, in many scenarios, there are cases where voice operations are not supported. For the user, it is also very cumbersome to look up the voice keywords supported by each scene mode. Therefore, how to give the user a proper voice operation prompt is a problem to be solved.
  • the present invention has been made in order to provide a method, apparatus and smart device for a voice control smart device that overcomes the above problems or at least partially solves the above problems.
  • a first aspect of the present invention provides a method for voice control an intelligent device, including: when a screen of the smart device loads or switches a user interface, acquiring a current scene mode of the smart device; determining the current scenario Whether the mode supports voice operation; if supported, selecting at least one voice operation prompt information that the current scene mode can support, and displaying the selected at least one voice operation prompt at a specified position on the screen of the smart device information.
  • an apparatus for a voice control smart device comprising: at least one processor; and at least one memory communicably coupled to the at least one processor; the at least one memory Included by the processor executable instructions, when executed by the at least one processor, causing the apparatus to perform at least the following: when a screen of the smart device loads or switches a user interface, Obtaining a current scene mode of the smart device; determining whether the current scene mode supports a voice operation; and selecting at least one voice operation prompt information that the current scene mode can support when the current scene mode supports a voice operation And displaying the selected at least one voice operation prompt information at a specified position on the screen of the smart device.
  • a smart device comprising: one or more processors; a memory; one or more applications, wherein the one or more applications are stored in the memory And configured to be executed by the one or more processors, the one or more programs configured to: acquire voice data, and transmit the collected voice data to an application to be received; The obtained voice data is converted into text data, and the converted text data is transmitted to the application to be received; and the operation performed by the device described in the second aspect.
  • a computer program comprising computer readable code, when the smart device runs the computer readable code, causes the method of the first aspect to be performed.
  • a computer readable medium wherein the computer program of the fourth aspect is stored.
  • the technical solution of the present invention acquires the current scene mode of the smart device when the screen of the smart device is loaded or switches the user interface, and then performs a judgment. If the scene mode supports the voice operation, the current one is selected. At least one voice operation prompt information that the scene mode can support, and displaying the selected at least one voice operation prompt information at a specified position on the screen of the smart device.
  • the technical solution can reasonably determine whether a voice operation prompt is needed for the user in the case of less resource consumption, so that the user can clearly understand whether the voice operation can be used at this time, and how to implement the voice operation. These can be seen at a glance through the display on the screen.
  • FIG. 1 is a flow chart showing a method of voice control smart device according to an embodiment of the present invention
  • FIG. 2-a is a schematic diagram showing an interface for displaying a plurality of voice operation prompt information in a floating window of the smart mirror screen
  • FIG. 2-b is a schematic diagram showing an interface for displaying a plurality of voice operation prompt information in a floating window of another smart device screen
  • FIG. 3 is a schematic structural diagram of an apparatus for a voice control smart device according to an embodiment of the present invention.
  • FIG. 4 is a schematic structural diagram of a smart device according to an embodiment of the present invention.
  • Figure 5 shows a block diagram of a smart device for performing the method according to the invention
  • Figure 6 shows a schematic diagram of a memory unit for holding or carrying program code implementing the method according to the invention.
  • FIG. 1 is a schematic flowchart diagram of a method for controlling a smart device according to an embodiment of the present invention. As shown in FIG. 1, the method includes:
  • Step S110 Acquire a current scene mode of the smart device when the screen of the smart device loads or switches the user interface.
  • the profile may include: an interface of the application running in the foreground, an interface of the application running in the foreground, and/or a function of the application that can be currently tuned.
  • the application starts running in the foreground, the interface is activated, and the interface of the application is obtained at this time, and it is determined whether the interface supports voice operations.
  • the music playing application is running, the user opens a new application, and at this time, the interface of the new application running in the foreground and the music playing function that can be tuned in the background are acquired.
  • the interface of the new application running in the foreground does not support voice operations, but the music playback function supports voice operations (such as pausing or cutting songs), it is still determined that the current scene mode supports voice operations. That is, in step S120, it is determined whether the current scene mode supports voice operations.
  • Step S130 if supported, selecting at least one voice operation prompt information that can be supported by the current scene mode, and displaying the selected at least one voice operation prompt information in a specified position on the screen of the smart device.
  • the smart device may be an in-vehicle smart device, a mobile terminal or a computer device, such as an intelligent in-vehicle smart device such as a smart rear view mirror, and a smart device that has been widely used for voice control, such as a mobile phone.
  • Figure 2-a shows an interface diagram showing the display of multiple voice action prompts in a floating window of a smart mirror screen.
  • the current profile is the main interface of Cool Music, as well as the call function and navigation function running in the background (not shown because it is not running in the foreground).
  • the user sees a floating window on the right side of the smart mirror screen, showing four voice operation prompts, and the user can issue a voice command conforming to such a format.
  • Figure 2-b shows a schematic diagram of an interface for displaying a plurality of voice operation prompt information in a floating window of another smart device screen. As shown in Figure 2-b, because the selected voice operation prompt information is more, it cannot be displayed on the screen. At this time, the user can manually slide to view other language operation prompt information that cannot be displayed.
  • the method shown in FIG. 1 acquires the current scene mode of the smart device when the screen of the smart device is loaded or switches the user interface, and then performs a judgment. If the scene mode supports voice operation, the current scenario is selected. At least one voice operation prompt information that the mode can support, and displaying the selected at least one voice operation prompt information at a specified position on the screen of the smart device.
  • the technical solution can reasonably determine whether a voice operation prompt is needed for the user in the case of less resource consumption, so that the user can clearly understand whether the voice operation can be used at this time, and how to implement the voice operation. These can be seen at a glance through the display on the screen.
  • the method shown in FIG. 1 further includes: separately setting voice operation prompt information that can be supported by the scene mode for each scene mode of the smart device, and saving the information in the specified configuration file; Whether the current scene mode supports voice operation includes: searching for a voice operation prompt information that the scene mode can support in the configuration file, and if found, determining that the current scene mode supports voice operation, and if not, determining The current scene mode does not support voice operations.
  • the application may have multiple levels of interfaces.
  • the voice operations supported by each interface are not identical. Then each interface can be used as a type of context mode.
  • the corresponding voice operation prompt information is saved in the configuration file.
  • the selecting, by the method, the at least one voice operation prompt information that can be supported by the current scene mode comprises: determining a display quantity of the voice operation prompt information to be displayed; In the voice operation prompt information that can be supported by the scene mode, randomly select the number of voice operation prompt information equal to the determined display quantity, and/or, according to the weight set in the configuration file for each voice operation prompt information, weighted random selection and determination The number of voice operation prompts is displayed in equal numbers.
  • the voice corresponding to the current scene mode is found.
  • the operation prompt information is all displayed.
  • all the voice operation prompt information can be treated differently or weighted.
  • the application adds a new function after the update. This function also supports voice operation. Then the user may not be able to use it skillfully. In this case, a prompt is required, and the voice operation corresponding to the function can be performed. The weight corresponding to the prompt information is set higher.
  • the method further includes: recording voice operation prompt information used by the user; and adjusting a weight preset for each voice operation prompt information in the configuration file according to the usage record of the voice operation prompt information.
  • the voice operation prompt information is prompt information including a voice keyword; and the voice operation prompt information used by the user is recorded: when the user speaks a voice keyword, the usage number of the corresponding voice operation prompt information is increased by one.
  • Fig. 2-a For example, “call”, “close”, “navigation” and “open music” shown in Fig. 2-a are four voice keywords, and the user can record correspondingly each time a voice keyword is spoken.
  • the weights preset for each voice operation prompt information in the configuration file according to the usage record of the voice operation prompt information include: when the number of users using a voice operation prompt information reaches When the preset value is used, the weight of the voice operation prompt information is correspondingly raised or lowered to a weight value corresponding to the preset value.
  • the configuration file can be loaded when the smart device is started, thereby ensuring the correct operation of the above functions.
  • the method further includes: displaying the voice operation prompt information through a floating window, and displaying a trigger switch for turning on/off the floating window in an operating system of the smart device.
  • a trigger switch for opening/closing the floating window is displayed.
  • the number of voice operation prompt information that the user has used may also be counted and reported to the server for use by each user.
  • the case is used as a big data sample to calculate a preset value as a trigger condition for adjusting the voice prompt information weight.
  • FIG. 3 is a schematic structural diagram of an apparatus for a voice control smart device according to an embodiment of the present invention. As shown in FIG. 3, the apparatus 300 for a voice control smart device includes:
  • the scene mode obtaining unit 310 is configured to acquire a current scene mode of the smart device when the screen of the smart device loads or switches the user interface.
  • the profile may include: an interface of the application running in the foreground, an interface of the application running in the foreground, and/or a function of the application that can be currently tuned.
  • the application starts running in the foreground, the interface is activated, and the interface of the application is obtained at this time, and it is determined whether the interface supports voice operations.
  • the music playing application is running, the user opens a new application, and at this time, the two interfaces of the new application running in the foreground and the music playing function that can be tuned in the background are acquired.
  • the interface of the new application running in the foreground does not support voice operations, but the music playback function supports voice operations (such as pausing or cutting songs), it is still determined that the current scene mode supports voice operations. That is, the determining unit 320 is adapted to determine whether the current scene mode supports voice operations.
  • the display unit 330 is configured to select at least one voice operation prompt information that the current scene mode can support when the current scene mode supports the voice operation, and display the selected at least one piece at a specified position on the screen of the smart device. Voice operation prompt information.
  • the device shown in FIG. 3 acquires the current scene mode of the smart device when the screen of the smart device is loaded or switches the user interface through the mutual cooperation of the units, and then performs a judgment. If the scene mode supports voice operation, Then selecting at least one voice operation prompt information that the current scene mode can support, and displaying the selected at least one voice operation prompt information at a specified position on the screen of the smart device.
  • the technical solution can reasonably determine whether a voice operation prompt is needed for the user in the case of less resource consumption, so that the user can clearly understand whether the voice operation can be used at this time, and how to implement the voice operation. These can be seen at a glance through the display on the screen.
  • the device further includes: a configuration unit 340, configured to separately set voice operation prompt information that can be supported by the scene mode for each scene mode of the smart device, and save the information in the specified configuration file;
  • the unit 320 is configured to search for a voice operation prompt information that can be supported by the scene mode in the configuration file, and if found, determine that the current scene mode supports a voice operation, and if not, determine the current scenario. The mode does not support voice operations.
  • the application may have multiple levels of interfaces.
  • the voice operations supported by each interface are not identical. Then each interface can be used as a type of context mode.
  • the corresponding voice operation prompt information is saved in the configuration file.
  • the display unit 330 is adapted to determine the number of display of the voice operation prompt information to be displayed, from the found voice operation prompt information that can be supported by the current scene mode. And randomly selecting the number of voice operation prompt information equal to the determined display quantity, and/or, according to the weight set in the configuration file for each voice operation prompt information, weighting and randomly selecting the number of voice operation prompts equal to the determined display quantity information.
  • the voice operation prompt corresponding to the current scene mode is found.
  • the information is all displayed.
  • all the voice operation prompt information can be treated differently or weighted.
  • the application adds a new function after the update. This function also supports voice operation. Then the user may not be able to use it skillfully. In this case, a prompt is required, and the voice operation corresponding to the function can be performed. The weight corresponding to the prompt information is set higher.
  • the apparatus further includes: a recording unit 350 adapted to record voice operation prompt information used by the user; and a configuration unit 340, further adapted to adjust the configuration file according to the usage record of the voice operation prompt information The weights preset for each voice operation prompt message.
  • the voice operation prompt information is prompt information including a voice keyword
  • the recording unit 350 is adapted to perform a corresponding voice operation when the user speaks a voice keyword.
  • the number of prompts used is increased by one.
  • Fig. 2-a For example, “call”, “close”, “navigation” and “open music” shown in Fig. 2-a are four voice keywords, and the user can record correspondingly each time a voice keyword is spoken.
  • the configuration unit 340 is adapted to increase or decrease the weight of the voice operation prompt information when the number of uses of the voice operation prompt information reaches a preset value by the user. To the weight value corresponding to the preset value.
  • the configuration unit 340 is adapted to load a configuration file when the smart device is started, thereby ensuring correct operation of the foregoing functions.
  • the display unit 330 is further configured to display the voice operation prompt information through a floating window, and display a trigger switch for turning on/off the floating window in an operating system of the smart device.
  • a trigger switch for opening/closing the floating window is displayed.
  • the number of voice operation prompt information that the user has used may also be counted and reported to the server for use by each user.
  • the case is used as a big data sample to calculate a preset value as a trigger condition for adjusting the voice prompt information weight.
  • FIG. 4 is a schematic structural diagram of a smart device according to an embodiment of the present invention. As shown in FIG. 4, the smart device includes: a voice collection unit 410, a voice recognition unit 420, and a voice in any of the above embodiments. A device 300 that controls a smart device.
  • the voice collection unit 410 is adapted to collect voice data and send the collected voice data to the application and/or voice recognition unit 420 to be received. Since many applications (such as instant messaging applications) can directly utilize the collected voice data, no voice recognition unit is needed for identification.
  • the voice recognition unit 420 is adapted to convert the voice data collected by the voice receiving unit 410 into text data, and send the converted character data to the application to be received. Specifically, it can be implemented by using a speech recognition library.
  • the smart device may be an in-vehicle smart device, a mobile terminal, or a computer device.
  • a mobile phone, a driving recorder, a smart rear view mirror, etc. can be used as the smart device in the above embodiment.
  • the technical solution of the present invention acquires the current scene mode of the smart device when the screen of the smart device is loaded or switches the user interface, and then performs a judgment. If the scene mode supports the voice operation, the current selection is selected. At least one voice operation prompt information that can be supported by the scene mode, and displays the selected at least one voice operation prompt information at a specified position on the screen of the smart device.
  • the technical solution can reasonably determine whether a voice operation prompt is needed for the user in the case of less resource consumption, so that the user can clearly understand whether the voice operation can be used at this time, and how to implement the voice operation. These can be seen at a glance through the display on the screen.
  • Fig. 5 shows a smart device (hereinafter referred to as a smart device collectively referred to as a device) that can implement voice control according to the present invention.
  • the device conventionally includes a processor 510 and a computer program product or computer readable medium in the form of a memory 520.
  • the memory 520 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, a hard disk, or a ROM.
  • Memory 520 has a memory space 530 for program code 531 for performing any of the method steps described above.
  • storage space 530 for program code may include various program code 531 for implementing various steps in the above methods, respectively.
  • the program code can be read from or written to one or more computer program products.
  • These computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks.
  • Such computer program products are typically portable or fixed storage units as described with reference to FIG.
  • the storage unit may have a storage section or a storage space or the like arranged similarly to the storage 520 in FIG.
  • the program code can be compressed, for example, in an appropriate form.
  • the storage unit comprises program code 531' for performing the steps of the method according to the invention, i.e. code that can be read by a processor, such as 510, which, when executed by the device, causes the device to perform the above Each step in the described method.
  • modules in the devices of the embodiments can be adaptively changed and placed in one or more devices different from the embodiment.
  • the modules or units or components of the embodiments may be combined into one module or unit or component, and further they may be divided into a plurality of sub-modules or sub-units or sub-components.
  • any combination of the features disclosed in the specification, including the accompanying claims, the abstract and the drawings, and any methods so disclosed, or All processes or units of the device are combined.
  • Each feature disclosed in this specification (including the accompanying claims, the abstract and the drawings) may be replaced by alternative features that provide the same, equivalent or similar purpose.
  • the various component embodiments of the present invention may be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof. It will be understood by those skilled in the art that a microprocessor or digital signal processor (DSP) may be used in practice to implement some or some of the components of the device and the smart device of the voice control smart device according to embodiments of the present invention or All features.
  • DSP digital signal processor
  • the invention can also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein.
  • Such a program implementing the invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Environmental & Geological Engineering (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

本发明公开了语音控制智能设备的方法、装置和智能设备。其中方法包括:当所述智能设备的屏幕加载或切换用户界面时,获取所述智能设备当前的情景模式;判断所述当前的情景模式是否支持语音操作;若支持,选择所述当前的情景模式能够支持的至少一条语音操作提示信息,并在所述智能设备的所述屏幕上的指定位置显示选择的所述至少一条语音操作提示信息。该技术方案可以在较少地耗费资源的情况下,较为合理地判断出是否需要为用户进行语音操作提示,这样使得用户可以清晰地了解到此时能否使用语音操作,以及如何来实现语音操作,这些都通过在屏幕上的显示能够一目了然。

Description

语音控制智能设备的方法、装置和智能设备 技术领域
本发明涉及智能设备领域,具体涉及语音控制智能设备的方法、装置和智能设备。
背景技术
智能设备由于在正常使用时采用语音指令进行操控会更加方便,因此大都支持语音操作。然而即便是这样,在许多情景模式下也会出现不支持语音操作的情况。对于用户而言,翻说明书查找每种情景模式所支持的语音关键词也是件很繁琐的事情,因而如何对用户进行恰到好处的语音操作提示是一个需要解决的问题。
发明内容
鉴于上述问题,提出了本发明以便提供一种克服上述问题或者至少部分地解决上述问题的语音控制智能设备的方法、装置和智能设备。
本发明的第一方面,提供了一种语音控制智能设备的方法,包括:当所述智能设备的屏幕加载或切换用户界面时,获取所述智能设备当前的情景模式;判断所述当前的情景模式是否支持语音操作;若支持,选择所述当前的情景模式能够支持的至少一条语音操作提示信息,并在所述智能设备的所述屏幕上的指定位置显示选择的所述至少一条语音操作提示信息。
依据本发明的第二方面,提供了一种语音控制智能设备的装置,包括:至少一个处理器;以及,至少一个存储器,其与所述至少一个处理器可通信地连接;所述至少一个存储器包括处理器可执行的指令,当所述处理器可执行的指令由所述至少一个处理器执行时,致使所述装置执行至少以下操作:当所述智能设备的屏幕加载或切换用户界面时,获取所述智能设备当前的情景模式;判断所述当前的情景模式是否支持语音操作;在所述当前的情景模 式支持语音操作时,选择所述当前的情景模式能够支持的至少一条语音操作提示信息,并在所述智能设备的所述屏幕上的指定位置显示选择的所述至少一条语音操作提示信息。
依据本发明的第三方面,提供了一种智能设备,该设备包括:一个或多个处理器;存储器;一个或多个应用程序,其中所述一个或多个应用程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个程序配置用于实现如下操作:采集语音数据,并将采集到的语音数据发送给待接收的应用;将采集到的语音数据转换为文字数据,将转换后的文字数据发送给待接收的应用;以及第二方面所述的装置所执行的操作。
依据本发明的第四方面,提供了一种计算机程序,包括计算机可读代码,当智能设备运行所述计算机可读代码时,导致第一方面所述的方法被执行。
依据本发明的第五方面,提供了一种计算机可读介质,其中存储了第四方面所述的计算机程序。
由上述可知,本发明的技术方案,在智能设备的屏幕加载或切换用户界面时,获取到智能设备当前的情景模式,然后进行一次判断,若该情景模式支持语音操作,那么选择所述当前的情景模式能够支持的至少一条语音操作提示信息,并在所述智能设备的所述屏幕上的指定位置显示选择的所述至少一条语音操作提示信息。该技术方案可以在较少地耗费资源的情况下,较为合理地判断出是否需要为用户进行语音操作提示,这样使得用户可以清晰地了解到此时能否使用语音操作,以及如何来实现语音操作,这些都通过在屏幕上的显示能够一目了然。
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。
附图说明
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示 相同的部件。在附图中:
图1示出了根据本发明一个实施例的一种语音控制智能设备的方法的流程示意图;
图2-a示出了在智能后视镜屏幕的悬浮窗中显示多条语音操作提示信息的界面示意图;
图2-b示出了在另一智能设备屏幕的悬浮窗中显示多条语音操作提示信息的界面示意图;
图3示出了根据本发明一个实施例的一种语音控制智能设备的装置的结构示意图;
图4示出了根据本发明一个实施例的一种智能设备的结构示意图;
图5示出了用于执行根据本发明的方法的智能设备的框图;以及
图6示出了用于保持或者携带实现根据本发明的方法的程序代码的存储单元示意图。
具体实施方式
下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。
图1示出了根据本发明一个实施例的一种语音控制智能设备的方法的流程示意图,如图1所示,该方法包括:
步骤S110,当智能设备的屏幕加载或切换用户界面时,获取智能设备当前的情景模式。
其中的情景模式可以包括:前台运行的应用程序的界面,前台运行的应用程序的界面,和/或在当前能够调起的应用程序的功能。
举例来说,当用户打开一个新的应用程序时,该应用程序开始在前台运 行,界面被激活,此时获取该应用程序的界面,并判断该界面是否支持语音操作。又例如,音乐播放应用程序在运行时,用户又打开一个新的应用程序,这时获取到前台运行的新的应用程序的界面,以及后台可以调起的音乐播放功能这两种情景模式。虽然前台运行的新的应用程序的界面不支持语音操作,但是音乐播放功能是支持语音操作(例如暂停或切歌)的,那么依然判断当前的情景模式支持语音操作。也就是步骤S120,判断所述当前的情景模式是否支持语音操作。
步骤S130,若支持,选择所述当前的情景模式能够支持的至少一条语音操作提示信息,并在智能设备的所述屏幕上的指定位置显示选择的所述至少一条语音操作提示信息。
具体来说,智能设备可以为车载智能设备、移动终端或计算机设备,如智能后视镜等新兴的车载智能设备,又如手机这种已经广泛应用了语音控制的智能设备。例如,图2-a示出了在智能后视镜屏幕的悬浮窗中显示多条语音操作提示信息的界面示意图。如图2-a所示,当前的情景模式是酷我音乐的主界面,以及后台运行的通话功能和导航功能(由于未在前台运行未能示出)。此时用户看到在智能后视镜屏幕的右侧有一个悬浮窗,示出了四条语音操作提示信息,用户可以发出符合这样格式的语音指令。图2-b示出了在另一智能设备屏幕的悬浮窗中显示多条语音操作提示信息的界面示意图。如图2-b所示,由于选择的语音操作提示信息较多,在屏幕中未能全部显示,此时用户可以手动滑动来查看未能显示出的其他语言操作提示信息。
可见,图1所示的方法,在智能设备的屏幕加载或切换用户界面时,获取到智能设备当前的情景模式,然后进行一次判断,若该情景模式支持语音操作,那么选择所述当前的情景模式能够支持的至少一条语音操作提示信息,并在智能设备的所述屏幕上的指定位置显示选择的所述至少一条语音操作提示信息。该技术方案可以在较少地耗费资源的情况下,较为合理地判断出是否需要为用户进行语音操作提示,这样使得用户可以清晰地了解到此时能否使用语音操作,以及如何来实现语音操作,这些都通过在屏幕上的显示能够一目了然。
在本发明的一个实施例中,图1所示的方法还包括:为智能设备的各情 景模式分别设置该情景模式能够支持的语音操作提示信息,并保存在指定的配置文件中;判断所述当前的情景模式是否支持语音操作包括:在配置文件中查找该情景模式能够支持的语音操作提示信息,若能查找到,则判断所述当前的情景模式支持语音操作,若查找不到,则判断所述当前的情景模式不支持语音操作。
以前台运行的地图导航应用程序的界面为例,该应用程序可能具有多级界面,每个界面所支持的语音操作是不完全相同的,那么也可以将每个界面作为一类情景模式,在配置文件中保存对应的语音操作提示信息。
在本发明的一个实施例中,上述方法中,选择所述当前的情景模式能够支持的至少一条语音操作提示信息包括:确定待显示的语音操作提示信息的显示数量;从查找到的所述当前情景模式能够支持的语音操作提示信息中,随机选取与确定的显示数量相等的数量的语音操作提示信息,和/或,按照配置文件中为各语音操作提示信息设置的权重,加权随机选取与确定的显示数量相等的数量的语音操作提示信息。
如果确定的在悬浮窗中显示的语音操作提示信息的显示数量大于查找到的与当前的情景模式对应的语音操作提示信息的数量,那么就将查找到的与所述当前的情景模式对应的语音操作提示信息全部显示。关于随机选取,既可以将所有语音操作提示信息不区别对待,也可以进行加权处理。举例而言,应用程序在更新后添加了新的功能,该功能也是支持语音操作的,那么用户可能还不能熟练地进行运用,此时就需要进行提示,那么可以将与该功能对应的语音操作提示信息对应的权重设置得较高。
而随着用户的使用,许多常用的语音操作已经变得熟练,那么可能用户就不再需要对这些语音操作进行提示;又或者虽然用户用了很多次,但是还是很难记住,但该语音操作又很常用,这就需要在选择语音操作提示信息时,根据用户的使用习惯进行调整。因此在本发明的一个实施例中,上述方法还包括:记录用户使用的语音操作提示信息;根据对语音操作提示信息的使用记录调整配置文件中为各语音操作提示信息预设的权重。
具体地,语音操作提示信息为包含语音关键词的提示信息;记录用户使用的语音操作提示信息包括:当用户说出一个语音关键词时,将对应的语音 操作提示信息的使用数量加一。
例如图2-a中示出的“打电话给”、“关闭”、“导航去”和“打开音乐”是四个语音关键词,用户每说出一个语音关键词时可以对应地进行记录。
在本发明的一个实施例中,上述方法中,根据对语音操作提示信息的使用记录调整配置文件中为各语音操作提示信息预设的权重包括:当用户对一条语音操作提示信息的使用数量达到预设值时,相应地将该语音操作提示信息的权重提高或调低至与该预设值对应的权重值。
上述方法中,可以在智能设备启动时加载配置文件,从而保证上述功能的正确运行。
在本发明的一个实施例中,上述方法还包括:将语音操作提示信息通过悬浮窗的方式显示,在智能设备的操作***中显示开启/关闭悬浮窗的触发开关。
在操作***的设置菜单项中显示一个开启/关闭悬浮窗的触发开关,在用户关闭触发开关时还可以统计用户已经使用的语音操作提示信息的次数并上报给服务器,用于根据各用户的使用情况作为大数据样本来计算作为调整语音提示信息权重触发条件的预设值。
图3示出了根据本发明一个实施例的一种语音控制智能设备的装置的结构示意图,如图3所示,语音控制智能设备的装置300包括:
情景模式获取单元310,适于当智能设备的屏幕加载或切换用户界面时,获取智能设备当前的情景模式。
其中的情景模式可以包括:前台运行的应用程序的界面,前台运行的应用程序的界面,和/或在当前能够调起的应用程序的功能。
举例来说,当用户打开一个新的应用程序时,该应用程序开始在前台运行,界面被激活,此时获取该应用程序的界面,并判断该界面是否支持语音操作。又例如,音乐播放应用程序在运行时,用户又打开一个新的应用程序,这时获取到前台运行的新的应用程序的界面以及后台可以调起的音乐播放功能这两种情景模式。虽然前台运行的新的应用程序的界面不支持语音操作,但是音乐播放功能是支持语音操作(例如暂停或切歌)的,那么依然判断当前的情景模式支持语音操作。也就是判断单元320,适于判断所述当前的情 景模式是否支持语音操作。
显示单元330,适于在所述当前的情景模式支持语音操作时,选择当前的情景模式能够支持的至少一条语音操作提示信息,并在智能设备的屏幕上的指定位置显示选择的所述至少一条语音操作提示信息。
同样可以参照图2-a和图2-b来查看在悬浮窗中显示选择的至少部分语音操作提示信息的效果。
可见,图3所示的装置,通过各单元的相互配合,在智能设备的屏幕加载或切换用户界面时,获取到智能设备当前的情景模式,然后进行一次判断,若该情景模式支持语音操作,那么选择所述当前的情景模式能够支持的至少一条语音操作提示信息,并在智能设备的所述屏幕上的指定位置显示选择的所述至少一条语音操作提示信息。该技术方案可以在较少地耗费资源的情况下,较为合理地判断出是否需要为用户进行语音操作提示,这样使得用户可以清晰地了解到此时能否使用语音操作,以及如何来实现语音操作,这些都通过在屏幕上的显示能够一目了然。
在本发明的一个实施例中,上述装置还包括:配置单元340,适于为智能设备的各情景模式分别设置该情景模式能够支持的语音操作提示信息,并保存在指定的配置文件中;判断单元320,适于在配置文件中查找该情景模式能够支持的语音操作提示信息,若能查找到,则判断所述当前的情景模式支持语音操作,若查找不到,则判断所述当前的情景模式不支持语音操作。
以前台运行的地图导航应用程序的界面为例,该应用程序可能具有多级界面,每个界面所支持的语音操作是不完全相同的,那么也可以将每个界面作为一类情景模式,在配置文件中保存对应的语音操作提示信息。
在本发明的一个实施例中,上述装置中,显示单元330,适于适于确定待显示的语音操作提示信息的显示数量,从查找到的所述当前情景模式能够支持的语音操作提示信息中,随机选取与确定的显示数量相等的数量的语音操作提示信息,和/或,按照配置文件中为各语音操作提示信息设置的权重,加权随机选取与确定的显示数量相等的数量的语音操作提示信息。
如果确定的在悬浮窗中显示的语音操作提示信息的显示数量大于查找到的与当前的情景模式对应的语音操作提示信息的数量,那么就将查找到的与 当前的情景模式对应的语音操作提示信息全部显示。关于随机选取,既可以将所有语音操作提示信息不区别对待,也可以进行加权处理。举例而言,应用程序在更新后添加了新的功能,该功能也是支持语音操作的,那么用户可能还不能熟练地进行运用,此时就需要进行提示,那么可以将与该功能对应的语音操作提示信息对应的权重设置得较高。
而随着用户的使用,许多常用的语音操作已经变得熟练,那么可能用户就不再需要对这些语音操作进行提示;又或者虽然用户用了很多次,但是还是很难记住,但该语音操作又很常用,这就需要在选择语音操作提示信息时,根据用户的使用习惯进行调整。因此在本发明的一个实施例中,上述装置还包括:记录单元350,适于记录用户使用的语音操作提示信息;配置单元340,还适于根据对语音操作提示信息的使用记录调整配置文件中为各语音操作提示信息预设的权重。
具体地,在本发明的一个实施例中,上述装置中,语音操作提示信息为包含语音关键词的提示信息;记录单元350,适于当用户说出一个语音关键词时,将对应的语音操作提示信息的使用数量加一。
例如图2-a中示出的“打电话给”、“关闭”、“导航去”和“打开音乐”是四个语音关键词,用户每说出一个语音关键词时可以对应地进行记录。
在本发明的一个实施例中,上述装置中,配置单元340,适于当用户对一条语音操作提示信息的使用数量达到预设值时,相应地将该语音操作提示信息的权重提高或调低至与该预设值对应的权重值。
在本发明的一个实施例中,上述装置中,配置单元340,适于在智能设备启动时加载配置文件,从而保证上述功能的正确运行。
在本发明的一个实施例中,上述装置中,显示单元330,还适于将语音操作提示信息通过悬浮窗的方式显示,在智能设备的操作***中显示开启/关闭悬浮窗的触发开关。
在操作***的设置菜单项中显示一个开启/关闭悬浮窗的触发开关,在用户关闭触发开关时还可以统计用户已经使用的语音操作提示信息的次数并上报给服务器,用于根据各用户的使用情况作为大数据样本来计算作为调整语音提示信息权重触发条件的预设值。
图4示出了根据本发明一个实施例的一种智能设备的结构示意图,如图4所示,智能设备包括:语音采集单元410、语音识别单元420,以及如上述任一实施例中的语音控制智能设备的装置300。
语音采集单元410,适于采集语音数据,并将采集到的语音数据发送给待接收的应用和/或语音识别单元420。由于许多应用(如即时聊天应用)可以直接利用采集到的语音数据,这时就不需要语音识别单元进行识别。
语音识别单元420,适于将语音接收单元410采集到的语音数据转换为文字数据,将转换后的文字数据发送给待接收的应用。具体地可以利用语音识别库进行实现。
上述实施例中,智能设备可以是为车载智能设备、移动终端或计算机设备。举例来说,手机、行车记录仪、智能后视镜等都可以作为上述实施例中的智能设备。
综上所述,本发明的技术方案,在智能设备的屏幕加载或切换用户界面时,获取到智能设备当前的情景模式,然后进行一次判断,若该情景模式支持语音操作,那么选择所述当前的情景模式能够支持的至少一条语音操作提示信息,并在智能设备的所述屏幕上的指定位置显示选择的所述至少一条语音操作提示信息。该技术方案可以在较少地耗费资源的情况下,较为合理地判断出是否需要为用户进行语音操作提示,这样使得用户可以清晰地了解到此时能否使用语音操作,以及如何来实现语音操作,这些都通过在屏幕上的显示能够一目了然。
图5示出了可以实现根据本发明的语音控制的智能设备(下述将智能设备统称为设备)。该设备传统上包括处理器510和以存储器520形式的计算机程序产品或者计算机可读介质。存储器520可以是诸如闪存、EEPROM(电可擦除可编程只读存储器)、EPROM、硬盘或者ROM之类的电子存储器。存储器520具有用于执行上述方法中的任何方法步骤的程序代码531的存储空间530。例如,用于程序代码的存储空间530可以包括分别用于实现上面的方法中的各种步骤的各个程序代码531。这些程序代码可以从一个或者多个计算机程序产品中读出或者写入到这一个或者多个计算机程序产品中。这些计算机程序产品包括诸如硬盘,紧致盘 (CD)、存储卡或者软盘之类的程序代码载体。这样的计算机程序产品通常为如参考图6所述的便携式或者固定存储单元。该存储单元可以具有与图5中的存储器520类似布置的存储段或者存储空间等。程序代码可以例如以适当形式进行压缩。通常,存储单元包括用于执行根据本发明的方法步骤的程序代码531’,即可以由例如诸如510之类的处理器读取的代码,这些代码当由设备运行时,导致该设备执行上面所描述的方法中的各个步骤。
需要说明的是:
在此提供的算法和显示不与任何特定计算机、虚拟装置或者其它设备固有相关。各种通用装置也可以与基于在此的示教一起使用。根据上面的描述,构造这类装置所要求的结构是显而易见的。此外,本发明也不针对任何特定编程语言。应当明白,可以利用各种编程语言实现在此描述的本发明的内容,并且上面对特定语言所做的描述是为了披露本发明的最佳实施方式。
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本发明的实施例可以在没有这些具体细节的情况下实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。
类似地,应当理解,为了精简本公开并帮助理解各个发明方面中的一个或多个,在上面对本发明的示例性实施例的描述中,本发明的各个特征有时被一起分组到单个实施例、图、或者对其的描述中。然而,并不应将该公开的方法解释成反映如下意图:即所要求保护的本发明要求比在每个权利要求中所明确记载的特征更多的特征。更确切地说,如下面的权利要求书所反映的那样,发明方面在于少于前面公开的单个实施例的所有特征。因此,遵循具体实施方式的权利要求书由此明确地并入该具体实施方式,其中每个权利要求本身都作为本发明的单独实施例。
本领域那些技术人员可以理解,可以对实施例中的设备中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个设备中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。除了这样的特征和/或过程或者单元中的至少一些是相互排斥之外,可以采用任何组合对本说明书(包括伴 随的权利要求、摘要和附图)中公开的所有特征以及如此公开的任何方法或者设备的所有过程或单元进行组合。除非另外明确陈述,本说明书(包括伴随的权利要求、摘要和附图)中公开的每个特征可以由提供相同、等同或相似目的的替代特征来代替。
此外,本领域的技术人员能够理解,尽管在此所述的一些实施例包括其它实施例中所包括的某些特征而不是其它特征,但是不同实施例的特征的组合意味着处于本发明的范围之内并且形成不同的实施例。例如,在下面的权利要求书中,所要求保护的实施例的任意之一都可以以任意的组合方式来使用。
本发明的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器(DSP)来实现根据本发明实施例的语音控制智能设备的装置和智能设备中的一些或者全部部件的一些或者全部功能。本发明还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本发明的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。
应该注意的是上述实施例对本发明进行说明而不是对本发明进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本发明可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。

Claims (24)

  1. 一种语音控制智能设备的装置,其中,该装置包括:
    至少一个处理器;
    以及,至少一个存储器,其与所述至少一个处理器可通信地连接;所述至少一个存储器包括处理器可执行的指令,当所述处理器可执行的指令由所述至少一个处理器执行时,致使所述装置执行至少以下操作:
    当所述智能设备的屏幕加载或切换用户界面时,获取所述智能设备当前的情景模式;
    判断所述当前的情景模式是否支持语音操作;
    在所述当前的情景模式支持语音操作时,选择所述当前的情景模式能够支持的至少一条语音操作提示信息,并在所述智能设备的所述屏幕上的指定位置显示选择的所述至少一条语音操作提示信息。
  2. 如权利要求1所述的装置,其中,所述智能设备为车载智能设备、移动终端或计算机设备。
  3. 如权利要求1所述的装置,其中,所述当前的情景模式包括:前台运行的应用程序的界面,和/或在当前能够调起的应用程序的功能。
  4. 如权利要求1所述的装置,其中,该操作还包括:
    为所述智能设备的各情景模式分别设置该情景模式能够支持的语音操作提示信息,并保存在指定的配置文件中;
    所述判断所述当前的情景模式是否支持语音操作包括:在所述配置文件中查找该情景模式能够支持的语音操作提示信息,若能查找到,则判断所述当前的情景模式支持语音操作,若查找不到,则判断所述当前的情景模式不支持语音操作。
  5. 如权利要求4所述的装置,其中,所述选择所述当前的情景模式能够支持的至少一条语音操作提示信息包括:
    确定待显示的语音操作提示信息的显示数量;
    从查找到的所述当前情景模式能够支持的语音操作提示信息中,随机选取与确定的显示数量相等的数量的语音操作提示信息,和/或,按照所述配置文件中为各语音操作提示信息设置的权重,加权随机选取与确定的显示数量相等的数量的语音操作提示信息。
  6. 如权利要求5所述的装置,其中,该操作还包括:
    记录用户使用的语音操作提示信息;
    根据对所述语音操作提示信息的使用记录调整所述配置文件中为各语音操作提示信息设置的权重。
  7. 如权利要求6所述的装置,其中,所述语音操作提示信息为包含语音关键词的提示信息;
    所述记录用户使用的语音操作提示信息包括:当用户说出一个语音关键词时,将对应的语音操作提示信息的使用数量加一。
  8. 如权利要求6所述的装置,其中,所述根据对所述语音操作提示信息的使用记录调整所述配置文件中为各语音操作提示信息设置的权重包括:
    当用户对一条语音操作提示信息的使用数量达到预设值时,相应地将该语音操作提示信息的权重提高或调低至与该预设值对应的权重。
  9. 如权利要求4-8中任一项所述的装置,其中,
    在所述智能设备启动时加载所述配置文件。
  10. 如权利要求1所述的装置,其中,
    将所述语音操作提示信息通过悬浮窗的方式显示,以及适于在所述智能设备的操作***中显示开启/关闭所述悬浮窗的触发开关。
  11. 一种语音控制智能设备的方法,其中,该方法包括:
    当所述智能设备的屏幕加载或切换用户界面时,获取所述智能设备当前的情景模式;
    判断所述当前的情景模式是否支持语音操作;
    若支持,选择所述当前的情景模式能够支持的至少一条语音操作提示信息,并在所述智能设备的所述屏幕上的指定位置显示选择的所述至少一条语音操作提示信息。
  12. 如权利要求11所述的方法,其中,所述智能设备为车载智能设备、移动终端或计算机设备。
  13. 如权利要求11所述的方法,其中,所述当前的情景模式包括:前台运行的应用程序的界面,和/或在当前能够调起的应用程序的功能。
  14. 如权利要求11所述的方法,其中,该方法还包括:
    为所述智能设备的各情景模式分别设置该情景模式能够支持的语音操作提示信息,并保存在指定的配置文件中;
    所述判断所述当前的情景模式是否支持语音操作包括:在所述配置文件中查找该情景模式能够支持的语音操作提示信息,若能查找到,则判断所述当前的情景模式支持语音操作,若查找不到,则判断所述当前的情景模式不支持语音操作。
  15. 如权利要求14所述的方法,其中,所述选择所述当前的情景模式能够支持的至少一条语音操作提示信息包括:
    确定待显示的语音操作提示信息的显示数量;
    从查找到的所述当前情景模式能够支持的语音操作提示信息中,随机选取与确定的显示数量相等的数量的语音操作提示信息;和/或,按照所述配置文件中为各语音操作提示信息设置的权重,加权随机选取与确定的显示数量相等的数量的语音操作提示信息。
  16. 如权利要求15所述的方法,其中,该方法还包括:
    记录用户使用的语音操作提示信息;
    根据对所述语音操作提示信息的使用记录调整所述配置文件中为各语音操作提示信息设置的权重。
  17. 如权利要求16所述的方法,其中,所述语音操作提示信息为包含语音关键词的提示信息;
    所述记录用户使用的语音操作提示信息包括:当用户说出一个语音关键词时,将对应的语音操作提示信息的使用数量加一。
  18. 如权利要求16所述的方法,其中,所述根据对所述语音操作提示信息的使用记录调整所述配置文件中为各语音操作提示信息设置的权重包括:
    当用户对一条语音操作提示信息的使用数量达到预设值时,相应地将该语音操作提示信息的权重提高或调低至与该预设值对应的权重。
  19. 如权利要求14-18中任一项所述的方法,其中,该方法还包括:
    在所述智能设备启动时加载所述配置文件。
  20. 如权利要求11所述的方法,其中,该方法还包括:
    将所述语音操作提示信息通过悬浮窗的方式显示,在所述智能设备的操 作***中显示开启/关闭所述悬浮窗的触发开关。
  21. 一种智能设备,其中,该设备包括:
    一个或多个处理器;
    存储器;
    一个或多个应用程序,其中所述一个或多个应用程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个程序配置用于实现如下操作:
    采集语音数据,并将采集到的语音数据发送给待接收的应用;
    将采集到的语音数据转换为文字数据,将转换后的文字数据发送给待接收的应用;以及
    如权利要求1-10中任一项所述的装置所执行的操作。
  22. 如权利要求21所述的智能设备,其中,所述智能设备为车载智能设备、移动终端或计算机设备。
  23. 一种计算机程序,包括计算机可读代码,当智能设备运行所述计算机可读代码时,导致权利要求11-20中的任一项权利要求所述的方法被执行。
  24. 一种计算机可读介质,其中存储了如权利要求23所述的计算机程序。
PCT/CN2018/085442 2017-05-04 2018-05-03 语音控制智能设备的方法、装置和智能设备 WO2018202073A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710309069.6A CN107277225B (zh) 2017-05-04 2017-05-04 语音控制智能设备的方法、装置和智能设备
CN201710309069.6 2017-05-04

Publications (1)

Publication Number Publication Date
WO2018202073A1 true WO2018202073A1 (zh) 2018-11-08

Family

ID=60074305

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/085442 WO2018202073A1 (zh) 2017-05-04 2018-05-03 语音控制智能设备的方法、装置和智能设备

Country Status (2)

Country Link
CN (1) CN107277225B (zh)
WO (1) WO2018202073A1 (zh)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107277225B (zh) * 2017-05-04 2020-04-24 北京奇虎科技有限公司 语音控制智能设备的方法、装置和智能设备
CN108965968B (zh) * 2018-07-25 2021-04-30 聚好看科技股份有限公司 智能电视操作提示的展示方法、装置及计算机存储介质
CN109584879B (zh) 2018-11-23 2021-07-06 华为技术有限公司 一种语音控制方法及电子设备
US20210333869A1 (en) * 2018-11-30 2021-10-28 Lg Electronics Inc. Vehicle control device and vehicle control method
CN109346081A (zh) * 2018-12-20 2019-02-15 广州河东科技有限公司 一种语音控制方法、装置、设备和存储介质
CN111414145A (zh) * 2019-01-04 2020-07-14 上海擎感智能科技有限公司 语音功能使用提示方法及装置
CN111552794B (zh) * 2020-05-13 2023-09-19 海信电子科技(武汉)有限公司 提示语生成方法、装置、设备和存储介质
CN112887805B (zh) * 2021-01-12 2023-01-20 南京创维信息技术研究院有限公司 语音功能提示方法、装置、设备及介质
CN114115790A (zh) * 2021-11-12 2022-03-01 上汽通用五菱汽车股份有限公司 语音对话提示方法、装置、设备及计算机可读存储介质

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1764896A (zh) * 2003-04-07 2006-04-26 诺基亚有限公司 在具有用户接口的电子设备中提供允许语音的输入的方法和设备
US20090165145A1 (en) * 2007-12-21 2009-06-25 Nokia Corporation Changing modes in a device
CN103200329A (zh) * 2013-04-10 2013-07-10 威盛电子股份有限公司 语音操控方法、移动终端装置及语音操控***
CN105975511A (zh) * 2016-04-27 2016-09-28 乐视控股(北京)有限公司 智能对话的方法及装置
CN106233246A (zh) * 2014-04-22 2016-12-14 三菱电机株式会社 用户界面***、用户界面控制装置、用户界面控制方法和用户界面控制程序
CN106297791A (zh) * 2016-08-25 2017-01-04 Tcl集团股份有限公司 一种全程语音实现方法及***
CN107277225A (zh) * 2017-05-04 2017-10-20 北京奇虎科技有限公司 语音控制智能设备的方法、装置和智能设备

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102883041A (zh) * 2012-08-02 2013-01-16 聚熵信息技术(上海)有限公司 移动终端的语音控制装置及方法
CN106601242A (zh) * 2015-10-16 2017-04-26 中兴通讯股份有限公司 操作事件的执行方法及装置、终端

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1764896A (zh) * 2003-04-07 2006-04-26 诺基亚有限公司 在具有用户接口的电子设备中提供允许语音的输入的方法和设备
US20090165145A1 (en) * 2007-12-21 2009-06-25 Nokia Corporation Changing modes in a device
CN103200329A (zh) * 2013-04-10 2013-07-10 威盛电子股份有限公司 语音操控方法、移动终端装置及语音操控***
CN106233246A (zh) * 2014-04-22 2016-12-14 三菱电机株式会社 用户界面***、用户界面控制装置、用户界面控制方法和用户界面控制程序
CN105975511A (zh) * 2016-04-27 2016-09-28 乐视控股(北京)有限公司 智能对话的方法及装置
CN106297791A (zh) * 2016-08-25 2017-01-04 Tcl集团股份有限公司 一种全程语音实现方法及***
CN107277225A (zh) * 2017-05-04 2017-10-20 北京奇虎科技有限公司 语音控制智能设备的方法、装置和智能设备

Also Published As

Publication number Publication date
CN107277225B (zh) 2020-04-24
CN107277225A (zh) 2017-10-20

Similar Documents

Publication Publication Date Title
WO2018202073A1 (zh) 语音控制智能设备的方法、装置和智能设备
US11086596B2 (en) Electronic device, server and control method thereof
JP7324313B2 (ja) 音声対話方法及び装置、端末、並びに記憶媒体
US11403123B2 (en) Suggesting actions based on machine learning
RU2699587C2 (ru) Обновление моделей классификаторов понимания языка на основе краудсорсинга
CN106663430B (zh) 使用用户指定关键词的说话者不相依关键词模型的关键词检测
EP3020040B1 (en) Method and apparatus for assigning keyword model to voice operated function
CN104145304A (zh) 用于多个装置语音控制的设备和方法
KR20160043677A (ko) 음성 태그를 이용한 이미지 관리 방법 및 그 장치
KR102376700B1 (ko) 비디오 컨텐츠 생성 방법 및 그 장치
EP3603040B1 (en) Electronic device and method of executing function of electronic device
CN110035318B (zh) 视频播放方法、装置和多媒体数据播放方法
KR20180081922A (ko) 전자 장치의 입력 음성에 대한 응답 방법 및 그 전자 장치
US10691717B2 (en) Method and apparatus for managing data
KR102282704B1 (ko) 영상 데이터를 재생하는 전자 장치 및 방법
CN104660819B (zh) 移动设备以及访问移动设备中文件的方法
CN106775794B (zh) 一种输入法客户端安装方法和装置
US20100157744A1 (en) Method and Apparatus for Accessing Information Identified from a Broadcast Audio Signal
US11460998B2 (en) Accessibility for digital devices
EP3605530B1 (en) Method and apparatus for responding to a voice command
CN118135984A (zh) 语音合成方法、装置、设备、存储介质及程序产品
KR20170097934A (ko) 미디 파일의 트랙 정보를 제공하는 방법 및 장치

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18794546

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18794546

Country of ref document: EP

Kind code of ref document: A1