WO2016082344A1 - 一种语音控制的方法、装置及存储介质 - Google Patents

一种语音控制的方法、装置及存储介质 Download PDF

Info

Publication number
WO2016082344A1
WO2016082344A1 PCT/CN2015/072705 CN2015072705W WO2016082344A1 WO 2016082344 A1 WO2016082344 A1 WO 2016082344A1 CN 2015072705 W CN2015072705 W CN 2015072705W WO 2016082344 A1 WO2016082344 A1 WO 2016082344A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
input
user
preset
preset information
Prior art date
Application number
PCT/CN2015/072705
Other languages
English (en)
French (fr)
Inventor
魏占婷
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016082344A1 publication Critical patent/WO2016082344A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/725Cordless telephones

Definitions

  • the present invention relates to the field of communications technologies, and in particular, to a voice control method, apparatus, and storage medium.
  • the terminal can Automatically dial 110; however, after the user makes a "dial 110" sound, the other person knows the user's intention, can immediately block it, cut off the terminal to dial 110, thereby affecting the user to implement self-help and so on.
  • the method in which the terminal operates according to the direct meaning of the user's voice is insecure, and other users can more easily acquire the user's intention, thereby affecting the user's operation.
  • embodiments of the present invention mainly provide a method, an apparatus, and a storage medium for voice control.
  • the embodiment of the invention provides a method for voice control, which is applied to a terminal side, and the method includes:
  • the terminal side starts the preset function, it is determined whether the terminal side has a signature voice consistent with the input voice.
  • the pre-set preset information that is not related to the meaning of the identifier voice is acquired according to the identifier voice;
  • the embodiment of the present invention further provides a device for voice control, which is applied to a terminal side, where the device includes: a voice acquiring module, a determining module, a preset information acquiring module, and a first executing module;
  • a voice acquisition module configured to acquire an input voice of the user
  • a determining module configured to determine whether the terminal side has a signature voice consistent with the input voice, if the preset function is enabled on the terminal side;
  • a preset information acquiring module configured to: if the identification voice exists, obtain preset information that is not related to the meaning of the identifier voice according to the identifier voice;
  • the first execution module is configured to perform an operation corresponding to the preset information.
  • the embodiment of the present invention further provides a terminal, where the terminal includes a processor, and the processor is configured to acquire an input voice of the user; if the terminal side starts the preset function, it is determined whether the terminal side stores the pre-installation And the preset voice that is not related to the meaning of the voice is obtained according to the voice, and the operation corresponding to the preset information is performed.
  • the embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the voice control method.
  • the voice control method, device, and storage medium of the embodiment of the present invention after obtaining the input voice of the user, the identifier voice pre-stored by the terminal side is matched, and the matching is obtained after the matching is obtained.
  • the preset information that the meaning of the voice is not related, so that the terminal performs the operation corresponding to the preset information; in the embodiment of the present invention, the preset information that is not related to the meaning of the identified voice is preset, so that other users cannot directly obtain the user.
  • the real intention is to realize personalized voice control settings, which greatly improves the security and service of the terminal voice input; at the same time, it improves user satisfaction.
  • FIG. 1 is a flow chart showing the basic steps of a method for voice control according to an embodiment of the present invention
  • FIG. 2 is a flow chart showing the basic steps of a method for setting preset information in a method for voice control according to an embodiment of the present invention
  • FIG. 3 is a schematic structural diagram of an apparatus for voice control according to an embodiment of the present invention.
  • FIG. 4 is a schematic diagram showing a connection relationship of a specific structure of a device for voice control according to an embodiment of the present invention
  • FIG. 5 is a flowchart showing the execution of a specific embodiment 1 of the present invention.
  • Figure 6 is a flowchart showing the execution of a second embodiment of the present invention.
  • Figure 7 is a flowchart showing the execution of a third embodiment of the present invention.
  • Figure 8 is a flowchart showing the execution of a fourth embodiment of the present invention.
  • Figure 9 is a flow chart showing the execution of a fifth embodiment of the present invention.
  • the present invention is directed to the problem that the voice control mode of the terminal is not high in the prior art, and provides a voice control method and device, which is matched with the identifier voice pre-stored by the terminal side after acquiring the input voice of the user, and the matching is consistent.
  • the preset information that is not related to the meaning of the voice is obtained, so that the terminal performs the operation corresponding to the preset information.
  • the preset information that is not related to the meaning of the voice is preset, so that other users cannot directly Get user
  • the real intention is to realize personalized voice control settings, which greatly improves the security and service of the terminal voice input; at the same time, it improves user satisfaction.
  • an embodiment of the present invention provides a voice control method, which is applied to a terminal side, and includes:
  • Step 11 Acquire an input voice of the user
  • Step 12 If the terminal side starts the preset function, it is determined whether the terminal side has a signature voice that is consistent with the input voice.
  • Step 13 If the identifier voice exists, obtain preset information that is not related to the meaning of the identifier voice according to the identifier voice.
  • Step 14 Perform an operation corresponding to the preset information.
  • the input voice of the user is the voice sent by the user.
  • the terminal is provided with a human interface module, which is an interface for detecting the collected user voice, and is used for collecting the collected voice.
  • the sound is transmitted to the central processing unit of the terminal; the central processing unit of the terminal side performs step 12 and step 13, that is, parsing the input voice of the user, and calling the preset information corresponding to the identification voice consistent with the input voice of the user, wherein, The security of the input voice is guaranteed, and the preset information is not related to the meaning of the voice.
  • the specific setting steps of the preset preset information that is not related to the meaning of the identification voice include:
  • Step 21 Acquire preset information input by the user through a preset interface, where the preset information is used to instruct the terminal to perform a corresponding operation;
  • step 22 in response to the operation of the user inputting a voice through the voice interface, the input voice is set as the identifier voice set by the preset information; wherein the preset information and the content of the identifier voice are not related.
  • the preset information is an operation content that the user actually wants the terminal to perform, and the preset information needs to be customized by the user through a preset interface, where the preset interface is mainly packaged.
  • the voice control setting method provided by the embodiment of the present invention the user needs to set the identifier voice for the preset information through the voice interface, and the identifier voice and the preset information have a one-to-one correspondence; that is, the terminal detects the user voice, if If the user voice is one of the voices, the preset information corresponding to the voice is obtained, and the terminal performs the operation corresponding to the preset information.
  • the setting method provided by the embodiment of the present invention makes the terminal not directly perform operations according to the actual meaning of the user voice, thereby improving the security of the voice control method of the terminal.
  • step 11 when the preset interface is an input text interface, step 11 is specifically:
  • Step 211 Acquire text input by the user through an input text interface.
  • step 11 is specifically:
  • Step 212 Acquire a voice input by the user through an input voice interface.
  • step 11 is specifically:
  • Step 213 Acquire an instruction preset by a user
  • Step 214 Acquire an instruction that the user selects from the preset instructions by calling an instruction interface.
  • the interface for inputting text on the terminal side is a text input mode on the UI of the user interface;
  • the interface for inputting voice is a voice input mode on the UI of the user interface;
  • the interface for invoking the command is on the UI of the user interface.
  • Command input mode specifically, the terminal can customize the usage scene of “text, command, voice input”. For example, in all editing interfaces of the terminal, text input can be started; in the software chat tool dialog interface, text and voice input can be started; Browsing the web page can initiate command input such as "page turning, exiting".
  • the mobile phone detects the voice consistent with the defined voice "yes", and automatically enters the text "I am at home” in the edit box.
  • the mobile phone detects the voice consistent with the defined voice "yes” and automatically sends the voice "test success”.
  • the user selects a page turning command to define a "page turning” voice for "page turning”, and can record a user voice or other voice defined by the user;
  • the mobile phone detects the voice consistent with the defined voice "page turning", and the web page or document will automatically turn the page.
  • the method for providing voice control in the embodiment of the present invention further sets a configuration switch when the terminal is set, that is, the method can be effective only when the configuration switch is turned on, and if the configuration switch is turned off, the terminal can normally recognize the user voice, and The operation corresponding to the actual meaning of the user voice is performed, and the setting of the configuration switch is such that the original function of the terminal is not affected.
  • the configuration method implements a method of custom voice input, which greatly improves the security of the terminal.
  • the method further includes:
  • Step 31 Parse the input voice, and determine a meaning of the input voice
  • Step 32 Perform a corresponding operation according to the meaning of the input voice.
  • the user inputs the actual through the preset interface.
  • the content needs to be executed, and the voice is set for the actual content to be executed through the voice interface.
  • the terminal After the terminal detects the voice, the terminal needs to execute the actual content to be executed corresponding to the preset information, and implements personalized voice control.
  • the setting greatly improves the security and serviceability of the terminal voice input; at the same time, the user satisfaction is improved.
  • the embodiment of the present invention further provides a device for voice control, which is applied to the terminal side, and includes:
  • the voice acquiring module 301 is configured to acquire an input voice of the user.
  • the determining module 302 is configured to: if the terminal side starts the preset function, determine whether the terminal side stores the identification voice consistent with the input voice in advance;
  • the preset information obtaining module 303 is configured to: if the identification voice exists, obtain preset information that is not related to the meaning of the identifier voice according to the identifier voice;
  • the first execution module 304 is configured to perform an operation corresponding to the preset information.
  • the device further includes:
  • a parsing module configured to parse the input voice to determine a meaning of the input voice
  • the second execution module is configured to perform a corresponding operation according to the meaning of the input voice.
  • the device further includes:
  • An acquiring module configured to acquire preset information that is input by the user through a preset interface, where the preset information is used to instruct the terminal to perform a corresponding operation;
  • a setting module configured to respond to the operation of the user inputting a voice through a voice interface, and set the input voice as an identifier voice set by the preset information; where the preset information and the content of the identifier voice are not Related.
  • the acquiring module includes:
  • the first obtaining submodule is configured to acquire text input by the user through an input text interface.
  • the acquiring module includes:
  • the second obtaining submodule is configured to acquire the voice input by the user through the input voice interface.
  • the acquiring module includes:
  • a third obtaining submodule configured to obtain an instruction preset by the user
  • a fourth acquiring submodule configured to acquire an instruction that the user selects from the preset instructions by calling an instruction interface.
  • the function of the voice acquiring module 301 is actually implemented by a human interface module on the terminal, and the corresponding functions of the determining module 302, the preset information acquiring module 303, and the executing module 304 are a central processing unit on the terminal.
  • the terminal further includes a UI interface and a setting module; the specific connection relationship is as shown in FIG.
  • the setting module provides the user to customize the actual input content, and provides corresponding customized sound and storage functions;
  • the human-machine interface module Detecting the interface for collecting user's voice, which is connected to the setting module through the central processing unit to collect sound and transmit the information to the central processor;
  • the central processor is responsible for the human-machine interface module, the UI module, the setting module and other functional modules, and processes User voice, and call the custom voice input module custom corresponding input, and display the corresponding input in the UI interface;
  • UI interface according to the processing and calling of the central processor, the user-defined actual input content is displayed in the UI interface.
  • the user issues a voice “yes”, and the terminal determines whether it has a custom stored voice; if the voice is not stored, the terminal does not respond; if the voice is stored, the terminal reads The preset information corresponding to the sound, for example, enter "I am at home” in the text box, and then automatically enter "I am at home” in the message edit box.
  • the user issues a sound, such as “something to make a call”, and the terminal determines whether it has a custom stored sound; if the sound is not stored, the terminal does not respond; if the sound is stored, The terminal reads the preset information corresponding to the sound, for example, automatically transmitting the voice content “test successful”, and then the terminal automatically sends the voice information “test success” to the information receiver.
  • a sound such as “something to make a call”
  • the terminal determines whether it has a custom stored sound; if the sound is not stored, the terminal does not respond; if the sound is stored,
  • the terminal reads the preset information corresponding to the sound, for example, automatically transmitting the voice content “test successful”, and then the terminal automatically sends the voice information “test success” to the information receiver.
  • the terminal determines whether it has a custom stored sound; if the sound is not stored, the terminal does not respond; if there is stored the sound
  • the terminal reads the preset information corresponding to the sound, for example, automatically sends the voice content "I was caught by the police", and the terminal automatically sends the voice message "I was caught by the police".
  • Embodiment 5 is a diagrammatic representation of Embodiment 5:
  • the terminal determines whether it has stored the sound by itself; if the sound is not stored, the terminal does not respond; if the sound is stored, The terminal reads the preset information corresponding to the sound, for example, the webpage automatically scrolls down one page, and the webpage on the terminal automatically flips to the next page.
  • the embodiment of the present invention further provides a terminal, where the terminal includes a processor, and the processor is configured to acquire an input voice of the user; And determining, by the terminal side, whether the identifier voice is consistent with the input voice; if the identifier voice is present, acquiring, according to the identifier voice, preset information that is not related to the meaning of the identifier voice. Performing an operation corresponding to the preset information.
  • the voice control method according to the embodiment of the present invention may also be stored in a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a stand-alone product. in.
  • the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product stored in a storage medium, including a plurality of instructions.
  • a computer device (which may be a personal computer, server, or network device, etc.) is caused to perform all or part of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a removable hard disk, a read-only memory (ROM), a magnetic disk, or an optical disk, and the like, which can store program codes.
  • ROM read-only memory
  • magnetic disk or an optical disk, and the like, which can store program codes.
  • optical disk and the like, which can store program codes.
  • the embodiment of the present invention further provides a computer storage medium, wherein a computer program for executing the voice control method of the embodiment of the present invention is stored.
  • the apparatus for voice control provided by the embodiment of the present invention is a device that utilizes the method of voice control described above, and all embodiments of the foregoing methods are applicable to the device, and all of the same or similar beneficial effects can be achieved.
  • the preset information that is not related to the meaning of the identified voice is preset, so that other users cannot directly obtain the true intention of the user, and the personalized voice control setting is realized, thereby greatly improving the security of the voice input of the terminal. Serviceability; at the same time, improved user satisfaction.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)

Abstract

本发明提供一种语音控制的方法、装置及存储介质,应用于终端侧的方法包括:获取用户的输入语音;若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;执行所述预设信息对应的操作。

Description

一种语音控制的方法、装置及存储介质 技术领域
本发明涉及通信技术领域,特别涉及一种语音控制的方法、装置及存储介质。
背景技术
手机已成为人们日常生活中形影不离的工具,手机使用安全性显得越来越重要,语音输入使用的频率越来越多,目前市面上的语音输入是终端识别用户语音后,对用户实际的语音含义复述或显示。例如利用Siri(苹果公司推出的一项语音控制功能)用户可以通过手机读短信、介绍餐厅、询问天气、语音设置闹钟等;Siri可以支持自然语言输入,并且可以调用***自带的天气预报、日程安排、搜索资料等应用,还能够不断学习新的声音和语调,提供对话式的应答。但是,由于现有技术中的语音输入是控制终端执行语音输入的实际含义,该种方法容易被其他用户轻易获知其目的,例如发生危险情况时,用户须发出“拨打110”的声音,终端才能自动拨打110;但是此时用户发出“拨打110”的声音后,别人就知道了该用户的意图,可以立即对其进行阻断,切断终端拨打110的操作,从而影响用户实施自救等等。综上,终端根据用户发出声音的直接含义进行操作的方法缺乏安全性,其他用户能够较容易获取用户意图,从而影响用户操作。
发明内容
为解决现有存在的技术问题,本发明实施例主要期望提供一种语音控制的方法、装置及存储介质。
本发明实施例提供一种语音控制的方法,应用于终端侧,该方法包括:
获取用户的输入语音;
若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;
若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;
执行所述预设信息对应的操作。
本发明实施例还提供一种语音控制的装置,应用于终端侧,该装置包括:语音获取模块、确定模块、预设信息获取模块、第一执行模块;其中,
语音获取模块,配置为获取用户的输入语音;
确定模块,配置为若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;
预设信息获取模块,配置为若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;
第一执行模块,配置为执行所述预设信息对应的操作。
本发明实施例还提供一种终端,该终端包括处理器,所述处理器,配置为获取用户的输入语音;若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;执行所述预设信息对应的操作。
本发明实施例还提供一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行上述的语音控制的方法。
本发明的上述技术方案至少具有如下有益效果:
本发明实施例的语音控制的方法、装置及存储介质中,通过获取用户的输入语音后与终端侧预先存储的标识语音相匹配,匹配一致后获取与标 识语音的含义不相关的预设信息,从而所述终端执行预设信息对应的操作;本发明实施例中通过预先设置的与标识语音的含义不相关的预设信息使得其他用户无法直接获取用户的真实意图,实现了个性化的语音控制设置,大大提高了终端语音输入的安全性和服务性;同时提高了用户满意度。
附图说明
图1表示本发明实施例的语音控制的方法的基本步骤流程图;
图2表示本发明实施例的语音控制的方法中设置预设信息的方法的基本步骤流程图;
图3表示本发明实施例的语音控制的装置的结构示意图;
图4表示本发明实施例的语音控制的装置的具体结构的连接关系示意图;
图5表示本发明的具体实施例一的执行流程图;
图6表示本发明的具体实施例二的执行流程图;
图7表示本发明的具体实施例三的执行流程图;
图8表示本发明的具体实施例四的执行流程图;
图9表示本发明的具体实施例五的执行流程图。
具体实施方式
为使本发明要解决的技术问题、技术方案和优点更加清楚,下面将结合附图及具体实施例进行详细描述。
本发明针对现有技术中终端的语音控制方式安全性不高的问题,提供一种语音控制的方法及装置,通过获取用户的输入语音后与终端侧预先存储的标识语音相匹配,匹配一致后获取与标识语音的含义不相关的预设信息,从而所述终端执行预设信息对应的操作;本发明实施例中通过预先设置的与标识语音的含义不相关的预设信息使得其他用户无法直接获取用户 的真实意图,实现了个性化的语音控制设置,大大提高了终端语音输入的安全性和服务性;同时提高了用户满意度。
如图1所示,本发明实施例提供一种语音控制的方法,应用于终端侧,包括:
步骤11,获取用户的输入语音;
步骤12,若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;
步骤13,若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;
步骤14,执行所述预设信息对应的操作。
本发明的上述实施例中,用户的输入语音即为用户发出的声音,具体的,终端上设置有一人机接口模块,该人机接口模块是检测收集用户声音的接口,并用于将收集到的声音传递至终端的中央处理器;由终端侧的中央处理器执行步骤12及步骤13,即解析用户的输入语音,并调用与用户的输入语音一致的标识语音对应的预设信息,其中,为了保障输入语音的安全性,该预设信息与标识语音的含义不相关。
较佳的,如图2所示,所述预先设置的与所述标识语音的含义不相关的预设信息的具体设置步骤包括:
步骤21,获取所述用户通过预设接口输入的预设信息,所述预设信息用于指示所述终端执行相应操作;
步骤22,响应所述用户通过语音接口输入语音的操作,将输入的所述语音设置为所述预设信息设置的标识语音;其中,所述预设信息和所述标识语音的内容不相关。
本发明的上述实施例中,预设信息即为用户实际想让终端执行的操作内容,该预设信息需要用户通过预设接口自定义,其中,预设接口主要包 括输入文本的接口、输入语音的接口以及调用指令的接口。同时,本发明实施例提供的语音控制的设置方法中用户需通过语音接口为所述预设信息设置标识语音,该标识语音与预设信息为一一对应的关系;即终端检测用户语音,若用户语音为标识语音中的一种,则获取标识语音对应的预设信息,终端则执行上述预设信息对应的操作。本发明实施例提供的设置方法使得终端不是直接根据用户语音的实际含义执行操作,提高了终端的语音控制方法的安全性。
具体的,本发明具体实施例中,当预设接口为输入文本接口时,步骤11具体为:
步骤211,获取所述用户通过输入文本接口输入的文本。
或者,当预设接口为输入语音接口时,步骤11具体为:
步骤212,获取所述用户通过输入语音接口输入的语音。
或者,当预设接口为调用指令接口时,步骤11具体为:
步骤213,获取用户预先设置的指令;
步骤214,获取所述用户通过调用指令接口从所述预先设置的指令中选择的指令。
本发明实施例的具体应用中,终端侧的输入文本的接口为用户界面UI上的文本输入模式;输入语音的接口为用户界面UI上的语音输入模式;调用指令的接口为用户界面UI上的指令输入模式;具体的,终端可自定义“文本、指令、语音输入”的使用场景,例如,在终端所有编辑界面,可以启动文本输入;在软件聊天工具对话界面可以启动文本及语音输入;在浏览网页可以启动“翻页、退出”等指令输入。
例如,若用户选择自定义“文本输入”:
1.提供用户输入文本的接口,比如用户可以输入“我在家呢”;
2.为用户提供定义语音的接口,为“我在家呢”定义“yes”等语音, 可以录制用户声音或用户定义的其他声音;
3.在终端的任何编辑界面,手机检测到与定义的语音“yes”一致的语音,自动在编辑框内输入文本“我在家呢”。
若用户选择自定义“语音输入”:
1.提供用户输入语音的接口,比如用户输入语音“试验成功”;
2.为用户提供自定义语音的接口,为“试验成功”定义“yes”等语音,可以录制用户声音或用户定义的其他声音;
3.在互动聊天界面,手机检测到与定义的语音“yes”一致的语音,自动发送语音“试验成功”。
若用户选择自定义“指令输入”:
1.首先自定义一些指令,并提供用户调用指令的接口,比如定义“网页翻页”指令;
2.用户选择网页翻页指令,为“网页翻页”定义“翻页”等语音,可以录制用户声音或用户定义的其他声音;
3.在浏览器界面或文档阅读界面,手机检测到与定义的语音“翻页”一致的语音,网页或文档会自动翻页。
需要说明的是,本发明实施例提供语音控制的方法在终端中设置时还设置一配置开关,即打开上述配置开关该方法才能生效,若该配置开关关闭,则终端能够正常识别用户语音,并执行与用户语音的实际含义对应的操作,该配置开关的设置使得终端原有功能不受影响。该配置方法实现了自定义语音输入的方法,大大提高终端的安全性。
具体的,若所述终端侧未开启所述预设功能,所述方法还包括:
步骤31,解析所述输入语音,确定所述输入语音的含义;
步骤32,根据所述输入语音的含义,执行对应操作。
本发明实施例的预设信息的设置方法中,用户通过预设接口输入实际 需执行内容(预设信息),并通过语音接口为实际需执行内容设置标识语音,则终端检测到标识语音后对应需执行上述预设信息对应的实际需执行内容,实现了个性化的语音控制设置,大大提高了终端语音输入的安全性和服务性;同时提高了用户满意度。
为了更好的实现上述方法,如图3所示,本发明实施例还提供一种语音控制的装置,应用于终端侧,包括:
语音获取模块301,配置为获取用户的输入语音;
确定模块302,配置为若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;
预设信息获取模块303,配置为若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;
第一执行模块304,配置为执行所述预设信息对应的操作。
具体的,本发明上述实施例中,若所述终端侧未开启所述预设功能,所述装置还包括:
解析模块,配置为解析所述输入语音,确定所述输入语音的含义;
第二执行模块,配置为根据所述输入语音的含义,执行对应操作。
具体的,本发明上述实施例中,所述装置还包括:
获取模块,配置为获取所述用户通过预设接口输入的预设信息,所述预设信息用于指示所述终端执行相应操作;
设置模块,配置为响应所述用户通过语音接口输入语音的操作,将输入的所述语音设置为所述预设信息设置的标识语音;其中,所述预设信息和所述标识语音的内容不相关。
具体的,本发明上述实施例中,所述获取模块包括:
第一获取子模块,配置为获取所述用户通过输入文本接口输入的文本。
具体的,本发明上述实施例中,所述获取模块包括:
第二获取子模块,配置为获取所述用户通过输入语音接口输入的语音。
具体的,本发明上述实施例中,所述获取模块包括:
第三获取子模块,配置为获取用户预先设置的指令;
第四获取子模块,配置为获取所述用户通过调用指令接口从所述预先设置的指令中选择的指令。
本发明的具体实施例中,语音获取模块301的功能在终端上实际为一人机接口模块实现,确定模块302、预设信息获取模块303以及执行模块304的相应功能在终端上为一中央处理器实现;终端还包括一UI界面和一设置模块;具体的连接关系如图4所示,设置模块,提供用户自定义实际输入的内容,提供对应的自定义声音及存储功能;人机接口模块,检测收集用户声音的接口,它通过中央处理器与设置模块连接,用于收集声音并将信息传递到中央处理器;中央处理器,负责人机接口模块、UI模块,设置模块等功能模块,处理用户声音,并调用自定义语音输入模块自定义的对应输入,并将对应的输入显示在UI界面;UI界面:根据中央处理器的处理和调用情况,将用户自定义实际输入的内容显示在UI界面。
具体说明如下:
具体实施例一:
如图5所示,首先在消息编辑界面,用户发出声音“yes”,终端判断其是否自定义存储有这个声音;如果没有存储该声音,终端无响应;如有存储有该声音,终端读取该声音对应的预设信息,比如:在文本框输入“我在家呢”,然后在消息编辑框内自动输入“我在家呢”。
具体实施例二:
如图6所示,首先用户发出声音,比如“啊啊啊”,终端判断其是否自定义存储有这个声音;如果没有存储该声音,终端无响应;如有存储有该声音,终端读取该声音对应的预设信息,比如自动呼叫110,然后终端则自 动呼叫110。
具体实施例三:
如图7所示,首先在信息编辑界面,用户发出声音,比如“有事打电话”,终端判断其是否自定义存储有这个声音;如果没有存储该声音,终端无响应;如有存储有该声音,终端读取该声音对应的预设信息,比如:自动发送语音内容“试验成功”,然后终端则自动向信息接收方发送语音信息“试验成功”。
具体实施例四:
如图8所示,首先用户在通话过程中发出声音,比如“我现在很好”,终端判断其是否自定义存储有这个声音;如果没有存储该声音,终端无响应;如有存储有该声音,终端读取该声音对应的预设信息,比如:自动发送语音内容“我被警察抓了”,则终端自动将“我被警察抓了”的语音信息发送出去。
具体实施例五:
如图9所示,首先用户在浏览网页过程中,发出声音,比如“翻页”,终端判断其是否自定义存储有这个声音;如果没有存储该声音,终端无响应;如有存储有该声音,终端读取该声音对应的预设信息,比如:网页自动往下翻一页,则终端上的网页自动翻到下一页。
为了更好的实现本发明实施例的方法,本发明实施例还提供一种终端,该终端包括处理器,所述处理器,配置为获取用户的输入语音;若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;执行所述预设信息对应的操作。
本发明实施例所述语音控制的方法如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质 中。基于这样的理解,本发明实施例的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机、服务器、或者网络设备等)执行本发明各个实施例所述方法的全部或部分。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、磁碟或者光盘等各种可以存储程序代码的介质。这样,本发明实施例不限制于任何特定的硬件和软件结合。
相应的,本发明实施例还提供一种计算机存储介质,其中存储有计算机程序,该计算机程序用于执行本发明实施例的语音控制的方法。
需要说明的是,本发明实施例提供的语音控制的装置是利用上述语音控制的方法的装置,则上述方法的所有实施例均适用于该装置,且均能达到相同或相似的有益效果。
以上所述是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明所述原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。
工业实用性
本发明实施例中通过预先设置的与标识语音的含义不相关的预设信息使得其他用户无法直接获取用户的真实意图,实现了个性化的语音控制设置,大大提高了终端语音输入的安全性和服务性;同时提高了用户满意度。

Claims (14)

  1. 一种语音控制的方法,应用于终端侧,所述方法包括:
    获取用户的输入语音;
    若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;
    若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;
    执行所述预设信息对应的操作。
  2. 根据权利要求1所述的语音控制的方法,其中,若所述终端侧未开启所述预设功能,所述方法还包括:
    解析所述输入语音,确定所述输入语音的含义;
    根据所述输入语音的含义,执行对应操作。
  3. 根据权利要求1所述的语音控制的方法,其中,所述预先设置的与所述标识语音的含义不相关的预设信息的设置步骤包括:
    获取所述用户通过预设接口输入的预设信息,所述预设信息用于指示所述终端执行相应操作;
    响应所述用户通过语音接口输入语音的操作,将输入的所述语音设置为所述预设信息设置的标识语音;其中,所述预设信息和所述标识语音的内容不相关。
  4. 根据权利要求3所述的语音控制的方法,其中,所述获取所述用户通过预设接口输入的预设信息,包括:
    获取所述用户通过输入文本接口输入的文本。
  5. 根据权利要求3所述的语音控制的方法,其中,所述获取所述用户通过预设接口输入的预设信息,包括:
    获取所述用户通过输入语音接口输入的语音。
  6. 根据权利要求3所述的语音控制的方法,其中,所述获取所述用户通过预设接口输入的预设信息,包括:
    获取用户预先设置的指令;
    获取所述用户通过调用指令接口从所述预先设置的指令中选择的指令。
  7. 一种语音控制的装置,应用于终端侧,该装置包括:语音获取模块、确定模块、预设信息获取模块、第一执行模块;其中,
    语音获取模块,配置为获取用户的输入语音;
    确定模块,配置为若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;
    预设信息获取模块,配置为若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;
    第一执行模块,配置为执行所述预设信息对应的操作。
  8. 根据权利要求7所述的语音控制的装置,其中,若所述终端侧未开启所述预设功能,所述装置还包括:
    解析模块,配置为解析所述输入语音,确定所述输入语音的含义;
    第二执行模块,配置为根据所述输入语音的含义,执行对应操作。
  9. 根据权利要求7所述的语音控制的装置,其中,所述装置还包括:
    获取模块,配置为获取所述用户通过预设接口输入的预设信息,所述预设信息配置为指示所述终端执行相应操作;
    设置模块,配置为响应所述用户通过语音接口输入语音的操作,将输入的所述语音设置为所述预设信息设置的标识语音;其中,所述预设信息和所述标识语音的内容不相关。
  10. 根据权利要求9所述的语音控制的装置,其中,所述获取模块包括:
    第一获取子模块,配置为获取所述用户通过输入文本接口输入的文本。
  11. 根据权利要求9所述的语音控制的装置,其中,所述获取模块包括:
    第二获取子模块,配置为获取所述用户通过输入语音接口输入的语音。
  12. 根据权利要求9所述的语音控制的装置,其中,所述获取模块包括:
    第三获取子模块,配置为获取用户预先设置的指令;
    第四获取子模块,配置为获取所述用户通过调用指令接口从所述预先设置的指令中选择的指令。
  13. 一种终端,该终端包括处理器,所述处理器,配置为获取用户的输入语音;若所述终端侧开启预设功能,确定所述终端侧是否预先存储有与所述输入语音一致的标识语音;若存在所述标识语音,根据所述标识语音,获取预先设置的与所述标识语音的含义不相关的预设信息;执行所述预设信息对应的操作。
  14. 一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行权利要求1-6任一项的方法。
PCT/CN2015/072705 2014-11-25 2015-02-10 一种语音控制的方法、装置及存储介质 WO2016082344A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410689720.3 2014-11-25
CN201410689720.3A CN105611033A (zh) 2014-11-25 2014-11-25 一种语音控制的方法及装置

Publications (1)

Publication Number Publication Date
WO2016082344A1 true WO2016082344A1 (zh) 2016-06-02

Family

ID=55990566

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/072705 WO2016082344A1 (zh) 2014-11-25 2015-02-10 一种语音控制的方法、装置及存储介质

Country Status (2)

Country Link
CN (1) CN105611033A (zh)
WO (1) WO2016082344A1 (zh)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105869643A (zh) * 2016-06-06 2016-08-17 青岛海信移动通信技术股份有限公司 基于语音的终端控制方法及语音控制装置
CN107547726A (zh) * 2016-06-24 2018-01-05 中兴通讯股份有限公司 一种移动终端语音指令处理方法及装置
CN107545892B (zh) * 2016-06-24 2021-07-30 中兴通讯股份有限公司 设备的控制方法、装置及***
CN109597657B (zh) * 2017-09-29 2022-04-29 阿里巴巴(中国)有限公司 针对目标应用的操作方法、装置及计算设备
CN108632463A (zh) * 2018-04-24 2018-10-09 维沃移动通信有限公司 一种语音控制方法及移动终端
CN109087640A (zh) * 2018-08-22 2018-12-25 蔚来汽车有限公司 信息交互方法、***以及用于信息交互的车机和服务器

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103458090A (zh) * 2012-05-28 2013-12-18 百度在线网络技术(北京)有限公司 移动终端控制方法及装置
CN103448632A (zh) * 2012-05-28 2013-12-18 百度在线网络技术(北京)有限公司 汽车控制方法及装置
US20140049697A1 (en) * 2012-08-14 2014-02-20 Kentec Inc. Television device and method for displaying virtual on-screen interactive moderator
CN103674012A (zh) * 2012-09-21 2014-03-26 高德软件有限公司 语音定制方法及其装置、语音识别方法及其装置
CN103793641A (zh) * 2014-02-27 2014-05-14 联想(北京)有限公司 一种信息处理方法、装置及电子设备

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103646646B (zh) * 2013-11-27 2018-08-31 联想(北京)有限公司 一种语音控制方法及电子设备

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103458090A (zh) * 2012-05-28 2013-12-18 百度在线网络技术(北京)有限公司 移动终端控制方法及装置
CN103448632A (zh) * 2012-05-28 2013-12-18 百度在线网络技术(北京)有限公司 汽车控制方法及装置
US20140049697A1 (en) * 2012-08-14 2014-02-20 Kentec Inc. Television device and method for displaying virtual on-screen interactive moderator
CN103674012A (zh) * 2012-09-21 2014-03-26 高德软件有限公司 语音定制方法及其装置、语音识别方法及其装置
CN103793641A (zh) * 2014-02-27 2014-05-14 联想(北京)有限公司 一种信息处理方法、装置及电子设备

Also Published As

Publication number Publication date
CN105611033A (zh) 2016-05-25

Similar Documents

Publication Publication Date Title
WO2016082344A1 (zh) 一种语音控制的方法、装置及存储介质
US10930277B2 (en) Configuration of voice controlled assistant
KR101726945B1 (ko) 수동 시작/종료 포인팅 및 트리거 구문들에 대한 필요성의 저감
US9811870B2 (en) Information processing method, apparatus and payment system
US9263029B2 (en) Instant communication voice recognition method and terminal
RU2530267C2 (ru) Способ коммуникации пользователя с информационной диалоговой системой
CN105072178B (zh) 手机号绑定信息获取方法及装置
RU2017124103A (ru) Совершение задачи без монитора в цифровом персональном помощнике
US20190199842A1 (en) Method and system for automatically saving unknown number in mobile phone
KR20140141916A (ko) 사용자 기기의 알림 기능 운용 방법 및 장치
CN109243443B (zh) 语音控制方法、装置及电子设备
WO2016201767A1 (zh) 一种语音控制方法、装置及计算机存储介质
CN105245729A (zh) 移动终端消息阅读方法和装置
CN107483736B (zh) 一种即时通信应用程序的消息处理方法及装置
WO2015188459A1 (zh) 一种终端控制方法、装置、语音控制装置及终端
CN104735238A (zh) 一种通话录音方法及装置
WO2015103842A1 (zh) 消息响应的方法及装置
WO2020063451A1 (zh) 通话留言方法、终端和具有存储功能的装置
CN103064828A (zh) 一种操作文本的方法及装置
US20170118586A1 (en) Voice data transmission processing method, terminal and computer storage medium
EP3687198B1 (en) Text message playback method, terminal and computer-readable storage medium
CN104572007A (zh) 一种终端的音量调节方法
KR101643808B1 (ko) 어플리케이션과 서버 간의 연동을 이용한 음성 서비스 제공 방법 및 그 시스템
CN105812535A (zh) 一种记录语音通信信息的方法及终端
CN104571856A (zh) 一种终端

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15864005

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15864005

Country of ref document: EP

Kind code of ref document: A1