CN111819533B - Method for triggering electronic equipment to execute function and electronic equipment - Google Patents

Method for triggering electronic equipment to execute function and electronic equipment Download PDF

Info

Publication number
CN111819533B
CN111819533B CN201880090844.6A CN201880090844A CN111819533B CN 111819533 B CN111819533 B CN 111819533B CN 201880090844 A CN201880090844 A CN 201880090844A CN 111819533 B CN111819533 B CN 111819533B
Authority
CN
China
Prior art keywords
voice data
electronic device
text
word
processor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201880090844.6A
Other languages
Chinese (zh)
Other versions
CN111819533A (en
Inventor
许军
李永德
梁兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of CN111819533A publication Critical patent/CN111819533A/en
Application granted granted Critical
Publication of CN111819533B publication Critical patent/CN111819533B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Telephone Function (AREA)

Abstract

A method for triggering an electronic device to execute a function and the electronic device relate to the field of electronic devices, and the electronic device can be triggered to execute a corresponding function without inputting voice data for many times by a user, so that the use efficiency of the electronic device is improved, and efficient interaction between the electronic device and the user is realized. The electronic equipment is provided with at least two first awakening words, each of the at least two first awakening words corresponds to one first instruction, and the electronic equipment responds to the first instructions corresponding to different first awakening words to execute different functions; the electronic device includes a main processor in a sleep state; the method comprises the following steps: the electronic device receives first voice data input by a user (S301); judging whether a wakeup word with a text matched with a text corresponding to the first voice data exists in the at least two first wakeup words (S302); if the first instruction exists, the main processor is waken up from the sleep state, a first instruction corresponding to the first voice data is determined, and a function corresponding to the first instruction is executed through the main processor (S303).

Description

Method for triggering electronic equipment to execute function and electronic equipment
Technical Field
The present embodiment relates to the field of electronic devices, and in particular, to a method for triggering an electronic device to execute a function and an electronic device.
Background
Generally, a user may perform a triggering operation on a physical key (e.g., a volume "+" key, a power key, etc.) of the mobile phone, or perform a touch operation on a display screen of the mobile phone to trigger the mobile phone to perform a corresponding function. When the user is inconvenient to operate the mobile phone by using fingers, the user often selects to control the mobile phone to execute the corresponding function through voice. Currently, a voice assistant can provide voice control service for a user to implement the purpose of executing a corresponding function through a voice control mobile phone. The voice assistant is an important application of artificial intelligence on the mobile phone. The voice assistant can recognize the voice command input by the user and trigger the mobile phone to execute the function corresponding to the voice command, so that the intelligent interaction between the user and the mobile phone is realized.
However, in general, the voice assistant is in a sleep state, and the user needs to wake up the voice assistant with voice before using the voice assistant. Only after the voice assistant is awakened can the voice command input by the user be received and recognized. The voice data used to wake up the voice assistant may be referred to as a wake up word (or called a wake up voice). The wakeup word may be pre-registered by the user in the handset. For example, the wakeup word registered in advance in the handset is "hello small E". If the user wants to use the voice assistant to trigger the mobile phone to turn down the volume of the mobile phone, the user needs to speak the 'hello small E' to wake up the voice assistant. After the voice assistant is awakened, the user says "turn down the volume of the mobile phone". At this time, the voice assistant can receive and recognize the voice command of the user to turn down the volume of the mobile phone, and trigger the mobile phone to turn down the volume.
It can be seen that the user needs to input voice data for multiple times to trigger the mobile phone to execute the corresponding function, which greatly reduces the usage efficiency of the mobile phone.
Disclosure of Invention
The embodiment provides a method for triggering an electronic device to execute a function and the electronic device, which can trigger the electronic device to execute a corresponding function without inputting voice data for many times by a user, improve the use efficiency of the electronic device, and realize efficient interaction between the electronic device and the user.
In order to achieve the above object, the present embodiment provides the following technical solutions:
a first aspect of this embodiment provides a method for triggering an electronic device to execute a function, where the electronic device is provided with at least two first wake-up words, each of the at least two first wake-up words corresponds to a first instruction, and the functions executed by the electronic device in response to the first instructions corresponding to different first wake-up words are different; the electronic device may include a primary processor, the primary processor being in a sleep state. The method can comprise the following steps: the electronic device receives first voice data input by a user. The electronic equipment judges whether a wake-up word of which the text is matched with the text corresponding to the first voice data exists in the at least two first wake-up words. If the electronic equipment determines that the awakening words with texts matched with the texts corresponding to the first voice data exist in the at least two first awakening words, the main processor can be awakened from the dormant state, a first instruction corresponding to the first voice data is determined, and a function corresponding to the first instruction is executed through the main processor.
According to the technical scheme provided by the embodiment, the electronic equipment can wake up the main processor of the electronic equipment from the dormant state after the received voice data passes the verification, and determine the instruction corresponding to the input voice data so as to trigger the electronic equipment to execute the function corresponding to the instruction through the main processor. It can be seen that as long as the electronic device does not have other software and hardware to use the microphone to collect voice data (even if the electronic device is in a black screen state and the AP is in a sleep state), the user does not need to input a wakeup word first, so that the electronic device starts the voice assistant and then inputs a voice command, but inputs a piece of voice data to wake up the main processor of the electronic device and trigger the electronic device to execute a corresponding function. Therefore, the use efficiency of the electronic equipment is improved, and the efficient interaction between the electronic equipment and a user is realized. Meanwhile, the use experience of the user is improved.
With reference to the first aspect, in a possible implementation manner, before the electronic device wakes up the main processor from the sleep state, the method may further include: the electronic device determines that the voiceprint features of the first voice data match the voiceprint features corresponding to the at least two first wake-up words. Therefore, only the person who registers the awakening word in the electronic equipment can trigger the electronic equipment to execute the corresponding function by inputting the voice data, and the use safety of the voice control service is improved.
With reference to the first aspect or the foregoing possible implementation manner, in another possible implementation manner, the electronic device wakes up the main processor from a sleep state, determines a first instruction corresponding to the first voice data, and executes, by the main processor, a function corresponding to the first instruction specifically: the electronic equipment wakes up the main processor from the sleep state, starts the voice assistant through the main processor, determines a first instruction corresponding to the first voice data through the voice assistant, and executes a function corresponding to the first instruction through the main processor. And the analysis of the monitored first voice data is realized by the voice assistant.
With reference to the first aspect or the foregoing possible implementation manner, in another possible implementation manner, the electronic device further includes a first coprocessor; the first voice data received by the electronic device and input by the user may specifically be: the electronic equipment monitors first voice data input by a user by using a first coprocessor; the electronic equipment judges whether an awakening word with a text matched with a text corresponding to the first voice data exists in the at least two first awakening words; if there is an awakening word in the at least two first awakening words, where the text of the awakening word matches the text corresponding to the first voice data, the awakening of the main processor from the sleep state by the electronic device may specifically be: the electronic equipment judges whether a wake-up word with a text matched with a text corresponding to the first voice data exists in the at least two first wake-up words or not by using the first coprocessor; if so, the first coprocessor wakes up the main processor from the sleep state. In some embodiments, the main processor is an AP, and the first coprocessor is a DSP.
With reference to the first aspect or the foregoing possible implementation manner, in another possible implementation manner, the electronic device further includes a first coprocessor; the first voice data received by the electronic device and input by the user may specifically be: the electronic equipment monitors first voice data input by a user by using a first coprocessor; the electronic equipment judges whether an awakening word of which the text is matched with the text corresponding to the first voice data exists in the at least two first awakening words or not; if there is an awakening word in the at least two first awakening words, where the text of the awakening word matches the text corresponding to the first voice data, the awakening of the main processor from the sleep state by the electronic device may specifically be: the electronic equipment judges whether a wake-up word with the matching degree of the text corresponding to the first voice data and the text meeting first precision exists in the at least two first wake-up words by using the first coprocessor; if the matching degree of the text corresponding to the first voice data and the text in the at least two first awakening words meets the awakening words with the first precision, the first coprocessor awakens the main processor from the dormant state; determining a first instruction corresponding to the first voice data, and executing a function corresponding to the first instruction by the main processor may specifically be: the electronic equipment judges whether a wake-up word with the matching degree of the text corresponding to the first voice data and the text meeting second precision exists in the at least two first wake-up words by using the main processor; if the matching degree of the text corresponding to the first voice data and the wake-up word with the second precision exists in the at least two first wake-up words, determining a first instruction corresponding to the first voice data, and executing a function corresponding to the first instruction through the main processor; the first precision is less than the second precision. In some embodiments, the host processor is an AP and the first coprocessor is a DSP.
With reference to the first aspect or the foregoing possible implementation manners, in another possible implementation manner, before the electronic device receives first voice data input by a user, the method may further include: the electronic device enters a predetermined mode. Therefore, after the electronic equipment enters the preset mode, the user can trigger the electronic equipment to execute the corresponding function by directly inputting the first awakening word. The method and the device have the advantages that efficient interaction between the electronic equipment and a user is realized, and meanwhile, the power consumption of the electronic equipment is saved as far as possible.
With reference to the first aspect or the foregoing possible implementation manner, in another possible implementation manner, the electronic device further includes a second coprocessor; before the electronic device enters the predetermined mode, the method may further include: the electronic device monitors the voice data using the second co-processor. Before the electronic equipment enters a preset mode, the second coprocessor with lower power consumption is used for monitoring voice data, so that the normal use of the voice assistant is ensured, and the power consumption of the electronic equipment is saved. In some embodiments, the second coprocessor is a DSP having a lower processing performance than the first coprocessor and a smaller memory than the first coprocessor.
With reference to the first aspect or the foregoing possible implementation manner, in another possible implementation manner, a second wakeup word is further set in the electronic device; the electronic device entering the predetermined mode may specifically be: the electronic equipment receives second voice data input by a user; the electronic equipment judges whether the second voice data is matched with the second awakening word; if the second voice data is matched with the second awakening word, the electronic equipment awakens the main processor from the dormant state, and the voice assistant is started through the main processor; the electronic equipment receives third voice data input by a user through the voice assistant, determines a second instruction corresponding to the third voice data, and executes a function corresponding to the second instruction through the main processor, wherein the second instruction is used for indicating the electronic equipment to enter a preset mode.
With reference to the first aspect or the foregoing possible implementation manner, in another possible implementation manner, the determining, by the electronic device, whether the second voice data is matched with the second wakeup word specifically may be: the electronic equipment judges whether the text corresponding to the second voice data is matched with the text of the second awakening word, and if the text corresponding to the second voice data is matched with the text of the second awakening word, the second voice data is matched with the second awakening word.
With reference to the first aspect or the foregoing possible implementation manner, in another possible implementation manner, the determining, by the electronic device, whether the second voice data is matched with the second wakeup word specifically may be: the electronic equipment judges whether the text corresponding to the second voice data is matched with the text of the second awakening word or not and judges whether the voiceprint characteristics of the second voice data are matched with the voiceprint characteristics corresponding to the second awakening word or not; and if the text corresponding to the second voice data is matched with the text of the second awakening word and the voiceprint feature of the second voice data is matched with the voiceprint feature corresponding to the second awakening word, matching the second voice data with the second awakening word. Therefore, after the voiceprint verification and the text verification of the second voice data are both passed, the second voice data are determined to be matched with the second awakening word so as to awaken the main processor from the dormant state, and then the voice assistant is started, so that the use safety of the voice control service is improved.
With reference to the first aspect or the foregoing possible implementation manner, in another possible implementation manner, the electronic device further includes a second coprocessor; the second voice data received by the electronic device from the user may specifically be: the electronic equipment monitors second voice data input by a user by using a second coprocessor; the electronic equipment judges whether the second voice data is matched with the second awakening word; if the second voice data is matched with the second awakening word, the electronic device awakens the main processor from the dormant state, and the starting of the voice assistant through the main processor specifically may be: the electronic equipment judges whether the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets third precision or not by using the second coprocessor; if the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets the third precision, the second coprocessor awakens the main processor from the dormant state; the electronic equipment judges whether the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets fourth precision or not by using the main processor; if the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets the fourth precision, the electronic equipment starts a voice assistant through the main processor; the third precision is less than the fourth precision.
A second aspect of this embodiment provides an electronic device, where at least two first wake-up words are set in the electronic device, each of the at least two first wake-up words corresponds to a first instruction, and functions executed by the electronic device in response to the first instructions corresponding to different first wake-up words are different; the electronic device may include: the input unit is used for receiving first voice data input by a user; the verification unit is used for judging whether an awakening word with text matching with the text corresponding to the first voice data exists in the at least two first awakening words; the awakening unit is used for awakening a main processor of the electronic equipment from a dormant state if an awakening word with a text matched with a text corresponding to the first voice data exists in the at least two first awakening words; and the determining and executing unit is used for determining a first instruction corresponding to the first voice data and executing a function corresponding to the first instruction through the main processor.
With reference to the second aspect, in a possible implementation manner, the verification unit may be further configured to determine that the voiceprint features of the first voice data match the voiceprint features corresponding to the at least two first wake-up words.
With reference to the second aspect or the foregoing possible implementation manner, in another possible implementation manner, the determining execution unit is specifically configured to: the voice assistant is started through the main processor, a first instruction corresponding to the first voice data is determined through the voice assistant, and a function corresponding to the first instruction is executed through the main processor.
With reference to the second aspect or the foregoing possible implementation manner, in another possible implementation manner, the verification unit is specifically configured to determine whether a wakeup word exists in the at least two first wakeup words, where a matching degree of a text corresponding to the first speech data to the text meets a first accuracy; the wake-up unit is specifically configured to wake up a main processor of the electronic device from a sleep state if a wake-up word with a first precision is present in the at least two first wake-up words, where a matching degree of a text corresponding to the first voice data to the text satisfies the first precision; the verification unit is further configured to determine whether a wakeup word with a matching degree between a text corresponding to the first voice data and the text in the at least two first wakeup words satisfies a second precision exists; the determining and executing unit is specifically configured to determine a first instruction corresponding to the first voice data if an awakening word with a second accuracy is present in the at least two first awakening words, and execute a function corresponding to the first instruction through the main processor; the first precision is less than the second precision.
With reference to the second aspect or the foregoing possible implementation manner, in another possible implementation manner, the electronic device may further include: and the triggering unit is used for triggering the electronic equipment to enter a preset mode.
With reference to the second aspect or the foregoing possible implementation manner, in another possible implementation manner, a second wake-up word is further set in the electronic device; the electronic device may further include: and a starting unit. The input unit is also used for receiving second voice data input by a user; the verification unit is further used for judging whether the second voice data are matched with the second awakening words or not; the awakening unit is also used for awakening the main processor from the dormant state if the second voice data is matched with the second awakening word; the starting unit is used for starting the voice assistant through the main processor; the input unit is also used for receiving third voice data input by the user through the voice assistant; and the determining and executing unit is further used for determining a second instruction corresponding to the third voice data, and executing a function corresponding to the second instruction through the main processor, wherein the second instruction is used for indicating the electronic equipment to enter a preset mode.
With reference to the second aspect or the foregoing possible implementation manner, in another possible implementation manner, the matching unit is specifically configured to determine whether a text corresponding to the second voice data matches a text of the second wakeup word, and if the text corresponding to the second voice data matches the text of the second wakeup word, match the second voice data with the second wakeup word.
With reference to the second aspect or the foregoing possible implementation manner, in another possible implementation manner, the matching unit is specifically configured to: judging whether the text corresponding to the second voice data is matched with the text of the second awakening word or not, and judging whether the voiceprint characteristics of the second voice data are matched with the voiceprint characteristics corresponding to the second awakening word or not; and if the text corresponding to the second voice data is matched with the text of the second awakening word and the voiceprint characteristics of the second voice data are matched with the voiceprint characteristics corresponding to the second awakening word, matching the second voice data with the second awakening word.
With reference to the second aspect or the foregoing possible implementation manner, in another possible implementation manner, the matching unit is specifically configured to determine whether a matching degree between a text of the second wakeup word and a text corresponding to the second voice data meets a third precision; the awakening unit is specifically used for awakening the main processor from the dormant state if the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets third precision; the matching unit is specifically further configured to determine whether the matching degree between the text of the second wakeup word and the text corresponding to the second voice data meets a fourth precision; the starting unit is used for starting the voice assistant through the main processor if the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets the fourth precision; the third precision is less than the fourth precision.
In a third aspect of the present embodiment, an electronic device is provided, which may include: a processor, a memory, and a display; the memory, the display and the processor are coupled; the display is used for displaying the image generated by the processor; the memory is for storing computer program code; the processor may include a primary processor, the primary processor being in a sleep state; the electronic equipment is provided with at least two first awakening words, each of the at least two first awakening words corresponds to one first instruction, and the functions executed by the electronic equipment in response to the first instructions corresponding to different first awakening words are different; the computer program code includes computer instructions that, when executed by the processor, cause the processor to receive first speech data input by a user; judging whether an awakening word with a text matched with a text corresponding to the first voice data exists in the at least two first awakening words; and if the wake-up words with texts matched with the texts corresponding to the first voice data exist in the at least two first wake-up words, waking up the main processor from the dormant state, determining a first instruction corresponding to the first voice data, and executing a function corresponding to the first instruction through the main processor.
With reference to the third aspect, in a possible implementation manner, the processor is further configured to determine that voiceprint features of the first voice data match voiceprint features corresponding to the at least two first wake-up words.
With reference to the third aspect or the foregoing possible implementation manner, in another possible implementation manner, the processor is configured to wake up the main processor from a sleep state, determine a first instruction corresponding to the first voice data, and execute, by the main processor, a function corresponding to the first instruction specifically: and the processor is used for awakening the main processor from the sleep state, starting the voice assistant through the main processor, determining a first instruction corresponding to the first voice data through the voice assistant, and executing a function corresponding to the first instruction through the main processor.
With reference to the third aspect or the foregoing possible implementation manner, in another possible implementation manner, the processor further includes a first coprocessor; the processor is configured to receive first voice data input by a user, and specifically: the first coprocessor is used for monitoring first voice data input by a user. The processor is used for judging whether an awakening word with text matching with the text corresponding to the first voice data exists in the at least two first awakening words; if the wake-up word with the text matched with the text corresponding to the first voice data exists in the at least two first wake-up words, waking up the main processor from the sleep state specifically as follows: the first coprocessor is used for judging whether an awakening word with text matching with the text corresponding to the first voice data exists in the at least two first awakening words; if so, the main processor is awakened from the sleep state.
With reference to the third aspect or the foregoing possible implementation manner, in another possible implementation manner, the processor further includes a first coprocessor; the processor is configured to receive first voice data input by a user, and specifically: the first coprocessor is used for monitoring first voice data input by a user. The processor is used for judging whether an awakening word with a text matched with the text corresponding to the first voice data exists in the at least two first awakening words; if the wake-up word with the text matched with the text corresponding to the first voice data exists in the at least two first wake-up words, waking up the main processor from the sleep state specifically as follows: the first coprocessor is used for judging whether a wake-up word with the matching degree of the text corresponding to the first voice data and the text corresponding to the first voice data in the at least two first wake-up words meets first precision or not; and if the matching degree of the text corresponding to the first voice data and the wake-up word with the first precision exists in the at least two first wake-up words, waking up the main processor from the dormant state. The processor is configured to determine a first instruction corresponding to the first voice data, and execute, by the main processor, a function corresponding to the first instruction specifically: the main processor is used for judging whether a wake-up word with the matching degree of the text corresponding to the first voice data and the text meeting second precision exists in the at least two first wake-up words; if the matching degree of the text corresponding to the first voice data and the wake-up word with the second precision is existed in the at least two first wake-up words, determining a first instruction corresponding to the first voice data, and executing a function corresponding to the first instruction; the first precision is less than the second precision.
With reference to the third aspect or the foregoing possible implementation manners, in another possible implementation manner, the processor is further configured to trigger the electronic device to enter a predetermined mode.
With reference to the third aspect or the foregoing possible implementation manner, in another possible implementation manner, the processor further includes a second coprocessor; and the second coprocessor is used for monitoring the voice data before the electronic equipment enters the preset mode.
With reference to the third aspect or the foregoing possible implementation manners, in another possible implementation manner, a second wake-up word is further set in the electronic device; the processor is further configured to trigger the electronic device to enter a predetermined mode, specifically: the processor is also used for receiving second voice data input by a user; judging whether the second voice data is matched with the second awakening word; if the second voice data is matched with the second awakening word, awakening the main processor from the dormant state, and starting the voice assistant through the main processor; and receiving third voice data input by a user through the voice assistant, determining a second instruction corresponding to the third voice data, and executing a function corresponding to the second instruction through the main processor, wherein the second instruction is used for indicating the electronic equipment to enter a preset mode.
With reference to the third aspect or the foregoing possible implementation manner, in another possible implementation manner, the determining whether the second voice data and the second wakeup word are matched specifically includes: and the processor is used for judging whether the text corresponding to the second voice data is matched with the text of the second awakening word or not, and if the text corresponding to the second voice data is matched with the text of the second awakening word, matching the second voice data with the second awakening word.
With reference to the third aspect or the foregoing possible implementation manner, in another possible implementation manner, the determining whether the second voice data and the second wakeup word are matched specifically includes: the processor is used for judging whether the text corresponding to the second voice data is matched with the text of the second awakening word or not and judging whether the voiceprint characteristics of the second voice data are matched with the voiceprint characteristics corresponding to the second awakening word or not; and if the text corresponding to the second voice data is matched with the text of the second awakening word and the voiceprint feature of the second voice data is matched with the voiceprint feature corresponding to the second awakening word, matching the second voice data with the second awakening word.
With reference to the third aspect or the foregoing possible implementation manner, in another possible implementation manner, the processor further includes a second coprocessor; the processor is further configured to receive second voice data input by the user, specifically: and the second coprocessor is used for monitoring second voice data input by the user. The processor is also used for judging whether the second voice data is matched with the second awakening word; if the second voice data is matched with the second awakening word, awakening the main processor from the dormant state, and starting the voice assistant through the main processor specifically comprises the following steps: the second coprocessor is further used for judging whether the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets third precision; if the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets the third precision, awakening the main processor from the dormant state; the main processor is further used for judging whether the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets fourth precision; if the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets the fourth precision, starting a voice assistant; the third precision is less than the fourth precision.
A fourth aspect of the present embodiments provides a computer storage medium, which includes computer instructions that, when executed on an electronic device, cause the electronic device to perform the method for triggering the electronic device to execute a function as described in the first aspect or any of the possible implementations of the first aspect.
A fifth aspect of the present embodiments provides a computer program product, which, when running on a computer, causes the computer to execute the method for triggering an electronic device to execute a function as described in the first aspect or any of the possible implementations of the first aspect.
It should be appreciated that the description of technical features, solutions, benefits, or similar language in this embodiment does not imply that all of the features and advantages may be realized in any single embodiment. Rather, it is to be understood that the description of a feature or advantage is intended to include the specific features, aspects or advantages in at least one embodiment. Therefore, descriptions of technical features, technical solutions or advantages in this specification do not necessarily refer to the same embodiment. Furthermore, the technical features, technical solutions and advantages described in the present embodiments may also be combined in any suitable manner. One skilled in the relevant art will recognize that an embodiment may be practiced without one or more of the specific features, aspects, or advantages of a particular embodiment. In other instances, additional features and advantages may be recognized in certain embodiments that may not be present in all embodiments.
Drawings
Fig. 1 is a schematic structural diagram of an electronic device provided in this embodiment;
fig. 2 is a block diagram of a software structure of an electronic device according to this embodiment;
fig. 3 is a flowchart illustrating a method for triggering an electronic device to execute a function according to this embodiment;
FIG. 4 is a diagram illustrating graphical user interfaces displayed on the electronic device according to the present embodiment;
fig. 5 is a flowchart illustrating another method for triggering an electronic device to execute a function according to this embodiment;
FIG. 6 is a diagram illustrating some other graphical user interfaces displayed on the electronic device according to this embodiment;
FIG. 7 is a diagram illustrating additional graphical user interfaces displayed on the electronic device in accordance with the present embodiment;
FIG. 8 is a diagram illustrating additional graphical user interfaces displayed on the electronic device in accordance with the present embodiment;
fig. 9 is a schematic structural diagram of another electronic device provided in this embodiment;
fig. 10 is a schematic structural diagram of another electronic device provided in this embodiment.
Detailed Description
In the following, the terms "first", "second" are used for descriptive purposes only and are not to be understood as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include one or more of that feature. In the description of the present embodiment, "a plurality" means two or more unless otherwise specified.
According to the method for triggering the electronic equipment to execute the function, the electronic equipment can provide the voice control service for the user without inputting voice data for many times by the user. That is, the electronic device can be triggered to execute the corresponding function without inputting voice data for many times by the user. The use efficiency of the electronic equipment is improved, and the efficient interaction between the electronic equipment and the user is realized. Meanwhile, the use experience of the user is improved.
It should be noted that the electronic device described in this embodiment may be a mobile phone, a tablet Computer, a desktop, a laptop, a handheld Computer, a notebook Computer, a Personal Computer (PC), a netbook, a cellular phone, a Personal Digital Assistant (PDA), a wearable device (such as a smart watch), an intelligent home device, an in-vehicle Computer, and the like, and the specific form of the electronic device is not particularly limited in this embodiment.
Referring to fig. 1, a schematic structural diagram of an electronic device 100 provided in the present embodiment is shown. The electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a Universal Serial Bus (USB) interface 130, a charging management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2, a mobile communication module 150, a wireless communication module 160, an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, an earphone interface 170D, a sensor module 180, a button 190, a motor 191, an indicator 192, a camera 193, a display screen 194, a Subscriber Identity Module (SIM) card interface 195, and the like. The sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, an ambient light sensor 180L, a bone conduction sensor 180M, and the like.
It is to be understood that the illustrated structure of the present embodiment does not constitute a specific limitation to the electronic apparatus 100. In other embodiments, electronic device 100 may include more or fewer components than shown, or combine certain components, or split certain components, or a different arrangement of components. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.
Processor 110 may include one or more processing units, such as: the processor 110 may include an Application Processor (AP), a modem processor, a Graphics Processing Unit (GPU), an Image Signal Processor (ISP), a controller, a memory, a video codec, a Digital Signal Processor (DSP), a baseband processor, and/or a neural-Network Processing Unit (NPU), etc. Wherein, the different processing units may be independent devices or may be integrated in one or more processors.
In this embodiment, the processor 110 may include a DSP, such as referred to as a first DSP. One or more first wake-up words may be set in the electronic device 100. When a plurality of first wake-up words are set in the electronic device 100, each of the plurality of first wake-up words corresponds to one instruction, and the electronic device 100 executes different functions according to the instructions corresponding to different first wake-up words. Wherein the first DSP may monitor the voice data in real time through the microphone 170C of the electronic device 100. When the voice data monitored by the first DSP is detected, text verification with first precision can be carried out on the monitored voice data. If the text verification of the first precision passes, the first DSP can awaken the AP and inform the AP to perform text verification of a second precision on the received voice data. The first precision is less than the second precision. And when the text verification of the second precision passes, the AP determines that the voice data is matched with the first awakening word. At this time, the electronic device 100 may determine an instruction corresponding to the voice data, and trigger the electronic device 100 to execute a corresponding function according to the instruction through the AP. For example, the electronic device may determine the instruction corresponding to the voice data according to a predefined correspondence between the first wake-up word and the instruction. For another example, the electronic device 100 may wake up a voice assistant in the electronic device 100 and perform semantic analysis on the text of the voice data by the voice assistant to determine an instruction corresponding to the voice data, thereby triggering the electronic device 100 to execute a function corresponding to the instruction.
In some embodiments, the text verification operation of the first precision and the text verification operation of the second precision may be performed only one or both. The first wake word may be a predefined wake word. The first wake-up word may also be a user-defined wake-up word. If the first awakening word is the user-defined awakening word, optionally, after receiving the voice data, the AP may also perform voiceprint verification on the voice data. And when the text check and the voiceprint check both pass, the AP determines that the voice data is matched with the first awakening word.
The first DSP may be a DSP with a large memory and higher processing capabilities. For example, the first DSP may be a High Fidelity (HIFI) DSP disposed within a System On a Chip (SOC). The first DSP may also be a codec DSP (codec DSP) disposed outside the SOC.
In some embodiments, the processor 110 may also include another DSP, such as referred to as a second DSP. A second wake-up word may also be set in the electronic device. Wherein the second DSP may monitor the voice data in real time through the microphone 170C of the electronic device 100. When the voice data monitored by the second DSP is detected, text verification of a third accuracy can be performed on the monitored voice data. If the text verification of the third precision passes, the second DSP may transmit the voice data to the AP. After receiving the voice data, the AP may perform a fourth-precision text check and a voiceprint check on the voice data (where the voiceprint check is an optional check operation). The third precision is less than the fourth precision. The third accuracy may be the same as the first accuracy or may be different from the first accuracy. The fourth accuracy may be the same as the second accuracy or may be different from the second accuracy. And when the text check and the voiceprint check both pass, the AP determines that the voice data is matched with the second awakening word. At this point, the electronic device 100 may turn on the voice assistant. After the voice assistant is turned on, the electronic device 100 may receive a voice command input by the user through the voice assistant in order to trigger the electronic device 100 to execute a corresponding function.
The second DSP may be a DSP with less memory and lower processing performance.
It should be noted that, in the present embodiment, if the processor 110 includes two DSPs, such as the first DSP and the second DSP described above. The microphone 170C of the electronic device 100 establishes a path with only one of the DSPs at a time to transmit the received voice data to the corresponding DSP. For example, the microphone 170C of the electronic device 100 establishes a path only with the first DSP when the electronic device 100 is in the predetermined mode. If the microphone 170C collects voice data input by the user, the collected voice data is transmitted to the first DSP through the established path for subsequent processing by the first DSP. For another example, when the electronic device exits the predetermined mode, or is not in the predetermined mode, the electronic device 100 switches the path of the microphone 170C from the first DSP to the second DSP, i.e., the microphone 170C only establishes a path with the second DSP. If the microphone 170C collects voice data input by the user, the collected voice data is transmitted to the second DSP through the established path for subsequent processing by the second DSP.
The controller may be a neural center and a command center of the electronic device 100. The controller can generate an operation control signal according to the instruction operation code and the timing signal to complete the control of instruction fetching and instruction execution.
A memory may also be provided in processor 110 for storing instructions and data. In some embodiments, the memory in the processor 110 is a cache memory. The memory may hold instructions or data that have just been used or recycled by the processor 110. If the processor 110 needs to reuse the instruction or data, it can be called directly from the memory. Avoiding repeated accesses reduces the latency of the processor 110, thereby increasing the efficiency of the system.
In some embodiments, processor 110 may include one or more interfaces. The interface may include an integrated circuit (I2C) interface, an integrated circuit built-in audio (I2S) interface, a Pulse Code Modulation (PCM) interface, a universal asynchronous receiver/transmitter (UART) interface, a Mobile Industry Processor Interface (MIPI), a general-purpose input/output (GPIO) interface, a Subscriber Identity Module (SIM) interface, and/or a Universal Serial Bus (USB) interface, etc.
The I2C interface is a bi-directional synchronous serial bus that includes a serial data line (SDA) and a Serial Clock Line (SCL). In some embodiments, processor 110 may include multiple sets of I2C buses. The processor 110 may be coupled to the touch sensor 180K, the charger, the flash, the camera 193, etc. through different I2C bus interfaces, respectively. For example: the processor 110 may be coupled to the touch sensor 180K via an I2C interface, such that the processor 110 and the touch sensor 180K communicate via an I2C bus interface to implement the touch functionality of the electronic device 100.
The I2S interface may be used for audio communication. In some embodiments, processor 110 may include multiple sets of I2S buses. The processor 110 may be coupled to the audio module 170 via an I2S bus to enable communication between the processor 110 and the audio module 170. In some embodiments, the audio module 170 may communicate audio signals to the wireless communication module 160 via the I2S interface, enabling answering of calls via a bluetooth headset.
The PCM interface may also be used for audio communication, sampling, quantizing and encoding analog signals. In some embodiments, the audio module 170 and the wireless communication module 160 may be coupled by a PCM bus interface. In some embodiments, the audio module 170 may also transmit audio signals to the wireless communication module 160 through the PCM interface, so as to implement a function of answering a call through a bluetooth headset. Both the I2S interface and the PCM interface may be used for audio communication.
The UART interface is a universal serial data bus used for asynchronous communications. The bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, a UART interface is generally used to connect the processor 110 with the wireless communication module 160. For example: the processor 110 communicates with a bluetooth module in the wireless communication module 160 through a UART interface to implement a bluetooth function. In some embodiments, the audio module 170 may transmit the audio signal to the wireless communication module 160 through a UART interface, so as to realize the function of playing music through a bluetooth headset.
MIPI interfaces may be used to connect processor 110 with peripheral devices such as display screen 194, camera 193, and the like. The MIPI interface includes a Camera Serial Interface (CSI), a Display Serial Interface (DSI), and the like. In some embodiments, processor 110 and camera 193 communicate through a CSI interface to implement the capture functionality of electronic device 100. The processor 110 and the display screen 194 communicate through the DSI interface to implement the display function of the electronic device 100.
The GPIO interface may be configured by software. The GPIO interface may be configured as a control signal and may also be configured as a data signal. In some embodiments, a GPIO interface may be used to connect the processor 110 with the camera 193, the display 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like. The GPIO interface may also be configured as an I2C interface, an I2S interface, a UART interface, a MIPI interface, and the like.
The USB interface 130 is an interface conforming to the USB standard specification, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, or the like. The USB interface 130 may be used to connect a charger to charge the electronic device 100, and may also be used to transmit data between the electronic device 100 and a peripheral device. And the earphone can also be used for connecting an earphone and playing audio through the earphone. The interface may also be used to connect other electronic devices, such as AR devices and the like.
It should be understood that the interface connection relationship between the modules illustrated in the present embodiment is only an exemplary illustration, and does not limit the structure of the electronic device 100. In other embodiments, the electronic device 100 may also adopt different interface connection manners or a combination of multiple interface connection manners in the above embodiments.
The charging management module 140 is configured to receive charging input from a charger. The charger may be a wireless charger or a wired charger. In some wired charging embodiments, the charging management module 140 may receive charging input from a wired charger via the USB interface 130. In some wireless charging embodiments, the charging management module 140 may receive a wireless charging input through a wireless charging coil of the electronic device 100. The charging management module 140 may also supply power to the electronic device through the power management module 141 while charging the battery 142.
The power management module 141 is used to connect the battery 142, the charging management module 140 and the processor 110. The power management module 141 receives input from the battery 142 and/or the charge management module 140 and provides power to the processor 110, the internal memory 121, the external memory, the display 194, the camera 193, the wireless communication module 160, and the like. The power management module 141 may also be used to monitor parameters such as battery capacity, battery cycle count, battery state of health (leakage, impedance), etc. In some other embodiments, the power management module 141 may also be disposed in the processor 110. In other embodiments, the power management module 141 and the charging management module 140 may also be disposed in the same device.
The wireless communication function of the electronic device 100 may be implemented by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, a modem processor, a baseband processor, and the like.
The antennas 1 and 2 are used for transmitting and receiving electromagnetic wave signals. Each antenna in the electronic device 100 may be used to cover a single or multiple communication bands. Different antennas can also be multiplexed to improve the utilization of the antennas. For example: the antenna 1 may be multiplexed as a diversity antenna of a wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
The mobile communication module 150 may provide a solution including 2G/3G/4G/5G wireless communication applied to the electronic device 100. The mobile communication module 150 may include at least one filter, a switch, a power amplifier, a Low Noise Amplifier (LNA), and the like. The mobile communication module 150 may receive the electromagnetic wave from the antenna 1, filter, amplify, etc. the received electromagnetic wave, and transmit the electromagnetic wave to the modem processor for demodulation. The mobile communication module 150 may also amplify the signal modulated by the modem processor, and convert the signal into electromagnetic wave through the antenna 1 to radiate the electromagnetic wave. In some embodiments, at least some of the functional modules of the mobile communication module 150 may be disposed in the processor 110. In some embodiments, at least some of the functional modules of the mobile communication module 150 may be disposed in the same device as at least some of the modules of the processor 110.
The modem processor may include a modulator and a demodulator. The modulator is used for modulating a low-frequency baseband signal to be transmitted into a medium-high frequency signal. The demodulator is used for demodulating the received electromagnetic wave signal into a low-frequency baseband signal. The demodulator then passes the demodulated low frequency baseband signal to a baseband processor for processing. The low frequency baseband signal is processed by the baseband processor and then passed to the application processor. The application processor outputs a sound signal through an audio device (not limited to the speaker 170A, the receiver 170B, etc.) or displays an image or video through the display screen 194. In some embodiments, the modem processor may be a stand-alone device. In other embodiments, the modem processor may be provided in the same device as the mobile communication module 150 or other functional modules, independent of the processor 110.
The wireless communication module 160 may provide a solution for wireless communication applied to the electronic device 100, including Wireless Local Area Networks (WLANs) (e.g., wireless fidelity (Wi-Fi) networks), bluetooth (bluetooth, BT), Global Navigation Satellite System (GNSS), Frequency Modulation (FM), Near Field Communication (NFC), Infrared (IR), and the like. The wireless communication module 160 may be one or more devices integrating at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via the antenna 2, performs frequency modulation and filtering processing on electromagnetic wave signals, and transmits the processed signals to the processor 110. The wireless communication module 160 may also receive a signal to be transmitted from the processor 110, perform frequency modulation and amplification on the signal, and convert the signal into electromagnetic waves via the antenna 2 to radiate the electromagnetic waves.
In some embodiments, antenna 1 of electronic device 100 is coupled to mobile communication module 150 and antenna 2 is coupled to wireless communication module 160 so that electronic device 100 can communicate with networks and other devices through wireless communication techniques. The wireless communication technology may include global system for mobile communications (GSM), General Packet Radio Service (GPRS), code division multiple access (code division multiple access, CDMA), Wideband Code Division Multiple Access (WCDMA), time-division code division multiple access (time-division code division multiple access, TD-SCDMA), Long Term Evolution (LTE), LTE, BT, GNSS, WLAN, NFC, FM, and/or IR technologies, etc. The GNSS may include a Global Positioning System (GPS), a global navigation satellite system (GLONASS), a beidou navigation satellite system (BDS), a quasi-zenith satellite system (QZSS), and/or a Satellite Based Augmentation System (SBAS).
The electronic device 100 implements display functions via the GPU, the display screen 194, and the application processor. The GPU is a microprocessor for image processing, and is connected to the display screen 194 and an application processor. The GPU is used to perform mathematical and geometric calculations for graphics rendering. The processor 110 may include one or more GPUs that execute program instructions to generate or alter display information.
The display screen 194 is used to display images, video, and the like. The display screen 194 includes a display panel. The display panel may adopt a Liquid Crystal Display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode (active-matrix organic light-emitting diode, AMOLED), a flexible light-emitting diode (FLED), a miniature, a Micro-oeld, a quantum dot light-emitting diode (QLED), and the like. In some embodiments, the electronic device 100 may include 1 or N display screens 194, with N being a positive integer greater than 1.
The electronic device 100 may implement a shooting function through the ISP, the camera 193, the video codec, the GPU, the display 194, the application processor, and the like.
The ISP is used to process the data fed back by the camera 193. For example, when a photo is taken, the shutter is opened, light is transmitted to the camera photosensitive element through the lens, the optical signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing and converting into an image visible to naked eyes. The ISP can also carry out algorithm optimization on the noise, brightness and skin color of the image. The ISP can also optimize parameters such as exposure, color temperature and the like of a shooting scene. In some embodiments, the ISP may be provided in camera 193.
The camera 193 is used to capture still images or video. The object generates an optical image through the lens and projects the optical image to the photosensitive element. The photosensitive element may be a Charge Coupled Device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The light sensing element converts the optical signal into an electrical signal, which is then passed to the ISP where it is converted into a digital image signal. And the ISP outputs the digital image signal to the DSP for processing. The DSP converts the digital image signal into image signal in standard RGB, YUV and other formats. In some embodiments, the electronic device 100 may include 1 or N cameras 193, N being a positive integer greater than 1.
The digital signal processor is used for processing digital signals, and can process digital image signals and other digital signals. For example, when the electronic device 100 selects a frequency bin, the digital signal processor is used to perform fourier transform or the like on the frequency bin energy.
Video codecs are used to compress or decompress digital video. The electronic device 100 may support one or more video codecs. In this way, the electronic device 100 may play or record video in a variety of encoding formats, such as: moving Picture Experts Group (MPEG) 1, MPEG2, MPEG3, MPEG4, and the like.
The NPU is a neural-network (NN) computing processor that processes input information quickly by using a biological neural network structure, for example, by using a transfer mode between neurons of a human brain, and can also learn by itself continuously. Applications such as intelligent recognition of the electronic device 100 can be realized through the NPU, for example: image recognition, face recognition, speech recognition, text understanding, and the like.
The external memory interface 120 may be used to connect an external memory card, such as a Micro SD card, to extend the memory capability of the electronic device 100. The external memory card communicates with the processor 110 through the external memory interface 120 to implement a data storage function. For example, files such as music, video, etc. are saved in the external memory card.
The internal memory 121 may be used to store computer-executable program code, which includes instructions. The processor 110 executes various functional applications of the electronic device 100 and data processing by executing instructions stored in the internal memory 121. The internal memory 121 may include a program storage area and a data storage area. The storage program area may store an operating system, an application program (such as a sound playing function, an image playing function, etc.) required by at least one function, and the like. The storage data area may store data (such as audio data, phone book, etc.) created during use of the electronic device 100, and the like. In addition, the internal memory 121 may include a high-speed random access memory, and may further include a nonvolatile memory, such as at least one magnetic disk storage device, a flash memory device, a universal flash memory (UFS), and the like.
The electronic device 100 may implement audio functions via the audio module 170, the speaker 170A, the receiver 170B, the microphone 170C, the headphone interface 170D, and the application processor. Such as music playing, recording, etc.
The audio module 170 is used to convert digital audio information into an analog audio signal output and also to convert an analog audio input into a digital audio signal. The audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be disposed in the processor 110, or some functional modules of the audio module 170 may be disposed in the processor 110.
The speaker 170A, also called a "horn", is used to convert the audio electrical signal into a sound signal. The electronic apparatus 100 can listen to music through the speaker 170A or listen to a handsfree call.
The receiver 170B, also called "earpiece", is used to convert the electrical audio signal into an acoustic signal. When the electronic apparatus 100 receives a call or voice information, it can receive voice by placing the receiver 170B close to the ear of the person.
The microphone 170C, also referred to as a "microphone," is used to convert sound signals into electrical signals. When a call is placed or a voice message is sent or it is desired to trigger the electronic device 100 to perform some function by the voice assistant, the user may speak via his/her mouth near the microphone 170C and input a voice signal into the microphone 170C. The electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C to achieve a noise reduction function in addition to collecting sound signals. In other embodiments, the electronic device 100 may further include three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, perform directional recording, and so on.
The earphone interface 170D is used to connect a wired earphone. The headset interface 170D may be the USB interface 130, or may be a 3.5mm open mobile electronic device platform (OMTP) standard interface, a cellular telecommunications industry association (cellular telecommunications industry association) standard interface of the USA.
The pressure sensor 180A is used for sensing a pressure signal, and can convert the pressure signal into an electrical signal. In some embodiments, the pressure sensor 180A may be disposed on the display screen 194. The pressure sensor 180A can be of a variety of types, such as a resistive pressure sensor, an inductive pressure sensor, a capacitive pressure sensor, and the like. The capacitive pressure sensor may be a sensor comprising at least two parallel plates having an electrically conductive material. When a force acts on the pressure sensor 180A, the capacitance between the electrodes changes. The electronic device 100 determines the strength of the pressure from the change in capacitance. When a touch operation is applied to the display screen 194, the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A. The electronic apparatus 100 may also calculate the touched position from the detection signal of the pressure sensor 180A. In some embodiments, the touch operations that are applied to the same touch position but different touch operation intensities may correspond to different operation instructions. For example: and when the touch operation with the touch operation intensity smaller than the first pressure threshold value acts on the short message application icon, executing an instruction for viewing the short message. And when the touch operation with the touch operation intensity larger than or equal to the first pressure threshold value acts on the short message application icon, executing an instruction of newly building the short message.
The gyro sensor 180B may be used to determine the motion attitude of the electronic device 100. In some embodiments, the angular velocity of electronic device 100 about three axes (i.e., the x, y, and z axes) may be determined by gyroscope sensor 180B. The gyro sensor 180B may be used for photographing anti-shake. For example, when the shutter is pressed, the gyro sensor 180B detects a shake angle of the electronic device 100, calculates a distance to be compensated for by the lens module according to the shake angle, and allows the lens to counteract the shake of the electronic device 100 through a reverse movement, thereby achieving anti-shake. The gyroscope sensor 180B may also be used for navigation, somatosensory gaming scenes.
The air pressure sensor 180C is used to measure air pressure. In some embodiments, electronic device 100 calculates altitude, aiding in positioning and navigation, from barometric pressure values measured by barometric pressure sensor 180C.
The magnetic sensor 180D includes a hall sensor. The electronic device 100 may detect the opening and closing of the flip holster using the magnetic sensor 180D. In some embodiments, when the electronic device 100 is a flip phone, the electronic device 100 may detect the opening and closing of the flip according to the magnetic sensor 180D. And then according to the opening and closing state of the leather sheath or the opening and closing state of the flip cover, the automatic unlocking of the flip cover is set.
The acceleration sensor 180E may detect the magnitude of acceleration of the electronic device 100 in various directions (typically three axes). The magnitude and direction of gravity can be detected when the electronic device 100 is stationary. The method can also be used for recognizing the posture of the electronic equipment, and is applied to horizontal and vertical screen switching, pedometers and other applications.
A distance sensor 180F for measuring a distance. The electronic device 100 may measure the distance by infrared or laser. In some embodiments, taking a picture of a scene, electronic device 100 may utilize range sensor 180F to range for fast focus.
The proximity light sensor 180G may include, for example, a Light Emitting Diode (LED) and a light detector, such as a photodiode. The light emitting diode may be an infrared light emitting diode. The electronic device 100 emits infrared light to the outside through the light emitting diode. The electronic device 100 detects infrared reflected light from nearby objects using a photodiode. When sufficient reflected light is detected, it can be determined that there is an object near the electronic device 100. When insufficient reflected light is detected, the electronic device 100 may determine that there are no objects near the electronic device 100. The electronic device 100 can utilize the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear for talking, so as to automatically turn off the screen to achieve the purpose of saving power. The proximity light sensor 180G may also be used in a holster mode, a pocket mode automatically unlocks and locks the screen.
The ambient light sensor 180L is used to sense ambient light brightness. Electronic device 100 may adaptively adjust the brightness of display screen 194 based on the perceived ambient light level. The ambient light sensor 180L may also be used to automatically adjust the white balance when taking a picture. The ambient light sensor 180L may also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in a pocket to prevent accidental touches.
The fingerprint sensor 180H is used to collect a fingerprint. The electronic device 100 can utilize the collected fingerprint characteristics to unlock the fingerprint, access the application lock, photograph the fingerprint, answer an incoming call with the fingerprint, and so on.
The temperature sensor 180J is used to detect temperature. In some embodiments, electronic device 100 implements a temperature processing strategy using the temperature detected by temperature sensor 180J. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold, the electronic device 100 performs a reduction in performance of a processor located near the temperature sensor 180J, so as to reduce power consumption and implement thermal protection. In other embodiments, the electronic device 100 heats the battery 142 when the temperature is below another threshold to avoid the low temperature causing the electronic device 100 to shut down abnormally. In other embodiments, when the temperature is lower than a further threshold, the electronic device 100 performs boosting on the output voltage of the battery 142 to avoid abnormal shutdown due to low temperature.
The touch sensor 180K is also referred to as a "touch panel". The touch sensor 180K may be disposed on the display screen 194, and the touch sensor 180K and the display screen 194 form a touch screen, which is also called a "touch screen". The touch sensor 180K is used to detect a touch operation applied thereto or nearby. The touch sensor can communicate the detected touch operation to the application processor to determine the touch event type. Visual output associated with the touch operation may be provided through the display screen 194. In other embodiments, the touch sensor 180K may be disposed on a surface of the electronic device 100, different from the position of the display screen 194.
The bone conduction sensor 180M can acquire a vibration signal. In some embodiments, the bone conduction sensor 180M may acquire a vibration signal of the human vocal part vibrating the bone mass. The bone conduction sensor 180M may also contact the human pulse to receive the blood pressure pulsation signal. In some embodiments, bone conduction sensor 180M may also be provided in a headset, integrated into a bone conduction headset. The audio module 170 may analyze a voice signal based on the vibration signal of the bone mass vibrated by the sound part acquired by the bone conduction sensor 180M, so as to implement a voice function. The application processor can analyze heart rate information based on the blood pressure beating signal acquired by the bone conduction sensor 180M, so as to realize the heart rate detection function.
The keys 190 include a power-on key, a volume key, and the like. The keys 190 may be mechanical keys. Or may be touch keys. The electronic apparatus 100 may receive a key input, and generate a key signal input related to user setting and function control of the electronic apparatus 100.
The motor 191 may generate a vibration cue. The motor 191 may be used for incoming call vibration cues, as well as for touch vibration feedback. For example, touch operations applied to different applications (e.g., photographing, audio playing, etc.) may correspond to different vibration feedback effects. The motor 191 may also respond to different vibration feedback effects for touch operations applied to different areas of the display screen 194. Different application scenes (such as time reminding, receiving information, alarm clock, game and the like) can also correspond to different vibration feedback effects. The touch vibration feedback effect may also support customization.
Indicator 192 may be an indicator light that may be used to indicate a state of charge, a change in charge, or a message, missed call, notification, etc.
The SIM card interface 195 is used to connect a SIM card. The SIM card can be brought into and out of contact with the electronic apparatus 100 by being inserted into the SIM card interface 195 or being pulled out of the SIM card interface 195. The electronic device 100 may support 1 or N SIM card interfaces, N being a positive integer greater than 1. The SIM card interface 195 may support a Nano SIM card, a Micro SIM card, a SIM card, etc. The same SIM card interface 195 can be inserted with multiple cards at the same time. The types of the plurality of cards may be the same or different. The SIM card interface 195 may also be compatible with different types of SIM cards. The SIM card interface 195 may also be compatible with external memory cards. The electronic device 100 interacts with the network through the SIM card to implement functions such as communication and data communication. In some embodiments, the electronic device 100 employs esims, namely: an embedded SIM card. The eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100.
The software system of the electronic device 100 may employ a layered architecture, an event-driven architecture, a micro-core architecture, a micro-service architecture, or a cloud architecture. In this embodiment, a software structure of the electronic device 100 is exemplarily illustrated by taking an Android system with a layered architecture as an example.
Please refer to fig. 2, which is a block diagram of a software structure of an electronic device 100 according to the present embodiment. Wherein, the layered architecture divides the software into a plurality of layers, and each layer has clear roles and division of labor. The layers communicate with each other through a software interface. In some embodiments, the Android system is divided into four layers, an application layer, an application framework layer, an Android runtime (Android runtime) and system library, and a kernel layer from top to bottom.
The application layer may include a series of application packages.
As shown in fig. 2, the application package may include voice assistant, gallery, calendar, call, map, navigation, WLAN, bluetooth, music, video, short message, etc. applications.
The application framework layer provides an Application Programming Interface (API) and a programming framework for the application program of the application layer. The application framework layer includes a number of predefined functions.
As shown in FIG. 2, the application framework layers may include a window manager, content provider, view system, phone manager, resource manager, notification manager, and the like.
The window manager is used for managing window programs. The window manager can obtain the size of the display screen, judge whether a status bar exists, lock the screen, intercept the screen and the like.
The content provider is used to store and retrieve data and make it accessible to applications. The data may include video, images, audio, calls made and received, browsing history and bookmarks, phone books, etc.
The view system includes visual controls such as controls to display text, controls to display pictures, and the like. The view system may be used to build applications. The display interface may be composed of one or more views. For example, the display interface including the short message notification icon may include a view for displaying text and a view for displaying pictures.
The phone manager is used to provide communication functions of the electronic device 100. Such as management of call status (including on, off, etc.).
The resource manager provides various resources for the application, such as localized strings, icons, pictures, layout files, video files, and the like.
The notification manager enables the application to display notification information in the status bar, can be used to convey notification-type messages, can disappear automatically after a short dwell, and does not require user interaction. Such as a notification manager used to notify download completion, message alerts, etc. The notification manager may also be a notification that appears in the form of a chart or scroll bar text at the top status bar of the system, such as a notification of a background running application, or a notification that appears on the screen in the form of a dialog window. For example, prompting text information in the status bar, sounding a prompt tone, vibrating the electronic device, flashing an indicator light, etc.
The Android Runtime comprises a core library and a virtual machine. The Android runtime is responsible for scheduling and managing an Android system.
The core library comprises two parts: one part is a function which needs to be called by java language, and the other part is a core library of android.
The application layer and the application framework layer run in a virtual machine. And executing java files of the application program layer and the application program framework layer into a binary file by the virtual machine. The virtual machine is used for performing the functions of object life cycle management, stack management, thread management, safety and exception management, garbage collection and the like.
The system library may include a plurality of functional modules. For example: surface managers (surface managers), Media Libraries (Media Libraries), three-dimensional graphics processing Libraries (e.g., OpenGL ES), 2D graphics engines (e.g., SGL), and the like.
The surface manager is used to manage the display subsystem and provide fusion of 2D and 3D layers for multiple applications.
The media library supports a variety of commonly used audio, video format playback and recording, and still image files, among others. The media library may support a variety of audio-video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
The three-dimensional graphic processing library is used for realizing three-dimensional graphic drawing, image rendering, synthesis, layer processing and the like.
The 2D graphics engine is a drawing engine for 2D drawing.
The kernel layer is a layer between hardware and software. The inner core layer at least comprises a display driver, a camera driver, an audio driver and a sensor driver.
For example, the technical solutions involved in the following embodiments may be implemented in the electronic device 100 having the above hardware architecture and software architecture. The method for triggering the electronic device to execute the function provided by the embodiment is described in detail below with reference to the accompanying drawings and application scenarios.
Fig. 3 is a flowchart illustrating a method for triggering an electronic device to execute a function according to this embodiment. As shown in fig. 3, the method may include the following S301-S304.
One or more first awakening words are arranged in the electronic equipment. The first wake-up word may be predefined or user-defined. If the first wake-up word is user-defined, the process of registering the first wake-up word in the electronic device may refer to specific descriptions in the conventional technology, which is not described herein in detail in this embodiment. The first wake-up word may be a voice command that the user will frequently use, such as "mute", "answer call", etc. The first wake-up word set in the electronic device may correspond to a first instruction. In this embodiment, when a plurality of first wake-up words (or at least two first wake-up words) are set in the electronic device, functions executed by the electronic device in response to first instructions corresponding to different first wake-up words are different. In the following embodiments, the electronic device is provided with at least two first wake-up words as an example for detailed description.
S301, the electronic equipment receives first voice data input by a user.
The first DSP of the electronic equipment can monitor whether the user has voice data input in real time through the microphone under the condition that the electronic equipment does not have other software and hardware and uses the microphone to collect the voice data. Generally, when a user wants to trigger the electronic device to perform some functions by inputting voice data, the user can speak near a microphone of the mobile phone to input the spoken sounds into the microphone. At this time, if the electronic device does not have other software and hardware using the microphone to collect voice data, the first DSP of the electronic device may monitor corresponding voice data, such as the first voice data, through the microphone.
It should be noted that, in this embodiment, when the electronic device is in any state of a black screen, a lock screen, a bright screen, and the like, as long as the electronic device does not have other software and hardware that are using the microphone to collect voice data, the first DSP of the electronic device may monitor the voice data of the user. In the black screen state, the AP of the electronic device may be in a sleep state or a non-sleep state.
S302, the electronic equipment judges whether a wake-up word with a text matched with a text corresponding to the first voice data exists in the at least two first wake-up words.
After the electronic device receives the first voice data, text verification can be performed on the first voice data, that is, it is determined whether a wakeup word whose text matches a text corresponding to the first voice data exists in at least two first wakeup words set in the electronic device, so as to determine whether the received first voice data is the first wakeup word set in the electronic device.
If the text check is passed, it indicates that the received first voice data is the first wakeup word set in the electronic device, and at this time, the electronic device may perform S303. If the text check is not passed, it indicates that the received first voice data is not the first wakeup word set in the electronic device, and the electronic device may perform S304.
In some embodiments, the text verification of the first voice data by the electronic device may specifically include: and the first DSP of the electronic equipment performs text verification on the first voice data, and/or the AP of the electronic equipment performs text verification on the first voice data. Wherein, if the electronic device performs text verification on the first voice data specifically comprises: the first DSP performs text verification on the first voice data, and the AP performs text verification on the first voice data, so that the accuracy of text verification performed by the first DSP and the AP can be different. For example, the first DSP performs text verification of a first precision on the first voice data, and the AP performs text verification of a second precision on the first voice data, where the first precision is smaller than the second precision.
For example, the text verification of the first voice data by the electronic device specifically includes: the first DSP performs text verification of a first precision on the first voice data, and the AP performs text verification of a second precision on the first voice data, and the process of performing text verification on the first voice data is described in detail by taking an example that the AP of the electronic device is in a sleep state. After the first DSP of the electronic device monitors the first voice data, the first DSP may perform a text check of a first precision (or referred to as a low precision) on the first voice data. That is to say, the first DSP may determine whether there is a wakeup word whose matching degree of the text corresponding to the first voice data satisfies the first precision, in the at least two first wakeup words. If the first DSP determines that, among the at least two first wake-up words, there is a wake-up word whose matching degree of the text corresponding to the first voice data meets the first precision, that is, the first DSP passes the text verification of the first precision of the first voice data, the first DSP may wake up the AP of the electronic device and transmit the monitored first voice data to the AP. If the first DSP determines that there is no wakeup word whose matching degree of the text corresponding to the first voice data satisfies the first precision in the at least two first wakeup words, it indicates that the received first voice data is not the first wakeup word set in the electronic device, and the electronic device may execute S304.
After receiving the first voice data, the AP of the electronic device may perform text verification with a second precision (or referred to as high precision) on the first voice data. That is to say, the AP may determine whether there is an awaking word whose matching degree between the text and the text corresponding to the first voice data satisfies the second precision in the at least two first awaking words. If the AP determines that there is an awakening word of which the matching degree of the text corresponding to the first voice data and the text in the at least two first awakening words satisfies the second precision, that is, if the AP passes the text verification of the second precision of the first voice data, it indicates that the received first voice data is the first awakening word set in the electronic device, and the electronic device may execute S303. If the AP of the electronic device determines that there is no wakeup word with a matching degree of the text corresponding to the first voice data and the text meeting the second precision in the at least two first wakeup words, that is, the AP fails to check the text of the first voice data with the second precision, it indicates that the received first voice data is not the first wakeup word set in the electronic device, and the electronic device may execute S304.
S303, the electronic device determines a first instruction corresponding to the first voice data, and executes a function corresponding to the first instruction through a main processor of the electronic device.
If the electronic device determines that the awakening words with texts matched with the texts corresponding to the first voice data exist in the at least two first awakening words, the electronic device can determine the first instructions corresponding to the first voice data and execute the functions corresponding to the first instructions, so that the purpose of triggering the electronic device to execute the corresponding functions by inputting the voice data is achieved.
For example, the determining, by the electronic device, the first instruction corresponding to the first voice data may specifically be: the electronic equipment stores the corresponding relation between at least two first awakening words and the instruction, wherein the first awakening words and the instruction are arranged in the electronic equipment. After the text of the first voice data is verified by the electronic device, the electronic device may search the corresponding relationship to determine a first instruction corresponding to the first voice data.
For another example, the electronic device may perform semantic analysis on the text of the first voice data to determine a first instruction corresponding to the first voice data. The semantic analysis function of the electronic device can be implemented in the electronic device by a certain separate module, or can be implemented by a module integrated in a certain application program. For example, the voice analysis function is implemented by a module integrated in the voice assistant. Then, when the electronic device determines that an awakening word whose text matches the text corresponding to the first voice data exists in the at least two first awakening words, the electronic device may start the voice assistant, perform semantic analysis on the text corresponding to the first voice data by the voice assistant to determine a first instruction corresponding to the first voice data, and trigger the electronic device to execute a function corresponding to the first instruction by the main processor. It can be seen that in this implementation, the first voice data may not only wake up the voice assistant as a wake-up word, but also trigger the electronic device to execute a corresponding function as a voice command. The voice assistant may be an Application (APP) installed in the electronic device. In this embodiment, the voice assistant may be a system application or a third party application. System applications, which may also be referred to as embedded applications, are application programs provided as part of an implementation of an electronic device. The third-party application, which may also be referred to as a downloadable application, is an application program that may provide its own Internet Protocol Multimedia Subsystem (IMS) connection, and may be pre-installed in the electronic device or downloaded and installed in the electronic device by a user.
S304, the electronic equipment deletes the first voice data.
According to the method for triggering the electronic device to execute the function, the electronic device can determine the instruction corresponding to the input voice data after the received voice data passes the verification, so as to trigger the electronic device to execute the function corresponding to the instruction. It can be seen that as long as the electronic device does not have other software and hardware to use the microphone to collect voice data (even in the black screen state, and the AP is in the sleep state), the user does not need to input a wakeup word first, so that the electronic device starts the voice assistant and then inputs a voice command, but inputs a piece of voice data to trigger the electronic device to execute a corresponding function. Therefore, the use efficiency of the electronic equipment is improved, and the efficient interaction between the electronic equipment and a user is realized. Meanwhile, the use experience of the user is improved.
In some embodiments, if the first wake-up word is a user-defined wake-up word, the voiceprint check of the first voice data may be continued after the text check of the first voice data is passed. That is, after the text verification passes, the AP may determine whether the voiceprint feature of the first voice data matches the voiceprint feature corresponding to the at least two first wake-up words set in the electronic device. If the voiceprint feature of the first voice data matches the voiceprint feature corresponding to at least two first wake-up words set in the electronic device, the electronic device may execute S303 above if the voiceprint check on the first voice data passes. If the voiceprint feature of the first voice data does not match the voiceprint features corresponding to the at least two first wake-up words set in the electronic device, the voiceprint check on the first voice data fails, and the electronic device may perform S304 described above. That is, after the electronic device receives the first voice data, the text check and the voiceprint check of the first voice data may both be passed, and then the S303 may be executed. Therefore, only the person who registers the awakening word in the electronic equipment can trigger the electronic equipment to execute the corresponding function by inputting the voice data, and the use safety of the voice control service is improved.
For example, when the at least two first wake-up words are set in the electronic device, the electronic device may generate the first voiceprint model from voice data input by the user when the at least two first wake-up words are set. The first voiceprint model can be used to characterize voiceprint characteristics of the at least two first wake words. If the electronic device determines that an awakening word with a text matching the text corresponding to the first voice data exists in the at least two first awakening words, that is, the text passes verification, the AP may continue to perform voiceprint verification on the first voice data according to the first voiceprint model. Specifically, the method comprises the following steps: after the electronic device generates the first voiceprint model, the voice data input by the user when at least two first awakening words are set can be used as an input value, and the first voiceprint threshold is obtained after the voice data are input into the first voiceprint model. After the electronic device determines that the monitored first voice data passes the text check, the first voice data can be used as an input value and input into the first voiceprint model to obtain a voiceprint value, such as a second voiceprint value. The electronic device may determine whether a difference between the second voiceprint value and the first voiceprint threshold is less than a preset threshold. If the difference value between the second voiceprint value and the first voiceprint threshold is smaller than the preset threshold value, the voiceprint verification is passed. If the difference value between the second voiceprint value and the first voiceprint threshold is greater than or equal to the preset threshold value, the voiceprint verification is not passed.
Illustratively, a plurality of predefined first wake-up words are set in the electronic device. For example, as shown in table 1, the first wake-up word set in the electronic device includes the following common voice commands: system setting commands such as "mute", "unmute", "volume up" (or "louder"), "volume down" (or "louder"), "lock screen", etc. Navigation setting commands such as "exit navigation", "stop navigation", "switch routes" (or "change one route"), "navigate home", "navigate to company", and the like. A music setting command such as "previous", "next" (or "cut song"), "pause music" (or "pause play"), "start music" (or "start play"), "stop music" (or "stop play") and the like. Communication setting commands such as "hang up the phone", "answer the phone", "view the short message", "reply the short message", "read the WeChat", "reply the WeChat", etc.
TABLE 1
Figure GPA0000293431030000221
For example, the electronic device performs semantic analysis through a voice assistant to obtain an instruction corresponding to the voice data. No other software or hardware currently uses a microphone to collect voice data in an electronic device. And as shown in fig. 4 (a), the electronic device is in a black screen state, and the AP of the electronic device is also in a sleep state. The user wants to turn down the volume of the electronic device by inputting voice data to trigger the electronic device. The user may say "turn down" near the microphone of the handset. The first DSP of the electronic device may monitor the corresponding voice data "turn down volume" through the microphone. After the first DSP monitors that the volume of the voice data is turned down, text verification of first precision can be carried out on the voice data of the volume turned down. When the first DSP passes the text verification of the first accuracy of the "down volume" for the voice data, the first DSP may wake up the AP and transmit the "down volume" for the voice data to the AP. The AP may perform a second accuracy text check on the voice data "turn down volume". When the AP passes the text verification of the second precision of the volume turning-down of the voice data, the voice assistant can be started, and the voice assistant performs semantic analysis on the volume turning-down of the voice data so as to determine the instruction corresponding to the volume turning-down of the voice data. At this time, the AP triggers the electronic device to turn down the volume of the system according to the instruction.
After the user knows the input voice data conveniently, whether the electronic equipment has responded correspondingly or not is judged. The electronic device may light up the screen and display a prompt message after the corresponding response is made to prompt the user that the corresponding response has been made. For example, in connection with the above example, as shown in fig. 4 (b), after the voice assistant is started, the electronic device may light up the screen and display the voice assistant interface 401. The voice assistant interface 401 may include the text "turn down" 402 corresponding to the recognized voice data entered by the user. After the electronic device turns the volume of the system down, a prompt 403 may be displayed in the voice assistant interface 401. The prompt 403 is used to prompt the user that the volume of the system has been turned down. It can be seen that, under the condition that the electronic device is in the black screen state and the AP is in the sleep state, the user does not need to input the voice data "hello little E" first to wake up the voice assistant and then input the voice data "turn down the volume (or turn down the volume)" of the mobile phone ", but only needs to input one piece of voice data" turn down the volume ", so as to trigger the electronic device to turn down the volume, thereby achieving the intention. The efficiency of human-computer interaction is improved, and user experience is greatly improved.
It will be appreciated that since the first DSP described above has a relatively high processing power, its power consumption is also relatively high. In the embodiment, the power consumption of the electronic equipment is saved as much as possible while the efficient interaction between the electronic equipment and the user is realized. The first DSP can be used to process the voice data collected by the microphone only in a specific scenario. That is, the electronic device may be triggered to execute the corresponding function by inputting voice data only in a specific scenario by executing the above S301-S304. That is, as shown in fig. 5, before S301, the method for triggering the electronic device to execute the function may further include S501.
S501, the electronic equipment enters a preset mode.
The predetermined mode may be a driving mode, a home mode, etc., among others. In the predetermined mode, when the electronic device has no other software or hardware to use the microphone to collect voice data, the microphone of the electronic device may transmit the collected voice data to the first DSP of the electronic device, so that the first DSP processes the voice data.
In some embodiments, the electronic device may automatically enter a predetermined mode under certain circumstances. For example, the electronic device may automatically enter the predetermined mode at certain specific time periods, or at certain specific locations.
Generally, the electronic device may acquire the user's intention to use the electronic device at the current location or the current time using the historical usage record, such as whether to trigger the electronic device to execute a corresponding function by inputting voice data. The electronic device may automatically enter the predetermined mode when the intention to use the electronic device is to trigger the electronic device to perform the corresponding function by inputting voice data. That is, the electronic device may utilize the historical usage record to determine when and/or where the user frequently triggered the electronic device to perform the corresponding function by inputting voice data. Upon determining that the current time or current location matches the historical usage record, the electronic device may automatically enter a predetermined mode. For example, the electronic device obtains that the user is at 19: 00-20: the electronic device is often triggered to perform the corresponding function by inputting voice data during this period of time 30. Then, the electronic device may determine that the current system time is at 19: 00-20: and 30, automatically entering a preset mode. For another example, if the electronic device acquires that the user is within a certain geographic location range (e.g., the geographic location range is the user's home), the electronic device is often triggered to perform the corresponding function by inputting voice data, and then the electronic device may automatically enter a predetermined mode (e.g., a home mode) when it is determined that the geographic location of the current electronic device is within the geographic location range.
In other embodiments, the electronic device may automatically enter a predetermined mode (e.g., a driving mode) upon determining that the speed of movement of the electronic device is greater than a particular value. Generally, when a user drives a car, voice data may be input to trigger the electronic device to perform a corresponding function, such as navigation through a map application of the electronic device triggered by a voice assistant. Therefore, when the electronic device detects that the current moving speed of the electronic device is larger than a specific value, a predetermined mode (such as a driving mode) is automatically entered.
In still other embodiments, the electronic device may enter the predetermined mode in response to a specific input by the user. The specific input may be a user's trigger operation of a specific virtual button or physical key. For example, the specific input is a user's trigger operation on a specific virtual button (e.g., a switch button of a "predetermined mode" option). The electronic device comprises Settings (Settings). As shown in fig. 6, the electronic device may receive a user's click operation on a set icon. In response to a user's click operation on the set icon, the electronic device may display a setting interface 601 illustrated in fig. 6. The setting interface 601 may include an "airplane mode" setting option, a "WLAN" setting option, a "bluetooth" setting option, a "mobile network" setting option, a "predetermined mode" setting option (as shown in fig. 6 by way of example as a "driving mode" setting option), and the like. The specific functions of the "flight mode" option, the "WLAN" option, the "bluetooth" option, and the "mobile network" option may refer to specific descriptions in the conventional technology, and are not described herein again in this embodiment. When the user wants to trigger the electronic device to execute the corresponding function by using the first wake-up word, the user may click the switch button 602 of the "driving mode" option. The electronic device may enter a predetermined mode, such as a driving mode, in response to a user clicking on the switch button 602 of the "driving mode" option. When the user performs the clicking operation again on the switch button 602 of the "driving mode" option, the electronic device may exit the driving mode. In this case, the display effect of the switch button 602 of the "driving mode" option shown in fig. 6 is used to indicate that the driving mode is not turned on, and the user may perform a click operation on the switch button 602 at this time, so that the electronic device enters the driving mode.
The specific input may also be a voice command input by the user. The voice command may be entered by a voice assistant of the electronic device. For example, a second wake-up word may also be provided in the electronic device. The second wake word may be used to wake up a voice assistant in the electronic device. After the user wakes up the voice assistant by the second wake-up word, a voice command can be input by the voice assistant to trigger the electronic device to enter the predetermined mode. Before the electronic device enters the predetermined mode, the electronic device may monitor the voice data using another DSP, such as a second DSP, so that the voice assistant may be awakened by a second wake-up word. After the electronic device exits the predetermined mode or is not in the predetermined mode, the electronic device may use the second DSP to process the voice data collected by the microphone. Because the processing performance of the second DSP is lower than that of the first DSP, the memory is smaller than that of the first DSP. Therefore, the power consumption is low compared to the first DSP. Therefore, efficient interaction between the electronic equipment and the user when the user uses the voice assistant in a preset scene can be guaranteed, and meanwhile, the power consumption of the electronic equipment is saved. Moreover, when the electronic device is in a non-specific scene, the electronic device may still be triggered by the voice assistant to perform a corresponding function, such as triggering the electronic device to enter a predetermined mode.
For example, the voice assistant is in a dormant state. When the user wants to enter a voice command via the voice assistant to trigger the electronic device to enter a predetermined mode, the user may speak near the microphone of the handset to input the spoken sound into the microphone. At this time, the second DSP of the electronic device may monitor the voice data input by the user, such as the second voice data, through the microphone. After the second DSP of the electronic device monitors the second voice data, the electronic device may determine whether the second voice data matches a second wakeup word set in the electronic device. The voice assistant may be activated when the second voice data matches the second wake up word. At this time, the user may input voice data, such as third voice data, for triggering the electronic device to enter the predetermined mode through the voice assistant. The electronic equipment can receive third voice data input by a user through the voice assistant and determine a second instruction corresponding to the third voice data. The second instruction may be for instructing the electronic device to enter a predetermined mode. The electronic device may perform a function corresponding to the second instruction, i.e., enter the predetermined mode.
The specific way for the electronic device to determine whether the second voice data matches the second wakeup word set in the electronic device may be: and the electronic equipment performs text verification on the second voice data, if the text corresponding to the second voice data is matched with the text of the second awakening word, the text verification is passed, and the second voice data is matched with the second awakening word. And if the text corresponding to the second voice data is not matched with the text of the second awakening word, the text verification fails, and the second voice data is not matched with the second awakening word. Or, the specific way for the electronic device to determine whether the second voice data matches the second wake-up word set in the electronic device may be: and the electronic equipment performs text verification and voiceprint verification on the second voice data. And if the text corresponding to the second voice data is matched with the text of the second awakening word and the voiceprint feature of the second voice data is matched with the voiceprint feature corresponding to the second awakening word, the text verification and the voiceprint verification are passed, and the second voice data is matched with the second awakening word. If the text corresponding to the second voice data is not matched with the text of the second awakening word, or the voiceprint feature of the second voice data is not matched with the voiceprint feature corresponding to the second awakening word, the text check and the voiceprint check fail, and the second voice data is not matched with the second awakening word.
In some embodiments, the electronic device text-checking the second speech data may include: and the second DSP of the electronic equipment performs text verification of third precision on the second voice data, and/or the AP of the electronic equipment performs text verification of fourth precision on the second voice data. The third precision is less than the fourth precision. The third accuracy may be the same as or different from the first accuracy. The fourth accuracy may be the same as the second accuracy or may be different from the first accuracy.
For example, the determining, by the electronic device, whether the second voice data matches a second wakeup word set in the electronic device specifically includes: and the electronic equipment performs text verification and voiceprint verification on the second voice data. The electronic equipment performs text verification on the second voice data and comprises the following steps: the text verification of the second voice data with the third precision by the second DSP and the text verification of the second voice data with the fourth precision by the AP are taken as examples, and the process of determining whether the second voice data is matched with the second wakeup word by the electronic device will be described in detail. After monitoring the second voice data, the second DSP of the electronic device may determine whether a matching degree between a text corresponding to the second voice data and a text of the second wakeup word satisfies a third precision. And if the matching degree of the text corresponding to the second voice data and the text of the second awakening word meets the third precision, transmitting the second voice data to the AP. The AP may determine whether a matching degree of the text corresponding to the second voice data and the text of the second wakeup word satisfies a fourth accuracy. If the matching degree of the text corresponding to the second voice data and the text of the second awakening word meets the fourth precision, the AP of the electronic device can judge whether the voiceprint feature of the second voice data is matched with the voiceprint feature corresponding to the second awakening word. If the voiceprint feature of the second voice data is matched with the voiceprint feature corresponding to the second awakening word, the second voice data is the second awakening word arranged in the electronic equipment, and at this time, the electronic equipment can start the voice assistant.
After the electronic device enters the predetermined mode, if the user wants to trigger the electronic device to execute the corresponding function by inputting the voice data, the user only needs to input the first awakening word to realize the intention. That is to say, after the electronic device enters the predetermined mode, the user does not need to input the voice data for multiple times, but directly uses the first wake-up word to trigger the electronic device to execute the corresponding function. The power consumption of the electronic equipment is saved as far as possible while the efficient interaction between the electronic equipment and the user is realized.
The following describes a method for triggering the electronic device to execute a function according to the present embodiment with reference to fig. 7 and 8. For example, the electronic device is a mobile phone. The mobile phone comprises a voice assistant. The mobile phone performs semantic analysis on the text corresponding to the voice data input by the user through the voice assistant to obtain a corresponding instruction. The mobile phone comprises two DSPs, namely a DSP 1 and a DSP 2. The handset includes a microphone that establishes a path with only one of the two DSPs at a time. The microphone establishes a path with the DSP2 by default. And after the mobile phone enters a driving mode, switching the microphone path from the DSP2 to the DSP 1.
5 predefined first awakening words are arranged in the mobile phone, and the words are respectively as follows: quit navigation, stop navigation, switch routes, navigate home, and navigate to the company. The mobile phone is also provided with 1 second awakening word 'hello small E'.
As shown in fig. 7 (a), the cellular phone is in a black screen state. The AP is in a sleep state. The voice assistant is in a sleep state. When a user drives a car, a microphone close to the mobile phone says "hello little E". The microphone of the mobile phone acquires voice data 1 corresponding to the 'hello small E'. And the microphone of the mobile phone transmits the collected voice data 1 to the DSP 2. The DSP 2 judges whether the matching degree of the text corresponding to the voice data 1 and the text of the second wake-up word "hello small E" set in the mobile phone satisfies the accuracy 1, to determine whether the voice data 1 is suspected to be the second wake-up word "hello small E" set in the mobile phone. And when the DSP 2 determines that the received voice data 1 is suspected to be a second awakening word 'hello small E' arranged in the mobile phone, awakening the AP from the dormant state and transmitting the voice data 1 to the AP of the mobile phone. After receiving the voice data 1, the AP determines whether the matching degree between the text corresponding to the voice data 1 and the text of the second wake-up word "hello-small E" set in the mobile phone meets the precision 2. Precision 2 is greater than precision 1. When determining that the matching degree of the text corresponding to the voice data 1 and the text of the second awakening word "hello xiao E" set in the mobile phone meets the precision 2, the AP of the mobile phone judges whether the voiceprint feature of the voice data 1 is matched with the voiceprint feature corresponding to the second awakening word "hello xiao E", that is, performs voiceprint verification. If the voiceprint check passes. The AP of the handset may wake up the voice assistant. As shown in fig. 7 (b), the cell phone lights up the screen and displays the voice assistant interface 701. Prompt messages 702 may be included in the voice assistant interface 701. The prompt message 702 is used to prompt the user that a voice command can be input at this time to trigger the mobile phone to execute a corresponding function.
The user speaks "enter driving mode" near the microphone of the handset. The handset may receive the corresponding voice data 2 via the voice assistant. As shown in fig. 7 (c), the mobile phone may display the text "enter driving mode" 704 corresponding to the voice data 2 in the voice assistant interface 703. The mobile phone performs semantic analysis on the voice data 2 through the voice assistant to determine an instruction corresponding to the voice data 2. The mobile phone can be triggered to enter a driving mode according to the instruction. And the mobile phone switches the microphone path from DSP 2 to DSP 1. After the cell phone enters the driving mode, as shown in fig. 7 (c), the cell phone may display a prompt message 705 in the voice assistant interface 703. The prompt message 705 is used to prompt the user that the user has entered the driving mode, and then the user can directly speak the first wake-up word to trigger the mobile phone to execute the corresponding function.
Generally, after the mobile phone executes the corresponding function according to the voice data received by the voice assistant, the voice assistant enters an idle state. Or the mobile phone controls the voice assistant to enter the dormant state again when the mobile phone determines that the user operation is not received within the preset time. In addition, if the user operation is not received within a certain time, the AP may re-enter the sleep state. In the prior art, after the voice assistant is in an idle state or enters a sleep state (or the voice assistant is in the idle state or enters the sleep state, and the AP is also in the sleep state), if the user wants to use the voice assistant again to trigger the mobile phone to execute a corresponding function, the user needs to re-input a wakeup word and then input a voice command to achieve an intention. However, in this embodiment, after the mobile phone enters the driving mode, even if the voice assistant is in the idle state or enters the sleep state (or the voice assistant is in the idle state or enters the sleep state and the AP is in the sleep state), the user only needs to input the first wake-up word to achieve the intention.
For example, after the mobile phone enters the driving mode, the voice assistant enters the sleep state again, and as shown in fig. 8 (a), the mobile phone is in the black screen state again, and the AP is in the sleep state. The user speaks "navigate home" near the microphone of the handset. The microphone of the mobile phone acquires voice data 3 corresponding to 'navigate home'. And the microphone of the mobile phone transmits the collected voice data 3 to the DSP 1. The DSP 1 can perform text matching with lower precision on the voice data 3, that is, determine whether there is an awakening word whose matching degree between the text and the text corresponding to the voice data 3 satisfies precision 3 among 5 first awakening words set in the mobile phone. The DSP 1 wakes up the AP when the lower precision text match passes and transmits the voice data 3 to the AP. At this time, the AP of the mobile phone can perform higher-accuracy text matching on the voice data 3, that is, determine whether there is an awakening word whose matching degree between the text and the text corresponding to the voice data 3 satisfies accuracy 4 among 5 first awakening words set in the mobile phone. Precision 4 is greater than precision 3. When the text verification with higher precision passes, the mobile phone can wake up the voice assistant. The mobile phone can also perform semantic analysis on the voice data 3 through the voice assistant to determine the instruction corresponding to the voice data 3. The mobile phone can call a corresponding interface according to the instruction to trigger the map application to display the corresponding navigation route to the user (or the mobile phone can simulate the click operation of the user according to the instruction so as to display the corresponding navigation route to the user in the map application). The mobile phone can also broadcast the navigation route information through a loudspeaker. For example, as shown in fig. 8 (b), the cell phone may light up the screen and display the voice assistant interface 801. The voice assistant interface 801 may include the text "navigate home" 802 corresponding to the recognized user-entered voice data 3. The voice assistant interface 801 is a jump interface, that is, after the voice assistant interface 801 is displayed on the display screen of the mobile phone, the user immediately jumps to the navigation interface 803 shown in (c) of fig. 8. In some embodiments, instead of displaying the voice assistant interface 801 on the display screen of the mobile phone, the mobile phone may directly light up the screen to display the navigation interface 803 after the user inputs "navigate home". Thus, the user can achieve the intention without inputting voice data for many times. The man-machine interaction efficiency is improved, and the user experience is improved.
It is understood that the electronic device includes hardware structures and/or software modules for performing the functions in order to realize the functions. Those of skill in the art will readily appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as hardware or combinations of hardware and computer software. Whether a function is performed as hardware or computer software drives hardware depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the embodiments of the present application.
The embodiment also provides electronic equipment for realizing the method embodiments. Specifically, the electronic device may be divided into functional modules, for example, each functional module may be divided according to each function, or two or more functions may be integrated into one processing module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. It should be noted that, in the embodiment of the present application, the division of the module is schematic, and is only one logic function division, and there may be another division manner in actual implementation.
In some embodiments, in the case of dividing each functional module by corresponding functions, fig. 9 shows a schematic diagram of a possible structure of the electronic device 900 involved in the foregoing embodiments, where the electronic device 900 may include: an input unit 901, a verification unit 902, a wake-up unit 903, and a determination execution unit 904.
Wherein the input unit 901 is used to support the electronic device 900 to perform S301 in the above method embodiments and/or other processes for the techniques described herein. For example, the input unit 901 enables the electronic device 900 to perform the receiving of the second voice data, the third voice data, and the like input by the user in the above method embodiment.
A verification unit 902 for enabling the electronic device 900 to perform S302 in the above-described method embodiments and/or other processes for the techniques described herein. For example, the verification unit 902 enables the electronic device 900 to perform voiceprint verification in the above-described method embodiments.
A wake-up unit 903 to enable the electronic device 900 to perform operations of waking up a main processor (e.g., an AP) in the above-described method embodiments and/or other processes for the techniques described herein.
A determination execution unit 904 for enabling the electronic device 900 to execute S303 in the above-described method embodiments and/or other processes for the techniques described herein.
In this embodiment of the application, further, as shown in fig. 9, the electronic device 900 may further include: a trigger unit 905 and a start unit 906.
A trigger unit 905 for enabling the electronic device 900 to perform S501 in the above-described method embodiments and/or other processes for the techniques described herein.
An initiating unit 906 for enabling the electronic device 900 to perform the operations of initiating the voice assistant in the above-described method embodiments and/or other processes for the techniques described herein.
Further, the electronic device 900 may further include a deletion unit. The deletion unit may be used to support the electronic device to perform S304 in the above-described method embodiments and/or other processes for the techniques described herein.
All relevant contents of each step related to the above method embodiment may be referred to the functional description of the corresponding functional module, and are not described herein again.
Of course, the electronic device 900 includes, but is not limited to, the above listed unit modules, for example, the electronic device 900 may further include a receiving unit for receiving data or signals transmitted by other devices, a display unit for displaying content, and the like. Moreover, the functions that can be specifically realized by the above functional units also include, but are not limited to, the functions corresponding to the method steps described in the above examples, and for the detailed description of other units of the electronic device 900, reference may be made to the detailed description of the corresponding method steps, which is not described herein again in this embodiment of the present application.
In other embodiments, where an integrated unit is employed, the electronic device may include: the device comprises a processing module, a storage module and a display module. The processing module is used for controlling and managing the action of the electronic equipment. And the display module is used for displaying the content according to the instruction of the processing module. And the storage module is used for storing the program codes and the data of the electronic equipment. In some embodiments, the storage module may be further configured to store a text and/or corresponding voiceprint feature information corresponding to the first wake-up word, a text and/or corresponding voiceprint feature information corresponding to the second wake-up word, and the like in the foregoing embodiments. Furthermore, the electronic device may further include an input module and a communication module, where the communication module is used to support the electronic device to communicate with other network entities, so as to implement functions of communication, data interaction, Internet access, and the like of the electronic device.
The processing module may be a processor or a controller. The communication module may be a transceiver, an RF circuit or a communication interface, etc. The storage module may be a memory. The display module may be a screen or a display. The input module may be a touch screen, a voice input device, or a fingerprint sensor, etc.
When the processing module is a processor, the communication module is a circuit, the storage module is a memory, and the display module is a touch screen, the electronic device provided in this embodiment may be the electronic device shown in fig. 1. The communication module not only can comprise an RF circuit, but also can comprise a Wi-Fi module, an NFC module and a Bluetooth module. Communication modules such as RF circuitry, NFC modules, Wi-Fi modules, and bluetooth modules may be collectively referred to as a communication interface. Wherein the processor, RF circuitry, touch screen and memory may be coupled together by a bus.
As shown in fig. 10, other embodiments of the present application further provide an electronic device 1000, where the electronic device 1000 may include: a display 1001; one or more processors 1002; a memory 1003; and one or more computer program codes 1004, which may be coupled via one or more communication buses 1005. Wherein the one or more computer program codes 1004 are stored in the memory 1003 and configured to be executed by the one or more processors 1002. At least two first awakening words are arranged in the electronic device 1000, each of the at least two first awakening words corresponds to one first instruction, and the functions executed by the electronic device 1000 in response to the first instructions corresponding to different first awakening words are different. The one or more computer program codes 1004 include computer instructions which, in some embodiments of the present application, may be used to perform various steps as performed by the electronic device of fig. 3 or fig. 5 and the corresponding embodiments. Of course, the electronic device 1000 includes, but is not limited to, the above listed components, for example, the electronic device 1000 may further include a radio frequency circuit, a positioning device, a sensor, and the like, and when the electronic device 1000 includes other components, the electronic device 1000 may be the electronic device shown in fig. 1. The processor 1002 may include an AP 1006 and a first DSP 1007. Further, the processor 1002 may also include a second DSP 1008.
Still other embodiments of the present application provide a computer storage medium, where the computer storage medium includes computer instructions, and when the computer instructions are executed on an electronic device, the electronic device is caused to execute relevant method steps, such as S301, S302, S303, S304, and S501, in any one of fig. 3 or fig. 5, to implement the method for triggering the electronic device to execute a function in the above embodiments.
Still other embodiments of the present application provide a computer program product containing instructions, which when run on a computer, causes the computer to execute the relevant method steps, such as S301, S302, S303, S304, S501, in any one of fig. 3 or fig. 5, to implement the method for triggering an electronic device to execute a function in the above embodiments.
Still further embodiments of the present application provide a control device comprising a processor and a memory, the memory being configured to store computer program code, the computer program code comprising computer instructions, which when executed by the processor, cause the control device to perform the relevant method steps as in any of fig. 3 or fig. 5, e.g. S301, S302, S303, S304, S501, implementing the method of triggering an electronic device to perform a function in the above embodiments. The control device may be an integrated circuit IC or may be a system on chip SOC. The integrated circuit can be a general integrated circuit, a Field Programmable Gate Array (FPGA) or an Application Specific Integrated Circuit (ASIC).
Other embodiments of the present application further provide an apparatus for triggering an electronic device to execute a function, where the apparatus has a function of implementing a behavior of the electronic device in practice according to the method. The functions can be realized by hardware, and the functions can also be realized by executing corresponding software by hardware. The hardware or software includes one or more modules corresponding to the above-described functions.
In addition, the electronic device, the computer storage medium, the computer program product, or the control device provided in the embodiments of the present application are all configured to execute the corresponding method provided above, so that the beneficial effects achieved by the electronic device, the computer storage medium, the computer program product, or the control device can refer to the beneficial effects in the corresponding method provided above, and are not described herein again.
Through the above description of the embodiments, it is clear to those skilled in the art that, for convenience and simplicity of description, the foregoing division of the functional modules is merely used as an example, and in practical applications, the above function distribution may be completed by different functional modules according to needs, that is, the internal structure of the device may be divided into different functional modules to complete all or part of the above described functions. For the specific working processes of the system, the apparatus and the unit described above, reference may be made to the corresponding processes in the foregoing method embodiments, and details are not described here again.
In the several embodiments provided in this embodiment, it should be understood that the disclosed system, apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the modules or units is only one logical division, and there may be other divisions when actually implemented, for example, a plurality of units or components may be combined or may be integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, each functional unit in the embodiments of the present embodiment may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present embodiment essentially or partially contributes to the prior art, or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium and including instructions for causing a computer device (which may be a personal computer, a server, or a network device) or a processor to execute all or part of the steps of the method described in the embodiments. And the aforementioned storage medium includes: flash memory, removable hard drive, read only memory, random access memory, magnetic or optical disk, and the like.
The above descriptions are only specific embodiments of the present embodiment, but the scope of the present embodiment is not limited thereto, and any changes or substitutions within the technical scope of the present embodiment should be covered by the scope of the present embodiment. Therefore, the protection scope of the present embodiment shall be subject to the protection scope of the claims.

Claims (21)

1. A method for triggering electronic equipment to execute functions is characterized in that at least two first awakening words are arranged in the electronic equipment, each of the at least two first awakening words corresponds to a first instruction, the electronic equipment stores the corresponding relation of the first instructions corresponding to the at least two first awakening words in advance, and the functions executed by the electronic equipment in response to the first instructions corresponding to different first awakening words are different; the electronic device comprises a main processor, wherein the main processor is in a dormant state; the method comprises the following steps:
the electronic equipment enters a preset mode; the first instructions corresponding to the at least two first awakening words are used for executing the functions provided by the preset mode;
the electronic equipment receives first voice data input by a user;
the electronic equipment judges whether an awakening word with text matched with the text corresponding to the first voice data exists in the at least two first awakening words or not;
if the wake-up word with the text matched with the text corresponding to the first voice data exists in the at least two first wake-up words, the electronic equipment wakes up the main processor from a dormant state, determines a first instruction corresponding to the first voice data, and executes a function corresponding to the first instruction through the main processor.
2. The method of claim 1, further comprising, prior to the electronic device waking the primary processor from a sleep state:
the electronic equipment determines that the voiceprint features of the first voice data are matched with the voiceprint features corresponding to the at least two first awakening words.
3. The method according to claim 2, wherein the electronic device wakes up the main processor from a sleep state, determines a first instruction corresponding to the first voice data, and executes a function corresponding to the first instruction by the main processor specifically:
the electronic equipment wakes up the main processor from a sleep state, starts a voice assistant through the main processor, determines a first instruction corresponding to the first voice data through the voice assistant, and executes a function corresponding to the first instruction through the main processor.
4. The method of any of claims 1-3, wherein the electronic device further comprises a first co-processor;
the method for receiving the first voice data input by the user by the electronic equipment specifically comprises the following steps:
the electronic device monitors the first voice data input by a user by using the first coprocessor;
The electronic equipment judges whether an awakening word with text matched with the text corresponding to the first voice data exists in the at least two first awakening words or not; if a wakeup word with a text matching the text corresponding to the first voice data exists in the at least two first wakeup words, the specifically waking up the main processor from the sleep state by the electronic device is as follows:
the electronic equipment judges whether a wake-up word of which the text is matched with the text corresponding to the first voice data exists in the at least two first wake-up words by using the first coprocessor; if so, the first coprocessor wakes up the main processor from a sleep state.
5. The method of any of claims 1-3, wherein the electronic device further comprises a first co-processor;
the method for receiving the first voice data input by the user by the electronic device specifically includes:
the electronic device monitors the first voice data input by a user by using the first coprocessor;
the electronic equipment judges whether an awakening word with text matched with the text corresponding to the first voice data exists in the at least two first awakening words or not; if the wake-up word whose text matches the text corresponding to the first voice data exists in the at least two first wake-up words, the waking up the main processor from the sleep state by the electronic device specifically includes:
The electronic equipment judges whether a wake-up word with a matching degree of a text corresponding to the first voice data and the text in the at least two first wake-up words meeting first precision exists by using the first coprocessor; if the matching degree of the text corresponding to the first voice data and the text in the at least two first awakening words meets the awakening words with the first precision, the first coprocessor awakens the main processor from a dormant state;
the determining of the first instruction corresponding to the first voice data and the executing of the function corresponding to the first instruction by the main processor specifically include:
the electronic equipment judges whether a wake-up word with a text matching degree meeting second precision exists in the at least two first wake-up words by using the main processor; if the matching degree of the text corresponding to the first voice data and the text in the at least two first awakening words meets the awakening words with the second precision, determining a first instruction corresponding to the first voice data, and executing a function corresponding to the first instruction through the main processor; the first precision is less than the second precision.
6. The method of claim 1, wherein the electronic device further comprises a second coprocessor;
before the electronic device enters the predetermined mode, the method further comprises:
the electronic device monitors voice data using the second coprocessor.
7. The method according to claim 1, wherein a second wake-up word is further provided in the electronic device;
the entering of the electronic device into the predetermined mode specifically includes:
the electronic equipment receives second voice data input by a user;
the electronic equipment judges whether the second voice data is matched with the second awakening word;
if the second voice data is matched with the second awakening word, the electronic equipment awakens the main processor from a dormant state, and a voice assistant is started through the main processor;
the electronic equipment receives third voice data input by a user through the voice assistant, determines a second instruction corresponding to the third voice data, and executes a function corresponding to the second instruction through the main processor, wherein the second instruction is used for indicating the electronic equipment to enter the preset mode.
8. The method according to claim 7, wherein the determining, by the electronic device, whether the second speech data matches the second wake-up word specifically includes:
The electronic equipment judges whether the text corresponding to the second voice data is matched with the text of the second awakening word, and if the text corresponding to the second voice data is matched with the text of the second awakening word, the second voice data is matched with the second awakening word.
9. The method according to claim 7, wherein the determining, by the electronic device, whether the second speech data matches the second wake-up word specifically includes:
the electronic equipment judges whether the text corresponding to the second voice data is matched with the text of the second awakening word or not, and judges whether the voiceprint characteristics of the second voice data are matched with the voiceprint characteristics corresponding to the second awakening word or not;
and if the text corresponding to the second voice data is matched with the text of the second awakening word and the voiceprint feature of the second voice data is matched with the voiceprint feature corresponding to the second awakening word, matching the second voice data with the second awakening word.
10. The method of claim 7, wherein the electronic device further comprises a second coprocessor;
the receiving, by the electronic device, the second voice data input by the user specifically includes:
The electronic device monitors the second voice data input by the user by using the second coprocessor;
the electronic equipment judges whether the second voice data is matched with the second awakening word or not; if the second voice data is matched with the second awakening word, the electronic device awakens the main processor from a dormant state, and starting a voice assistant through the main processor specifically comprises:
the electronic equipment judges whether the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets third precision or not by using the second coprocessor; if the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets the third precision, the second coprocessor awakens the main processor from a dormant state; the electronic equipment judges whether the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets fourth precision or not by using the main processor; if the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets the fourth precision, the electronic equipment starts the voice assistant through the main processor; the third precision is less than the fourth precision.
11. An electronic device, characterized in that the electronic device comprises: a processor, a memory, and a display; the memory, the display and the processor are coupled; the display is used for displaying the image generated by the processor; the memory for storing computer program code; the processor comprises a primary processor, wherein the primary processor is in a dormant state; the electronic equipment is provided with at least two first awakening words, each of the at least two first awakening words corresponds to one first instruction, the electronic equipment pre-stores the corresponding relation of the first instructions corresponding to the at least two first awakening words, and the electronic equipment responds to the first instructions corresponding to different first awakening words to execute different functions; the computer program code comprises computer instructions which, when executed by the processor,
the processor is further configured to trigger the electronic device to enter a predetermined mode, where first instructions corresponding to the at least two first wake-up words are used to execute functions provided by the predetermined mode;
the processor is used for receiving first voice data input by a user; judging whether an awakening word with a text matched with the text corresponding to the first voice data exists in the at least two first awakening words; if the wake-up word with the text matched with the text corresponding to the first voice data exists in the at least two first wake-up words, waking up the main processor from a dormant state, determining a first instruction corresponding to the first voice data, and executing a function corresponding to the first instruction through the main processor.
12. The electronic device of claim 11, wherein the processor is further configured to determine that a voiceprint feature of the first voice data matches a voiceprint feature corresponding to the at least two first wake words.
13. The electronic device according to claim 12, wherein the processor is configured to wake up the main processor from a sleep state, determine a first instruction corresponding to the first voice data, and execute a function corresponding to the first instruction through the main processor, specifically:
the processor is configured to wake up the main processor from a sleep state, start a voice assistant through the main processor, determine a first instruction corresponding to the first voice data through the voice assistant, and execute a function corresponding to the first instruction through the main processor.
14. The electronic device of any of claims 11-13, wherein the processor further comprises a first co-processor;
the processor is configured to receive first voice data input by a user, specifically:
the first coprocessor is used for monitoring the first voice data input by a user;
the processor is configured to determine whether an awakening word whose text matches the text corresponding to the first voice data exists in the at least two first awakening words; if the wake-up word whose text matches the text corresponding to the first voice data exists in the at least two first wake-up words, waking up the main processor from a sleep state specifically as follows:
The first coprocessor is used for judging whether an awakening word with a text matched with a text corresponding to the first voice data exists in the at least two first awakening words; and if so, waking up the main processor from the sleep state.
15. The electronic device of any of claims 11-13, wherein the processor further comprises a first co-processor;
the processor is configured to receive first voice data input by a user, specifically:
the first coprocessor is used for monitoring the first voice data input by a user;
the processor is configured to determine whether an awakening word whose text matches the text corresponding to the first voice data exists in the at least two first awakening words; if the wake-up word whose text matches the text corresponding to the first voice data exists in the at least two first wake-up words, waking up the main processor from a sleep state specifically as follows:
the first coprocessor is used for judging whether a wake-up word with a matching degree of a text corresponding to the first voice data and the text corresponding to the first voice data in the at least two first wake-up words meets a first precision; if the matching degree of the text corresponding to the first voice data and the text in the at least two first awakening words meets the awakening words with the first precision, awakening the main processor from a dormant state;
The processor is configured to determine a first instruction corresponding to the first voice data, and execute, by the main processor, a function corresponding to the first instruction specifically:
the main processor is used for judging whether a wake-up word with a text matching degree meeting second precision exists in the at least two first wake-up words or not, wherein the text matching degree of the wake-up word corresponds to the first voice data; if the matching degree of the text corresponding to the first voice data and the wake-up word with the second precision exists in the at least two first wake-up words, determining a first instruction corresponding to the first voice data, and executing a function corresponding to the first instruction; the first precision is less than the second precision.
16. The electronic device of claim 11, wherein the processor further comprises a second coprocessor;
the second co-processor is configured to monitor voice data before the electronic device enters the predetermined mode.
17. The electronic device according to claim 11, wherein a second wake-up word is further provided in the electronic device;
the processor is further configured to trigger the electronic device to enter a predetermined mode specifically as follows:
The processor is further used for receiving second voice data input by a user; judging whether the second voice data is matched with the second awakening word or not; if the second voice data is matched with the second awakening word, awakening the main processor from a dormant state, and starting a voice assistant through the main processor; and receiving third voice data input by a user through the voice assistant, determining a second instruction corresponding to the third voice data, and executing a function corresponding to the second instruction through the main processor, wherein the second instruction is used for indicating the electronic equipment to enter the preset mode.
18. The electronic device of claim 17, wherein the processor, configured to determine whether the second voice data matches the second wakeup word specifically is:
the processor is configured to determine whether the text corresponding to the second voice data matches the text of the second wakeup word, and if the text corresponding to the second voice data matches the text of the second wakeup word, match the second voice data with the second wakeup word.
19. The electronic device of claim 17, wherein the processor, configured to determine whether the second voice data matches the second wakeup word specifically is:
The processor is configured to determine whether the text corresponding to the second voice data matches the text of the second wakeup word, and determine whether the voiceprint feature of the second voice data matches the voiceprint feature corresponding to the second wakeup word; and if the text corresponding to the second voice data is matched with the text of the second awakening word and the voiceprint feature of the second voice data is matched with the voiceprint feature corresponding to the second awakening word, matching the second voice data with the second awakening word.
20. The electronic device of claim 17, wherein the processor further comprises a second co-processor;
the processor is further configured to receive second voice data input by a user, specifically:
the second coprocessor is used for monitoring the second voice data input by the user;
the processor is further configured to determine whether the second voice data matches the second wake-up word; if the second voice data is matched with the second awakening word, awakening the main processor from a dormant state, and starting a voice assistant through the main processor specifically comprises the following steps:
the second coprocessor is further configured to determine whether a matching degree between the text of the second wake-up word and the text corresponding to the second voice data meets a third precision; if the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets the third precision, awakening the main processor from a dormant state; the main processor is further configured to determine whether a matching degree between the text of the second wakeup word and the text corresponding to the second voice data meets a fourth precision; if the matching degree of the text of the second awakening word and the text corresponding to the second voice data meets the fourth precision, starting the voice assistant; the third precision is less than the fourth precision.
21. A computer storage medium comprising computer instructions which, when run on an electronic device, cause the electronic device to perform the method of triggering the electronic device to perform a function of any of claims 1-10.
CN201880090844.6A 2018-10-11 2018-10-11 Method for triggering electronic equipment to execute function and electronic equipment Active CN111819533B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/109888 WO2020073288A1 (en) 2018-10-11 2018-10-11 Method for triggering electronic device to execute function and electronic device

Publications (2)

Publication Number Publication Date
CN111819533A CN111819533A (en) 2020-10-23
CN111819533B true CN111819533B (en) 2022-06-14

Family

ID=70164164

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880090844.6A Active CN111819533B (en) 2018-10-11 2018-10-11 Method for triggering electronic equipment to execute function and electronic equipment

Country Status (2)

Country Link
CN (1) CN111819533B (en)
WO (1) WO2020073288A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116456441B (en) * 2023-06-16 2023-10-31 荣耀终端有限公司 Sound processing device, sound processing method and electronic equipment

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114125143B (en) * 2020-08-31 2023-04-07 华为技术有限公司 Voice interaction method and electronic equipment
CN114258116A (en) * 2020-09-25 2022-03-29 华为技术有限公司 Power consumption optimization method and device
CN114697438B (en) * 2020-12-29 2023-06-27 华为技术有限公司 Method, device, equipment and storage medium for carrying out call by utilizing intelligent equipment
CN113066490B (en) * 2021-03-16 2022-10-14 海信视像科技股份有限公司 Prompting method of awakening response and display equipment
CN117334182A (en) * 2022-06-25 2024-01-02 华为技术有限公司 Voice interaction method and related device
CN115376524B (en) * 2022-07-15 2023-08-04 荣耀终端有限公司 Voice awakening method, electronic equipment and chip system
CN117690423A (en) * 2022-09-05 2024-03-12 华为技术有限公司 Man-machine interaction method and related device
CN116069818B (en) * 2023-01-05 2023-09-12 广州市华势信息科技有限公司 Application processing method and system based on zero code development

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102929390A (en) * 2012-10-16 2013-02-13 广东欧珀移动通信有限公司 Method and device for starting application program in stand-by state
CN102999161A (en) * 2012-11-13 2013-03-27 安徽科大讯飞信息科技股份有限公司 Implementation method and application of voice awakening module
CN103634480A (en) * 2013-12-17 2014-03-12 百度在线网络技术(北京)有限公司 Method and device for communication in mobile communication terminal
CN104866274A (en) * 2014-12-01 2015-08-26 联想(北京)有限公司 Information processing method and electronic equipment
CN105323386A (en) * 2015-12-03 2016-02-10 上海卓易科技股份有限公司 Method and system for automatically switching contextual model of mobile phone
CN107277904A (en) * 2017-07-03 2017-10-20 上海斐讯数据通信技术有限公司 A kind of terminal and voice awakening method

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103095911B (en) * 2012-12-18 2014-12-17 苏州思必驰信息科技有限公司 Method and system for finding mobile phone through voice awakening
CN103197571A (en) * 2013-03-15 2013-07-10 张春鹏 Control method, device and system
TWI557599B (en) * 2015-11-05 2016-11-11 宏碁股份有限公司 Voice control method and voice control system
CN107315561A (en) * 2017-06-30 2017-11-03 联想(北京)有限公司 A kind of data processing method and electronic equipment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102929390A (en) * 2012-10-16 2013-02-13 广东欧珀移动通信有限公司 Method and device for starting application program in stand-by state
CN102999161A (en) * 2012-11-13 2013-03-27 安徽科大讯飞信息科技股份有限公司 Implementation method and application of voice awakening module
CN103634480A (en) * 2013-12-17 2014-03-12 百度在线网络技术(北京)有限公司 Method and device for communication in mobile communication terminal
CN104866274A (en) * 2014-12-01 2015-08-26 联想(北京)有限公司 Information processing method and electronic equipment
CN105323386A (en) * 2015-12-03 2016-02-10 上海卓易科技股份有限公司 Method and system for automatically switching contextual model of mobile phone
CN107277904A (en) * 2017-07-03 2017-10-20 上海斐讯数据通信技术有限公司 A kind of terminal and voice awakening method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116456441B (en) * 2023-06-16 2023-10-31 荣耀终端有限公司 Sound processing device, sound processing method and electronic equipment

Also Published As

Publication number Publication date
CN111819533A (en) 2020-10-23
WO2020073288A1 (en) 2020-04-16

Similar Documents

Publication Publication Date Title
CN109584879B (en) Voice control method and electronic equipment
CN111819533B (en) Method for triggering electronic equipment to execute function and electronic equipment
WO2021052263A1 (en) Voice assistant display method and device
CN110134316B (en) Model training method, emotion recognition method, and related device and equipment
CN110138959B (en) Method for displaying prompt of human-computer interaction instruction and electronic equipment
CN110910872B (en) Voice interaction method and device
CN113704014B (en) Log acquisition system, method, electronic device and storage medium
CN111316199B (en) Information processing method and electronic equipment
CN111602108B (en) Application icon display method and terminal
CN111742361A (en) Method for updating voice wake-up of voice assistant by terminal and terminal
WO2021052139A1 (en) Gesture input method and electronic device
CN114095599B (en) Message display method and electronic equipment
CN115589051B (en) Charging method and terminal equipment
CN112150778A (en) Environmental sound processing method and related device
CN113641271A (en) Application window management method, terminal device and computer readable storage medium
CN110058729B (en) Method and electronic device for adjusting sensitivity of touch detection
CN111819830A (en) Information recording and displaying method and terminal in communication process
CN113380240B (en) Voice interaction method and electronic equipment
CN115206308A (en) Man-machine interaction method and electronic equipment
CN113918003A (en) Method and device for detecting time length of skin contacting screen and electronic equipment
CN114006976B (en) Interface display method and terminal equipment
CN117271170B (en) Activity event processing method and related equipment
CN118131891A (en) Man-machine interaction method and device
CN115513571A (en) Control method of battery temperature and terminal equipment
CN114237453A (en) Method for controlling equipment, electronic equipment and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant