CN112489650B - Wakeup control method, wakeup control device, storage medium and terminal - Google Patents

Wakeup control method, wakeup control device, storage medium and terminal Download PDF

Info

Publication number
CN112489650B
CN112489650B CN202011349897.0A CN202011349897A CN112489650B CN 112489650 B CN112489650 B CN 112489650B CN 202011349897 A CN202011349897 A CN 202011349897A CN 112489650 B CN112489650 B CN 112489650B
Authority
CN
China
Prior art keywords
terminal
wake
audio data
historical
historical audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011349897.0A
Other languages
Chinese (zh)
Other versions
CN112489650A (en
Inventor
程思
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xiaomi Pinecone Electronic Co Ltd
Original Assignee
Beijing Xiaomi Pinecone Electronic Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Xiaomi Pinecone Electronic Co Ltd filed Critical Beijing Xiaomi Pinecone Electronic Co Ltd
Priority to CN202011349897.0A priority Critical patent/CN112489650B/en
Publication of CN112489650A publication Critical patent/CN112489650A/en
Application granted granted Critical
Publication of CN112489650B publication Critical patent/CN112489650B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Electric Clocks (AREA)
  • Telephone Function (AREA)

Abstract

The disclosure relates to a wake-up control method, a device, a storage medium and a terminal, wherein the method is applied to a first terminal and comprises the following steps: collecting audio data in real time; under the condition that the first audio data collected in the target time period meets the preset wake-up condition, acquiring the first historical audio data in the stored historical time period, wherein the historical time period comprises a preset time period before the target time period; acquiring a first voiceprint feature corresponding to the first historical audio data, and transmitting the first voiceprint feature to a second terminal, so that the second terminal acquires second historical audio data stored in the historical time period after receiving the first voiceprint feature, and transmits wake-up response information to the first terminal according to the second historical audio data; receiving the wake-up response information sent by the second terminal; determining a terminal to be awakened from the first terminal and the second terminal according to the awakening response information; and controlling the terminal to be awakened.

Description

Wakeup control method, wakeup control device, storage medium and terminal
Technical Field
The disclosure relates to the technical field of terminals, and in particular relates to a wake-up control method, a wake-up control device, a storage medium and a terminal.
Background
Along with the development of science and technology, intelligent voice products are becoming more and more popular, and common intelligent voice equipment comprises an intelligent sound box, a child accompanying robot, an intelligent alarm clock, intelligent household appliances and the like. In general, wake-up words preset in different intelligent devices may be the same, so, because a plurality of intelligent voice devices may exist in the same space at the same time, when a user speaks the same wake-up word, a plurality of intelligent devices may respond at the same time, which easily causes trouble to the user and seriously affects the interactive experience of the user.
In the related art, in order to avoid that a plurality of intelligent voice devices simultaneously respond to the wake-up of the user, the intelligent voice device closest to the user can be selected from the plurality of intelligent voice devices to respond to the wake-up of the user by the nearby wake-up judging method, but when only part of the plurality of intelligent voice devices can wake-up according to the wake-up word spoken by the user, the wake-up of the user can be responded after the timeout of the timeout judging timer, so that the wake-up response speed is slower, and the user experience is affected.
Disclosure of Invention
In order to overcome the problems in the related art, the present disclosure provides a wake-up control method, device, storage medium and terminal.
According to a first aspect of embodiments of the present disclosure, there is provided a wake-up control method, applied to a first terminal, the method including: collecting audio data in real time; under the condition that first audio data acquired in a target time period meet a preset wake-up condition, acquiring first historical audio data in a stored historical time period, wherein the historical time period comprises a preset time period before the target time period; acquiring a first voiceprint feature corresponding to the first historical audio data, and sending the first voiceprint feature to a second terminal, so that the second terminal acquires second historical audio data stored in the historical time period after receiving the first voiceprint feature, and sends wake-up response information to the first terminal according to the second historical audio data; receiving the wake-up response information sent by the second terminal; determining a terminal to be awakened from the first terminal and the second terminal according to the awakening response information; and controlling the terminal to be awakened.
Optionally, the preset wake-up condition includes: the first audio data comprises a preset wake-up word; or the first audio data comprises target keywords in the preset wake-up words, the target keywords comprise first N keywords in the preset wake-up words, N is an integer greater than 0 and smaller than the target number, and the target number is the number of keywords in the preset wake-up words.
Optionally, the wake-up response information includes wake-up rejection information or wake-up characteristics of the second historical audio data; and determining the terminal to be awakened from the first terminal and the second terminal according to the awakening response information, wherein the determining the terminal to be awakened comprises: determining a target terminal nearest to a user from the first terminal and the second terminal according to the wake-up characteristics of the first audio data and the wake-up characteristics of the second historical audio data under the condition that the wake-up response information comprises the wake-up characteristics of the second historical audio data, and taking the target terminal as the terminal to be waken, wherein the wake-up characteristics are used for representing the distance between the terminal and the user; or if the wake-up response information comprises the wake-up rejection information, the first terminal is used as the terminal to be waken up.
Optionally, the controlling the terminal to be awakened includes: controlling the first terminal to wake up under the condition that the terminal to be wake up is the first terminal; or under the condition that the terminal to be awakened is the second terminal, sending an awakening instruction to the second terminal so as to enable the second terminal to be awakened according to the awakening instruction.
According to a second aspect of embodiments of the present disclosure, there is provided a wake-up control method applied to a second terminal, the method including: collecting audio data in real time; under the condition that a first voiceprint feature corresponding to first historical audio data acquired in a historical time period sent by a first terminal is received, acquiring second historical audio data stored in the historical time period; and sending wake-up response information to the first terminal according to the second historical audio data, so that the first terminal determines a terminal to be waken according to the wake-up response information, and controls the terminal to be waken.
Optionally, the wake-up response information includes wake-up characteristics rejecting wake-up information or the second historical audio data, and before the sending of the wake-up response information to the first terminal according to the second historical audio data, the method further includes: determining whether to wake up the second terminal according to the second historical audio data; and sending wake-up response information to the first terminal according to the second historical audio data comprises the following steps: transmitting a wake-up feature of the second historical audio data to the first terminal if it is determined to wake up the second terminal; or sending the refusal awakening information to the first terminal under the condition that the second terminal is not awakened.
Optionally, before determining whether to wake up the second terminal according to the second historical audio data, the method further includes: determining whether the second historical audio data includes voice data; and determining whether to wake up the second terminal according to the second historical audio data comprises: acquiring a second voiceprint feature corresponding to the second historical audio data under the condition that the second historical audio data comprises the voice data, and determining whether to wake up the second terminal according to the first voiceprint feature and the second voiceprint feature; or determining not to wake up the second terminal if the second historical audio data does not include the voice data.
Optionally, the determining whether to wake up the second terminal according to the first voiceprint feature and the second voiceprint feature includes: determining whether the first historical audio data and the second historical audio data belong to the same user according to the first voiceprint feature and the second voiceprint feature; and determining to wake up the second terminal under the condition that the first historical audio data and the second historical audio data are determined to belong to the same user, or determining not to wake up the second terminal under the condition that the first historical audio data and the second historical audio data are determined not to belong to the same user.
Optionally, the method further comprises: receiving a wake-up instruction sent by the first terminal under the condition that the terminal to be wake-up is the second terminal; and waking up the second terminal according to the wake-up instruction.
According to a third aspect of embodiments of the present disclosure, there is provided a wake-up control device applied to a first terminal, the device including: the first data acquisition module is configured to acquire audio data in real time; the first data acquisition module is configured to acquire first historical audio data in a stored historical time period when the first audio data acquired in a target time period meets a preset wake-up condition, wherein the historical time period comprises a preset time period before the target time period; the voiceprint feature acquisition module is configured to acquire a first voiceprint feature corresponding to the first historical audio data, and send the first voiceprint feature to a second terminal, so that the second terminal acquires second historical audio data in the stored historical time period after receiving the first voiceprint feature, and sends wake-up response information to the first terminal according to the second historical audio data; the information receiving module is configured to receive the wake-up response information sent by the second terminal; the terminal determining module is configured to determine a terminal to be awakened from the first terminal and the second terminal according to the awakening response information; the first wake-up module is configured to control the terminal to be woken up.
Optionally, the preset wake-up condition includes: the first audio data comprises a preset wake-up word; or the first audio data comprises target keywords in the preset wake-up words, the target keywords comprise first N keywords in the preset wake-up words, N is an integer greater than 0 and smaller than the target number, and the target number is the number of keywords in the preset wake-up words.
Optionally, the wake-up response information includes wake-up rejection information or wake-up characteristics of the second historical audio data; the terminal determining module includes: a terminal determining submodule, configured to determine a target terminal nearest to a user from the first terminal and the second terminal according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data, and take the target terminal as the terminal to be waken, wherein the wake-up feature is used for representing the distance between the terminal and the user; or if the wake-up response information comprises the wake-up rejection information, the first terminal is used as the terminal to be waken up.
Optionally, the first wake-up module includes: the wake-up sub-module is configured to control the first terminal to wake up under the condition that the terminal to be wake-up is the first terminal; or under the condition that the terminal to be awakened is the second terminal, sending an awakening instruction to the second terminal so as to enable the second terminal to be awakened according to the awakening instruction.
According to a fourth aspect of embodiments of the present disclosure, there is provided a wake-up control device applied to a second terminal, the device including: the second data acquisition module is configured to acquire audio data in real time; the second data acquisition module is configured to acquire second historical audio data in a stored historical time period under the condition that a first voiceprint feature corresponding to first historical audio data acquired in the historical time period sent by the first terminal is received; and the information sending module is configured to send wake-up response information to the first terminal according to the second historical audio data, so that the first terminal determines a terminal to be waken according to the wake-up response information and controls the terminal to be waken.
Optionally, the wake-up response information includes wake-up characteristics rejecting wake-up information or the second historical audio data, the apparatus further comprising: a wake-up determining module configured to determine whether to wake up the second terminal according to the second historical audio data; the information sending module is configured to: transmitting a wake-up feature of the second historical audio data to the first terminal if it is determined to wake up the second terminal; or sending the refusal awakening information to the first terminal under the condition that the second terminal is not awakened.
Optionally, the apparatus further comprises: a data determination module configured to determine whether the second historical audio data includes voice data; the wake-up determination module is configured to: acquiring a second voiceprint feature corresponding to the second historical audio data under the condition that the second historical audio data comprises the voice data, and determining whether to wake up the second terminal according to the first voiceprint feature and the second voiceprint feature; or determining not to wake up the second terminal if the second historical audio data does not include the voice data.
Optionally, the wake-up determining module includes: a user determination sub-module configured to determine, according to the first voiceprint feature and the second voiceprint feature, whether the first historical audio data and the second historical audio data belong to the same user; and the wake-up determination submodule is configured to determine to wake up the second terminal when the first historical audio data and the second historical audio data are determined to belong to the same user, or determine not to wake up the second terminal when the first historical audio data and the second historical audio data are determined to not belong to the same user.
Optionally, the apparatus further comprises: the instruction receiving module is configured to receive a wake-up instruction sent by the first terminal under the condition that the terminal to be wake-up is the second terminal; and the second wake-up module is configured to wake up the second terminal according to the wake-up instruction.
According to a fifth aspect of embodiments of the present disclosure, there is provided a computer readable storage medium having stored thereon computer program instructions which, when executed by a processor, implement the steps of the wake-up control method provided in the first aspect of the present disclosure, or which, when executed by a processor, implement the steps of the wake-up control method provided in the second aspect of the present disclosure.
According to a sixth aspect of embodiments of the present disclosure, there is provided a terminal, comprising: a memory having a computer program stored thereon; and a processor, configured to execute the computer program in the memory, to implement the steps of the wake-up control method provided in the first aspect of the disclosure, or to implement the steps of the wake-up control method provided in the second aspect of the disclosure.
The technical scheme provided by the embodiment of the disclosure can comprise the following beneficial effects: collecting audio data in real time; under the condition that first audio data acquired in a target time period meet a preset wake-up condition, acquiring first historical audio data in a stored historical time period, wherein the historical time period comprises a preset time period before the target time period; acquiring a first voiceprint feature corresponding to the first historical audio data, and sending the first voiceprint feature to a second terminal, so that the second terminal acquires second historical audio data stored in the historical time period after receiving the first voiceprint feature, and sends wake-up response information to the first terminal according to the second historical audio data; receiving the wake-up response information sent by the second terminal; determining a terminal to be awakened from the first terminal and the second terminal according to the awakening response information; and controlling the terminal to be awakened. That is, after the first terminal meets the preset wake-up condition, the first terminal may send the first voiceprint feature to the second terminal, and after the second terminal receives the first voiceprint feature, the second terminal may send wake-up response information to the first terminal according to the second historical audio data, so that the first terminal may determine the terminal to be woken up after receiving the wake-up response information sent by the second terminal without waiting for the timeout of the timeout decision timer, and control the terminal to be woken up, thereby improving the speed of the wake-up response and improving the user experience.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the disclosure and together with the description, serve to explain the principles of the disclosure.
FIG. 1 is a flow chart illustrating a wake-up control method according to an exemplary embodiment;
FIG. 2 is a flow chart illustrating a second wake-up control method according to an exemplary embodiment;
FIG. 3 is a flow chart illustrating a third wake-up control method according to an exemplary embodiment;
FIG. 4 is a flowchart illustrating a fourth wake-up control method according to an example embodiment;
FIG. 5 is a schematic diagram of a wake-up control device, according to an exemplary embodiment;
FIG. 6 is a schematic diagram illustrating the structure of a second wake-up control device, according to an exemplary embodiment;
FIG. 7 is a schematic diagram of a third wake-up control device according to an exemplary embodiment;
FIG. 8 is a schematic diagram of a fourth wake-up control device, according to an example embodiment;
fig. 9 is a schematic structural view of a fifth wake-up control unit according to an exemplary embodiment;
fig. 10 is a block diagram of a terminal according to an exemplary embodiment.
Detailed Description
Reference will now be made in detail to exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numbers in different drawings refer to the same or similar elements, unless otherwise indicated. The implementations described in the following exemplary examples are not representative of all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with some aspects of the present disclosure as detailed in the accompanying claims.
In the following description, the words "first," "second," and the like are used merely for distinguishing between the descriptions and not for indicating or implying a relative importance or order.
First, an application scenario of the present disclosure will be described. Typically, the wake-up word is preset in the smart voice device by the user, or the wake-up word is set before the smart voice device leaves the factory, so the preset wake-up words in different smart voice devices may be the same. Along with the popularization of intelligent voice products, a plurality of intelligent voice devices possibly exist in the same space at the same time, so that after a user speaks the same wake-up word, the plurality of intelligent voice devices in the same space can respond simultaneously to interact with the user, and the user trouble is easily caused, and the user interaction experience is influenced.
In the related art, a multi-device linkage system can be established, namely, a plurality of intelligent voice devices in the same space form a linkage system through the same account, after a user speaks a wake-up word, each intelligent voice device in the linkage system can determine whether to wake up according to the wake-up word, and after the wake-up word is determined, a timeout judgment timer can be started and wake-up results are sent to other intelligent voice devices. If the wake-up result sent by other intelligent voice devices is received before the timeout judgment timer is timed out, the wake-up of the intelligent voice device nearest to the user in response to the user can be determined from a plurality of wake-up intelligent voice devices; if the wake-up result sent by other intelligent voice devices is not received before the timeout of the timeout judgment timer, that is to say, one intelligent voice device in the linkage system is waken up, the waken-up intelligent voice device needs to wait until the timeout of the timeout judgment timer, and then can respond to the wake-up of the user. Therefore, the wake-up response speed of the whole linkage system is low, and the user experience is affected.
In order to solve the above problems, the present disclosure provides a wake-up control method, a device, a storage medium, and a terminal, where after a first terminal meets a preset wake-up condition, the first terminal may send a first voiceprint feature to a second terminal, and after receiving the first voiceprint feature, the second terminal may send wake-up response information to the first terminal according to second historical audio data collected by the second terminal in a historical time period, so that the first terminal may determine a terminal to be woken up after receiving the wake-up response information sent by the second terminal without waiting for a timeout of a timeout decision timer, and control the terminal to be woken up, thereby improving the speed of the wake-up response and improving user experience.
The first terminal and the second terminal of the present disclosure may belong to the same multi-device linkage system, and the types of the first terminal and the second terminal may be the same or different, and the first terminal and the second terminal include, but are not limited to, a fixed device and a mobile device, for example, the fixed device includes, but is not limited to: personal computers (Personal Computer, PC), televisions, air conditioners, wall hanging furnaces and the like; the mobile device includes, but is not limited to: cell phone, tablet computer, wearable device, audio amplifier, alarm clock etc., this disclosure does not limit this. The multi-device linkage system may use a nearby wake-up function to determine a terminal from a plurality of terminals that is responsive to a user wake-up. For example, both the first terminal and the second terminal may belong to devices in the distributed far field device network. The second terminal can include a plurality of second terminals, the first terminal and the second terminal are in the same space, and the multi-device linkage system is formed through the same account. In addition, the wake-up response speed of the first terminal is faster than the wake-up response speed of the second terminal in the present disclosure, for example, the performance of the wake-up engine of the first terminal is better than the performance of the wake-up engine of the second terminal. That is, the first terminal may wake up faster after the user speaks the wake-up word.
The present disclosure is described below in connection with specific embodiments.
Fig. 1 is a flowchart illustrating a wake-up control method applied to a first terminal according to an exemplary embodiment, and the method includes:
S101, collecting audio data in real time.
In this step, after the first terminal is started, the data acquisition module of the first terminal may acquire audio data in real time.
S102, under the condition that the first audio data collected in the target time period meets the preset wake-up condition, the first historical audio data in the stored historical time period is obtained.
The historical time period may include a preset time period before the target time period, where the preset time period may be a time period required by the user to speak the wake-up word, and the time periods of the historical time periods corresponding to different wake-up words are different.
It should be noted that, in the process of collecting audio data in real time, the data collection module of the first terminal may store the first historical audio data collected in the historical time period. For example, the present disclosure may store the first historical audio data in a queue, where the audio data at the head of the queue is the audio data acquired earliest in the first historical audio data, and the audio data at the tail of the queue is the audio data acquired latest in the first historical audio data, and after each time new audio data is acquired, the audio data at the head of the queue may be deleted, and the new audio data is stored at the tail of the queue.
In this step, after the first terminal collects the first audio data in the target time period, the first terminal may acquire a preset wake-up word, and determine, according to the preset wake-up word, whether the first audio data meets a preset wake-up condition. The preset wake-up condition may include: the first audio data comprises the preset wake-up word; or the first audio data comprises target keywords in the preset wake-up words, the target keywords comprise first N keywords in the preset wake-up words, N is an integer greater than 0 and less than the target number, and the target number is the number of keywords in the preset wake-up words. Here, the value of N may be determined according to the type of the first terminal, for example, in the case where the preset wake-up word includes 5 keywords, a smaller value may be set for the first terminal having a relatively high wake-up instantaneity requirement, for example, N may be set to 2; for the first terminal with low wake-up real-time requirement, a larger value may be set, for example, N may be set to 4, which is not limited in this disclosure. In addition, considering that the smaller the value of N is, the higher the probability that the first terminal is awakened by mistake, so the value of N can be determined according to the requirement of the first terminal on the false awakening rate, for example, in the case that the preset awakening word includes 5 keywords, a larger value, for example, 4 can be set for the first terminal with higher requirement on the false awakening rate; for the first terminal with a low false wake up rate requirement, a smaller value, for example, set to 2, which is not limited in the disclosure.
If the preset wake-up condition is that the first audio data includes the preset wake-up word, the wake-up engine of the first terminal needs to detect each keyword in the preset wake-up word, and then determines that the first audio data meets the preset wake-up condition, for example, if the preset wake-up word is "ABCDE", if the first audio data includes "ABCDE", it indicates that the first audio data meets the preset wake-up condition. When the preset wake-up condition is that the first audio data includes a target keyword in the preset wake-up word, after detecting the target keyword, the wake-up engine of the first terminal can determine that the first audio data meets the preset wake-up condition, for example, if the preset wake-up word is "ABC", and the target keyword is "ABC", if the first audio data includes "ABC", it indicates that the first audio data meets the preset wake-up condition.
Because the first terminal needs to acquire the first historical audio data and extract the first voiceprint feature from the first historical audio data after determining that the first audio data meets the preset awakening condition, and a certain time is required to be spent in extracting the first voiceprint feature, when the preset awakening condition is that the first audio data comprises the target keyword in the preset awakening word, the first terminal does not need to wait until the user finishes the complete awakening word, the first historical audio data can be acquired in advance, and the first voiceprint feature is extracted from the first historical audio, so that the awakening response time can be shortened, and the awakening response speed can be improved.
Further, in the case where it is determined that the first audio data satisfies the preset wake-up condition, the stored first historical audio data in the historical time period may be acquired.
S103, acquiring a first voiceprint feature corresponding to the first historical audio data, and sending the first voiceprint feature to a second terminal.
In this step, after the first historical audio data is obtained, a first voiceprint feature corresponding to the first historical audio data may be extracted by a related-art method, and the first voiceprint feature is sent to the second terminal. After receiving the first voiceprint feature sent by the first terminal, the second terminal can firstly acquire second historical audio data stored in the historical time period, and send wake-up response information to the first terminal according to the second historical audio data.
S104, receiving the wake-up response information sent by the second terminal.
S105, determining a terminal to be awakened from the first terminal and the second terminal according to the awakening response information.
The wake-up response information may include wake-up characteristics of the second historical audio data or wake-up refusal information, where the wake-up response information includes wake-up characteristics of the second historical audio data, which indicates that the second terminal may wake-up according to the second audio data collected in the target time period; and under the condition that the wake-up response information comprises the refusal wake-up information, the second terminal is indicated not to be awakened by the second audio data acquired in the target time period.
In this step, after receiving the wake-up response information sent by the second terminal, the first terminal may determine, according to the wake-up response information, a terminal that may be woken up by the audio data collected in the target time period. If only the first terminal is awakened, the first terminal can be used as the terminal to be awakened, and if a plurality of terminals are awakened, a target terminal closest to a user can be selected from the plurality of awakened terminals, and the target terminal is used as the terminal to be awakened.
In a possible implementation manner, in a case that the wake-up response information includes the wake-up feature of the second historical audio data, the wake-up feature of the first audio data may be acquired, a target terminal closest to the user is determined from the first terminal and the second terminal according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data, and the target terminal is used as the terminal to be woken up, where the wake-up feature is used to represent a distance between the terminal and the user, and the wake-up feature may be, for example, audio energy of the audio data, where a larger audio energy indicates that the terminal is closer to the user and a smaller audio energy indicates that the terminal is farther from the user. Here, when the wake-up response information includes the wake-up feature of the second historical audio data, it indicates that the second terminal may be woken up, in which case the first terminal may determine a target terminal nearest to the user according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data, for example, in a case where the wake-up feature is the audio energy of the audio data, the first terminal may compare the audio energy of the first audio data with the audio energy of the second historical audio data, determine a target terminal with the largest audio energy from the first terminal and the second terminal, and regard the target terminal as the terminal to be woken up.
In case the wake-up response information comprises a reject wake-up information, it means that the second terminal will not wake up, in which case the first terminal may be regarded as the terminal to be woken up.
It should be noted that, the wake-up feature of the first audio data may be obtained when the first terminal determines that the first audio data meets the preset wake-up condition, and in addition, considering that the first historical audio data also includes a wake-up word spoken by the user, if the wake-up response information includes the wake-up feature of the second historical audio data, the wake-up feature of the first historical audio data may also be obtained, and according to the wake-up feature of the first historical audio data and the wake-up feature of the second historical audio data, a target terminal closest to the user is determined from the first terminal and the second terminal, and the target terminal is taken as the terminal to be woken.
S106, controlling the terminal to be awakened.
In this step, after determining the terminal to be awakened, the first terminal may control the terminal to be awakened to respond to the awakening of the user according to a preset response instruction. In a possible implementation manner, in a case that the terminal to be awakened is the first terminal, the first terminal can be controlled to be awakened; or if the terminal to be awakened is the second terminal, an awakening instruction can be sent to the second terminal so that the second terminal is awakened according to the awakening instruction. For example, when the terminal to be awakened is the first terminal, the first terminal may directly output a response instruction, for example, "i am at", when the terminal to be awakened is the second terminal, which indicates that the distance between the second terminal and the user is closer, the first terminal may send an awakening instruction to the second terminal, and the second terminal may output the response instruction, for example, "i am at" after receiving the awakening instruction.
It should be noted that the above-mentioned manner of controlling the terminal to be awakened is merely illustrative, and the terminal to be awakened may be awakened by other manners of related art, which is not limited in this disclosure.
By adopting the method, after the first terminal meets the preset awakening condition, the first voice print characteristic can be sent to the second terminal, the second terminal can acquire the second historical audio data in the stored historical time period after receiving the first voice print characteristic, and the awakening response information is sent to the first terminal according to the second historical audio data, so that the first terminal can determine the terminal to be awakened after receiving the awakening response information sent by the second terminal without waiting for the timeout of the timeout judgment timer, and control the terminal to be awakened, thereby improving the awakening response speed and user experience.
Fig. 2 is a flowchart illustrating a second wake-up control method applied to a second terminal according to an exemplary embodiment, as shown in fig. 2, the method including:
s201, collecting audio data in real time.
S202, under the condition that a first voiceprint feature corresponding to first historical audio data acquired in a historical time period sent by a first terminal is received, second historical audio data in the stored historical time period is acquired.
The historical time period may include a preset time period before the target time period, where the preset time period may be a time period required by the user to speak the wake-up word, and the time periods of the historical time periods corresponding to different wake-up words are different.
It should be noted that, in the process of collecting the audio data in real time, the data collection module of the second terminal may store the second historical audio data collected in the historical time period, where the manner of storing the second historical audio data may refer to the manner of storing the first historical audio data in step S102, which is not described herein again.
In this step, after the second terminal receives the first voiceprint feature corresponding to the first historical audio data sent by the first terminal, it indicates that the first terminal has been awakened, and the second terminal has not been awakened, where in this case, the second terminal may acquire the stored second historical audio data in the historical time period.
S203, according to the second historical audio data, wake-up response information is sent to the first terminal.
Wherein the wake-up response information may include a reject wake-up information or a wake-up feature of the second historical audio data.
In this step, after the second terminal acquires the second historical audio data, it may be determined whether the second terminal can be awakened according to the second historical audio data. Under the condition that the second terminal can be awakened, the awakening characteristic of the second historical audio data can be acquired, the awakening characteristic of the second historical audio data is sent to the first terminal, and under the condition that the second terminal is not awakened, the awakening rejection information can be sent to the first terminal.
To determine whether the second terminal can be awakened, in one possible implementation, it may be first determined whether the second historical audio data includes voice data, where the second historical audio data does not include voice data, indicating that the second terminal will not be awakened, the refusal to wake information may be sent to the first terminal.
In the case where the second historical audio number includes voice data, it is indicated that the second terminal may be awakened, but in order to avoid the second terminal being awakened by mistake, in the case where the second historical audio data includes voice data, it may be further determined whether the second terminal may be awakened, considering that the voice included in the second historical audio data may not be the same person as the voice included in the first historical audio data. In a possible implementation manner, in a case where the second historical audio data includes voice data, a second voice feature corresponding to the second historical audio data may be acquired, and whether to wake up the second terminal is determined according to the first voice feature and the second voice feature.
Since the voiceprint characteristics of different speakers are different, it may be determined whether the first historical audio data and the second historical audio data belong to the same user according to the first voiceprint characteristic and the second voiceprint characteristic, and if it is determined that the first historical audio data and the second historical audio data belong to the same user, it is determined to wake up the second terminal, or if it is determined that the first historical audio data and the second historical audio data do not belong to the same user, it is determined not to wake up the second terminal. Determining whether the first voiceprint feature and the second voiceprint feature belong to the same speaker in a related technology manner, determining that the first historical audio data and the second historical audio data belong to the same user under the condition that the first voiceprint feature and the second voiceprint feature belong to the same speaker, determining to wake up the second terminal, acquiring the wake-up feature of the second historical audio data, and transmitting the wake-up feature of the second historical audio data to the first terminal; in the case that the first voice print feature and the second voice print feature are determined not to belong to the same speaker, it may be determined that the first historical audio data and the second historical audio data do not belong to the same user, thereby determining not to wake up the second terminal, and transmitting the refusal wake-up information to the first terminal.
It should be noted that, the manner in which the second terminal obtains the wake-up feature of the second historical audio data may refer to the manner in which the first terminal obtains the wake-up feature of the first historical audio data, which is not described herein.
After receiving the wake-up response information sent by the second terminal, the first terminal can determine the terminal to be waken up according to the wake-up response information. And when the wake-up response information comprises the wake-up rejection information, the first terminal can be used as the terminal to be waken, and when the wake-up response information comprises the wake-up feature of the second historical audio data, the terminal to be waken can be determined from the first terminal and the second terminal according to the wake-up feature of the first historical audio data and the wake-up feature of the second historical audio data.
After determining the terminal to be awakened, the first terminal can control the terminal to be awakened to respond to the awakening of the user according to a preset response instruction. In the case that the terminal to be awakened is the first terminal, the first terminal may directly respond to the user's awakening, for example, the first terminal may directly output a response instruction, for example, "i am in", in the case that the terminal to be awakened is the second terminal, the second terminal may receive the awakening instruction sent by the first terminal, awaken the second terminal according to the awakening instruction, and respond to the user's awakening, for example, the second terminal may output the response instruction after receiving the awakening instruction.
By adopting the method, after the second terminal receives the first voiceprint feature corresponding to the first historical audio data sent by the first terminal, the second terminal can acquire the second historical audio data in the stored historical time period and send the awakening response information to the first terminal according to the second historical audio data, so that the first terminal does not need to wait for the timeout of the timeout judgment timer, can directly determine the terminal to be awakened according to the awakening response information and control the terminal to be awakened, thereby improving the awakening response speed and user experience.
FIG. 3 is a flow chart illustrating a third wake-up control method, as shown in FIG. 3, according to an exemplary embodiment, comprising:
S301, the first terminal and the second terminal collect audio data in real time.
S302, under the condition that first audio data acquired in a target time period meet a preset wake-up condition, the first terminal acquires first historical audio data in a stored historical time period.
The historical time period may include a preset time period before the target time period, where the preset time period may be a duration of the user speaking the wake-up word, and durations of the historical time periods corresponding to different wake-up words are different.
S303, the first terminal acquires a first voiceprint feature corresponding to the first historical audio data and sends the first voiceprint feature to the second terminal.
S304, under the condition that the second terminal receives the first voiceprint feature corresponding to the first historical audio data acquired in the historical time period sent by the first terminal, acquiring the stored second historical audio data in the historical time period.
S305, the second terminal determines whether the second historical audio data comprises voice data, and if the second historical audio data comprises voice data, steps S306-S309 are executed, and if the second historical audio data does not comprise voice data, it is determined that the second terminal is not awakened, and step S309 is executed.
S306, the second terminal acquires second voice characteristics corresponding to the second historical audio data.
S307, the second terminal determines whether to wake up the second terminal according to the first voiceprint feature and the second voiceprint feature, and executes step S308 if the second terminal is determined to wake up, and executes step S309 if the second terminal is determined not to wake up.
S308, the second terminal sends the wake-up feature of the second historical audio data to the first terminal.
S309, the second terminal sends the refusal awakening information to the first terminal.
S310, after receiving the wake-up response information sent by the second terminal, the first terminal determines a terminal to be waken up according to the wake-up response information.
In this step, under the condition that the wake-up response information includes the wake-up feature of the second historical audio data, the first terminal may acquire the wake-up feature of the first audio data, determine, from the first terminal and the second terminal, a target terminal closest to the user according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data, and use the target terminal as the terminal to be woken up; in case the wake-up response information comprises a reject wake-up information, it means that the second terminal will not wake up, in which case the first terminal may be regarded as the terminal to be woken up.
S311, the first terminal determines whether the terminal to be awakened is the first terminal, and if the terminal to be awakened is the first terminal, step S312 is executed, and if the terminal to be awakened is the second terminal, step S313 is executed.
S312, the first terminal controls the first terminal to wake up.
S313, the first terminal sends a wake-up instruction to the second terminal.
S314, after receiving the wake-up instruction sent by the first terminal, the second terminal wakes up the second terminal according to the wake-up instruction.
The specific implementation manner of the steps S301 to S314 may refer to the specific implementation manner of the embodiment shown in fig. 1 and the embodiment shown in fig. 2, which are not described herein again.
By adopting the method, after the first terminal meets the preset wake-up condition, the first voice print characteristic can be sent to the second terminal, the second terminal can acquire second historical audio data in a stored historical time period after receiving the first voice print characteristic, determine whether to wake up the second terminal according to the second historical audio data, and send wake-up refusal information to the first terminal under the condition that the second terminal is not wake up; further, the second terminal may acquire a second voice print feature corresponding to the second historical audio data when determining that the second terminal may be awakened according to the second historical audio data, determine whether to awaken the second terminal according to the first voice print feature and the second voice print feature, send an awakening feature of the second historical audio data to the first terminal when determining to awaken the second terminal, and send a refusal awakening message to the first terminal when determining not to awaken the second terminal, so that the first terminal may determine the terminal to be awakened after receiving the refusal awakening message or the awakening feature of the second historical audio data without waiting until a timeout decision timer expires, and control the terminal to be awakened, thereby improving the speed of the awakening response and improving user experience.
It should be noted that, in the wake-up control method of the embodiment shown in fig. 3, the first terminal is first woken up, and the second terminal is not woken up when receiving the first voiceprint feature sent by the first terminal, but it is considered that the second terminal may be in a pre-wake-up state when receiving the first voiceprint feature sent by the first terminal, that is, the second terminal has been woken up by the wake-up word spoken by the user, but has not responded to the user yet, in this case, the terminal to be woken up in response to the user wake-up may also be determined from the first terminal and the second terminal through the second terminal. In one possible implementation, fig. 4 is a flowchart illustrating a fourth wake-up control method according to an exemplary embodiment, as shown in fig. 4, including:
S401, the first terminal and the second terminal collect audio data in real time.
S402, under the condition that first audio data acquired in a target time period meet a preset wake-up condition, the first terminal acquires first historical audio data in a stored historical time period.
The historical time period may include a preset time period before the target time period, where the preset time period may be a duration of the user speaking the wake-up word, and durations of the historical time periods corresponding to different wake-up words are different.
S403, the first terminal acquires a first voiceprint feature corresponding to the first historical audio data and a wake-up feature of the first audio data, and sends the first voiceprint feature and the wake-up feature of the first audio data to the second terminal.
S404, the second terminal determines whether the second terminal is in a pre-wake state or not under the condition that the second terminal receives the first voiceprint feature and the wake feature of the first audio data sent by the first terminal, and acquires second historical audio data in the stored historical time period under the condition that the second terminal is in the pre-wake state.
Note that, in the case where the second terminal is not in the pre-wake state, the subsequent processing manner may refer to the processing manners of S304 to S314 in the embodiment shown in fig. 3, which is not described herein.
S405, the second terminal acquires second voice characteristics corresponding to the second historical audio data.
S406, the second terminal determines whether to wake up the second terminal according to the first voiceprint feature and the second voiceprint feature, and performs steps S407 to S412 when determining to wake up the second terminal, and performs steps S413 to S414 when determining not to wake up the second terminal.
S407, the second terminal acquires the wake-up characteristic of the second historical audio data.
S408, the second terminal determines a terminal to be awakened from the first terminal and the second terminal according to the awakening characteristics of the first audio data and the awakening characteristics of the second historical audio data.
S409, the second terminal determines whether the terminal to be awakened is the second terminal, and if the terminal to be awakened is the second terminal, step S410 is executed, and if the terminal to be awakened is the first terminal, step S411 is executed.
S410, the second terminal controls the second terminal to wake up.
S411, the second terminal sends a wake-up instruction to the first terminal.
S412, the first terminal wakes up the first terminal according to the wake-up instruction after receiving the wake-up instruction sent by the second terminal.
S413, the second terminal sends wake-refusing information to the first terminal.
S414, the first terminal controls the first terminal to wake up after receiving the wake-up response information sent by the second terminal.
The specific implementation manner of the steps S401 to S414 may refer to the specific implementation manner of the embodiment shown in fig. 1 and the embodiment shown in fig. 2, which are not described herein again.
By adopting the method, after the first terminal meets the preset wake-up condition, the first voice print characteristic and the wake-up characteristic of the first audio data can be sent to the second terminal, and the second terminal can acquire the second voice print characteristic corresponding to the second historical audio data and determine whether to wake up the second terminal according to the first voice print characteristic and the second voice print characteristic when receiving the first voice print characteristic and the wake-up characteristic of the first audio data and determine the wake-up terminal responding to the user according to the wake-up characteristic of the first audio data and the wake-up characteristic of the second historical audio data, and control the wake-up terminal to wake-wake up according to the wake-up characteristic of the first audio data and the wake-up characteristic of the second historical audio data.
Fig. 5 is a schematic structural diagram of a wake-up control device according to an exemplary embodiment, which is applied to a first terminal, as shown in fig. 5, and includes:
A first data acquisition module 501 configured to acquire audio data in real time;
A first data obtaining module 502, configured to obtain, in a case where first audio data collected in a target time period meets a preset wake-up condition, first historical audio data in a stored historical time period, where the historical time period includes a preset time period before the target time period;
The voiceprint feature acquiring module 503 is configured to acquire a first voiceprint feature corresponding to the first historical audio data, and send the first voiceprint feature to a second terminal, so that the second terminal acquires second historical audio data in the stored historical time period after receiving the first voiceprint feature, and sends wake-up response information to the first terminal according to the second historical audio data;
An information receiving module 504 configured to receive the wake-up response information sent by the second terminal;
A terminal determining module 505 configured to determine a terminal to be awakened from the first terminal and the second terminal according to the awakening response information;
the first wake-up module 506 is configured to control the wake-up terminal to wake up.
Optionally, the preset wake-up condition includes:
the first audio data comprises a preset wake-up word; or alternatively
The first audio data comprises target keywords in the preset wake-up words, the target keywords comprise first N keywords in the preset wake-up words, N is an integer greater than 0 and less than the target number, and the target number is the number of keywords in the preset wake-up words.
Optionally, the terminal determining module 505 includes:
A terminal determining submodule, configured to determine a target terminal nearest to a user from the first terminal and the second terminal according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data, and take the target terminal as the terminal to be woken up, wherein the wake-up feature is used for representing the distance between the terminal and the user; or alternatively
And under the condition that the wake-up response information comprises wake-up refusal information, taking the first terminal as the terminal to be waken up.
Optionally, the first wake-up module 506 includes:
the wake-up sub-module is configured to control the first terminal to wake up under the condition that the terminal to be wake-up is the first terminal; or alternatively
And under the condition that the terminal to be awakened is the second terminal, sending an awakening instruction to the second terminal so as to enable the second terminal to be awakened according to the awakening instruction.
Through the device, the first voice print characteristic can be sent to the second terminal after the first terminal meets the preset awakening condition, the second terminal can acquire the second historical audio data in the stored historical time period after receiving the first voice print characteristic, and the awakening response information is sent to the first terminal according to the second historical audio data, so that the first terminal can determine the terminal to be awakened after receiving the awakening response information sent by the second terminal without waiting for the timeout of the timeout judgment timer, and the terminal to be awakened is controlled to be awakened, so that the awakening response speed can be improved, and the user experience is improved.
Fig. 6 is a schematic structural diagram of a second wake-up control unit according to an exemplary embodiment, which is applied to a second terminal, as shown in fig. 6, and includes:
A second data acquisition module 601 configured to acquire audio data in real time;
A second data obtaining module 602, configured to obtain, when receiving a first voiceprint feature corresponding to first historical audio data collected in a historical time period sent by a first terminal, stored second historical audio data in the historical time period;
The information sending module 603 is configured to send wake-up response information to the first terminal according to the second historical audio data, so that the first terminal determines a terminal to be woken up according to the wake-up response information, and controls the terminal to be woken up.
Optionally, the wake-up response information includes wake-up characteristics rejecting wake-up information or the second historical audio data, and fig. 7 is a schematic structural diagram of a third wake-up control device according to an exemplary embodiment, and as shown in fig. 7, the device further includes:
A wake-up determination module 604 configured to determine whether to wake up the second terminal based on the second historical audio data;
The information sending module 603 is configured to:
Transmitting a wake-up feature of the second historical audio data to the first terminal if it is determined to wake up the second terminal; or alternatively
And sending the refusal awakening information to the first terminal under the condition that the second terminal is not awakened.
Optionally, fig. 8 is a schematic structural diagram of a fourth wake-up control device according to an exemplary embodiment, and as shown in fig. 8, the device further includes:
A data determination module 605 configured to determine whether the second historical audio data includes voice data;
the wake-up determination module 604 is configured to:
under the condition that the second historical audio data comprises the voice data, acquiring a second voice characteristic corresponding to the second historical audio data, and determining whether to wake up the second terminal according to the first voice characteristic and the second voice characteristic; or alternatively
In the event that the second historical audio data does not include the voice data, it is determined not to wake up the second terminal.
Optionally, the wake-up determination module 604 includes:
A user judging sub-module configured to determine whether the first historical audio data and the second historical audio data belong to the same user according to the first voiceprint feature and the second voiceprint feature;
And the wake-up determining sub-module is configured to determine to wake up the second terminal if the first historical audio data and the second historical audio data are determined to belong to the same user, or determine not to wake up the second terminal if the first historical audio data and the second historical audio data are determined not to belong to the same user.
Optionally, fig. 9 is a schematic structural diagram of a fifth wake-up control device according to an exemplary embodiment, and as shown in fig. 9, the device further includes:
the instruction receiving module 606 is configured to receive a wake-up instruction sent by the first terminal when the terminal to be wake-up is the second terminal;
A second wake-up module 607 configured to wake up the second terminal according to the wake-up instruction.
Through the device, after the second terminal receives the first voiceprint feature corresponding to the first historical audio data sent by the first terminal, the second terminal can acquire the second historical audio data in the stored historical time period and send the awakening response information to the first terminal according to the second historical audio data, so that the first terminal does not need to wait for the timeout of the timeout judgment timer, can directly determine the terminal to be awakened according to the awakening response information and control the terminal to be awakened, thereby improving the awakening response speed and user experience.
The specific manner in which the various modules perform the operations in the apparatus of the above embodiments have been described in detail in connection with the embodiments of the method, and will not be described in detail herein.
The present disclosure also provides a computer readable storage medium having stored thereon computer program instructions which, when executed by a processor, implement the steps of the wake-up control method provided by the present disclosure.
Fig. 10 is a block diagram of a terminal 1000, according to an example embodiment. For example, terminal 1000 can be a mobile phone, computer, digital broadcast terminal, messaging device, game console, tablet device, medical device, exercise device, personal digital assistant, or the like.
Referring to fig. 10, terminal 1000 can include one or more of the following components: a processing component 1002, a memory 1004, a power component 1006, a multimedia component 1008, an audio component 1010, an input/output (I/O) interface 1012, a sensor component 1014, and a communication component 1016.
Processing component 1002 generally controls overall operation of terminal 1000, such as operations associated with displaying, telephone calling, data communication, camera operation, and recording operation. The processing component 1002 can include one or more processors 1020 to execute instructions to perform all or part of the steps of the wake-up control method described above. Further, the processing component 1002 can include one or more modules that facilitate interaction between the processing component 1002 and other components. For example, the processing component 1002 can include a multimedia module to facilitate interaction between the multimedia component 1008 and the processing component 1002.
Memory 1004 is configured to store various types of data to support operation at terminal 1000. Examples of such data include instructions for any application or method operating on terminal 1000, contact data, phonebook data, messages, pictures, video, and the like. The memory 1004 may be implemented by any type or combination of volatile or nonvolatile memory devices such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disk.
Power component 1006 provides power to the various components of terminal 1000. Power component 1006 can include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for terminal 1000.
Multimedia component 1008 includes a screen between terminal 1000 and the user that provides an output interface. In some embodiments, the screen may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from a user. The touch panel includes one or more touch sensors to sense touches, swipes, and gestures on the touch panel. The touch sensor may sense not only the boundary of a touch or slide action, but also the duration and pressure associated with the touch or slide operation. In some embodiments, the multimedia assembly 1008 includes a front-facing camera and/or a rear-facing camera. The front camera and/or the rear camera may receive external multimedia data when terminal 1000 is in an operational mode, such as a photographing mode or a video mode. Each front camera and rear camera may be a fixed optical lens system or have focal length and optical zoom capabilities.
The audio component 1010 is configured to output and/or input audio signals. For example, audio component 1010 includes a Microphone (MIC) configured to receive external audio signals when terminal 1000 is in an operational mode, such as a call mode, a recording mode, and a speech recognition mode. The received audio signals may be further stored in memory 1004 or transmitted via communication component 1016. In some embodiments, the audio component 1010 further comprises a speaker for outputting audio signals. The I/O interface 1012 provides an interface between the processing assembly 1002 and peripheral interface modules, which may be a keyboard, click wheel, buttons, and the like. These buttons may include, but are not limited to: homepage button, volume button, start button, and lock button.
Sensor assembly 1014 includes one or more sensors for providing status assessment of various aspects to terminal 1000. For example, sensor assembly 1014 can detect an open/closed state of terminal 1000, a relative positioning of the components, such as a display and keypad of terminal 1000, sensor assembly 1014 can also detect a change in position of terminal 1000 or a component of terminal 1000, the presence or absence of user contact with terminal 1000, terminal 1000 orientation or acceleration/deceleration, and a change in temperature of terminal 1000. The sensor assembly 1014 may include a proximity sensor configured to detect the presence of nearby objects in the absence of any physical contact. The sensor assembly 1014 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor assembly 1014 can also include an acceleration sensor, a gyroscopic sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
Communication component 1016 is configured to facilitate communication between terminal 1000 and other devices, either wired or wireless. Terminal 1000 can access a wireless network based on a communication standard, such as WiFi,2G or 3G, or a combination thereof. In one exemplary embodiment, the communication component 1016 receives broadcast signals or broadcast-related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 1016 further includes a Near Field Communication (NFC) module to facilitate short range communications. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, ultra Wideband (UWB) technology, bluetooth (BT) technology, and other technologies.
In an exemplary embodiment, terminal 1000 can be implemented by one or more Application Specific Integrated Circuits (ASICs), digital Signal Processors (DSPs), digital Signal Processing Devices (DSPDs), programmable Logic Devices (PLDs), field Programmable Gate Arrays (FPGAs), controllers, microcontrollers, microprocessors or other electronic elements for performing the wake-up control method described above.
In an exemplary embodiment, a non-transitory computer readable storage medium is also provided, such as memory 1004, including instructions executable by processor 1020 of terminal 1000 to perform the wake-up control method described above. For example, the non-transitory computer readable storage medium may be ROM, random Access Memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, etc.
In another exemplary embodiment, a computer program product is also provided, comprising a computer program executable by a programmable apparatus, the computer program having code portions for performing the above-mentioned wake-up control method when executed by the programmable apparatus.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure. This application is intended to cover any adaptations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
It is to be understood that the present disclosure is not limited to the precise arrangements and instrumentalities shown in the drawings, and that various modifications and changes may be effected without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims (18)

1. A wake-up control method, applied to a first terminal, the method comprising:
collecting audio data in real time;
Under the condition that first audio data acquired in a target time period meet a preset wake-up condition, acquiring first historical audio data in a stored historical time period, wherein the historical time period comprises a preset time period before the target time period;
Acquiring a first voiceprint feature corresponding to the first historical audio data, and sending the first voiceprint feature to a second terminal, so that the second terminal acquires second historical audio data stored in the historical time period after receiving the first voiceprint feature, and sends wake-up response information to the first terminal according to the second historical audio data;
receiving the wake-up response information sent by the second terminal;
Determining a terminal to be awakened from the first terminal and the second terminal according to the awakening response information;
controlling the terminal to be awakened;
The preset wake-up condition includes: the first audio data comprises target keywords in preset wake-up words, the target keywords comprise first N keywords in the preset wake-up words, N is an integer which is greater than 0 and smaller than the target number, and the target number is the number of the keywords in the preset wake-up words.
2. The method of claim 1, wherein the wake response information comprises a reject wake information or a wake feature of the second historical audio data; and determining the terminal to be awakened from the first terminal and the second terminal according to the awakening response information, wherein the determining the terminal to be awakened comprises:
Determining a target terminal nearest to a user from the first terminal and the second terminal according to the wake-up characteristics of the first audio data and the wake-up characteristics of the second historical audio data under the condition that the wake-up response information comprises the wake-up characteristics of the second historical audio data, and taking the target terminal as the terminal to be waken, wherein the wake-up characteristics are used for representing the distance between the terminal and the user; or alternatively
And under the condition that the wake-up response information comprises the wake-up rejection information, taking the first terminal as the terminal to be waken up.
3. The method of claim 1, wherein the controlling the terminal to wake up comprises:
Controlling the first terminal to wake up under the condition that the terminal to be wake up is the first terminal; or alternatively
And under the condition that the terminal to be awakened is the second terminal, sending an awakening instruction to the second terminal so as to enable the second terminal to be awakened according to the awakening instruction.
4. A wake-up control method, applied to a second terminal, the method comprising:
collecting audio data in real time;
Under the condition that a first voiceprint feature corresponding to first historical audio data collected in a historical time period sent by a first terminal is received, second historical audio data in the stored historical time period is obtained, wherein the first historical audio data is audio data obtained when the first audio data collected in a target time period of the first terminal meets a preset wake-up condition, the historical time period comprises a preset time period before the target time period, and the preset wake-up condition comprises: the first audio data comprises target keywords in preset wake-up words, the target keywords comprise first N keywords in the preset wake-up words, N is an integer which is more than 0 and less than the target number, and the target number is the number of the keywords in the preset wake-up words;
And sending wake-up response information to the first terminal according to the second historical audio data, so that the first terminal determines a terminal to be waken according to the wake-up response information, and controls the terminal to be waken.
5. The method of claim 4, wherein the wake-up response information comprises wake-up characteristics rejecting wake-up information or the second historical audio data, the method further comprising, prior to the sending of wake-up response information to the first terminal based on the second historical audio data:
determining whether to wake up the second terminal according to the second historical audio data;
and sending wake-up response information to the first terminal according to the second historical audio data comprises the following steps:
Transmitting a wake-up feature of the second historical audio data to the first terminal if it is determined to wake up the second terminal; or alternatively
And under the condition that the second terminal is not awakened, sending the awakening rejection information to the first terminal.
6. The method of claim 5, wherein prior to said determining whether to wake up the second terminal based on the second historical audio data, the method further comprises:
Determining whether the second historical audio data includes voice data;
and determining whether to wake up the second terminal according to the second historical audio data comprises:
Acquiring a second voiceprint feature corresponding to the second historical audio data under the condition that the second historical audio data comprises the voice data, and determining whether to wake up the second terminal according to the first voiceprint feature and the second voiceprint feature; or alternatively
In the event that the second historical audio data does not include the voice data, it is determined not to wake up the second terminal.
7. The method of claim 6, wherein the determining whether to wake the second terminal based on the first voiceprint feature and the second voiceprint feature comprises:
Determining whether the first historical audio data and the second historical audio data belong to the same user according to the first voiceprint feature and the second voiceprint feature;
and determining to wake up the second terminal under the condition that the first historical audio data and the second historical audio data are determined to belong to the same user, or determining not to wake up the second terminal under the condition that the first historical audio data and the second historical audio data are determined not to belong to the same user.
8. The method according to claim 4, wherein the method further comprises:
Receiving a wake-up instruction sent by the first terminal under the condition that the terminal to be wake-up is the second terminal;
And waking up the second terminal according to the wake-up instruction.
9. A wake-up control device, applied to a first terminal, the device comprising:
the first data acquisition module is configured to acquire audio data in real time;
The first data acquisition module is configured to acquire first historical audio data in a stored historical time period when the first audio data acquired in a target time period meets a preset wake-up condition, wherein the historical time period comprises a preset time period before the target time period;
The voiceprint feature acquisition module is configured to acquire a first voiceprint feature corresponding to the first historical audio data, and send the first voiceprint feature to a second terminal, so that the second terminal acquires second historical audio data in the stored historical time period after receiving the first voiceprint feature, and sends wake-up response information to the first terminal according to the second historical audio data;
the information receiving module is configured to receive the wake-up response information sent by the second terminal;
The terminal determining module is configured to determine a terminal to be awakened from the first terminal and the second terminal according to the awakening response information;
The first awakening module is configured to control the terminal to be awakened;
The preset wake-up condition includes: the first audio data comprises target keywords in preset wake-up words, the target keywords comprise first N keywords in the preset wake-up words, N is an integer which is greater than 0 and smaller than the target number, and the target number is the number of the keywords in the preset wake-up words.
10. The apparatus of claim 9, wherein the wake response information comprises a reject wake information or a wake feature of the second historical audio data; the terminal determining module includes:
a terminal determining submodule, configured to determine a target terminal nearest to a user from the first terminal and the second terminal according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data, and take the target terminal as the terminal to be waken, wherein the wake-up feature is used for representing the distance between the terminal and the user; or alternatively
And under the condition that the wake-up response information comprises the wake-up rejection information, taking the first terminal as the terminal to be waken up.
11. The apparatus of claim 9, wherein the first wake-up module comprises:
The wake-up sub-module is configured to control the first terminal to wake up under the condition that the terminal to be wake-up is the first terminal; or alternatively
And under the condition that the terminal to be awakened is the second terminal, sending an awakening instruction to the second terminal so as to enable the second terminal to be awakened according to the awakening instruction.
12. A wake-up control device, applied to a second terminal, the device comprising:
the second data acquisition module is configured to acquire audio data in real time;
The second data acquisition module is configured to acquire second historical audio data in a stored historical time period under the condition that a first voiceprint feature corresponding to first historical audio data acquired in the historical time period sent by the first terminal is received, wherein the first historical audio data is audio data acquired by the first terminal under the condition that the first audio data acquired in a target time period meets a preset wake-up condition, the historical time period comprises a preset time period before the target time period, and the preset wake-up condition comprises: the first audio data comprises target keywords in preset wake-up words, the target keywords comprise first N keywords in the preset wake-up words, N is an integer which is more than 0 and less than the target number, and the target number is the number of the keywords in the preset wake-up words;
And the information sending module is configured to send wake-up response information to the first terminal according to the second historical audio data, so that the first terminal determines a terminal to be waken according to the wake-up response information and controls the terminal to be waken.
13. The apparatus of claim 12, wherein the wake response information comprises wake characteristics that reject wake information or the second historical audio data, the apparatus further comprising:
a wake-up determining module configured to determine whether to wake up the second terminal according to the second historical audio data;
the information sending module is configured to:
Transmitting a wake-up feature of the second historical audio data to the first terminal if it is determined to wake up the second terminal; or alternatively
And under the condition that the second terminal is not awakened, sending the awakening rejection information to the first terminal.
14. The apparatus of claim 13, wherein the apparatus further comprises:
a data determination module configured to determine whether the second historical audio data includes voice data;
The wake-up determination module is configured to:
Acquiring a second voiceprint feature corresponding to the second historical audio data under the condition that the second historical audio data comprises the voice data, and determining whether to wake up the second terminal according to the first voiceprint feature and the second voiceprint feature; or alternatively
In the event that the second historical audio data does not include the voice data, it is determined not to wake up the second terminal.
15. The apparatus of claim 14, wherein the wake-up determination module comprises:
a user determination sub-module configured to determine, according to the first voiceprint feature and the second voiceprint feature, whether the first historical audio data and the second historical audio data belong to the same user;
And the wake-up determination submodule is configured to determine to wake up the second terminal when the first historical audio data and the second historical audio data are determined to belong to the same user, or determine not to wake up the second terminal when the first historical audio data and the second historical audio data are determined to not belong to the same user.
16. The apparatus of claim 12, wherein the apparatus further comprises:
The instruction receiving module is configured to receive a wake-up instruction sent by the first terminal under the condition that the terminal to be wake-up is the second terminal;
and the second wake-up module is configured to wake up the second terminal according to the wake-up instruction.
17. A computer readable storage medium having stored thereon computer program instructions, which when executed by a processor perform the steps of the method according to any of claims 1-3 or which when executed by a processor perform the steps of the method according to any of claims 4-8.
18. A terminal, comprising:
a memory having a computer program stored thereon;
A processor for executing the computer program in the memory to carry out the steps of the method of any one of claims 1-3 or to carry out the steps of the method of any one of claims 4-8.
CN202011349897.0A 2020-11-26 2020-11-26 Wakeup control method, wakeup control device, storage medium and terminal Active CN112489650B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011349897.0A CN112489650B (en) 2020-11-26 2020-11-26 Wakeup control method, wakeup control device, storage medium and terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011349897.0A CN112489650B (en) 2020-11-26 2020-11-26 Wakeup control method, wakeup control device, storage medium and terminal

Publications (2)

Publication Number Publication Date
CN112489650A CN112489650A (en) 2021-03-12
CN112489650B true CN112489650B (en) 2024-06-18

Family

ID=74935022

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011349897.0A Active CN112489650B (en) 2020-11-26 2020-11-26 Wakeup control method, wakeup control device, storage medium and terminal

Country Status (1)

Country Link
CN (1) CN112489650B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115827075B (en) * 2022-11-21 2024-02-23 小米汽车科技有限公司 Equipment control method, device, equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108259280A (en) * 2018-02-06 2018-07-06 北京语智科技有限公司 A kind of implementation method, the system of Inteldectualization Indoors control
CN109391528A (en) * 2018-08-31 2019-02-26 百度在线网络技术(北京)有限公司 Awakening method, device, equipment and the storage medium of speech-sound intelligent equipment

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105575395A (en) * 2014-10-14 2016-05-11 中兴通讯股份有限公司 Voice wake-up method and apparatus, terminal, and processing method thereof
KR20170132622A (en) * 2016-05-24 2017-12-04 삼성전자주식회사 Electronic device having speech recognition function and operating method of Electronic device
KR20190084789A (en) * 2018-01-09 2019-07-17 엘지전자 주식회사 Electric terminal and method for controlling the same
CN110299137B (en) * 2018-03-22 2023-12-12 腾讯科技(深圳)有限公司 Voice interaction method and device
US10679629B2 (en) * 2018-04-09 2020-06-09 Amazon Technologies, Inc. Device arbitration by multiple speech processing systems
CN111742361B (en) * 2018-07-24 2023-08-22 华为技术有限公司 Method for updating wake-up voice of voice assistant by terminal and terminal
US10971160B2 (en) * 2018-11-13 2021-04-06 Comcast Cable Communications, Llc Methods and systems for determining a wake word
CN110364153A (en) * 2019-07-30 2019-10-22 恒大智慧科技有限公司 A kind of distributed sound control method, system, computer equipment and storage medium
CN111210829A (en) * 2020-02-19 2020-05-29 腾讯科技(深圳)有限公司 Speech recognition method, apparatus, system, device and computer readable storage medium

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108259280A (en) * 2018-02-06 2018-07-06 北京语智科技有限公司 A kind of implementation method, the system of Inteldectualization Indoors control
CN109391528A (en) * 2018-08-31 2019-02-26 百度在线网络技术(北京)有限公司 Awakening method, device, equipment and the storage medium of speech-sound intelligent equipment

Also Published As

Publication number Publication date
CN112489650A (en) 2021-03-12

Similar Documents

Publication Publication Date Title
CN107919123B (en) Multi-voice assistant control method, device and computer readable storage medium
CN111722824B (en) Voice control method, device and computer storage medium
CN109920418B (en) Method and device for adjusting awakening sensitivity
EP3136793A1 (en) Method and apparatus for awakening electronic device
EP3933570A1 (en) Method and apparatus for controlling a voice assistant, and computer-readable storage medium
CN105224601B (en) A kind of method and apparatus of extracting time information
EP4184506A1 (en) Audio processing
US10325244B2 (en) Method and device for processing a communication message
CN106791921A (en) The processing method and processing device of net cast
CN111696553B (en) Voice processing method, device and readable medium
CN111063354B (en) Man-machine interaction method and device
CN112133302B (en) Method, device and storage medium for pre-waking up terminal
CN106331328B (en) Information prompting method and device
CN111580773B (en) Information processing method, device and storage medium
CN109032554A (en) A kind of audio-frequency processing method and electronic equipment
CN112489650B (en) Wakeup control method, wakeup control device, storage medium and terminal
CN112509596A (en) Wake-up control method and device, storage medium and terminal
CN111309392B (en) Equipment control method and electronic equipment
CN105786561B (en) Method and device for calling process
CN111127846A (en) Door-knocking reminding method, door-knocking reminding device and electronic equipment
CN112489653B (en) Speech recognition method, device and storage medium
CN111968680A (en) Voice processing method, device and storage medium
CN113035189A (en) Document demonstration control method, device and equipment
CN107979695B (en) Network message receiving method and device and storage medium
CN106210320A (en) Time prompting method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant