CN110459215A

CN110459215A - Sound control method, device, computer equipment and computer readable storage medium

Info

Publication number: CN110459215A
Application number: CN201910740319.0A
Authority: CN
Inventors: 邓衍
Original assignee: Tencent Technology Shenzhen Co Ltd
Current assignee: Tencent Technology Shenzhen Co Ltd
Priority date: 2019-08-12
Filing date: 2019-08-12
Publication date: 2019-11-15
Anticipated expiration: 2039-08-12
Also published as: CN110459215B

Abstract

This application involves a kind of sound control method, device, computer equipment and computer readable storage mediums, and the method includes intelligent sound boxes to obtain voice messaging；The intelligent sound box obtains the control instruction generated according to the voice messaging；The control instruction is sent to mobile terminal by Bluetooth protocol by the intelligent sound box, the control instruction is used to indicate the audio frequency parameter that the mobile terminal adjusts itself, and the mobile terminal includes at least one of smart phone, tablet computer and laptop.Scheme provided by the present application may be implemented to carry out voice control to mobile terminal by intelligent sound box and provide the user with more conveniences without manually adjusting the audio of mobile terminal, can satisfy the greater demand of user.

Description

Voice control method, device, computer equipment and computer readable storage medium

Technical Field

The present application relates to the field of computer technologies, and in particular, to a voice control method, apparatus, computer device, and computer-readable storage medium.

Background

With the development of computer technology, voice control technology has emerged, which is a technology for a machine to convert voice signals into corresponding texts or commands through a recognition and understanding process. A large number of voice recognition control products have been introduced into the market and service fields, such as adjusting the volume of an air conditioner or a smart audio by voice, and the like.

When traditional intelligent audio amplifier passes through bluetooth with the terminal and connects, generally be through the volume of voice regulation audio amplifier, but when the volume of terminal was too low, still can not reach user's wanted volume size when adjusting the audio amplifier volume to the highest. At the moment, the volume of the mobile phone can be adjusted only by hand, so that the operation is inconvenient.

Disclosure of Invention

In view of the above, it is necessary to provide a voice control method, apparatus, computer device and computer readable storage medium for solving the technical problem.

A method of voice control, the method comprising:

the intelligent sound box acquires voice information;

the intelligent sound box acquires a control instruction generated according to the voice information;

the intelligent sound box sends the control instruction to a mobile terminal through a Bluetooth protocol, the control instruction is used for indicating the mobile terminal to adjust the audio parameter of the mobile terminal, and the mobile terminal comprises at least one of an intelligent mobile phone, a tablet personal computer and a notebook computer.

A voice control method is applied to a terminal, and comprises the following steps:

the method comprises the steps that a mobile terminal receives a control instruction sent by an intelligent sound box through a Bluetooth protocol, wherein the control instruction is generated according to voice information, and the mobile terminal comprises at least one of an intelligent mobile phone, a tablet personal computer and a notebook computer;

and the mobile terminal adjusts the audio parameters of the mobile terminal according to the control instruction.

The utility model provides a voice control device, is applied to intelligent audio amplifier, the device includes:

the voice information acquisition module is used for acquiring voice information;

the control instruction acquisition module is used for acquiring a control instruction generated according to the voice information;

the sending module is used for sending the control instruction to a mobile terminal through a Bluetooth protocol, the control instruction is used for indicating the mobile terminal to adjust the audio parameter of the mobile terminal, and the mobile terminal comprises at least one of a smart phone, a tablet computer and a notebook computer.

A voice control device is applied to a mobile terminal, and comprises:

the mobile terminal comprises a receiving module and a processing module, wherein the receiving module is used for receiving a control instruction sent by the intelligent sound box, the control instruction is generated according to voice information, and the mobile terminal comprises at least one of an intelligent mobile phone, a tablet personal computer and a notebook computer;

and the adjusting module is used for adjusting the audio parameters of the adjusting module according to the control instruction.

A computer-readable storage medium storing a computer program which, when executed by a processor, causes the processor to perform the steps of any of the speech control methods.

A computer device comprising a memory and a processor, the memory storing a computer program that, when executed by the processor, causes the processor to perform the steps of any of the speech control methods.

In the voice control method, the voice control device, the computer equipment and the computer readable storage medium, the intelligent sound box acquires the voice information, acquires the control instruction generated according to the voice information, and sends the control instruction to the mobile terminal, wherein the control instruction is used for instructing the mobile terminal to adjust the audio parameters of the intelligent sound box, and the mobile terminal comprises at least one of an intelligent mobile phone, a tablet personal computer and a notebook personal computer, so that the voice control can be performed on the mobile terminal through the intelligent sound box without manually adjusting the audio of the mobile terminal, more convenience is provided for a user, and more requirements of the user can be met.

Drawings

FIG. 1 is a diagram of an exemplary implementation of a voice control method;

FIG. 2 is a flow diagram illustrating a voice control method according to one embodiment;

FIG. 3 is a flow chart illustrating a voice control method according to another embodiment;

FIG. 4 is a flowchart illustrating a process of adjusting audio parameters by a mobile terminal according to a control command according to an embodiment;

FIG. 5 is a schematic diagram illustrating a process of adjusting audio parameters according to a control command by the mobile terminal in another embodiment;

FIG. 6 is a schematic diagram illustrating a process of adjusting audio parameters according to a control command by the mobile terminal in another embodiment;

FIG. 7 is a diagram illustrating a mobile terminal adjusting volume according to a control command according to an embodiment;

FIG. 8 is a block diagram showing the structure of a voice control apparatus according to an embodiment;

FIG. 9 is a block diagram showing the construction of a voice control apparatus according to another embodiment;

fig. 10 is a block diagram of a mobile terminal in one embodiment.

Detailed Description

In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.

FIG. 1 is a diagram of an exemplary implementation of a speech control method. Referring to fig. 1, the voice control method is applied to a voice control system. The voice control system includes a smart sound box 110 and a mobile terminal 120. The smart speaker 110 and the mobile terminal 120 are connected through a bluetooth protocol. The smart speaker 110 may be a speaker with voice recognition and bluetooth technology. The mobile terminal may be at least one of a smart phone, a tablet computer, a notebook computer, and the like.

In this embodiment, smart sound box 110 and mobile terminal 120 may communicate via a bluetooth audio/video remote control protocol. The smart speaker 110 receives the voice information of the user, uploads the voice information to the cloud, and performs voice recognition through a voice recognition engine on the cloud to determine the intention of the user. After the voice recognition engine in the cloud performs voice recognition on the voice information, a recognition result is obtained and returned to the smart sound box 110. The smart speaker 110 receives the voice recognition result and generates a control command according to the voice recognition result. Then, the smart speaker 110 sends the control command to the mobile terminal 120. The mobile terminal 120 receives the control command, and adjusts its own audio parameter according to the control command.

In this embodiment, the smart sound box 110 collects the voice information of the user through the microphone array, sends the voice information to the voice recognition engine, and sends a recognition request to the cloud through the voice recognition engine to determine the intention of the user.

In this embodiment, after the voice recognition engine in the cloud obtains the recognition result, a control instruction is generated according to the voice recognition result, and the control instruction is returned to the smart sound box 110. The smart speaker 110 receives the control command and sends the control command to the mobile terminal 120 through the bluetooth protocol.

In one embodiment, after receiving the voice message of the user, the smart sound box 110 may recognize the voice message through its internal voice recognition engine. After the voice information is subjected to voice recognition, a recognition result is obtained, and the smart sound box 110 may generate a control instruction according to the recognition result of the voice information. Then, the smart speaker 110 sends the control command to the mobile terminal 120.

In one embodiment, as shown in FIG. 2, a method of voice control is provided. The present embodiment is mainly illustrated by applying the method to the smart sound box 110 in fig. 1. Referring to fig. 2, the voice control method specifically includes the following steps:

step 202, the smart speaker acquires voice information.

Specifically, the intelligent sound box and the mobile terminal are in communication connection through a Bluetooth protocol, and the intelligent sound box receives voice information sent by a user.

And step 204, the intelligent sound box acquires a control instruction generated according to the voice information.

Specifically, the smart sound box performs voice recognition on the voice information through an internal voice recognition engine to recognize the intention of the user, and generates a corresponding control instruction according to a recognition result.

In this embodiment, the smart sound box may perform voice recognition on the voice information through the third-party device, receive a recognition result returned by the third-party device, and generate a corresponding control instruction according to the recognition result.

Step 206, the smart speaker sends a control instruction to the mobile terminal through a bluetooth protocol, where the control instruction is used to instruct the mobile terminal to adjust its own audio parameters, and the mobile terminal includes at least one of a smart phone, a tablet computer, and a notebook computer.

Specifically, the intelligent sound box sends the control instruction to the mobile terminal, so that the mobile terminal adjusts the audio parameter of the mobile terminal according to the control instruction.

In this embodiment, when the control instruction includes at least two mobile terminals, the smart sound box may determine the control instruction corresponding to the same mobile terminal in the control instruction, classify the control instruction according to the corresponding mobile terminal, and send the classified control instruction to the corresponding mobile terminal respectively.

In this embodiment, the method further includes: and receiving response information of the mobile terminal. The response information refers to that after the mobile terminal receives the control instruction, corresponding response is made to the control instruction, and response information is formed and returned to the intelligent sound box.

Specifically, after the intelligent sound box sends the control instruction to the mobile terminal, the intelligent sound box waits for the mobile terminal to return corresponding response information. And when the intelligent sound box receives the response information returned by the mobile terminal, converting the response information into voice and playing the voice to prompt the user.

According to the voice control method, the intelligent sound box acquires the voice information, acquires the control instruction generated according to the voice information, and sends the control instruction to the mobile terminal, the control instruction is used for indicating the mobile terminal to adjust the audio parameters of the mobile terminal, and the mobile terminal comprises at least one of a smart phone, a tablet computer and a notebook computer, so that the mobile terminal can be controlled through the intelligent sound box, the audio of the mobile terminal does not need to be adjusted manually, more convenience is provided for a user, and more requirements of the user can be met.

In one embodiment, the obtaining of the control instruction generated according to the voice message by the smart speaker includes: the intelligent sound box obtains the generated control instruction comprising the audio frequency and the corresponding audio frequency parameter, wherein the audio frequency parameter corresponding to the audio frequency is determined according to the voice information.

The control instruction comprises audio and corresponding audio parameters. A plurality of audios and audio parameters corresponding to each audio may exist in the control instruction. The audio includes but is not limited to media, alarm clock, system ring tone of the mobile terminal, call, etc. audio types.

Specifically, the smart sound box can perform voice recognition on the voice information through an internal voice recognition engine so as to recognize the intention of the user and obtain a recognition result. And then, the intelligent sound box converts the identification result into a corresponding control instruction.

In this embodiment, the smart speaker may perform speech recognition on the speech information through a third-party device to obtain a recognition result. And then, the intelligent sound box receives an identification result returned by the third-party equipment, wherein the identification result comprises the audio frequency which the user wants to adjust and the audio frequency parameters which are specifically needed to be adjusted by the audio frequency. And then, the intelligent sound box generates a corresponding control instruction according to the audio frequency which the user wants to adjust and the audio frequency parameter which the audio frequency specifically needs to adjust.

In this embodiment, when the identification result includes at least two mobile terminals, the smart sound box determines an audio that needs to be adjusted and an audio parameter that needs to be adjusted, which correspond to each of the at least two mobile terminals. And then, the intelligent sound box generates the audio which is required to be adjusted and corresponds to each of the at least two mobile terminals and the audio parameters which are required to be adjusted and correspond to the control instructions.

According to the voice control method, the intelligent sound box determines the audio frequency and the corresponding audio frequency parameter in the voice information according to the recognition of the voice information, so that the audio frequency required to be adjusted by a user and the audio frequency parameter required to be adjusted can be determined. And acquiring a control instruction which is generated by the intelligent sound box or generated by a third-party device and comprises audio and corresponding audio parameters so as to instruct the mobile terminal to perform corresponding adjustment.

In one embodiment, the obtaining of the generated control instruction including the audio frequency and the corresponding audio frequency parameter by the smart speaker includes: the intelligent sound box obtains the generated control instruction which comprises the audio parameters corresponding to the at least two audios and the adjusting sequence, and the adjusting sequence is determined according to the time sequence of the at least two audios in the voice information.

Specifically, the smart sound box can perform voice recognition on the voice information through an internal voice recognition engine so as to recognize the intention of the user and obtain a recognition result. Then, the smart speaker can determine the number of the audios existing in the recognition result, and when at least two audios exist in the recognition result, the smart speaker can determine the audio parameter corresponding to each of the at least two audios. Then, the smart speaker determines an adjustment sequence corresponding to each of the at least two audio frequencies. Further, the smart sound box may determine the occurrence time of each of the at least two audios in the voice message, and compare the occurrence times to obtain the occurrence sequence of each of the at least two audios in the voice message. The sequence of the at least two audios appearing in the voice information is used as the adjustment sequence of each audio in the at least two audios. And then, the intelligent sound box generates corresponding control instructions according to the at least two audios, the audio parameters corresponding to each audio in the at least two audios and the adjustment sequence.

In this embodiment, the smart speaker may perform speech recognition on the speech information through a third-party device to obtain a recognition result.

In this embodiment, the smart speaker may perform speech recognition on the speech information through a third-party device to obtain a recognition result. Then, the intelligent sound box can determine the number of the audios existing in the identification result through the third-party device, and when at least two audios exist in the identification result, determine the audio parameter corresponding to each of the at least two audios. Then, the smart sound box determines an adjustment sequence corresponding to each of the at least two audios through the third-party device. Further, the smart sound box may determine, through the third-party device, a time when each of the at least two audios appears in the voice information, and compare the time when each audio appears, so as to obtain a sequence of appearance of each of the at least two audios in the voice information. The sequence of the at least two audios appearing in the voice information is used as the adjustment sequence of each audio in the at least two audios. And then, the intelligent sound box converts the at least two audios and the audio parameters and the adjustment sequence corresponding to each of the at least two audios into a control instruction through third-party equipment. And then, the intelligent sound box acquires the audio parameters and the control instruction of the adjusting sequence from the third-party equipment, wherein the audio parameters comprise at least two audios and each audio in the at least two audios.

In this embodiment, after the voice information is identified and recognized, when at least two mobile terminals exist in the recognition result, the intelligent sound equipment determines the audio and audio parameters corresponding to each of the at least two mobile terminals. When one mobile terminal corresponds to at least two audios, the intelligent sound box determines audio parameters and an adjustment sequence corresponding to each audio in the at least two audios. And converting the audio frequency, the audio frequency parameters and the adjusting sequence corresponding to each mobile terminal into the control instruction corresponding to each mobile terminal.

For example, the voice information received by the smart sound box is "turn off music of the notebook computer, turn off the alarm clock of the smart phone, turn on video playing of the smart phone, and adjust the volume to 70%", recognize the voice, and the obtained recognition result may be "notebook-music-stop playing", "smart phone-alarm clock-off", "smart phone-video-volume adjustment to 70%". At this moment, the smart sound box can determine that the audio frequency which corresponds to the smart phone and needs to be adjusted is an alarm clock and a video, and can determine the adjustment parameters which correspond to the alarm clock and the video respectively. The smart speaker may further determine an adjustment order of the alarm clock and the video, and the adjustment order may be determined according to an order of occurrence of the alarm clock and the video in the voice message.

According to the voice control method, the intelligent sound box generates a control instruction which comprises audio parameters corresponding to at least two audios and an adjusting sequence through the intelligent sound box or the intelligent sound box through a third-party device according to the recognition of the voice information, wherein the adjusting sequence is determined according to the time sequence of the at least two audios in the voice information, so that the mobile terminal is instructed to correspondingly adjust the audios according to the adjusting sequence.

In one embodiment, the audio parameter includes volume;

this intelligent sound box obtains the control command according to this speech information generation, includes: and the intelligent sound box acquires a generated control instruction comprising a volume adjusting direction and a volume adjusting amplitude, wherein the volume adjusting direction is determined according to the voice information, and the volume adjusting amplitude is acquired according to the volume adjusting direction.

The volume adjustment range refers to the volume adjustment variation, the volume adjustment magnification, and the like. For example, the volume adjustment amplitude is 3, 5, or 6, etc. Or the volume adjustment amplitude is 2 times, 3 times, etc. of the current volume value.

Specifically, the intelligent sound box recognizes the voice information through a voice recognition engine inside the intelligent sound box or a voice device engine of the third-party device, and then obtains a recognition result. The smart speaker may determine whether audio and corresponding audio parameters exist in the recognition result. When the smart sound box detects that the audio parameter exists but the audio does not exist in the recognition result, the type of the audio parameter of the recognition result may be further determined, for example: volume, sound effects, etc. And when the intelligent sound box detects that the identification result only has one audio parameter of the volume, determining the volume adjusting amplitude and the adjusting direction of the volume. And then, the intelligent sound box generates a corresponding control instruction according to the volume adjustment direction and the volume adjustment amplitude.

In this embodiment, when the smart speaker detects that the volume adjustment direction exists in the recognition result, it is determined that the audio parameter that the user needs to adjust is the volume. The smart sound box detects the volume adjustment amplitude corresponding to the volume adjustment direction of the identification result. When the smart sound box detects that the volume adjustment direction exists and the volume adjustment amplitude does not exist in the identification result, the smart sound box can acquire the preset volume adjustment amplitude corresponding to the volume adjustment direction. And then, the intelligent sound box converts the volume adjusting direction and the preset volume adjusting amplitude corresponding to the volume adjusting direction into a control instruction.

In the voice control method, when the audio parameters comprise volume, the intelligent sound box determines the volume adjustment direction and the volume adjustment amplitude, and when the volume adjustment amplitude does not exist in the voice information, the corresponding volume adjustment amplitude can be obtained according to the volume adjustment direction, so that the volume adjustment information can be supplemented, and the requirements of users can be met. And acquiring the generated control instruction comprising the volume adjustment direction and the volume adjustment amplitude, so as to send the control value instruction to the mobile terminal, and can definitely instruct the mobile terminal to adjust the volume of the mobile terminal.

In this embodiment, when the smart speaker detects that there is a volume adjustment direction and there is no volume adjustment amplitude in the recognition result, matching the volume adjustment direction in the recognition result with a preset volume adjustment direction, where the preset volume adjustment direction includes a positive direction and a negative direction, the positive direction indicates increasing the volume, and the negative direction indicates decreasing the volume; and when the volume adjusting direction in the identification result is successfully matched with the positive direction in the preset volume adjusting direction, acquiring the volume adjusting amplitude corresponding to the positive direction, and taking the volume adjusting amplitude corresponding to the positive direction as the volume adjusting amplitude corresponding to the volume adjusting direction in the identification result. And when the volume adjustment direction in the identification result is successfully matched with the negative direction in the preset volume adjustment directions, acquiring the volume adjustment amplitude corresponding to the negative direction, and taking the volume adjustment amplitude corresponding to the negative direction as the volume adjustment amplitude corresponding to the volume adjustment direction in the identification result. And then, the intelligent sound box generates a control instruction according to the volume adjustment direction and the corresponding volume adjustment amplitude in the identification result.

In one embodiment, after the obtaining the voice information, the method further comprises: the intelligent sound box uploads the voice information to a cloud;

this intelligent sound box obtains the control command according to this speech information generation, includes: and the intelligent sound box receives a control instruction which is returned by the cloud and is generated according to the voice information.

Specifically, after receiving the voice information of the user, the intelligent sound box uploads the voice information to the cloud, and voice recognition is performed on the voice information through a voice recognition engine of the cloud so as to judge the intention of the user. And after the voice information is identified by the cloud voice identification engine, an identification result is obtained. And then, the intelligent sound box can generate a corresponding control instruction according to the voice recognition result through the voice recognition engine at the cloud end, and the control instruction is returned to the intelligent sound box. And the intelligent sound box receives a control instruction which is returned by the cloud and is generated according to the voice information.

According to the voice control method, the voice information is uploaded to the cloud end, so that the voice information is recognized through the cloud end, and a recognition result is returned, so that the storage space of the intelligent sound box can be saved, and the recognition efficiency is improved.

In one embodiment, as shown in FIG. 3, a method of voice control is provided. The embodiment is mainly illustrated by applying the method to the mobile terminal 120 in fig. 1. Referring to fig. 3, the voice control method specifically includes the following steps:

step 302, the mobile terminal receives a control instruction sent by the smart sound box through a bluetooth protocol, wherein the control instruction is generated according to the voice information, and the mobile terminal includes at least one of a smart phone, a tablet computer and a notebook computer.

And step 304, the mobile terminal adjusts the audio parameters of the mobile terminal according to the control instruction.

Specifically, the mobile terminal is in communication connection with the smart sound box through a Bluetooth audio/video remote control protocol (avrcp). And after the intelligent sound box acquires the generated control instruction, the control instruction is sent to the mobile terminal through a Bluetooth audio/video remote control protocol. The mobile terminal can monitor the callback of the Bluetooth audio/video remote control protocol in real time, and after callback information is detected, the mobile terminal can receive a control instruction sent by the intelligent sound box through the Bluetooth audio/video remote control protocol, wherein the control instruction is generated according to the voice information. And after receiving the control instruction, the mobile terminal adjusts the audio parameters of the mobile terminal according to the control instruction. By receiving the control instruction sent by the intelligent sound box and adjusting the audio parameter of the mobile terminal according to the control instruction, the user can directly adjust the volume of the mobile terminal by using voice through the intelligent sound box, and therefore more convenient voice control is provided for the user.

In one embodiment, as shown in fig. 4, the method further comprises:

in step 402, the mobile terminal identifies the control command and determines the audio frequency in the control command.

Specifically, after receiving the control instruction, the mobile terminal identifies the control instruction to determine the audio frequency existing in the control instruction.

Step 404, the mobile terminal counts audio parameters corresponding to the same audio in the control command.

Specifically, the mobile terminal may extract the audio frequencies present in the control instruction and determine an audio parameter corresponding to each audio frequency in the control instruction. Then, the mobile terminal can compare the audio frequencies existing in the control instruction one by one to determine whether the same audio frequency exists in the control instruction. When the same audio exists, the audio parameters corresponding to the same audio are classified into one class, so that the audio parameters corresponding to the same audio in the control command can be counted.

In this embodiment, the mobile terminal may extract the audio frequencies existing in the control instruction, and compare the audio frequencies existing in the control instruction one by one to determine whether the same audio frequency exists in the control instruction. When the same audio exists, the mobile terminal can reserve any one of the same audio and acquire the audio parameter corresponding to the same audio, so that the audio parameter corresponding to the same audio in the control instruction can be counted.

The mobile terminal adjusts the audio parameters thereof according to the control instruction, and the method comprises the following steps:

and step 406, the mobile terminal adjusts the audio parameter of the audio corresponding to the audio in the mobile terminal according to the audio parameter corresponding to the audio.

Specifically, the mobile terminal determines the same audio frequency as each audio frequency after statistics in the control instruction from a plurality of audio frequencies of the mobile terminal, so as to determine the audio frequency of the mobile terminal, which needs to adjust the audio frequency parameter. And then, the mobile terminal acquires the current audio parameter of the audio needing to adjust the audio parameter in the mobile terminal, and modifies the current audio parameter into the audio parameter corresponding to the same audio in the control instruction.

For example, the control command received by the mobile terminal is: the volume of the mobile terminal ring is adjusted to 5, the volume of the media is adjusted to 60, the volume of the alarm clock is adjusted to 100, and the volume of the media is adjusted to 50. The mobile terminal recognizes the control instruction, and knows that the audio frequency in the control instruction is the mobile terminal ring tone, media and alarm clock, and that respective corresponding audio frequency parameters exist. The mobile terminal can count the volume parameters corresponding to the same audio, that is, two media appear in the control instruction, and the two media are the same audio, and then the mobile terminal can reserve one media, and obtain the audio parameters corresponding to the media as follows: adjustment 60 and adjustment 50.

And then, the mobile terminal determines the system ring tone, the media and the alarm clock of the mobile terminal from the plurality of audio frequencies of the mobile terminal, and obtains the current volume values of the system ring tone, the media and the alarm clock of the mobile terminal. The mobile terminal adjusts the current volume value of the own ringtone to 5, the current volume value of the alarm clock to 100, the current volume value of the media to 60, and the volume value of the media to 50 after adjustment.

In the voice control method, the mobile terminal identifies the control instruction, determines the audio frequency in the control instruction, counts the audio frequency parameters corresponding to the same audio frequency in the control instruction, and adjusts the audio frequency parameters of the corresponding audio frequency in the mobile terminal according to the audio frequency parameters corresponding to the audio frequency. When the same audio frequency needs to be adjusted for multiple times in the control instruction, the same audio frequency can be adjusted by switching the audio frequency once, so that the consumption of switching the audio frequency back and forth for multiple times is avoided, the speed of adjusting audio frequency parameters is increased, and the response speed of voice control is increased.

In this embodiment, the mobile terminal may select an audio and an audio parameter corresponding to the audio from the control instruction, and determine an audio identical to the selected audio from multiple audios of the mobile terminal. The mobile terminal can obtain the current audio parameters of the same audio in the mobile terminal and modify the current audio parameters of the audio in the mobile terminal into the audio parameters corresponding to the selected audio. According to the same mode, the current audio parameters of the same audio in the mobile terminal can be adjusted to the corresponding audio parameters in the control instruction according to the audio parameters corresponding to the audios in the control instruction.

In one embodiment, after counting the audio parameters corresponding to the same audio in the control command, the method further includes: the mobile terminal determines an adjustment sequence corresponding to each audio frequency in at least two same audio frequencies, and adjusts the audio frequency parameters of the corresponding audio frequency in the mobile terminal according to the audio frequency parameters of the audio frequency corresponding to the last adjustment sequence. For example, two media appear in the control instruction received by the mobile terminal, the volume of the media needs to be adjusted to 60 for the first time, and the volume of the media needs to be adjusted to 50 for the second time. The mobile terminal may directly adjust the volume of the second corresponding "media to 50" as a target of the media adjustment in the mobile terminal, thereby directly adjusting the current volume of the media in the mobile terminal to 50.

In one embodiment, as shown in fig. 5, the method further comprises:

step 502, the mobile terminal identifies the control command and determines the audio frequency in the control command.

Step 504, the mobile terminal counts audio parameters corresponding to each of at least two audios in the control instruction, and determines an adjustment sequence corresponding to each audio.

Specifically, after receiving the control instruction, the mobile terminal identifies the control instruction to determine the audio frequency existing in the control instruction. And when at least two audios exist in the control instruction, counting audio parameters corresponding to each audio in the at least two audios in each control instruction. Then, the mobile terminal may use the sequence of the audio in the control instruction as the adjustment sequence corresponding to the audio according to the sequence of the audio in the control instruction.

In this embodiment, the mobile terminal may detect whether the control command carries an adjustment sequence corresponding to each audio, and if not, the mobile terminal may randomly set the adjustment sequence corresponding to each audio in at least two audios in the control command.

In this embodiment, the mobile terminal may determine an adjustment sequence of each audio parameter according to the number of the audio parameters corresponding to each of the at least two audios in the control command. The more audio parameters corresponding to the audio, the more the adjustment sequence.

The adjusting of the audio parameters according to the control instruction includes:

step 506, the mobile terminal sequentially adjusts the audio parameters of the audio corresponding to the mobile terminal according to the audio parameters and the adjustment sequence corresponding to the audio.

Specifically, the mobile terminal sequentially obtains the current audio parameters of the same audio in the mobile terminal according to the audio parameters corresponding to the audio and the determined adjustment sequence, and adjusts the current audio parameters to the audio parameters corresponding to the audio in the control instruction.

In this embodiment, the mobile terminal may detect whether the control command carries an adjustment sequence corresponding to each audio, and if so, the mobile terminal adjusts the current audio parameter of the corresponding audio in the mobile terminal according to the adjustment sequence and the audio parameter corresponding to each audio in the control command, so as to meet the requirement of the user.

According to the voice control method, the mobile terminal identifies the control instruction, determines the audio frequency in the control instruction, counts the audio frequency parameter corresponding to each audio frequency in at least two audio frequencies in the control instruction, determines the adjustment sequence corresponding to each audio frequency, sequentially adjusts the audio frequency parameter of the audio frequency corresponding to the mobile terminal according to the audio frequency parameter corresponding to the audio frequency and the adjustment sequence, can adjust according to the adjustment sequence when the adjustment sequence exists in the control instruction, and can rapidly determine the adjustment sequence of a plurality of audio frequencies when the adjustment sequence does not exist in the control instruction, so that the user response is rapidly and orderly realized.

In one embodiment, as shown in fig. 6, the control instruction includes an adjustment direction and a volume change amount;

step 602, the mobile terminal obtains the current volume value of each audio of the mobile terminal.

Specifically, after receiving a control instruction sent by the intelligent sound box, the mobile terminal identifies the control instruction. When the mobile terminal detects that the control instruction only contains the adjusting direction and the volume change amount, the mobile terminal judges that the user needs to adjust the volume of all the audios in the mobile terminal according to the adjusting direction and the volume change amount. The mobile terminal obtains the current volume value corresponding to each audio of the mobile terminal.

In step 604, the mobile terminal converts the volume variation in the control command into a volume variation corresponding to each audio of the mobile terminal.

Specifically, the volume adjustment ranges are different because of different audio frequencies. The mobile terminal needs to convert the volume variation in the control command into the volume variation corresponding to each audio of the mobile terminal through the conversion function. For example, the volume change amount in the control command is 5, but the volume change amount corresponding to the audio such as media, alarm clock, call, etc. is not necessarily 5, and the volume change amount corresponding to the audio such as media, alarm clock, call, etc. needs to be obtained through the conversion function.

And 606, the mobile terminal respectively adjusts the current volume value of each audio of the mobile terminal according to the adjusting direction and the volume variable quantity corresponding to each audio.

Specifically, the mobile terminal adjusts the current volume value of each audio according to the adjustment direction in the control instruction and the volume change amount corresponding to each audio in the mobile terminal.

Further, the mobile terminal determines the adjustment direction to be a positive direction or a negative direction. When the adjusting direction is a positive direction, the volume is enhanced, and then the mobile terminal can correspondingly add the current volume value of each audio of the mobile terminal to the volume variation corresponding to each audio in the mobile terminal. When the adjustment direction is a negative direction, which indicates that the volume is decreased, the mobile terminal may correspondingly subtract the volume change amount corresponding to each audio in the mobile terminal from the current volume value of each audio.

According to the voice control method, the mobile terminal obtains the current volume value of each audio frequency of the mobile terminal, the volume variable quantity in the control instruction is converted into the volume variable quantity corresponding to each audio frequency of the mobile terminal, and the current volume value of each audio frequency of the mobile terminal is respectively adjusted according to the adjusting direction and the volume variable quantity corresponding to each audio frequency of the mobile terminal, so that the volume variable quantity corresponding to each audio frequency in the mobile terminal can be determined, the range of each audio frequency in the mobile terminal which needs to be adjusted is determined, and accurate adjustment is achieved.

Fig. 7 is a schematic diagram illustrating the mobile terminal adjusting the volume according to the control instruction in one embodiment. The mobile terminal is connected with the intelligent sound box through a Bluetooth audio/video remote control protocol, and the mobile terminal monitors callback of the Bluetooth audio/video remote control protocol in real time after connection. When the mobile terminal detects a control instruction sent by the intelligent sound box through the audio/video remote control protocol, the mobile terminal can call the volume setting function to transmit the control instruction to the audio service. And after receiving the control instruction, the audio service determines the audio of which the volume needs to be adjusted in the mobile terminal. Since the volume adjustment ranges are different for different audio types, the amount of change in volume value needs to be changed from the source stream type to the target stream type. The audio service of the mobile terminal can convert the volume variation corresponding to the audio in the control instruction into the volume variation corresponding to the audio required to be adjusted in the mobile terminal through the conversion function. And then, the mobile terminal adjusts the current volume value of the audio needing to be adjusted in the mobile terminal through the audio service according to the adjustment direction in the control instruction and the volume variation corresponding to the audio needing to be adjusted in the mobile terminal, and obtains and stores the adjusted volume value of the audio of the mobile terminal. Then, the audio service of the mobile terminal sends the audio-adjusted volume value to the audio mixer, and the audio mixer stores the audio-adjusted volume value and outputs the audio-adjusted volume value through an audio mixing thread.

In one embodiment, the voice control method includes:

the smart speaker acquires voice information.

And then, the intelligent sound box uploads the voice information to the cloud.

Further, the smart sound box receives a control instruction which is returned by the cloud and comprises audio parameters corresponding to at least two audios and an adjusting sequence, wherein the adjusting sequence is determined according to the time sequence of the at least two audios in the voice message.

And then, the smart sound box determines a mobile terminal corresponding to the control instruction, and sends the control instruction to the corresponding mobile terminal, wherein the control instruction is used for instructing the mobile terminal to adjust the audio parameter of the mobile terminal, and the mobile terminal comprises at least one of a smart phone, a tablet computer and a notebook computer.

Optionally, the mobile terminal receives a control instruction sent by the smart sound box, where the control instruction is generated according to the voice information.

Then, the mobile terminal identifies the control command and determines the audio frequency in the control command.

Then, the mobile terminal counts the audio parameters corresponding to each of at least two audios in the control instruction, and determines the adjustment sequence corresponding to each audio.

Further, the mobile terminal sequentially adjusts the audio parameters of the corresponding audio in the mobile terminal according to the audio parameters and the adjustment sequence corresponding to the audio.

Optionally, when the control instruction includes an adjustment direction and a volume change, the mobile terminal obtains a current volume value of each audio in the mobile terminal; the mobile terminal converts the volume variation in the control instruction into the volume variation corresponding to each audio in the mobile terminal; and the mobile terminal respectively adjusts the current volume value of each audio of the mobile terminal according to the adjusting direction and the volume variable quantity corresponding to each audio of the mobile terminal.

According to the voice control method, the intelligent sound box receives the voice information of the user, uploads the voice information to the cloud end, the voice information is identified through the cloud end, a control instruction is returned according to an identification result, and the storage space of the intelligent sound box can be saved through the cloud end identification of the voice information. And sending the control instruction to the mobile terminal, and instructing the mobile terminal to adjust the corresponding audio parameters according to the control instruction. After receiving the control instruction, the mobile terminal determines the audio frequency in the control instruction, and counts the audio frequency parameter corresponding to each audio frequency in at least two audio frequencies in the control instruction and the adjustment sequence corresponding to each audio frequency, so that the audio frequency parameter of the audio frequency in the mobile terminal can be correspondingly adjusted according to the information in the control instruction. According to the scheme, the voice information of the user is received through the intelligent sound box, the control instruction is generated through recognition and sent to the mobile terminal, the control of the intelligent sound box on the mobile terminal can be achieved, the mobile terminal is controlled to adjust the audio parameters in the mobile terminal through the intelligent sound box, more convenience is provided for the user, and the user experience is improved.

Fig. 2-6 are flow diagrams illustrating a voice control method according to an embodiment. It should be understood that although the various steps in the flowcharts of fig. 2-6 are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least some of the steps in fig. 2-6 may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, and the order of performance of the sub-steps or stages is not necessarily sequential, but may be performed in turn or alternating with other steps or at least some of the sub-steps or stages of other steps.

In one embodiment, as shown in fig. 8, a voice control apparatus is provided for use in a smart speaker. The voice control device includes: a voice information acquisition module 802, a control instruction acquisition module 804 and a sending module 806. Wherein,

a voice information obtaining module 802, configured to obtain voice information.

A control instruction obtaining module 804, configured to obtain a control instruction generated according to the voice information.

A sending module 806, configured to send the control instruction to a mobile terminal through a bluetooth protocol, where the control instruction is used to instruct the mobile terminal to adjust an audio parameter of the mobile terminal, and the mobile terminal includes at least one of a smart phone, a tablet computer, and a notebook computer.

According to the voice control device, the voice information is obtained, the control instruction generated according to the voice information is obtained, the control instruction is sent to the mobile terminal, the control instruction is used for indicating the mobile terminal to adjust the audio parameters of the mobile terminal, the mobile terminal comprises at least one of a smart phone, a tablet personal computer and a notebook personal computer, therefore, the voice control of the mobile terminal can be achieved through the smart sound box, more convenience is provided for users, and more requirements of the users can be met.

In one embodiment, the control instruction obtaining module 804 is further configured to: and acquiring the generated control instruction comprising the audio and the corresponding audio parameter, wherein the audio parameter corresponding to the audio is determined according to the voice information. The voice control device determines the audio frequency and the corresponding audio frequency parameter in the voice information according to the recognition of the voice information, thereby determining the audio frequency required to be adjusted by a user and the audio frequency parameter required to be adjusted. And acquiring a control instruction which is generated by the intelligent sound box or generated by a third-party device and comprises audio and corresponding audio parameters so as to instruct the mobile terminal to perform corresponding adjustment.

In one embodiment, the control instruction obtaining module 804 is further configured to: and acquiring the generated control instruction comprising the audio parameters corresponding to the at least two audios and an adjusting sequence, wherein the adjusting sequence is determined according to the time sequence of the at least two audios in the voice information. According to the voice control device, the intelligent sound box generates a control instruction which comprises audio parameters corresponding to at least two audios and an adjusting sequence through the intelligent sound box or the intelligent sound box through a third-party device according to the recognition of voice information, wherein the adjusting sequence is determined according to the time sequence of the occurrence of the at least two audios in the voice information, so that the mobile terminal is instructed to correspondingly adjust the audios according to the adjusting sequence.

In one embodiment, the audio parameter includes a volume, and the control instruction obtaining module 804 is further configured to:

and acquiring the generated control instruction comprising the volume adjusting direction and the volume adjusting amplitude, wherein the volume adjusting direction is determined according to the voice information, and the volume adjusting amplitude is acquired according to the volume adjusting direction. The voice control device determines the volume adjustment direction and the volume adjustment amplitude when the audio parameters comprise volume, and can acquire the corresponding volume adjustment amplitude according to the volume adjustment direction when the volume adjustment amplitude does not exist in the voice information, so that the voice control device can supplement the volume adjustment information and can meet the requirements of users. And acquiring the generated control instruction comprising the volume adjustment direction and the volume adjustment amplitude, so as to send the control value instruction to the mobile terminal, and can definitely instruct the mobile terminal to adjust the volume of the mobile terminal.

In one embodiment, after the obtaining the voice information, the apparatus further comprises: and an uploading module. The upload module is configured to: and uploading the voice information to the cloud.

The control instruction obtaining module 804 is further configured to: and receiving a control instruction which is returned by the cloud and generated according to the voice information. Through uploading the voice information to the cloud end, the voice information is identified through the cloud end, and an identification result is returned, so that the storage space of the intelligent sound box can be saved, and the identification efficiency is improved.

In one embodiment, as shown in fig. 9, a voice control apparatus is provided for use in a mobile terminal. The voice control device includes: a receiving module 902 and an adjusting module 904. Wherein,

the receiving module 902 is configured to receive a control instruction sent by the smart sound box, where the control instruction is generated according to the voice information.

And the adjusting module 904 is configured to adjust the audio parameter of the user according to the control instruction.

In this embodiment, the mobile terminal receives the control instruction sent by the smart speaker and adjusts the audio parameter of the mobile terminal according to the control instruction, so that the user can directly adjust the volume of the mobile terminal by using voice through the smart speaker, thereby providing more convenient voice control for the user.

In one embodiment, the apparatus further comprises: an identification module to: identifying the control instruction, and determining the audio frequency in the control instruction; and counting the audio parameters corresponding to the same audio in the control command.

The adjustment module 904 is further configured to: and adjusting the audio parameters of the corresponding audio in the mobile terminal according to the audio parameters corresponding to the audio.

In the above-mentioned voice control apparatus, the audio in the control instruction is determined by identifying the control instruction, and the audio parameter corresponding to the same audio in the control instruction is counted, and the audio parameter corresponding to the audio in the mobile terminal is adjusted according to the audio parameter corresponding to the audio. When the same audio frequency needs to be adjusted for multiple times in the control instruction, the same audio frequency can be adjusted by switching the audio frequency once, so that the consumption of switching the audio frequency back and forth for multiple times is avoided, the speed of adjusting audio frequency parameters is increased, and the response speed of voice control is increased.

In one embodiment, the identification module is to: identifying the control instruction, and determining the audio frequency in the control instruction; and counting audio parameters corresponding to each audio in at least two audios in the control instruction, and determining an adjustment sequence corresponding to each audio.

The adjustment module 904 is further configured to: and sequentially adjusting the audio parameters of the corresponding audio in the mobile terminal according to the audio parameters corresponding to the audio and the adjustment sequence.

The voice control device identifies the control instruction, determines the audio frequency in the control instruction, counts the audio frequency parameter corresponding to each audio frequency in at least two audio frequencies in the control instruction, determines the adjustment sequence corresponding to each audio frequency, sequentially adjusts the audio frequency parameter of the audio frequency corresponding to the mobile terminal according to the audio frequency parameter corresponding to the audio frequency and the adjustment sequence, can adjust according to the adjustment sequence when the adjustment sequence exists in the control instruction, and can rapidly determine the adjustment sequence of a plurality of audio frequencies when the adjustment sequence does not exist in the control instruction, thereby rapidly and orderly realizing user response.

In one embodiment, the control instruction comprises an adjustment direction and a volume change;

the adjustment module 904 is further configured to: acquiring the current volume value of each audio in the mobile terminal; converting the volume variation in the control instruction into the volume variation corresponding to each audio in the mobile terminal; and respectively adjusting the current volume value of each audio of the mobile terminal according to the adjusting direction and the volume variable quantity corresponding to each audio of the mobile terminal.

The voice control device obtains the current volume value of each audio in the mobile terminal, converts the volume variation in the control instruction into the volume variation corresponding to each audio in the mobile terminal, and respectively adjusts the current volume value of each audio in the mobile terminal according to the adjusting direction and the volume variation corresponding to each audio in the mobile terminal, so that the volume variation corresponding to each audio in the mobile terminal can be determined, the range of each audio in the mobile terminal which needs to be adjusted is determined, and accurate adjustment is realized.

Fig. 10 shows an internal structural diagram of the mobile terminal in one embodiment. As shown in fig. 10, the mobile terminal includes a processor, a memory, a network interface, an input device, and a display screen connected through a system bus. Wherein the memory includes a non-volatile storage medium and an internal memory. The non-volatile storage medium of the mobile terminal stores an operating system and may also store a computer program that, when executed by the processor, causes the processor to implement the voice control method. The internal memory may also have a computer program stored therein, which when executed by the processor, causes the processor to perform the speech control method. The display screen of the mobile terminal can be a liquid crystal display screen or an electronic ink display screen, and the input device of the mobile terminal can be a touch layer covered on the display screen, a key, a track ball or a touch pad arranged on a shell of the mobile terminal, an external keyboard, a touch pad or a mouse and the like.

Those skilled in the art will appreciate that the architecture shown in fig. 10 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.

In one embodiment, the voice control apparatus provided in the present application may be implemented in a form of a computer program, and the computer program may be executed on the smart speaker. The memory of the smart speaker may store various program modules constituting the voice control apparatus, such as a voice information obtaining module 802, a control instruction obtaining module 804 and a sending module 806 shown in fig. 8. The computer program constituted by the respective program modules causes the processor to execute the steps in the voice control method of the respective embodiments of the present application described in the present specification.

For example, the smart speaker may perform the step of acquiring the voice information through the voice information acquiring module 802 in the voice control apparatus as shown in fig. 8. The smart speaker may execute the step of obtaining the control instruction generated according to the voice information through the control instruction obtaining module 804. The smart speaker may perform the step of sending the control instruction to the mobile terminal through the sending module 806, where the control instruction is used to instruct the mobile terminal to adjust its own audio parameter.

In one embodiment, the voice control apparatus provided in the present application may be implemented in the form of a computer program, and the computer program may be run on the mobile terminal as described in fig. 10. The memory of the mobile terminal may store various program modules constituting the voice control apparatus, such as a receiving module 902 and an adjusting module 904 shown in fig. 9. The computer program constituted by the respective program modules causes the processor to execute the steps in the voice control method of the respective embodiments of the present application described in the present specification.

For example, the mobile terminal may execute, by the receiving module 902 in the voice control apparatus shown in fig. 9, receiving a control instruction sent by the smart speaker, where the control instruction is a step generated according to the voice information. The mobile terminal may perform the step of adjusting its own audio parameter according to the control instruction through the adjusting module 904.

In one embodiment, a computer device is provided, the computer device is a smart speaker or a mobile terminal, the computer device includes a memory and a processor, the memory stores a computer program, and the computer program, when executed by the processor, causes the processor to execute the steps of the voice control method. Here, the steps of the voice control method may be steps in the voice control method of each of the above embodiments.

In one embodiment, a computer-readable storage medium is provided, in which a computer program is stored, which, when executed by a processor, causes the processor to carry out the steps of the above-mentioned speech control method. Here, the steps of the voice control method may be steps in the voice control method of each of the above embodiments.

It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a non-volatile computer-readable storage medium, and can include the processes of the embodiments of the methods described above when the program is executed. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).

The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.

The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present application. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims

1. A method of voice control, the method comprising:

the intelligent sound box acquires voice information;

2. The method of claim 1, wherein the smart speaker obtaining the control command generated according to the voice message comprises:

the intelligent sound box acquires a generated control instruction comprising audio and corresponding audio parameters, wherein the audio parameters corresponding to the audio are determined according to the voice information.

3. The method of claim 2, wherein the obtaining of the generated control command including the audio and the corresponding audio parameter by the smart speaker comprises:

the intelligent sound box obtains the generated control instruction which comprises audio parameters corresponding to at least two audios and an adjusting sequence, wherein the adjusting sequence is determined according to the time sequence of the at least two audios in the voice information.

4. The method of claim 1, wherein the audio parameter comprises volume;

the intelligent sound box obtains a control instruction generated according to the voice information, and the control instruction comprises the following steps:

the intelligent sound box obtains a generated control instruction comprising a volume adjusting direction and a volume adjusting amplitude, wherein the volume adjusting direction is determined according to the voice information, and the volume adjusting amplitude is obtained according to the volume adjusting direction.

5. The method of claim 1, wherein after the smart sound box obtains the voice information, the method further comprises:

the intelligent sound box uploads the voice information to a cloud;

and the intelligent sound box receives a control instruction which is returned by the cloud and is generated according to the voice information.

6. A method of voice control, the method comprising:

7. The method of claim 6, further comprising:

the mobile terminal identifies the control instruction and determines the audio frequency in the control instruction;

the mobile terminal counts audio parameters corresponding to the same audio in the control instruction;

and the mobile terminal adjusts the audio parameters of the corresponding audio in the mobile terminal according to the audio parameters corresponding to the audio.

8. The method of claim 6, further comprising:

the mobile terminal counts audio parameters corresponding to each audio in at least two audios in the control instruction, and determines an adjustment sequence corresponding to each audio;

and the mobile terminal sequentially adjusts the audio parameters of the corresponding audio in the mobile terminal according to the audio parameters and the adjustment sequence corresponding to the audio.

9. The method of claim 6, wherein the control instruction comprises an adjustment direction and a volume change;

the mobile terminal acquires the current volume value of each audio frequency of the mobile terminal;

the mobile terminal converts the volume variation in the control instruction into volume variation corresponding to each audio of the mobile terminal;

and the mobile terminal respectively adjusts the current volume value of each audio of the mobile terminal according to the adjusting direction and the volume variable quantity corresponding to each audio.

10. The utility model provides a voice control device, is applied to intelligent audio amplifier, the device includes:

11. A voice control device is applied to a mobile terminal, and comprises:

12. A computer-readable storage medium, storing a computer program which, when executed by a processor, causes the processor to carry out the steps of the method according to any one of claims 1 to 9.

13. A computer device comprising a memory and a processor, the memory storing a computer program that, when executed by the processor, causes the processor to perform the steps of the method according to any one of claims 1 to 9.