CN110459215A - Sound control method, device, computer equipment and computer readable storage medium - Google Patents

Sound control method, device, computer equipment and computer readable storage medium Download PDF

Info

Publication number
CN110459215A
CN110459215A CN201910740319.0A CN201910740319A CN110459215A CN 110459215 A CN110459215 A CN 110459215A CN 201910740319 A CN201910740319 A CN 201910740319A CN 110459215 A CN110459215 A CN 110459215A
Authority
CN
China
Prior art keywords
mobile terminal
audio
control instruction
sound box
volume
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910740319.0A
Other languages
Chinese (zh)
Other versions
CN110459215B (en
Inventor
邓衍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201910740319.0A priority Critical patent/CN110459215B/en
Publication of CN110459215A publication Critical patent/CN110459215A/en
Application granted granted Critical
Publication of CN110459215B publication Critical patent/CN110459215B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B5/00Near-field transmission systems, e.g. inductive or capacitive transmission systems
    • H04B5/70Near-field transmission systems, e.g. inductive or capacitive transmission systems specially adapted for specific purposes
    • H04B5/72Near-field transmission systems, e.g. inductive or capacitive transmission systems specially adapted for specific purposes for local intradevice communication
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/72409User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories
    • H04M1/72412User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories using two-way short-range wireless interfaces
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Telephone Function (AREA)

Abstract

This application involves a kind of sound control method, device, computer equipment and computer readable storage mediums, and the method includes intelligent sound boxes to obtain voice messaging;The intelligent sound box obtains the control instruction generated according to the voice messaging;The control instruction is sent to mobile terminal by Bluetooth protocol by the intelligent sound box, the control instruction is used to indicate the audio frequency parameter that the mobile terminal adjusts itself, and the mobile terminal includes at least one of smart phone, tablet computer and laptop.Scheme provided by the present application may be implemented to carry out voice control to mobile terminal by intelligent sound box and provide the user with more conveniences without manually adjusting the audio of mobile terminal, can satisfy the greater demand of user.

Description

Voice control method, device, computer equipment and computer readable storage medium
Technical Field
The present application relates to the field of computer technologies, and in particular, to a voice control method, apparatus, computer device, and computer-readable storage medium.
Background
With the development of computer technology, voice control technology has emerged, which is a technology for a machine to convert voice signals into corresponding texts or commands through a recognition and understanding process. A large number of voice recognition control products have been introduced into the market and service fields, such as adjusting the volume of an air conditioner or a smart audio by voice, and the like.
When traditional intelligent audio amplifier passes through bluetooth with the terminal and connects, generally be through the volume of voice regulation audio amplifier, but when the volume of terminal was too low, still can not reach user's wanted volume size when adjusting the audio amplifier volume to the highest. At the moment, the volume of the mobile phone can be adjusted only by hand, so that the operation is inconvenient.
Disclosure of Invention
In view of the above, it is necessary to provide a voice control method, apparatus, computer device and computer readable storage medium for solving the technical problem.
A method of voice control, the method comprising:
the intelligent sound box acquires voice information;
the intelligent sound box acquires a control instruction generated according to the voice information;
the intelligent sound box sends the control instruction to a mobile terminal through a Bluetooth protocol, the control instruction is used for indicating the mobile terminal to adjust the audio parameter of the mobile terminal, and the mobile terminal comprises at least one of an intelligent mobile phone, a tablet personal computer and a notebook computer.
A voice control method is applied to a terminal, and comprises the following steps:
the method comprises the steps that a mobile terminal receives a control instruction sent by an intelligent sound box through a Bluetooth protocol, wherein the control instruction is generated according to voice information, and the mobile terminal comprises at least one of an intelligent mobile phone, a tablet personal computer and a notebook computer;
and the mobile terminal adjusts the audio parameters of the mobile terminal according to the control instruction.
The utility model provides a voice control device, is applied to intelligent audio amplifier, the device includes:
the voice information acquisition module is used for acquiring voice information;
the control instruction acquisition module is used for acquiring a control instruction generated according to the voice information;
the sending module is used for sending the control instruction to a mobile terminal through a Bluetooth protocol, the control instruction is used for indicating the mobile terminal to adjust the audio parameter of the mobile terminal, and the mobile terminal comprises at least one of a smart phone, a tablet computer and a notebook computer.
A voice control device is applied to a mobile terminal, and comprises:
the mobile terminal comprises a receiving module and a processing module, wherein the receiving module is used for receiving a control instruction sent by the intelligent sound box, the control instruction is generated according to voice information, and the mobile terminal comprises at least one of an intelligent mobile phone, a tablet personal computer and a notebook computer;
and the adjusting module is used for adjusting the audio parameters of the adjusting module according to the control instruction.
A computer-readable storage medium storing a computer program which, when executed by a processor, causes the processor to perform the steps of any of the speech control methods.
A computer device comprising a memory and a processor, the memory storing a computer program that, when executed by the processor, causes the processor to perform the steps of any of the speech control methods.
In the voice control method, the voice control device, the computer equipment and the computer readable storage medium, the intelligent sound box acquires the voice information, acquires the control instruction generated according to the voice information, and sends the control instruction to the mobile terminal, wherein the control instruction is used for instructing the mobile terminal to adjust the audio parameters of the intelligent sound box, and the mobile terminal comprises at least one of an intelligent mobile phone, a tablet personal computer and a notebook personal computer, so that the voice control can be performed on the mobile terminal through the intelligent sound box without manually adjusting the audio of the mobile terminal, more convenience is provided for a user, and more requirements of the user can be met.
Drawings
FIG. 1 is a diagram of an exemplary implementation of a voice control method;
FIG. 2 is a flow diagram illustrating a voice control method according to one embodiment;
FIG. 3 is a flow chart illustrating a voice control method according to another embodiment;
FIG. 4 is a flowchart illustrating a process of adjusting audio parameters by a mobile terminal according to a control command according to an embodiment;
FIG. 5 is a schematic diagram illustrating a process of adjusting audio parameters according to a control command by the mobile terminal in another embodiment;
FIG. 6 is a schematic diagram illustrating a process of adjusting audio parameters according to a control command by the mobile terminal in another embodiment;
FIG. 7 is a diagram illustrating a mobile terminal adjusting volume according to a control command according to an embodiment;
FIG. 8 is a block diagram showing the structure of a voice control apparatus according to an embodiment;
FIG. 9 is a block diagram showing the construction of a voice control apparatus according to another embodiment;
fig. 10 is a block diagram of a mobile terminal in one embodiment.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
FIG. 1 is a diagram of an exemplary implementation of a speech control method. Referring to fig. 1, the voice control method is applied to a voice control system. The voice control system includes a smart sound box 110 and a mobile terminal 120. The smart speaker 110 and the mobile terminal 120 are connected through a bluetooth protocol. The smart speaker 110 may be a speaker with voice recognition and bluetooth technology. The mobile terminal may be at least one of a smart phone, a tablet computer, a notebook computer, and the like.
In this embodiment, smart sound box 110 and mobile terminal 120 may communicate via a bluetooth audio/video remote control protocol. The smart speaker 110 receives the voice information of the user, uploads the voice information to the cloud, and performs voice recognition through a voice recognition engine on the cloud to determine the intention of the user. After the voice recognition engine in the cloud performs voice recognition on the voice information, a recognition result is obtained and returned to the smart sound box 110. The smart speaker 110 receives the voice recognition result and generates a control command according to the voice recognition result. Then, the smart speaker 110 sends the control command to the mobile terminal 120. The mobile terminal 120 receives the control command, and adjusts its own audio parameter according to the control command.
In this embodiment, the smart sound box 110 collects the voice information of the user through the microphone array, sends the voice information to the voice recognition engine, and sends a recognition request to the cloud through the voice recognition engine to determine the intention of the user.
In this embodiment, after the voice recognition engine in the cloud obtains the recognition result, a control instruction is generated according to the voice recognition result, and the control instruction is returned to the smart sound box 110. The smart speaker 110 receives the control command and sends the control command to the mobile terminal 120 through the bluetooth protocol.
In one embodiment, after receiving the voice message of the user, the smart sound box 110 may recognize the voice message through its internal voice recognition engine. After the voice information is subjected to voice recognition, a recognition result is obtained, and the smart sound box 110 may generate a control instruction according to the recognition result of the voice information. Then, the smart speaker 110 sends the control command to the mobile terminal 120.
In one embodiment, as shown in FIG. 2, a method of voice control is provided. The present embodiment is mainly illustrated by applying the method to the smart sound box 110 in fig. 1. Referring to fig. 2, the voice control method specifically includes the following steps:
step 202, the smart speaker acquires voice information.
Specifically, the intelligent sound box and the mobile terminal are in communication connection through a Bluetooth protocol, and the intelligent sound box receives voice information sent by a user.
And step 204, the intelligent sound box acquires a control instruction generated according to the voice information.
Specifically, the smart sound box performs voice recognition on the voice information through an internal voice recognition engine to recognize the intention of the user, and generates a corresponding control instruction according to a recognition result.
In this embodiment, the smart sound box may perform voice recognition on the voice information through the third-party device, receive a recognition result returned by the third-party device, and generate a corresponding control instruction according to the recognition result.
Step 206, the smart speaker sends a control instruction to the mobile terminal through a bluetooth protocol, where the control instruction is used to instruct the mobile terminal to adjust its own audio parameters, and the mobile terminal includes at least one of a smart phone, a tablet computer, and a notebook computer.
Specifically, the intelligent sound box sends the control instruction to the mobile terminal, so that the mobile terminal adjusts the audio parameter of the mobile terminal according to the control instruction.
In this embodiment, when the control instruction includes at least two mobile terminals, the smart sound box may determine the control instruction corresponding to the same mobile terminal in the control instruction, classify the control instruction according to the corresponding mobile terminal, and send the classified control instruction to the corresponding mobile terminal respectively.
In this embodiment, the method further includes: and receiving response information of the mobile terminal. The response information refers to that after the mobile terminal receives the control instruction, corresponding response is made to the control instruction, and response information is formed and returned to the intelligent sound box.
Specifically, after the intelligent sound box sends the control instruction to the mobile terminal, the intelligent sound box waits for the mobile terminal to return corresponding response information. And when the intelligent sound box receives the response information returned by the mobile terminal, converting the response information into voice and playing the voice to prompt the user.
According to the voice control method, the intelligent sound box acquires the voice information, acquires the control instruction generated according to the voice information, and sends the control instruction to the mobile terminal, the control instruction is used for indicating the mobile terminal to adjust the audio parameters of the mobile terminal, and the mobile terminal comprises at least one of a smart phone, a tablet computer and a notebook computer, so that the mobile terminal can be controlled through the intelligent sound box, the audio of the mobile terminal does not need to be adjusted manually, more convenience is provided for a user, and more requirements of the user can be met.
In one embodiment, the obtaining of the control instruction generated according to the voice message by the smart speaker includes: the intelligent sound box obtains the generated control instruction comprising the audio frequency and the corresponding audio frequency parameter, wherein the audio frequency parameter corresponding to the audio frequency is determined according to the voice information.
The control instruction comprises audio and corresponding audio parameters. A plurality of audios and audio parameters corresponding to each audio may exist in the control instruction. The audio includes but is not limited to media, alarm clock, system ring tone of the mobile terminal, call, etc. audio types.
Specifically, the smart sound box can perform voice recognition on the voice information through an internal voice recognition engine so as to recognize the intention of the user and obtain a recognition result. And then, the intelligent sound box converts the identification result into a corresponding control instruction.
In this embodiment, the smart speaker may perform speech recognition on the speech information through a third-party device to obtain a recognition result. And then, the intelligent sound box receives an identification result returned by the third-party equipment, wherein the identification result comprises the audio frequency which the user wants to adjust and the audio frequency parameters which are specifically needed to be adjusted by the audio frequency. And then, the intelligent sound box generates a corresponding control instruction according to the audio frequency which the user wants to adjust and the audio frequency parameter which the audio frequency specifically needs to adjust.
In this embodiment, when the identification result includes at least two mobile terminals, the smart sound box determines an audio that needs to be adjusted and an audio parameter that needs to be adjusted, which correspond to each of the at least two mobile terminals. And then, the intelligent sound box generates the audio which is required to be adjusted and corresponds to each of the at least two mobile terminals and the audio parameters which are required to be adjusted and correspond to the control instructions.
According to the voice control method, the intelligent sound box determines the audio frequency and the corresponding audio frequency parameter in the voice information according to the recognition of the voice information, so that the audio frequency required to be adjusted by a user and the audio frequency parameter required to be adjusted can be determined. And acquiring a control instruction which is generated by the intelligent sound box or generated by a third-party device and comprises audio and corresponding audio parameters so as to instruct the mobile terminal to perform corresponding adjustment.
In one embodiment, the obtaining of the generated control instruction including the audio frequency and the corresponding audio frequency parameter by the smart speaker includes: the intelligent sound box obtains the generated control instruction which comprises the audio parameters corresponding to the at least two audios and the adjusting sequence, and the adjusting sequence is determined according to the time sequence of the at least two audios in the voice information.
Specifically, the smart sound box can perform voice recognition on the voice information through an internal voice recognition engine so as to recognize the intention of the user and obtain a recognition result. Then, the smart speaker can determine the number of the audios existing in the recognition result, and when at least two audios exist in the recognition result, the smart speaker can determine the audio parameter corresponding to each of the at least two audios. Then, the smart speaker determines an adjustment sequence corresponding to each of the at least two audio frequencies. Further, the smart sound box may determine the occurrence time of each of the at least two audios in the voice message, and compare the occurrence times to obtain the occurrence sequence of each of the at least two audios in the voice message. The sequence of the at least two audios appearing in the voice information is used as the adjustment sequence of each audio in the at least two audios. And then, the intelligent sound box generates corresponding control instructions according to the at least two audios, the audio parameters corresponding to each audio in the at least two audios and the adjustment sequence.
In this embodiment, the smart speaker may perform speech recognition on the speech information through a third-party device to obtain a recognition result.
In this embodiment, the smart speaker may perform speech recognition on the speech information through a third-party device to obtain a recognition result. Then, the intelligent sound box can determine the number of the audios existing in the identification result through the third-party device, and when at least two audios exist in the identification result, determine the audio parameter corresponding to each of the at least two audios. Then, the smart sound box determines an adjustment sequence corresponding to each of the at least two audios through the third-party device. Further, the smart sound box may determine, through the third-party device, a time when each of the at least two audios appears in the voice information, and compare the time when each audio appears, so as to obtain a sequence of appearance of each of the at least two audios in the voice information. The sequence of the at least two audios appearing in the voice information is used as the adjustment sequence of each audio in the at least two audios. And then, the intelligent sound box converts the at least two audios and the audio parameters and the adjustment sequence corresponding to each of the at least two audios into a control instruction through third-party equipment. And then, the intelligent sound box acquires the audio parameters and the control instruction of the adjusting sequence from the third-party equipment, wherein the audio parameters comprise at least two audios and each audio in the at least two audios.
In this embodiment, after the voice information is identified and recognized, when at least two mobile terminals exist in the recognition result, the intelligent sound equipment determines the audio and audio parameters corresponding to each of the at least two mobile terminals. When one mobile terminal corresponds to at least two audios, the intelligent sound box determines audio parameters and an adjustment sequence corresponding to each audio in the at least two audios. And converting the audio frequency, the audio frequency parameters and the adjusting sequence corresponding to each mobile terminal into the control instruction corresponding to each mobile terminal.
For example, the voice information received by the smart sound box is "turn off music of the notebook computer, turn off the alarm clock of the smart phone, turn on video playing of the smart phone, and adjust the volume to 70%", recognize the voice, and the obtained recognition result may be "notebook-music-stop playing", "smart phone-alarm clock-off", "smart phone-video-volume adjustment to 70%". At this moment, the smart sound box can determine that the audio frequency which corresponds to the smart phone and needs to be adjusted is an alarm clock and a video, and can determine the adjustment parameters which correspond to the alarm clock and the video respectively. The smart speaker may further determine an adjustment order of the alarm clock and the video, and the adjustment order may be determined according to an order of occurrence of the alarm clock and the video in the voice message.
According to the voice control method, the intelligent sound box generates a control instruction which comprises audio parameters corresponding to at least two audios and an adjusting sequence through the intelligent sound box or the intelligent sound box through a third-party device according to the recognition of the voice information, wherein the adjusting sequence is determined according to the time sequence of the at least two audios in the voice information, so that the mobile terminal is instructed to correspondingly adjust the audios according to the adjusting sequence.
In one embodiment, the audio parameter includes volume;
this intelligent sound box obtains the control command according to this speech information generation, includes: and the intelligent sound box acquires a generated control instruction comprising a volume adjusting direction and a volume adjusting amplitude, wherein the volume adjusting direction is determined according to the voice information, and the volume adjusting amplitude is acquired according to the volume adjusting direction.
The volume adjustment range refers to the volume adjustment variation, the volume adjustment magnification, and the like. For example, the volume adjustment amplitude is 3, 5, or 6, etc. Or the volume adjustment amplitude is 2 times, 3 times, etc. of the current volume value.
Specifically, the intelligent sound box recognizes the voice information through a voice recognition engine inside the intelligent sound box or a voice device engine of the third-party device, and then obtains a recognition result. The smart speaker may determine whether audio and corresponding audio parameters exist in the recognition result. When the smart sound box detects that the audio parameter exists but the audio does not exist in the recognition result, the type of the audio parameter of the recognition result may be further determined, for example: volume, sound effects, etc. And when the intelligent sound box detects that the identification result only has one audio parameter of the volume, determining the volume adjusting amplitude and the adjusting direction of the volume. And then, the intelligent sound box generates a corresponding control instruction according to the volume adjustment direction and the volume adjustment amplitude.
In this embodiment, when the smart speaker detects that the volume adjustment direction exists in the recognition result, it is determined that the audio parameter that the user needs to adjust is the volume. The smart sound box detects the volume adjustment amplitude corresponding to the volume adjustment direction of the identification result. When the smart sound box detects that the volume adjustment direction exists and the volume adjustment amplitude does not exist in the identification result, the smart sound box can acquire the preset volume adjustment amplitude corresponding to the volume adjustment direction. And then, the intelligent sound box converts the volume adjusting direction and the preset volume adjusting amplitude corresponding to the volume adjusting direction into a control instruction.
In the voice control method, when the audio parameters comprise volume, the intelligent sound box determines the volume adjustment direction and the volume adjustment amplitude, and when the volume adjustment amplitude does not exist in the voice information, the corresponding volume adjustment amplitude can be obtained according to the volume adjustment direction, so that the volume adjustment information can be supplemented, and the requirements of users can be met. And acquiring the generated control instruction comprising the volume adjustment direction and the volume adjustment amplitude, so as to send the control value instruction to the mobile terminal, and can definitely instruct the mobile terminal to adjust the volume of the mobile terminal.
In this embodiment, when the smart speaker detects that there is a volume adjustment direction and there is no volume adjustment amplitude in the recognition result, matching the volume adjustment direction in the recognition result with a preset volume adjustment direction, where the preset volume adjustment direction includes a positive direction and a negative direction, the positive direction indicates increasing the volume, and the negative direction indicates decreasing the volume; and when the volume adjusting direction in the identification result is successfully matched with the positive direction in the preset volume adjusting direction, acquiring the volume adjusting amplitude corresponding to the positive direction, and taking the volume adjusting amplitude corresponding to the positive direction as the volume adjusting amplitude corresponding to the volume adjusting direction in the identification result. And when the volume adjustment direction in the identification result is successfully matched with the negative direction in the preset volume adjustment directions, acquiring the volume adjustment amplitude corresponding to the negative direction, and taking the volume adjustment amplitude corresponding to the negative direction as the volume adjustment amplitude corresponding to the volume adjustment direction in the identification result. And then, the intelligent sound box generates a control instruction according to the volume adjustment direction and the corresponding volume adjustment amplitude in the identification result.
In one embodiment, after the obtaining the voice information, the method further comprises: the intelligent sound box uploads the voice information to a cloud;
this intelligent sound box obtains the control command according to this speech information generation, includes: and the intelligent sound box receives a control instruction which is returned by the cloud and is generated according to the voice information.
Specifically, after receiving the voice information of the user, the intelligent sound box uploads the voice information to the cloud, and voice recognition is performed on the voice information through a voice recognition engine of the cloud so as to judge the intention of the user. And after the voice information is identified by the cloud voice identification engine, an identification result is obtained. And then, the intelligent sound box can generate a corresponding control instruction according to the voice recognition result through the voice recognition engine at the cloud end, and the control instruction is returned to the intelligent sound box. And the intelligent sound box receives a control instruction which is returned by the cloud and is generated according to the voice information.
According to the voice control method, the voice information is uploaded to the cloud end, so that the voice information is recognized through the cloud end, and a recognition result is returned, so that the storage space of the intelligent sound box can be saved, and the recognition efficiency is improved.
In one embodiment, as shown in FIG. 3, a method of voice control is provided. The embodiment is mainly illustrated by applying the method to the mobile terminal 120 in fig. 1. Referring to fig. 3, the voice control method specifically includes the following steps:
step 302, the mobile terminal receives a control instruction sent by the smart sound box through a bluetooth protocol, wherein the control instruction is generated according to the voice information, and the mobile terminal includes at least one of a smart phone, a tablet computer and a notebook computer.
And step 304, the mobile terminal adjusts the audio parameters of the mobile terminal according to the control instruction.
Specifically, the mobile terminal is in communication connection with the smart sound box through a Bluetooth audio/video remote control protocol (avrcp). And after the intelligent sound box acquires the generated control instruction, the control instruction is sent to the mobile terminal through a Bluetooth audio/video remote control protocol. The mobile terminal can monitor the callback of the Bluetooth audio/video remote control protocol in real time, and after callback information is detected, the mobile terminal can receive a control instruction sent by the intelligent sound box through the Bluetooth audio/video remote control protocol, wherein the control instruction is generated according to the voice information. And after receiving the control instruction, the mobile terminal adjusts the audio parameters of the mobile terminal according to the control instruction. By receiving the control instruction sent by the intelligent sound box and adjusting the audio parameter of the mobile terminal according to the control instruction, the user can directly adjust the volume of the mobile terminal by using voice through the intelligent sound box, and therefore more convenient voice control is provided for the user.
In one embodiment, as shown in fig. 4, the method further comprises:
in step 402, the mobile terminal identifies the control command and determines the audio frequency in the control command.
Specifically, after receiving the control instruction, the mobile terminal identifies the control instruction to determine the audio frequency existing in the control instruction.
Step 404, the mobile terminal counts audio parameters corresponding to the same audio in the control command.
Specifically, the mobile terminal may extract the audio frequencies present in the control instruction and determine an audio parameter corresponding to each audio frequency in the control instruction. Then, the mobile terminal can compare the audio frequencies existing in the control instruction one by one to determine whether the same audio frequency exists in the control instruction. When the same audio exists, the audio parameters corresponding to the same audio are classified into one class, so that the audio parameters corresponding to the same audio in the control command can be counted.
In this embodiment, the mobile terminal may extract the audio frequencies existing in the control instruction, and compare the audio frequencies existing in the control instruction one by one to determine whether the same audio frequency exists in the control instruction. When the same audio exists, the mobile terminal can reserve any one of the same audio and acquire the audio parameter corresponding to the same audio, so that the audio parameter corresponding to the same audio in the control instruction can be counted.
The mobile terminal adjusts the audio parameters thereof according to the control instruction, and the method comprises the following steps:
and step 406, the mobile terminal adjusts the audio parameter of the audio corresponding to the audio in the mobile terminal according to the audio parameter corresponding to the audio.
Specifically, the mobile terminal determines the same audio frequency as each audio frequency after statistics in the control instruction from a plurality of audio frequencies of the mobile terminal, so as to determine the audio frequency of the mobile terminal, which needs to adjust the audio frequency parameter. And then, the mobile terminal acquires the current audio parameter of the audio needing to adjust the audio parameter in the mobile terminal, and modifies the current audio parameter into the audio parameter corresponding to the same audio in the control instruction.
For example, the control command received by the mobile terminal is: the volume of the mobile terminal ring is adjusted to 5, the volume of the media is adjusted to 60, the volume of the alarm clock is adjusted to 100, and the volume of the media is adjusted to 50. The mobile terminal recognizes the control instruction, and knows that the audio frequency in the control instruction is the mobile terminal ring tone, media and alarm clock, and that respective corresponding audio frequency parameters exist. The mobile terminal can count the volume parameters corresponding to the same audio, that is, two media appear in the control instruction, and the two media are the same audio, and then the mobile terminal can reserve one media, and obtain the audio parameters corresponding to the media as follows: adjustment 60 and adjustment 50.
And then, the mobile terminal determines the system ring tone, the media and the alarm clock of the mobile terminal from the plurality of audio frequencies of the mobile terminal, and obtains the current volume values of the system ring tone, the media and the alarm clock of the mobile terminal. The mobile terminal adjusts the current volume value of the own ringtone to 5, the current volume value of the alarm clock to 100, the current volume value of the media to 60, and the volume value of the media to 50 after adjustment.
In the voice control method, the mobile terminal identifies the control instruction, determines the audio frequency in the control instruction, counts the audio frequency parameters corresponding to the same audio frequency in the control instruction, and adjusts the audio frequency parameters of the corresponding audio frequency in the mobile terminal according to the audio frequency parameters corresponding to the audio frequency. When the same audio frequency needs to be adjusted for multiple times in the control instruction, the same audio frequency can be adjusted by switching the audio frequency once, so that the consumption of switching the audio frequency back and forth for multiple times is avoided, the speed of adjusting audio frequency parameters is increased, and the response speed of voice control is increased.
In this embodiment, the mobile terminal may select an audio and an audio parameter corresponding to the audio from the control instruction, and determine an audio identical to the selected audio from multiple audios of the mobile terminal. The mobile terminal can obtain the current audio parameters of the same audio in the mobile terminal and modify the current audio parameters of the audio in the mobile terminal into the audio parameters corresponding to the selected audio. According to the same mode, the current audio parameters of the same audio in the mobile terminal can be adjusted to the corresponding audio parameters in the control instruction according to the audio parameters corresponding to the audios in the control instruction.
In one embodiment, after counting the audio parameters corresponding to the same audio in the control command, the method further includes: the mobile terminal determines an adjustment sequence corresponding to each audio frequency in at least two same audio frequencies, and adjusts the audio frequency parameters of the corresponding audio frequency in the mobile terminal according to the audio frequency parameters of the audio frequency corresponding to the last adjustment sequence. For example, two media appear in the control instruction received by the mobile terminal, the volume of the media needs to be adjusted to 60 for the first time, and the volume of the media needs to be adjusted to 50 for the second time. The mobile terminal may directly adjust the volume of the second corresponding "media to 50" as a target of the media adjustment in the mobile terminal, thereby directly adjusting the current volume of the media in the mobile terminal to 50.
In one embodiment, as shown in fig. 5, the method further comprises:
step 502, the mobile terminal identifies the control command and determines the audio frequency in the control command.
Step 504, the mobile terminal counts audio parameters corresponding to each of at least two audios in the control instruction, and determines an adjustment sequence corresponding to each audio.
Specifically, after receiving the control instruction, the mobile terminal identifies the control instruction to determine the audio frequency existing in the control instruction. And when at least two audios exist in the control instruction, counting audio parameters corresponding to each audio in the at least two audios in each control instruction. Then, the mobile terminal may use the sequence of the audio in the control instruction as the adjustment sequence corresponding to the audio according to the sequence of the audio in the control instruction.
In this embodiment, the mobile terminal may detect whether the control command carries an adjustment sequence corresponding to each audio, and if not, the mobile terminal may randomly set the adjustment sequence corresponding to each audio in at least two audios in the control command.
In this embodiment, the mobile terminal may determine an adjustment sequence of each audio parameter according to the number of the audio parameters corresponding to each of the at least two audios in the control command. The more audio parameters corresponding to the audio, the more the adjustment sequence.
The adjusting of the audio parameters according to the control instruction includes:
step 506, the mobile terminal sequentially adjusts the audio parameters of the audio corresponding to the mobile terminal according to the audio parameters and the adjustment sequence corresponding to the audio.
Specifically, the mobile terminal sequentially obtains the current audio parameters of the same audio in the mobile terminal according to the audio parameters corresponding to the audio and the determined adjustment sequence, and adjusts the current audio parameters to the audio parameters corresponding to the audio in the control instruction.
In this embodiment, the mobile terminal may detect whether the control command carries an adjustment sequence corresponding to each audio, and if so, the mobile terminal adjusts the current audio parameter of the corresponding audio in the mobile terminal according to the adjustment sequence and the audio parameter corresponding to each audio in the control command, so as to meet the requirement of the user.
According to the voice control method, the mobile terminal identifies the control instruction, determines the audio frequency in the control instruction, counts the audio frequency parameter corresponding to each audio frequency in at least two audio frequencies in the control instruction, determines the adjustment sequence corresponding to each audio frequency, sequentially adjusts the audio frequency parameter of the audio frequency corresponding to the mobile terminal according to the audio frequency parameter corresponding to the audio frequency and the adjustment sequence, can adjust according to the adjustment sequence when the adjustment sequence exists in the control instruction, and can rapidly determine the adjustment sequence of a plurality of audio frequencies when the adjustment sequence does not exist in the control instruction, so that the user response is rapidly and orderly realized.
In one embodiment, as shown in fig. 6, the control instruction includes an adjustment direction and a volume change amount;
the mobile terminal adjusts the audio parameters thereof according to the control instruction, and the method comprises the following steps:
step 602, the mobile terminal obtains the current volume value of each audio of the mobile terminal.
Specifically, after receiving a control instruction sent by the intelligent sound box, the mobile terminal identifies the control instruction. When the mobile terminal detects that the control instruction only contains the adjusting direction and the volume change amount, the mobile terminal judges that the user needs to adjust the volume of all the audios in the mobile terminal according to the adjusting direction and the volume change amount. The mobile terminal obtains the current volume value corresponding to each audio of the mobile terminal.
In step 604, the mobile terminal converts the volume variation in the control command into a volume variation corresponding to each audio of the mobile terminal.
Specifically, the volume adjustment ranges are different because of different audio frequencies. The mobile terminal needs to convert the volume variation in the control command into the volume variation corresponding to each audio of the mobile terminal through the conversion function. For example, the volume change amount in the control command is 5, but the volume change amount corresponding to the audio such as media, alarm clock, call, etc. is not necessarily 5, and the volume change amount corresponding to the audio such as media, alarm clock, call, etc. needs to be obtained through the conversion function.
And 606, the mobile terminal respectively adjusts the current volume value of each audio of the mobile terminal according to the adjusting direction and the volume variable quantity corresponding to each audio.
Specifically, the mobile terminal adjusts the current volume value of each audio according to the adjustment direction in the control instruction and the volume change amount corresponding to each audio in the mobile terminal.
Further, the mobile terminal determines the adjustment direction to be a positive direction or a negative direction. When the adjusting direction is a positive direction, the volume is enhanced, and then the mobile terminal can correspondingly add the current volume value of each audio of the mobile terminal to the volume variation corresponding to each audio in the mobile terminal. When the adjustment direction is a negative direction, which indicates that the volume is decreased, the mobile terminal may correspondingly subtract the volume change amount corresponding to each audio in the mobile terminal from the current volume value of each audio.
According to the voice control method, the mobile terminal obtains the current volume value of each audio frequency of the mobile terminal, the volume variable quantity in the control instruction is converted into the volume variable quantity corresponding to each audio frequency of the mobile terminal, and the current volume value of each audio frequency of the mobile terminal is respectively adjusted according to the adjusting direction and the volume variable quantity corresponding to each audio frequency of the mobile terminal, so that the volume variable quantity corresponding to each audio frequency in the mobile terminal can be determined, the range of each audio frequency in the mobile terminal which needs to be adjusted is determined, and accurate adjustment is achieved.
Fig. 7 is a schematic diagram illustrating the mobile terminal adjusting the volume according to the control instruction in one embodiment. The mobile terminal is connected with the intelligent sound box through a Bluetooth audio/video remote control protocol, and the mobile terminal monitors callback of the Bluetooth audio/video remote control protocol in real time after connection. When the mobile terminal detects a control instruction sent by the intelligent sound box through the audio/video remote control protocol, the mobile terminal can call the volume setting function to transmit the control instruction to the audio service. And after receiving the control instruction, the audio service determines the audio of which the volume needs to be adjusted in the mobile terminal. Since the volume adjustment ranges are different for different audio types, the amount of change in volume value needs to be changed from the source stream type to the target stream type. The audio service of the mobile terminal can convert the volume variation corresponding to the audio in the control instruction into the volume variation corresponding to the audio required to be adjusted in the mobile terminal through the conversion function. And then, the mobile terminal adjusts the current volume value of the audio needing to be adjusted in the mobile terminal through the audio service according to the adjustment direction in the control instruction and the volume variation corresponding to the audio needing to be adjusted in the mobile terminal, and obtains and stores the adjusted volume value of the audio of the mobile terminal. Then, the audio service of the mobile terminal sends the audio-adjusted volume value to the audio mixer, and the audio mixer stores the audio-adjusted volume value and outputs the audio-adjusted volume value through an audio mixing thread.
In one embodiment, the voice control method includes:
the smart speaker acquires voice information.
And then, the intelligent sound box uploads the voice information to the cloud.
Further, the smart sound box receives a control instruction which is returned by the cloud and comprises audio parameters corresponding to at least two audios and an adjusting sequence, wherein the adjusting sequence is determined according to the time sequence of the at least two audios in the voice message.
And then, the smart sound box determines a mobile terminal corresponding to the control instruction, and sends the control instruction to the corresponding mobile terminal, wherein the control instruction is used for instructing the mobile terminal to adjust the audio parameter of the mobile terminal, and the mobile terminal comprises at least one of a smart phone, a tablet computer and a notebook computer.
Optionally, the mobile terminal receives a control instruction sent by the smart sound box, where the control instruction is generated according to the voice information.
Then, the mobile terminal identifies the control command and determines the audio frequency in the control command.
Then, the mobile terminal counts the audio parameters corresponding to each of at least two audios in the control instruction, and determines the adjustment sequence corresponding to each audio.
Further, the mobile terminal sequentially adjusts the audio parameters of the corresponding audio in the mobile terminal according to the audio parameters and the adjustment sequence corresponding to the audio.
Optionally, when the control instruction includes an adjustment direction and a volume change, the mobile terminal obtains a current volume value of each audio in the mobile terminal; the mobile terminal converts the volume variation in the control instruction into the volume variation corresponding to each audio in the mobile terminal; and the mobile terminal respectively adjusts the current volume value of each audio of the mobile terminal according to the adjusting direction and the volume variable quantity corresponding to each audio of the mobile terminal.
According to the voice control method, the intelligent sound box receives the voice information of the user, uploads the voice information to the cloud end, the voice information is identified through the cloud end, a control instruction is returned according to an identification result, and the storage space of the intelligent sound box can be saved through the cloud end identification of the voice information. And sending the control instruction to the mobile terminal, and instructing the mobile terminal to adjust the corresponding audio parameters according to the control instruction. After receiving the control instruction, the mobile terminal determines the audio frequency in the control instruction, and counts the audio frequency parameter corresponding to each audio frequency in at least two audio frequencies in the control instruction and the adjustment sequence corresponding to each audio frequency, so that the audio frequency parameter of the audio frequency in the mobile terminal can be correspondingly adjusted according to the information in the control instruction. According to the scheme, the voice information of the user is received through the intelligent sound box, the control instruction is generated through recognition and sent to the mobile terminal, the control of the intelligent sound box on the mobile terminal can be achieved, the mobile terminal is controlled to adjust the audio parameters in the mobile terminal through the intelligent sound box, more convenience is provided for the user, and the user experience is improved.
Fig. 2-6 are flow diagrams illustrating a voice control method according to an embodiment. It should be understood that although the various steps in the flowcharts of fig. 2-6 are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least some of the steps in fig. 2-6 may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, and the order of performance of the sub-steps or stages is not necessarily sequential, but may be performed in turn or alternating with other steps or at least some of the sub-steps or stages of other steps.
In one embodiment, as shown in fig. 8, a voice control apparatus is provided for use in a smart speaker. The voice control device includes: a voice information acquisition module 802, a control instruction acquisition module 804 and a sending module 806. Wherein,
a voice information obtaining module 802, configured to obtain voice information.
A control instruction obtaining module 804, configured to obtain a control instruction generated according to the voice information.
A sending module 806, configured to send the control instruction to a mobile terminal through a bluetooth protocol, where the control instruction is used to instruct the mobile terminal to adjust an audio parameter of the mobile terminal, and the mobile terminal includes at least one of a smart phone, a tablet computer, and a notebook computer.
According to the voice control device, the voice information is obtained, the control instruction generated according to the voice information is obtained, the control instruction is sent to the mobile terminal, the control instruction is used for indicating the mobile terminal to adjust the audio parameters of the mobile terminal, the mobile terminal comprises at least one of a smart phone, a tablet personal computer and a notebook personal computer, therefore, the voice control of the mobile terminal can be achieved through the smart sound box, more convenience is provided for users, and more requirements of the users can be met.
In one embodiment, the control instruction obtaining module 804 is further configured to: and acquiring the generated control instruction comprising the audio and the corresponding audio parameter, wherein the audio parameter corresponding to the audio is determined according to the voice information. The voice control device determines the audio frequency and the corresponding audio frequency parameter in the voice information according to the recognition of the voice information, thereby determining the audio frequency required to be adjusted by a user and the audio frequency parameter required to be adjusted. And acquiring a control instruction which is generated by the intelligent sound box or generated by a third-party device and comprises audio and corresponding audio parameters so as to instruct the mobile terminal to perform corresponding adjustment.
In one embodiment, the control instruction obtaining module 804 is further configured to: and acquiring the generated control instruction comprising the audio parameters corresponding to the at least two audios and an adjusting sequence, wherein the adjusting sequence is determined according to the time sequence of the at least two audios in the voice information. According to the voice control device, the intelligent sound box generates a control instruction which comprises audio parameters corresponding to at least two audios and an adjusting sequence through the intelligent sound box or the intelligent sound box through a third-party device according to the recognition of voice information, wherein the adjusting sequence is determined according to the time sequence of the occurrence of the at least two audios in the voice information, so that the mobile terminal is instructed to correspondingly adjust the audios according to the adjusting sequence.
In one embodiment, the audio parameter includes a volume, and the control instruction obtaining module 804 is further configured to:
and acquiring the generated control instruction comprising the volume adjusting direction and the volume adjusting amplitude, wherein the volume adjusting direction is determined according to the voice information, and the volume adjusting amplitude is acquired according to the volume adjusting direction. The voice control device determines the volume adjustment direction and the volume adjustment amplitude when the audio parameters comprise volume, and can acquire the corresponding volume adjustment amplitude according to the volume adjustment direction when the volume adjustment amplitude does not exist in the voice information, so that the voice control device can supplement the volume adjustment information and can meet the requirements of users. And acquiring the generated control instruction comprising the volume adjustment direction and the volume adjustment amplitude, so as to send the control value instruction to the mobile terminal, and can definitely instruct the mobile terminal to adjust the volume of the mobile terminal.
In one embodiment, after the obtaining the voice information, the apparatus further comprises: and an uploading module. The upload module is configured to: and uploading the voice information to the cloud.
The control instruction obtaining module 804 is further configured to: and receiving a control instruction which is returned by the cloud and generated according to the voice information. Through uploading the voice information to the cloud end, the voice information is identified through the cloud end, and an identification result is returned, so that the storage space of the intelligent sound box can be saved, and the identification efficiency is improved.
In one embodiment, as shown in fig. 9, a voice control apparatus is provided for use in a mobile terminal. The voice control device includes: a receiving module 902 and an adjusting module 904. Wherein,
the receiving module 902 is configured to receive a control instruction sent by the smart sound box, where the control instruction is generated according to the voice information.
And the adjusting module 904 is configured to adjust the audio parameter of the user according to the control instruction.
In this embodiment, the mobile terminal receives the control instruction sent by the smart speaker and adjusts the audio parameter of the mobile terminal according to the control instruction, so that the user can directly adjust the volume of the mobile terminal by using voice through the smart speaker, thereby providing more convenient voice control for the user.
In one embodiment, the apparatus further comprises: an identification module to: identifying the control instruction, and determining the audio frequency in the control instruction; and counting the audio parameters corresponding to the same audio in the control command.
The adjustment module 904 is further configured to: and adjusting the audio parameters of the corresponding audio in the mobile terminal according to the audio parameters corresponding to the audio.
In the above-mentioned voice control apparatus, the audio in the control instruction is determined by identifying the control instruction, and the audio parameter corresponding to the same audio in the control instruction is counted, and the audio parameter corresponding to the audio in the mobile terminal is adjusted according to the audio parameter corresponding to the audio. When the same audio frequency needs to be adjusted for multiple times in the control instruction, the same audio frequency can be adjusted by switching the audio frequency once, so that the consumption of switching the audio frequency back and forth for multiple times is avoided, the speed of adjusting audio frequency parameters is increased, and the response speed of voice control is increased.
In one embodiment, the identification module is to: identifying the control instruction, and determining the audio frequency in the control instruction; and counting audio parameters corresponding to each audio in at least two audios in the control instruction, and determining an adjustment sequence corresponding to each audio.
The adjustment module 904 is further configured to: and sequentially adjusting the audio parameters of the corresponding audio in the mobile terminal according to the audio parameters corresponding to the audio and the adjustment sequence.
The voice control device identifies the control instruction, determines the audio frequency in the control instruction, counts the audio frequency parameter corresponding to each audio frequency in at least two audio frequencies in the control instruction, determines the adjustment sequence corresponding to each audio frequency, sequentially adjusts the audio frequency parameter of the audio frequency corresponding to the mobile terminal according to the audio frequency parameter corresponding to the audio frequency and the adjustment sequence, can adjust according to the adjustment sequence when the adjustment sequence exists in the control instruction, and can rapidly determine the adjustment sequence of a plurality of audio frequencies when the adjustment sequence does not exist in the control instruction, thereby rapidly and orderly realizing user response.
In one embodiment, the control instruction comprises an adjustment direction and a volume change;
the adjustment module 904 is further configured to: acquiring the current volume value of each audio in the mobile terminal; converting the volume variation in the control instruction into the volume variation corresponding to each audio in the mobile terminal; and respectively adjusting the current volume value of each audio of the mobile terminal according to the adjusting direction and the volume variable quantity corresponding to each audio of the mobile terminal.
The voice control device obtains the current volume value of each audio in the mobile terminal, converts the volume variation in the control instruction into the volume variation corresponding to each audio in the mobile terminal, and respectively adjusts the current volume value of each audio in the mobile terminal according to the adjusting direction and the volume variation corresponding to each audio in the mobile terminal, so that the volume variation corresponding to each audio in the mobile terminal can be determined, the range of each audio in the mobile terminal which needs to be adjusted is determined, and accurate adjustment is realized.
Fig. 10 shows an internal structural diagram of the mobile terminal in one embodiment. As shown in fig. 10, the mobile terminal includes a processor, a memory, a network interface, an input device, and a display screen connected through a system bus. Wherein the memory includes a non-volatile storage medium and an internal memory. The non-volatile storage medium of the mobile terminal stores an operating system and may also store a computer program that, when executed by the processor, causes the processor to implement the voice control method. The internal memory may also have a computer program stored therein, which when executed by the processor, causes the processor to perform the speech control method. The display screen of the mobile terminal can be a liquid crystal display screen or an electronic ink display screen, and the input device of the mobile terminal can be a touch layer covered on the display screen, a key, a track ball or a touch pad arranged on a shell of the mobile terminal, an external keyboard, a touch pad or a mouse and the like.
Those skilled in the art will appreciate that the architecture shown in fig. 10 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
In one embodiment, the voice control apparatus provided in the present application may be implemented in a form of a computer program, and the computer program may be executed on the smart speaker. The memory of the smart speaker may store various program modules constituting the voice control apparatus, such as a voice information obtaining module 802, a control instruction obtaining module 804 and a sending module 806 shown in fig. 8. The computer program constituted by the respective program modules causes the processor to execute the steps in the voice control method of the respective embodiments of the present application described in the present specification.
For example, the smart speaker may perform the step of acquiring the voice information through the voice information acquiring module 802 in the voice control apparatus as shown in fig. 8. The smart speaker may execute the step of obtaining the control instruction generated according to the voice information through the control instruction obtaining module 804. The smart speaker may perform the step of sending the control instruction to the mobile terminal through the sending module 806, where the control instruction is used to instruct the mobile terminal to adjust its own audio parameter.
In one embodiment, the voice control apparatus provided in the present application may be implemented in the form of a computer program, and the computer program may be run on the mobile terminal as described in fig. 10. The memory of the mobile terminal may store various program modules constituting the voice control apparatus, such as a receiving module 902 and an adjusting module 904 shown in fig. 9. The computer program constituted by the respective program modules causes the processor to execute the steps in the voice control method of the respective embodiments of the present application described in the present specification.
For example, the mobile terminal may execute, by the receiving module 902 in the voice control apparatus shown in fig. 9, receiving a control instruction sent by the smart speaker, where the control instruction is a step generated according to the voice information. The mobile terminal may perform the step of adjusting its own audio parameter according to the control instruction through the adjusting module 904.
In one embodiment, a computer device is provided, the computer device is a smart speaker or a mobile terminal, the computer device includes a memory and a processor, the memory stores a computer program, and the computer program, when executed by the processor, causes the processor to execute the steps of the voice control method. Here, the steps of the voice control method may be steps in the voice control method of each of the above embodiments.
In one embodiment, a computer-readable storage medium is provided, in which a computer program is stored, which, when executed by a processor, causes the processor to carry out the steps of the above-mentioned speech control method. Here, the steps of the voice control method may be steps in the voice control method of each of the above embodiments.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a non-volatile computer-readable storage medium, and can include the processes of the embodiments of the methods described above when the program is executed. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present application. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (13)

1. A method of voice control, the method comprising:
the intelligent sound box acquires voice information;
the intelligent sound box acquires a control instruction generated according to the voice information;
the intelligent sound box sends the control instruction to a mobile terminal through a Bluetooth protocol, the control instruction is used for indicating the mobile terminal to adjust the audio parameter of the mobile terminal, and the mobile terminal comprises at least one of an intelligent mobile phone, a tablet personal computer and a notebook computer.
2. The method of claim 1, wherein the smart speaker obtaining the control command generated according to the voice message comprises:
the intelligent sound box acquires a generated control instruction comprising audio and corresponding audio parameters, wherein the audio parameters corresponding to the audio are determined according to the voice information.
3. The method of claim 2, wherein the obtaining of the generated control command including the audio and the corresponding audio parameter by the smart speaker comprises:
the intelligent sound box obtains the generated control instruction which comprises audio parameters corresponding to at least two audios and an adjusting sequence, wherein the adjusting sequence is determined according to the time sequence of the at least two audios in the voice information.
4. The method of claim 1, wherein the audio parameter comprises volume;
the intelligent sound box obtains a control instruction generated according to the voice information, and the control instruction comprises the following steps:
the intelligent sound box obtains a generated control instruction comprising a volume adjusting direction and a volume adjusting amplitude, wherein the volume adjusting direction is determined according to the voice information, and the volume adjusting amplitude is obtained according to the volume adjusting direction.
5. The method of claim 1, wherein after the smart sound box obtains the voice information, the method further comprises:
the intelligent sound box uploads the voice information to a cloud;
the intelligent sound box obtains a control instruction generated according to the voice information, and the control instruction comprises the following steps:
and the intelligent sound box receives a control instruction which is returned by the cloud and is generated according to the voice information.
6. A method of voice control, the method comprising:
the method comprises the steps that a mobile terminal receives a control instruction sent by an intelligent sound box through a Bluetooth protocol, wherein the control instruction is generated according to voice information, and the mobile terminal comprises at least one of an intelligent mobile phone, a tablet personal computer and a notebook computer;
and the mobile terminal adjusts the audio parameters of the mobile terminal according to the control instruction.
7. The method of claim 6, further comprising:
the mobile terminal identifies the control instruction and determines the audio frequency in the control instruction;
the mobile terminal counts audio parameters corresponding to the same audio in the control instruction;
the mobile terminal adjusts the audio parameters thereof according to the control instruction, and the method comprises the following steps:
and the mobile terminal adjusts the audio parameters of the corresponding audio in the mobile terminal according to the audio parameters corresponding to the audio.
8. The method of claim 6, further comprising:
the mobile terminal identifies the control instruction and determines the audio frequency in the control instruction;
the mobile terminal counts audio parameters corresponding to each audio in at least two audios in the control instruction, and determines an adjustment sequence corresponding to each audio;
the mobile terminal adjusts the audio parameters thereof according to the control instruction, and the method comprises the following steps:
and the mobile terminal sequentially adjusts the audio parameters of the corresponding audio in the mobile terminal according to the audio parameters and the adjustment sequence corresponding to the audio.
9. The method of claim 6, wherein the control instruction comprises an adjustment direction and a volume change;
the mobile terminal adjusts the audio parameters thereof according to the control instruction, and the method comprises the following steps:
the mobile terminal acquires the current volume value of each audio frequency of the mobile terminal;
the mobile terminal converts the volume variation in the control instruction into volume variation corresponding to each audio of the mobile terminal;
and the mobile terminal respectively adjusts the current volume value of each audio of the mobile terminal according to the adjusting direction and the volume variable quantity corresponding to each audio.
10. The utility model provides a voice control device, is applied to intelligent audio amplifier, the device includes:
the voice information acquisition module is used for acquiring voice information;
the control instruction acquisition module is used for acquiring a control instruction generated according to the voice information;
the sending module is used for sending the control instruction to a mobile terminal through a Bluetooth protocol, the control instruction is used for indicating the mobile terminal to adjust the audio parameter of the mobile terminal, and the mobile terminal comprises at least one of a smart phone, a tablet computer and a notebook computer.
11. A voice control device is applied to a mobile terminal, and comprises:
the mobile terminal comprises a receiving module and a processing module, wherein the receiving module is used for receiving a control instruction sent by the intelligent sound box, the control instruction is generated according to voice information, and the mobile terminal comprises at least one of an intelligent mobile phone, a tablet personal computer and a notebook computer;
and the adjusting module is used for adjusting the audio parameters of the adjusting module according to the control instruction.
12. A computer-readable storage medium, storing a computer program which, when executed by a processor, causes the processor to carry out the steps of the method according to any one of claims 1 to 9.
13. A computer device comprising a memory and a processor, the memory storing a computer program that, when executed by the processor, causes the processor to perform the steps of the method according to any one of claims 1 to 9.
CN201910740319.0A 2019-08-12 2019-08-12 Voice control method, device, computer equipment and computer readable storage medium Active CN110459215B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910740319.0A CN110459215B (en) 2019-08-12 2019-08-12 Voice control method, device, computer equipment and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910740319.0A CN110459215B (en) 2019-08-12 2019-08-12 Voice control method, device, computer equipment and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN110459215A true CN110459215A (en) 2019-11-15
CN110459215B CN110459215B (en) 2024-05-14

Family

ID=68485983

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910740319.0A Active CN110459215B (en) 2019-08-12 2019-08-12 Voice control method, device, computer equipment and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN110459215B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111190566A (en) * 2019-12-27 2020-05-22 拉克诺德(深圳)科技有限公司 Volume control method and device, computer equipment and storage medium
CN111243589A (en) * 2020-01-14 2020-06-05 佛山市顺德区美的洗涤电器制造有限公司 Control method of household appliance, household appliance and computer readable storage medium
CN111341310A (en) * 2020-02-19 2020-06-26 北京声智科技有限公司 System, method and device for controlling mobile phone based on smart sound box and storage medium

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105578351A (en) * 2016-01-07 2016-05-11 北京乐动卓越科技有限公司 Music playing method for intelligent sound box system and intelligent sound box system
WO2017035967A1 (en) * 2015-09-01 2017-03-09 广东欧珀移动通信有限公司 Smart speaker volume control method, device, smart speaker and mobile terminal
CN107368278A (en) * 2017-06-30 2017-11-21 北京小米移动软件有限公司 Information processing method and device
CN108091331A (en) * 2017-12-27 2018-05-29 北京镁客创新工场科技有限公司 A kind of intelligent domestic system and its control method based on voice control
CN108509177A (en) * 2018-04-12 2018-09-07 出门问问信息科技有限公司 A kind of volume adjusting method, device, electronic equipment and storage medium
CN108563941A (en) * 2018-07-02 2018-09-21 信利光电股份有限公司 A kind of intelligent home equipment control method, intelligent sound box and intelligent domestic system
CN108810595A (en) * 2018-06-29 2018-11-13 四川斐讯信息技术有限公司 A kind of video broadcasting method and system based on speaker
CN109493865A (en) * 2018-10-17 2019-03-19 北京车和家信息技术有限公司 Signal processing method, terminal and vehicle
CN109841215A (en) * 2018-12-26 2019-06-04 珠海格力电器股份有限公司 Voice broadcasting method and device, storage medium and voice household appliance
CN209017333U (en) * 2018-06-20 2019-06-21 深圳市领芯者科技有限公司 Blue tooth voice controls equipment

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017035967A1 (en) * 2015-09-01 2017-03-09 广东欧珀移动通信有限公司 Smart speaker volume control method, device, smart speaker and mobile terminal
CN105578351A (en) * 2016-01-07 2016-05-11 北京乐动卓越科技有限公司 Music playing method for intelligent sound box system and intelligent sound box system
CN107368278A (en) * 2017-06-30 2017-11-21 北京小米移动软件有限公司 Information processing method and device
CN108091331A (en) * 2017-12-27 2018-05-29 北京镁客创新工场科技有限公司 A kind of intelligent domestic system and its control method based on voice control
CN108509177A (en) * 2018-04-12 2018-09-07 出门问问信息科技有限公司 A kind of volume adjusting method, device, electronic equipment and storage medium
CN209017333U (en) * 2018-06-20 2019-06-21 深圳市领芯者科技有限公司 Blue tooth voice controls equipment
CN108810595A (en) * 2018-06-29 2018-11-13 四川斐讯信息技术有限公司 A kind of video broadcasting method and system based on speaker
CN108563941A (en) * 2018-07-02 2018-09-21 信利光电股份有限公司 A kind of intelligent home equipment control method, intelligent sound box and intelligent domestic system
CN109493865A (en) * 2018-10-17 2019-03-19 北京车和家信息技术有限公司 Signal processing method, terminal and vehicle
CN109841215A (en) * 2018-12-26 2019-06-04 珠海格力电器股份有限公司 Voice broadcasting method and device, storage medium and voice household appliance

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111190566A (en) * 2019-12-27 2020-05-22 拉克诺德(深圳)科技有限公司 Volume control method and device, computer equipment and storage medium
CN111243589A (en) * 2020-01-14 2020-06-05 佛山市顺德区美的洗涤电器制造有限公司 Control method of household appliance, household appliance and computer readable storage medium
CN111341310A (en) * 2020-02-19 2020-06-26 北京声智科技有限公司 System, method and device for controlling mobile phone based on smart sound box and storage medium

Also Published As

Publication number Publication date
CN110459215B (en) 2024-05-14

Similar Documents

Publication Publication Date Title
CN110459215B (en) Voice control method, device, computer equipment and computer readable storage medium
CN107452386B (en) Voice data processing method and system
CN108320744B (en) Voice processing method and device, electronic equipment and computer readable storage medium
US20180103376A1 (en) Device and method for authenticating a user of a voice user interface and selectively managing incoming communications
CN107749299B (en) Multi-audio output method and device
CN111182390B (en) Volume data processing method and device, computer equipment and storage medium
CN107171412B (en) Dynamic adjusts method, apparatus, mobile terminal and the storage medium of charging current
CN105975063B (en) A kind of method and apparatus controlling intelligent terminal
CN101453598A (en) Electronic device and method capable of sound effect regulation according to customer location
CN110992963A (en) Network communication method, device, computer equipment and storage medium
CN107482710A (en) Wireless charging method and terminal
CN105120063A (en) Volume prompting method of input voice and electronic device
CN203689892U (en) Sightsinging place special-purpose personal audio frequency data adjusting apparatus
CN105744609B (en) Improve the method and device of mobile terminal power consumption
US10574707B1 (en) Reducing latency associated with communications
CN105139848A (en) Data conversion method and apparatus
CN105098882B (en) input current distribution method and device
WO2022262366A1 (en) Cross-device dialogue service connection method, system, electronic device, and storage medium
CN103905837A (en) Image processing method and device and terminal
CN109196936A (en) A kind of resource allocation indicating method and device, base station and terminal
CN106856339A (en) Charging method, device and electronic equipment
CN105721676B (en) A kind of terminal processing method and device
CN104038626B (en) The communication means of mobile computing device and accessory device
CN113451669A (en) Charging method and device, electronic device and storage medium
CN103634321B (en) Voice recognition result display method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant