CN108962259A - Processing method and the first electronic equipment - Google Patents

Processing method and the first electronic equipment Download PDF

Info

Publication number
CN108962259A
CN108962259A CN201810825087.4A CN201810825087A CN108962259A CN 108962259 A CN108962259 A CN 108962259A CN 201810825087 A CN201810825087 A CN 201810825087A CN 108962259 A CN108962259 A CN 108962259A
Authority
CN
China
Prior art keywords
sound
condition
electronic equipment
voice control
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810825087.4A
Other languages
Chinese (zh)
Other versions
CN108962259B (en
Inventor
付宏让
赵佩璐
钱泰良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CN201810825087.4A priority Critical patent/CN108962259B/en
Publication of CN108962259A publication Critical patent/CN108962259A/en
Application granted granted Critical
Publication of CN108962259B publication Critical patent/CN108962259B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/34Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The embodiment of the present application discloses a kind of processing method and the first electronic equipment, and monitoring voice input starts voice control function if detecting that the first sound meets the first condition in multiple preset conditions first;Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are associated with different servers;Second sound input after acquiring first voice input;Obtain the analysis result that the associated first server of the first condition is directed to the second sound.It is directed to same first electronic equipment, different servers can be accessed according to the sound for meeting different preset conditions.To realize the function that first electronic equipment accesses multiple servers.

Description

Processing method and the first electronic equipment
Technical field
This application involves technical field of data transmission, and more specifically, it relates to processing method and the first electronic equipments.
Background technique
Voice control is widely used, for example, the electronic equipments such as intelligent sound box all have voice control function.
Currently, the electronic equipment with voice control function is all the single access for realizing a server, for example, sub- horse When inferior Alexa intelligent sound box detects voice input, voice can be sent to the corresponding server of Amazon;The intelligence of Google When speaker detects voice input, voice can be sent to the corresponding server of Google.
Summary of the invention
In view of this, this application provides a kind of processing method and the first electronic equipments.
To achieve the above object, the application provides the following technical solutions:
A kind of processing method is applied to the first electronic equipment, the treating method comprises:
Monitor voice input;
If detecting that the first sound meets the first condition in multiple preset conditions, start voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed It is associated with different servers;
Second sound input after acquiring first voice input;
Obtain the analysis result that the associated first server of the first condition is directed to the second sound.
Wherein, if described detect that the first sound meets the first condition in multiple preset conditions, start voice control Function includes:
It detects whether first sound meets multiple preset conditions respectively, is preset with obtaining first sound with multiple The corresponding testing result of condition;
If the testing result includes that first sound meets the first condition, start voice control function.
Wherein, further includes:
From multiple voice control modes, the first voice control mode is determined;
Wherein, a voice control mode characterization can start the sound of the voice control function of first electronic equipment The preset condition of satisfaction, different voice control modes characterize different preset conditions;
The first voice control mode characterization can start the sound of the voice control function of first electronic equipment The preset condition of satisfaction is the first condition.
Wherein, if described detect that the first sound meets first condition, starting voice control function includes:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
Wherein, the starting voice control function includes:
Determine that the server for analyzing the voice of subsequent input is the associated first server of the first condition.
A kind of first electronic equipment, comprising:
Pronunciation receiver, for monitoring voice input;
Chip is handled, if detecting that the first sound meets the in multiple preset conditions for the pronunciation receiver One condition starts voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed It is associated with different servers;
The pronunciation receiver is also used to: the second sound input after acquisition first voice input;
The processor chips are also used to: obtaining the associated first server of the first condition for the second sound Analysis result;
Output device, for exporting the analysis result.
Wherein, further includes:
Transmitting device, for carrying out signal transmission with the second electronic equipment;
If the processing chip detects that the first sound meets the first condition in multiple preset conditions in execution, start When voice control function, it is specifically used for:
It detects whether first sound meets multiple preset conditions respectively, is preset with obtaining first sound with multiple The corresponding testing result of condition;
If the testing result includes that first sound meets the first condition, start voice control function.
Wherein, further includes:
The processor chips are also used to: from multiple voice control modes, determining the first voice control mode;
Wherein, a voice control mode characterization can start the sound of the voice control function of first electronic equipment The preset condition of satisfaction, different voice control modes characterize different preset conditions;
The first voice control mode characterization can start the sound of the voice control function of first electronic equipment The preset condition of satisfaction is the first condition.
Wherein, if the processing chip detects that the first sound meets first in multiple preset conditions in execution Part is specifically used for when starting voice control function:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
A kind of first electronic equipment, comprising:
Memory, for storing program;
Processor, for executing described program, described program is specifically used for:
Monitor voice input;
If detecting that the first sound meets the first condition in multiple preset conditions, start voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed It is associated with different servers;
Second sound input after acquiring first voice input;
Obtain the analysis result that the associated first server of the first condition is directed to the second sound.
It can be seen via above technical scheme that compared with prior art, this application discloses a kind of processing methods, supervise first Voice input is surveyed, if detecting that the first sound meets the first condition in multiple preset conditions, starts voice control function;Its In, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are associated with different Server;Second sound input after acquiring first voice input;Obtain the associated first server of the first condition For the analysis result of the second sound.It is directed to same first electronic equipment, it can be according to meeting different preset conditions Sound accesses different servers.To realize the function that first electronic equipment accesses multiple servers.
Detailed description of the invention
In order to illustrate the technical solutions in the embodiments of the present application or in the prior art more clearly, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this The embodiment of application for those of ordinary skill in the art without creative efforts, can also basis The attached drawing of offer obtains other attached drawings.
Fig. 1 is a kind of structure chart of implementation of processing system provided by the embodiments of the present application;
Fig. 2 is the structure chart of another implementation of processing system provided by the embodiments of the present application;
Fig. 3 provides a kind of signaling diagram of implementation of processing method for the embodiment of the present application;
Fig. 4 provides the schematic diagram that voice control mode selects a kind of implementation of the page for the embodiment of the present application;
Fig. 5 is a kind of structure chart of implementation of the first electronic equipment provided by the embodiments of the present application;
Fig. 6 is the structure chart of another implementation of the first electronic equipment provided by the embodiments of the present application;
Fig. 7 is the structure chart of another implementation of the first electronic equipment provided by the embodiments of the present application;
Fig. 8 is the structure chart of another implementation of processing system provided by the embodiments of the present application;
Fig. 9 is the structure chart of another implementation of the first electronic equipment provided by the embodiments of the present application.
Specific embodiment
Below in conjunction with the attached drawing in the embodiment of the present application, technical solutions in the embodiments of the present application carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of embodiments of the present application, instead of all the embodiments.It is based on Embodiment in the application, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall in the protection scope of this application.
Processing method provided by the embodiments of the present application can be applied to processing system, as shown in Figure 1, being the embodiment of the present application A kind of structure chart of implementation of the processing system of offer.
Processing system includes: the first electronic equipment 11, and, multiple servers 12.
First electronic equipment 11 can be PAD or smart phone or laptop or desktop computer or intelligent sound box or intelligence The equipment such as household.
Each server 12 can be a background server, or, the server cluster being made of several servers, or, Cloud computing service center.
In an alternative embodiment, different servers corresponds to different suppliers.The corresponding server of different suppliers Achieved service function may be different, for example, shopping service function may be implemented in the corresponding server of Amazon supplier, For example the corresponding server of Amazon supplier can analyze the sound of characterization shopping;For another example the corresponding clothes of supplier of Google Music service function may be implemented in business device, for example the corresponding server of supplier of Google can play the sound of music with response representation Sound.
Optionally, service function achieved by the corresponding server of different suppliers may be identical.
Optionally, server can be divided according to supplier in the embodiment of the present application, different server is corresponding not Same supplier.
Electronic equipment (by taking intelligent sound box as an example) can only realize the access of a server at present, for example, if intelligent sound box The access of the server of Amazon supplier can only be realized, if intelligent sound box receives server corresponding for supplier of Google Or the sound of the corresponding server of supplier of Baidu, since intelligent sound box cannot access the corresponding server of supplier of Google or hundred The corresponding server of supplier is spent, therefore, intelligent sound box cannot respond to the sound.
The access of multiple servers may be implemented in the first electronic equipment 11 in the embodiment of the present application.It is directed to so as to respond The phonetic control command of different type server.
As shown in Fig. 2, the structure chart of another implementation for processing system provided by the embodiments of the present application.
Processing system includes: the first electronic equipment 21, the second electronic equipment 22 and multiple servers 12.
First electronic equipment 21 can be equipment such as docking station (Docking Station).
Second electronic equipment 22 can be PAD or smart phone or the equipment such as laptop or desktop computer.
Wireless connection or wired connection can be carried out between first electronic equipment 21 and the second electronic equipment 22.
First electronic equipment 21 and the second electronic equipment 22 can have wireless data transmission device, pass through wireless data Transmitting device is wirelessly connected.
Wireless data transmission device can be low speed long distance transmission device or high speed short haul device, and high speed short distance passes Defeated device includes: that (Near Field Communication, near field are logical based on ultra-high frequency wireless signal transmitted data device, NFC Letter) device;Low speed long distance transmission device includes: wifi (WIreless-FIdelity) device, blue-tooth device.
Message transmission rate based on ultra-high frequency wireless signal transmitted data device can be up to 6GB/S.
The frequency range of ultra-high frequency wireless signal is 3GHz to 30GHz.
In an alternative embodiment, the first electronic equipment 21 can be as shown in Fig. 2, the first electronic equipment 21 may include holding It carries and sets, bogey can carry the second electronic equipment 22.In an alternative embodiment, the first electronic equipment 21 can not also Including bogey, i.e. the first electronic equipment 21 does not carry the second electronic equipment.First electronic equipment 21 and the second electronic equipment It can not be bonded between 22, can there is a certain distance.
Since the first electronic equipment 21 and the second electronic equipment 22 may have certain distance, so wireless data transmission fills Setting can be wifi device or blue-tooth device.In practical applications, if the first electronic equipment 21 and the second electronic equipment 22 away from From pre-determined distance, such as 10cm is less than, then wireless data transmission device can be high speed short haul device.
Each server 12 can be server, or, the server cluster being made of several servers, or, cloud computing takes Business center.It specifically may refer to the explanation that server 12 is directed in Fig. 1, which is not described herein again.
The first electronic equipment 21 can control the second electronic equipment 22 and realize connecing for multiple servers in the embodiment of the present application Enter.So as to respond the phonetic control command for being directed to different type server.
Integrally processing method provided by the embodiments of the present application is illustrated in conjunction with Fig. 1 or Fig. 2, as shown in figure 3, being this Shen Please embodiment provide processing method a kind of implementation signaling diagram, this method comprises:
The S301: the first electronic equipment of step 31 monitors voice input.
First electronic equipment 31 can set for the first electronics described in the first electronic equipment 11 described in Fig. 1 or Fig. 2 Standby 21.
In an alternative embodiment, 31 speech monitoring function of the first electronic equipment can be constantly in open state.I.e. Extraneous sound can be monitored in real time in one electronic equipment.
In an alternative embodiment, the first electronic equipment 31 can be handled the first sound, for example, to the first sound Noise reduction process is carried out, and/or, coded treatment;Or, carrying out noise reduction process and/or decoding process to the first sound.
Step S302: if the first electronic equipment 31 detects that the first sound meets first in multiple preset conditions Part starts voice control function.
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed It is associated with different servers.
In an alternative embodiment, predetermined condition can be with are as follows: the first sound includes waking up word, or, the first sound is The control instruction of control starting voice control function.
First condition is either condition in multiple preset conditions.
The corresponding wake-up word of different preset conditions or control instruction are different;For example, multiple preset conditions be respectively as follows: it is default Condition 1, preset condition 2 and preset condition 4;Preset condition 1 can be with are as follows: the first sound includes waking up word 1, or, the first sound is First control instruction of control starting voice control function;Preset condition 2 can be with are as follows: the first sound includes waking up word 2, or, the One sound is the second control instruction of control starting voice control function;Preset condition 3 can be with are as follows: the first sound includes waking up word 3, or, the first sound is the third control instruction of control starting voice control function.
Wherein, the first control instruction, the second control instruction, third control instruction are different control instructions;Wake-up word 1, Wake up word 2, wake-up word 3 is different wake-up word.
If the first sound meets either condition in multiple predetermined conditions, the voice control function of the first electronic equipment is just It is activated.In the embodiment of the present application, the first electronic equipment enters after monitoring to meet the first voice input of first condition Subsequent sound can be sent to and first after receiving subsequent voice input by the state of the subsequent voice input of waiting The corresponding server of part, so that server carries out speech recognition to subsequent sound.By the first electronic equipment in the embodiment of the present application The function of monitoring that the first voice input for meeting first condition can be realized later is known as voice control function.
In an alternative embodiment, the first electronic equipment before monitoring to meet the first voice input of first condition, May monitor multiple sound for being unsatisfactory for all predetermined conditions, it is assumed that meet any preset condition the first sound be comprising Any the first sound for waking up word in multiple wake-up words, it is assumed that multiple wake-up words, which are respectively as follows:, to be waken up word 1, wakes up word 2, wake up word 3;First electronic equipment may be monitored also before the first sound for monitoring comprising wake-up word 1 or waking up word 2 or wake-up word 3 Sound, these sound such as " I eats up ", " very nice " cannot start the voice control function of the first electronic equipment.If the The voice control function of one electronic equipment is not activated, then the first electronic equipment, which can be constantly in the current voice input of monitoring, is The no state for meeting either condition in multiple predetermined conditions.It is constantly in searching and " meets either condition in multiple predetermined conditions The first voice input " state.
The S303: the first electronic equipment of step 31 acquires the second sound input after first voice input.
Second sound is sent to the corresponding first server 12 of first condition by the S304: the first electronic equipment of step 31.
Different preset conditions characterize different servers, and the condition that the first sound meets is different, and first server 12 is just not Together, optionally, the condition that the first sound meets is different, and the corresponding supplier of first server is just different.
Second sound can be sent to first server by the first electronic equipment, and first server divides second sound Analysis processing, and analysis result is fed back into the first electronic equipment.
In an alternative embodiment, the first electronic equipment 31 can be handled second sound, for example, to second sound Noise reduction process is carried out, and/or, coded treatment;Or, carrying out noise reduction process and/or decoding process to second sound, can will locate Sound after reason is sent to first server 12.
In an alternative embodiment, the first electronic equipment 31 can not be handled second sound, directly by the rising tone Sound is sent to first server 12.
Step S305: first server 12 is analyzed for second sound, is analyzed as a result, and feeding back to the first electricity Sub- equipment 31.
In an alternative embodiment, each server can be constantly in speech recognition state;It is set when receiving the first electronics When the second sound that preparation is sent, just the second sound is analyzed and processed, when not receiving the of the transmission of the first electronic equipment When two sound, the sound status to be received such as it is at.
In an alternative embodiment, each server may be at non-speech recognition state, and it is more to detect that the first sound meets In a preset condition when first condition, since first condition corresponds to first server, at this point, first server can just be in voice Identification state;Since the first sound is unsatisfactory for other preset conditions in addition to first condition, so in addition to first server Other servers be in non-speech recognition state.
The S306: the first electronic equipment of step 31 exports the analysis result.
In an alternative embodiment, the first electronic equipment 31 can be with the voice output analysis as a result, or, passing through display screen display Show the analysis result.
This application discloses a kind of processing methods, first monitoring voice input, if it is multiple to detect that the first sound meets First condition in preset condition starts voice control function;Wherein, a preset condition is at least associated with the subsequent input of analysis Voice server, different preset conditions are associated with different servers;The rising tone after acquiring first voice input Sound input;Obtain the analysis result that the associated first server of the first condition is directed to the second sound.I.e. for same First electronic equipment can access different servers according to the sound for meeting different preset conditions.To realize one the One electronic equipment accesses the function of multiple servers.
In an alternative embodiment, " starting voice control function " may include: to determine the voice for analyzing subsequent input Server is the associated first server of the first condition.
In conjunction with Fig. 1, to " server for determining the voice of the subsequent input of analysis is the first condition associated described first The implementation of server " is illustrated, and the embodiment of the present application provides but is not limited to following methods.
First way establishes the communication connection of first electronic equipment and the first server.
In an alternative embodiment, receive meet first condition in multiple preset conditions the first voice input it Before, the server that multiple preset conditions are respectively associated is not connected with the first electronic equipment.Optionally, the first sound is being detected When meeting first condition in multiple preset conditions, the communication connection with the associated first server of first condition is established.Optionally, Still it is not connected with the first electronic equipment with other servers of first condition onrelevant.
Optionally, under the first technique, after the first electronic equipment and first server establish communication connection, first service Device can be automatically into speech recognition state.
The second way, control first server enter speech recognition state.
In an alternative embodiment, receive meet first condition in multiple preset conditions the first voice input it Before, the server that multiple preset conditions are respectively associated has been connected with the first electronic equipment.But multiple servers may be not Into speech recognition state.Optionally, detect meet the first voice input of first condition in multiple preset conditions when, control The associated first server of first condition processed enters speech recognition state.Optionally, it is serviced with other of first condition onrelevant Device does not still enter speech recognition state.
In conjunction with Fig. 2, to " server for determining the voice of the subsequent input of analysis is the first condition associated described first The implementation of server " is illustrated, and the embodiment of the present application provides but is not limited to following methods.
First way, generates the first instruction, and first instruction is used to indicate the second electronic equipment and first clothes Business device establishes communication connection.
In an alternative embodiment, receive meet first condition in multiple preset conditions the first voice input it Before, the server that multiple preset conditions are respectively associated is not connected with the second electronic equipment 22.Optionally, the first sound is being detected When sound meets first condition in multiple preset conditions, the first instruction can be generated, the second electronic equipment 22 of instruction is established and first The communication connection of the first server of conditions relevant.Optionally, with other servers of first condition onrelevant still not with Two electronic equipments 22 are connected.
Optionally, under the first technique, after the first electronic equipment and first server establish communication connection, first service Device can be automatically into speech recognition state.
The second way, generates the second instruction, and second instruction is used to indicate the second electronic equipment triggering first condition Associated first server enters speech recognition state.
In an alternative embodiment, receive meet first condition in multiple preset conditions the first voice input it Before, the server that multiple preset conditions are respectively associated has been connected with the second electronic equipment.But multiple servers may be not Into speech recognition state.Optionally, detect meet the first voice input of first condition in multiple preset conditions when, it is raw At the second instruction, instruction the second electronic equipment 22 triggering associated first server of first condition enters speech recognition state.It can Choosing, speech recognition state is not still entered with other servers of first condition onrelevant.
The mode of " the first sound of detection meets the first condition in multiple preset conditions, starts voice control function " has more Kind, the embodiment of the present application is provided but is not limited to following several:
The first: whether detection first sound meets multiple preset conditions respectively, with obtain first sound with The corresponding testing result of multiple preset conditions;If the testing result includes that first sound meets described first Part starts voice control function.
It can detect whether the first sound meets multiple preset conditions respectively.The power consumption of the first electronic equipment is increased, if First electronic equipment does not connect to power supply in real time, then can also reduce the cruising ability of the first electronic equipment.
In order to reduce the power consumption of the first electronic equipment, the embodiment of the present application also provides second of implementations.
Second of implementation:
It is understood that user is when using the first electronic equipment, in a period (for example, one or more stars Phase) in, the first sound for meeting first condition in multiple preset conditions may be only issued, for example, user is in one or more stars Music only all is played using the first electronic equipment in phase, for example, passing through the service corresponding with supplier of Google of the first electronic equipment Device provides music for user, at this point, the first condition can include this wake-up word of Google for the first sound.
According to the first implementation, user issues the first sound, and " after hi, Google ", the first electronic equipment still can It detects whether the first sound meets multiple preset conditions respectively, causes processing speed slower, increase the power consumption of the first electronic equipment.
Second of implementation include:
From multiple voice control modes, the first voice control mode is determined;
Wherein, a voice control mode characterization can start the sound of the voice control function of first electronic equipment The preset condition of satisfaction, different voice control modes characterize different preset conditions;
The first voice control mode characterization can start the sound of the voice control function of first electronic equipment The preset condition of satisfaction is the first condition.
If detecting that the first sound meets first condition described in corresponding, starting voice control function includes:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
To sum up, after receiving the first sound, it is only necessary to detect whether the first sound meets first condition, without inspection Surveying the other conditions whether the first sound meets in addition to first condition reduces the first electronics to improve processing speed The power consumption of equipment.
There are many implementations of " from multiple voice control modes, determining the first voice control mode ", and the application is real Example is applied to provide but be not limited to following several: the first, at least one key is provided on the first electronic equipment.User can pass through At least one described key more becomes the voice control mode of the first electronic equipment.Second, the first electronic equipment can show language Sound control model selects the page, and the voice control mode selection page presentation has multiple voice control modes;From multiple voices In control model, the first voice control mode is determined.
In an alternative embodiment, voice control mode selects the page can be as shown in figure 4, optional, voice control mould Formula can be the title of supplier, and the voice control mode selection page may include: Google, Baidu, Amazon, millet in Fig. 4 Etc..
Method is described in detail in above-mentioned disclosed embodiments, diversified forms can be used for the present processes Device realize that therefore disclosed herein as well is a kind of devices, and specific embodiment is given below and is described in detail.
As shown in figure 5, a kind of structure chart of implementation for the first electronic equipment provided by the embodiments of the present application, this One electronic equipment may include:
Pronunciation receiver 51, for monitoring voice input.
Optionally, pronunciation receiver 51 can be microphone.
Chip 52 is handled, if detecting that the first sound meets in multiple preset conditions for the pronunciation receiver First condition starts voice control function.
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed It is associated with different servers.
Optionally, processing chip 52 can be CPU (central processing unit) or Bluetooth chip.
The pronunciation receiver 51 is also used to: the second sound input after acquisition first voice input.
The processor chips 52 are also used to: obtaining the associated first server of the first condition for the rising tone The analysis result of sound.
Output device 53, for exporting the analysis result.
Optionally, output device can be voice playing device or display, and voice playing device can be loudspeaker.It can Choosing, pronunciation receiver is in running order simultaneously with voice playing device.That is the first electronic equipment is playing the same of voice When can monitor the input of sound, or, voice can be played while monitoring the input of sound.
Optionally, the first electronic equipment can also include: noise treatment device, for acquiring the pronunciation receiver To sound handled, reduce the sound noise that includes.
Optionally, further includes: transmitting device, for carrying out signal transmission with the second electronic equipment;
If the processing chip detects that the first sound meets the first condition in multiple preset conditions in execution, start When voice control function, it is specifically used for:
It detects whether first sound meets multiple preset conditions respectively, is preset with obtaining first sound with multiple The corresponding testing result of condition;
If the testing result includes that first sound meets the first condition, start voice control function.
Transmitting device can be wireless data transmission device or wired transmission device.
Optionally, further includes:
The processor chips are also used to: from multiple voice control modes, determining the first voice control mode;
Wherein, a voice control mode characterization can start the sound of the voice control function of first electronic equipment The preset condition of satisfaction, different voice control modes characterize different preset conditions;
The first voice control mode characterization can start the sound of the voice control function of first electronic equipment The preset condition of satisfaction is the first condition.
Optionally, if the processing chip detects that the first sound meets first in multiple preset conditions in execution Part is specifically used for when starting voice control function:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
Optionally, processing chip is specifically used for when executing starting voice control function:
Determine that the server for analyzing the voice of subsequent input is the associated first server of the first condition.
There are many specific implementations of first electronic equipment shown in fig. 5, the embodiment of the present application provide but be not limited to Under it is several.
The first implementation is illustrated in conjunction with a kind of implementation of the Fig. 1 to the first electronic equipment;As shown in fig. 6, For the structure chart of another implementation of the first electronic equipment provided by the embodiments of the present application.
First electronic equipment 11 includes at least one microphone 51, at least one loudspeaker 53, processing chip 52;Optionally It can also include codec 61 and/or at least one amplifier 62.The connection relationship of above-mentioned component can be as shown in Figure 6.
Optionally, codec 61 can also include DSP (DigitalSignalProcessing, Digital Signal Processing) 63, the sound of at least one described microphone 51 acquisition is handled, the noise that sound includes is reduced.Optionally, 63 DSP It can be independent from each other with codec 61, or, DSP is integrated in codec 61.
Optionally, processing chip 52 can also be integrated with storage unit, for example, DRAM (Dynamic RandomAccess Memory, i.e. dynamic random access memory) and/or FLASH.
The corresponding wake-up word of multiple preset conditions or control instruction can be stored in the storage unit.
The working principle of the first electronic equipment 11 shown in fig. 6 is illustrated below.
Any microphone monitors voice input at least one described microphone 51, and the first sound that will test is sent to Codec 61, optionally, the DSP in codec 61 handle the first sound, the first sound after obtaining noise reduction, compile Decoder 61 encodes the first sound after noise reduction, and the first sound after coding is sent to processing chip 52;Handle core Piece 52 detects whether the first sound meets either condition in multiple preset conditions, if detecting, the first sound meets multiple default items First condition in part establishes the communication connection with first server 12, and/or, control first server 12 enters voice and knows Other state.
Any microphone continues to acquire second sound at least one described microphone 51;The second sound hair that will test It send to codec 61, optionally, the DSP in codec 61 handles second sound, the rising tone after obtaining noise reduction Sound, codec 61 encode the second sound after noise reduction, and the second sound after coding is sent to processing chip 52, place It manages chip 52 and second sound is sent to the associated first server 12 of first condition.
First server 12 carries out speech analysis processing to second sound, is analyzed as a result, analysis result is sent to Handle chip 52;Processing chip 52 is sent to codec 61 for result is analyzed, and the DSP 63 in optional codec 61 is right It analyzes result and carries out noise reduction process;Codec 61 is decoded the analysis result after noise reduction, and it is corresponding to obtain analysis result Voice data;It sends voice data in loudspeaker 53 by amplifier 62, so that the first electronic equipment voice plays Analyze result.
Optionally, the first electronic equipment 11 can have display screen, can show analysis result.
In an alternative embodiment, if the memory capacity for the storage unit that processing chip 52 integrates is smaller, it can handle The external storage unit of chip 52 (for example, DRAM and/or FLASH), by the corresponding wake-up word of multiple preset conditions or control System instruction is stored in external storage unit 71.As shown in fig. 7, again for the first electronic equipment provided by the embodiments of the present application A kind of structure chart of implementation.
The difference of Fig. 7 and Fig. 6 is, corresponding word or the control instruction of waking up of multiple preset conditions is stored in Fig. 7 Storage unit be it is external, the corresponding storage unit for waking up word or control instruction of multiple preset conditions is stored in Fig. 6 is It is integrated in processing chip 52.
Second of implementation is illustrated in conjunction with another implementation of the Fig. 2 to the first electronic equipment;Such as Fig. 8 institute Show, is the structure chart of another implementation of processing system provided by the embodiments of the present application.
First electronic equipment 21 shown in Fig. 8 includes: microphone 51, processing chip 52;Optionally, further include DSP 81, Amplifier 82 and loudspeaker 84.It is illustrated for handling chip 52 and being Bluetooth chip in Fig. 8.
Optionally, Bluetooth chip 52 includes: wireless data transmission device 83 and processing unit 84, optionally, no line number It include: Wireless data receiving device 831 and wireless data sending device 832 according to transmitting device 83.
Optionally, Bluetooth chip 52 can integrate storage unit (for example, DRAM and/or FLASH), by multiple default items The corresponding wake-up word of part or control instruction are stored in external storage unit.
Working principle shown in Fig. 8 is illustrated below.
Monitor voice input in microphone 51, the first sound that will test is sent to DSP 81, DSP to the first sound into Row noise reduction process, first sound that obtains that treated will treated that the first sound is sent to Bluetooth chip 52;Bluetooth chip 52 In processing unit 84 detect the first sound whether meet either condition in multiple preset conditions, if detect the first sound meet First condition in multiple preset conditions generates control instruction (can be the first instruction or the second instruction), control instruction is led to The wireless data sending device 832 crossed in wireless data transmission device 83 is sent to the second electronic equipment 22.
Second electronic equipment 22 based on the control instruction establish with the communication connection of first server 12, and/or, control the One server 12 enters speech recognition state.
Microphone 51 continues to acquire second sound, second sound is sent to DSP 81, DSP carries out noise reduction to second sound Processing, the second sound that obtains that treated will treated that second sound is sent to Bluetooth chip 52;Place in Bluetooth chip 52 Second sound is sent to the second electronics by the wireless data sending device 832 that reason device 84 controls in wireless data transmission device 83 Equipment 22.
Second sound is sent to first server 12 by the second electronic equipment 22.
First server 12 carries out speech analysis processing to second sound, is analyzed as a result, analysis result is sent to Second electronic equipment 22;Second electronic equipment 22 is sent to the first electronic equipment 21 for result is analyzed.
First electronic equipment 21 receives analysis knot by the Wireless data receiving device 831 in wireless data transmission device 83 Fruit;Processing unit 84 in first electronic equipment 21, which controls wireless data sending device 832, will analyze the corresponding voice number of result It is sent to loudspeaker 84 according to by amplifier 82, to play the corresponding voice data of analysis result.
Optionally, the first electronic equipment 21 can have display screen, can show analysis result.
As shown in figure 9, the structure chart of another implementation for the first electronic equipment provided by the embodiments of the present application, it should First electronic equipment includes:
Memory 91, for storing program;
Processor 92, for executing described program, described program is specifically used for:
Monitor voice input;
If detecting that the first sound meets the first condition in multiple preset conditions, start voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed It is associated with different servers;
Second sound input after acquiring first voice input;
Obtain the analysis result that the associated first server of the first condition is directed to the second sound.
Memory 91 may include high speed RAM memory, it is also possible to further include nonvolatile memory (non-volatile Memory), a for example, at least magnetic disk storage.
Processor 92 may be a central processor CPU or specific integrated circuit ASIC (Application Specific Integrated Circuit), or be arranged to implement the integrated electricity of one or more of the embodiment of the present application Road.
Optionally, electronic equipment can also include communication bus 93 and communication interface 94, wherein memory 91, processing Device 92, completes mutual communication by communication bus 93 at communication interface 94;
Optionally, communication interface 94 can be the interface of communication module, such as the interface of gsm module.
The embodiment of the present application also provides a kind of readable storage medium storing program for executing, are stored thereon with computer program, which is characterized in that When the computer program is executed by processor, each step that any of the above-described processing method includes is realized.
It should be noted that all the embodiments in this specification are described in a progressive manner, each embodiment weight Point explanation is the difference from other embodiments, and the same or similar parts between the embodiments can be referred to each other. For device or system class embodiment, since it is basically similar to the method embodiment, so be described relatively simple, it is related Place illustrates referring to the part of embodiment of the method.
It should also be noted that, herein, relational terms such as first and second and the like are used merely to one Entity or operation are distinguished with another entity or operation, without necessarily requiring or implying between these entities or operation There are any actual relationship or orders.Moreover, the terms "include", "comprise" or its any other variant are intended to contain Lid non-exclusive inclusion, so that the process, method, article or equipment including a series of elements is not only wanted including those Element, but also including other elements that are not explicitly listed, or further include for this process, method, article or equipment Intrinsic element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that There is also other identical elements in process, method, article or equipment including the element.
The step of method described in conjunction with the examples disclosed in this document or algorithm, can directly be held with hardware, processor The combination of capable software module or the two is implemented.Software module can be placed in random access memory (RAM), memory, read-only deposit Reservoir (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technology In any other form of storage medium well known in field.
The foregoing description of the disclosed embodiments makes professional and technical personnel in the field can be realized or use the application. Various modifications to these embodiments will be readily apparent to those skilled in the art, as defined herein General Principle can be realized in other embodiments without departing from the spirit or scope of the application.Therefore, the application It is not intended to be limited to the embodiments shown herein, and is to fit to and the principles and novel features disclosed herein phase one The widest scope of cause.

Claims (10)

1. a kind of processing method, which is characterized in that be applied to the first electronic equipment, the treating method comprises:
Monitor voice input;
If detecting that the first sound meets the first condition in multiple preset conditions, start voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are associated with Different servers;
Second sound input after acquiring first voice input;
Obtain the analysis result that the associated first server of the first condition is directed to the second sound.
2. processing method according to claim 1, which is characterized in that if described detect that the first sound meets multiple preset First condition in condition, starting voice control function include:
Detect whether first sound meets multiple preset conditions respectively, to obtain first sound and multiple preset conditions Corresponding testing result;
If the testing result includes that first sound meets the first condition, start voice control function.
3. processing method according to claim 1, which is characterized in that further include:
From multiple voice control modes, the first voice control mode is determined;
Wherein, the sound for the voice control function that a voice control mode characterization can start first electronic equipment meets Preset condition, different voice control modes characterizes different preset conditions;
The sound for the voice control function that the first voice control mode characterization can start first electronic equipment meets Preset condition be the first condition.
4. processing method according to claim 3, which is characterized in that if described detect that the first sound meets first Part, starting voice control function include:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
5. according to the processing method of claim 2 or 4, which is characterized in that the starting voice control function includes:
Determine that the server for analyzing the voice of subsequent input is the associated first server of the first condition.
6. a kind of first electronic equipment characterized by comprising
Pronunciation receiver, for monitoring voice input;
Chip is handled, if detecting that the first sound meets first in multiple preset conditions for the pronunciation receiver Part starts voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are associated with Different servers;
The pronunciation receiver is also used to: the second sound input after acquisition first voice input;
The processor chips are also used to: obtaining the associated first server of the first condition for point of the second sound Analyse result;
Output device, for exporting the analysis result.
7. the first electronic equipment according to claim 6, which is characterized in that further include:
Transmitting device, for carrying out signal transmission with the second electronic equipment;
If the processing chip detects that the first sound meets the first condition in multiple preset conditions in execution, start voice When control function, it is specifically used for:
Detect whether first sound meets multiple preset conditions respectively, to obtain first sound and multiple preset conditions Corresponding testing result;
If the testing result includes that first sound meets the first condition, start voice control function.
8. the first electronic equipment according to claim 6, which is characterized in that further include:
The processor chips are also used to: from multiple voice control modes, determining the first voice control mode;
Wherein, the sound for the voice control function that a voice control mode characterization can start first electronic equipment meets Preset condition, different voice control modes characterizes different preset conditions;
The sound for the voice control function that the first voice control mode characterization can start first electronic equipment meets Preset condition be the first condition.
9. the first electronic equipment according to claim 8, which is characterized in that if the processing chip detects the executing One sound meets the first condition in multiple preset conditions, when starting voice control function, is specifically used for:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
10. a kind of first electronic equipment characterized by comprising
Memory, for storing program;
Processor, for executing described program, described program is specifically used for:
Monitor voice input;
If detecting that the first sound meets the first condition in multiple preset conditions, start voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are associated with Different servers;
Second sound input after acquiring first voice input;
Obtain the analysis result that the associated first server of the first condition is directed to the second sound.
CN201810825087.4A 2018-07-25 2018-07-25 Processing method and first electronic device Active CN108962259B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810825087.4A CN108962259B (en) 2018-07-25 2018-07-25 Processing method and first electronic device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810825087.4A CN108962259B (en) 2018-07-25 2018-07-25 Processing method and first electronic device

Publications (2)

Publication Number Publication Date
CN108962259A true CN108962259A (en) 2018-12-07
CN108962259B CN108962259B (en) 2021-06-15

Family

ID=64464137

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810825087.4A Active CN108962259B (en) 2018-07-25 2018-07-25 Processing method and first electronic device

Country Status (1)

Country Link
CN (1) CN108962259B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111096680A (en) * 2019-12-31 2020-05-05 广东美的厨房电器制造有限公司 Cooking equipment, electronic equipment, voice server, voice control method and device
CN112104949A (en) * 2020-09-02 2020-12-18 北京字节跳动网络技术有限公司 Method and device for detecting pickup assembly and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106960667A (en) * 2017-03-08 2017-07-18 杭州联络互动信息科技股份有限公司 Position reminding methods, devices and systems
US20180040324A1 (en) * 2016-08-05 2018-02-08 Sonos, Inc. Multiple Voice Services
CN107704275A (en) * 2017-09-04 2018-02-16 百度在线网络技术(北京)有限公司 Smart machine awakening method, device, server and smart machine
US9934777B1 (en) * 2016-07-01 2018-04-03 Amazon Technologies, Inc. Customized speech processing language models

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9934777B1 (en) * 2016-07-01 2018-04-03 Amazon Technologies, Inc. Customized speech processing language models
US20180040324A1 (en) * 2016-08-05 2018-02-08 Sonos, Inc. Multiple Voice Services
CN106960667A (en) * 2017-03-08 2017-07-18 杭州联络互动信息科技股份有限公司 Position reminding methods, devices and systems
CN107704275A (en) * 2017-09-04 2018-02-16 百度在线网络技术(北京)有限公司 Smart machine awakening method, device, server and smart machine

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
WENHUA XU等: "Cancelable Voiceprint Templates Based on Knowledge Signatures", 《INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY》 *
梁烽: "应用自动语音识别技术实现通信增值业务", 《广西科学院学报》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111096680A (en) * 2019-12-31 2020-05-05 广东美的厨房电器制造有限公司 Cooking equipment, electronic equipment, voice server, voice control method and device
CN112104949A (en) * 2020-09-02 2020-12-18 北京字节跳动网络技术有限公司 Method and device for detecting pickup assembly and electronic equipment

Also Published As

Publication number Publication date
CN108962259B (en) 2021-06-15

Similar Documents

Publication Publication Date Title
CN109166593B (en) Audio data processing method, device and storage medium
JP7160967B2 (en) Keyphrase detection with audio watermark
EP3127116B1 (en) Attention-based dynamic audio level adjustment
Rossi et al. AmbientSense: A real-time ambient sound recognition system for smartphones
CN104254884B (en) Low-power integrated-circuit for analyzing digitized audio stream
US9167520B2 (en) Controlling applications in a mobile device based on environmental context
CN107147618A (en) A kind of user registering method, device and electronic equipment
CN109844857B (en) Portable audio device with voice capability
CN111083678B (en) Playing control method and system of Bluetooth sound box and intelligent device
CN105408953A (en) Voice recognition client device for local voice recognition
CN104247280A (en) Voice-controlled communication connections
CN104282307A (en) Method, device and terminal for awakening voice control system
KR20160106075A (en) Method and device for identifying a piece of music in an audio stream
US12014732B2 (en) Energy efficient custom deep learning circuits for always-on embedded applications
CN111433737A (en) Electronic device and control method thereof
CN105975063B (en) A kind of method and apparatus controlling intelligent terminal
CN110322898A (en) Vagitus detection method, device and computer readable storage medium
CN108962259A (en) Processing method and the first electronic equipment
CN107016996B (en) Audio data processing method and device
CN110933345A (en) Method for reducing television standby power consumption, television and storage medium
CN113157240A (en) Voice processing method, device, equipment, storage medium and computer program product
CN108231074A (en) A kind of data processing method, voice assistant equipment and computer readable storage medium
CN112259076A (en) Voice interaction method and device, electronic equipment and computer readable storage medium
CN110958348B (en) Voice processing method and device, user equipment and intelligent sound box
CN116705033A (en) System on chip for wireless intelligent audio equipment and wireless processing method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant