CN102999161A - Implementation method and application of voice awakening module - Google Patents

Implementation method and application of voice awakening module Download PDF

Info

Publication number
CN102999161A
CN102999161A CN2012104551752A CN201210455175A CN102999161A CN 102999161 A CN102999161 A CN 102999161A CN 2012104551752 A CN2012104551752 A CN 2012104551752A CN 201210455175 A CN201210455175 A CN 201210455175A CN 102999161 A CN102999161 A CN 102999161A
Authority
CN
China
Prior art keywords
word
voice
score
wake
phoneme
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012104551752A
Other languages
Chinese (zh)
Other versions
CN102999161B (en
Inventor
操文祥
王海坤
康怀茂
钱勇
谢信珍
黄海兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Science And Technology University Information Flying South China Institute Of Artificial Intelligence (guangzhou) Co Ltd
Original Assignee
iFlytek Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by iFlytek Co Ltd filed Critical iFlytek Co Ltd
Priority to CN201210455175.2A priority Critical patent/CN102999161B/en
Publication of CN102999161A publication Critical patent/CN102999161A/en
Application granted granted Critical
Publication of CN102999161B publication Critical patent/CN102999161B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • User Interface Of Digital Computer (AREA)

Abstract

The invention discloses an implementation method and application of a voice awakening module. The implementation method comprises the following steps of: voice input (1), voice awakening algorithm (2) and awakening actuation (3), wherein the voice awakening algorithm (2) is implemented through the following main steps of: acoustic feature extraction (4), awakening word detection (5), awakening word confirmation (6), construction of an awakening word detection network (7), training of an acoustic model (8) and construction of an awakening word confirming network (9) and the like. The invention has the advantages that even under a noisy environment, no matter whether the music is played, the voice awakening function can be started by the voice awakening word, and the recognition awakening effect is good; and the implementation method can be planted onto an ARM or DSP universal process for operation and is applied in the fields related to vehicle mounting and household appliances.

Description

A kind of implementation method of voice wake module and application
Technical field
The invention discloses a kind of implementation method and application of voice wake module, be specifically related to a kind ofly say that by the user predetermined voice wake word up and come triggering system to carry out next step operation of user, can use with needs and realize the fields such as vehicle-mounted and household electrical appliances that voice wake up.
Background technology
The present invention relates to one and applied for the invention disclosed patent, publication number is: CN102645977A, and the applying date is 2012.03.26, the inventor is Yin Jianhong, Wang Zhong, Zhou Yanhuang, name is called " a kind of vehicle-mounted voice wakes man-machine interactive system and method up ", at this it is drawn to be list of references.The vehicle-mounted voice of this invention wakes up realizes that principle is: deposit the information such as sound bank, vehicle-mounted noise storehouse, speech engine in the flash storer that sets in advance, compare via the phonetic order relevant information of master controller MCU and memory stores by the phonetic order of microphone input and to carry out speech recognition, and with the phonetic order relevant information determined behind the matching identification as carrying out instruction control vehicle-mounted control functional unit block, realize its corresponding function.What involved flash deposited in this invention all is the data of fixing, and under the vehicle environment, because road speed, road conditions, weather, the variation such as the vehicle-mounted noise storehouse that all can cause engine noise and tyre noise that opens a window of whether turning on the aircondition, the music of playing in the car is different, the difference of speaker can cause the sound bank of institute's reference to change, so realize the voice arousal function under the scene that this invention is only applicable to fix.And the present invention trains a kind of acoustic model by gathering different speaker recording datas under all kinds of scenes, wakes the word Sampling network up and confirms network by structure simultaneously, so that the present invention adapts to scene is more extensive, the voice wake-up effect is good simultaneously.
Summary of the invention
The objective of the invention is in order to solve the deficiencies in the prior art, a kind of implementation method of voice waken system is provided, no matter even whether music playing is arranged, can wake word opening voice arousal function up by voice under noisy environment, the voice wake-up effect is good simultaneously; The present invention also provides the application of voice waken system in addition, comprises being applied to application vehicle-mounted and the household electrical appliances association area.
The present invention is achieved by the following technical solutions: a kind of implementation method of voice wake module comprises: phonetic entry 1, voice wake algorithm 2 up and wake up and carry out 3 steps, voice wake the voice signal that algorithm 2 obtains phonetic entry 1 up, after carrying out the voice wake up process, the result exported to wake up carry out 3, thereby finish wake operation;
Described voice wake algorithm 2 up and extract 4, wake word up and detect 5, wake word up and confirm 6, make up and wake word Sampling network 7, training acoustic model 8 and structure up and wake word up and confirm that network 9 realizes that the specific implementation process is as follows by acoustic feature:
The first step, acoustic feature extracts 4: obtain the voice signal input by phonetic entry 1, extraction has the property distinguished and feature that be based on the human hearing characteristic extraction, usually choose MFCC (Mel-Frequency Cepstrum Coefficient, the Mel frequency cepstrum coefficient) feature used in the speech recognition as acoustic feature;
Second step, wake word up and detect 5: the acoustic feature that extraction is obtained, adopt the acoustic model 8 of training waking word Sampling network 7 calculating acoustics scores up, if comprise the word that wakes up that will detect in the path of score optimum, then determine to have detected to wake word up, enter the operation of the 3rd step, re-start extraction acoustic feature 4 steps otherwise get back to the first step;
In the 3rd step, wake word up and confirm 6: with the acoustic feature that extraction obtains, the acoustic model 8 that adopts training confirms that network 9 wakes word up and confirms waking word up, is finally confirmed score; Whether that judges that this detects wakes word up for waking really word up, being about to this final affirmation score and predefined thresholding that wakes word up compares, if confirm that finally score is more than or equal to thresholding, think that then this wakes word up is to wake really word up, voice wake up successfully, the result exported to wake up carry out 3, thereby finish the voice wake operation; If finally confirm score less than thresholding, think that then this wakes word up and is the false word that wakes up, come back to the first step and re-start acoustic feature and extract 4 steps.
The training of described acoustic model 8 is divided into two parts, is respectively phoneme acoustic model and garbage model (being the Garbage model); The phoneme acoustic model adopts the acoustic training model method in traditional speech recognition, choose database, utilization is based on MLE (Maximum Likelihood Estimation, maximal possibility estimation) and under MPE (Minimum Phone Error, the minimum phoneme mistake) property the distinguished training criterion obtain; The Garbage model is used for absorbing the irrelevant voice except waking word up, use and train the same database of phoneme model, by calculating the similarity between each phoneme model, each phoneme is divided into 20 classes, use all training datas corresponding to every class phoneme to merge, adopt Garbage model corresponding to MLE criterion training, just obtain 20 class Garbage models.
The described implementation method of waking word Sampling network 7 up is to adopt optimum score path computing to draw, and the described optimum computing formula that gets sub-path is:
W = arg max W P ( W ) P ( X | W ) - - - ( 2 )
Wherein X represents the acoustic feature vector that extracts from the input voice, and W represents the optimum word sequence of score maximum; Conditional probability P (X|W) is the acoustic model score, calculates by the acoustic model 8 that trains; Prior probability P (W) is the language model score, is the added PenaltyP of different acoustic models (X) as total probability, when acoustic model be exactly definite value after waking the word Sampling network up and deciding.
The described word that wakes up confirms that network (9) implementation method is:
The word that wakes up that a. will detect is decoded to the phoneme one-level, and records all score (Score Phone1, Score Phone2..., Score PhoneN), wherein N wakes phoneme number total in the word up;
Score Phone1, Score Phone2..., Score PhoneNWhat represent respectively that this wakes all phonemes in the word up is the decoding score, and wherein subscript represents the sign of N phoneme of phoneme.
B. use and wake word up and detect same feature, obtain corresponding acoustics score, and be accurate to frame one-level (Score Frame1, Score Frame2..., Score FrameM), wherein M is the total duration of this feature, take frame as unit;
C. calculate and wake each phoneme of word up and really recognize minute, account form is as follows:
C M phonei = ( Score phonei - Σ k = K istart K iend Score framek ) / ( K iend - K istart ) - - - ( 3 )
K wherein IstartAnd K IendBe respectively zero-time and the concluding time of i phoneme;
CM PhoneiRepresent that i phoneme recognize minute really, subscript phonei represents i phoneme, Score PhoneiThe decoding score of i phone as shown above, Score FramekExpression is used and is waken the score that the k frame that network decoding obtains confirmed in word up.
D. calculate the final affirmation score that this wakes word up, account form is as follows:
C M word = 1 N Σ i = 1 N C M phonei - - - ( 4 )
Method of the present invention can be transplanted on ARM or the DSP general processor, is applied to vehicle-mounted and the household electrical appliances association area.
A kind of vehicle-mounted voice waken system is characterized in that comprising: microprocessor, voice wake module, audio conversion device, recording device, apparatus for processing audio, public address system; Wherein the voice wake module operates in the microprocessor, and the specific implementation process is as follows:
The first step, microprocessor and apparatus for processing audio interconnection, control apparatus for processing audio output audio information, and apparatus for processing audio and public address system interconnection are carried out power amplification with required playback of audio information to promote the loudspeaker playback, finish the audio frequency play operation;
Second step, the interconnection of recording device and audio conversion device when the user says voice when waking word up, is carried out the voice typing and is passed to the audio conversion device conversion by recording device, finishes the voice collecting operation;
In the 3rd step, audio conversion device carries out data-switching to the voice messaging of recording device typing, and the data after will changing are simultaneously passed to the computing that microprocessor carries out the described voice wake module of claim 1, finish the voice data conversion operations;
The 4th step, microprocessor and audio conversion device interconnection, the voice messaging that audio conversion device is inputted carries out the computing of voice wake module, wakes information up if correctly identify voice, then control apparatus for processing audio and play the voice suggestion sound, finish that vehicle-mounted voice wakes up and the prompt tone play operation; If identification makes mistakes, then proceed the operation of second step voice collecting.
The present invention's advantage compared with prior art is:
(1) the present invention wakes word up as trigger source by user's voice, adds that waking word up detects and wake up the word affirmation, even no matter whether music playing is arranged, can wake word opening voice arousal function up by voice under noisy environment, the voice wake-up effect is good; Simultaneously also need not the user and utilize bimanualness, only realize fast arousal function by voice command, carry out next step interactive operation.
(2) the present invention realizes that cost is low, and code is transplanted convenient, has good application value.
(3) the present invention can be widely used in the fields such as vehicle-mounted and household electrical appliances, can also be widely used in each field that other audio plays needs voice to wake up simultaneously.Under vehicle environment, do not use and want to start recognition function in user's driving process before the native system and need to manually remove operation push-button, suspend the music of current broadcast, cause the driving process to have potential safety hazard; User's experience effect is poor simultaneously.
(4) value brought of the present invention is, can wake word opening voice arousal function up by the voice of saying agreement after using native system, need not to suspend in advance audio frequency and plays, and simultaneously by actual testing authentication, correctly identifies and wakes rate up and can reach more than 90%; Such as field of household appliances, the user just when TV reception, looks on the bright side of things and opens speech identifying function at other, also can wake word up by voice and realize, so that interactive voice is more convenient, more humane.
(5) the voice arousal function among the present invention is all realized by software algorithm, can be transplanted to very easily on the general processors such as ARM or DSP.
Description of drawings
Fig. 1 is the schematic block diagram that the present invention realizes;
Fig. 2 is that structure of the present invention wakes word Sampling network schematic block diagram up;
Fig. 3 is that structure of the present invention wakes word affirmation network schematic block diagram up;
Fig. 4 is that the present invention is at the implementation synoptic diagram of automotive field.
Embodiment
As shown in Figure 1, the realization of voice wake module of the present invention wakes algorithm 2 up by phonetic entry 1, voice and wakes up and carry out the realization of 3 steps.
Voice wake algorithm 2 up and realize mainly being extracted 4, being waken up word and detect 5, wake word up and confirm 6, make up and wake word Sampling network 7, training acoustic model 8 and structure up and wake word up and confirm that network 9 finishes by acoustic feature, and the specific implementation process is:
(1) training acoustic model 8: the training of acoustic model is divided into two parts, is respectively phoneme acoustic model and garbage model (being the Garbage model).The phoneme acoustic model adopts the acoustic training model method in traditional speech recognition, choose suitable database, utilization is based on MLE (Maximum Likelihood Estimation, maximal possibility estimation) and MPE (Minimum Phone Error, minimum phoneme mistake) distinguish obtaining under the property training criterion.The Garbage model is used for absorbing the irrelevant voice except waking word up, use and train the same database of phoneme model, by calculating the similarity between each phoneme model, each phoneme is divided into 20 classes, use all training datas corresponding to every class phoneme to merge, adopt Garbage model corresponding to MLE criterion training, so namely obtain 20 class Garbage models.The Garbage model has adopted the phoneme training data combined training of cluster, and two kinds of purposes are arranged, and is used for absorbing other voice except waking word up in waking the word Sampling network up, is used for calculating the score of confirming network in waking word affirmation network up.
(2) acoustic feature extracts 4: obtain the voice signal input by phonetic entry 1, extraction can have certain differentiation, and be based on the feature that human hearing characteristic extracts, generally choose MFCC (Mel-Frequency Cepstrum Coefficient, the Mel frequency cepstrum coefficient) feature of using in the speech recognition.
(3) wake word up and detect 5: with the acoustic feature that extraction obtains, use acoustic model 8 waking word Sampling network 7 calculating acoustics scores up, if comprise the word that wakes up that will detect in the path of score optimum, then detect and wake word up, enter next step operation; Otherwise again extract the acoustic feature operation.In order to guarantee that waking word up can be detected normally, invalid voice can effectively be absorbed again simultaneously.The structure that wakes Sampling network up mainly by the user select wake word up and the Garbage model forms, as shown in Figure 2, this network is also referred to as recognition network in speech recognition, to detect network configuration very simple owing to wake up, or can by simple program manual construction.Because the complicacy of practical service environment, under many circumstances, what receive wakes voice up by noise pollution, wake a lot of that the score of feature on the phoneme acoustic model of acoustics corresponding to voice will reduce this moment up, and because the Garbage model is to use more phoneme combined training to obtain, itself be not very accurate, the amplitude that the score of acoustic feature on the Garbage model reduces is limited, wake voice this moment up and just absorbed by Garbage model mistake, the system wake-up rate will reduce.
In order to prevent the generation of above-mentioned situation, when waking the word Sampling network up and decode, the decoding score of the arc at Garbage place is certain punishment, i.e. Penalty, make its can not with the fair competition of phoneme acoustic model, also can normally be detected to ensure by the voice that wake up of noise pollution.Concrete punishment amplitude need to be done experimental adjustment for the different words that wakes up.
The implementation method of waking word Sampling network 7 up is to adopt optimum score path computing to draw.
Optimum that obtaining of sub-path adopted classical Bayesian formula, as follows:
Figure BDA00002396147200051
The acoustic feature vector that the X representative is extracted from the input voice in the following formula, W represents the optimum word sequence of score maximum.Conditional probability P (X|W) is the acoustic model score, can calculate by phoneme acoustic model and the garbage model that trains, and prior probability P (W) is the language model score, can be understood as here the added Penalty of different acoustic models.P (X) is total probability, and when acoustic model be exactly definite value after waking the word Sampling network up and deciding, so formula (1) can be written as:
W = arg max W P ( W ) P ( X | W ) - - - ( 2 )
(4) wake word up and confirm 6: because the complicacy that has inexactness and practical service environment of acoustic model itself, not necessarily wake really word up by waking the word that wakes up that the word detection obtains up.In order to reduce the non-problem that the false wake-up that brings and back can cause of waking up, need to do further to confirm to the word that wakes up that detection obtains.The present invention adopts the mode of accompanying drawing 3 to make up and wakes word affirmation network 9 up, wake that network confirmed in word and to wake the word Sampling network up the same up, all belong to the recognition network in the speech recognition, confirm only to comprise the Garbage model in the network, can use simple program or manual construction.
The key step of waking the word affirmation up is as follows:
A) will wake word up and detect and to obtain waking up word and be decoded to the phoneme one-level, and record its all score (Score Phone1, Score Phone2..., Score PhoneN), wherein N wakes phoneme number total in the word up.
B) use and wake word up and detect same feature, confirm that network obtains corresponding acoustics score waking word up, and be accurate to frame one-level (Score Frame1, Score Frame2..., Score FrameM), wherein M is the total duration of this feature, take frame as unit.
C) calculate and wake each phoneme of word up and really recognize minute, account form is as follows:
C M phonei = ( Score phonei - Σ k = K istart K iend Score framek ) / ( K iend - K istart ) - - - ( 3 )
K wherein IstartAnd K IendBe respectively zero-time and the concluding time of i phoneme.
D) calculate the final affirmation score that this wakes word up, account form is as follows:
C M word = 1 N Σ i = 1 N C M phonei - - - ( 4 )
E) judge that whether this wakes word up for waking really word up, contrast final affirmation score and predefined thresholding that this wakes word up, if confirm score C M WordThink then that greater than thresholding T this wakes word up for waking really word up, wakes up successfully; If CM WordThink then that less than thresholding T this wakes word up and is the false word that wakes up, re-start acoustic feature and extract.
Realize the voice arousal function by above work, result feedback is given to wake up and is carried out 3 the most at last, carries out wake operation.
As shown in Figure 4, provided the implementation synoptic diagram of the present invention in automotive field, the vehicle-mounted voice waken system, its structure comprises: microprocessor 11, preferentially select the ARM9 processor, but be not limited to this microprocessor; The voice wake module operates in the microprocessor 11; Audio conversion device 12 is preferentially selected WM8731, but is not limited to this audio conversion device; Recording device 13 is preferentially selected the high electret microphone of cost performance, but is not limited to this recording device; Apparatus for processing audio 14 is preferentially selected TDA7419, but is not limited to this apparatus for processing audio; Public address system 15, the four unit loudspeaker (left front loudspeaker, left back loudspeaker, right front loudspeaker, right back loudspeaker) that adopt power amplifier TDA7388 and automobile to carry, but be not limited to this power amplifier and vehicle-mounted loudspeaker unit; Voice wake command word, preferential select " automobile language point " do not wake word up but be not limited to these voice.
Realize that principle mainly comprises audio frequency broadcast, data under voice, voice data conversion, voice wake up and the step such as prompt tone broadcast is finished.Specific as follows:
The, when the user uses native system to listen to music when driving, music can be radio/TV/other sources of sound such as DVD/line in of the audio frequency that provides of the broadcast module by microprocessor ARM9 or accessing to audio processor TDA7419; The music of all broadcasts promotes vehicle-mounted loudspeaker by power amplifier TDA7388 again and broadcasts after carrying out the audio processing by audio process first, finishes audio frequency broadcast work;
The second, when saying specific voice, the user wakes word up---when " automobile language point ", user's speaking volume should keep the level of normally speaking, the too little meeting of sound causes the electret microphone record less than voice signal, and sound is crossed conference and caused recording to cut the top, all can cause the arousal function failure; Include the microphone signal that voice wake word information up, through carrying out analog to digital conversion among the audio converter WM8731, finish speech signal collection work;
Three, the voice acquisition module of microprocessor ARM9 is carried out analog to digital conversion work by iic bus control audio converter WM8731, convert the microphone location signal to digital signal, and return to microprocessor by the IIS bus, finish the voice data conversion work;
Four, microprocessor training acoustic model extracts user's acoustic feature of microphone signal input, after waking the word Sampling network up and waking word affirmation network up, realizes the voice arousal function.Simultaneously by audio process play cuing tone signal, finish that whole voice wake up and the prompt tone play operation.
More than be the preferential embodiment of the present invention, the user can wake word opening voice recognition function up by special sound equally when not music playing or non-driving.
The non-elaborated part of the present invention belongs to techniques well known.And above-described embodiment does not limit the present invention in any form, and all employings are equal to replaces or technical scheme that the form of equivalent transformation obtains, all drops within protection scope of the present invention.

Claims (6)

1. the implementation method of a voice wake module, it is characterized in that comprising: phonetic entry (1), voice wake algorithm (2) up and wake execution (3) step up, voice wake algorithm (2) up and obtain the voice signal of phonetic entry (1), after carrying out the voice wake up process, the result exported to wake execution (3) up, thereby finish wake operation;
Described voice wake algorithm (2) up and extract (4), wake word up and detect (5), wake word up and confirm (6), make up and wake word Sampling network (7), training acoustic model (8) and structure up and wake word affirmation network (9) up and realize that the specific implementation process is as follows by acoustic feature:
The first step, acoustic feature extracts (4): obtain the voice signal input by phonetic entry (1), extraction has the property distinguished and feature that be based on the human hearing characteristic extraction, usually choose MFCC (Mel-Frequency Cepstrum Coefficient, the Mel frequency cepstrum coefficient) feature used in the speech recognition as acoustic feature;
Second step, wake word up and detect (5): the acoustic feature that extraction is obtained, adopt the acoustic model (8) of training waking word Sampling network (7) calculating acoustics score up, if comprise the word that wakes up that will detect in the path of acoustics score optimum, then determine to have detected to wake word up, enter the operation of the 3rd step, re-start extraction acoustic feature (4) step otherwise get back to the first step;
In the 3rd step, wake word up and confirm (6): with the acoustic feature that extraction obtains, the acoustic model (8) that adopts training confirms that network (9) wakes word up and confirms waking word up, is finally confirmed score; Whether that judges that this detects wakes word up for waking really word up, be about to this and wake final affirmation score and the predefined thresholding of word up, if confirm that finally score is more than or equal to thresholding, think that then this wakes word up is to wake really word up, voice wake up successfully, the result exported to wake execution (3) up, thereby finish the voice wake operation; If finally confirm score less than thresholding, think that then this wakes word up and is the false word that wakes up, come back to the first step and re-start acoustic feature extraction (4) step.
2. the implementation method of voice wake module according to claim 1, it is characterized in that: the training of described acoustic model (8) is divided into two parts, is respectively phoneme acoustic model and garbage model (being the Garbage model); The phoneme acoustic model adopts the acoustic training model method in traditional speech recognition, choose database, utilization is based on MLE (Maximum Likelihood Estimation, maximal possibility estimation) and under MPE (Minimum Phone Error, the minimum phoneme mistake) property the distinguished training criterion obtain; The Garbage model is used for absorbing the irrelevant voice except waking word up, use and train the same database of phoneme model, by calculating the similarity between each phoneme model, each phoneme is divided into 20 classes, use all training datas corresponding to every class phoneme to merge, adopt Garbage model corresponding to MLE criterion training, just obtain 20 class Garbage models.
3. the implementation method of voice wake module according to claim 1 is characterized in that: the described implementation method of waking word Sampling network (7) up is to adopt optimum score path computing to draw, and the computing formula of described optimum sub-path is:
W = arg max W P ( W ) P ( X | W )
Wherein X represents the acoustic feature vector that extracts from the input voice, and W represents the optimum word sequence of score maximum; Conditional probability P (X|W) is the acoustic model score, calculates by the acoustic model (8) that trains; Prior probability P (W) is the language model score, is the added PenaltyP of different acoustic models (X) as total probability, when acoustic model with to wake up after the word Sampling network is decided namely be definite value.
4. the implementation method of voice wake module according to claim 1 is characterized in that: the described word that wakes up confirms that network (9) implementation method is:
The word that wakes up that a. will detect is decoded to the phoneme one-level, and records all score (Score Phone1, Score Phone2..., Score PhoneN), wherein N wakes phoneme number total in the word, Score up Phone1, Score Phone2..., Score PhoneNWhat represent respectively that this wakes all phonemes in the word up is the decoding score, and wherein subscript represents the sign of N phoneme of phoneme;
B. use and wake word up and detect same feature, obtain corresponding acoustics score, and be accurate to frame one-level (Score Frame1, Score Frame2..., Score FrameM), wherein M is the total duration of this feature, take frame as unit;
C. calculate and wake each phoneme of word up and really recognize minute, account form is as follows:
C M phonei = ( Score phonei - Σ k = K istart K iend Score framek ) / ( K iend - K istart )
K wherein IstartAnd K IendBe respectively zero-time and the concluding time of i phoneme;
CM PhoneiRepresent that i phoneme recognize minute really, subscript phonei represents i phoneme, Score PhoneiThe decoding score of i phone as shown above, Score FramekExpression is used and is waken the score that the k frame that network decoding obtains confirmed in word up;
D. calculate the final affirmation score that this wakes word up, account form is as follows:
C M word = 1 N Σ i = 1 N C M phonei .
5. the implementation method of a kind of voice wake module according to claim 1, it is characterized in that: described method can be transplanted on ARM or the DSP general processor and move, and is applied to vehicle-mounted and the household electrical appliances association area.
6. vehicle-mounted voice waken system, it is characterized in that comprising: microprocessor, the described voice wake module of claim 1, audio conversion device, recording device, apparatus for processing audio, public address system, described voice wake module operates in the microprocessor, and the specific implementation process is as follows:
The first step, microprocessor and apparatus for processing audio interconnection, control apparatus for processing audio output audio information, and apparatus for processing audio and public address system interconnection are carried out power amplification with required playback of audio information to promote the loudspeaker playback, finish the audio frequency play operation;
Second step, the interconnection of recording device and audio conversion device when the user says voice when waking word up, is carried out the voice typing and is passed to the audio conversion device conversion by recording device, finishes the voice collecting operation;
In the 3rd step, audio conversion device carries out data-switching to the voice messaging of recording device typing, and the data after will changing are simultaneously passed to the computing that microprocessor carries out the voice wake module, finish the voice data conversion operations;
The 4th step, microprocessor and audio conversion device interconnection, the voice messaging that audio conversion device is inputted carries out the computing of voice wake module, wakes information up if correctly identify voice, then control apparatus for processing audio and play the voice suggestion sound, finish that vehicle-mounted voice wakes up and the prompt tone play operation; If identification makes mistakes, then proceed the operation of second step voice collecting.
CN201210455175.2A 2012-11-13 2012-11-13 A kind of implementation method of voice wake-up module and application Active CN102999161B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210455175.2A CN102999161B (en) 2012-11-13 2012-11-13 A kind of implementation method of voice wake-up module and application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210455175.2A CN102999161B (en) 2012-11-13 2012-11-13 A kind of implementation method of voice wake-up module and application

Publications (2)

Publication Number Publication Date
CN102999161A true CN102999161A (en) 2013-03-27
CN102999161B CN102999161B (en) 2016-03-02

Family

ID=47927817

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210455175.2A Active CN102999161B (en) 2012-11-13 2012-11-13 A kind of implementation method of voice wake-up module and application

Country Status (1)

Country Link
CN (1) CN102999161B (en)

Cited By (93)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103714815A (en) * 2013-12-09 2014-04-09 何永 Voice control method and device thereof
CN103943105A (en) * 2014-04-18 2014-07-23 安徽科大讯飞信息科技股份有限公司 Voice interaction method and system
CN104464723A (en) * 2014-12-16 2015-03-25 科大讯飞股份有限公司 Voice interaction method and system
CN104616653A (en) * 2015-01-23 2015-05-13 北京云知声信息技术有限公司 Word match awakening method, work match awakening device, voice awakening method and voice awakening device
WO2015154412A1 (en) * 2014-09-05 2015-10-15 中兴通讯股份有限公司 Method and device for awakening voice control system, and terminal
CN105096939A (en) * 2015-07-08 2015-11-25 百度在线网络技术(北京)有限公司 Voice wake-up method and device
CN105141919A (en) * 2015-09-01 2015-12-09 武汉同迅智能科技有限公司 Monitoring terminal device remotely controlled by voice
CN105556595A (en) * 2013-09-17 2016-05-04 高通股份有限公司 Method and apparatus for adjusting detection threshold for activating voice assistant function
CN105575395A (en) * 2014-10-14 2016-05-11 中兴通讯股份有限公司 Voice wake-up method and apparatus, terminal, and processing method thereof
CN105632486A (en) * 2015-12-23 2016-06-01 北京奇虎科技有限公司 Voice wake-up method and device of intelligent hardware
CN105654949A (en) * 2016-01-07 2016-06-08 北京云知声信息技术有限公司 Voice wake-up method and device
CN105702253A (en) * 2016-01-07 2016-06-22 北京云知声信息技术有限公司 Voice awakening method and device
CN105812573A (en) * 2016-04-28 2016-07-27 努比亚技术有限公司 Voice processing method and mobile terminal
CN106094673A (en) * 2016-08-30 2016-11-09 奇瑞商用车(安徽)有限公司 Automobile wakes up word system and control method thereof up
CN106161755A (en) * 2015-04-20 2016-11-23 钰太芯微电子科技(上海)有限公司 A kind of key word voice wakes up system and awakening method and mobile terminal up
CN106297777A (en) * 2016-08-11 2017-01-04 广州视源电子科技股份有限公司 A kind of method and apparatus waking up voice service up
CN106469554A (en) * 2015-08-21 2017-03-01 科大讯飞股份有限公司 A kind of adaptive recognition methodss and system
CN106611597A (en) * 2016-12-02 2017-05-03 百度在线网络技术(北京)有限公司 Voice wakeup method and voice wakeup device based on artificial intelligence
CN106653010A (en) * 2015-11-03 2017-05-10 络达科技股份有限公司 Electronic device and method for waking up electronic device through voice recognition
CN106653022A (en) * 2016-12-29 2017-05-10 百度在线网络技术(北京)有限公司 Voice awakening method and device based on artificial intelligence
CN106847273A (en) * 2016-12-23 2017-06-13 北京云知声信息技术有限公司 The wake-up selected ci poem selection method and device of speech recognition
CN107123417A (en) * 2017-05-16 2017-09-01 上海交通大学 Optimization method and system are waken up based on the customized voice that distinctive is trained
CN107220532A (en) * 2017-04-08 2017-09-29 网易(杭州)网络有限公司 For the method and apparatus by voice recognition user identity
CN107591151A (en) * 2017-08-22 2018-01-16 百度在线网络技术(北京)有限公司 Far field voice awakening method, device and terminal device
CN107767863A (en) * 2016-08-22 2018-03-06 科大讯飞股份有限公司 voice awakening method, system and intelligent terminal
CN107767861A (en) * 2016-08-22 2018-03-06 科大讯飞股份有限公司 voice awakening method, system and intelligent terminal
CN107895573A (en) * 2017-11-15 2018-04-10 百度在线网络技术(北京)有限公司 Method and device for identification information
CN108039175A (en) * 2018-01-29 2018-05-15 北京百度网讯科技有限公司 Audio recognition method, device and server
CN108122556A (en) * 2017-08-08 2018-06-05 问众智能信息科技(北京)有限公司 Reduce the method and device that driver's voice wakes up instruction word false triggering
CN108198548A (en) * 2018-01-25 2018-06-22 苏州奇梦者网络科技有限公司 A kind of voice awakening method and its system
CN108320733A (en) * 2017-12-18 2018-07-24 上海科大讯飞信息科技有限公司 Voice data processing method and device, storage medium, electronic equipment
CN108352168A (en) * 2015-11-24 2018-07-31 英特尔Ip公司 The low-resource key phrase detection waken up for voice
CN108447472A (en) * 2017-02-16 2018-08-24 腾讯科技(深圳)有限公司 Voice awakening method and device
CN108536668A (en) * 2018-02-26 2018-09-14 科大讯飞股份有限公司 Wake up word appraisal procedure and device, storage medium, electronic equipment
CN108597506A (en) * 2018-03-13 2018-09-28 广州势必可赢网络科技有限公司 A kind of intelligent wearable device alarming method for power and intelligent wearable device
CN108962240A (en) * 2018-06-14 2018-12-07 百度在线网络技术(北京)有限公司 A kind of sound control method and system based on earphone
CN109102806A (en) * 2018-09-29 2018-12-28 百度在线网络技术(北京)有限公司 Method, apparatus, equipment and computer readable storage medium for interactive voice
CN109119078A (en) * 2018-10-26 2019-01-01 北京石头世纪科技有限公司 Automatic robot's control method, device, automatic robot and medium
CN109192210A (en) * 2018-10-25 2019-01-11 腾讯科技(深圳)有限公司 A kind of method of speech recognition, the method and device for waking up word detection
CN109243426A (en) * 2018-09-19 2019-01-18 易诚博睿(南京)科技有限公司 A kind of automatization judgement voice false wake-up system and its judgment method
CN109448720A (en) * 2018-12-18 2019-03-08 维拓智能科技(深圳)有限公司 Convenience service self-aided terminal and its voice awakening method
CN109672775A (en) * 2017-10-16 2019-04-23 腾讯科技(北京)有限公司 Adjust the method, apparatus and terminal of wakeup sensitivity
CN109753665A (en) * 2019-01-30 2019-05-14 北京声智科技有限公司 Wake up the update method and device of model
CN109878218A (en) * 2019-01-30 2019-06-14 厦门爱立得科技有限公司 A kind of printer and its Method of printing with intelligent sound control
WO2019113911A1 (en) * 2017-12-15 2019-06-20 海尔优家智能科技(北京)有限公司 Device control method, cloud device, smart device, computer medium and device
CN110033758A (en) * 2019-04-24 2019-07-19 武汉水象电子科技有限公司 A kind of voice wake-up implementation method based on small training set optimization decoding network
CN110097870A (en) * 2018-01-30 2019-08-06 阿里巴巴集团控股有限公司 Method of speech processing, device, equipment and storage medium
CN110177317A (en) * 2019-05-17 2019-08-27 腾讯科技(深圳)有限公司 Echo cancel method, device, computer readable storage medium and computer equipment
CN110390933A (en) * 2018-04-20 2019-10-29 比亚迪股份有限公司 State methods of exhibiting, device and the displaying vehicle system of vehicle intelligent voice system
CN110473536A (en) * 2019-08-20 2019-11-19 北京声智科技有限公司 A kind of awakening method, device and smart machine
CN110600008A (en) * 2019-09-23 2019-12-20 苏州思必驰信息科技有限公司 Voice wake-up optimization method and system
CN110727821A (en) * 2019-10-12 2020-01-24 深圳海翼智新科技有限公司 Method, apparatus, system and computer storage medium for preventing device from being awoken by mistake
CN110770093A (en) * 2017-08-07 2020-02-07 微芯片技术股份有限公司 Voice activated actuation of automotive features
CN110809796A (en) * 2017-10-24 2020-02-18 北京嘀嘀无限科技发展有限公司 Speech recognition system and method with decoupled wake phrases
CN110989963A (en) * 2019-11-22 2020-04-10 北京梧桐车联科技有限责任公司 Awakening word recommendation method and device and storage medium
CN111128134A (en) * 2018-10-11 2020-05-08 阿里巴巴集团控股有限公司 Acoustic model training method, voice awakening method, device and electronic equipment
CN111247582A (en) * 2018-09-28 2020-06-05 搜诺思公司 System and method for selective wake word detection using neural network models
CN111739513A (en) * 2020-07-22 2020-10-02 江苏清微智能科技有限公司 Automatic voice awakening test system and test method thereof
CN111819533A (en) * 2018-10-11 2020-10-23 华为技术有限公司 Method for triggering electronic equipment to execute function and electronic equipment
CN111862963A (en) * 2019-04-12 2020-10-30 阿里巴巴集团控股有限公司 Voice wake-up method, device and equipment
CN112382303A (en) * 2016-08-05 2021-02-19 搜诺思公司 Playback device, method for playback device, and computer-readable medium
CN112420051A (en) * 2020-11-18 2021-02-26 青岛海尔科技有限公司 Equipment determination method, device and storage medium
CN112655043A (en) * 2018-09-11 2021-04-13 日本电信电话株式会社 Keyword detection device, keyword detection method, and program
CN113038048A (en) * 2021-03-02 2021-06-25 海信视像科技股份有限公司 Far-field voice awakening method and display device
CN113535913A (en) * 2021-06-02 2021-10-22 科大讯飞股份有限公司 Answer scoring method and device, electronic equipment and storage medium
WO2023029442A1 (en) * 2021-08-30 2023-03-09 佛山市顺德区美的电子科技有限公司 Smart device control method and apparatus, smart device, and readable storage medium
US11727933B2 (en) 2016-10-19 2023-08-15 Sonos, Inc. Arbitration-based voice recognition
US11778259B2 (en) 2018-09-14 2023-10-03 Sonos, Inc. Networked devices, systems and methods for associating playback devices based on sound codes
US11792590B2 (en) 2018-05-25 2023-10-17 Sonos, Inc. Determining and adapting to changes in microphone performance of playback devices
US11790937B2 (en) 2018-09-21 2023-10-17 Sonos, Inc. Voice detection optimization using sound metadata
US11797263B2 (en) 2018-05-10 2023-10-24 Sonos, Inc. Systems and methods for voice-assisted media content selection
US11798553B2 (en) 2019-05-03 2023-10-24 Sonos, Inc. Voice assistant persistence across multiple network microphone devices
US11816393B2 (en) 2017-09-08 2023-11-14 Sonos, Inc. Dynamic computation of system response volume
US11817076B2 (en) 2017-09-28 2023-11-14 Sonos, Inc. Multi-channel acoustic echo cancellation
US11817083B2 (en) 2018-12-13 2023-11-14 Sonos, Inc. Networked microphone devices, systems, and methods of localized arbitration
US11832068B2 (en) 2016-02-22 2023-11-28 Sonos, Inc. Music service selection
US11854547B2 (en) 2019-06-12 2023-12-26 Sonos, Inc. Network microphone device with command keyword eventing
US11863593B2 (en) 2016-02-22 2024-01-02 Sonos, Inc. Networked microphone device control
US11862161B2 (en) 2019-10-22 2024-01-02 Sonos, Inc. VAS toggle based on device orientation
US11869503B2 (en) 2019-12-20 2024-01-09 Sonos, Inc. Offline voice control
WO2024011885A1 (en) * 2022-07-15 2024-01-18 北京百度网讯科技有限公司 Voice wakeup method and apparatus, electronic device, and storage medium
US11881222B2 (en) 2020-05-20 2024-01-23 Sonos, Inc Command keywords with input detection windowing
US11881223B2 (en) 2018-12-07 2024-01-23 Sonos, Inc. Systems and methods of operating media playback systems having multiple voice assistant services
US11887598B2 (en) 2020-01-07 2024-01-30 Sonos, Inc. Voice verification for media playback
US11893308B2 (en) 2017-09-29 2024-02-06 Sonos, Inc. Media playback system with concurrent voice assistance
US11900937B2 (en) 2017-08-07 2024-02-13 Sonos, Inc. Wake-word detection suppression
US11899519B2 (en) 2018-10-23 2024-02-13 Sonos, Inc. Multiple stage network microphone device with reduced power consumption and processing load
US11947870B2 (en) 2016-02-22 2024-04-02 Sonos, Inc. Audio response playback
US11961519B2 (en) 2020-02-07 2024-04-16 Sonos, Inc. Localized wakeword verification
US11973893B2 (en) 2018-08-28 2024-04-30 Sonos, Inc. Do not disturb feature for audio notifications
US11979960B2 (en) 2016-07-15 2024-05-07 Sonos, Inc. Contextualization of voice inputs
US11983463B2 (en) 2016-02-22 2024-05-14 Sonos, Inc. Metadata exchange involving a networked playback system and a networked microphone system
US11984123B2 (en) 2020-11-12 2024-05-14 Sonos, Inc. Network device interaction by range

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1256460A (en) * 1999-11-19 2000-06-14 清华大学 Phonetic command controller
US20020143540A1 (en) * 2001-03-28 2002-10-03 Narendranath Malayath Voice recognition system using implicit speaker adaptation
CN101516005A (en) * 2008-02-23 2009-08-26 华为技术有限公司 Speech recognition channel selecting system, method and channel switching device
KR20090123396A (en) * 2008-05-28 2009-12-02 (주)파워보이스 System for robust voice activity detection and continuous speech recognition in noisy environment using real-time calling key-word recognition

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1256460A (en) * 1999-11-19 2000-06-14 清华大学 Phonetic command controller
US20020143540A1 (en) * 2001-03-28 2002-10-03 Narendranath Malayath Voice recognition system using implicit speaker adaptation
CN101516005A (en) * 2008-02-23 2009-08-26 华为技术有限公司 Speech recognition channel selecting system, method and channel switching device
KR20090123396A (en) * 2008-05-28 2009-12-02 (주)파워보이스 System for robust voice activity detection and continuous speech recognition in noisy environment using real-time calling key-word recognition

Cited By (114)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105556595A (en) * 2013-09-17 2016-05-04 高通股份有限公司 Method and apparatus for adjusting detection threshold for activating voice assistant function
CN103714815A (en) * 2013-12-09 2014-04-09 何永 Voice control method and device thereof
CN103943105A (en) * 2014-04-18 2014-07-23 安徽科大讯飞信息科技股份有限公司 Voice interaction method and system
WO2015154412A1 (en) * 2014-09-05 2015-10-15 中兴通讯股份有限公司 Method and device for awakening voice control system, and terminal
CN105575395A (en) * 2014-10-14 2016-05-11 中兴通讯股份有限公司 Voice wake-up method and apparatus, terminal, and processing method thereof
CN104464723A (en) * 2014-12-16 2015-03-25 科大讯飞股份有限公司 Voice interaction method and system
CN104616653A (en) * 2015-01-23 2015-05-13 北京云知声信息技术有限公司 Word match awakening method, work match awakening device, voice awakening method and voice awakening device
CN106161755A (en) * 2015-04-20 2016-11-23 钰太芯微电子科技(上海)有限公司 A kind of key word voice wakes up system and awakening method and mobile terminal up
CN105096939A (en) * 2015-07-08 2015-11-25 百度在线网络技术(北京)有限公司 Voice wake-up method and device
CN106469554A (en) * 2015-08-21 2017-03-01 科大讯飞股份有限公司 A kind of adaptive recognition methodss and system
CN105141919A (en) * 2015-09-01 2015-12-09 武汉同迅智能科技有限公司 Monitoring terminal device remotely controlled by voice
CN106653010A (en) * 2015-11-03 2017-05-10 络达科技股份有限公司 Electronic device and method for waking up electronic device through voice recognition
CN106653010B (en) * 2015-11-03 2020-07-24 络达科技股份有限公司 Electronic device and method for waking up electronic device through voice recognition
CN108352168A (en) * 2015-11-24 2018-07-31 英特尔Ip公司 The low-resource key phrase detection waken up for voice
CN108352168B (en) * 2015-11-24 2023-08-04 英特尔公司 Low resource key phrase detection for voice wakeup
CN105632486A (en) * 2015-12-23 2016-06-01 北京奇虎科技有限公司 Voice wake-up method and device of intelligent hardware
CN105632486B (en) * 2015-12-23 2019-12-17 北京奇虎科技有限公司 Voice awakening method and device of intelligent hardware
CN105654949B (en) * 2016-01-07 2019-05-07 北京云知声信息技术有限公司 A kind of voice awakening method and device
CN105654949A (en) * 2016-01-07 2016-06-08 北京云知声信息技术有限公司 Voice wake-up method and device
CN105702253A (en) * 2016-01-07 2016-06-22 北京云知声信息技术有限公司 Voice awakening method and device
US11832068B2 (en) 2016-02-22 2023-11-28 Sonos, Inc. Music service selection
US11863593B2 (en) 2016-02-22 2024-01-02 Sonos, Inc. Networked microphone device control
US11947870B2 (en) 2016-02-22 2024-04-02 Sonos, Inc. Audio response playback
US11983463B2 (en) 2016-02-22 2024-05-14 Sonos, Inc. Metadata exchange involving a networked playback system and a networked microphone system
CN105812573A (en) * 2016-04-28 2016-07-27 努比亚技术有限公司 Voice processing method and mobile terminal
US11979960B2 (en) 2016-07-15 2024-05-07 Sonos, Inc. Contextualization of voice inputs
CN112382303A (en) * 2016-08-05 2021-02-19 搜诺思公司 Playback device, method for playback device, and computer-readable medium
US11934742B2 (en) 2016-08-05 2024-03-19 Sonos, Inc. Playback device supporting concurrent voice assistants
CN106297777A (en) * 2016-08-11 2017-01-04 广州视源电子科技股份有限公司 A kind of method and apparatus waking up voice service up
CN106297777B (en) * 2016-08-11 2019-11-22 广州视源电子科技股份有限公司 A kind of method and apparatus waking up voice service
CN107767861A (en) * 2016-08-22 2018-03-06 科大讯飞股份有限公司 voice awakening method, system and intelligent terminal
CN107767863A (en) * 2016-08-22 2018-03-06 科大讯飞股份有限公司 voice awakening method, system and intelligent terminal
CN106094673A (en) * 2016-08-30 2016-11-09 奇瑞商用车(安徽)有限公司 Automobile wakes up word system and control method thereof up
US11727933B2 (en) 2016-10-19 2023-08-15 Sonos, Inc. Arbitration-based voice recognition
CN106611597B (en) * 2016-12-02 2019-11-08 百度在线网络技术(北京)有限公司 Voice awakening method and device based on artificial intelligence
CN106611597A (en) * 2016-12-02 2017-05-03 百度在线网络技术(北京)有限公司 Voice wakeup method and voice wakeup device based on artificial intelligence
CN106847273B (en) * 2016-12-23 2020-05-05 北京云知声信息技术有限公司 Awakening word selection method and device for voice recognition
CN106847273A (en) * 2016-12-23 2017-06-13 北京云知声信息技术有限公司 The wake-up selected ci poem selection method and device of speech recognition
CN106653022A (en) * 2016-12-29 2017-05-10 百度在线网络技术(北京)有限公司 Voice awakening method and device based on artificial intelligence
CN106653022B (en) * 2016-12-29 2020-06-23 百度在线网络技术(北京)有限公司 Voice awakening method and device based on artificial intelligence
CN108447472B (en) * 2017-02-16 2022-04-05 腾讯科技(深圳)有限公司 Voice wake-up method and device
CN108447472A (en) * 2017-02-16 2018-08-24 腾讯科技(深圳)有限公司 Voice awakening method and device
CN107220532A (en) * 2017-04-08 2017-09-29 网易(杭州)网络有限公司 For the method and apparatus by voice recognition user identity
CN107123417A (en) * 2017-05-16 2017-09-01 上海交通大学 Optimization method and system are waken up based on the customized voice that distinctive is trained
CN107123417B (en) * 2017-05-16 2020-06-09 上海交通大学 Customized voice awakening optimization method and system based on discriminant training
US11900937B2 (en) 2017-08-07 2024-02-13 Sonos, Inc. Wake-word detection suppression
CN110770093A (en) * 2017-08-07 2020-02-07 微芯片技术股份有限公司 Voice activated actuation of automotive features
CN108122556A (en) * 2017-08-08 2018-06-05 问众智能信息科技(北京)有限公司 Reduce the method and device that driver's voice wakes up instruction word false triggering
CN107591151A (en) * 2017-08-22 2018-01-16 百度在线网络技术(北京)有限公司 Far field voice awakening method, device and terminal device
US11816393B2 (en) 2017-09-08 2023-11-14 Sonos, Inc. Dynamic computation of system response volume
US11817076B2 (en) 2017-09-28 2023-11-14 Sonos, Inc. Multi-channel acoustic echo cancellation
US11893308B2 (en) 2017-09-29 2024-02-06 Sonos, Inc. Media playback system with concurrent voice assistance
CN109672775A (en) * 2017-10-16 2019-04-23 腾讯科技(北京)有限公司 Adjust the method, apparatus and terminal of wakeup sensitivity
CN109672775B (en) * 2017-10-16 2021-10-29 腾讯科技(北京)有限公司 Method, device and terminal for adjusting awakening sensitivity
US10789946B2 (en) 2017-10-24 2020-09-29 Beijing Didi Infinity Technology And Development Co., Ltd. System and method for speech recognition with decoupling awakening phrase
CN110809796A (en) * 2017-10-24 2020-02-18 北京嘀嘀无限科技发展有限公司 Speech recognition system and method with decoupled wake phrases
CN107895573A (en) * 2017-11-15 2018-04-10 百度在线网络技术(北京)有限公司 Method and device for identification information
WO2019113911A1 (en) * 2017-12-15 2019-06-20 海尔优家智能科技(北京)有限公司 Device control method, cloud device, smart device, computer medium and device
CN108320733A (en) * 2017-12-18 2018-07-24 上海科大讯飞信息科技有限公司 Voice data processing method and device, storage medium, electronic equipment
CN108198548A (en) * 2018-01-25 2018-06-22 苏州奇梦者网络科技有限公司 A kind of voice awakening method and its system
CN108039175A (en) * 2018-01-29 2018-05-15 北京百度网讯科技有限公司 Audio recognition method, device and server
US11398228B2 (en) 2018-01-29 2022-07-26 Beijing Baidu Netcom Science And Technology Co., Ltd. Voice recognition method, device and server
CN110097870A (en) * 2018-01-30 2019-08-06 阿里巴巴集团控股有限公司 Method of speech processing, device, equipment and storage medium
CN108536668A (en) * 2018-02-26 2018-09-14 科大讯飞股份有限公司 Wake up word appraisal procedure and device, storage medium, electronic equipment
CN108536668B (en) * 2018-02-26 2022-06-07 科大讯飞股份有限公司 Wake-up word evaluation method and device, storage medium and electronic equipment
CN108597506A (en) * 2018-03-13 2018-09-28 广州势必可赢网络科技有限公司 A kind of intelligent wearable device alarming method for power and intelligent wearable device
CN110390933A (en) * 2018-04-20 2019-10-29 比亚迪股份有限公司 State methods of exhibiting, device and the displaying vehicle system of vehicle intelligent voice system
US11797263B2 (en) 2018-05-10 2023-10-24 Sonos, Inc. Systems and methods for voice-assisted media content selection
US11792590B2 (en) 2018-05-25 2023-10-17 Sonos, Inc. Determining and adapting to changes in microphone performance of playback devices
CN108962240A (en) * 2018-06-14 2018-12-07 百度在线网络技术(北京)有限公司 A kind of sound control method and system based on earphone
US11973893B2 (en) 2018-08-28 2024-04-30 Sonos, Inc. Do not disturb feature for audio notifications
CN112655043A (en) * 2018-09-11 2021-04-13 日本电信电话株式会社 Keyword detection device, keyword detection method, and program
US11778259B2 (en) 2018-09-14 2023-10-03 Sonos, Inc. Networked devices, systems and methods for associating playback devices based on sound codes
CN109243426A (en) * 2018-09-19 2019-01-18 易诚博睿(南京)科技有限公司 A kind of automatization judgement voice false wake-up system and its judgment method
US11790937B2 (en) 2018-09-21 2023-10-17 Sonos, Inc. Voice detection optimization using sound metadata
CN111247582A (en) * 2018-09-28 2020-06-05 搜诺思公司 System and method for selective wake word detection using neural network models
US11790911B2 (en) 2018-09-28 2023-10-17 Sonos, Inc. Systems and methods for selective wake word detection using neural network models
CN109102806A (en) * 2018-09-29 2018-12-28 百度在线网络技术(北京)有限公司 Method, apparatus, equipment and computer readable storage medium for interactive voice
CN111128134A (en) * 2018-10-11 2020-05-08 阿里巴巴集团控股有限公司 Acoustic model training method, voice awakening method, device and electronic equipment
CN111819533B (en) * 2018-10-11 2022-06-14 华为技术有限公司 Method for triggering electronic equipment to execute function and electronic equipment
CN111819533A (en) * 2018-10-11 2020-10-23 华为技术有限公司 Method for triggering electronic equipment to execute function and electronic equipment
US11899519B2 (en) 2018-10-23 2024-02-13 Sonos, Inc. Multiple stage network microphone device with reduced power consumption and processing load
CN109192210A (en) * 2018-10-25 2019-01-11 腾讯科技(深圳)有限公司 A kind of method of speech recognition, the method and device for waking up word detection
CN109192210B (en) * 2018-10-25 2023-09-22 腾讯科技(深圳)有限公司 Voice recognition method, wake-up word detection method and device
CN109119078A (en) * 2018-10-26 2019-01-01 北京石头世纪科技有限公司 Automatic robot's control method, device, automatic robot and medium
US11881223B2 (en) 2018-12-07 2024-01-23 Sonos, Inc. Systems and methods of operating media playback systems having multiple voice assistant services
US11817083B2 (en) 2018-12-13 2023-11-14 Sonos, Inc. Networked microphone devices, systems, and methods of localized arbitration
CN109448720A (en) * 2018-12-18 2019-03-08 维拓智能科技(深圳)有限公司 Convenience service self-aided terminal and its voice awakening method
CN109878218A (en) * 2019-01-30 2019-06-14 厦门爱立得科技有限公司 A kind of printer and its Method of printing with intelligent sound control
CN109753665A (en) * 2019-01-30 2019-05-14 北京声智科技有限公司 Wake up the update method and device of model
CN111862963A (en) * 2019-04-12 2020-10-30 阿里巴巴集团控股有限公司 Voice wake-up method, device and equipment
CN111862963B (en) * 2019-04-12 2024-05-10 阿里巴巴集团控股有限公司 Voice wakeup method, device and equipment
CN110033758A (en) * 2019-04-24 2019-07-19 武汉水象电子科技有限公司 A kind of voice wake-up implementation method based on small training set optimization decoding network
CN110033758B (en) * 2019-04-24 2021-09-24 武汉水象电子科技有限公司 Voice wake-up implementation method based on small training set optimization decoding network
US11798553B2 (en) 2019-05-03 2023-10-24 Sonos, Inc. Voice assistant persistence across multiple network microphone devices
CN110177317A (en) * 2019-05-17 2019-08-27 腾讯科技(深圳)有限公司 Echo cancel method, device, computer readable storage medium and computer equipment
US11854547B2 (en) 2019-06-12 2023-12-26 Sonos, Inc. Network microphone device with command keyword eventing
CN110473536A (en) * 2019-08-20 2019-11-19 北京声智科技有限公司 A kind of awakening method, device and smart machine
CN110600008A (en) * 2019-09-23 2019-12-20 苏州思必驰信息科技有限公司 Voice wake-up optimization method and system
CN110727821A (en) * 2019-10-12 2020-01-24 深圳海翼智新科技有限公司 Method, apparatus, system and computer storage medium for preventing device from being awoken by mistake
US11862161B2 (en) 2019-10-22 2024-01-02 Sonos, Inc. VAS toggle based on device orientation
CN110989963A (en) * 2019-11-22 2020-04-10 北京梧桐车联科技有限责任公司 Awakening word recommendation method and device and storage medium
US11869503B2 (en) 2019-12-20 2024-01-09 Sonos, Inc. Offline voice control
US11887598B2 (en) 2020-01-07 2024-01-30 Sonos, Inc. Voice verification for media playback
US11961519B2 (en) 2020-02-07 2024-04-16 Sonos, Inc. Localized wakeword verification
US11881222B2 (en) 2020-05-20 2024-01-23 Sonos, Inc Command keywords with input detection windowing
CN111739513A (en) * 2020-07-22 2020-10-02 江苏清微智能科技有限公司 Automatic voice awakening test system and test method thereof
US11984123B2 (en) 2020-11-12 2024-05-14 Sonos, Inc. Network device interaction by range
CN112420051A (en) * 2020-11-18 2021-02-26 青岛海尔科技有限公司 Equipment determination method, device and storage medium
CN113038048A (en) * 2021-03-02 2021-06-25 海信视像科技股份有限公司 Far-field voice awakening method and display device
CN113535913B (en) * 2021-06-02 2023-12-01 科大讯飞股份有限公司 Answer scoring method and device, electronic equipment and storage medium
CN113535913A (en) * 2021-06-02 2021-10-22 科大讯飞股份有限公司 Answer scoring method and device, electronic equipment and storage medium
WO2023029442A1 (en) * 2021-08-30 2023-03-09 佛山市顺德区美的电子科技有限公司 Smart device control method and apparatus, smart device, and readable storage medium
WO2024011885A1 (en) * 2022-07-15 2024-01-18 北京百度网讯科技有限公司 Voice wakeup method and apparatus, electronic device, and storage medium

Also Published As

Publication number Publication date
CN102999161B (en) 2016-03-02

Similar Documents

Publication Publication Date Title
CN102999161B (en) A kind of implementation method of voice wake-up module and application
CN103021409B (en) A kind of vice activation camera system
US11967323B2 (en) Hotword suppression
CN111161714B (en) Voice information processing method, electronic equipment and storage medium
CN104575504A (en) Method for personalized television voice wake-up by voiceprint and voice identification
CN109166575A (en) Exchange method, device, smart machine and the storage medium of smart machine
CN106463112A (en) Voice recognition method, voice wake-up device, voice recognition device and terminal
CN103208284A (en) Method and system for using sound related vehicle information to enhance speech recognition
CN112397065A (en) Voice interaction method and device, computer readable storage medium and electronic equipment
CN107600075A (en) The control method and device of onboard system
CN103198829A (en) Method, device and equipment of reducing interior noise and improving voice recognition rate
CN111145763A (en) GRU-based voice recognition method and system in audio
CN110696756A (en) Vehicle volume control method and device, automobile and storage medium
CN110970020A (en) Method for extracting effective voice signal by using voiceprint
CN111833870A (en) Awakening method and device of vehicle-mounted voice system, vehicle and medium
CN112185425A (en) Audio signal processing method, device, equipment and storage medium
CN110689887A (en) Audio verification method and device, storage medium and electronic equipment
CN112927688B (en) Voice interaction method and system for vehicle
CN113643704A (en) Test method, upper computer, system and storage medium of vehicle-mounted machine voice system
CN110808050A (en) Voice recognition method and intelligent equipment
CN110737422B (en) Sound signal acquisition method and device
CN111477226A (en) Control method, intelligent device and storage medium
CN106094673A (en) Automobile wakes up word system and control method thereof up
CN101350196A (en) On-chip system for confirming role related talker identification and confirming method thereof
TW202029181A (en) Method and apparatus for specific user to wake up by speech recognition

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: Wangjiang Road high tech Development Zone Hefei city Anhui province 230088 No. 666

Applicant after: Iflytek Co., Ltd.

Address before: Wangjiang Road high tech Development Zone Hefei city Anhui province 230088 No. 666

Applicant before: Anhui USTC iFLYTEK Co., Ltd.

COR Change of bibliographic data
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20190212

Address after: 511458 X1301-G5145 (Cluster Registration) (JM) No. 106 Fengze East Road, Nansha District, Guangzhou, Guangdong Province

Patentee after: Science and Technology University Information Flying South China Institute of Artificial Intelligence (Guangzhou) Co., Ltd.

Address before: 230088 666 Wangjiang West Road, Hefei hi tech Development Zone, Anhui

Patentee before: Iflytek Co., Ltd.

TR01 Transfer of patent right