CN103366740A - Voice command recognition method and voice command recognition device - Google Patents

Voice command recognition method and voice command recognition device Download PDF

Info

Publication number
CN103366740A
CN103366740A CN2012100848242A CN201210084824A CN103366740A CN 103366740 A CN103366740 A CN 103366740A CN 2012100848242 A CN2012100848242 A CN 2012100848242A CN 201210084824 A CN201210084824 A CN 201210084824A CN 103366740 A CN103366740 A CN 103366740A
Authority
CN
China
Prior art keywords
voice command
sound
command recognition
signal
domain
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012100848242A
Other languages
Chinese (zh)
Other versions
CN103366740B (en
Inventor
袁媛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CN201210084824.2A priority Critical patent/CN103366740B/en
Publication of CN103366740A publication Critical patent/CN103366740A/en
Application granted granted Critical
Publication of CN103366740B publication Critical patent/CN103366740B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a voice command recognition method and a voice command recognition device and relates to the technical field of voice control. The method and the device can improve the voice recognition rate and make the operation more convenient. The method of the invention comprises the steps of receiving an audio signal, decomposing and filtering the audio signal according to an effective voice command feature to obtain a voice sample, and carrying out semantic recognition on the voice sample and determining a corresponding voice command. The voice command recognition method and the voice command recognition device are mainly used in the process of voice command recognition.

Description

Voice command recognition methods and device
Technical field
The present invention relates to the sound control technique field, relate in particular to a kind of voice command recognition methods and device.
Background technology
Along with the development of sound control technique, sound control technique is widely applied in people's daily life and the work.Sound control technique be a kind of can be with the control technology of human speech as input command, inevitably can run into the aliasing of the noises such as user's voice and ambient noise, other staff's voice during use, therefore how the filtered voice that non-important sound source is sent, and accurately identify the voice command of important sound source, become the major issue that voice-operated device need to solve.Accordingly, voice-operated device becomes the important topic of paying close attention in the industry to the accuracy of speech recognition and the friendly of voice-operated device.
In the prior art, voice-operated device only can be identified predetermined voice.For example, the manipulator of voice-operated device is owner A, then behind the speech samples by a large amount of owner A of typing, the speech samples of owner A is stored as the standard commands database, as the foundation of voice command identification.Owner B because the features such as the sound frequency of owner B and owner A, tone color are different, even send same voice command, can not be identified when controlling voice-operated device.
Therefore, in realization in the process of predicate sound command recognition, the inventor finds that there are the following problems at least in the prior art: because according to the manipulator's of in advance typing the speech samples basis of characterization as voice command, the personnel that control of voice-operated device are restricted, and cause phonetic recognization rate low; And any manipulator must carry out the typing in a large amount of standard commands storehouses before using voice-operated device, increased operation easier, causes use procedure unfriendly.
Summary of the invention
Embodiments of the invention provide a kind of voice command recognition methods and device, can improve phonetic recognization rate, and so that operating process is more convenient.
For achieving the above object, embodiments of the invention adopt following technical scheme:
A kind of voice command recognition methods comprises:
Received audio signal;
Described sound signal is decomposed and filter according to the efficient voice command characteristics, obtain speech samples;
Described speech samples is carried out semanteme identification, determine corresponding voice command.
A kind of voice command recognition device comprises:
The audio frequency receiving element is used for received audio signal;
The sample extraction unit for according to the efficient voice command characteristics described sound signal being decomposed and filtering, obtains speech samples;
The command recognition unit is used for described speech samples is carried out semanteme identification, determines corresponding voice command.
Voice command recognition methods and device that the embodiment of the invention provides, the sound signal that receives is decomposed and filter according to the efficient voice command characteristics, carry out again semanteme identification and determine voice command, with existing the sound signal that receives is compared with the technology that owner's speech samples of typing mates, can not limit the user of voice command recognition device, raising is to the discrimination of voice command, and need not a large amount of speech samples of in advance typing, so that operation is more convenient.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art, the below will do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art, apparently, accompanying drawing in the following describes only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain according to these accompanying drawings other accompanying drawing.
Figure l is the voice command recognition methods process flow diagram of embodiment of the invention l;
Fig. 2 is a kind of voice command recognition methods process flow diagram in the embodiment of the invention 2;
Fig. 3 is the another kind of voice command recognition methods process flow diagram in the embodiment of the invention 2;
Fig. 4 is that a kind of voice command recognition device in the embodiment of the invention 3 forms schematic diagram;
Fig. 5 is that the another kind of voice command recognition device in the embodiment of the invention 3 forms schematic diagram;
Fig. 6 is that the another kind of voice command recognition device in the embodiment of the invention 3 forms schematic diagram.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making the every other embodiment that obtains under the creative work prerequisite.
Embodiment 1
The embodiment of the invention provides a kind of voice command recognition methods, and shown in figure l, the method can comprise:
101, received audio signal.
Wherein, the source of sound signal is not limited to specific user, can be adult or children, sex etc., and voice command recognition methods provided by the invention can receive and identify the voice command of the human language of various different tone colors.Under special circumstances, for example the voice command recognition device does not wish to be used by children, or the sound of looking children is not specific voice command, then can decompose with filter process in, with unwanted sound filtering.
And in actual mechanical process, directly received audio signal carries out relevance filtering and identifying operation, and typing user's sample sound storehouse so that the voice command recognition device more is simple and easy to usefulness, is improved the user and experienced in advance.
102, according to the efficient voice command characteristics described sound signal is decomposed and filter, obtain speech samples.
Wherein, the efficient voice command characteristics can be set according to the practical application needs, for example, frequency is higher and sound signal that sound is very brief can be considered as be children's sound, perhaps in whole sound signal the low-frequency sound of sustainable existence can be considered as be environmental noise etc., these all do not meet the efficient voice command characteristics, therefore can with the filtering of unconcerned sound composition, obtain satisfactory efficient voice order.
103, described speech samples is carried out semanteme identification, determine corresponding voice command.
Wherein, after in step 102, obtaining speech samples, described speech samples is carried out semanteme identification, determine that the method for corresponding voice command is specifically as follows: the sound characteristic point that the sound characteristic point of described speech samples is corresponding with voice command in the voice command material database mates; Determine matching rate the highest and reach the voice command of regulation matching rate.Described speech samples is carried out semanteme identification, the method of determining corresponding voice command specifically also can for: the sound characteristic point of described speech samples and the keyword feature point in the voice command material database are mated, determine to reach the keyword of regulation matching rate; Determine corresponding voice command according to described keyword.
The voice command recognition methods that the embodiment of the invention provides, the sound signal that receives is decomposed and filter according to the efficient voice command characteristics, carry out again semanteme identification and determine voice command, with existing the sound signal that receives is compared with the technology that owner's speech samples of typing mates, can not limit the user of voice command recognition device, raising is to the discrimination of voice command, and need not a large amount of speech samples of in advance typing, so that operation is more convenient.
Embodiment 2
The embodiment of the invention provides a kind of voice command recognition methods, and as shown in Figure 2, the method can comprise:
201, received audio signal.
202, analyze the sound signal receive in the audio frequency receiving cycle, the starting point of screening human speech in the time-domain signal intercepts the time-domain signal of effective human speech.
Wherein, complete audio frequency receives the voice duration of cycle duration and a voice command may be not identical, perhaps may receive a plurality of human languages in a complete audio frequency receiving cycle, or a plurality of voice command.Therefore, can analyze the sound signal that in the audio frequency receiving cycle, receives, screen the starting point of human speech in the time-domain signal, intercept the time-domain signal of effective human speech.
If 203 are truncated to the time-domain signal of at least two effective human speeches in described audio frequency receiving cycle, then the temporal signatures according to the efficient voice order filters out the sound signal that meets the time domain requirement.
Wherein, if in an audio frequency receiving cycle, be truncated to the time-domain signal of a more than effective human speech, comprise at least two time-domain signals in the sound signal that namely in described audio frequency receiving cycle, receives.Can filter out according to the temporal signatures of efficient voice order the time-domain signal that meets the time domain requirement, as the required sound signal of subsequent treatment.Concrete, if with adult's voice as the efficient voice order, then can be according to children's sound high frequency characteristics and the duration of speaking the slightly short characteristics of being grown up, the time-domain signal of preliminary screening adult voice.
204, described sound signal is carried out frequency domain decomposition, the wave band of the too high and/or underfrequency of rejection frequency.
Wherein, to after the time-domain analysis of sound signal and filtering, can further carry out frequency-domain analysis and filtration to the sound signal after filtering through step 202-203.Concrete, frequency can be higher than the noise-filtering that the sound of first threshold is made a lot of noise as children, the sound that also frequency can be lower than Second Threshold is as the environmental noise filtering, the perhaps too high and excessively low equal filtering of sound with frequency.The threshold value of concrete frequency and the standard of filtering can be set according to the applied environment of actual speech command recognition unit, and the embodiment of the invention is not done restriction to this.
205, the sound signal of filtering through frequency domain is carried out independent component analysis, the filtering noise obtains speech samples.
Wherein, can comprise the sound that multi-acoustical sends in the sound signal that obtains after the frequency domain filtration by step 204, can further sound signal be carried out independent component analysis, filtering does not meet the noise of efficient voice command characteristics.For example, noise can comprise: background music, pet sound, children's sound etc.
In a kind of application scenarios of the embodiment of the invention, the speech samples that obtains after decomposition and the filtration directly can be mated and definite voice command, concrete grammar can comprise:
206, the sound characteristic point that the sound characteristic point of described speech samples is corresponding with voice command in the voice command material database mates.
Wherein, pre-configured described voice command material database can comprise sound characteristic point corresponding to voice command and voice command in the described voice command material database.The sound characteristic point that the sound characteristic point of described speech samples is corresponding with voice command in the voice command material database mates, if the matching rate of the sound characteristic point that the sound characteristic point of speech samples is corresponding with voice command in the voice command material database reaches the regulation matching rate, for example 75%, then can determine corresponding voice command.If the matching rate of the sound characteristic point that the sound characteristic point of speech samples is corresponding with voice command in the voice command material database is lower than described regulation matching rate, then can be considered as invalid speech samples, withdraw from the voice command identification process, or prompting user re-enters.
Be understandable that, the concrete numerical value of described regulation matching rate can be regulated according in the practical application voice command being identified required susceptibility, and the embodiment of the invention is not done restriction to this.
207, determine matching rate the highest and reach the voice command of regulation matching rate.
Wherein, there is and only has one if satisfy the speech samples of regulation matching rate, then can directly determine corresponding voice command; There are at least two if satisfy the speech samples of regulation matching rate, then can select the highest speech samples of matching rate, and determine the highest with this matching rate and reach the voice command corresponding to speech samples of regulation matching rate.
In addition, also can will reach the voice command of regulation matching rate show so that the required voice command of user selection or re-enter.Concrete, there are at least two if satisfy the speech samples of regulation matching rate, can determine the voice command of at least two correspondences, and the voice command of described a plurality of correspondences is presented, so that the operation that the required voice command of user selection is corresponding perhaps selects to re-enter voice command.
208, carry out operation corresponding to described voice command.
Wherein, the operation that voice command is corresponding can specifically be set according to the equipment of working control, and for example, the operation that " lower one page " is corresponding can be the page turning of PPT or e-book; The voice commands such as " beginning ", " time-out ", " withdrawing from " can be corresponding to the relevant control operation of application program.
In the another kind of application scenarios of the embodiment of the invention, can mate the keyword that obtains correspondence with decomposition with in the speech samples that obtains after filtering, thereby determine corresponding voice command.Concrete grammar as shown in Figure 3, above step 206 and 207 also can replace with following steps:
209, the sound characteristic point of described speech samples and the keyword feature point in the voice command material database are mated, determine to reach the keyword of regulation matching rate.
Wherein, pre-configured described voice command material database can comprise voice command, keyword and keyword feature point that voice command is corresponding in the described voice command material database.The sound characteristic point of speech samples and the keyword feature point in the voice command material database are mated, if the matching rate of the keyword feature point in the sound characteristic point of speech samples and the voice command material database reaches the regulation matching rate, for example 75%, then can determine corresponding keyword.If the matching rate of the keyword feature point in the sound characteristic point of neither one speech samples and the voice command material database reaches described regulation matching rate, then can be considered as invalid speech samples, withdraw from the voice command identification process, or prompting user re-enters.
Be understandable that, the concrete numerical value of described regulation matching rate can be regulated according in the practical application voice command being identified required susceptibility, and the embodiment of the invention is not done restriction to this.
210, determine corresponding voice command according to described keyword.
Wherein, if coupling obtains the keyword of a matching rate requirement up to specification, then can determine voice command according to this keyword that the match is successful.If coupling obtains the keyword of a plurality of matching rate requirements up to specification, also can comprehensively determine corresponding voice command according to the keyword that the match is successful.
In addition, also can show by will reach relevant with described keyword voice command, so that the required voice command of user selection or re-enter.Concrete, can determine according to keyword the voice command of a plurality of correspondences, and the voice command of described a plurality of correspondences is presented, so that operation corresponding to the required voice command of user selection, perhaps voice command is re-entered in selection.
The voice command recognition methods that the embodiment of the invention provides, the sound signal that receives is decomposed and filter according to the efficient voice command characteristics, carry out again semanteme identification and determine voice command, with existing the sound signal that receives is compared with the technology that owner's speech samples of typing mates, can not limit the user of voice command recognition device, raising is to the discrimination of voice command, and need not a large amount of speech samples of in advance typing, so that operation is more convenient.
Embodiment 3
The embodiment of the invention provides a kind of voice command recognition device, and as shown in Figure 4, this device can comprise: audio frequency receiving element 31, sample extraction unit 32, command recognition unit 33.
Audio frequency receiving element 31 is used for received audio signal.
Sample extraction unit 32 for according to the efficient voice command characteristics described sound signal being decomposed and filtering, obtains speech samples.
Command recognition unit 33 is used for described speech samples is carried out semanteme identification, determines corresponding voice command.
Further, as shown in Figure 5, this voice command recognition device can also comprise: time domain interception unit 34.
Time domain interception unit 34 is used for analyzing the sound signal that receives in receiving cycle after described audio frequency receiving element 31 receives sound signal, screens the starting point of human speech in the time-domain signal, intercepts the time-domain signal of effective human speech.
Corresponding, described sample extraction unit 32 can also be used for: the time-domain signal of described effective human speech is decomposed and filter according to the efficient voice command characteristics, obtain speech samples.
Further, this voice command recognition device can also comprise: time domain screening unit 35.
Time domain screening unit 35, be used for after the time-domain signal of the effective human speech of described time domain interception unit 34 interceptings, when in an audio frequency receiving cycle, being truncated to the time-domain signal of at least two effective human speeches, filtering out according to the temporal signatures of efficient voice order and to meet the sound signal that time domain requires.
Further, described time domain screening unit 35 specifically also is used for: according to children's sound high frequency characteristics and the duration of speaking the slightly short characteristics of being grown up, the time-domain signal of preliminary screening adult voice.
Further, described sample extraction unit 32 can comprise: the first filtering module 321, the second filtering module 322.
The first filtering module 321 is used for described sound signal is carried out frequency domain decomposition, the wave band of the too high and/or underfrequency of rejection frequency.
The second filtering module 322 is used for the sound signal of filtering through frequency domain is carried out independent component analysis, and the filtering noise obtains speech samples.
Wherein, described noise comprises: background music, pet sound, children's sound.
In a kind of application scenarios of the embodiment of the invention, described command recognition unit 33 can comprise: the first matching module 331, the first determination module 332.
The first matching module 331 is used for the sound characteristic point that the sound characteristic point of described speech samples is corresponding with the voice command of voice command material database and mates.
The first determination module 332 is used for determining matching rate the highest and reach the voice command of regulation matching rate, and the voice command that perhaps will reach the regulation matching rate shows, so that the required voice command of user selection or re-enter.
As shown in Figure 6, in the another kind of application scenarios of the embodiment of the invention, described command recognition unit 33 can comprise: the second matching module 333, the second determination module 334.
The second matching module 333 is used for the sound characteristic point of described speech samples and the keyword feature point of voice command material database are mated, and determines to reach the keyword of regulation matching rate.
The second determination module 334 is used for determining corresponding voice command according to described keyword, and voice command demonstration that perhaps will be relevant with described keyword is so that the required voice command of user selection or re-enter.
Further, this voice command recognition device can also comprise: performance element 36
Performance element 36 is used for carrying out operation corresponding to described voice command after corresponding voice command is determined in described command recognition unit 33.
The voice command recognition device that the embodiment of the invention provides, the sound signal that receives is decomposed and filter according to the efficient voice command characteristics, carry out again semanteme identification and determine voice command, with existing the sound signal that receives is compared with the technology that owner's speech samples of typing mates, can not limit the user of voice command recognition device, raising is to the discrimination of voice command, and need not a large amount of speech samples of in advance typing, so that operation is more convenient.
Through the above description of the embodiments, the those skilled in the art can be well understood to the present invention and can realize by the mode that software adds essential common hardware, can certainly pass through hardware, but the former is better embodiment in a lot of situation.Based on such understanding, the part that technical scheme of the present invention contributes to prior art in essence in other words can embody with the form of software product, this computer software product is stored in the storage medium that can read, floppy disk such as computing machine, hard disk or CD etc., comprise some instructions with so that computer equipment (can be personal computer, server, the perhaps network equipment etc.) carry out the described method of each embodiment of the present invention.
The above; be the specific embodiment of the present invention only, but protection scope of the present invention is not limited to this, anyly is familiar with those skilled in the art in the technical scope that the present invention discloses; can expect easily changing or replacing, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of described claim.

Claims (18)

1. a voice command recognition methods is characterized in that, comprising:
Received audio signal;
Described sound signal is decomposed and filter according to the efficient voice command characteristics, obtain speech samples;
Described speech samples is carried out semanteme identification, determine corresponding voice command.
2. voice command recognition methods according to claim 1 is characterized in that, behind described received audio signal, also comprises:
The sound signal that analysis receives in the audio frequency receiving cycle, the starting point of screening human speech in the time-domain signal intercepts the time-domain signal of effective human speech;
Corresponding, describedly described sound signal is decomposed and filtration is specially according to the efficient voice command characteristics: the time-domain signal of described effective human speech is decomposed and filter according to the efficient voice command characteristics.
3. voice command recognition methods according to claim 2 is characterized in that,
After the time-domain signal of the effective human speech of described intercepting, also comprise:
If be truncated to the time-domain signal of at least two effective human speeches in described audio frequency receiving cycle, then the temporal signatures according to the efficient voice order filters out the sound signal that meets the time domain requirement.
4. voice command recognition methods according to claim 3 is characterized in that, described temporal signatures according to the efficient voice order filters out the sound signal that meets the time domain requirement and comprises:
According to children's sound high frequency characteristics and the duration of speaking the slightly short characteristics of being grown up, the time-domain signal of preliminary screening adult voice.
5. voice command recognition methods according to claim 1 is characterized in that,
Describedly described sound signal decomposed and filter according to the efficient voice command characteristics, obtain speech samples, comprising:
Described sound signal is carried out frequency domain decomposition, the wave band of the too high and/or underfrequency of rejection frequency;
The sound signal of filtering through frequency domain is carried out independent component analysis, and the filtering noise obtains speech samples.
6. voice command recognition methods according to claim 5 is characterized in that, described noise comprises: background music, pet sound, children's sound.
7. voice command recognition methods according to claim 1 is characterized in that,
Described described speech samples is carried out semanteme identification, determines corresponding voice command, comprising:
The sound characteristic point that the sound characteristic point of described speech samples is corresponding with voice command in the voice command material database mates;
Determine matching rate the highest and reach the voice command of regulation matching rate, the voice command that perhaps will reach the regulation matching rate shows, so that the required voice command of user selection or re-enter.
8. voice command recognition methods according to claim 1 is characterized in that,
Described described speech samples is carried out semanteme identification, determines corresponding voice command, comprising:
The sound characteristic point of described speech samples and the keyword feature point in the voice command material database are mated, determine to reach the keyword of regulation matching rate;
Determine corresponding voice command according to described keyword, voice command demonstration that perhaps will be relevant with described keyword is so that the required voice command of user selection or re-enter.
9. each described voice command recognition methods is characterized in that according to claim 1-8,
After determining corresponding voice command, also comprise:
Carry out operation corresponding to described voice command.
10. a voice command recognition device is characterized in that, comprising:
The audio frequency receiving element is used for received audio signal;
The sample extraction unit for according to the efficient voice command characteristics described sound signal being decomposed and filtering, obtains speech samples;
The command recognition unit is used for described speech samples is carried out semanteme identification, determines corresponding voice command.
11. voice command recognition device according to claim 10 is characterized in that, also comprises:
The time domain interception unit is used for analyzing the sound signal that receives in the audio frequency receiving cycle after described audio frequency receiving element receives sound signal, screens the starting point of human speech in the time-domain signal, intercepts the time-domain signal of effective human speech;
Corresponding, described sample extraction unit also is used for: the time-domain signal of described effective human speech is decomposed and filter according to the efficient voice command characteristics, obtain speech samples.
12. voice command recognition device according to claim 11 is characterized in that, also comprises:
Time domain screening unit, be used for after described time domain interception unit intercepts the time-domain signal of effective human speech, when in described audio frequency receiving cycle, being truncated to the time-domain signal of at least two effective human speeches, filtering out according to the temporal signatures of efficient voice order and to meet the sound signal that time domain requires.
13. voice command recognition device according to claim 12 is characterized in that, described time domain screening unit specifically also is used for: according to children's sound high frequency characteristics and the duration of speaking the slightly short characteristics of being grown up, the time-domain signal of preliminary screening adult voice.
14. voice command recognition device according to claim 11 is characterized in that,
Described sample extraction unit comprises:
The first filtering module is used for described sound signal is carried out frequency domain decomposition, the wave band of the too high and/or underfrequency of rejection frequency;
The second filtering module is used for the sound signal of filtering through frequency domain is carried out independent component analysis, and the filtering noise obtains speech samples.
15. voice command recognition device according to claim 14 is characterized in that, described noise comprises: background music, pet sound, children's sound.
16. voice command recognition device according to claim 11 is characterized in that,
Described command recognition unit comprises:
The first matching module is used for the sound characteristic point that the sound characteristic point of described speech samples is corresponding with the voice command of voice command material database and mates;
The first determination module is used for determining matching rate the highest and reach the voice command of regulation matching rate, and the voice command that perhaps will reach the regulation matching rate shows, so that the required voice command of user selection or re-enter.
17. voice command recognition device according to claim 11 is characterized in that,
Described command recognition unit comprises:
The second matching module is used for the sound characteristic point of described speech samples and the keyword feature point of voice command material database are mated, and determines to reach the keyword of regulation matching rate;
The second determination module is used for determining corresponding voice command according to described keyword, and voice command demonstration that perhaps will be relevant with described keyword is so that the required voice command of user selection or re-enter.
18. each described voice command recognition device is characterized in that according to claim 11-17, also comprises:
Performance element is used for carrying out operation corresponding to described voice command after corresponding voice command is determined in described command recognition unit.
CN201210084824.2A 2012-03-27 2012-03-27 Voice command identification method and device Active CN103366740B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210084824.2A CN103366740B (en) 2012-03-27 2012-03-27 Voice command identification method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210084824.2A CN103366740B (en) 2012-03-27 2012-03-27 Voice command identification method and device

Publications (2)

Publication Number Publication Date
CN103366740A true CN103366740A (en) 2013-10-23
CN103366740B CN103366740B (en) 2016-12-14

Family

ID=49367941

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210084824.2A Active CN103366740B (en) 2012-03-27 2012-03-27 Voice command identification method and device

Country Status (1)

Country Link
CN (1) CN103366740B (en)

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103714815A (en) * 2013-12-09 2014-04-09 何永 Voice control method and device thereof
CN103903091A (en) * 2014-03-19 2014-07-02 江苏科技大学 Vehicle ignition control system and method based on cloud computing platform
CN103971681A (en) * 2014-04-24 2014-08-06 百度在线网络技术(北京)有限公司 Voice recognition method and system
CN104103272A (en) * 2014-07-15 2014-10-15 无锡中星微电子有限公司 Voice recognition method and device and blue-tooth earphone
CN105100460A (en) * 2015-07-09 2015-11-25 上海斐讯数据通信技术有限公司 Method and system for controlling intelligent terminal by use of sound
WO2015180231A1 (en) * 2014-05-29 2015-12-03 中兴通讯股份有限公司 Voice interaction method and apparatus
CN105679318A (en) * 2015-12-23 2016-06-15 珠海格力电器股份有限公司 Display method and device based on speech recognition, display system and air conditioner
CN106127631A (en) * 2016-06-15 2016-11-16 汤美 The teaching-course manager method and system of network courses
CN106369773A (en) * 2016-11-15 2017-02-01 北京小米移动软件有限公司 Method and device for controlling air supply of air conditioner
CN106792003A (en) * 2016-12-27 2017-05-31 西安石油大学 A kind of intelligent advertisement inserting method, device and server
CN106874185A (en) * 2016-12-27 2017-06-20 中车株洲电力机车研究所有限公司 A kind of automated testing method driven based on voiced keyword and system
CN106940996A (en) * 2017-04-24 2017-07-11 维沃移动通信有限公司 The recognition methods of background music and mobile terminal in a kind of video
WO2017157067A1 (en) * 2016-03-16 2017-09-21 广州阿里巴巴文学信息技术有限公司 Page turning method and device for use in electronic book
CN107195300A (en) * 2017-05-15 2017-09-22 珠海格力电器股份有限公司 Sound control method and system
CN107271963A (en) * 2017-06-22 2017-10-20 广东美的制冷设备有限公司 The method and apparatus and air conditioner of auditory localization
CN108595143A (en) * 2018-03-30 2018-09-28 联想(北京)有限公司 Electronic equipment, sound pick-up and signal processing method
WO2018196231A1 (en) * 2017-04-26 2018-11-01 海信集团有限公司 Method for smart terminal displaying user manipulation instruction, and smart terminal
CN108771491A (en) * 2018-05-24 2018-11-09 宁波国盛电器有限公司 A kind of sandwich unit
CN108806672A (en) * 2017-04-28 2018-11-13 辛雪峰 A kind of control method for fan of voice double mode
CN108961525A (en) * 2017-05-17 2018-12-07 北京博瑞彤芸文化传播股份有限公司 A kind of acquisition methods selecting information
CN109036461A (en) * 2017-06-12 2018-12-18 杭州海康威视数字技术股份有限公司 A kind of output method of notification information, server and monitoring system
CN109074808A (en) * 2018-07-18 2018-12-21 深圳魔耳智能声学科技有限公司 Sound control method, control device and storage medium
CN109241332A (en) * 2018-10-19 2019-01-18 广东小天才科技有限公司 It is a kind of to determine semantic method and system by voice
CN109344231A (en) * 2018-10-31 2019-02-15 广东小天才科技有限公司 A kind of method and system of the semantic incomplete corpus of completion
CN110310623A (en) * 2017-09-20 2019-10-08 Oppo广东移动通信有限公司 Sample generating method, model training method, device, medium and electronic equipment
CN110634486A (en) * 2018-06-21 2019-12-31 阿里巴巴集团控股有限公司 Voice processing method and device
CN111477206A (en) * 2020-04-16 2020-07-31 北京百度网讯科技有限公司 Noise reduction method and device for vehicle-mounted environment, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6456977B1 (en) * 1998-10-15 2002-09-24 Primax Electronics Ltd. Voice control module for controlling a game controller
CN101405739A (en) * 2002-12-26 2009-04-08 摩托罗拉公司(在特拉华州注册的公司) Identification apparatus and method
CN101516005A (en) * 2008-02-23 2009-08-26 华为技术有限公司 Speech recognition channel selecting system, method and channel switching device
CN101867742A (en) * 2010-05-21 2010-10-20 中山大学 Television system based on sound control
CN102237087A (en) * 2010-04-27 2011-11-09 中兴通讯股份有限公司 Voice control method and voice control device
CN102254558A (en) * 2011-07-01 2011-11-23 重庆邮电大学 Control method of intelligent wheel chair voice recognition based on end point detection

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6456977B1 (en) * 1998-10-15 2002-09-24 Primax Electronics Ltd. Voice control module for controlling a game controller
CN101405739A (en) * 2002-12-26 2009-04-08 摩托罗拉公司(在特拉华州注册的公司) Identification apparatus and method
CN101516005A (en) * 2008-02-23 2009-08-26 华为技术有限公司 Speech recognition channel selecting system, method and channel switching device
CN102237087A (en) * 2010-04-27 2011-11-09 中兴通讯股份有限公司 Voice control method and voice control device
CN101867742A (en) * 2010-05-21 2010-10-20 中山大学 Television system based on sound control
CN102254558A (en) * 2011-07-01 2011-11-23 重庆邮电大学 Control method of intelligent wheel chair voice recognition based on end point detection

Cited By (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103714815A (en) * 2013-12-09 2014-04-09 何永 Voice control method and device thereof
CN103903091B (en) * 2014-03-19 2017-10-03 江苏科技大学 A kind of control method of the vehicle ignition control device based on cloud computing platform
CN103903091A (en) * 2014-03-19 2014-07-02 江苏科技大学 Vehicle ignition control system and method based on cloud computing platform
CN103971681A (en) * 2014-04-24 2014-08-06 百度在线网络技术(北京)有限公司 Voice recognition method and system
CN105138110A (en) * 2014-05-29 2015-12-09 中兴通讯股份有限公司 Voice interaction method and voice interaction device
WO2015180231A1 (en) * 2014-05-29 2015-12-03 中兴通讯股份有限公司 Voice interaction method and apparatus
CN104103272A (en) * 2014-07-15 2014-10-15 无锡中星微电子有限公司 Voice recognition method and device and blue-tooth earphone
CN104103272B (en) * 2014-07-15 2017-10-10 无锡中感微电子股份有限公司 Audio recognition method, device and bluetooth earphone
CN105100460A (en) * 2015-07-09 2015-11-25 上海斐讯数据通信技术有限公司 Method and system for controlling intelligent terminal by use of sound
CN105679318A (en) * 2015-12-23 2016-06-15 珠海格力电器股份有限公司 Display method and device based on speech recognition, display system and air conditioner
WO2017157067A1 (en) * 2016-03-16 2017-09-21 广州阿里巴巴文学信息技术有限公司 Page turning method and device for use in electronic book
CN107205076A (en) * 2016-03-16 2017-09-26 广州阿里巴巴文学信息技术有限公司 The page turning method and device of a kind of e-book
CN106127631A (en) * 2016-06-15 2016-11-16 汤美 The teaching-course manager method and system of network courses
CN106369773A (en) * 2016-11-15 2017-02-01 北京小米移动软件有限公司 Method and device for controlling air supply of air conditioner
CN106792003A (en) * 2016-12-27 2017-05-31 西安石油大学 A kind of intelligent advertisement inserting method, device and server
CN106874185A (en) * 2016-12-27 2017-06-20 中车株洲电力机车研究所有限公司 A kind of automated testing method driven based on voiced keyword and system
CN106940996A (en) * 2017-04-24 2017-07-11 维沃移动通信有限公司 The recognition methods of background music and mobile terminal in a kind of video
WO2018196231A1 (en) * 2017-04-26 2018-11-01 海信集团有限公司 Method for smart terminal displaying user manipulation instruction, and smart terminal
CN108806672A (en) * 2017-04-28 2018-11-13 辛雪峰 A kind of control method for fan of voice double mode
CN107195300A (en) * 2017-05-15 2017-09-22 珠海格力电器股份有限公司 Sound control method and system
CN107195300B (en) * 2017-05-15 2019-03-19 珠海格力电器股份有限公司 Sound control method and system
CN108961525A (en) * 2017-05-17 2018-12-07 北京博瑞彤芸文化传播股份有限公司 A kind of acquisition methods selecting information
CN109036461A (en) * 2017-06-12 2018-12-18 杭州海康威视数字技术股份有限公司 A kind of output method of notification information, server and monitoring system
US11275628B2 (en) 2017-06-12 2022-03-15 Hangzhou Hikvision Digital Technology Co., Ltd. Notification information output method, server and monitoring system
CN107271963A (en) * 2017-06-22 2017-10-20 广东美的制冷设备有限公司 The method and apparatus and air conditioner of auditory localization
CN110310623A (en) * 2017-09-20 2019-10-08 Oppo广东移动通信有限公司 Sample generating method, model training method, device, medium and electronic equipment
CN110310623B (en) * 2017-09-20 2021-12-28 Oppo广东移动通信有限公司 Sample generation method, model training method, device, medium, and electronic apparatus
CN108595143A (en) * 2018-03-30 2018-09-28 联想(北京)有限公司 Electronic equipment, sound pick-up and signal processing method
CN108771491A (en) * 2018-05-24 2018-11-09 宁波国盛电器有限公司 A kind of sandwich unit
CN110634486A (en) * 2018-06-21 2019-12-31 阿里巴巴集团控股有限公司 Voice processing method and device
CN109074808A (en) * 2018-07-18 2018-12-21 深圳魔耳智能声学科技有限公司 Sound control method, control device and storage medium
CN109241332A (en) * 2018-10-19 2019-01-18 广东小天才科技有限公司 It is a kind of to determine semantic method and system by voice
CN109241332B (en) * 2018-10-19 2021-09-24 广东小天才科技有限公司 Method and system for determining semantics through voice
CN109344231A (en) * 2018-10-31 2019-02-15 广东小天才科技有限公司 A kind of method and system of the semantic incomplete corpus of completion
CN109344231B (en) * 2018-10-31 2021-08-17 广东小天才科技有限公司 Method and system for completing corpus of semantic deformity
CN111477206A (en) * 2020-04-16 2020-07-31 北京百度网讯科技有限公司 Noise reduction method and device for vehicle-mounted environment, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN103366740B (en) 2016-12-14

Similar Documents

Publication Publication Date Title
CN103366740A (en) Voice command recognition method and voice command recognition device
CN110085251B (en) Human voice extraction method, human voice extraction device and related products
CN107210040B (en) Method for operating voice function and electronic device supporting the same
US20170140750A1 (en) Method and device for speech recognition
DE102011054197B4 (en) Selective transmission of voice data
US10811005B2 (en) Adapting voice input processing based on voice input characteristics
DE112018006101T5 (en) Dynamic registration of a user-defined wake-up key phrase for a speech-enabled computer system
CN109817220A (en) Audio recognition method, apparatus and system
CN103730120A (en) Voice control method and system for electronic device
CN103165129B (en) Method and system for optimizing voice recognition acoustic model
CN106448654A (en) Robot speech recognition system and working method thereof
CN103971681A (en) Voice recognition method and system
CN103106061A (en) Voice input method and device
CN108461081B (en) Voice control method, device, equipment and storage medium
CN111462741B (en) Voice data processing method, device and storage medium
EP4044178A2 (en) Method and apparatus of performing voice wake-up in multiple speech zones, method and apparatus of performing speech recognition in multiple speech zones, device, and storage medium
CN106228047B (en) A kind of application icon processing method and terminal device
JP2015007850A (en) Filter coefficient group calculation apparatus and filter coefficient group calculation method
CN109979446A (en) Sound control method, storage medium and device
CN115719592A (en) Voice information processing method and device
CN102693721A (en) Simple and easy voice and gender detection device and method
CN111345016A (en) Start control method and start control system of intelligent terminal
CN108231074A (en) A kind of data processing method, voice assistant equipment and computer readable storage medium
US20230015112A1 (en) Method and apparatus for processing speech, electronic device and storage medium
US20230186943A1 (en) Voice activity detection method and apparatus, and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant