CN108986806A - Sound control method and system based on Sounnd source direction - Google Patents
Sound control method and system based on Sounnd source direction Download PDFInfo
- Publication number
- CN108986806A CN108986806A CN201810702505.0A CN201810702505A CN108986806A CN 108986806 A CN108986806 A CN 108986806A CN 201810702505 A CN201810702505 A CN 201810702505A CN 108986806 A CN108986806 A CN 108986806A
- Authority
- CN
- China
- Prior art keywords
- audio
- frequency information
- sound
- instruction
- sound source
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 43
- 238000009434 installation Methods 0.000 claims description 3
- 230000009286 beneficial effect Effects 0.000 abstract description 3
- 238000004891 communication Methods 0.000 description 25
- 230000006870 function Effects 0.000 description 14
- 230000008569 process Effects 0.000 description 9
- 230000005540 biological transmission Effects 0.000 description 5
- 238000010295 mobile communication Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 230000007774 longterm Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000009471 action Effects 0.000 description 2
- 238000004378 air conditioning Methods 0.000 description 2
- 230000005611 electricity Effects 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- XEGGRYVFLWGFHI-UHFFFAOYSA-N bendiocarb Chemical compound CNC(=O)OC1=CC=CC2=C1OC(C)(C)O2 XEGGRYVFLWGFHI-UHFFFAOYSA-N 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000027455 binding Effects 0.000 description 1
- 238000009739 binding Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L63/00—Network architectures or network communication protocols for network security
- H04L63/10—Network architectures or network communication protocols for network security for controlling access to devices or network resources
- H04L63/107—Network architectures or network communication protocols for network security for controlling access to devices or network resources wherein the security policies are location-dependent, e.g. entities privileges depend on current location or allowing specific operations only from locally connected terminals
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
Abstract
This application provides a kind of sound control method and system based on Sounnd source direction, is related to intelligent terminal technical field, this method comprises: obtaining audio-frequency information;Determine the sound source position of the audio-frequency information;Semantic parsing is carried out to audio-frequency information, generates control instruction;According to sound source position and control instruction, executive control operation.Compared to the prior art, sound control method provided by the present application based on Sounnd source direction, the audio-frequency information of acquisition is analyzed, while generating control instruction, the sound source position of audio-frequency information is determined, according to two aspect content of control instruction and sound source position, execute operation, keep operation more intelligent, closer to actual demand, is also beneficial to improve the safety of operation.
Description
Technical field
This application involves intelligent terminal technical field, more particularly, to a kind of sound control method based on Sounnd source direction and
System.
Background technique
Voice control is a kind of comparative maturity control technology, is widely used in various intelligent terminals, but be applied to vapour
When the intelligence control system of vehicle, since interior space is narrow and closes, personnel's distribution is again relatively intensive, make intelligence control system into
It when row speech recognition, is easy to misread instruction, for example, the conventional dialogue between passenger is identified as audio-frequency information.
In addition, passenger and driver are differentiated to the control authority of intelligence control system, for example, under normal circumstances, department
Machine can consider that passenger is not suitable for voice control, and existing skill by the various vehicle parameters of voice control in traffic safety
Voice control in art cannot achieve above-mentioned difference control.
Summary of the invention
The application's is designed to provide a kind of sound control method and system based on Sounnd source direction, by identifying audio
The sound source position of information, more accurately executive control operation.
To achieve the above object, the sound control method provided by the present application based on Sounnd source direction, comprising:
Obtain audio-frequency information;
Determine the sound source position of the audio-frequency information;
Semantic parsing is carried out to audio-frequency information, generates control instruction;
According to sound source position and control instruction, executive control operation.
In the above-mentioned technical solutions, further, using being mounted on multiple sound receivers of automobile different location simultaneously
It obtains audio-frequency information and the audio-frequency information is determined according to the volume difference for the audio-frequency information that multiple sound receivers receive
Sound source position.
In the above-mentioned technical solutions, further, after the sound source position for determining the audio-frequency information, further includes:
According to sound source position, the identity of audio-frequency information sender is judged;
Judge whether this audio-frequency information is effective according to identity;
When judging that audio-frequency information is effective, ability executive control operation.
In the above-mentioned technical solutions, further, judge whether this audio-frequency information effectively refers to according to identity:
Permission needed for determining the corresponding control operation of the control instruction;
According to the identity of audio-frequency information sender, judge whether the audio-frequency information sender has and have permission;
When having permission, judge that this audio-frequency information is effective.
In the above-mentioned technical solutions, further, according to sound source position, judge whether audio-frequency information sender is driver;
Only when audio-frequency information sender is driver, ability executive control operation.
In the above-mentioned technical solutions, further, after generating control instruction, further includes:
According to preset rules, judge the control instruction for normal instruction or restricted instruction;
When for normal instruction, direct executive control operation;
When for restricted instruction, according to sound source position, the identity of audio-frequency information sender is judged;According to authentication, sentence
Whether staccato frequency delivering person has corresponding operating right;When having operating right, ability executive control operation.
In the above-mentioned technical solutions, further, when control instruction is to open or close vehicle window, executive control operation
Refer to: opening or closing and the hithermost vehicle window of sound source position.
In the above-mentioned technical solutions, further, it is opposite to be separately mounted to interior different seats for multiple sound receivers
Position.
In the above-mentioned technical solutions, further, multiple sound receiver installations are in the car and outside vehicle;Determine the audio
The sound source position of information comprises determining that sound source is in the car or outside vehicle.
In addition, the application provides a kind of speech control system based on Sounnd source direction, comprising:
One memory, is configured as storing data and instruction;
One is established the processor communicated with memory, wherein when executing the instruction in memory, the processor quilt
It is configured that
Obtain audio-frequency information;
Determine the sound source position of the audio-frequency information;
Semantic parsing is carried out to audio-frequency information, generates control instruction;
According to sound source position and control instruction, executive control operation.
Compared to the prior art, the sound control method provided by the present application based on Sounnd source direction believes the audio of acquisition
Breath is analyzed, and while generating control instruction, the sound source position of audio-frequency information is determined, according to control instruction and sound source position
Two aspect contents, execute operation, keep operation more intelligent, closer to actual demand, are also beneficial to improve the safety of operation.
The additional aspect and advantage of the application will become obviously in following description section, or the practice for passing through the application
Recognize.
Detailed description of the invention
It, below will be to specific in order to illustrate more clearly of the application specific embodiment or technical solution in the prior art
Embodiment or attached drawing needed to be used in the description of the prior art be briefly described, it should be apparent that, it is described below
Attached drawing is some embodiments of the application, for those of ordinary skill in the art, before not making the creative labor
It puts, is also possible to obtain other drawings based on these drawings.
Fig. 1 is the illustrative diagram of the Environment System provided according to some embodiments of the present application.
Fig. 2 is the exemplary cell schematic diagram of electronic functionalities configuration shown in FIG. 1.
Fig. 3 is the exemplary stream of the sound control method based on Sounnd source direction provided according to some embodiments of the present application
Cheng Tu;
Fig. 4 is the exemplary stream of the sound control method based on Sounnd source direction provided according to some embodiments of the present application
Cheng Tu;
Fig. 5 is the exemplary stream of the sound control method based on Sounnd source direction provided according to some embodiments of the present application
Cheng Tu.
Specific embodiment
It is described as the application defined in requirement and its equivalent that has the right convenient for Integrated Understanding below with reference to attached drawing
Various embodiments.These embodiments include various specific details in order to understand, but these are considered only as illustratively.Cause
This, it will be appreciated by those skilled in the art that carrying out variations and modifications without departing from this to various embodiments described herein
The scope and spirit of application.In addition, briefly and to be explicitly described the application, the application will be omitted to known function and structure
Description.
The term used in following description and claims and phrase are not limited to literal meaning, and being merely can
Understand and consistently understands the application.Therefore, for those skilled in the art, it will be understood that provide to the various implementations of the application
The description of example is only the purpose to illustrate, rather than limits the application of appended claims and its Equivalent definitions.
Below in conjunction with the attached drawing in some embodiments of the application, technical solutions in the embodiments of the present application carries out clear
Chu is fully described by, it is clear that described embodiments are only a part of embodiments of the present application, instead of all the embodiments.
Based on the embodiment in the application, obtained by those of ordinary skill in the art without making creative efforts all
Other embodiments shall fall in the protection scope of this application.
It should be noted that the term used in the embodiment of the present application is the mesh for being only in description specific embodiment
, it is not intended to be limiting the application." one " of the embodiment of the present application and singular used in the attached claims,
"one", "an", " described " and "the" be also intended to including most forms, unless the context clearly indicates other meaning.Also
It should be appreciated that term "and/or" used herein refer to and include one or more mutually bindings list any of project or
All possible combinations.Express " first ", " second ", " first " and " second " is for modifying respective element without examining
Worry sequence or importance are used only for distinguishing a kind of element and another element, without limiting respective element.
Can be electronic equipment according to the terminal of some embodiments of the application, the electronic equipment may include smart phone,
PC (PC, such as tablet computer, desktop computer, notebook, net book, palm PC PDA), mobile phone, e-book
Reader, portable media player (PMP), audio/video player (MP3/MP4), video camera, virtual reality device
(VR) and the combination of one or more of wearable device etc..According to some embodiments of the present application, the wearable device
Including type of attachment (such as wrist-watch, ring, bracelet, eyes or wear-type device (HMD)), integrated type (such as electronics clothes
Dress), decorated type (such as pad skin, tatoo or built in electronic device) etc. one or more of combination.The application's
In some embodiments, the electronic equipment be can be flexibly, be not limited to above equipment, or can be in above-mentioned various equipment
One or more of combinations.In this application, term " user " can indicate the people using electronic equipment or use electronic equipment
Equipment (such as artificial intelligence electronic equipment).
The embodiment of the present application provides a kind of sound control method based on Sounnd source direction.The application is real in order to facilitate understanding
Example is applied, the embodiment of the present application is described in detail below with reference to attached drawing.
Fig. 1 is the illustrative diagram of the Environment System 100 provided according to some embodiments of the present application.Such as Fig. 1
Shown, Environment System 100 may include electronic equipment 110, network 120 and server 130 etc..Electronic equipment 110 can be with
Including bus 111, processor 112, memory 113, input/output module 114, display 115, communication module 116 and physics
Key 117 etc..In some embodiments of the present application, electronic equipment 110 can be omitted one or more elements, or can be into one
Step includes one or more other elements.
Bus 111 may include circuit.The circuit can be with one or more element (examples in interconnection electronics 110
Such as, bus 111, processor 112, memory 113, input/output module 114, display 115 and communication module 116).It is described
Circuit can also realize communication (for example, obtaining and/or sending number between one or more elements in electronic equipment 110
According to).
Processor 112 may include one or more coprocessors (Co-processor), application processor (AP,
Application Processor) and communication processor (Communication Processor).As an example, processor
112 can execute the control and/or data processing with one or more elements of electronic equipment 110.
Memory 113 can store data.The data may include other with one or more of electronic equipment 110
The relevant instruction of element or data.For example, the data may include the initial data before processor 112 is handled, intermediate data
And/or treated data.Specifically, memory 113 can store photo, image, iris information etc..Memory 113 can
To include impermanent memory memory and/or permanent memory memory.
According to some embodiments of the present application, memory 113 can store software and/or program.Described program can wrap
It includes kernel, middleware, Application Programming Interface (API, Application Programming Interface) and/or applies journey
Sequence.At least part of the kernel, the middleware or the Application Programming Interface may include operating system (OS,
Operating System).As an example, the kernel be can control or be managed for executing other programs (for example, intermediate
Part, Application Programming Interface and application program) in realize operation or function system resource (for example, bus 111, processor
112, memory 113 etc.).In addition, the kernel can provide interface.The interface can by the middleware, described answer
With one or more elements of programming interface or application program access electronic equipment 110 to control or management system resource.
The middleware can be used as the middle layer of data transmission.Data transmission can permit Application Programming Interface or
Application program is with the kernel communication to exchange data.As an example, the middleware can handle from the application program
One or more task requests of acquisition.For example, the middleware can distribute electronic equipment to one or more application program
The priority of 110 system resource (for example, bus 111, processor 112, memory 113 etc.), and processing it is one or
Multiple tasks request.The Application Programming Interface can be the application program for control from the kernel or the middleware
The interface of function is provided.The Application Programming Interface also may include one or more interfaces or function.The function can be used
In security control, communication control, document control, window control, text control, image procossing, signal processing etc..
What input/output module 114 can be inputted to the transmission of the other elements of electronic equipment 110 from user or external equipment
Instruction or data.Input/output module 114 can also be defeated by the instruction or data that obtain from the other elements of electronic equipment 110
Out to user or external equipment.
Display 115 can show content.The content can to user show various types (for example, text, image,
Video, icon and/or symbol).Display 115 may include liquid crystal display (LCD, Liquid Crystal Display),
Light emitting diode (LED, Light-Emitting Diode) display, Organic Light Emitting Diode (OLED, Organic Light
Emitting Diode) display, Micro Electro Mechanical System (MEMS, Micro Electro Mechanical Systems) display
Device or electric paper display etc. or several combinations.Display 115 may include touch screen.In some embodiments, display
115 can show virtual key.The input of the available virtual key of touch screen.Display 115 can pass through the touching
It touches screen and obtains input.The input may include touch input, gesture input, action input, close input, electronic pen or user
The input of body part.
Communication module 116 can configure the communication between equipment.In some embodiments, network environment 100 can be into one
Step includes electronic equipment 140.As an example, the communication between the equipment may include electronic equipment 110 and other equipment (example
Such as, server 130 or electronic equipment 140) between communication.For example, communication module 116 can by wireless communication or cable modem
Letter is connected to network 120, communicates with other equipment (for example, server 130 or electronic equipment 140) realization.
The wireless communication may include microwave communication and/or satellite communication etc..The wireless communication may include honeycomb
Communication is (for example, global mobile communication (GSM, Global System for Mobile Communications), CDMA
(CDMA, Code Division Multiple Access), 3G (Third Generation) Moblie (3G, The 3rd Generation
Telecommunication), forth generation mobile communication (4G), the 5th third-generation mobile communication (5G)), Long Term Evolution (LTE,
Long Term Evolution), Long Term Evolution upgrade version (LTE-A, LTE-Advanced), wideband code division multiple access
(WCDMA, Wideband Code Division Multiple Access), Universal Mobile Communication System (UMTS,
Universal Mobile Telecommunications System), WiMAX (WiBro, Wireless
) etc. or several combinations Broadband.According to some embodiments of the present application, the wireless communication may include wireless local area
Net (WiFi, Wireless Fidelity), bluetooth, low-power consumption bluetooth (BLE, Bluetooth Low Energy), ZigBee protocol
(ZigBee), near-field communication (NFC, Near Field Communication), magnetic safe transmission, radio frequency and body area network (BAN,
Body Area Network) etc. or several combinations.According to some embodiments of the present application, the wire communication may include
Global Navigation Satellite System (Glonass/GNSS, Global Navigation Satellite System), global positioning system
System (GPS, Global Position System), Beidou navigation satellite system or Galileo (European Global Satellite Navigation System)
Deng.The wire communication may include universal serial bus (USB, Universal Serial Bus), high-definition media interface
(HDMI, High-Definition Multimedia Interface), proposed standard 232 (RS-232, Recommend
Standard 232), and/or plain old telephone service (POTS, Plain Old Telephone Service) etc. in one
Kind or several combinations.
Secondary or physical bond 117 can be used for user's interaction.Secondary or physical bond 117 may include one or more entity keys.In some realities
It applies in example, user can be with the function of customized secondary or physical bond 117.
Network 120 may include communication network.The communication network may include computer network (for example, local area network
(LAN, Local Area Network) or wide area network (WAN, Wide Area Network)), internet and/or telephone network
Deng or several combinations.Network 120 can be to the other equipment in Environment System 100 (for example, electronic equipment 110, clothes
Business device 130, electronic equipment 140 etc.) send information.
Server 130 can connect the other equipment in Environment System 100 (for example, electronic equipment by network 120
110, electronic equipment 140 etc.).
Electronic equipment 140 can be identical or different with electronic equipment 110 type.According to some embodiments of the present application,
Some or all of execution operation can be in another equipment or multiple equipment (for example, electronic equipment 140 in electronic equipment 110
And/or server 130) in execute.In some embodiments, when electronic equipment 110 be automatically or in response to request execute it is a kind of or
When multiple functions and/or service, electronic equipment 110 can request other equipment (for example, electronic equipment 140 and/or server
130) substitution executes function and/or service.In some embodiments, electronic equipment 110 is in addition to executing function or service, further
Execute relative one or more functions.In some embodiments, other equipment are (for example, electronic equipment 140 and/or clothes
Business device 130) requested function or other relevant one or more functions can be executed, implementing result can be sent to electricity
Sub- equipment 110.Electronic equipment 110 can repeat result or be further processed implementing result, to provide requested function
Or service.
It should be noted that the description for Environment System 100 above only for convenience of description can not be this Shen
It please be limited within the scope of illustrated embodiment.It is appreciated that the principle based on this system can for those skilled in the art
Any combination can be carried out to each element, or constitute subsystem and connect with other elements under the premise of without departing substantially from the principle,
Various modifications and variations in form and details are carried out to the implementation above method and systematic difference field.For example, network environment
System 100 may further include database etc..Suchlike deformation, within the scope of protection of this application.
Fig. 2 is the exemplary cell block diagram of the electronic functionalities configuration provided according to some embodiments of the present application.Such as
Shown in Fig. 2, processor 112 may include processing module 200, and the processing module 200 may include acquiring unit 210, analysis
Unit 220, control unit 230.
According to some embodiments of the present application, the available information of acquiring unit 210.The information may include but unlimited
In text, picture, audio, video, movement, gesture etc. or several combinations.In some embodiments, acquiring unit 210 can be with
Input information is obtained by input/output module 114, the touch screen of display 115 and/or secondary or physical bond 117.As an example, obtaining
Take the input information of the available electronic equipment 110 of unit 210.The input information may include key-press input, touch-control input,
Gesture input, action input, remote input, transmission input etc. or several combinations.
In some embodiments, the available audio-frequency information of acquiring unit 210, audio-frequency information derive from and are mounted on automobile not
With multiple sound receivers of position.
According to some embodiments of the present application, analytical unit 220 can at least be carried out the information that acquiring unit 210 obtains
Analysis.In some embodiments, analytical unit 220 can analyze the audio-frequency information of the acquisition of acquiring unit 210, with the determination sound
The control instruction for including in the sound source position and audio-frequency information of frequency information.
According to some embodiments of the present application, control unit 230 can control electricity according to the analysis result of analytical unit 220
Sub- equipment.The controlling electronic devices may include that controlling electronic devices 110 executes movement.
In some embodiments, control unit 230 can according to analytical unit 220 to the analysis of image information as a result, control
Automobile processed executes operation.For example, open vehicle window, open multimedia, ring loudspeaker, control car light etc..
It should be noted that the unit in processing module 200 is described above, it only for convenience of description, can not be this
Application is limited within the scope of illustrated embodiment.It is appreciated that for those skilled in the art, the principle based on this system,
Any combination may be carried out to each unit, or constitute submodule and other units company under the premise of without departing substantially from the principle
It connects, various modifications and variations in form and details is carried out to the function of implementing above-mentioned module and unit.For example, electronic equipment
110 may further include sensor etc., and acquiring unit 210 can obtain information by sensor.In another example processing unit
220 may further include division subelement etc..Suchlike deformation, within the scope of protection of this application.
Fig. 3 is the exemplary stream of the sound control method based on Sounnd source direction provided according to some embodiments of the present application
Cheng Tu.As shown in figure 3, process 300 can be realized by processing module 200.
In step 310, audio-frequency information is obtained.
Audio-frequency information is phonetic order.According to some embodiments of the present application, audio-frequency information derives from and is mounted on automobile not
With multiple sound receivers of position, such as microphone.
In step 320, the sound source position of the audio-frequency information is determined.
According to some embodiments of the present application, using the multiple sound receivers for being mounted on automobile different location while obtaining
Audio-frequency information determines the sound source of the audio-frequency information according to the volume difference for the audio-frequency information that multiple sound receivers receive
Position.
Furthermore, it is understood that in some embodiments, it is opposite that multiple sound receivers are separately mounted to interior different seats
Position, can more accurately determine be which seat passenger issue audio-frequency information, in order to according to different seats execute not
Biconditional operation.
Furthermore, it is understood that in some embodiments, multiple sound receiver installations are in the car and outside vehicle;Determine the audio
The sound source position of information comprises determining that sound source is the sound in the car or outside vehicle, outside interior audio-frequency information sender and vehicle
Frequency delivering person is corresponding to execute operation difference.
In step 330, semantic parsing is carried out to audio-frequency information, generates control instruction.
Control instruction can execute various operations by the central control system of automobile, such as: it opens or closes vehicle window, open
Or close virgin lock, open or close air-conditioning, open or close double sudden strains of a muscle, ring loudspeaker etc..
Operation is executed according to sound source position and control instruction in step 340.
According to some embodiments of the present application, when control instruction is to open or close vehicle window, execute operation and refer to: opening or
It closes and the hithermost vehicle window of sound source position.Above scheme is more accurate to the control of vehicle window, more intelligently, more meets practical need
It asks.
According to some embodiments of the present application, when control instruction is to open air-conditioning, executing operation includes: for some position
The passenger set carries out the adjustment in interior air-conditioner air outlet direction.
According to some embodiments of the present application, car has multiple display screens, and user opens/closes display by phonetic order
Screen, judges the position of passenger in the car at this time, opened according to command information/close display screen;For example user needs to see that Shanghai and Shenzhen refers to
When number, display screen is waken up by voice, judges the location of user at this time, opens the display screen of user present position.
According to some embodiments of the present application, in order to guarantee safety, need to judge to determine that sound source is located at interior or vehicle
Outside, operation can just only be executed when sound source is located at interior.
Compared to the prior art, the sound control method provided by the present application based on Sounnd source direction believes the audio of acquisition
Breath is analyzed, and while generating control instruction, the sound source position of audio-frequency information is determined, according to control instruction and sound source position
Two aspect contents, execute operation, keep operation more intelligent, closer to actual demand, are also beneficial to improve the safety of operation.
Fig. 4 is the exemplary stream of the sound control method based on Sounnd source direction provided according to some embodiments of the present application
Cheng Tu.As shown in figure 4, process 300 can be realized by processing module 200.
In step 410, audio-frequency information is obtained.The content of step 410 is identical as above-mentioned steps 310, therefore details are not described herein.
In step 420, the sound source position of the audio-frequency information is determined.The content of step 420 is identical as above-mentioned steps 320,
Therefore details are not described herein.
The identity of audio-frequency information sender is judged according to sound source position in step 430.
According to some embodiments of the present application, judges that audio-frequency information sender is driver or passenger, can also preset
Different seats correspond to different identity.
In step 440, judge whether this audio-frequency information is effective according to identity;When judging that audio-frequency information is effective, just hold
Row step 450.
According to some embodiments of the present application, the audio-frequency information sender of different identity has different permissions.It is specific next
It says, in some embodiments, only when the identity of audio-frequency information sender is driver, just executes step 450.
In step 450, semantic parsing is carried out to audio-frequency information, generates control instruction.The content and above-mentioned steps of step 450
330 is identical, therefore details are not described herein.
Operation is executed according to sound source position and control instruction in step 460.The content and above-mentioned steps of step 460
340 is identical, therefore details are not described herein.
Fig. 5 is the exemplary stream of the sound control method based on Sounnd source direction provided according to some embodiments of the present application
Cheng Tu.As shown in figure 5, process 300 can be realized by processing module 200.
In step 510, audio-frequency information is obtained.The content of step 510 is identical as above-mentioned steps 310, therefore details are not described herein.
In step 520, the sound source position of the audio-frequency information is determined.The content of step 520 is identical as above-mentioned steps 320,
Therefore details are not described herein.
In step 530, semantic parsing is carried out to audio-frequency information, generates control instruction.The content and above-mentioned steps of step 530
330 is identical, therefore details are not described herein.
In step 540, judge whether the control instruction is restricted instruction.
When for restricted instruction, step 550 is executed;When for untethered instruction, i.e. normal instruction, 560 are thened follow the steps;
Normal instruction and restricted instruction can be system default setting, also can independently be set by user.For example, can will close
It is to be set as restricted instruction to the control instruction of traffic safety, sets normal instruction for other instructions.
The identity of audio-frequency information sender is judged according to sound source position in step 550;
Judge whether audio-frequency information sender has corresponding operating right according to authentication in step 560;Work as tool
When standby operating right, step 570 is executed.
Operation is executed according to sound source position and control instruction in step 570.The content and above-mentioned steps of step 470
340 is identical, therefore details are not described herein.
Compared with embodiment shown in Fig. 4, sound control method shown in fig. 5, without equal for all control instructions
It carries out authentication and carries out relevant authentication only to the control instruction of traffic safety is related to, a certain amount of fortune can be saved
It calculates, and then improves operating efficiency.
It should be noted that the above embodiments are intended merely as example, the application is not limited to such example, but can
To carry out various change.
It should be noted that in the present specification, the terms "include", "comprise" or its any other variant are intended to
Non-exclusive inclusion, so that the process, method, article or equipment including a series of elements is not only wanted including those
Element, but also including other elements that are not explicitly listed, or further include for this process, method, article or equipment
Intrinsic element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that
There is also other identical elements in process, method, article or equipment including the element.
Finally, it is to be noted that, it is above-mentioned it is a series of processing not only include with sequence described here in temporal sequence
The processing of execution, and the processing including executing parallel or respectively rather than in chronological order.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with
It is completed by the relevant hardware of computer program instructions, the program can be stored in a computer readable storage medium,
The program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein, the storage medium can for magnetic disk,
CD, read-only memory (Read-Only Memory, ROM) or random access memory (Random Access Memory, RAM)
Deng.
Above disclosed is only some preferred embodiments of the application, and the right model of the application cannot be limited with this
It encloses, those skilled in the art can understand all or part of the processes for realizing the above embodiment, and wants according to the application right
Made equivalent variations is sought, is still belonged to the scope covered by the invention.
Claims (10)
1. a kind of sound control method based on Sounnd source direction characterized by comprising
Obtain audio-frequency information;
Determine the sound source position of the audio-frequency information;
Semantic parsing is carried out to audio-frequency information, generates control instruction;
According to sound source position and control instruction, executive control operation.
2. the sound control method according to claim 1 based on Sounnd source direction, which is characterized in that using being mounted on automobile
Multiple sound receivers of different location obtain audio-frequency information simultaneously, the audio-frequency information received according to multiple sound receivers
Volume difference determines the sound source position of the audio-frequency information.
3. the sound control method according to claim 1 based on Sounnd source direction, which is characterized in that determining the audio
After the sound source position of information, further includes:
According to sound source position, the identity of audio-frequency information sender is judged;
Judge whether this audio-frequency information is effective according to identity;
When judging that audio-frequency information is effective, ability executive control operation.
4. the sound control method according to claim 3 based on Sounnd source direction, which is characterized in that should according to identity judgement
Whether audio-frequency information effectively refers to:
Permission needed for determining the corresponding control operation of the control instruction;
According to the identity of audio-frequency information sender, judge whether the audio-frequency information sender has and have permission;
When having permission, judge that this audio-frequency information is effective.
5. the sound control method according to claim 3 based on Sounnd source direction, which is characterized in that according to sound source position,
Judge whether audio-frequency information sender is driver;
Only when audio-frequency information sender is driver, ability executive control operation.
6. the sound control method according to claim 1 based on Sounnd source direction, which is characterized in that generating control instruction
Afterwards, further includes:
According to preset rules, judge the control instruction for normal instruction or restricted instruction;
When for normal instruction, direct executive control operation;
When for restricted instruction, according to sound source position, the identity of audio-frequency information sender is judged;According to authentication, sound is judged
Whether frequency delivering person has corresponding operating right;When having operating right, ability executive control operation.
7. the sound control method according to claim 1 based on Sounnd source direction, which is characterized in that when control instruction is to beat
When vehicle window is closed on or off, executive control operation refers to: opening or closing and the hithermost vehicle window of sound source position.
8. the sound control method according to claim 7 based on Sounnd source direction, which is characterized in that multiple sound receivers
It is separately mounted to the opposite position in interior different seats.
9. the sound control method according to claim 2 based on Sounnd source direction, which is characterized in that multiple sound receivers
Installation is in the car and outside vehicle;Determine that the sound source position of the audio-frequency information comprises determining that sound source is in the car or outside vehicle.
10. a kind of speech control system based on Sounnd source direction characterized by comprising
One memory, is configured as storing data and instruction;
One is established the processor communicated with memory, wherein when executing the instruction in memory, the processor is configured
Are as follows:
Obtain audio-frequency information;
Determine the sound source position of the audio-frequency information;
Semantic parsing is carried out to audio-frequency information, generates control instruction;
According to sound source position and control instruction, executive control operation.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810702505.0A CN108986806A (en) | 2018-06-30 | 2018-06-30 | Sound control method and system based on Sounnd source direction |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810702505.0A CN108986806A (en) | 2018-06-30 | 2018-06-30 | Sound control method and system based on Sounnd source direction |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108986806A true CN108986806A (en) | 2018-12-11 |
Family
ID=64539692
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810702505.0A Pending CN108986806A (en) | 2018-06-30 | 2018-06-30 | Sound control method and system based on Sounnd source direction |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108986806A (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109606260A (en) * | 2018-12-26 | 2019-04-12 | 北京蓦然认知科技有限公司 | A kind of method and device of the outer auditory tone cues of vehicle |
CN109781134A (en) * | 2018-12-29 | 2019-05-21 | 百度在线网络技术(北京)有限公司 | Navigation control method, device, engine end and storage medium |
CN109801623A (en) * | 2018-12-20 | 2019-05-24 | 合肥凌极西雅电子科技有限公司 | A kind of interactive projection equipment intelligent control method and system |
CN110001549A (en) * | 2019-04-17 | 2019-07-12 | 百度在线网络技术(北京)有限公司 | Method for controlling a vehicle and device |
CN111599366A (en) * | 2020-05-19 | 2020-08-28 | 科大讯飞股份有限公司 | Vehicle-mounted multi-sound-zone voice processing method and related device |
CN111653277A (en) * | 2020-06-10 | 2020-09-11 | 北京百度网讯科技有限公司 | Vehicle voice control method, device, equipment, vehicle and storage medium |
CN111660773A (en) * | 2020-05-29 | 2020-09-15 | 奇瑞汽车股份有限公司 | Sound control window method and system applied to automobile |
CN111948807A (en) * | 2019-05-14 | 2020-11-17 | Oppo广东移动通信有限公司 | Control method, control device, wearable device and storage medium |
WO2022001347A1 (en) * | 2020-07-03 | 2022-01-06 | 华为技术有限公司 | In-vehicle voice instruction control method, and related device |
CN114242072A (en) * | 2021-12-21 | 2022-03-25 | 上海帝图信息科技有限公司 | Voice recognition system for intelligent robot |
CN115214541A (en) * | 2022-08-10 | 2022-10-21 | 海南小鹏汽车科技有限公司 | Vehicle control method, vehicle, and computer-readable storage medium |
WO2023116087A1 (en) * | 2021-12-21 | 2023-06-29 | 北京地平线机器人技术研发有限公司 | Processing method and apparatus for speech interaction instruction, and computer-readable storage medium |
WO2024051592A1 (en) * | 2022-09-05 | 2024-03-14 | 华为技术有限公司 | Vehicle control method and control apparatus |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003345389A (en) * | 2002-05-22 | 2003-12-03 | Nissan Motor Co Ltd | Voice recognition device |
JP2009020423A (en) * | 2007-07-13 | 2009-01-29 | Fujitsu Ten Ltd | Speech recognition device and speech recognition method |
US9583119B2 (en) * | 2015-06-18 | 2017-02-28 | Honda Motor Co., Ltd. | Sound source separating device and sound source separating method |
CN106878281A (en) * | 2017-01-11 | 2017-06-20 | 上海蔚来汽车有限公司 | In-car positioner, method and vehicle-mounted device control system based on mixed audio |
CN107554456A (en) * | 2017-08-31 | 2018-01-09 | 上海博泰悦臻网络技术服务有限公司 | Vehicle-mounted voice control system and its control method |
-
2018
- 2018-06-30 CN CN201810702505.0A patent/CN108986806A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003345389A (en) * | 2002-05-22 | 2003-12-03 | Nissan Motor Co Ltd | Voice recognition device |
JP2009020423A (en) * | 2007-07-13 | 2009-01-29 | Fujitsu Ten Ltd | Speech recognition device and speech recognition method |
US9583119B2 (en) * | 2015-06-18 | 2017-02-28 | Honda Motor Co., Ltd. | Sound source separating device and sound source separating method |
CN106878281A (en) * | 2017-01-11 | 2017-06-20 | 上海蔚来汽车有限公司 | In-car positioner, method and vehicle-mounted device control system based on mixed audio |
CN107554456A (en) * | 2017-08-31 | 2018-01-09 | 上海博泰悦臻网络技术服务有限公司 | Vehicle-mounted voice control system and its control method |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109801623A (en) * | 2018-12-20 | 2019-05-24 | 合肥凌极西雅电子科技有限公司 | A kind of interactive projection equipment intelligent control method and system |
CN109606260A (en) * | 2018-12-26 | 2019-04-12 | 北京蓦然认知科技有限公司 | A kind of method and device of the outer auditory tone cues of vehicle |
CN109781134A (en) * | 2018-12-29 | 2019-05-21 | 百度在线网络技术(北京)有限公司 | Navigation control method, device, engine end and storage medium |
CN110001549A (en) * | 2019-04-17 | 2019-07-12 | 百度在线网络技术(北京)有限公司 | Method for controlling a vehicle and device |
CN111948807A (en) * | 2019-05-14 | 2020-11-17 | Oppo广东移动通信有限公司 | Control method, control device, wearable device and storage medium |
CN111599366A (en) * | 2020-05-19 | 2020-08-28 | 科大讯飞股份有限公司 | Vehicle-mounted multi-sound-zone voice processing method and related device |
CN111599366B (en) * | 2020-05-19 | 2024-04-12 | 科大讯飞股份有限公司 | Vehicle-mounted multitone region voice processing method and related device |
CN111660773B (en) * | 2020-05-29 | 2023-02-03 | 奇瑞汽车股份有限公司 | Sound control window method and system applied to automobile |
CN111660773A (en) * | 2020-05-29 | 2020-09-15 | 奇瑞汽车股份有限公司 | Sound control window method and system applied to automobile |
CN111653277A (en) * | 2020-06-10 | 2020-09-11 | 北京百度网讯科技有限公司 | Vehicle voice control method, device, equipment, vehicle and storage medium |
WO2022001347A1 (en) * | 2020-07-03 | 2022-01-06 | 华为技术有限公司 | In-vehicle voice instruction control method, and related device |
WO2023116087A1 (en) * | 2021-12-21 | 2023-06-29 | 北京地平线机器人技术研发有限公司 | Processing method and apparatus for speech interaction instruction, and computer-readable storage medium |
CN114242072A (en) * | 2021-12-21 | 2022-03-25 | 上海帝图信息科技有限公司 | Voice recognition system for intelligent robot |
CN115214541A (en) * | 2022-08-10 | 2022-10-21 | 海南小鹏汽车科技有限公司 | Vehicle control method, vehicle, and computer-readable storage medium |
CN115214541B (en) * | 2022-08-10 | 2024-01-09 | 海南小鹏汽车科技有限公司 | Vehicle control method, vehicle, and computer-readable storage medium |
WO2024051592A1 (en) * | 2022-09-05 | 2024-03-14 | 华为技术有限公司 | Vehicle control method and control apparatus |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108986806A (en) | Sound control method and system based on Sounnd source direction | |
CN107657953A (en) | Sound control method and system | |
CN108303903A (en) | The control method and system of smart home | |
CN109492412A (en) | The encryption storage method and system of file | |
CN108228811A (en) | Information recommendation method and system | |
CN108319408A (en) | Stereogram target operating method and system | |
CN108921855A (en) | Image processing method and system based on information | |
CN107846508A (en) | For the assisted memory method and system of forgetful crowd | |
CN107786979A (en) | A kind of multiple terminals shared communication method and system | |
CN107423585A (en) | The concealed application method and system of a kind of application | |
CN109714479A (en) | Conducive to the terminal control method and system improved efficiency | |
CN208673193U (en) | A kind of intelligent multimedia system | |
CN108897479A (en) | A kind of terminal touch control method and system | |
CN109189536A (en) | A kind of terminal applies display methods and system | |
CN108428455A (en) | The acquisition method and system of vocal print feature | |
CN108040088A (en) | Event arrangement method and system based on stroke route | |
CN108021350A (en) | A kind of terminal output volume method of adjustment and system | |
CN108874465A (en) | A kind of application starting method and system based on caching | |
CN108010519A (en) | A kind of information search method and system | |
CN108536409A (en) | A kind of terminal display adjusting method and system | |
CN107395900A (en) | The multiple based reminding method of missed call | |
CN107071182A (en) | A kind of communication means | |
CN108881417A (en) | A kind of user interaction approach and system based on local area network | |
CN108509017A (en) | A kind of control method and system of terminal applies | |
CN109101292A (en) | A kind of terminal shortcut operation method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20181211 |
|
WD01 | Invention patent application deemed withdrawn after publication |