CN108389584A

CN108389584A - Sound analysis method and device

Info

Publication number: CN108389584A
Application number: CN201810096118.7A
Authority: CN
Inventors: 袁晖
Original assignee: Shenzhen Comexe Ikang Science And Technology Co Ltd
Current assignee: Shenzhen Comexe Ikang Science And Technology Co Ltd
Priority date: 2018-01-31
Filing date: 2018-01-31
Publication date: 2018-08-10
Anticipated expiration: 2038-01-31
Also published as: CN108389584B; WO2019148737A1

Abstract

The present invention provides a kind of sound analysis method and devices, obtain audio-frequency information to be analyzed, include the audio-frequency information of multiple and different audio categories in the audio-frequency information to be analyzed；Determine the audio categories of each audio-frequency information；According to the audio categories, the characteristic information of the audio-frequency information is obtained；The characteristic information includes source of sound distance, decibel value；According to the source of sound distance of the audio-frequency information, the audio-frequency information is distributed in the layering of sound map, wherein the sound map is layered according to distance；The sound analysis method and device provided in the present invention, the sound in sound map according to source of sound apart from Layering manifestation separate sources is intuitive to show, and shows abundant in content.

Description

Sound analysis method and device

Technical field

The present invention relates to voice recognition technology field, more particularly to a kind of sound analysis method and device.

Background technology

Video monitoring is the important information source that current every profession and trade carries out safety management, dispute processing, however video monitoring It will appear blind sector, having the personnel of bad motivation that can deliberately hide has the place of monitoring to implement illegal activities, to thing truth Discovery cause very big puzzlement.

And the propagation of sound limits almost without blind area, small-scale block will not cause prodigious masking.Therefore, right The analysis of sound will play the role of prodigious information supplement.

Sound map common at present generally refers to city or noise region map, the size of noise everywhere is used different Visual effect is presented on map, intuitive that noise profile situation is presented.But the information that this sound map is presented is less, can only The decibel value of simple performance sound, is generally only used for judging the distribution of noise, sound more can not be analyzed and be answered With.

Invention content

The main object of the present invention is to provide a kind of sound analysis method, by the audio distribution of different distance in sound map In.

The present invention proposes a kind of sound analysis method, includes the following steps：

Audio-frequency information to be analyzed is obtained, includes the audio letter of multiple and different audio categories in the audio-frequency information to be analyzed Breath；

Determine the audio categories of each audio-frequency information；

According to the audio categories, the characteristic information of the audio-frequency information is obtained；The characteristic information include source of sound distance, Decibel value；

According to the source of sound distance of the audio-frequency information, the audio-frequency information is distributed in the layering of sound map, wherein The sound map is layered according to distance.

Further, the audio-frequency information is distributed in sound map by the source of sound distance according to the audio-frequency information Layering in step after, including：

When receiving preset trigger signal in the layering, sound source is carried out to the audio-frequency information in layering and is gone It makes an uproar, audio optimization processing, and the audio-frequency information after playback process.

If in layering there are many distributions when the audio-frequency information of audio categories, by the corresponding audio of multiple audio categories Information carries out separation and shows.

Further, the determination each the audio categories of the audio-frequency information the step of include：

It is compared with the feature audio to prestore in database, with the audio categories of the determination audio-frequency information.

Further, after the step of acquisition audio-frequency information to be analyzed, further include：

Obtain source place and the temporal information of the audio-frequency information；

According to the source place and temporal information, the first audio categories are selected；

The determination each the audio categories of the audio-frequency information the step of include then：

By the frequency of the first audio categories character pair audio to prestore in the frequency of the audio-frequency information and database into Row comparison, and calculate the similarity of the audio-frequency information and feature audio；

When similarity reaches preset value, the audio categories of the audio-frequency information are determined.

The present invention also provides a kind of sound analysis devices, including：

First acquisition unit, include for obtaining audio-frequency information to be analyzed, in the audio-frequency information to be analyzed it is multiple not With the audio-frequency information of audio categories；

Determination unit, the audio categories for determining the audio-frequency information；

Second acquisition unit, for according to the audio categories, obtaining the characteristic information of the audio-frequency information；The feature Information includes source of sound distance, decibel value；

The audio-frequency information is distributed in sound map by distribution unit for the source of sound distance according to the audio-frequency information Layering in, wherein the sound map is layered according to distance.

Further, further include：

Broadcast unit when for receiving preset trigger signal in the layering, is believed the audio in layering Breath carries out sound source denoising, audio optimization processing, and the audio-frequency information after playback process.

Further, further include：

Separative element, if when for being distributed the audio-frequency information there are many audio categories in being layered, by multiple audio categories Corresponding audio-frequency information carries out separation and shows.

Further, the determination unit is specifically used for：

Further, further include：

Third acquiring unit, the source place for obtaining the audio-frequency information and temporal information；

Selecting unit, for according to the source place and temporal information, selecting the first audio categories；

The determination unit includes then：

Contrast subunit, for the frequency of the audio-frequency information is corresponding with the first audio categories to prestore in database special The frequency of sign audio is compared, and calculates the similarity of the audio-frequency information and feature audio；

Determination subelement, for when the similarity reaches preset value, determining the audio categories of the audio-frequency information.

The sound analysis method and device provided in the present invention, has the advantages that：

The sound analysis method and device provided in the present invention obtains audio-frequency information to be analyzed, the audio letter to be analyzed It include the audio-frequency information of multiple and different audio categories in breath；Determine the audio categories of each audio-frequency information；According to described Audio categories obtain the characteristic information of the audio-frequency information；The characteristic information includes source of sound distance, decibel value；According to described The source of sound distance of audio-frequency information, the audio-frequency information is distributed in the layering of sound map, wherein the sound map according to Distance is layered；Sound in sound map according to source of sound apart from Layering manifestation separate sources, it is intuitive to show, and in display Hold abundant.

Description of the drawings

Fig. 1 is sound analysis method step schematic diagram in one embodiment of the invention；

Fig. 2 is sound analysis method step schematic diagram in another embodiment of the present invention；

Fig. 3 is sound analysis method step schematic diagram in further embodiment of this invention；

Fig. 4 is sound analysis device structure diagram in one embodiment of the invention；

Fig. 5 is sound analysis device structure diagram in another embodiment of the present invention；

Fig. 6 is determination unit structure diagram in one embodiment of the invention.

The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.

Specific implementation mode

It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.

Those skilled in the art of the present technique are appreciated that unless expressly stated, singulative " one " used herein, " one It is a ", " described " " above-mentioned " and "the" may also comprise plural form.It is to be further understood that making in the specification of the present invention Wording " comprising " refers to that there are the feature, integer, step, operation, element, unit, module and/or components, but simultaneously Do not preclude the presence or addition of other one or more features, integer, step, operation, element, unit, module, component and/or it Group.It should be understood that when we say that an element is " connected " or " coupled " to another element, it can be directly connected to or couple To other elements, or there may also be intermediary elements.In addition, " connection " used herein or " coupling " may include wirelessly connecting It connects or wirelessly couples.Wording "and/or" used herein includes the whole or any of one or more associated list items Unit and all combination.

Those skilled in the art of the present technique are appreciated that unless otherwise defined, all terms used herein (including technology art Language and scientific terminology), there is meaning identical with the general understanding of the those of ordinary skill in fields of the present invention.Should also Understand, those terms such as defined in the general dictionary, it should be understood that have in the context of the prior art The consistent meaning of meaning, and unless by specific definitions as here, the meaning of idealization or too formal otherwise will not be used To explain.

Referring to Fig.1, it is the sound analysis method step schematic diagram in one embodiment of the invention.

A kind of sound analysis method is proposed in one embodiment of the invention, is applied to intelligent terminal, is included the following steps：

Step S1 obtains audio-frequency information to be analyzed, includes multiple and different audio categories in the audio-frequency information to be analyzed Audio-frequency information；

Step S2 determines the audio categories of each audio-frequency information；

Step S3 obtains the characteristic information of the audio-frequency information according to the audio categories；The characteristic information includes sound Source distance, decibel value；

The audio-frequency information is distributed in the layering of sound map by step S4 according to the source of sound distance of the audio-frequency information It is interior, wherein the sound map is layered according to distance.

Only show that the decibel value of various sound, the information of display are smaller on usual sound map.In the present embodiment, it obtains To a section audio information, which includes the audio-frequency information there are many audio categories.Audio categories in the present embodiment Refer to that the audio categories delimited according to the sound source of audio, such as sound, sound of the wind, tweedle and pedestrian that vehicle to run is sent out are said The sound of words is different audio categories.Above-mentioned audio-frequency information to be analyzed is analyzed, is determined wherein included each The audio categories of audio-frequency information.

It is understood that some audios may not have corresponding source of sound distance, such as sound of the wind, thunder and lightning sound etc., this reality It applies in example and sets the source of sound distance of this class audio frequency to infinity.Therefore, the difference of audio categories, the source of sound distance got is not Together.In the present embodiment, according to the difference of audio categories, its source of sound distance is determined according to different rules.Source of sound distance belongs to sound One characteristic information of frequency information, this feature information can also include audio categories, decibel value, loudness, clarity etc..Specifically Ground can determine the source of sound distance of audio-frequency information by auditory localization.

Finally, source of sound distance is provided with multiple layerings in sound map in the present embodiment, each layering represents difference The audio-frequency information of source of sound distance, by the above-mentioned audio-frequency information for getting source of sound distance according to corresponding source of sound range distribution in sound In map；Meanwhile also by the characteristic informations such as the corresponding audio categories of audio-frequency information, decibel value include in sound map, so as to It can intuitively check the relevant information of audio.

It can also include the following steps after above-mentioned steps S2：

According to audio categories, handled according to predetermined manner.In the present embodiment, for the audio of different audio categories Information takes different processing routines to be handled；Processing routine mainly goes hot-tempered, audio optimization including auditory localization, sound source.

For example, in the present embodiment, being preset with the mode handled the audio-frequency information of a variety of different audio categories：

(1) voice：Judge voice number of sources, carries out sound bearing and Distance Judgment respectively, and by each voice clear Presentation is optimized in terms of degree, sound intensity.

(2) traffic sound：Traffic sound includes mainly land, sea and sky three classes；It is each including automobile, train, ship, aircraft etc. Class traffic noise and wheel and ground/rail grating, engine sound, exhaust sound, whistle sound etc..Judge for traffic sound Traffic sound type in audio-frequency information, and presented after optimizing.

(3) building site sound：Include the running noises of the construction equipments such as piling machine, bull-dozer.Mainly judge sound for building site sound Source distance and audio categories.

(4) musical sound：The source of musical sound is usually that businessman or personal broadcasting loudspeaker generate under conventional environment.For music Sound mainly judges source of sound distance and audio categories.

(5) natural phonation：Including wind and rain sound caused by weather, thunder and lightning sound, flow sound etc..When the sound for judging a certain characteristic frequency When sound is natural phonation, without judging sound source distance, it is directly displayed as the sound of unlimited distance.

To the processing procedure of audio-frequency information can be before audio-frequency information is distributed in the layering of sound map, can also It is after distribution；If processing procedure before distribution, when receiving preset trigger signal in the layering, is then directly broadcast Put the audio-frequency information in the layering；If processing procedure is after distribution, with reference to Fig. 2, in one embodiment, described According to the source of sound distance of the audio-frequency information, the audio-frequency information is distributed in after the step S4 in the layering of sound map, is wrapped It includes：

Step S5 when receiving preset trigger signal in the layering, carries out the audio-frequency information in layering Sound source denoising, audio optimization processing, and the audio-frequency information after playback process.

After by audio-frequency information layer distributed on sound map, user operates intelligent terminal, pre- to trigger If trigger signal, when intelligent terminal receives above-mentioned trigger signal by region residing for layering, then play automatically in the layering The audio-frequency information of distribution；Above-mentioned preset trigger signal could be provided as clicking triggering, or double-clicking triggering etc..Preferably, Before playing above-mentioned audio-frequency information, above-mentioned audio-frequency information is filtered, denoising, audio optimization processing.

In another embodiment, the audio-frequency information is distributed in sound by the source of sound distance according to the audio-frequency information After step S4 in the layering of sound map, including：

Step S6, if in layering there are many distributions when the audio-frequency information of audio categories, multiple audio categories are right respectively The audio-frequency information answered carries out separation and shows.Distinguished convenient for the audio-frequency information to same audio source distance, when its be distributed in it is same When layering, then the corresponding audio-frequency information of multiple audio categories is subjected to separation and shown, can use different lines face Color distinguishes.

Specifically, in one embodiment, the step S2 of each audio categories of the audio-frequency information of the determination includes：

In the present embodiment, such as voice, traffic sound, building site voice music, natural phonation are prestored in the database of system Etc. all kinds of feature audios, it needs to be determined that audio-frequency information to be analyzed audio categories when, only need to by its with prestore in database Feature audio is compared, and judges whether similar, if similarity reaches preset value, judges it for the corresponding sound of this feature audio Frequency classification.

With reference to Fig. 3, in another embodiment, after the step S1 for obtaining audio-frequency information to be analyzed, further include：

Step S1a obtains source place and the temporal information of the audio-frequency information；

Step S1b selects the first audio categories according to the source place and temporal information；

The step S2 of each audio categories of the audio-frequency information of the determination includes then：

Step S2a, the first audio categories character pair audio that will be prestored in the frequency of the audio-frequency information and database Frequency compared, and calculate the similarity of the audio-frequency information and feature audio；

Step S2b determines the audio categories of the audio-frequency information when similarity reaches preset value.

If audio-frequency information and pre-stored characteristics audio in database are compared one by one, calculation amount is larger, in this implementation In example, in order to reduce calculation amount, geography information (source place when acquisition audio is got from audio-frequency information to be analyzed first Point) and temporal information, according to source place and temporal information, selection points out the main audio of period appearance on the ground Classification (i.e. the first audio categories)；Such as place is when being located at urban traffic road, audio-frequency information should be mainly traffic, secondly It, then can be using traffic sound as the first audio categories for voice.Then, it will prestore in the frequency of the audio-frequency information and database The frequency of the first audio categories character pair audio compared, and it is similar to feature audio to obtain the audio-frequency information Degree, when similarity reaches preset value, determines the audio categories of the audio-frequency information.In this way, calculation amount can be reduced largely, carry Rise analyze speed.

In conclusion for the sound analysis method provided in the embodiment of the present invention, audio-frequency information to be analyzed is obtained, it is described to wait for It include the audio-frequency information of multiple and different audio categories in analysis audio-frequency information；Determine the audio class of each audio-frequency information Not；According to the audio categories, the characteristic information of the audio-frequency information is obtained；The characteristic information includes source of sound distance, decibel Value；According to the source of sound distance of the audio-frequency information, the audio-frequency information is distributed in the layering of sound map, wherein the sound Sound map is layered according to distance；Sound in sound map according to source of sound apart from Layering manifestation separate sources, it is intuitive aobvious Show, and shows abundant in content.

With reference to Fig. 4, a kind of sound analysis device is additionally provided in one embodiment of the invention, is applied to intelligent terminal, including：

First acquisition unit 10 includes multiple in the audio-frequency information to be analyzed for obtaining audio-frequency information to be analyzed The audio-frequency information of different audio categories；

Determination unit 20, the audio categories for determining the audio-frequency information；

Second acquisition unit 30, for according to the audio categories, obtaining the characteristic information of the audio-frequency information；The spy Reference breath includes source of sound distance, decibel value；

Distribution unit 40, for the source of sound distance according to the audio-frequency information, with being distributed in sound by the audio-frequency information In the layering of figure, wherein the sound map is layered according to distance.

Only show that the decibel value of various sound, the information of display are smaller on usual sound map.In the present embodiment, it obtains To a section audio information, which includes the audio-frequency information there are many audio categories.Audio categories in the present embodiment Refer to that the audio categories delimited according to the sound source of audio, such as sound, sound of the wind, tweedle and pedestrian that vehicle to run is sent out are said The sound of words is different audio categories.Above-mentioned audio-frequency information to be analyzed is analyzed, determination unit 20 is determined wherein Including each audio-frequency information audio categories.

It is understood that some audios may not have corresponding source of sound distance, such as sound of the wind, thunder and lightning sound etc., this reality It applies in example and sets the source of sound distance of this class audio frequency to infinity.Therefore, the difference of audio categories, the source of sound distance got is not Together.In the present embodiment, second acquisition unit 30 determines its source of sound distance according to the difference of audio categories according to different rules. Source of sound distance belongs to a characteristic information of audio-frequency information, this feature information can also include audio categories, decibel value, loudness, Clarity etc..Specifically, the source of sound distance of audio-frequency information can be determined by auditory localization.

Finally, source of sound distance is provided with multiple layerings in sound map in the present embodiment, each layering represents difference The audio-frequency information of source of sound distance, distribution unit 40 is by the above-mentioned audio-frequency information for getting source of sound distance according to corresponding source of sound distance It is distributed in sound map；Meanwhile also including in sound by characteristic informations such as the corresponding audio categories of audio-frequency information, decibel values In figure, so as to intuitively check the relevant information of audio.

The above sound analytical equipment can also include：

Processing unit, for according to audio categories, being handled according to predetermined manner.In the present embodiment, for difference The audio-frequency information of audio categories takes different processing routines to be handled；Processing routine is mainly gone including auditory localization, sound source Hot-tempered, audio optimization.

To the processing procedure of audio-frequency information can be before audio-frequency information is distributed in the layering of sound map, can also It is after distribution；If processing procedure before distribution, when receiving preset trigger signal in the layering, is then directly broadcast Put the audio-frequency information in the layering；If processing procedure after distribution, receives in layering default before being played Trigger signal when, sound source denoising, audio optimization processing, then the institute after playback process are carried out to the audio-frequency information in layering State audio-frequency information.

Specifically, further include in one embodiment with reference to Fig. 5：

Broadcast unit 50, when for receiving preset trigger signal in the layering, to the audio in layering Information carries out sound source denoising, audio optimization processing, and the audio-frequency information after playback process.

In another embodiment, above-mentioned apparatus further includes：

Separative element, if when for being distributed the audio-frequency information there are many audio categories in being layered, by multiple audio categories Corresponding audio-frequency information carries out separation and shows.It is distinguished convenient for the audio-frequency information to same audio source distance, when its distribution In same layering, then the corresponding audio-frequency information of multiple audio categories is subjected to separation and shown, can be use it is different Line color distinguishes.

Specifically, the determination unit 20 is specifically used for：

Further, the above sound analytical equipment further includes：

With reference to Fig. 6, the determination unit 20 includes then：

Contrast subunit 201, the first audio categories pair for will prestore in the frequency of the audio-frequency information and database It answers the frequency of feature audio to be compared, and calculates the similarity of the audio-frequency information and feature audio；

Determination subelement 202, for when the similarity reaches preset value, determining the audio class of the audio-frequency information Not.

If audio-frequency information and pre-stored characteristics audio in database are compared one by one, calculation amount is larger, in this implementation In example, in order to reduce calculation amount, third acquiring unit first gets ground when acquiring audio from audio-frequency information to be analyzed Information (source place) and temporal information are managed, selecting unit is pointed out according to source place and temporal information, selection on the ground The dominant audio class (i.e. the first audio categories) that the period occurs；Such as place is when being located at urban traffic road, audio Information should be mainly traffic, secondly be voice, then can be using traffic sound as the first audio categories.Then, contrast subunit 201 carry out the frequency of the first audio categories character pair audio to prestore in the frequency of the audio-frequency information and database pair Than, and the similarity of the audio-frequency information and feature audio is obtained, when similarity reaches preset value, determination subelement 202 is then Determine the audio categories of the audio-frequency information.In this way, calculation amount can be reduced largely, analyze speed is promoted.

In conclusion for the sound analysis method and device that are provided in the embodiment of the present invention, audio-frequency information to be analyzed is obtained, It include the audio-frequency information of multiple and different audio categories in the audio-frequency information to be analyzed；Determine the sound of each audio-frequency information Frequency classification；According to the audio categories, the characteristic information of the audio-frequency information is obtained；The characteristic information include source of sound distance, Decibel value；According to the source of sound distance of the audio-frequency information, the audio-frequency information is distributed in the layering of sound map, wherein institute Sound map is stated to be layered according to distance；Sound in sound map according to source of sound apart from Layering manifestation separate sources, directly Display is seen, and is shown abundant in content.

Those skilled in the art of the present technique be appreciated that can with computer program instructions come realize these structure charts and/or The combination of each frame and these structure charts and/or the frame in block diagram and/or flow graph in block diagram and/or flow graph.This technology is led Field technique personnel be appreciated that these computer program instructions can be supplied to all-purpose computer, special purpose computer or other The processor of programmable data processing method is realized, to pass through the processing of computer or other programmable data processing methods Device come execute structure chart and/or block diagram and/or flow graph disclosed by the invention frame or multiple frames in specify scheme.

Those skilled in the art of the present technique are appreciated that in the various operations crossed by discussion in the present invention, method, flow Steps, measures, and schemes can be replaced, changed, combined or be deleted.Further, each with having been crossed by discussion in the present invention Other steps, measures, and schemes in kind operation, method, flow may also be alternated, changed, rearranged, decomposed, combined or deleted. Further, in the prior art to have and step, measure, the scheme in various operations, method, flow disclosed in the present invention It may also be alternated, changed, rearranged, decomposed, combined or deleted.

The foregoing is merely the preferred embodiment of the present invention, are not intended to limit the scope of the invention, every utilization Equivalent structure or equivalent flow shift made by description of the invention and accompanying drawing content is applied directly or indirectly in other correlations Technical field, be included within the scope of the present invention.

Claims

1. a kind of sound analysis method, which is characterized in that include the following steps：

Audio-frequency information to be analyzed is obtained, includes the audio-frequency information of multiple and different audio categories in the audio-frequency information to be analyzed；

Determine the audio categories of each audio-frequency information；

According to the audio categories, the characteristic information of the audio-frequency information is obtained；The characteristic information includes source of sound distance, decibel Value；

According to the source of sound distance of the audio-frequency information, the audio-frequency information is distributed in the layering of sound map, wherein described Sound map is layered according to distance.

2. sound analysis method according to claim 1, which is characterized in that the source of sound according to the audio-frequency information away from From, the audio-frequency information is distributed in after the step in the layering of sound map, including：

When receiving preset trigger signal in the layering, sound source denoising, sound are carried out to the audio-frequency information in layering Frequency optimization processing, and the audio-frequency information after playback process.

3. sound analysis method according to claim 1, which is characterized in that the source of sound according to the audio-frequency information away from From, the audio-frequency information is distributed in after the step in the layering of sound map, including：

If in layering there are many distributions when the audio-frequency information of audio categories, by the corresponding audio-frequency information of multiple audio categories Separation is carried out to show.

4. sound analysis method according to claim 1, which is characterized in that the sound of each audio-frequency information of the determination The step of frequency classification includes：

5. sound analysis method according to claim 1, which is characterized in that the step of acquisition audio-frequency information to be analyzed Later, further include：

The frequency of the first audio categories character pair audio to prestore in the frequency of the audio-frequency information and database is carried out pair Than, and calculate the similarity of the audio-frequency information and feature audio；

6. a kind of sound analysis device, which is characterized in that including：

First acquisition unit includes multiple and different sounds in the audio-frequency information to be analyzed for obtaining audio-frequency information to be analyzed The audio-frequency information of frequency classification；

Second acquisition unit, for according to the audio categories, obtaining the characteristic information of the audio-frequency information；The characteristic information Including source of sound distance, decibel value；

The audio-frequency information is distributed in point of sound map by distribution unit for the source of sound distance according to the audio-frequency information In layer, wherein the sound map is layered according to distance.

7. sound analysis device according to claim 6, which is characterized in that further include：

Broadcast unit, when for receiving preset trigger signal in the layering, to the audio-frequency information in layering into The denoising of row sound source, audio optimization processing, and the audio-frequency information after playback process.

8. sound analysis device according to claim 6, which is characterized in that further include：

Separative element, if distinguishing multiple audio categories there are many when the audio-frequency information of audio categories for distribution in being layered Corresponding audio-frequency information carries out separation and shows.

9. sound analysis device according to claim 6, which is characterized in that the determination unit is specifically used for：

10. sound analysis device according to claim 6, which is characterized in that further include：

The determination unit includes then：

Contrast subunit, the first audio categories character pair sound for will prestore in the frequency of the audio-frequency information and database The frequency of frequency is compared, and calculates the similarity of the audio-frequency information and feature audio；