CN113707159A - Electric network bird-involved fault bird species identification method based on Mel language graph and deep learning - Google Patents

Electric network bird-involved fault bird species identification method based on Mel language graph and deep learning Download PDF

Info

Publication number
CN113707159A
CN113707159A CN202110878327.9A CN202110878327A CN113707159A CN 113707159 A CN113707159 A CN 113707159A CN 202110878327 A CN202110878327 A CN 202110878327A CN 113707159 A CN113707159 A CN 113707159A
Authority
CN
China
Prior art keywords
bird
mel
network
training
singing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110878327.9A
Other languages
Chinese (zh)
Other versions
CN113707159B (en
Inventor
邱志斌
卢祖文
廖才波
王海祥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanchang University
Original Assignee
Nanchang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanchang University filed Critical Nanchang University
Priority to CN202110878327.9A priority Critical patent/CN113707159B/en
Publication of CN113707159A publication Critical patent/CN113707159A/en
Application granted granted Critical
Publication of CN113707159B publication Critical patent/CN113707159B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/20Pattern transformations or operations aimed at increasing system robustness, e.g. against channel noise or different working conditions
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/45Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of analysis window
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/02Preprocessing
    • G06F2218/04Denoising
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/08Feature extraction
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y04INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
    • Y04SSYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
    • Y04S10/00Systems supporting electrical power generation, transmission or distribution
    • Y04S10/50Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biophysics (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)

Abstract

The invention discloses a method for identifying bird species with bird-involved faults in a power grid based on Mel language maps and deep learning. Firstly, establishing a singing sample database of bird species related to bird-related faults of a power grid, preprocessing the singing signals, calculating the energy of each frame of signal in each Mel filter to obtain an M multiplied by N matrix containing signal energy information, and mapping the energy and the color depth one by one to obtain a Mel language map of the singing signals. The convolutional neural network is trained through the Mel tone map, the Mel tone map characteristic of the bird singing signal is continuously captured and learned in the convolution-pooling process, the internal parameters of the network are adjusted through repeated iterative training, the training is finished when the loss between the predicted output value and the actual value of the network is minimum, and finally the prediction and identification of the tested bird species are realized. The method can effectively distinguish the characteristics of different bird song sounds, realize bird identification and provide reference for carrying out bird-related fault differential control on the power grid.

Description

Electric network bird-involved fault bird species identification method based on Mel language graph and deep learning
Technical Field
The invention relates to the field of power transmission lines, in particular to a method for identifying bird species with bird-involved faults in a power grid based on Mel language maps and deep learning.
Background
Bird activity is one of the important causes of overhead transmission line faults, although various bird prevention devices are widely applied, the bird prevention devices still have great blindness, the rising trend of bird-related faults cannot be effectively inhibited, and line tripping faults caused by the failure of the bird prevention devices also occur frequently. In addition, because bird-involved faults are transient, operation and maintenance personnel often have difficulty in judging the bird species causing the faults after the faults occur, an intelligent bird species identification and fault cause judgment method is lacked, and bird-involved fault prevention measures are difficult to take pertinently. Therefore, it is necessary to develop intelligent bird species identification research related to bird-involved faults of overhead transmission lines, and provide basis for line operation and maintenance personnel to correctly identify birds.
At present, the traditional method for identifying the twitter is to extract the characteristics of Linear Prediction Cepstrum Coefficient (LPCC), Mel cepstrum coefficient (MFCC), power spectral density and the like of a sound signal and carry out classification prediction by combining classification algorithms such as Random Forest (RF), Support Vector Machine (SVM), Hidden Markov Model (HMM), Gaussian Mixture Model (GMM) and the like, and the traditional methods are difficult in characteristic extraction and low in identification accuracy.
Disclosure of Invention
Aiming at the problems in the prior art, the invention aims to provide a method for identifying bird-involved fault bird species in a power grid based on Mel language graph and deep learning, which identifies the bird species according to a singing signal and provides a basis for developing targeted and differential bird prevention for a power transmission line.
In order to achieve the purpose, the invention adopts the following technical scheme, which comprises the following steps:
s1: establishing a related bird seed singing database according to the counted main bird seeds with bird-involved faults of the power grid and the actual situation of the power grid;
s2: carrying out denoising, framing and windowing preprocessing on samples in a singing database, removing noise in the singing signal by adopting an improved spectral subtraction method of multi-window spectral estimation, framing the singing signal by setting the frame length and the frame shift, and multiplying the frame length and the frame shift by a window function to increase the continuity of two ends of the frame;
s3: calculating the energy of each frame of the birdsong signal in each Mel filter to obtain the Mel energy of the birdsong sample, obtaining an MxN order matrix containing signal energy information, mapping the energy and the color depth one by one to obtain a Mel language map of the birdsong signal, and dividing the Mel language map into a training set, a verification set and a test set;
s4: building a convolutional neural network classification model, performing repeated iterative training by taking a Mel (Mel) morphogram of a training set as input, testing a verification set in the training process to adjust parameters of the model, and finishing the training when the loss between a predicted output value and an actual value of the network reaches the minimum;
s5: and predicting and identifying the bird species in the test set by using the trained network, and outputting the corresponding bird species category.
Further, the calculation process of the Mel language graph in S3 is as follows: for a section of M-frame birdsong signals, N Mel filters are arranged, an MxN matrix is obtained through Mel energy calculation, a Mel language graph is obtained through coloring according to the energy, horizontal and vertical coordinates in the Mel language graph are the number of frames and the number of the filters respectively, only the MxN data size needs to be calculated, output is simplified, and meanwhile calculation time is shortened.
Further, the convolutional neural network in S4 includes a plurality of convolution-pooling processes for capturing Mel-language graph features, the training set is trained by adjusting network parameters and network iteration times, the model performs prediction on the validation set once and adjusts parameters accordingly according to the prediction result of the validation set every time the model is trained for a certain number of rounds, and the model is corrected in the direction of high prediction accuracy until the loss function value of the network is minimized, and the network training is completed.
The invention has the beneficial effects that:
the method for identifying bird-involved fault bird species of the power grid based on the Mel language graph and the deep learning overcomes the defects of redundant characteristics, large data volume and insufficient discrimination of the traditional voice feature extraction technology, and further promotes accurate bird identification, thereby providing guidance for differential bird prevention and improving the accuracy and effectiveness of bird-involved fault prevention and control of the power transmission line and the transformer substation.
Drawings
FIG. 1 is a flow chart of an implementation of a method for identifying bird species with bird-involved faults in a power grid based on Mel language maps and deep learning;
FIG. 2 is a diagram illustrating a denoising effect of a birdsong signal according to an embodiment of the present invention;
FIG. 3 is a partial birdsong signal waveform and Mel's chart in accordance with an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a convolutional neural network in an embodiment of the present invention.
Detailed Description
The present invention is further described in the following examples, which should not be construed as limiting the scope of the invention, but rather as providing the following examples which are set forth to illustrate and not limit the scope of the invention.
With the rapid development of deep learning, emerging speech recognition methods tend to convert an acoustic signal into a spectrogram such as a chirp spectrogram or a fourier spectrogram as a feature input of a deep learning model. The invention adopts a method of converting a bird song sound signal into a Mel language graph and then combining a convolutional neural network for classification and identification to predict and classify bird species related to bird-related faults of a power transmission line.
The following processes of singing signal processing, Mel language graph calculation and convolutional neural network training of typical bird species related to bird-related faults of the power transmission line are explained in detail, as shown in fig. 1, and the method comprises the following steps:
s1: and establishing a related bird seed singing database according to the statistical main bird species with bird-involved faults of the power grid and the actual condition of the power grid.
In the embodiment, according to the main bird species of the bird-related faults of the power transmission line counted by operation and maintenance personnel of a certain power saving network and by combining the actual situation of the power network, 40 typical birds causing four failure types, namely bird nest type, bird dung type, bird body short-circuit type and bird pecking type, are selected as research objects, and comprise Wu \40491, phoenix-headed wheat chicken, magpie, four-tone rhododendron, night aigren, doodle, swan goose, azalea, bird swallow, dobby, cliff, dycep, spotted dog, ordinary gull, crow, pool aigre, grey-head green pecker, grey-tail 26891bird, grey goose, gray crane, giraffe, white-head spotted dove, white-head bulbul, white aigren, crow, red-mouth gull, red-tail burrower, red-horn, red falcon, belly small owl, wanny duck, cocky bird, cocket, eagle, red-eagle gull, silver carving, chincey, bone-top chicken, quail and a sample database is established and a sample database is disclosed by the birds is collected and public.
S2: the method comprises the steps of carrying out preprocessing such as denoising, framing and windowing on samples in a singing database, removing noise in a singing signal by adopting an improved spectral subtraction method of multi-window spectral estimation, framing the singing signal by setting the frame length and the frame shift, and multiplying the frame length and the frame shift by a window function to increase the continuity of two ends of the frame.
In the embodiment, preprocessing operations such as format unification, denoising, framing, windowing and the like are carried out on all the bird song audio signals, the sampling frequency of all audios is set to 16000Hz by utilizing GoldWave and Sox software, a sound channel is set to be a single sound channel, the length of the audios is uniformly cut to be 1 second, and the audios are stored in a wav format; framing the audio, respectively setting the frame length and the frame shift to be 0.025 second and 0.01 second, and dividing each audio sample into 98 frames; then selecting a Hamming window to carry out windowing operation so as to increase the continuity of two ends of the frame; the speech is denoised by adopting an improved spectral subtraction method of multi-window spectral estimation, and the denoising effect is shown in fig. 2, (a) is the rhododendron speech containing noise, and (b) is the denoised rhododendron speech.
S3: the method comprises the steps of calculating the energy of each frame of the birdsong signal in each Mel filter, calculating the Mel energy of the birdsong sample to obtain an MxN-order matrix containing signal energy information, mapping the energy and the color depth one by one to obtain a Mel spectrogram of the birdsong signal, and dividing the Mel spectrogram into a training set, a verification set and a test set.
The Mel language map is an image representation of the singing signal, and the Mel language maps formed by different bird species have differences. In this embodiment, the bird song signal is divided into 98 frames, 40 Mel filters are provided, a 98 × 40 data matrix is obtained through Mel energy calculation, a Mel language graph of the bird song signal can be obtained through coloring according to the energy, horizontal and vertical coordinates in the Mel language graph are the number of frames and the number of the filters, only 98 × 40 data volumes need to be calculated, output is simplified, and calculation time is shortened. Fig. 3 shows a part of bird song signal waveforms and their corresponding Mel language maps, (a), (b), (c) are the voice waveforms of azalea, red owl and red-bill gull, respectively, (d), (e) and (f) are the Mel language maps of azalea, red-horn owl and red-bill gull, respectively, which describe a section of bird song signal in terms of frame number combined with the number of Mel filters, and can distinguish the song of different bird species.
In this embodiment, the acquired Mel-language graph is divided into a training set, a verification set and a test set according to a ratio of 8: 1.
S4: and (3) building a convolutional neural network classification model, performing repeated iterative training by taking the Mel tone map of the training set as input, testing the verification set in the training process to adjust the parameters of the model, and finishing the training when the loss between the predicted output value and the actual value of the network is minimum.
In this embodiment, a 24-layer convolutional neural network model is constructed, as shown in fig. 4, training is performed with a training set as an input, and the convolutional neural network includes a plurality of convolutions-pooling for capturing Mel-language graph featuresThe process is that an initial learning rate of 0.01 is set to train a training set, the learning rate is reduced to 1/10 after 30 rounds of training, and when a model is trained for a certain round, the model carries out prediction on a verification set once and correspondingly adjusts parameters in a network according to the prediction result of the verification set, and corrects the parameters in the direction with high prediction accuracy. The training of the convolutional neural network is essentially a process of minimizing a loss function, and the aim of learning the optimal class of image feature matching is achieved by continuously iteratively optimizing to seek the minimum loss between the predicted output value and the actual value of the network. The loss function used by the convolutional neural network in this embodiment is a cross entropy function, and the expression is:
Figure BDA0003191031410000041
m is the total number of samples, k is the number of classes of samples, 1{ y }iJ is an indicative function, the output is 1 when the value in parenthesis is true, otherwise 0,
Figure BDA0003191031410000042
indicating the probability that the ith sample is predicted as the jth class. And when the loss function value of the network is reduced to the minimum, the network training is finished.
S5: and predicting and identifying the bird species in the test set by using the trained network, and outputting the corresponding bird species category.
In this embodiment, since the training samples and the testing samples are randomly selected, in order to avoid the contingency of the classification result, classification tests under 3 groups of different training sample sets are performed, and the average prediction accuracy is 96.7%. Therefore, the Mel spectrogram of the singing signal is used as the characteristic quantity, the voice is denoised by using the improved spectral subtraction of multi-window spectral estimation, and the deep learning model is established by using the convolutional neural network, so that the related bird species threatening the safe operation of the power transmission line can be accurately identified, and guidance is provided for differential bird prevention.
It should be understood that parts of the specification not set forth in detail are well within the prior art.
Although specific embodiments of the present invention have been described above with reference to the accompanying drawings, it will be appreciated by those skilled in the art that these are merely illustrative and that various changes or modifications may be made to these embodiments without departing from the principles and spirit of the invention. The scope of the invention is only limited by the appended claims.

Claims (3)

1. A method for identifying bird species with bird-involved faults in a power grid based on Mel language maps and deep learning is characterized by comprising the following steps: the method comprises the following steps:
s1: establishing a related bird seed singing database according to the counted main bird seeds with bird-involved faults of the power grid and the actual situation of the power grid;
s2: carrying out denoising, framing and windowing preprocessing on samples in a singing database, removing noise in the singing signal by adopting an improved spectral subtraction method of multi-window spectral estimation, framing the singing signal by setting the frame length and the frame shift, and multiplying the frame length and the frame shift by a window function to increase the continuity of two ends of the frame;
s3: calculating the energy of each frame of the birdsong signal in each Mel filter to obtain the Mel energy of the birdsong sample, obtaining an MxN order matrix containing signal energy information, mapping the energy and the color depth one by one to obtain a Mel language map of the birdsong signal, and dividing the Mel language map into a training set, a verification set and a test set;
s4: building a convolutional neural network classification model, performing repeated iterative training by taking a Mel (Mel) morphogram of a training set as input, testing a verification set in the training process to adjust parameters of the model, and finishing the training when the loss between a predicted output value and an actual value of the network reaches the minimum;
s5: and predicting and identifying the bird species in the test set by using the trained network, and outputting the corresponding bird species category.
2. The method for identifying the bird species with bird-involved fault in the power grid based on the Mel language map and the deep learning as claimed in claim 1, wherein: the calculation process of the Mel language graph in S3 is as follows: for a section of M-frame birdsong signals, N Mel filters are arranged, an MxN matrix is obtained through Mel energy calculation, a Mel language graph is obtained through coloring according to the energy, horizontal and vertical coordinates in the Mel language graph are the number of frames and the number of the filters respectively, only the MxN data size needs to be calculated, output is simplified, and meanwhile calculation time is shortened.
3. The method for identifying the bird species with bird-involved fault in the power grid based on the Mel language map and the deep learning as claimed in claim 1, wherein: and S4, the convolutional neural network comprises a plurality of convolution-pooling processes for capturing Mel language map features, network parameters and network iteration times are adjusted to train a training set, when the model is trained for a certain turn, the model carries out prediction on a verification set once and correspondingly adjusts the parameters according to the prediction result of the verification set, and the parameters are corrected in the direction with high prediction accuracy until the loss function value of the network is reduced to the minimum, and the network training is finished.
CN202110878327.9A 2021-08-02 2021-08-02 Power grid bird-involved fault bird species identification method based on Mel language graph and deep learning Active CN113707159B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110878327.9A CN113707159B (en) 2021-08-02 2021-08-02 Power grid bird-involved fault bird species identification method based on Mel language graph and deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110878327.9A CN113707159B (en) 2021-08-02 2021-08-02 Power grid bird-involved fault bird species identification method based on Mel language graph and deep learning

Publications (2)

Publication Number Publication Date
CN113707159A true CN113707159A (en) 2021-11-26
CN113707159B CN113707159B (en) 2024-05-03

Family

ID=78651104

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110878327.9A Active CN113707159B (en) 2021-08-02 2021-08-02 Power grid bird-involved fault bird species identification method based on Mel language graph and deep learning

Country Status (1)

Country Link
CN (1) CN113707159B (en)

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106093612A (en) * 2016-05-26 2016-11-09 国网江苏省电力公司电力科学研究院 A kind of method for diagnosing fault of power transformer
CN107393542A (en) * 2017-06-28 2017-11-24 北京林业大学 A kind of birds species identification method based on binary channels neutral net
CN108197591A (en) * 2018-01-22 2018-06-22 北京林业大学 A kind of birds individual discrimination method based on multiple features fusion transfer learning
CN109409308A (en) * 2018-11-05 2019-03-01 中国科学院声学研究所 A method of the birds species identification based on birdvocalization
CN109979441A (en) * 2019-04-03 2019-07-05 中国计量大学 A kind of birds recognition methods based on deep learning
CN110120224A (en) * 2019-05-10 2019-08-13 平安科技(深圳)有限公司 Construction method, device, computer equipment and the storage medium of bird sound identification model
CN111626093A (en) * 2020-03-27 2020-09-04 国网江西省电力有限公司电力科学研究院 Electric transmission line related bird species identification method based on sound power spectral density
WO2020177371A1 (en) * 2019-03-06 2020-09-10 哈尔滨工业大学(深圳) Environment adaptive neural network noise reduction method and system for digital hearing aids, and storage medium
CN112331220A (en) * 2020-11-17 2021-02-05 中国计量大学 Bird real-time identification method based on deep learning
WO2021051608A1 (en) * 2019-09-20 2021-03-25 平安科技(深圳)有限公司 Voiceprint recognition method and device employing deep learning, and apparatus

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106093612A (en) * 2016-05-26 2016-11-09 国网江苏省电力公司电力科学研究院 A kind of method for diagnosing fault of power transformer
CN107393542A (en) * 2017-06-28 2017-11-24 北京林业大学 A kind of birds species identification method based on binary channels neutral net
CN108197591A (en) * 2018-01-22 2018-06-22 北京林业大学 A kind of birds individual discrimination method based on multiple features fusion transfer learning
CN109409308A (en) * 2018-11-05 2019-03-01 中国科学院声学研究所 A method of the birds species identification based on birdvocalization
WO2020177371A1 (en) * 2019-03-06 2020-09-10 哈尔滨工业大学(深圳) Environment adaptive neural network noise reduction method and system for digital hearing aids, and storage medium
CN109979441A (en) * 2019-04-03 2019-07-05 中国计量大学 A kind of birds recognition methods based on deep learning
CN110120224A (en) * 2019-05-10 2019-08-13 平安科技(深圳)有限公司 Construction method, device, computer equipment and the storage medium of bird sound identification model
WO2021051608A1 (en) * 2019-09-20 2021-03-25 平安科技(深圳)有限公司 Voiceprint recognition method and device employing deep learning, and apparatus
CN111626093A (en) * 2020-03-27 2020-09-04 国网江西省电力有限公司电力科学研究院 Electric transmission line related bird species identification method based on sound power spectral density
CN112331220A (en) * 2020-11-17 2021-02-05 中国计量大学 Bird real-time identification method based on deep learning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
谢将剑;李文彬;张军国;丁长青: "基于Chirplet语图特征和深度学习的鸟类物种识别方法", 北京林业大学学报, vol. 40, no. 003, pages 122 - 127 *

Also Published As

Publication number Publication date
CN113707159B (en) 2024-05-03

Similar Documents

Publication Publication Date Title
Jiang et al. Whistle detection and classification for whales based on convolutional neural networks
CN107610707B (en) A kind of method for recognizing sound-groove and device
CN110120224B (en) Method and device for constructing bird sound recognition model, computer equipment and storage medium
CN101710490B (en) Method and device for compensating noise for voice assessment
CN106611604B (en) Automatic voice superposition detection method based on deep neural network
Clemins et al. Automatic classification and speaker identification of African elephant (Loxodonta africana) vocalizations
CN109493874A (en) A kind of live pig cough sound recognition methods based on convolutional neural networks
CN110718232B (en) Speech enhancement method for generating countermeasure network based on two-dimensional spectrogram and condition
CN107680582A (en) Acoustic training model method, audio recognition method, device, equipment and medium
CN109065027A (en) Speech differentiation model training method, device, computer equipment and storage medium
CN109979486B (en) Voice quality assessment method and device
CN106128477B (en) A kind of spoken identification correction system
CN103065620A (en) Method with which text input by user is received on mobile phone or webpage and synthetized to personalized voice in real time
CN108847252B (en) Acoustic feature extraction method based on acoustic signal spectrogram texture distribution
CN113111786B (en) Underwater target identification method based on small sample training diagram convolutional network
CN112802494B (en) Voice evaluation method, device, computer equipment and medium
CN106558308A (en) A kind of internet audio quality of data auto-scoring system and method
CN114373452A (en) Voice abnormity identification and evaluation method and system based on deep learning
CN113707158A (en) Power grid harmful bird seed singing recognition method based on VGGish migration learning network
CN113345443A (en) Marine mammal vocalization detection and identification method based on mel-frequency cepstrum coefficient
CN111626093B (en) Method for identifying related bird species of power transmission line based on sound power spectral density
CN114863938A (en) Bird language identification method and system based on attention residual error and feature fusion
CN110473548B (en) Classroom interaction network analysis method based on acoustic signals
CN113707159A (en) Electric network bird-involved fault bird species identification method based on Mel language graph and deep learning
CN111091816B (en) Data processing system and method based on voice evaluation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant