CN112926380A

CN112926380A - Novel underwater laser target intelligent recognition system

Info

Publication number: CN112926380A
Application number: CN202110023538.4A
Authority: CN
Inventors: 刘兴高; 田子健; 王文海; 张志猛; 张泽银
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2021-01-08
Filing date: 2021-01-08
Publication date: 2021-06-08
Anticipated expiration: 2041-01-08
Also published as: CN112926380B

Abstract

The invention discloses a novel underwater laser target intelligent identification system which comprises an underwater laser radar, a three-dimensional sample library, a two-dimensional image database and an underwater laser target identification upper computer system, wherein the underwater laser target identification upper computer system comprises a sample distributed image generation module, a feature extraction module, a convolution attention module, a feature selection module, a feature fusion module, a decision module, an output and display module and the like. And provides an algorithm for carrying out target identification by using the underwater three-dimensional stereo sample. The invention has the advantages of realizing the extraction of a plurality of images from a three-dimensional sample and the effective characteristic extraction, identifying the moving target object in a turbid water area, having high identification accuracy and high intelligent degree, and simultaneously solving the problems of characteristic loss, high noise of the image in the turbid water area, identification of the number of small samples of the moving target and the like when a three-dimensional sample is converted into a two-dimensional image, which are commonly existed in the traditional underwater laser identification system.

Description

Novel underwater laser target intelligent recognition system

Technical Field

The invention relates to the fields of computer vision, intelligent recognition algorithm, laser radar and underwater target recognition, in particular to a novel underwater target intelligent recognition technology based on laser radar.

Background

The ocean area accounts for more than seven percent of the total area of the earth and has very much resources and space, but the ocean space which is explored by human beings currently is only about 5 percent of the total space of the earth, the ocean has important military strategic position and resource exploitation and utilization position, and due to the fact that the ocean environment is inexperienced and changeable, people are used for exploring the ocean environment, and casualties can be caused with high probability. Therefore, the underwater world promotion schedule is explored with modern intelligent technology. At present, the commonly used method for acquiring underwater target information comprises acoustic detection and optical detection, and due to the difference of frequency and applicable wave band, the two detection modes are also commonly used in different detection scenes. The acoustic frequency is low, so that the method is commonly used for judging whether a target exists in a long distance, and the optical frequency is high, so that the method is commonly used for identifying a short-distance target.

Due to the strong attenuation of the water body to the incident light, the incident light presents exponential attenuation when propagating in water, and the common light source is difficult to achieve ideal detection depth when propagating in water due to poor collimation of the common light source. The appearance of laser provides a new underwater detection means, the low attenuation of the laser at an ocean window can reach the detection depth which is difficult to reach by the conventional optical means by utilizing the high collimation of the laser, particularly, the application of the conventional laser radar technology can not only carry out target detection at larger water depth, but also overcome the problem that the acoustic resistance of water and air is not matched, and the direct detection from water to underwater is realized. However, due to the complex underwater condition, the disordered motion of various organisms in water and a plurality of imaging noise points, an intelligent underwater recognition system is urgently needed to solve the problem of target recognition in image processing when a target needs to be accurately recognized in an underwater image obtained by laser.

The sample that laser radar drawed is three-dimensional stereo sample, because the target is often in the moving state under water, laser radar hardly gets a large amount of samples at this in-process of sample, consequently must adopt the small sample to carry out the analysis, and, under muddy complicated waters environment, visibility is extremely low, to the judgement of same object, we often need through different angles to three-dimensional sample carry out the projection to carry out characteristic's contrast and analysis with the two-dimensional image that the projection obtained, thereby obtain the judgement of generality to the discernment thing.

Disclosure of Invention

Aiming at the problem of mismatching of characteristic values when a three-dimensional sample is sampled into a two-dimensional image at present, the method solves the problems that an underwater target is in a moving state and a large sample is difficult to obtain in a snapshot mode, overcomes the problem of insufficient imaging definition in a turbid water area,

the technical scheme adopted by the invention for solving the technical problems is as follows: the novel underwater laser target intelligent identification system is characterized by comprising an underwater laser radar, a three-dimensional sample library, a two-dimensional image database and an underwater laser target identification upper computer system, wherein the underwater laser target identification upper computer system comprises a sample distributed image generation module, a convolution attention module, a feature extraction module, a feature selection module, a feature fusion module, a decision module, an output and display module and the like. The method is characterized in that an underwater laser radar performs laser scanning on a target recognition object in a selected water area, three-dimensional sample data returned by the radar is stored in a three-dimensional sample bank, a sample distributed image generation module in an upper computer system for underwater laser target recognition performs automatic multi-angle shooting on the three-dimensional sample data in the three-dimensional sample bank to obtain two-dimensional image samples, data enhancement is performed on the two-dimensional image samples, then the processed two-dimensional image samples are stored in a two-dimensional image database, a feature extraction module performs feature extraction on the enhanced two-dimensional image and transfers the extracted features and the two-dimensional image to a convolution attention module, the convolution attention module automatically performs convolution coding on the features in a plurality of two-dimensional images from the same three-dimensional sample in the two-dimensional image database, and automatically establishes a mapping relation between a two-dimensional image space and a three-dimensional sample space, and the processed data is sent to a feature selection module, the feature selection module further screens the two-dimensional image features according to the attention coefficients generated in the convolutional codes, the feature fusion module further matches and corrects the screened features with the convolutional codes added with the attention coefficients after receiving the screened features, the features are delivered to a decision module for judgment of target recognition, and the decision module sends the final decision to an output and display module for a man-machine interaction and display link.

Furthermore, the sample distributed image generation module can automatically take two-dimensional image samples from multiple angles for three-dimensional sample data in the three-dimensional sample library, and perform image enhancement processing on the images, and the main processing algorithm is as follows:

2.1){x_ij,(i,j)∈I²i is a set or subset of natural numbers) to represent the gray value at each point of the digital image, where I represents the row position of the point in the lattice for which I represents the row position, j represents the column position of the point in the lattice for which j represents the column position, the gray value y at one point is replaced by the median of the point values in the neighborhood of that point: taking the gray value of the neighborhood of the point according to the size sequence x₁≤x₂≤x₃≤...≤x_nArrangement in which { x with a single numerical subscript_nN is a non-zero natural number } represents the label of the gray value of n points in the neighborhood, and then the intermediate value is taken according to the following mode:

2.2) processing the degraded image polluted by noise underwater by adopting a linear filtering mode, and completing the following steps: and the resulting gray value y of the two-dimensional median filter with the filter window A_ijCan be expressed as:

wherein (r, s) is the position information of the point in the filtering window, r is the abscissa of the position information, s is the ordinate of the position information,

indicating taking an intermediate value from each point gray value.

2.3) the calculation formula of the loss function L for carrying out the noise reduction operation on the image is as follows:

wherein i represents an image label ranging from 1 to N, N represents the number of training samples of the convolutional neural network of the image noise reduction module, and X_iRepresenting the ith true noise-free picture, Y_iDenotes that the ith sheet is at X_iAdding noise to the picture, R representing the original picture and Y_iThe residual pictures in between. The subscript F and superscript 2 in the formula indicate that the formula takes a 2 norm.

Further, the feature extraction module is used for analyzing the values of the RGB three-color channels and extracting features, and the mathematical basis for realizing the feature extraction module is as follows:

3.1) carrying out red channel compensation, wherein the compensation formula is as follows:

wherein, x is a certain pixel point in the image, alpha₁In order to compensate for the coefficients of the coefficients,

is the average of the green color components,

is the mean of the red component, I_R(x) Is the red component value, I, of the point_G(x) Is the green component value, I, of the point_d(x) To compensate for the red channel compensation value of the area where the pixel is located before,

the red channel compensation value of the area where the pixel is located after compensation.

3.2) to the image

Correction of color shift, maximum value per channel

And minimum value

The definition formula is:

and

are respectively images

Mean and mean square error, α, in channel d₂Corrected underwater color image for attention of coefficient

Comprises the following steps:

3.3) HSV equalization and deblurring, gamma correction of the luminance, hue and saturation components to enhance hue and luminance contrast, and deblurring of the luminance.

3.4) carrying out feature extraction on the image.

Furthermore, the convolution attention module is used for carrying out convolution coding on the characteristics of a plurality of two-dimensional images from the same three-dimensional sample, generating an attention coefficient, and automatically establishing a mapping relation between a two-dimensional image space and a three-dimensional sample space so as to solve the problem of characteristic mismatching caused by shooting the two-dimensional images at various angles. The mathematical principle expression is as follows:

in the formula I_c(x) Is a three-dimensional sample function, x is a certain pixel point in the image, J_c(x) For a picture in a two-dimensional image function set, c ═ R, G, B },

as background light or backscattered light, t_c(x) The attention coefficient corresponding to the two-dimensional image. By transforming the formula, a two-dimensional image function J is obtained_c(x) Comprises the following steps:

further, the feature selection module and the feature fusion module adopt a long-time memory recurrent neural network (LSTM-RNN) model to perform feature selection and feature fusion, and the state expression at the time t is as follows:

s_t＝f(Ux_t+Ws_t-1)

h_t＝g(Vs_t)

where t denotes the time, s_tIndicating the hidden layer state at time t, s_t-1Indicating the state of the hidden layer at the unit time immediately preceding time t. h is_tIndicating output layer h at time t_tStatus. The hidden layer activation function is f, and U is expressed as the weight between the input layer and the hidden layerAnd W is a weight matrix between the hidden layer and the hidden layer from the time t-1 to the time t, and V is a weight matrix between the output layer and the hidden layer.

Further, the decision module is used for analyzing and fitting the extracted features and finally transmitting the analysis result to the output and display module, and the mathematical support of the decision module is as follows:

6.1) output layer weight equation

6.2) equation of weights of Loop layers

6.3) input layer weight equation

Wherein, w_hy(i, j) represents a weight matrix from the hidden layer to the output layer, w'_hyAnd (i, j) represents a weight matrix from the hidden layer to the output layer after the weight is updated. w is a_hh(i, j) represents a weight matrix between hidden layers, w'_hhAnd (i, j) represents a weight matrix between hidden layers after the weight is updated. w is a_xh(i, j) represents a weight matrix, w ', between the input layer and the hidden layer'_xhAnd (i, j) represents a weight matrix between the input layer and the hidden layer after the weight is updated. Eta is expressed as a learning rate,

is the output layer error.

The technical conception of the invention is as follows: aiming at the problems that the moving frequency of a current underwater target is high, and the number of three-dimensional space samples obtained by a laser radar is small, a sample distributed image generation module is designed to perform projection analysis on three-dimensional space small samples from different angles. Aiming at the problem that a traditional neural network structure cannot link the same characteristics of a plurality of pictures for learning in the process of projecting a three-dimensional space to a two-dimensional screen, a convolution attention module is introduced to carry out convolution coding on the characteristics from the three-dimensional space to the two-dimensional space, the mapping relation between the two-dimensional image space and the three-dimensional sample space is automatically established, and a long-time memory recurrent neural network (LSTM-RNN) model neural network is set up for learning. Aiming at the phenomena of low visibility under turbid water and serious noise of the obtained image, a feature extraction mode of red channel enhancement is adopted, and the feature extraction mode is improved.

The invention has the following beneficial effects: 1. the method combines the traditional CNN and the characteristic fusion mode, provides a new technical scheme applied to the underwater target identification of three-dimensional small sample number, and has the advantages of lower detection cost, higher efficiency and better effect compared with the existing underwater target identification technical scheme. 2. Based on an attention mechanism, convolution attention is introduced into the established long-time memory cyclic neural network, the relation between the characteristic channel and the space is enhanced, and the relation between the two-dimensional image and the three-dimensional space is established. 3. A large number of experiments prove that the method is high in efficiency and stability, and compared with a common underwater target recognition system, the method is high in recognition accuracy and high in feature analysis efficiency.

Drawings

Fig. 1 is a structural model diagram of the system proposed by the present invention.

Detailed Description

The present invention will be described in detail below with reference to the accompanying drawings.

Referring to fig. 1, the underwater laser radar 1, the three-dimensional sample library 2, the two-dimensional image database 3 and the underwater laser target recognition upper computer system 4 are formed, and the underwater laser target recognition upper computer system 4 is formed by sequentially connecting a sample distributed image generation module 5, a convolution attention module 6, a feature extraction module 7, a feature selection module 8, a feature fusion module 9, a decision module 10 and an output and display module 11.

The underwater laser radar 1 obtains a three-dimensional sample of a target object to be detected by emitting laser and receiving the laser reflected by the target, and stores the three-dimensional sample into a three-dimensional sample library 2.

A sample distributed image generation module 5 in the underwater laser target recognition upper computer system automatically shoots three-dimensional sample data in a three-dimensional sample library 2 from multiple angles to obtain two-dimensional image samples, performs data enhancement on the two-dimensional image samples, and then stores the processed two-dimensional image samples into a two-dimensional image database 3. In this step, the image enhancement processing on the image sample is processed according to the following formula:

{x_ij,(i,j)∈I²i is a set or subset of natural numbers) to represent the gray value at each point of the digital image, where I represents the row position of the point in the lattice for which I represents the row position, j represents the column position of the point in the lattice for which j represents the column position, the gray value y at one point is replaced by the median of the point values in the neighborhood of that point: taking the gray value of the neighborhood of the point according to the size sequence x₁≤x₂≤x₃≤...≤x_nArrangement in which { x with a single numerical subscript_nN is a non-zero natural number } represents the label of the gray value of n points in the neighborhood, and then the intermediate value is taken according to the following mode:

processing the degraded image polluted by noise underwater by adopting a linear filtering mode, and completing the following steps: and the resulting gray value y of the two-dimensional median filter with the filter window A_ijCan be expressed as:

indicating taking an intermediate value from each point gray value. To proceed with imageThe loss function L of the noise reduction operation is calculated as follows:

The feature extraction module 6 is used for analyzing the RGB three-color channel values, extracting features, and compensating the red channel, wherein the compensation formula is as follows:

is the average of the green color components,

the red channel compensation value of the area where the pixel is located after compensation. For images

Correction of color shift, maximum value per channel

And minimum value

The definition formula is:

and

are respectively images

Comprises the following steps:

HSV equalization and deblurring are adopted, gamma correction is carried out on the brightness, hue and saturation components to enhance hue and brightness contrast, deblurring is carried out on the brightness, and feature extraction is carried out on the image.

The convolution attention module 7 is used for performing convolution coding on the characteristics of a plurality of two-dimensional images from the same three-dimensional sample, generating an attention coefficient, and automatically establishing a mapping relation between a two-dimensional image space and a three-dimensional sample space so as to solve the problem of characteristic mismatching caused by shooting the two-dimensional images at various angles. The mathematical principle expression is as follows:

the feature selection module 8 and the feature fusion module 9 adopt a long-time memory recurrent neural network (LSTM-RNN) model to perform feature selection and feature fusion, and the state expression at the time t is as follows:

s_t＝f(Ux_t+Ws_t-1)

h_t＝g(Vs_t)

where t denotes the time, s_tIndicating the hidden layer state at time t, s_t-1Indicating the state of the hidden layer at the unit time immediately preceding time t. h is_tIndicating output layer h at time t_tStatus. The hidden layer activation function is f, U is expressed as a weight matrix between the input layer and the hidden layer, W is expressed as a weight matrix between the hidden layer and the hidden layer from the time t-1 to the time t, and V is expressed as a weight matrix between the output layer and the hidden layer.

The decision module 10 is used for analyzing and fitting the extracted features and finally transmitting the analysis result to the output and display module 11, and the mathematical support of the decision module is as follows:

output layer weight equation

Circulating layerWeight equation

Input layer weight equation

is the output layer error.

The output and display module 11 displays the final result of the classification recognition of the image features.

The above examples are intended to illustrate the invention, but not to limit the invention, and any modifications and variations of the invention within the spirit of the invention and the scope of the claims are within the scope of the invention.

Claims

1. The novel underwater laser target intelligent identification system is characterized by comprising an underwater laser radar, a three-dimensional sample library, a two-dimensional image database and an underwater laser target identification upper computer system, wherein the underwater laser target identification upper computer system comprises a sample distributed image generation module, a convolution attention module, a feature extraction module, a feature selection module, a feature fusion module, a decision module, an output and display module and the like. The underwater laser radar performs laser scanning on a target recognition object in a selected water area, stores three-dimensional sample data returned by the radar into a three-dimensional sample library, a sample distributed image generation module performs multi-angle shooting on the three-dimensional sample data in the three-dimensional sample library to obtain two-dimensional image samples, performs data enhancement on the two-dimensional image samples, then stores the processed two-dimensional image samples into a two-dimensional image database, a feature extraction module performs feature extraction on the enhanced two-dimensional image and transfers the extracted features and the two-dimensional image to a convolution attention module, the convolution attention module performs convolution coding on the features in a plurality of two-dimensional images from the same three-dimensional sample in the two-dimensional image database, automatically establishes a mapping relation between a two-dimensional image space and a three-dimensional sample space, and sends the processed data to a feature selection module, the feature selection module further screens the two-dimensional image features according to the attention coefficients generated in the convolutional coding; after receiving the screened features, the feature fusion module further matches and corrects the screened features with the convolution codes added with attention coefficients, and then the selected features are sent to a decision module for judgment of target identification; the decision module sends the final decision to the display module for human-computer interaction and display.

2. The novel underwater laser target intelligent identification system according to claim 1, wherein the sample distributed image generation module is capable of performing multi-angle shooting of two-dimensional image samples on three-dimensional sample data in a three-dimensional sample library, and performing image enhancement processing on the images, and specifically comprises:

(2.1){x_ij,(i,j)∈I²i is a set or subset of natural numbers) to represent the gray value at each point of the digital image, where I represents the row position of the point in the lattice for which I represents the row position, j represents the column position of the point in the lattice for which j represents the column position, the gray value y at one point is replaced by the median of the point values in the neighborhood of that point: taking the gray value of the neighborhood of the point according to the size sequence x₁≤x₂≤x₃≤...≤x_nArrangement in which { x with a single numerical subscript_nN is a non-zero natural number } represents the label of the gray value of n points in the neighborhood, and then the intermediate value is taken according to the following mode:

(2.2) processing the degraded image polluted by noise underwater by adopting a linear filtering mode, and completing the following steps: and the resulting gray value y of the two-dimensional median filter with the filter window A_ijCan be expressed as:

indicating taking an intermediate value from each point gray value.

(2.3) the loss function L for performing the noise reduction operation on the image is calculated as follows:

3. The system for intelligently identifying the underwater laser target according to claim 1, wherein the feature extraction module is specifically configured to perform feature extraction on the enhanced two-dimensional image:

(3.1) carrying out red channel compensation, wherein the compensation formula is as follows:

is the average of the green color components,

(3.2) for the image

Correction of color shift, maximum value per channel

And minimum value

The definition formula is:

and

are respectively images

Comprises the following steps:

(3.3) HSV equalization and deblurring, gamma correction of the luminance, hue and saturation components to enhance hue and luminance contrast, and deblurring of the luminance.

And (3.4) carrying out feature extraction on the image.

4. The system of claim 1, wherein the convolution attention module is configured to perform convolution encoding on a plurality of features of two-dimensional images from the same three-dimensional sample, generate an attention coefficient, and automatically establish a mapping relationship between a two-dimensional image space and a three-dimensional sample space, so as to solve a problem of feature mismatch caused by shooting two-dimensional images at various angles:

5. the system for intelligently identifying the underwater laser target according to claim 1, wherein the feature selection module and the feature fusion module adopt a long-time memory cyclic neural network model for feature extraction and selection, and a state expression at the time t is as follows:

s_t＝f(Ux_t+Ws_t-1)

h_t＝g(Vs_t)

6. The system for intelligently recognizing the underwater laser target according to claim 1, wherein the decision module is configured to analyze and fit the extracted features, and finally transmit the analysis result to the output and display module, and specifically:

(6.1) output layer weight equation

(6.2) equation of weights of Loop layers

(6.3) input layer weight equation

is the output layer error.