CN108171200B

CN108171200B - SAR image classification method based on SAR image statistical distribution and DBN

Info

Publication number: CN108171200B
Application number: CN201810031937.3A
Authority: CN
Inventors: 侯彪; 焦李成; 梁亚敏; 马晶晶; 马文萍; 王爽; 白静
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2018-01-12
Filing date: 2018-01-12
Publication date: 2022-05-17
Anticipated expiration: 2038-01-12
Also published as: CN108171200A

Abstract

The invention discloses an SAR image classification method based on SAR image statistical distribution and DBN, which mainly solves the problems that the traditional deep confidence network DBN is used for SAR image classification, and the regional consistency is poor and the edge information is incomplete. The realization process is as follows: preprocessing an SAR image to be classified to obtain an input matrix of a DBN; designing a DBN consisting of 3 restricted Boltzmann machines; pre-training the designed network by using an input matrix to obtain a trained DBN; randomly selecting part of pixel sets with category labels from labels of the SAR image category label graph, and performing micro-adjustment on the trained DBN by using a back propagation algorithm; and (4) carrying out pixel-by-pixel classification on the image to be classified by using the micro-adjusted DBN to obtain a classification result, and coloring and outputting the classification result. The method has the advantages of excellent classification result, good regional consistency and complete edge information, and can be applied to terrain classification and target identification of SAR images.

Description

SAR image classification method based on SAR image statistical distribution and DBN

Technical Field

The invention belongs to the field of image processing, and particularly relates to an SAR image classification method which can be applied to terrain classification and target identification of an SAR image.

Background

The synthetic aperture radar SAR is an airborne radar or a satellite-borne radar which can generate high-resolution images, has the characteristic of all weather all the day, and is widely applied to the fields of remote sensing, mapping and the like. Synthetic aperture radar images, SAR images, are topographical images of the ground surface that reflect radar signals that illuminate the ground surface and have differences in light and dark tones, much like black and white photographs. For interpretation of the SAR image, compared with an optical image, the SAR image classification is difficult, and the objective to be realized by the SAR image classification is to label different ground object types with different colors through a designed classification algorithm.

The statistical model of the SAR image plays a very important role in the aspects of detection and identification of the ground object target, elimination of speckle noise, classification of the ground object target and the like. Statistical models of SAR images are roughly classified into two categories according to the modeling process of the SAR images: parametric models and non-parametric models. The parametric model is to calculate each parameter of the probability density function through the existing SAR image data under the condition of unknown parameters of the probability density function, and then select the probability density function with the best fitting degree as a statistical model of the ground feature region of the SAR image. For example, gamma distribution, Weibull distribution, Log-normal distribution, and the like. The nonparametric model refers to that the best probability density function is directly selected according to a certain rule according to the data of the ground feature region of the SAR image.

The feature extraction method for the classification of the SAR image can be roughly classified into three types: statistical methods, transform domain methods, and model-based methods. Wherein:

a typical representative statistical method is a gray level co-occurrence matrix characteristic proposed by u.kandaswamy et al, see u.kandaswamy, d.a.adjeroh, and m.c.lee, effective texture analysis of SAR image, IEEE trans.geos.remote sens.,2005,43(9): 2075-2083, which includes variance, contrast, energy, etc., since these information are only features based on statistics of the SAR image, spatial neighborhood characteristics of the SAR image are ignored, and thus the spatial consistency of the classification result obtained by the statistical method is poor.

Transform domain methods, including fourier transforms, wavelet transforms and Garbor wavelet transforms, are mainly used for texture analysis of SAR Images, see e.g. akbaizadeh, a new static-based wavelet transform for texture recognition of SAR Images, IEEE trans. geosci. remote sens, 2002,50(11): 4358-one 4368. These transformation methods cannot obtain sufficient discrimination information and rely on a large amount of high-quality label information, and therefore, when used for classification of an SAR image, a good classification result cannot be obtained.

Model-based methods, including Markov field-based models and Bag-of-Words-based models. The method is mainly used for researching the spatial consistency of neighborhood pixels, in practical application, the method cannot effectively extract the texture information of the image, and is not suitable for classifying multi-resolution SAR images because the statistical relationship among different resolutions of the SAR images is not fully mined and the prior probability problem under the condition of only researching a single resolution is not researched; when the Bag-of-Words model proposed by Jie Feng et al is used for classification of SAR images, bottom layer features need to be extracted manually, and the process is complicated.

Disclosure of Invention

The invention aims to provide an SAR image classification method based on SAR image statistical distribution and DBN (direct binary decomposition) to overcome the defects of the prior art, so that the classification accuracy and the region consistency of SAR image classification are improved.

The technical scheme for realizing the aim of the invention comprises the following steps:

1) reading in an SAR image to be classified, forming a neighborhood matrix by one pixel in the image and pixels around the pixel, taking a neighborhood matrix for each pixel of the image, and forming an input matrix of a neural network by the neighborhood matrices;

2) designing a depth confidence network DBN consisting of 3 restricted Boltzmann machines, wherein the 1 st restricted Boltzmann machine is a Gamma restricted Boltzmann machine Gamma RBM, and the 2 nd and 3 rd restricted Boltzmann machines are Bernoulli-Bernoulli restricted Boltzmann machines BBRBM; each limited Boltzmann machine consists of a visible layer and a hidden layer, nodes in the layers are not connected, nodes between the layers are all connected, and a parameter set of each limited Boltzmann machine is { W, b, c }, wherein W is a weight matrix connecting nodes of the visible layer and nodes of the hidden layer, and b and c are offsets of the visible layer and the hidden layer respectively;

3) training the network designed in the step 2 to obtain a trained deep belief network DBN:

3a) the input matrix is used as the input of a deep confidence network DBN, the 1 st limited Boltzmann machine is pre-trained to obtain the output of the 1 st limited Boltzmann machine, and the weight matrix and the bias of the output are stored;

3b) the output of the 1 st limited Boltzmann machine is used as the input of the second limited Boltzmann machine of the deep confidence network DBN, the 2 nd limited Boltzmann machine is pre-trained to obtain the output of the 2 nd limited Boltzmann machine, and the weight matrix and the bias of the output are stored;

3c) setting the numerical value in the 3 rd limited Boltzmann machine weight matrix as a normally distributed random number with the mean value of 0 and the variance of 0.1, and setting the hidden layer bias c as a random number in [0,1 ];

4) according to the labeled category information in the ground object category reference map of the SAR image, randomly selecting a part of pixel sets with category labels for each category;

5) adopting a back propagation algorithm BP, and carrying out supervised micro-adjustment on the parameters of the trained deep belief network DBN by using a pixel set with a category label to obtain an adjusted deep belief network DBN;

6) classifying all pixels in the image to be classified one by using the adjusted depth confidence network DBN to obtain a classification result;

7) and marking the same color for the same category on the classified SAR image according to the three primary colors of red, green and blue to obtain a colored classification result image and outputting the colored classification result image.

Compared with the prior art, the invention has the following advantages:

1. the invention improves the application of the deep learning method to the SAR image, combines the statistical distribution characteristic of the SAR image with the classic deep learning model DBN, realizes the automatic extraction of the SAR image characteristic through multi-layer characteristic learning, and overcomes the difficulty of manually extracting the characteristic.

2. The invention can achieve better classification performance by classifying through a deep confidence network DBN formed by a novel limited Boltzmann machine after supervised fine adjustment.

Simulation results show that the method has the advantages of excellent classification effect, good regional consistency and complete edge information.

Drawings

FIG. 1 is a flow chart of an implementation of the present invention;

FIG. 2 is a SAR image to be classified containing three types of ground object backgrounds;

FIG. 3 is a real terrain labeling diagram of a three-class terrain background SAR image;

FIG. 4 is a graph of the results of a prior art simulation of the classification of FIG. 2;

FIG. 5 is a diagram of the classification simulation results of the present invention with respect to FIG. 2.

Detailed Description

Embodiments and effects of the present invention will be described in further detail below with reference to the accompanying drawings.

Referring to fig. 1, the implementation steps of the present invention are as follows:

firstly, preprocessing an image to be classified to obtain an input matrix of a Deep Belief Network (DBN).

1a) Reading in an SAR image to be classified as shown in fig. 2;

1b) let e be a pixel in the image, and form a3 × 3 neighborhood matrix Y with pixel e and its surrounding pixels a, b, c, d, f, g, h, i:

1c) converting the neighborhood matrix Y into a row vector X:

X＝[a d g b e h c f i]，

1d) repeating 1b and 1c on all pixel points in the SAR image to be classified to obtain a row vector x representing each pixel point_iN, n represents the number of pixels of the SAR image;

1e) the input matrix D of the deep belief network DBN is formed by the row vectors:

D＝[x₁；x₂...x_i...x_n]。

and step two, designing a deep confidence network DBN consisting of 3 limited Boltzmann machines.

The 1 st restricted Boltzmann machine is a Gamma restricted Boltzmann machine, a Gamma RBM, and the 2 nd and 3 rd restricted Boltzmann machines are Bernoulli-Bernoulli restricted Boltzmann machines, a BBRBM;

each limited Boltzmann machine consists of a visible layer and a hidden layer, nodes in the layers are not connected, nodes between the layers are all connected, and a parameter set of each limited Boltzmann machine is { W, b, c }, wherein W is a weight matrix connecting nodes of the visible layer and nodes of the hidden layer, and b and c are offsets of the visible layer and the hidden layer respectively;

stacking three restricted Boltzmann machines according to the following process to obtain a deep confidence network DBN:

according to the characteristics that the hidden layer output of the 1 st limited Boltzmann machine is the same as the visible layer input of the 2 nd limited Boltzmann machine, the hidden layer of the 1 st limited Boltzmann machine and the visible layer of the 2 nd limited Boltzmann machine are laminated into a second layer of the DBN;

by analogy, the hidden layer of the 2 nd limited boltzmann machine and the visible layer of the 3 rd limited boltzmann machine are combined into a third layer of the DBN, and the hidden layer of the 3 rd limited boltzmann machine is used as a fourth layer of the DBN, so that the depth confidence network DBN which is connected with each other layer and is provided with 1 visible layer and 3 hidden layers is formed.

And step three, training the deep confidence network DBN designed in the step 2 to obtain the trained deep confidence network DBN.

3a) Taking the input matrix D as the input of a deep confidence network DBN, pre-training a1 st limited Boltzmann machine to obtain hidden layer output of the 1 st limited Boltzmann machine, and storing a weight matrix and bias thereof:

(3a1) for a specific set of data (v, h), a new energy function is derived from the Gamma distribution of the SAR image and the energy formula of the restricted Boltzmann machine:

wherein v and h respectively represent a visible layer and a hidden layer of the restricted Boltzmann machine, L represents the view of the SAR image, and W represents the visual number of the SAR image_ijThe ith row and the jth column elements of the weight matrix represent weight values connecting the ith visible layer node and the jth hidden layer node; n is_vIs the number of nodes of the visual layer, n_hIs the number of hidden layer nodes; v. of_iRepresents the state of the ith visible level node, h_jRepresenting the state of the jth hidden layer node; b_iAnd c_jRespectively visible layer node v_iAnd hidden layer node h_jBias of (3);

(3a2) deriving an activation value formula of the jth hidden node from the new energy function as follows:

wherein the sigmoid function is an activation function in the neural network, which is defined as:

sigmoid(x)＝1/(1+e^-x)

the activation value formula of the jth hidden layer node is expressed in the known visible layer state v^(k)Then, sampling to obtain the probability that the j hidden layer node value is 1;

(3a3) calling CD-k algorithm to perform k times of Gibbs sampling, namely generating [0,1]Random number R of inner_jWhen P (h)_j＝1|v)＞R_jWhen h is present_j1, otherwise h_jObtaining the hidden layer state h of the restricted Boltzmann machine of the kth iteration after sampling all hidden layer nodes as 0^(k)；

(3a4) After sampling to obtain a hidden layer state, reconstructing an activation value of an ith visible layer node:

wherein

Where Γ () represents a Gamma function, the activation value formula for the ith visible layer node is expressed in the known hidden state h^(k)Sampling to obtain the probability that the ith visible layer node value is x, and sampling all visible layer nodes to obtain the visible layer state v^(k+1)；

(3a5) Let the training sample set of the first constrained boltzmann machine be:

wherein n is_sTo train the number of samples, v^tIs the t training sample:

(3a6) calculating the probability of the t training sample:

where θ is the parameter set initialized by the restricted boltzmann machine, i.e., θ ═ W_ij,b_i,c_jZ is a distribution item of a restricted Boltzmann machine;

(3a7) for P (v)^t| θ) to W_ij、b_iAnd c_jThe gradient of each parameter of the kth iteration is obtained:

wherein Δ w_ij ^kRepresenting the gradient of the weight matrix of the first restricted Boltzmann machine, Δ b_i ^kGradient, Δ c, representing the visual layer bias of the first restricted Boltzmann machine_j ^kA gradient representing a hidden layer bias of a first restricted boltzmann machine;

(3a8) obtaining a weight matrix W after the training of the first limited Boltzmann machine according to the gradient of each parameter of the kth iteration_ij ^kVisual layer bias b_i ^kHidden layer bias c_j ^k：

W_ij ^k＝W_ij+ηΔw_ij ^k

b_i ^k＝b_i+ηΔb_i ^k

c_j ^k＝c_j+ηΔc_j ^k

Wherein eta is the learning rate of each parameter;

3b) taking the hidden layer output of the 1 st limited Boltzmann machine as the visible layer input of the second limited Boltzmann machine of the deep confidence network DBN, pre-training the 2 nd limited Boltzmann machine to obtain the hidden layer output of the 2 nd limited Boltzmann machine, and storing the weight matrix and the bias thereof:

(3b1) for a specific set of data (v)⁽¹⁾,h⁽¹⁾) The second limited boltzmann function in the network is:

wherein v is⁽¹⁾And h⁽¹⁾Respectively representing the visible and hidden layers of a confined Boltzmann machine, W⁽¹⁾For a weight matrix connecting visible and hidden nodes, W_ij ⁽¹⁾Is the ith row and the jth column element of the weight matrix; n is_vIs the number of nodes of the visible layer, n_hIs the number of hidden layer nodes; v. of_i ⁽¹⁾Represents the state of the ith visible level node, h_j ⁽¹⁾Representing the state of the jth hidden layer node; b_i ⁽¹⁾And c_j ⁽¹⁾Respectively visible layer node v_i ⁽¹⁾And hidden layer node h_j ⁽¹⁾Bias of (c);

(3b2) based on the energy function of (3b1), obtaining an activation value formula of the j hidden layer node of the second limited Boltzmann machine:

the activation value formula of the hidden layer node is expressed in a known visible layer state v^{0}Then, sampling to obtain the probability that the j hidden layer node value is 1;

(3b3) calling the CD-k algorithm to perform k times of Gibbs sampling, i.e. firstly generating [0,1]Random number of (b), when P (h)_j ⁽¹⁾＝1|v⁽⁰⁾)＞R_jWhen h is present_j ⁽¹⁾1, otherwise h_j ⁽¹⁾0; obtaining the hidden layer state h of a second limited Boltzmann machine after sampling all hidden layer nodes^{0}Wherein the value of k is set to 1;

(3b4) after sampling to obtain a hidden layer state, reconstructing an activation value formula of an ith visible layer node of a second limited Boltzmann machine as follows:

the activation value formula of the visible layer node is expressed in a known hidden layer state h^{0}Sampling to obtain the probability that the value of the ith visible layer node is 1, and carrying out the sampling on all visible layer nodesAfter point sampling, a visible layer state v is obtained^{1}；

(3b5) Let the training sample set of the second constrained boltzmann machine be:

wherein n is_MTo train the number of samples, v^mIs the mth training sample:

(3b6) initializing parameter set theta in a second known limited Boltzmann machine⁽¹⁾Under the condition (1), obtaining the probability of the mth training sample:

wherein, theta⁽¹⁾＝{W_ij ⁽⁰⁾,b_i ⁽⁰⁾,c_j ⁽⁰⁾}，Z⁽¹⁾A distribution term of a second limited Boltzmann machine;

(3b7) for P (v)^m| θ) to W_ij ⁽¹⁾,b_i ⁽¹⁾,c_j ⁽¹⁾The partial derivative of each parameter is obtained:

wherein Δ w_ij ⁽⁰⁾Representing the gradient of the weight matrix of the second restricted Boltzmann machine, Δ b_i ⁽⁰⁾Gradient, Δ c, representing the visual layer bias of the second restricted Boltzmann machine_j ⁽⁰⁾A gradient representing a hidden layer bias of the second limited boltzmann machine;

(3b8) obtaining a weight matrix W after the training of a second limited Boltzmann machine according to the gradient obtained in the step (3a7)_ij ^*Visual layer bias b_i ^*And hidden layer bias c_j ^*：

W_ij ^*＝W_ij ⁽⁰⁾+λΔW_ij ⁽⁰⁾

b_i ^*＝b_i ⁽⁰⁾+λΔb_i ⁽⁰⁾

c_j ^*＝c_j ⁽⁰⁾+λΔc_j ⁽⁰⁾

Wherein λ represents the learning rate of each parameter;

3c) initializing a third restricted boltzmann machine: and setting the numerical value in the 3 rd limited Boltzmann machine weight matrix as a normally distributed random number with the mean value of 0 and the variance of 0.1, and setting the hidden layer bias c as a random number in [0,1], thereby obtaining the trained deep confidence network DBN.

And step four, extracting the pixel set with the category label.

According to the labeled category information in the ground object category label map of the SAR image shown in the figure (3), 15% of pixel points with category labels are randomly selected from the pixel region of each category to form a pixel set with the category labels.

And step five, adopting a back propagation algorithm BP, and carrying out supervised micro-adjustment on the trained deep belief network DBN by using a pixel set with class labels to obtain an adjusted deep belief network DBN.

5a) And (3) forward propagation process: the input matrix D is propagated from the first limited Boltzmann machine to the third limited Boltzmann machine, namely the output of the hidden layer of the previous limited Boltzmann machine is used as the input of the visible layer of the next limited Boltzmann machine, and the actual output of the hidden layer of the third limited Boltzmann machine is obtained;

5b) and (3) backward propagation process: propagating errors of the actual output and the expected output of the step 5a) backwards layer by layer, namely propagating the errors forwards two limited Boltzmann machines layer by layer, and carrying out micro-adjustment on the weight and the bias of the trained deep belief network DBN to obtain a micro-adjusted deep belief network DBN;

the weight and the bias of the trained deep belief network DBN are subjected to micro adjustment according to the following steps:

5b1) calculating the sensitivity of the jth node of the hidden layer of each restricted Boltzmann machine:

δ_j＝ο_j(1-ο_j)(d_j-ο_j),j＝1,2,...,n_h，

wherein o_jRepresenting the actual output of the j-th node, d_jRepresenting the expected output of the jth node, n_hRepresenting the number of hidden layer nodes;

5b2) calculating the sensitivity delta of the hidden layer of the limited Boltzmann machine according to the sensitivity of the jth node of the hidden layer_i ^l：

Wherein y is_i ^lRepresenting the actual output of the implicit layer of the ith restricted boltzmann machine, w_ij ^lRepresenting the weight matrix, δ, of the ith restricted Boltzmann machine_j ^l+1Representing the sensitivity of the jth hidden layer node of the (l + 1) th limited Boltzmann machine;

5b3) obtaining a weight matrix w of each restricted Boltzmann machine after micro-adjustment according to the sensitivity of the hidden layer of each restricted Boltzmann machine_ij ^lAnd bias c_j ^l：

w_ij ^l＝w_ij ^l+ε×y_i ^lδ_j ^l+1

c_j ^l＝c_j ^l+ε×δ_j ^l+1

Where epsilon represents the learning rate of the fine adjustment.

And step six, carrying out pixel-by-pixel classification on the image to be classified to obtain a classification result.

And (3) classifying all pixels in the SAR image to be classified one by using the depth confidence network DBN after supervision and fine adjustment, and outputting a class label of each pixel to obtain a classification result of the whole SAR image.

And step seven, obtaining a colored classification result graph according to the classification result of the step six.

And marking the same color for the pixel points of the same category on the whole classified SAR image according to the three primary colors of red, green and blue to obtain a colored classification result picture and outputting the colored classification result picture.

The effects of the present invention can be further illustrated by the following simulations:

1. simulation conditions

The simulation use method is a traditional DBN-SVM classification method and the method of the invention;

in simulation experiments, the method of the present invention and the comparative method were both implemented as programmed in MATLAB R2016b software.

2. Emulated content and results

Simulation 1, a conventional DBN-SVM classification method is used to perform a classification experiment on the SAR image shown in fig. 2, and the result is shown in fig. 4.

Simulation 2, the method of the invention is used to perform classification experiments on the SAR image shown in FIG. 2, and the result is shown in FIG. 5.

Comparing the classification result graphs shown in fig. 4 and 5 with the real ground object labeling graph shown in fig. 3, it can be seen from the visual effect that:

the classification result obtained by the traditional DBN-SVM classification method is influenced by coherent speckle noise, so that the phenomenon of wrong classification is serious, more pixels with wrong classification are obtained, the edge information of each region is incomplete, the region consistency is poor, and the classification accuracy is low;

in the classification result obtained by the method, the regional consistency of species in various regions is improved, the edge information is clear and complete, the number of wrongly classified pixels is small, and the classification accuracy is high.

Claims

1. The SAR image classification method based on the SAR image statistical distribution characteristics and the DBN comprises the following steps:

3a) the input matrix is used as the input of a deep confidence network DBN, the 1 st limited Boltzmann machine is pre-trained to obtain the output of the 1 st limited Boltzmann machine, and the weight matrix and the bias of the output are stored; the implementation is as follows:

wherein v and h represent the visible layer and the hidden layer of the restricted Boltzmann machine, respectively, L represents the view of the SAR image, and W represents the visual number of the SAR image_ijIs the ith row and the jth column element of the weight matrix, and represents the weight connecting the ith visible layer node and the jth hidden layer nodeA value; n is_vIs the number of nodes of the visual layer, n_hIs the number of hidden layer nodes; v. of_iRepresents the state of the ith visible level node, h_jRepresenting the state of the jth hidden layer node; b_iAnd c_jRespectively visible layer node v_iAnd hidden layer node h_jBias of (3);

(3a2) deriving the activation value of the jth hidden node from the new energy function as follows:

sigmoid(x)＝1/(1+e^-x)

the activation value is formulated at a known visible layer state v^(k)Then, sampling to obtain the probability that the j hidden layer node value is 1;

(3a3) calling CD-k algorithm to perform k times of Gibbs sampling, namely generating [0,1]Random number of (b), when P (h)_j＝1|v)＞R_jWhen h is present_j1, otherwise h_jObtaining the hidden layer state h of the restricted Boltzmann machine of the kth iteration after sampling all hidden layer nodes as 0^(k)；

wherein

Where Γ () represents a Gamma function, the activation value formula represents at a known hidden state h^(k)Sampling to obtain the probability that the ith visible layer node value is x, and sampling all visible layer nodes to obtain the visible layer state v^(k+1)；

(3a5) Let the first restricted boltzmann machine training sample set as:

wherein n is_sTo train the number of samples, v^tIs the t training sample:

(3a6) calculating the probability of the t training sample:

wherein Δ w_ij ^kRepresenting the gradient of the weight matrix of the first restricted Boltzmann machine, Δ b_i ^kGradient, Δ c, representing the visual layer bias of the first restricted Boltzmann machine_j ^kHidden layer representing the first restricted boltzmann machineA gradient of bias;

(3a8) obtaining a trained parameter W according to the gradient of each parameter of the kth iteration_ij ^k+1，b_i ^k+1，c_j ^k+1：

W_ij ^k+1＝W_ij+ηΔw_ij ^k

b_i ^k+1＝b_i+ηΔb_i ^k

c_j ^k+1＝c_j+ηΔc_j ^k

Wherein eta is the learning rate of each parameter;

2. The method of claim 1, wherein step 3b) pre-trains the 2 nd restricted boltzmann machine with the hidden layer output of the 1 st restricted boltzmann machine as an input to a second restricted boltzmann machine of the deep belief network DBN by:

wherein v is⁽¹⁾And h⁽¹⁾Respectively representing the visible and hidden layers of a confined Boltzmann machine, W⁽¹⁾For a weight matrix connecting visible and hidden nodes, W_ij ⁽¹⁾Is the ith row and the jth column element of the weight matrix; n is_vIs the number of nodes of the visual layer, n_hIs the number of hidden layer nodes; v. of_i ⁽¹⁾Represents the state of the ith visible level node, h_j ⁽¹⁾Representing the state of the jth hidden layer node; b_i ⁽¹⁾And c_j ⁽¹⁾Respectively visible layer node v_i ⁽¹⁾And hidden layer node h_j ⁽¹⁾Bias of (3);

(3b2) based on the energy function of (3b1), obtaining an activation value formula of the jth hidden node:

the activation value is formulated at a known visible layer state v^{0}Then, sampling to obtain the probability that the j hidden layer node value is 1;

(3b3) calling the CD-k algorithm to perform k times of Gibbs sampling, i.e. firstly generating [0,1]Random number in P (h)_j ⁽¹⁾＝1|v⁽⁰⁾)＞R_jWhen h is present_j ⁽¹⁾1, otherwise h_j ⁽¹⁾0; obtaining the hidden layer state h of the restricted Boltzmann machine of the kth iteration after sampling all hidden layer nodes^{0}Wherein the value of k is set to 1;

(3b4) after the hidden layer state is obtained through sampling, the activation value formula for reconstructing the ith visible layer node is as follows:

the activation value is formulated in a known hidden state h^{0}Sampling to obtain the probability that the value of the ith visible layer node is 1, and sampling all visible layer nodes to obtain the visible layer state v^{1}；

wherein n is_MTo train the number of samples, v^mIs the mth training sample:

wherein Δ w_ij ⁽⁰⁾Representing the gradient of the weight matrix of the second restricted Boltzmann machine, Δ b_i ⁽⁰⁾Gradient, Δ c, representing the visual layer bias of the second restricted Boltzmann machine_j ⁽⁰⁾A gradient representing a hidden layer bias of a second limited boltzmann machine;

(3b8) obtaining the parameter W after the training of the second limited Boltzmann machine according to the gradient obtained in (3a6)_ij ^*，b_i ^*And c_j ^*：

W_ij ^*＝W_ij ⁽⁰⁾+λΔW_ij ⁽⁰⁾

b_i ^*＝b_i ⁽⁰⁾+λΔb_i ⁽⁰⁾

c_j ^*＝c_j ⁽⁰⁾+λΔc_j ⁽⁰⁾

Where λ represents the learning rate of each parameter.

3. The method of claim 1, wherein step 5) employs a back propagation algorithm BP, using a class-labeled set of pixels to perform supervised micro-tuning of parameters of a trained deep belief network DBN, which is implemented as follows:

5a) and (3) forward propagation process: the input matrix is transmitted from the first limited Boltzmann machine to the third limited Boltzmann machine, namely, the output of the hidden layer of the previous limited Boltzmann machine is used as the input of the visible layer of the next limited Boltzmann machine, and the actual output of the hidden layer of the third limited Boltzmann machine is obtained;

5b) and (3) backward propagation process: propagating errors of the actual output and the expected output of the step 5a) backwards layer by layer, namely propagating the errors forwards layer by two limited Boltzmann machines, and finely adjusting the weight and the bias of the trained deep belief network DBN to obtain the finely adjusted deep belief network DBN, wherein the implementation is as follows:

δ_j＝ο_j(1-ο_j)(d_j-ο_j)，

wherein o_jRepresenting the actual output of the j-th node, d_jRepresents the expected output of the jth node;

5b2) calculating the sensitivity of the hidden layer of the ith restricted Boltzmann machine according to the sensitivity of the jth node of the hidden layer:

wherein delta_i ^lSensitivity, y, representing the hidden layer of the first restricted Boltzmann machine_i ^lRepresenting the actual output of the hidden layer of the ith restricted Boltzmann machine, delta_j ^l+1The sensitivity of the (l + 1) limited Boltzmann machine hidden layer is represented;

w_ij ^l＝w_ij ^l+ε×y_i ^lδ_j ^l+1

c_j ^l＝c_j ^l+ε×δ_j ^l+1

Where epsilon represents the learning rate of the fine adjustment.