CN112365554A

CN112365554A - Compressed sensing image reconstruction method based on multi-scale residual error neural network

Info

Publication number: CN112365554A
Application number: CN202011153555.1A
Authority: CN
Inventors: 李素梅; 刘人赫; 薛建伟
Original assignee: Tianjin University
Current assignee: Tianjin University
Priority date: 2020-10-26
Filing date: 2020-10-26
Publication date: 2021-02-12

Abstract

The invention belongs to the field of image processing, and aims to improve the reconstruction quality and the reconstruction effect of an image to the maximum extent. The compression sampling model comprises a remolded reshape layer/a full connection layer; an initial reconstruction model, wherein one full-connection layer FC is used for up-sampling an observation vector y to finally obtain a model x_vAnother reshaped reshape layer rearranges the output of the upsampling process to form a preliminary reconstructed image xir; further processing xir using a deep multi-scale residual reconstruction model comprising a plurality of multi-scale residual blocks MSRBs, the output features of each MSRB being mapped to the networkAnd the terminal links the characteristic information and the output of the initial reconstruction model into the network terminal. The invention is mainly applied to the image processing occasion.

Description

Compressed sensing image reconstruction method based on multi-scale residual error neural network

Technical Field

The invention belongs to the field of image processing, and relates to the structure optimization of a convolutional neural network in deep learning, image compression sampling and reverse reconstruction research. In particular to a compressed sensing image reconstruction method based on a multi-scale residual error neural network.

Background

Compressed Sensing (CS) theory is a very promising emerging technology that demonstrates that the original signal [1,2] can be reconstructed with a high probability with less sampled data than required by nyqiust sampling theory when the signal is sparse in the transform domain. In addition, the CS theory can complete the sampling and compression processing of the signals at the same time, and is beneficial to relieving the pressure of hardware in data acquisition, storage and transmission. Based on the advantages, the CS theory is widely applied to a plurality of practical applications such as medical image scanners [3], single-pixel cameras [4], cognitive radio communication [5] and the like.

In order to fully exploit the potential of the CS theory, many image reconstruction methods based on the CS theory have been proposed in the last decade. The reconstruction method of the compressed sensing image can be roughly divided into two types: the conventional compressed sensing image reconstruction method based on optimization and the latest compressed sensing image reconstruction method based on CNN (convolutional neural network). To better analyze these methods and to show the advantages of our method, we first explain the compressed sensing of the signal and its inverse reconstruction process in mathematical form: the purpose of compressed sensing and inverse reconstruction theory is to obtain the corresponding compressed observed quantity y ═ x ∈ R from the original signal^MIn-process reverse reconstruction of original signal x epsilon R^NWhere φ ∈ R^M×NCalled measurement matrix, for implementing a compressive sampling process on a raw signal x, y ∈ R^MIs an observation vector obtained from compressed samples in the original signal x according to the compressed sensing theory. Due to m<<n, reconstructing x from y back is a highly ill-posed problem.

For this problem, most conventional compressed sensing image reconstruction methods [6,7,8,9 ]]Assuming that the original image signal is structurally sparse in some transform domains, iterative calculations are then applied to solve an optimization problem. However, natural images in the real world do not have exact sparsity in their transform domain, which limits the reconstruction performance of these algorithms. In addition, the iterative computation has high computational complexity and long time for executing the reconstruction task, which also limits the real-time application of the image compression perception theory. Recently, CN is receivingN application to image super-resolution reconstruction [10 ]]Inspiring in the field, researchers developed a new class of compressed sensing image reconstruction methods based on CNN. Kulkarni et al first proposed a method using a simple convolutional neural network ReconvNet [11]And recovering the original image from the observation vector in the reverse direction. Yao et al [12]]Further perfecting the structure of the reconstructed network and forming a new network DR²-Net[12]The full connection layer is introduced as a linear mapping model, and the residual error network is used as a reconstruction model. In document [13]]Zhang et al propose a CNN model called ISTA-Net, so as to appropriately fuse the structural advantages of the conventional iterative algorithm and the reconstruction speed advantages of the CNN method.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention aims to realize the high-efficiency and high-quality reconstruction and restoration of the compressed sensing image through a multi-scale residual error neural network, and the reconstruction quality and the reconstruction effect of the image are improved to the maximum extent on the premise of controlling the reconstruction time cost of the compressed sensing image. Therefore, the technical scheme adopted by the invention is that the compressed sensing image reconstruction system based on the multi-scale residual error neural network comprises the following steps: the compressively sampled model contains a remoulded reshape layer for size transforming an input image block xp of size n × n, resulting in a single column vector x of size n2 × 1_v(ii) a Then one full connection layer is used for pair vector x_vPerforming a compressive sampling process and generating a corresponding size of α × n²The image observation vector y, a represents the measurement rate, i.e. the compressed sampling rate, and then the output of the compressed sampling model is fed into the initial reconstruction model, wherein a full-connected layer FC is used for up-sampling the observation vector y, the FC (full-connected layer) contains n²Each neuron finally obtaining the sum of x_vAnother reshaped reshape layer rearranges the output of the upsampling process to form a preliminary reconstructed image xir; further processing xir with a deep multi-scale residual reconstruction model comprising a plurality of multi-scale residual blocks MSRBs, mapping the output characteristics of each MSRB to the end of the network, and connecting the characteristic information with the output of the initial reconstruction model into the networkThe end of the collaterals.

Wherein, compressing the sampling model:

the input image block xp is n x n in size, and the shaping operation of the shaping layer in the compressed sample model is f_re1The reshape layer is used to reshape the size of the input image block while preserving the value of each pixel, and the output of the reshape layer is then expressed as:

x_v＝f_re1(x_p) (1)

wherein x_vIs provided with n²A single column vector of size x 1. Second, at compressed sample x_vAdding a full connection layer comprising alpha x n²Nerve, α represents the measurement rate; the output of the full link layer is y, then y and x_vThe relationship between them is expressed as:

y＝f_full1(x_v) (2)

wherein f is_full1An operation representing full connectivity;

this full connectivity layer performs a similar function to the conventional random gaussian measurement matrix: in a layer of²Each neuron generating an AND vector x_vThe weight vectors with the same size are trained on the network to adaptively learn the value of the weight vector, and then the vector x is used for outputting the current neuron_vAnd inner product representation of weight vector, thereby a layer of alphaxn²The output of each neuron may be represented by a vector x_vAnd one is composed of. alpha.x n²An inner product of weight matrix formed by the weight vectors, wherein the weight matrix is (alpha x n2) x n²Size;

the output of the compressed sampling model is sent to the next initial reconstruction model, whose output is denoted x_irThen y and x_irThe relationship between can be expressed as:

x_ir＝f_re2(f_full2(y)) (3)

wherein f is_full2Representing operations in a fully connected layer, can be seen as simulating from y to the original signal x_vLinear mapping of the initial reconstruction process, and f_re2Indicating a shaping operation in the reshape layer.

Wherein the multi-scale residual error reconstruction module MSRB:

there are three convolution channels in the MSRB, each channel extracting some scale information from the input feature map using a fixed size convolution kernel, and further sharing and merging the image information extracted from the three channels.

The above operation formula of the MSRB unit is expressed as follows:

where F denotes the output of a certain convolutional Layer (Conv Layer) in the MSRB unit, w and b denote the weight and offset in the convolutional Layer, respectively, the superscript denotes the position of the convolutional Layer in the MSRB unit, and the subscript denotes the size of the convolutional core in the convolutional Layer. M_i-1And M_iRepresenting the input and output of the ith MSRB unit. []To representThe channel is connected to a conditioned operation, max (0, x) denotes Relu [16 ]]A function. w is a_oAnd b_oRespectively representing the weight and the deviation of the last convolution layer of the MSRB unit;

assuming that the number of MSRB structural units in the MSRNet is n, the weight sum deviation of the last convolution layer is w_fAnd b_fThen the final reconstruction result x of the MSRNet_frExpressed as:

x_fr＝max(0,w_f*[M₁,……,M_N]+b_f)+x_ir (11)。

training and execution of MSRNet network:

for the training dataset, we use the same document [10,11 ]]The same dataset, comprising 91 images, these original images are rotated 90, 180, 270 to expand and enhance the dataset, 33 x 33 image patches are extracted from the original images by tiling, the tiling step size is set to 14, a network training set is made, for the training strategy, the mean square error MSE is used as the loss function to minimize the predicted loss of the network, and Adam is used as the network optimization algorithm to train the network, the initial learning rate is set to 0.001, the learning rate is reduced by a factor of 10 every 5 ten thousand, the learning rate is less than 10^-5The network training process is terminated.

The invention has the characteristics and beneficial effects that:

the end-to-end multi-scale residual error reconstruction network provided by the method realizes the reverse reconstruction process of the compressed sensing image. Through training of the convolutional neural network based on end-to-end optimization, the difficulty of generating a compression measurement matrix is greatly reduced, and meanwhile, the reconstruction precision of the network is improved. In addition, multi-scale residual error learning is introduced, the learning capability of multi-scale information and characteristics of the network is improved, better image reconstruction quality is facilitated, and meanwhile, lower image reconstruction time cost is kept. Experiments on a standard image test set show that the MSRNET is an efficient and excellent compressed sensing image reconstruction system.

Description of the drawings:

FIG. 1 shows that PSNR values of image parrots are reconstructed by different algorithms under the condition that the compression measurement rate is 10%And time costs. Reconstruction results of our method compare RecoNet [11]And DR²-net[12]The time cost is well controlled because the time cost is respectively 5.61dB and 4.30dB higher.

FIG. 2 image compressive sensing and inverse reconstruction process based on multi-scale residual error neural network

FIG. 3: multi-scale residual Unit (MSRB) Structure representation

FIG. 4: the previous frame of image is the reconstruction result under the condition that the measurement rate is 25%, and the next frame of image is the reconstruction result under the condition that the measurement rate is 4%.

Detailed Description

For existing CNN-based models, they are typically able to complete the reconstruction task very quickly due to the strong learning capabilities of CNNs. However, these existing CNN-based methods often use simple and basic network frameworks such as residual networks or deep networks to form a reconstruction model, and cannot fully utilize and mine the learning and mapping capabilities of complex convolutional networks. In order to improve the learning capability of the reconstruction network and further improve the recovery performance of the compressed sensing image, a new multi-scale residual error reconstruction network MSRNet is proposed. In MSRNet, we construct a multi-scale residual block (MSRB) as the basic block for reconstructing the network. There are three parallel channels of convolution kernels of different sizes in one MSRB, each of which can be used to extract one scale information from the input feature map. In addition, the information extracted by the current channel is shared with other channels, so that the network learning capability of the characteristic information with different scales is obviously improved. In addition, the MSRNet also introduces jump connection and residual learning, and the prediction accuracy of the network is improved. Another contribution of MSRNet is the measurement matrix. In previous CNN-based reconstruction methods, the corresponding compressed perceptual observation vector y was generated by compression sampling the original image x with a random measurement matrix of appropriate size, which was artificially generated before training the network. However, in MSRNet, we use a compressive sampling network to implement the compressive sampling process, and this model can perform parameter update and learning by training the network, which has two advantages: (1) generating a matrix of random nature is very difficult for hardware. By training the MSRNet, a deterministic measurement matrix can be generated, which is self-learned by the network and is easy to implement in hardware. (2) By integrating the compressive sampling and reverse recovery processes into one network, we actually build and optimize a complete end-to-end CNN, rather than optimizing the image compressive sensing and reconstruction components separately.

Overall, our MSRNet's major contributions are three areas: (1) multi-scale learning: through the multi-scale convolution channel, the network can extract and utilize feature information of different scales, and has stronger learning ability. (2) End-to-end optimization: a complete end-to-end network is realized to simulate image compression and reverse reconstruction, and the difficulties of network optimization and hardware realization are avoided. (3) Accuracy and time complexity: as shown in fig. 1, our MSRNet achieves significant improvement in reconstruction performance with lower temporal complexity over the standard image set.

As an end-to-end network that simulates the process of image compressed sensing and reverse reconstruction, we designed MSRNet to include three sub-models: the device comprises a compression sampling model, an initial reconstruction model and a deep multi-scale residual error reconstruction model.

The method comprises the following specific steps:

as shown in fig. 2, the three submodels are cascaded and integrated to form a complete MSRNet. In MSRNet, the compressed sampling model comprises a reshaping (reshape) layer for transforming the size of an input image block xp of size n × n, resulting in a single-column vector x of size n2 × 1_vThen one full connection layer is used for pair vector x_vPerforming a compressive sampling process and generating a corresponding size of α × n²And (α represents the measurement rate, i.e., the compressed sampling rate) of the image observation vector y. The output of the compressed sample model is then fed into an initial reconstruction model, where a fully-connected layer (FC) contains n²Neuron) for up-sampling the observation vector y to obtain the sum x_vWith the same size of the output data, another reshaping (reshape) layer rearranges the output of the upsampling process to form a preliminary reconstructed image xir. Considering the initialThe reconstructed image xir is not ideal in terms of PSNR values, human vision and the like, and the performance and visual quality of the reconstructed image are further improved by using a deep multi-scale residual reconstruction model including a plurality of multi-scale residual blocks (MSRB). In addition, the output characteristic mapping of each MSRB is sent to the end of the network by using skip connection, and the characteristic information is connected with the output of the initial reconstruction model and sent to the end of the network, so that the reconstruction performance of the network is improved.

We now analyze and present the compressive sampling and deep multi-scale residual reconstruction modules in detail.

Compressive sampling model in MSRNet

In previous methods, researchers have implemented sampling operations using a random gaussian matrix as the measurement matrix phi. Although this is an effective compressive sampling measurement matrix, it is difficult to implement a random matrix in practical applications. In MSRNet, we propose a compressed sampling model to replace the random gaussian matrix.

Assuming that the input image block xp is n × n in size, the shaping operation of the shaping layer in the compressed sample model is f_re1The reshape layer is used to reshape the size of the input image block while preserving the value of each pixel, the output of the reshape layer can be expressed as:

x_v＝f_re1(x_p) (1)

wherein x_vIs provided with n²A single column vector of size x 1. Second, at compressed sample x_vAdding a full connection layer comprising alpha x n²Nerves (α represents the measurement rate). Assuming that the output of the fully connected layer is y, then y and x_vThe relationship between can be expressed as:

y＝f_full1(x_v) (2)

wherein f is_full1Indicating a fully connected operation.

This full connectivity layer performs a similar function to the conventional random gaussian measurement matrix: in a layer of²Each neuron generating an AND vector x_vWeight vectors of the same size. Through a network pairAdaptively learning the value of the weight vector, and then using the vector x as the output of the current neuron_vAnd inner product representation of weight vector, thereby a layer of alphaxn²The output of each neuron may be represented by a vector x_vAnd one is composed of. alpha.x n²The inner product of the weight matrix formed by the individual weight vectors is represented. Wherein the weight matrix is (alpha x n2) x n²The size is the same as the size of the previous conventional measurement matrix. However, in MSRNET, the weight matrix is generated by training a network based on end-to-end optimization, which is more advantageous in terms of hardware implementation and compressive sampling efficiency.

The output of the compressed sampling model is sent to the next initial reconstruction model, the output of which can be represented as x_irThen y and x_irThe relationship between can be expressed as:

x_ir＝f_re2(f_full2(y)) (3)

Second, multi-scale residual error reconstruction module in MSRNet

In MSRNet, we constructed a multi-scale residual block (MSRB) as the basic building block of the deep reconstruction model. As shown in fig. 2, there are three convolution channels in the MSRB, each channel extracting some scale information from the input feature map using a fixed size convolution kernel. In addition, the image information extracted from the three channels can be further shared and fused, and the learning and expression capacity of the network on different scale features is improved.

The above operation of the MSRB unit in fig. 2 can be formulated as follows:

where F denotes the output of a certain convolutional Layer (Conv Layer) in the MSRB unit, w and b denote the weight and offset in the convolutional Layer, respectively, the superscript denotes the position of the convolutional Layer in the MSRB unit, and the subscript denotes the size of the convolutional core in the convolutional Layer. M_i-1And M_iRepresenting the input and output of the ith MSRB unit. []Denotes a channel connect (connected) operation, max (0, x) denotes Relu [16 ]]A function. w is a_oAnd b_oRespectively representing the weight and offset of the last convolutional layer of the MSRB unit.

Assuming that the number of MSRB structural units in the MSRNet is n, the weight sum deviation of the last convolution layer is w_fAnd b_fThen the final reconstruction result of MSRNet (denoted as x)_fr) Can be expressed as:

x_fr＝max(0,w_f*[M₁,……,M_N]+b_f)+x_ir (11)

training and execution details for a three, MSRNet network

For the training dataset, we use the same document [10,11 ]]The same data set comprising 91 images. Receiver [10 ]]Inspired, we rotate these original images 90, 180, 270 to expand and enhance the data set. According to the common practice of the existing CNN-based compressed sensing method, 33 x 33 image small blocks are extracted from an original image by block cutting, the block cutting step length is set to be 14, and a network training set is manufactured. For the training strategy, we use Mean Square Error (MSE) as a loss function to minimize the predicted loss of the network, and Adam [17 ]]Our network is trained as a network optimization algorithm. The initial learning rate is set to 0.001, the learning rate is reduced by 10 times every 5 ten thousand times, and the learning rate is less than 10^-5The network training process is terminated.

In addition, the number of convolution kernels for each convolution layer in MSRNet is set to 64, except for the last convolution layer, which is set to 1, to form the final reconstructed image. The batch size (batch size) of the image patch fed into the network training is set to 64. All experiments were done on a 12GB memory Titanx GPU through the deep learning framework, caffe [18 ].

Like other compressed sensing methods, we used Set11[11] as the test dataset and tested the reconstruction performance of the different methods at four measurement rates (MR ═ 0.25,0.1,0.04, 0.01). Considering the balance between performance and computational complexity, we performed our experiments using msrnets containing 2 MSRBs. We compared our approach to the most advanced existing approaches, including TVAL3[7], D-AMP [14], SDA [15], Reconnet [11], DR2-NET [12], ISTA-NET [13] the average PSNR results on the Set11[11] test Set are reported in Table 1:

table 1: PSNR values of reconstruction results of different methods at different measurement rates (Set11 test Set)

As can be seen from table 1, our MSRNet significantly improved the reconstruction performance over the previous best methods, e.g., MR 0.25, and our method was 7.82db and 4.7db higher than CNN-based ReconNet [11] and DR2-Net [12], respectively. For MR ═ 0.01, our method improves PSNR by 2.78db over ISTA-Net [13], and MSRNet is also robust to reconstruction of compressed perceptual images, even at very small compressed measurement rates, MSRNet can achieve the currently optimal reverse reconstruction effect of compressed perceptual images, more reconstructed example images at different measurement rates are shown in fig. 4. Our method produces detailed texture and edges in the reconstructed image with better visual effect than other known methods.

For the temporal complexity of the algorithm, we used a single frame image of 256 × 256 size as the test image to calculate the reconstruction time for the different methods, the results are shown in table 2.

Table 2: different methods realize the time required by the reconstruction of a single frame (256X256) image under different measurement rates

For fair comparison, we performed all tests on similar performing devices, and from Table 2 we see that our method compares to the conventional image compressive sensing reconstruction algorithm TVAL3[7]]，D-AMP[14]Much faster. SDA [15]]As a non-iterative algorithm based on deep learning, the method has a great speed advantage compared with other compressed sensing image reconstruction algorithms based on CNN (compressed sensing network), although the method is more reconNet than Reconnet [11]]Slow by 5-6 times, ratio DR²-NET[12]The speed is 2 times slower, but the time complexity of the MSRNet is acceptable and can meet the requirements of practical application, so that the limited increase of the time complexity is acceptable in consideration of the great advantage of reconstruction performance of the method.

Reference to the literature

[1]E.J.Candes,T.Tao,“Near-optimal signal recovery from random projections:Universal encoding strategies？,”IEEE Transactions on Information Theory,52(12):5406-5425,2006.

[2]M.F.Duarte,M.A.Davenport,D.Takbar,J.N.Laska,T.Sun,K.F.Kelly,and R.G.Baraniuk,“Single-pixel imaging via compressive sampling,”IEEE Signal Processing Magazine,25(2):83–91,2008.

[3]M.Lustig,D.Donoho,and J.M.Pauly,“Sparse mri:The application of compressed sensing for rapid mr imaging,”Magnetic Resonance in Medicine,58(6):1182–1195,2007.

[4]F.Rousset,N.Ducros,A.Farina,G.Valentini,C.DAndrea,F.Peyrin,“Adaptive basis scan by wavelet prediction for single-pixel imaging,”IEEE Transactions on Computational Imaging,3(1):36–46,2017.

[5]S.K.Sharma,E.Lagunas,S.Chatzinotas,and B.Ottersten,“Application of compressive sensing in cognitive radio communications:A survey,”IEEE Communication Surveys&Tutorials,18(3):1838–1860,2016.

[6]Y.Kim,M.S.Nadar,and A.Bilgin,“Compressed sensing using a gaussian scale mixtures model in wavelet domain,”In ICIP,IEEE,pages 3365–3368.,2010.

[7]C.Li,W.Yin,H.Jiang,and Y.Zhang,“An efficient augmented lagrangian method with applications to total variation minimization,”Computational Optimization and Applications,56(3):507–530,2013.

[8]C.A.Metzler,A.Maleki,and R.G.Baraniuk,“From denoisingto compressed sensing,”IEEE Transactions on Information Theory,62(9):5117–5144,2016.

[9]J.Zhang,D.Zhao,and W.Gao,“Group-based sparse representation for image restoration,”IEEE Transactions on Image Processing,23(8):3336–3351,2014.

[10]C.Dong,C.C.Loy,K.He,and X.Tang,“Learning a deep convolutional network for image superresolution,”In Euro.Conf.Comp.Vision,Springer,2014,pages.184-199.

[11]K.Kulkarni,S.Lohit,P.Turaga,R.Kerviche,and A.Ashok,“Reconnet:Non-iterative reconstruction of images from compressively sensed random measurements,”CVPR,2016.

[12]H.Yao,F.Dai,D.Zhang,Y.Ma and S.Zhang,“DR²-Net:deep residual reconstruction network for image compressive sensing,”IEEE Conference on Computer Vision and Pattern Recognition,2017.

[13]J.Zhang,B.Ghanem,“ISTA-Net:Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing,”CVPR,2018.

[14]C.A.Metzler,A.Maleki,and R.G.Baraniuk,“From denois-ing to compressed sensing,”IEEE Trans.Information Theory,2016,62(9):5117–5144.

[15]A.Mousavi,A.B.Patel,and R.G.Baraniuk,“A deep learning approach to structured signal recovery,”in 2015 53rd Annual Allerton Conference on Communication,Control,and Computing,IEEE,2015,pp.1336–1343.

[16]X.Glorot,A.Bordes,and Y.Bengio,“Deep sparse rectifier neural networks,”In AISTATS,2011.3.

[17]D.Kingma,J.Ba.Adam,“A method for stochastic optimization,”ICLR,2014.5.

[18]Y.Jia,E.Shelhamer,J.Donahue,S.Karayev,J.Long,R.Girshick,S.Guadarrama,and T.Darrell,“Caffe:Convolutional architecture for fast feature embedding,”arXiv:1408.5093,2014.5。

Claims

1. A compressed sensing image reconstruction system based on a multi-scale residual error neural network is characterized by comprising: the compressively sampled model contains a remoulded reshape layer for size transforming an input image block xp of size n × n, resulting in a single column vector x of size n2 × 1_v(ii) a Then one full connection layer is used for pair vector x_vPerforming a compressive sampling process and generating a corresponding size of α × n²The image observation vector y, a represents the measurement rate, i.e. the compressed sampling rate, and then the output of the compressed sampling model is fed into the initial reconstruction model, wherein a full-connected layer FC is used for up-sampling the observation vector y, the FC (full-connected layer) contains n²Each neuron finally obtaining the sum of x_vAnother reshaped reshape layer rearranges the output of the upsampling process to form a preliminary reconstructed image xir; using blocks containing multiple multi-scale residuesThe deep multi-scale residual reconstruction model of the MSRB is further processed xir and the output features of each MSRB are mapped to the end of the network and these features are concatenated with the output of the initial reconstruction model to the end of the network.

2. The multi-scale residual neural network-based compressed sensing image reconstruction system of claim 1, wherein the compressed sampling model:

x_v＝f_re1(x_p) (1)

y＝f_full1(x_v) (2)

wherein f is_full1An operation representing full connectivity;

x_ir＝f_re2(f_full2(y)) (3)

3. The multi-scale residual neural network-based compressed sensing image reconstruction system of claim 1, wherein the multi-scale residual reconstruction module MSRB: there are three convolution channels in the MSRB, each channel extracting some scale information from the input feature map using a fixed size convolution kernel, and further sharing and merging the image information extracted from the three channels.

4. The system of claim 4, wherein the MSRB unit is expressed by the following formula:

where F denotes the output of a certain convolutional Layer (Conv Layer) in the MSRB unit, w and b denote the weight and offset in the convolutional Layer, respectively, the superscript denotes the position of the convolutional Layer in the MSRB unit, and the subscript denotes the size of the convolutional core in the convolutional Layer. M_i-1And M_iRepresenting the input and output of the ith MSRB unit. []Indicates the channel connection conditioned operation, max (0, x) indicates Relu [16 ]]A function. w is a_oAnd b_oRespectively representing the weight and the deviation of the last convolution layer of the MSRB unit;

x_fr＝max(0,w_f*[M₁,……,M_N]+b_f)+x_ir (11)。

5. the multi-scale residual neural network-based compressed sensing image reconstruction system of claim 1, wherein the MSRNet network is trained and executed to: for a training dataset, the usage dataset comprises 91 images, these original images are rotated 90, 180, 270 to expand and enhance the dataset, 33 x 33 image patches are extracted from the original images by dicing, the dicing step size is set to 14, a network training set is made, for the training strategy, the mean square error MSE is used as a loss function to minimize the prediction loss of the network, and Adam is used as a network optimization algorithm to train the network, the initial learning rate is set to 0.001, the learning rate is reduced by 10 times every 5 ten thousand times, the learning rate is less than 10^-5The network training process is terminated.