CN117054754A

CN117054754A - Quick radio storm signal searching method based on target detection model

Info

Publication number: CN117054754A
Application number: CN202311024924.0A
Authority: CN
Inventors: 王洪丰; 姜文俐; 郝传辉
Original assignee: Dezhou University
Current assignee: Dezhou University
Priority date: 2023-08-15
Filing date: 2023-08-15
Publication date: 2023-11-14

Abstract

The invention relates to a rapid radio storm signal searching method based on a target detection model, and belongs to the technical field of astronomy and artificial intelligence. Aiming at the problems of large workload, simplicity and repetition, time waste caused by DM value violent cracking and the like when a rapid electric storm signal searching task is manually completed, a trainable target detection model is introduced to replace manual completion of rapid electric storm signal searching work, a training set and a testing set are divided into 1:1, model training is completed under the condition of less existing data, a better testing result is obtained, and DM value cracking speed and rapid electric storm signal searching efficiency are remarkably improved. The method has wide application prospect in astronomical signal searching and other fields.

Description

Quick radio storm signal searching method based on target detection model

Technical Field

The invention relates to a rapid radio storm signal searching method based on a target detection model, and belongs to the technical field of astronomy and artificial intelligence.

Background

The fast radio storm (Fast Radio Bursts, FRB) is an explosive, pulsed radio radiation astronomical phenomenon of only a few milliseconds in duration, which is one of the current astronomical hotspot fronts. Because of the characteristics of extremely short burst time and few repetition, the rapid radio storm signal is difficult to capture.

Most of the existing fast radio storm signal searching methods firstly perform dispersion elimination treatment on signals, and the traditional technology generally adopts an automatic and high-performance software pipeline based on the dispersion elimination theory to perform FRB event searching, such as HEIMDALL, FDMT (Fast Dispersion Measure Transform), presto (Pulsar Exploration and Search Toolkit), BEAR (Burst Emission Automatic Roger) and the like. These algorithms face noise and RFI-induced false positive challenges, subject to radio frequency interference. In addition, the traditional searching method needs to perform dispersion elimination processing firstly, namely, the DM value needs to be known firstly, but the DM value is mostly cracked by adopting an exhaustion method at present, namely, experiments are performed by a traversing method, and the method is low in efficiency and wastes computational resources. With the rapid increase of the FRB observed data quantity, the number of false positive candidates is correspondingly increased, the workload of DM value calculation is very huge, and the screening work of FRB events from a large number of candidates is manually carried out, so that the search of rapid electric storm consumes manpower and material resources greatly.

Along with the development of artificial intelligence, a machine learning target detection technology is applied to the field of rapid electric storm, and the cycle-frequency diagram of the rapid electric storm without dispersion is directly learned, so that the current situation that a great deal of manpower is consumed in rapid electric storm searching is changed. The constructed convolutional neural network is used for carrying out regression to predict the DM value to replace violent cracking, so that the current situation that a large amount of material resources are consumed in the process of decoloring can be changed.

Currently, the transient non-relevant publications relate to the acquisition of a fast-shot storm signal by directly performing target detection on a fast-shot storm period-frequency map without dispersion. By adopting the traditional method, the quick electric storm searching is carried out on the basis of achromatizing, and for the DM regression problem, the violent cracking is adopted at present, and related publications such as:

fast radio storm candidate classification based on convolutional neural networks. And carrying out dispersion elimination processing and DM conversion on the dynamic spectrum data to respectively generate a dispersion elimination dynamic spectrum array and a DM-time intensity array. Finally input it into trainingThe convolutional neural network classifier is used for identifying and classifying, so that the classification of the rapid radio storm is completed, and the conventional violent cracking is adopted for the DM regression problem; the algorithm of the write channel brute force DM value as in Applying Deep Learning to Fast Radio Burst Classification requires that all N be _t N of NDM DM test of time samples _f And summing the frequency channels. The computational complexity is O (N) _f N _t log ₂ N _f ). The de-dispersion of trees employs a divide-by-conquer technique that exploits redundancy of adjacent channel de-dispersion (Taylor 1974). By using an FFT-like approach, the problem is reduced to having log ₂ N _f Branched tree, allowing O (N _t N _t log ₂ N _f ) Is not limited by the complexity of (a). A highly optimized CPU-based version of this algorithm has been used for the FRB search of the prime. Other algorithms, such as fast DM transforms (FDMT), exploit the same redundancy and attempt to maintain optimality. The algorithm is used for the FREDDA pipe of ASKAP.

The convolutional neural network is used for regression, the most classical CNN structure using the convolutional neural network is generally a plurality of convolutional layers, the convolutional layers are connected with full connection layers, and finally, the Softmax layer is connected with the output predicted classification probability. If the matrix of the image is also regarded as a vector, whether convolution or FC is performed in the CNN, one tensor is continuously transformed into another tensor, and finally the output is a probability vector with the number of categories formulated for classification as the dimension, and the value of the vector finally output is directly taken as regression, and the final optimized objective function is not cross entry and the like any more, but is directly based on the error of real values.

Disclosure of Invention

The invention aims at solving the problems and the defects existing in the prior art, and creatively provides a rapid shot storm signal searching method based on a target detection model, aiming at solving the technical problems of consumption of manpower and material resources, time waste caused by forced cracking of a DM value in the process of achromatizing in the process of firstly eliminating chromatic dispersion and then searching in the process of manually searching the rapid shot storm signal.

The method can effectively solve the problems of consumption of manpower and material resources, time waste caused by forced cracking of the DM value in the process of achromatizing in the process of manually searching the rapid electric storm signal, and the like, and can be used for applying target detection to the searching work of the rapid electric storm signal, thereby remarkably improving the cracking speed of the DM value and the searching efficiency of the rapid electric storm signal.

The innovation points of the invention include: aiming at the problems of large workload, simple repetition, time waste caused by DM value violent cracking and the like when a rapid electric storm signal searching task is manually completed, a trainable target detection model is introduced to replace manual completion of the rapid electric storm signal searching work, a training set and a testing set are divided into 1:1, model training is completed under the condition of less existing data, a good testing result is obtained, and the efficiency of the rapid electric storm signal searching work is improved. Aiming at the problem of time waste caused by the forced cracking of the DM value in the existing achromatizing process, a DM value regression model is introduced to replace manual completion of the searching work of the rapid radio storm signal, so that achromatizing efficiency is improved.

Advantageous effects

Compared with the prior art, the invention has the following characteristics:

firstly, the rapid radio storm detection is carried out on the non-dispersion signals which are directly captured, so that extremely time-consuming processes such as cracking of DM values and the like are avoided, a large amount of time and resources are saved, and the target detection technology is directly utilized to detect the non-dispersion time-frequency two-dimensional intensity map to judge whether the signals are rapid radio storm signals.

And secondly, carrying out DM value regression by using the constructed convolutional neural network to replace the traditional brute force cracking method of traversing the DM value in a certain step length within the range of 0-2000.

Drawings

FIG. 1 is a schematic flow chart of the method of the present invention.

Fig. 2 is an effect diagram of the method.

Detailed Description

The method of the invention is verified in further detail below with reference to the accompanying drawings.

As shown in fig. 1, a fast shot storm signal searching method based on a target detection model includes the following steps:

step 1: a dataset is constructed.

In particular, the data set is generated by means of a code-generated fast-shot analogue signal. For example, the DM value, the signal-to-noise ratio, and the full width at half maximum are randomly generated as set ranges. And the data sets are partitioned by the training set and the validation set in a set ratio (e.g., a ratio of 9: 1).

In addition, a batch of analog data (e.g., 880 sheets) is regenerated as a test set.

Step 2: the Mask R-CNN model was trained with simulation data.

Firstly, setting parameters to obtain the best effect.

Then, a backbone feature extraction network Resnet101 and a feature pyramid network FPN (Feature Pyramid Networks) are used as feature extraction layers.

For example, the feature map of Conv2_1 (Convolt2_1, the first Convolution layer inside the second Convolution block), conv3_2 (Convolt3_2, the second Convolution layer inside the third Convolution block), conv4_3 (Convolt4_3, the third Convolution layer inside the fourth Convolution block), conv5_4 (Convolt5_4, the fourth Convolution layer inside the fifth Convolution block) of ResNet is taken out and put into the FPN for operation.

The FPN takes out the feature images of each layer of the original trunk network and then operates the feature images, so that the problem that small-size objects in an object detection scene are difficult to detect can be effectively solved.

FPN takes the form of a Feature Map within a pyramid like SSD (Single Shot MultiBox Detector). Unlike SSD, FPN uses not only Feature maps of deep layer in VGG (Visual Geometry Group), but also Feature maps of shallow layer are applied to FPN. By means of bottom-up (bottom-up), top-down (top-down) and transverse connection (lateral connection), the Feature maps are integrated efficiently, the detection time is not greatly increased while the accuracy is improved, SSD is used as a target detection algorithm, and is one of main detection frames up to the present; VGG is a convolutional neural network structure that is widely used; resnet101 represents a backbone feature extraction network, resNet101 has two basic blocks, namely Conv Block and Identity Block, wherein the input and output dimensions of Conv Block are different, so that the network cannot be continuously connected in series, the network dimension is changed, the Identity Block is the same in input dimension and output dimension, so that the network can be connected in series, and the network is deepened.

Step 3: the model is tested with simulated data.

And using the cross-over ratio IoU and the Recall ratio Recall as judging model effects, wherein:

a represents a prediction frame region, B represents a label frame region, A.u.B represents a portion where two regions overlap, and A.u.B represents a set of the two regions. The larger the cross-over ratio, the more accurate the model searches for the FRB signal location.

TP represents an FRB signal predicted as an FRB signal, FP represents a non-FRB signal predicted as an FRB signal, TN represents a non-FRB signal predicted as a non-FRB signal, and FN represents an FRB signal predicted as a non-FRB signal. The higher the recall, the lower the probability that the representative model missed the FRB signal.

Step 4: and constructing a DM value regression model, wherein the DM value regression model comprises a Resnet50 model and a full link layer.

And finally adding a full connection layer by using the Resnet50 model to adapt to the DM value problem, and sending the model into a network for training.

During training, the pictures and the characteristic labels thereof cut by the rapid radio storm search model based on target detection are written into the GPU, the GPU is used for accelerating training, and the pictures are sent into the model training, including flattening matrix, calculating mean square error, gradient zero clearing, back propagation and updating optimizer operation.

Step 5: combining the search model and the DM value regression model, taking the output of the search model as the input of the regression model, and training the model.

Step 6: the entire fast-shot basic processing model is tested.

By R ² As the judgment model effect, the following is concrete:

wherein m represents the number of samples, y ⁽ⁱ⁾ The true value is represented by a value that is true,representing predicted values +.>Representing the sample mean. Var (y) represents the mean-generated error, < >>Representing the prediction-induced error. R is R ² When=1, it means that the predicted value and the true value in the sample are completely equal, without any error. The best-performing model, meaning R ² The value also reaches a maximum.

Step 7: data, which may contain a fast-shot signal, is input into the model, and signal location information and DM values are output.

Specifically, data captured by the radio telescope, which may contain a rapid radio storm signal, is first input into a rapid radio storm base processing model. The data possibly containing the rapid radio storm signal is firstly passed through the RPN network of the search model to generate a candidate region, a binding box (boundary box) is obtained, the data containing the rapid radio storm signal is detected, and the search model integrates the data containing the rapid radio storm signal and is used as the input of the regression model. And after the regression model is calculated, outputting DM values of all the rapid radio storm data in the data.

The method introduces a target detection technology to search the rapid electric-radiation storm signal instead of manual searching, and searches the rapid electric-radiation storm signal without dispersion, thereby improving the searching efficiency of the rapid electric-radiation signal. The DM value regression model is introduced to replace the traditional method for violently cracking the DM value, so that the achromatizing efficiency is improved, and the searching efficiency of the rapid radio storm signal is indirectly improved, as shown in figure 2. The method has wide application prospect in the field of astronomical signal search.

The foregoing is a preferred embodiment of the invention and the invention should not be limited to this embodiment and the disclosure of the drawings. All equivalents and modifications that come within the spirit of the disclosure are desired to be protected.

Claims

1. A rapid radio storm signal searching method based on a target detection model is characterized by comprising the following steps:

step 1: constructing a data set;

step 2: training a Mask R-CNN model by using simulation data;

firstly, setting parameters to obtain the best effect;

then, using a backbone feature extraction network Resnet101 and a feature pyramid network FPN as feature extraction layers;

the FPN takes out each layer of side feature images of the original trunk network and then operates the feature images;

the FPN adopts a form of Feature Map in the pyramid; the FPN not only uses the Feature Map with deep hierarchy in the convolutional neural network structure VGG, but also is applied to the FPN, and the Feature Map is integrated through bottom-up, top-down and transverse connection;

resnet101 represents a backbone feature extraction network, resNet101 has two basic blocks, namely Conv Block and Identity Block, wherein the input and output dimensions of Conv Block are different, the effect is to change the dimension of the network, and the input dimension and output dimension of Identity Block are the same, the effect is to deepen the network;

step 3: testing the model with simulated data;

wherein A represents a predicted frame region, B represents a marked frame region, A n B represents a part where two regions overlap, A n B represents a set of the two regions, and the larger the intersection ratio is, the more accurate the model searches FRB signal positions are:

where TP represents an FRB signal predicted as an FRB signal, FP represents a non-FRB signal predicted as an FRB signal, TN represents a non-FRB signal predicted as a non-FRB signal, and FN represents an FRB signal predicted as a non-FRB signal; the higher the recall rate, the lower the probability of missing FRB signals of the representative model;

step 4: constructing a DM value regression model, including a Resnet50 model and a full link layer;

using a Resnet50 model, and finally adding a full connection layer to adapt to the problem of DM values, and sending the model into a network for training;

during training, writing the pictures and the characteristic labels thereof cut by the rapid radio storm search model based on target detection into a GPU, using the GPU to accelerate training, and sending the pictures into the model training, wherein the operations comprise flattening matrix, calculating mean square error, gradient zero clearing, back propagation and updating an optimizer;

step 5: combining the search model and the DM value regression model, taking the output of the search model as the input of the regression model, and training the model;

step 6: testing the whole rapid radio storm basic processing model;

2. The method of claim 1, wherein step 1 comprises:

generating a data set by adopting a mode of generating a rapid radio storm analog signal by means of codes, and dividing the data set according to a training set and a verification set in a set proportion; in addition, a batch of simulation data is regenerated as a test set.

3. The method for searching fast radio storm signals based on object detection model as claimed in claim 2, wherein in step 1.2, the DM value, signal to noise ratio, full width at half maximum are randomly generated into analog signals according to a set range; the training set and the validation set are partitioned into data sets at a 9:1 ratio.

4. The method of claim 1, wherein in step 6, R is used to search for a fast-speed radio storm signal based on a target detection model ² As a judgment model effect:

wherein m represents the number of samples, y ⁽ⁱ⁾ The true value is represented by a value that is true,representing predicted values +.>Representing a sample mean; var (y) represents the mean-generated error, < >>Representing the prediction-generated error; r is R ² When=1, it means that the predicted value and the true value in the sample are completely equal, without any error.

5. The method of claim 1, wherein in step 7, data captured by the radio telescope and possibly containing the rapid-response storm signal is input into a rapid-response storm basic processing model;

the data possibly containing the rapid radio storm signals firstly pass through an RPN network of a search model to generate candidate areas, a Bounding box is obtained, the data containing the rapid radio storm signals are detected, and then the data containing the rapid radio storm signals are integrated by the search model to be used as the input of a regression model;

and after the regression model is calculated, outputting DM values of all the rapid radio storm data in the data.