CN110443207B - Target progressive detection and identification method based on hierarchical feature tensor - Google Patents

Target progressive detection and identification method based on hierarchical feature tensor Download PDF

Info

Publication number
CN110443207B
CN110443207B CN201910727379.9A CN201910727379A CN110443207B CN 110443207 B CN110443207 B CN 110443207B CN 201910727379 A CN201910727379 A CN 201910727379A CN 110443207 B CN110443207 B CN 110443207B
Authority
CN
China
Prior art keywords
target
tensor
sub
false
resolution
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910727379.9A
Other languages
Chinese (zh)
Other versions
CN110443207A (en
Inventor
陈浩
高通
陈稳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harbin Institute of Technology
Original Assignee
Harbin Institute of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin Institute of Technology filed Critical Harbin Institute of Technology
Priority to CN201910727379.9A priority Critical patent/CN110443207B/en
Publication of CN110443207A publication Critical patent/CN110443207A/en
Application granted granted Critical
Publication of CN110443207B publication Critical patent/CN110443207B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/13Satellite images

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Astronomy & Astrophysics (AREA)
  • Remote Sensing (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

A target progressive detection and identification method based on hierarchical feature tensor belongs to the technical field of target detection and identification in remote sensing images. The method solves the problem that the accuracy of the existing method for detecting and identifying the target in the remote sensing image is low. According to the method, a target identification model based on a tensor mode is established, and the problem that a typical vector mode identification method ignores internal structural information of a remote sensing image target and is easy to overfit under a small sample is solved. The method realizes accurate identification of the target by learning the hierarchical feature tensor with the most discrimination capability and utilizing a soft and hard interval support tensor machine. The method is suitable for detecting and identifying the target in the large-scene remote sensing image. The method can obtain more than 98% of detection accuracy rate for the remote sensing image with large information amount and complex background, and can be applied to the technical field of target detection and identification in the remote sensing image.

Description

Target progressive detection and identification method based on hierarchical feature tensor
Technical Field
The invention belongs to the technical field of target detection and identification in remote sensing images, and particularly relates to a target progressive detection and identification method.
Background
With the gradual maturity of remote sensing technology, the quality of the obtained remote sensing image is greatly improved, and the target identification based on the remote sensing image becomes a hot research problem. And for various ground targets, the target detection and identification have important research significance and value and wide application scenes.
At present, a great deal of research is developed for remote sensing image target detection and identification at home and abroad, certain achievements are obtained, and various target detection methods aiming at optical image slices are divided into static target detection and moving target detection according to different target types; according to different implementation strategies, the method can be divided into bottom-up data-driven target detection and top-down knowledge (hypothesis) -driven target detection; according to the type of the target features, the method can be divided into target detection based on local features and target detection based on global features; in addition, from the viewpoint of human visual recognition, the utilization of visual saliency is an important means for realizing target detection.
However, the conventional spectral feature can only describe the characteristic of one pixel, and the spatial feature which can be extracted from the local area of the image is ignored. The vector itself is only a one-dimensional array, and the spatial spectrum information of the image is arranged in the vector, so that the structural information of the pixel characteristics is necessarily lost. The accuracy rate of target detection and identification in the remote sensing image is low.
Disclosure of Invention
The invention aims to solve the problem that the accuracy of detection and identification of a target in a remote sensing image is low in the existing method, and provides a target progressive detection and identification method based on hierarchical feature tensor.
The technical scheme adopted by the invention for solving the technical problems is as follows: a target progressive detection and identification method based on hierarchical feature tensor comprises the following steps:
the method comprises the following steps of firstly, extracting real target slices and false target slices to construct a training sample set, and constructing a multi-resolution feature tensor corresponding to each slice in the training sample set;
step two, constructing a characteristic discrimination criterion, and extracting characteristic sub-tensors with the most discrimination capability in different resolutions according to a projection matrix from the multi-resolution characteristic tensor intermediate mathematics Xi Touying matrix constructed in the step one;
training a soft and hard interval support tensor machine by using the extracted characteristic sub tensor until all false targets are correctly classified or the number of layers of a multi-resolution layer reaches a threshold value Q, and obtaining a trained target identification model;
thirdly, quickly detecting a target of the remote sensing image to be detected and identified to obtain a candidate target position; then, acquiring a candidate target slice by intercepting an image block at the position of the candidate target;
step four, respectively carrying out direction estimation on each obtained candidate target slice, and carrying out rotation transformation on each candidate target slice according to the direction estimation result to obtain all candidate target slices after the rotation transformation;
step five, extracting the feature tensors with different resolutions of each slice obtained in the step four, and extracting the feature sub-tensors with the most discriminative ability with different resolutions from the feature tensors with different resolutions by using the projection matrix obtained in the step two;
and inputting the extracted characteristic sub tensor into the target recognition model trained in the second step to obtain a target recognition result of the remote sensing image to be detected and recognized, which is output by the target recognition model.
The beneficial effects of the invention are: the invention provides a target progressive detection and identification method based on hierarchical feature tensor, which establishes a target identification model based on a tensor mode, and solves the problem that a typical vector mode identification method ignores internal structural information of a remote sensing image target and is easy to over-fit under a small sample. The method realizes accurate identification of the target by learning the hierarchical feature tensor with the most discrimination capability and utilizing a soft and hard interval support tensor machine.
The method is suitable for detecting and identifying the target in the large-scene remote sensing image. The method can obtain more than 98% of detection accuracy for the remote sensing image with large information amount and complex background, and has very important significance in the military and civil fields.
Drawings
FIG. 1 is a flow chart of a hierarchical feature tensor-based progressive detection identification method of a target of the present invention;
FIG. 2 is a schematic representation of a standard STM and soft-hard interval support tensor machine classification hyperplane of the present invention;
fig. 3 is a schematic illustration of an experimental slice training sample.
Detailed Description
The first embodiment is as follows: as shown in fig. 1, the method for detecting and identifying a target progressive based on a hierarchical feature tensor according to the present embodiment includes the following steps:
the method comprises the following steps of firstly, extracting a real target slice and a false target slice to construct a training sample set, and constructing a multi-resolution feature tensor corresponding to each slice in the training sample set;
the multi-resolution feature tensor is: constructing a Q-layer image pyramid, extracting images of different layers of the pyramid to respectively construct a feature tensor, and gradually increasing the resolution from the first layer to the Q-th layer of the pyramid to form a multi-resolution layer;
step two, constructing a characteristic discrimination criterion, and extracting characteristic sub-tensors with the most discrimination capability in different resolutions according to a projection matrix from the multi-resolution characteristic tensor intermediate mathematics Xi Touying matrix constructed in the step one;
training a soft and hard interval support tensor machine by using the extracted characteristic sub tensor until all false targets are correctly classified or the number of layers of a multi-resolution layer reaches a threshold value Q, and obtaining a trained target identification model;
thirdly, after the remote sensing image to be detected and identified is subjected to target rapid detection, a candidate target position is obtained; then, acquiring a candidate target slice by intercepting an image block at the position of the candidate target;
step four, respectively carrying out direction estimation on each obtained candidate target slice, and carrying out rotation transformation on each candidate target slice according to the direction estimation result to obtain all candidate target slices after the rotation transformation;
direction estimation refers to estimating the direction of an object that may be included in a candidate object slice. Dividing the target slices of the training sample set into M types according to the direction of the included target, constructing a K-neighbor classifier, and finishing direction estimation of the candidate target by constructing a feature tensor and inputting the feature tensor to the K-neighbor classifier for any candidate target slice. If the target direction in the target slice is X degrees, it is ceil (X × M/360) class.
Step five, extracting the feature tensors with different resolutions of each slice obtained in the step four, and extracting the feature sub-tensors with the most discriminative ability with different resolutions from the feature tensors with different resolutions by using the projection matrix obtained in the step two;
and inputting the extracted characteristic sub tensor into the target recognition model trained in the second step to obtain a target recognition result of the remote sensing image to be detected and recognized, which is output by the target recognition model.
The objects in the digital images are in tensor forms essentially, the extracted image features are described by the tensors, the spatial features of the images are reserved, and the target identification precision can be improved to a certain extent.
Conventional vector mode feature selection selects a subset of features from a set of candidate features, resulting in the feature selection algorithm of conventional vector mode not being applicable to tensor mode data. The invention designs a target progressive identification technology based on a hierarchical feature tensor model, and utilizes a hierarchical feature tensor from low spatial resolution to high spatial resolution to remove false alarms with obvious difference between the current resolution and a target layer by layer, and finally outputs a target identification result meeting specific features of a specified model.
The method fully analyzes the specific characteristics of the typical target in the large-scene remote sensing image, constructs the characteristic tensor according to the specific characteristics, obtains a better detection and identification effect, and has a better application prospect. Experiments against typical aircraft targets identified the aircraft on a *** earth data source. Experimental results show that the method not only can reliably and effectively identify the airplane target in the high-resolution remote sensing image, but also can have a strong robust effect in a complex scene.
The second embodiment is as follows: the first difference between the present embodiment and the specific embodiment is: the specific process of the second step is as follows:
and (3) extracting the feature tensors with different resolutions in the first step, wherein the feature tensors always have great contribution of local sub tensors to target identification, so that the sub tensors are extracted by using a tensor mode feature selection algorithm based on tensor manifold discrimination and arrangement to realize effective target identification.
The manifold learning theory considers that a manifold structure with low dimensionality is contained in a high-dimensional feature, namely, the original sample close relation can still be maintained in the low dimensionality. However, some conventional manifold learning methods ignore the difficult sample problem and do not analyze the separation of heterogeneous neighbors in depth.
A difficult sample refers to a sample that is easily confused with other classes of samples. Due to the complexity of the remote sensing image background, the situation that an object is mixed with the background (false object) easily occurs (for example, the similarity of an airplane object and a cross-shaped road intersection under the spatial feature is high), especially under the description of the low-resolution feature. The reason is that the low resolution feature tensor describes the rough characteristics of the target, and the feature dimensionality is lower, which is more likely to generate a false target that is difficult to distinguish. The task of the multi-resolution identification layer is to separate false targets which can be distinguished by the current resolution, and difficult samples should be distinguished by a subsequent high-resolution feature identification layer.
Extracting multi-resolution characteristic tensor and constructing a training set according to a real target and a false target slice, wherein the false target slice is a chi false_t arg et The method is divided into two types, one is a false target (namely a difficult sample) x which is difficult to separate from a real target hard_samples The other is the easy-to-remove false target (i.e. simple sample) χ simple_samples (ii) a The goal of each multi-resolution identification layer is to remove χ as much as possible according to a soft and hard interval support tensor machine corresponding to the feature tensor training of the current resolution on the premise of keeping the correct identification of a real target sample simple_samples And then, constructing a training set of the next layer by using the real target samples and the false target samples which cannot be correctly classified by the previous layer.
And then constructing a characteristic discrimination criterion, defining samples which are difficult to meet the characteristic discrimination criterion as sample samples which are difficult to be instantiated, and using the rest samples as simple samples. The feature tensor selection algorithm should enable the feature sub-tensors of the same type of samples to still keep the manifold structure of the original feature tensor, that is, the neighbors of the same type of samples of the original feature tensor are still neighbors in the sub-tensors, and the sub-tensors of different types are separated as far as possible. Whereas a difficult sample is a false target that is difficult to separate from a real target.
Step two, aiming at the first layer of the multi-resolution layer, a false target x j′ ∈χ false_t arg et ,χ false_t arg et Representing a set of false objects, x j′ With each real target
Figure BDA0002159375200000041
The sum of squares of the Frobenius norms, and the sum result represents the false target χ j′ The difference from the real target is that,
Figure BDA0002159375200000042
χ t arg et represents a set of real targets, 1 ≦ i' ≦ N t ,N t The total number of real targets;
false target χ j′ Maximum separation from the totality of real objects
Figure BDA0002159375200000043
Comprises the following steps:
Figure BDA0002159375200000044
wherein: u shape i′ In order to be a projection matrix, the projection matrix,
Figure BDA0002159375200000045
is a false target x j′ Maximum separation from the whole real target, all false targets
Figure BDA0002159375200000046
In descending order, the top 80% of the dummy objects are the first layer of hard samples, the other dummy objects are the first layer of simple samples,
Figure BDA0002159375200000047
represents U i′ And chi j′ In the mode product of the i' th mode,
Figure BDA0002159375200000048
represents U i′ And
Figure BDA0002159375200000049
the mode product at the i' th mode;
to ensure
Figure BDA0002159375200000051
In that
Figure BDA0002159375200000052
Neighbor feature tensor or
Figure BDA0002159375200000053
In that
Figure BDA0002159375200000054
The adjacent feature tensors are far away from each other as much as possible, and adjacent samples of the original feature tensors are selected to construct adjacent constraint;
for the
Figure BDA0002159375200000055
The same group (means
Figure BDA0002159375200000056
False target or true target together with neighbor set NH) neighbor set NH representation
Figure BDA0002159375200000057
K neighbors in the same group, i.e.
Figure BDA0002159375200000058
To be separated (i.e. the different groups,
Figure BDA0002159375200000059
unlike the neighbor set NH, the neighbor set NM is a dummy target,
Figure BDA00021593752000000510
is a true target, or the neighbor set NM is a true target,
Figure BDA00021593752000000511
as a false target) represents: if it is
Figure BDA00021593752000000512
Then
Figure BDA00021593752000000513
K neighbors to be separated are
Figure BDA00021593752000000514
If it is
Figure BDA00021593752000000515
Then
Figure BDA00021593752000000516
K neighbors to be separated are
Figure BDA00021593752000000517
If it is
Figure BDA00021593752000000518
For the real target, the same group of k neighbors means, and
Figure BDA00021593752000000519
the k nearest real targets with different k adjacent groups mean
Figure BDA00021593752000000520
The k false objects that are closest in distance,
when in use
Figure BDA00021593752000000521
Is composed of
Figure BDA00021593752000000522
(
Figure BDA00021593752000000523
Possibly a false target, and possibly a true target) or
Figure BDA00021593752000000524
Is composed of
Figure BDA00021593752000000525
When k is adjacent to each other in the same group, the weight value
Figure BDA00021593752000000526
Is 1, otherwise the weight value
Figure BDA00021593752000000527
Is 0;
when the temperature is higher than the set temperature
Figure BDA00021593752000000528
Is composed of
Figure BDA00021593752000000529
Of different groups k of neighbors or
Figure BDA00021593752000000530
Is composed of
Figure BDA00021593752000000531
When k is adjacent to other groups, the weight value
Figure BDA00021593752000000532
Is 1, otherwise weight
Figure BDA00021593752000000533
Is 0;
according to the relationship of the adjacent neighbors,
Figure BDA00021593752000000534
sub tensor of
Figure BDA00021593752000000535
Satisfying the minimum following formula:
Figure BDA00021593752000000536
wherein: k is a regularization parameter, the purpose of which is to balance the importance of the same set of neighbors remaining separate from different sets of samples,
Figure BDA00021593752000000537
is composed of
Figure BDA00021593752000000538
The sub-tensors of (a) are,
Figure BDA00021593752000000539
the meaning of the above equation is the neutralization of the original tensor
Figure BDA00021593752000000540
The nearest neighbor of the same group of samples in the sub-tensor is the nearest possible, and the chi to be separated simple_samples And with
Figure BDA00021593752000000541
Are to be moved away from each other.
The equation (2) is modified as:
Figure BDA00021593752000000542
in the formula: intermediate variables
Figure BDA0002159375200000061
Intermediate variables
Figure BDA0002159375200000062
Is a matrix of the units,
Figure BDA0002159375200000063
g and h represent the g-th row and h-th column of the matrix respectively,
Figure BDA0002159375200000064
represents the first
Figure BDA0002159375200000065
Sub-tensor set formed by samples and 2k neighbors thereof
Figure BDA0002159375200000066
In (1)The sub-tensor of the g-th sample (which may be a false target, or a target),
Figure BDA0002159375200000067
represents the first
Figure BDA0002159375200000068
Sub-tensor set composed of samples and 2k neighbors thereof
Figure BDA0002159375200000069
The sub-tensor of the h-th sample in (a); diag (·) represents a diagonal matrix with input parameters as diagonals;
the optimization goal of equation (3) is the sample
Figure BDA00021593752000000610
The target function of the samples adjacent to it, and the overall target function is the target function of all samples
Figure BDA00021593752000000611
Stacking the optimization objective functions corresponding to the N with the i being more than or equal to 1 and less than or equal to N;
to select a sub-tensor
Figure BDA00021593752000000612
And with
Figure BDA00021593752000000613
Neighbor (including same group neighbor and different group neighbor) sub tensor, defining selection matrix
Figure BDA00021593752000000614
As shown in equation (4):
Figure BDA00021593752000000615
wherein:
Figure BDA00021593752000000616
to select a matrix
Figure BDA00021593752000000617
I of (1) 0 Line kth 0 Elements of a column;
Figure BDA00021593752000000618
represents
Figure BDA00021593752000000619
Is composed of
Figure BDA00021593752000000620
Of the same group of neighbors 0 The number of the samples is one,
Figure BDA00021593752000000621
represents
Figure BDA00021593752000000622
Is composed of
Figure BDA00021593752000000623
Of the same group of neighbors 0 The number of the samples is one,
Figure BDA00021593752000000624
represents
Figure BDA00021593752000000625
Is composed of
Figure BDA00021593752000000626
Of a different group of neighbors 0 The number of the samples is one,
Figure BDA00021593752000000627
represents
Figure BDA00021593752000000628
Is composed of
Figure BDA00021593752000000629
Of a different group of neighbors 0 A sample is obtained;
the overall feature discrimination objective function is expressed as equation (5):
Figure BDA00021593752000000630
intermediate variables
Figure BDA00021593752000000631
A set of sub-tensor representing the total of N samples (i.e. the total number of false targets and real targets is N)
Figure BDA00021593752000000632
The sub-tensor of the g-th sample (which may be a false target or a true target),
Figure BDA0002159375200000071
subset set of sub-tensor representing the entire N samples
Figure BDA0002159375200000072
The sub-tensor of the h-th sample in (a);
the final feature discrimination criterion is:
Figure BDA0002159375200000073
in the formula: w is an intermediate variable matrix, an
Figure BDA0002159375200000074
Figure BDA0002159375200000075
Is W to the middle
Figure BDA0002159375200000076
Go to the first
Figure BDA0002159375200000077
The elements of the column are,
Figure BDA0002159375200000078
represents
Figure BDA0002159375200000079
Is composed of
Figure BDA00021593752000000710
Are close to each other in the same group k,
Figure BDA00021593752000000711
represents
Figure BDA00021593752000000712
Is composed of
Figure BDA00021593752000000713
Are close to each other in the same group k,
Figure BDA00021593752000000714
represents
Figure BDA00021593752000000715
Is composed of
Figure BDA00021593752000000716
Are close to each other in the different groups k,
Figure BDA00021593752000000717
represents
Figure BDA00021593752000000718
Is composed of
Figure BDA00021593752000000719
K, D is a diagonal matrix,
Figure BDA00021593752000000720
equation (6) is equivalent to the form of equation (7):
Figure BDA00021593752000000721
L g,h is a Laplace matrix;
by using
Figure BDA00021593752000000722
Replacement of
Figure BDA00021593752000000723
By using
Figure BDA00021593752000000724
Replacement of
Figure BDA00021593752000000725
i =1,2, …, l; transforming equation (7) into the form of equation (8):
Figure BDA00021593752000000726
then, the alternative iteration method is utilized to respectively and sequentially solve the U i I =1,2, …, l, and based on the determined projection matrix U i Calculating the most discriminative characteristic sub-tensor of the first layer of the multi-resolution layer
Figure BDA00021593752000000727
J' is more than or equal to 1 and less than or equal to N; (remote sensing image tensor expression and manifold learning method research _ Zhang Lefei)
Fig. 2 (a) shows that the standard STM uses soft gaps, and all the black and white segments are segmented incorrectly. Assuming that black is the class of targets and white is the false target, for the classifier of the multi-resolution recognition layer, the situation of fig. 2 (b) (SHM-STM) occurs, i.e., targets can be classified correctly and have hard intervals, while the false target can tolerate false separation under the condition of maximizing soft intervals. Establishing a soft-hard interval support tensor machine (SHM-STM), according to the objective function of the standard STM for maximizing class interval and minimizing relaxation variable and the soft-hard interval constraint required by us, then:
step two, in order to obtain the parameters of the soft-hard interval support tensor machine, the optimization problem of constructing the soft-hard interval support tensor machine is as follows:
Figure BDA0002159375200000081
wherein: w is a i Parameters representing the i-th hyperplane of the soft-hard interval support tensor machine, i =1,2, …, l, l represents the number of hyperplanes of the soft-hard interval support tensor machine,
Figure BDA00021593752000000811
representing the outer product of the vectors, C representing the penalty factor of the soft-hard interval support tensor machine, ξ i″ Represents the relaxation variable for the ith "simple sample, 1 ≦ i ≦ N sft ,N sft For the number of false target samples,
Figure BDA0002159375200000082
is the sub tensor of the ith sample with the category of the real target, i' is more than or equal to 1 and less than or equal to N t ,N t B represents the bias of the hyperplane for the true target sample number;
Figure BDA0002159375200000083
is a sub-tensor of the ith "sample of the class of false objects,
Figure BDA0002159375200000084
denotes w i And
Figure BDA0002159375200000085
in the mode product of the i-th mode,
Figure BDA0002159375200000086
denotes w i And
Figure BDA0002159375200000087
the mode product at the ith mode;
the soft-hard interval support tensor machine can be considered as an extreme form of the C-STM, which can be expressed as follows;
Figure BDA0002159375200000088
Figure BDA0002159375200000089
Figure BDA00021593752000000810
ξ i ≥0,1≤i≤N simple +N t arg et
when C is present 1 The formula is + ∞, and the C-STM is equivalent to the SHM-STM. The SHM-STM ensures the maximum target recognition rate of the training process.
Respectively identifying the characteristic sub-tensors with the highest discrimination capability of the first layer, and obtaining the category to which each characteristic sub-tensor belongs:
Figure BDA0002159375200000091
wherein:
Figure BDA0002159375200000092
if it is
Figure BDA0002159375200000093
Is 1, then the sub-tensor
Figure BDA0002159375200000094
The corresponding sample is classified as a real target if
Figure BDA0002159375200000095
Is-1, the sub-tensor
Figure BDA0002159375200000096
The category of the corresponding sample is a false target;
and step two, repeating the process from the step one to the step two for other layers of the multi-resolution layer until all the false targets are correctly classified or the number of layers of the multi-resolution layer reaches a threshold value Q, and stopping training to obtain a trained target recognition model.
For the second layer of the multi-resolution layer, false targets which cannot be correctly classified by the first layer and all real target samples are used as a training set of the second layer of the multi-resolution layer, and the like, for the Nth layer of the multi-resolution layer, false targets which cannot be correctly classified by the (N-1) th layer and all real target samples are used as a training set of the Nth layer of the multi-resolution layer;
and (3) adopting a mode of sequentially executing a first layer, a second layer, a third layer, … …, an Nth layer and … … until all the false targets are correctly classified or the number of layers of the multi-resolution layer reaches a threshold value Q, stopping training, and obtaining a trained target recognition model.
The third concrete implementation mode: the second embodiment is different from the first embodiment in that: the specific process of the third step is as follows:
and rapidly detecting a target of the remote sensing image based on the visual saliency to obtain candidate target positions in the remote sensing image, and intercepting an image block at each candidate target position according to the center of each candidate target position, the size of a real target and the prior of the resolution of the remote sensing image to obtain an image slice of each candidate target.
In the large-scene airplane target detection, after rapid detection based on visual saliency, a large number of candidate targets exist, and image slices corresponding to the size of an airplane are intercepted to be used as slices to be identified; for the remote sensing image to be detected, the suspected target position can be marked by methods such as selective search and the like.
And taking the center of the candidate target position as a center, and intercepting a square image block, wherein the side length of the image block is between 1 and 2 times of the target length.
The fourth concrete implementation mode: the third difference between the present embodiment and the specific embodiment is that: the multi-resolution feature tensor in the first step is a multi-resolution Gabor (Gabor) feature tensor, a multi-resolution Gist (Generalized search trees) feature, or a multi-resolution morphological difference operator feature tensor.
The fifth concrete implementation mode is as follows: the fourth difference between this embodiment and the specific embodiment is that: the threshold Q of the number of layers of the multi-resolution layer is 6.
The following examples were used to demonstrate the beneficial effects of the present invention:
the method for progressively detecting and identifying the target based on the hierarchical feature tensor has the advantages that a large number of experiments are conducted, experimental data sources are optical remote sensing images of 20 airports of a Google Earth data source, a slice result drawing in the experiments is shown in figure 3, (a) is a real target training sample, and (b) is a false target training sample, and sliding window identification is conducted with the step length of 10.
According to the overall training algorithm of the target progressive recognition model of the hierarchical feature tensor, which is provided by the invention, the model is trained, and relevant training parameters are enabled to be delta S cellsize =[40,40],δ 1 =[16,16],δ 2 =20,δ 3 =[4,4]. Setting the difficult sample of the multi-resolution identification layer as
Figure BDA0002159375200000101
The dimension of the eigen-tensor is 20% of the smaller samples
Figure BDA0002159375200000102
Where round () represents rounding and size () is the dimension from which the input tensor is extracted. In the target attitude estimation layer
Figure BDA0002159375200000103
I.e. we have
Figure BDA0002159375200000104
Target poses are classified into 10 classes for separation, and sigma in target pose estimation is the minimum distance of different classes of samples in single pose
Figure BDA0002159375200000105
Each level of the model can correctly identify a part of background samples till the last background sample is almost allPartial removal, and since the proposed SHM-STM and OC-STM ensure 100% correct classification of the targets in the training process, 4 of 912 background samples are considered as false alarms and 600 target samples are correctly classified at the end of the final model training.
The above-described calculation examples of the present invention are merely to explain the calculation model and the calculation flow of the present invention in detail, and are not intended to limit the embodiments of the present invention. It will be apparent to those skilled in the art that other variations and modifications can be made on the basis of the foregoing description, and it is not intended to exhaust all of the embodiments, and all obvious variations and modifications which fall within the scope of the invention are intended to be included within the scope of the invention.

Claims (5)

1. A target progressive detection and identification method based on hierarchical feature tensor is characterized by comprising the following steps:
the method comprises the following steps of firstly, extracting a real target slice and a false target slice to construct a training sample set, and constructing a multi-resolution feature tensor corresponding to each slice in the training sample set;
step two, constructing a characteristic discrimination criterion, and extracting characteristic sub-tensors with the most discrimination capability in different resolutions according to a projection matrix from the multi-resolution characteristic tensor intermediate mathematics Xi Touying matrix constructed in the step one;
training a soft and hard interval support tensor machine by using the extracted characteristic sub tensor until all false targets are correctly classified or the number of layers of a multi-resolution layer reaches a threshold value Q, and obtaining a trained target identification model;
thirdly, after the remote sensing image to be detected and identified is subjected to target rapid detection, a candidate target position is obtained; then, acquiring a candidate target slice by intercepting an image block at the position of the candidate target;
step four, respectively carrying out direction estimation on each obtained candidate target slice, and carrying out rotation transformation on each candidate target slice according to a direction estimation result to obtain all candidate target slices after rotation transformation;
step five, extracting the feature tensors with different resolutions of each slice obtained in the step four, and extracting the feature sub-tensors with the most discriminative ability with different resolutions from the feature tensors with different resolutions by using the projection matrix obtained in the step two;
and inputting the extracted characteristic sub tensor into the target recognition model trained in the second step to obtain a target recognition result of the remote sensing image to be detected and recognized, which is output by the target recognition model.
2. The method for progressively detecting and identifying the target based on the hierarchical feature tensor of claim 1, wherein the specific process of the second step is as follows:
step two, regarding the first layer of the multi-resolution layer, the false target x j′ ∈χ false_target Will falsely target x j′ With each real target
Figure FDA0002159375190000011
The sum of squares of the Frobenius norms, and the sum result represents the false target χ j′ The difference from the real target is that,
Figure FDA0002159375190000012
1≤i′≤N t ,N t the total number of real targets;
false target χ j′ Maximum separation from the totality of real objects
Figure FDA0002159375190000013
Comprises the following steps:
Figure FDA0002159375190000014
wherein: u shape i′ In order to be a projection matrix, the projection matrix,
Figure FDA0002159375190000015
is a false target x j′ To all bodiesMaximum separation of targets, all false targets
Figure FDA0002159375190000016
In descending order, the top 80% of the dummy objects are the first layer of hard samples, the other dummy objects are the first layer of simple samples,
Figure FDA0002159375190000017
represents U i′ And chi j′ In the mode product of the i' th mode,
Figure FDA0002159375190000021
represents U i′ And
Figure FDA0002159375190000022
the mode product at the i' th mode;
when in use
Figure FDA0002159375190000023
Is composed of
Figure FDA0002159375190000024
Of the same group k neighbor or
Figure FDA0002159375190000025
Is composed of
Figure FDA0002159375190000026
When k is adjacent to the same group, the weight value
Figure FDA0002159375190000027
Is 1, otherwise the weight value
Figure FDA0002159375190000028
Is 0;
when in use
Figure FDA0002159375190000029
Is composed of
Figure FDA00021593751900000210
Of different groups k of neighbors or
Figure FDA00021593751900000211
Is composed of
Figure FDA00021593751900000212
When k is adjacent to other groups, the weight value
Figure FDA00021593751900000213
Is 1, otherwise the weight value
Figure FDA00021593751900000214
Is 0;
according to the relationship of the adjacent neighbors,
Figure FDA00021593751900000215
sub tensor of
Figure FDA00021593751900000216
Satisfying the minimum following formula:
Figure FDA00021593751900000217
wherein: k is a regularization parameter and is,
Figure FDA00021593751900000218
is composed of
Figure FDA00021593751900000219
The sub-tensors of (a) are,
Figure FDA00021593751900000220
the equation (2) is modified as:
Figure FDA00021593751900000221
in the formula: intermediate variables
Figure FDA00021593751900000222
Intermediate variables
Figure FDA00021593751900000223
Figure FDA00021593751900000224
Figure FDA00021593751900000225
Is a matrix of the units,
Figure FDA00021593751900000226
g and h represent the g-th row and h-th column of the matrix respectively,
Figure FDA00021593751900000227
Figure FDA00021593751900000228
represents the first
Figure FDA00021593751900000229
Sub-tensor set formed by samples and 2k neighbors thereof
Figure FDA00021593751900000230
The sub-tensor of the g-th sample in (b),
Figure FDA00021593751900000231
represents the first
Figure FDA00021593751900000232
A sub-tensor composed of one sample and 2k neighbors thereofCollection
Figure FDA00021593751900000233
The sub-tensor of the h-th sample in (a);
to select a sub-tensor
Figure FDA00021593751900000234
And with
Figure FDA00021593751900000235
Neighbor tensor, defining a selection matrix
Figure FDA00021593751900000236
As shown in equation (4):
Figure FDA00021593751900000237
wherein:
Figure FDA00021593751900000238
to select a matrix
Figure FDA00021593751900000239
I of (1) 0 Line kth 0 Elements of a column;
Figure FDA00021593751900000240
represents
Figure FDA00021593751900000241
Is composed of
Figure FDA00021593751900000242
Of the same group of neighbors 0 The number of the samples is one,
Figure FDA00021593751900000243
represent
Figure FDA00021593751900000244
Is composed of
Figure FDA00021593751900000245
Of the same group of neighbors 0 The number of the samples is one,
Figure FDA0002159375190000031
represent
Figure FDA0002159375190000032
Is composed of
Figure FDA0002159375190000033
Of different groups of neighbors 0 The number of the samples is one,
Figure FDA0002159375190000034
represents
Figure FDA0002159375190000035
Is composed of
Figure FDA0002159375190000036
Of different groups of neighbors 0 A sample is obtained;
the overall feature discrimination objective function is expressed as equation (5):
Figure FDA0002159375190000037
intermediate variables
Figure FDA0002159375190000038
Figure FDA0002159375190000039
Subset set of sub-tensor representing the entire N samples
Figure FDA00021593751900000310
The sub-tensor of the g-th sample in (b),
Figure FDA00021593751900000311
subset set of sub-tensor representing the entire N samples
Figure FDA00021593751900000312
The sub-tensor of the h-th sample in (a);
the final feature discrimination criterion is:
Figure FDA00021593751900000313
in the formula: w is an intermediate variable matrix, an
Figure FDA00021593751900000314
Is W to the middle
Figure FDA00021593751900000315
Go to the first
Figure FDA00021593751900000316
The elements of the column are,
Figure FDA00021593751900000317
represents
Figure FDA00021593751900000318
Is composed of
Figure FDA00021593751900000319
Are close to each other in the same group k,
Figure FDA00021593751900000320
represents
Figure FDA00021593751900000321
Is composed of
Figure FDA00021593751900000322
Are close to each other in the same group k,
Figure FDA00021593751900000323
represents
Figure FDA00021593751900000324
Is composed of
Figure FDA00021593751900000325
Are close to each other in the different groups k,
Figure FDA00021593751900000326
represent
Figure FDA00021593751900000327
Is composed of
Figure FDA00021593751900000328
K, D is a diagonal matrix,
Figure FDA00021593751900000329
equation (6) is equivalent to the form of equation (7):
Figure FDA00021593751900000330
L g,h is a Laplace matrix;
by using
Figure FDA00021593751900000331
Replacement of
Figure FDA00021593751900000332
By using
Figure FDA00021593751900000333
Replacement of
Figure FDA00021593751900000334
Transforming equation (7) into the form of equation (8):
Figure FDA00021593751900000335
then, the alternative iteration method is utilized to respectively and sequentially solve the U i I =1,2, …, l, and based on the determined projection matrix U i Calculating the most discriminative characteristic sub-tensor of the first layer of the multi-resolution layer
Figure FDA0002159375190000041
Step two, in order to obtain the parameters of the soft-hard interval support tensor machine, the optimization problem of constructing the soft-hard interval support tensor machine is as follows:
Figure FDA0002159375190000042
wherein: w is a i Parameters representing the i-th hyperplane of the soft-hard interval support tensor machine, i =1,2, …, l, l represents the number of hyperplanes of the soft-hard interval support tensor machine,
Figure FDA00021593751900000415
representing the outer product of the vectors, C representing the penalty factor of the soft-hard interval support tensor machine, ξ i″ Represents the relaxation variable for the ith "simple sample, 1 ≦ i ≦ N sft ,N sft For the number of false target samples,
Figure FDA0002159375190000043
is the sub tensor of the ith sample with the category of the real target, i' is more than or equal to 1 and less than or equal to N t ,N t B generation for the number of real target samplesA bias of the watch hyperplane;
Figure FDA0002159375190000044
is a sub-tensor of the ith "sample of the class of false objects,
Figure FDA0002159375190000045
denotes w i And
Figure FDA0002159375190000046
in the mode product of the i-th mode,
Figure FDA0002159375190000047
denotes w i And
Figure FDA0002159375190000048
the mode product at the ith mode;
respectively identifying the characteristic sub-tensors with the highest discrimination capability of the first layer, and obtaining the category to which each characteristic sub-tensor belongs:
Figure FDA0002159375190000049
wherein:
Figure FDA00021593751900000410
if it is
Figure FDA00021593751900000411
Is 1, the sub-tensor
Figure FDA00021593751900000412
The corresponding sample is classified as a real target if
Figure FDA00021593751900000413
Is-1, then the sub-tensor
Figure FDA00021593751900000414
The category of the corresponding sample is a false target;
and step two, repeating the process from the step one to the step two for other layers of the multi-resolution layer until all the false targets are correctly classified or the number of layers of the multi-resolution layer reaches a threshold value Q, and stopping training to obtain a trained target recognition model.
3. The method for progressively detecting and identifying the target based on the hierarchical feature tensor of claim 2, wherein the specific process of the third step is as follows:
and carrying out target rapid detection on the remote sensing image based on the visual saliency to obtain candidate target positions in the remote sensing image, and intercepting an image block at each candidate target position according to the center of each candidate target position, the size of a real target and the prior of the resolution of the remote sensing image to obtain an image slice of each candidate target.
4. The method as claimed in claim 3, wherein the multi-resolution feature tensor in the first step is a multi-resolution Gabor feature tensor, a multi-resolution Gist feature tensor or a multi-resolution morphological difference operator feature tensor.
5. The method according to claim 4, wherein the threshold Q of the number of layers of the multi-resolution layer is 6.
CN201910727379.9A 2019-08-07 2019-08-07 Target progressive detection and identification method based on hierarchical feature tensor Active CN110443207B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910727379.9A CN110443207B (en) 2019-08-07 2019-08-07 Target progressive detection and identification method based on hierarchical feature tensor

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910727379.9A CN110443207B (en) 2019-08-07 2019-08-07 Target progressive detection and identification method based on hierarchical feature tensor

Publications (2)

Publication Number Publication Date
CN110443207A CN110443207A (en) 2019-11-12
CN110443207B true CN110443207B (en) 2022-10-11

Family

ID=68433971

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910727379.9A Active CN110443207B (en) 2019-08-07 2019-08-07 Target progressive detection and identification method based on hierarchical feature tensor

Country Status (1)

Country Link
CN (1) CN110443207B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111666925B (en) * 2020-07-02 2023-10-17 北京爱笔科技有限公司 Training method and device for face recognition model

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103761510A (en) * 2014-01-02 2014-04-30 华南理工大学 Method for motion recognition for simulating human visual cortex perception mechanism
CN106529511A (en) * 2016-12-13 2017-03-22 北京旷视科技有限公司 Image structuring method and device
CN107563447A (en) * 2017-09-08 2018-01-09 哈尔滨工业大学 Hierarchical identification method of the target to target site in a kind of remote sensing images
CN107609507A (en) * 2017-09-08 2018-01-19 哈尔滨工业大学 Feature based tensor sum supports the Remote Sensing Target recognition methods of tensor machine
CN109614936A (en) * 2018-12-12 2019-04-12 哈尔滨工业大学 The layered recognition method of remote sensing images Aircraft Targets

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9031317B2 (en) * 2012-09-18 2015-05-12 Seiko Epson Corporation Method and apparatus for improved training of object detecting system
US10078794B2 (en) * 2015-11-30 2018-09-18 Pilot Ai Labs, Inc. System and method for improved general object detection using neural networks
US10726244B2 (en) * 2016-12-07 2020-07-28 Samsung Electronics Co., Ltd. Method and apparatus detecting a target

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103761510A (en) * 2014-01-02 2014-04-30 华南理工大学 Method for motion recognition for simulating human visual cortex perception mechanism
CN106529511A (en) * 2016-12-13 2017-03-22 北京旷视科技有限公司 Image structuring method and device
CN107563447A (en) * 2017-09-08 2018-01-09 哈尔滨工业大学 Hierarchical identification method of the target to target site in a kind of remote sensing images
CN107609507A (en) * 2017-09-08 2018-01-19 哈尔滨工业大学 Feature based tensor sum supports the Remote Sensing Target recognition methods of tensor machine
CN109614936A (en) * 2018-12-12 2019-04-12 哈尔滨工业大学 The layered recognition method of remote sensing images Aircraft Targets

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Hao Chen,等.A hierarchical support tensor machine structure for target detection on high-resolution remote sensing images.《IEEE》.2017, *
基于卷积神经网络的 Webshell 检测方法研究;姜天,等;《网络与信息安全》;20190710;第27-31页 *

Also Published As

Publication number Publication date
CN110443207A (en) 2019-11-12

Similar Documents

Publication Publication Date Title
US11195051B2 (en) Method for person re-identification based on deep model with multi-loss fusion training strategy
CN111860612B (en) Unsupervised hyperspectral image hidden low-rank projection learning feature extraction method
Zhao et al. Spectral–spatial feature extraction for hyperspectral image classification: A dimension reduction and deep learning approach
Huang et al. An SVM ensemble approach combining spectral, structural, and semantic features for the classification of high-resolution remotely sensed imagery
CN105389550B (en) It is a kind of based on sparse guide and the remote sensing target detection method that significantly drives
CN103839065B (en) Extraction method for dynamic crowd gathering characteristics
Nie et al. Pavement distress detection based on transfer learning
CN109871875B (en) Building change detection method based on deep learning
CN105528794A (en) Moving object detection method based on Gaussian mixture model and superpixel segmentation
CN105718866B (en) A kind of detection of sensation target and recognition methods
CN101140624A (en) Image matching method
CN106023257A (en) Target tracking method based on rotor UAV platform
CN104077605A (en) Pedestrian search and recognition method based on color topological structure
CN111860106B (en) Unsupervised bridge crack identification method
CN108427919B (en) Unsupervised oil tank target detection method based on shape-guided saliency model
Mo et al. Vehicles detection in traffic flow
CN115527269B (en) Intelligent human body posture image recognition method and system
Kaluri et al. Sign gesture recognition using modified region growing algorithm and adaptive genetic fuzzy classifier
CN111274964B (en) Detection method for analyzing water surface pollutants based on visual saliency of unmanned aerial vehicle
CN105069471A (en) Hyperspectral data subspace projection and classification method based on fuzzy label
CN110443207B (en) Target progressive detection and identification method based on hierarchical feature tensor
CN107679467B (en) Pedestrian re-identification algorithm implementation method based on HSV and SDALF
Peng et al. Semi-supervised bolt anomaly detection based on local feature reconstruction
CN100416592C (en) Human face automatic identifying method based on data flow shape
CN117437691A (en) Real-time multi-person abnormal behavior identification method and system based on lightweight network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant