CN112906710A

CN112906710A - Visual image feature extraction method based on BAKAZE-MAGSAC

Info

Publication number: CN112906710A
Application number: CN202110327237.0A
Authority: CN
Inventors: 胡燕祝; 王松; 贺琬婧
Original assignee: Beijing University of Posts and Telecommunications
Current assignee: Beijing University of Posts and Telecommunications
Priority date: 2021-03-26
Filing date: 2021-03-26
Publication date: 2021-06-04

Abstract

The invention relates to a visual image feature extraction method based on BAKAZE-MAGSAC, which is a method for extracting features of disaster visual images, belongs to the field of image processing and machine learning, and is characterized by comprising the following steps: (1) determining a nonlinear diffusion filter; (2) determining a non-linear scale space of the image; (3) determining the main direction of the feature points; (4) matching the characteristic points; (5) determining a model quality function; (6) the probability of each point being an inlier is determined. The method effectively solves the problem that the number of extracted feature points is reduced because the feature points cannot be effectively extracted when the image has scale and fuzzy changes, greatly improves the influence of interference noise on the extraction of the feature points, and simultaneously improves the matching time efficiency and the matching precision. The method with high classification accuracy is provided for the field of feature extraction of disaster vision images.

Description

Visual image feature extraction method based on BAKAZE-MAGSAC

Technical Field

The invention relates to the field of image processing and machine learning, in particular to a method for extracting features of disaster vision images.

Background

At present, the extraction of image features mainly adopts fuzzy extraction of image contours, filters interference noise of images and then screens out key feature values. For feature extraction of an image, the conventional methods include image feature matching based on an AKAZE algorithm and image feature identification based on a Relief algorithm. For the image feature matching based on the AKAZE algorithm, because the binary descriptor is adopted to describe the feature points of the image, the extraction quantity of the feature points is unstable and sharply reduced when the scale and the blur of the image change, the accuracy of the feature points is reduced at the same time, and the deviation is easily generated. And by adopting the Relief algorithm, because the feature detection is carried out by adopting a differential frame scanning technology, the edge contour feature is marked and the ambiguity is identified, the nonlinear interference noise is difficult to inhibit, and the change of the image brightness in different scale spaces is difficult to describe. Due to the change of image dimensions and the uncertainty of interference noise, the problems of failure in feature point extraction and interference noise shielding are easily caused in the practical application process of the technology. Thereby affecting the accurate extraction of image features.

For the feature extraction of the complex image, the key points are to suppress and filter the interference noise of the image and improve the extraction accuracy rate when the image has scale and fuzzy change. Aiming at the problems, an improved visual image feature extraction algorithm based on BAKAZE-MAGSAC is provided, by constructing a nonlinear scale space, the edge information of an image is greatly reserved, the interference noise in the image is effectively filtered, and meanwhile, the error matching is eliminated by respectively utilizing the feature point scale information constraint and the root mean square error constraint in the stages of coarse matching and fine matching of features, so that the matching time efficiency and the matching precision are improved.

Disclosure of Invention

In view of the problems existing in the prior art, the technical problem to be solved by the present invention is to provide a visual image feature extraction method based on BAKAZE-magsa, and the specific flow of the method is shown in fig. 1.

The technical scheme comprises the following implementation steps:

(1) determining nonlinear diffusion filtering

Where M represents image brightness, t represents evolution time, and c (x, y, t) represents propagationDerivative functions, (x, y) represent pixel coordinates, x represents pixel abscissa, and y represents pixel ordinate. The scale parameter σ to be in units of pixels_iConversion to evolution time t_i：

In the formula, i is a dimensional space.

(2) Determining a non-linear scale space L of an imageⁱ⁺¹：

Lⁱ⁺¹＝(I+τA(Lⁱ))Lⁱ

Wherein L represents an image parameter, I represents an identity matrix, τ represents a time step, and A (L)ⁱ) Expressed as a matrix of the image in dimension i, A is expressed as a matrix, LⁱRepresented as the result of the image L in dimension i. In a nonlinear scale space, a non-maximum value inhibition method is used for calculating a Hessian matrix value of each pixel point in an image pyramid, and meanwhile, the Hessian matrix value is compared with p pixel points on the same layer and upper and lower adjacent layers, and the Hessian matrix maximum value after normalization of different scales is searched.

(3) Determining the main direction of the feature points:

taking a first-order differential L of the characteristic point field on the gradient image by taking the characteristic point as a center and taking the radius r as a statistical range_xAnd L_yAnd performing Gaussian weighting calculation, then selecting a sector area with the size of theta to rotate around the origin, calculating the vector sum in the area, and taking the longest direction as the main direction of the characteristic point. And describing the feature points in the image by adopting the BRISK feature descriptor.

(4) Matching the characteristic points:

wherein a and b represent corresponding feature point descriptors, m_iRepresenting the first bit, n, in descriptor a_iDenotes the first bit in descriptor b, j denotes the number of bits in the a and b descriptors,

the representation descriptor is exclusive-ored.

(5) Determining a model quality function Q^*(α,P)：

Selecting s in the input data point set to calculate the characteristic points to obtain a model W, obtaining a homography matrix parameter alpha, and determining a model quality function:

where P is a set of data points, l represents the degree of image blur, σ_maxRepresenting the maximum value of the projection error, k representing the number of feature points, σ_iExpressed as the projection error of the characteristic point, d is the counting parameter, p_dAnd the probability of the corresponding characteristic point of the image when the dimension is D is expressed, and D represents residual error. And judging whether the current model is the optimal model according to the model quality function.

(6) Determine the probability L (p | α) that each point is an inlier:

in the formula, p is the probability that each point is an inlier. And (4) regarding the probability of each point as the weight of each point, and fitting the optimization model by using a weighted least square method according to the weight. Marginalizing the sigma, determining iteration times h:

and when the model is the optimal model, updating the iteration times h, otherwise, stopping the iteration. And outputting the optimal model before the output as the optimal model of the data set, calculating correct matching point pairs according to the model, and eliminating error points.

Compared with the prior art, the invention has the advantages that:

(1) the problem that the number of extracted feature points is reduced because the feature points cannot be effectively extracted when the image has scale and fuzzy changes is effectively solved.

(2) The influence of interference noise on feature point extraction is greatly improved, and meanwhile, the matching time efficiency and the matching precision are improved.

Drawings

For a better understanding of the present invention, reference is made to the following further description taken in conjunction with the accompanying drawings.

FIG. 1 is a flow chart of the steps to establish a BAKAZE-MAGSAC based visual image feature extraction method;

FIG. 2 is a flow chart of a method for establishing a BAKAZE-MAGSAC-based visual image feature extraction method;

FIG. 3 is the result of feature extraction for four sets of visual images using the present invention;

detailed description of the preferred embodiments

The present invention will be described in further detail below with reference to examples.

The data selected by the implementation case come from a typical demonstration area of an underground shielding space, 1000 groups of samples are shared in total, wherein 5 scenes including an underground tunnel, a railway tunnel, an underground mall, an underground parking lot and a subway station are provided, each scene has 200 groups of data, 140 groups of samples are extracted from each data in the 5 groups of scenes by adopting a random sampling method to serve as a training set, and the rest are used as a testing set. Finally, the total number of samples used as the training set is 700 and the total number of samples used as the test set is 300.

The overall flow of the visual image feature extraction method provided by the invention is shown in fig. 1, and the specific steps are as follows:

(1) determining nonlinear diffusion filtering

Where M represents image brightness, t represents evolution time, c (x, y, t) represents a transfer function, (x, y) represents pixel coordinates,x represents the abscissa of the pixel and y represents the ordinate of the pixel. The scale parameter σ to be in units of pixels_iConversion to evolution time t_i：

In the formula, i is a dimensional space.

(2) Determining a non-linear scale space L of an imageⁱ⁺¹：

Lⁱ⁺¹＝(I+3A(Lⁱ))Lⁱ

In the formula, L represents image parameters, I represents an identity matrix, tau represents a time step, and the value is 3s, A (L)ⁱ) Expressed as a matrix of the image in dimension i, A is expressed as a matrix, LⁱRepresented as the result of the image L in dimension i. In a nonlinear scale space, a non-maximum value inhibition method is used for calculating a Hessian matrix value of each pixel point in an image pyramid, and meanwhile, the Hessian matrix value is compared with 26 pixel points on the same layer, the upper layer and the lower layer, and the Hessian matrix maximum value after normalization of different scales is searched.

(3) Determining the main direction of the feature points:

taking a first-order differential L of the characteristic point field in a statistical range of radius 6 sigma by taking the characteristic point as a center on the gradient image_xAnd L_yAnd performing Gaussian weighting calculation, then selecting a 60-degree fan-shaped area to rotate around the origin, calculating the vector sum in the area, and taking the longest direction as the main direction of the characteristic point. And describing the feature points in the image by adopting the BRISK feature descriptor.

(4) Matching the characteristic points:

the representation descriptor is exclusive-ored.

(5) Determining a model quality function Q^*(α,P)：

Selecting 4 pairs of characteristic points in the input data point set for calculation to obtain a model W, obtaining a homography matrix parameter alpha, and determining a model quality function:

(6) Determine the probability L (p | α) that each point is an inlier:

In order to verify the accuracy of extracting the visual image features, four groups of visual image feature extraction experiments are performed on the invention, and the experimental results are shown in fig. 3. As can be seen from FIG. 3, the accuracy of feature extraction of the visual image established by the invention is kept above 98%, and higher accuracy can be achieved on the basis of ensuring stability, and the feature extraction effect is good. The visual image feature extraction method established by the invention is effective, provides a better method for establishing accurate image feature extraction, and has certain practicability.

Claims

1. The invention is characterized in that: (1) determining a nonlinear diffusion filter; (2) determining a non-linear scale space of the image; (3) determining the main direction of the feature points; (4) matching the characteristic points; (5) determining a model quality function; (6) determining the probability that each point is an interior point; the method specifically comprises the following six steps:

the method comprises the following steps: determining nonlinear diffusion filtering

In the formula, M represents image brightness, t represents evolution time, c (x, y, t) represents a transfer function, (x, y) represents pixel coordinates, x represents pixel abscissa, and y represents pixel ordinate; the scale parameter σ to be in units of pixels_iConversion to evolution time t_i：

In the formula, i is a dimensional space;

step two: determining a non-linear scale space L of an imageⁱ⁺¹：

Lⁱ⁺¹＝(I+τA(Lⁱ))Lⁱ

Wherein L represents an image parameter, I represents an identity matrix, τ represents a time step, and A (L)ⁱ) Expressed as an image in dimension iMatrix, A is expressed as matrix, LⁱExpressed as the result of the image L in dimension i; in a nonlinear scale space, calculating a Hessian matrix value of each pixel point in an image pyramid by using a non-maximum value inhibition method, simultaneously comparing the Hessian matrix value with p pixel points on the same layer and upper and lower adjacent layers, and searching the Hessian matrix maximum value after normalization of different scales;

step three: determining the main direction of the feature points:

taking a first-order differential L of the characteristic point field on the gradient image by taking the characteristic point as a center and taking the radius r as a statistical range_xAnd L_yPerforming Gaussian weighting calculation, then selecting a sector area with the size of theta to rotate around an original point, calculating the vector sum in the area, and taking the longest direction as the main direction of the characteristic point; describing the feature points in the image by adopting a BRISK feature descriptor;

step four: matching the characteristic points:

performing exclusive-or operation on the representation descriptor;

step five: determining a model quality function Q^*(α,P)：

where P is a set of data points, l represents the degree of image blur, σ_maxK table representing maximum projection errorNumber of characteristic points, σ_iExpressed as the projection error of the characteristic point, d is the counting parameter, p_dRepresenting the probability of the corresponding characteristic point of the image when the dimension is D, and D represents a residual error; judging whether the current model is the optimal model according to the model quality function;

step six: determine the probability L (p | α) that each point is an inlier:

in the formula, p is the probability that each point is an interior point; taking the probability of each point as the weight of each point, and fitting an optimization model by using a weighted least square method according to the weight; marginalizing the sigma, determining iteration times h:

when the model is the optimal model, updating the iteration times h, otherwise, stopping the iteration; and outputting the optimal model before the output as the optimal model of the data set, calculating correct matching point pairs according to the model, and eliminating error points.