CN112435296A

CN112435296A - Image matching method for VSLAM indoor high-precision positioning

Info

Publication number: CN112435296A
Application number: CN202011380479.8A
Authority: CN
Inventors: 刘伟伟; 唐蕾; 刘婷婷
Original assignee: Nanjing Institute of Technology
Current assignee: Nanjing Institute of Technology
Priority date: 2020-12-01
Filing date: 2020-12-01
Publication date: 2021-03-02
Anticipated expiration: 2040-12-01
Also published as: CN112435296B

Abstract

The invention relates to an image matching method for VSLAM indoor high-precision positioning, which can better detect repeatable key points under severe illumination change conditions by adding an illumination suppression factor in a matching similarity criterion, has illumination robustness and can improve the accuracy of image matching. In addition, the invention also comprehensively utilizes cosine similarity and Euclidean measurement, can measure the difference between the value direction of the key point and the absolute value in the image matching process, and further improves the accuracy of image matching.

Description

Image matching method for VSLAM indoor high-precision positioning

Technical Field

The invention relates to an image matching method for VSLAM indoor high-precision positioning.

Background

The vision SLAM system is mainly applied to robot vision positioning, and the working process includes front end image acquisition, rear end matching optimization and closed loop detection etc. and the image matching includes: extracting feature points of each frame of image; and carrying out image matching by using the characteristic point coordinates and the descriptors to obtain the motion trail of the camera. The method comprises the steps of calculating the matching relation between a current frame image and an adjacent frame image, solving the relative spatial relation between two scenes, and accordingly obtaining the motion trail of a camera is a key link.

On the other hand, in the image matching process, the euclidean distance or an improved algorithm thereof is a common similarity discrimination criterion, the euclidean distance mainly discriminates the absolute difference of the real distance between two points in the m-dimensional space, and the cosine similarity more discriminates the difference from the m-dimensional direction and is insensitive to an absolute numerical value.

Disclosure of Invention

The inventor finds that in the process of photographing or video recording by a depth camera, due to unpredictability of day and night periods, weather changes and lighting environments, dynamic changes can occur in lighting conditions, the accuracy of extracting feature points of an image can be influenced under the condition of severe lighting changes, and many problems exist in the loop detection and back-end optimization links for identifying and matching the feature points of the image under two lighting conditions with great differences, so that failure in positioning or reduction in positioning accuracy can be caused easily.

In addition, in the image matching process, only the numerical difference of the feature points or the direction vector difference of the feature points are considered, which affects the accuracy of the image matching.

Aiming at the problems, the invention provides the image matching method for VSLAM indoor high-precision positioning, which has good illumination robustness and high image matching accuracy.

The technical scheme adopted by the invention is as follows:

an image matching method for VSLAM indoor high-precision positioning comprises the following steps:

step one, extracting feature points of each frame of image;

step two, establishing a matching similarity criterion, combining Euclidean measurement and cosine similarity measurement in the matching similarity criterion, and adding an illumination inhibition factor; the matching similarity criterion function E is:

in the formula (1), both alpha and beta are numeric coefficients, and the numeric range is (0, 1); sqrt represents taking the positive square root; gamma (t) is an illumination inhibition factor, and the value range is (0, 1); w is a_iThe value range is (0,1) for the weighting coefficient; x is the number of_i1Representing a first feature pointOf the ith dimension, x_i2An ith coordinate representing the second feature point, i ═ 1,2, … n;

the photoinhibition factor γ (t) is expressed as:

in the formula (2), eta is an inhibition coefficient, and the numeric area is (0, 1); Δ i (t) is a change value of the luminous intensity of the light source with time, and r is a distance from the camera to the light source;

step three, calculating a matching similarity metric value E of the images of the adjacent frames through a matching similarity criterion function, sorting the images according to the calculated similarity metric values from large to small, and selecting the image with the smallest value E as a matching frame;

and step four, quantitatively estimating the motion of the camera between frames according to the motion vector change between the adjacent matched frames to obtain the position and the posture of the camera.

Further, in the third step, when the image positioning accuracy exceeds 1m, it is determined that the illumination condition is changed drastically, and the value step length of the illumination suppression factor γ (t) is dynamically adjusted within the range of (0, 1); otherwise, the possible value of each alpha and beta parameter is calculated by adjusting the alpha and beta value step length in the range of (0,1), and then all the combination conditions are traversed and the optimal parameter value is returned.

Further, β is set to 0.9, α is set to 0.1; alternatively, the initial value of β is set to 0.1 and the initial value of α is set to 0.9.

The invention has the beneficial effects that:

according to the method, the illumination suppression factor is added in the matching similarity criterion, so that repeatable key points under the condition of severe illumination change can be well detected, the illumination robustness is achieved, and the accuracy of image matching can be improved. In addition, the invention also comprehensively utilizes cosine similarity and Euclidean measurement, can measure the difference (the difference in space and distance) between the value direction and the absolute value of the key point in the image matching process, and further improves the accuracy of image matching.

Drawings

Fig. 1 is a flow chart of the image matching method for VSLAM indoor high-precision positioning according to the present invention.

Detailed Description

The image matching method for high-precision positioning of the VSLAM in the room according to the present invention will be further described with reference to the accompanying drawings and specific embodiments.

As shown in fig. 1, an image matching method for high-precision positioning of VSLAM in a room includes the following steps:

step one, extracting feature points (including coordinate information and descriptors and direction information on 8 dimensions) of each frame of image.

And step two, establishing a matching similarity criterion, combining Euclidean measurement and cosine similarity measurement in the matching similarity criterion, and adding an illumination inhibition factor. The matching similarity criterion function E is:

in the formula (1), both α and β are numeric coefficients, and the numeric range is (0, 1). sqrt denotes taking the positive square root. Gamma (t) is an illumination inhibition factor, and the value range is (0, 1). w is a_iThe value range is (0,1) for the weighting coefficient. x is the number of_i1I-dimensional coordinate, x, representing the first feature point_i2And i-th-dimension coordinates representing the second feature point, i being 1,2, … n.

The photoinhibition factor γ (t) is expressed as:

in the formula (2), η is an inhibition coefficient, and the numeric area is (0, 1). Δ I (t) is the variation value of the luminous intensity of the light source with time, and r is the distance from the camera to the light source.

In the matching similarity criterion of the invention, the difference judgment of the absolute distances of the feature points is considered, and the Euclidean measure d is utilized. And (4) considering the influence of the feature point direction dimension information, and measuring cos by using cosine similarity.

d＝sqrt(∑(x_i1-x_i2)²) (3)

The euclidean metric d measures the absolute distance between feature points.

The closer the cosine value is to 1, the closer the included angle is to 0 degrees, i.e., the more similar the two feature points are.

And step three, calculating a matching similarity metric value E of the images of the adjacent frames through a matching similarity criterion function, sorting the images according to the calculated similarity metric values from large to small, and selecting the image with the smallest value E as a matching frame.

And when the image quality does not meet the requirement and the positioning accuracy is poor (exceeds 1m), judging that the illumination condition is changed violently, and dynamically adjusting the value step length of the illumination inhibition factor gamma (t) in the range of (0, 1). Otherwise, the possible value of each alpha and beta parameter is calculated by adjusting the alpha and beta value step length in the range of (0,1), and then all the combination conditions are traversed and the optimal parameter value is returned.

Beta reflects the distance penalty coefficient of the model to the matching criterion, and alpha reflects the distribution of the data after being mapped to the high-dimensional feature space. The larger β, the easier the model is to overfit. The smaller beta, the easier the model is to under-fit. The larger alpha is, the more supporting feature point descriptor directions are, the smaller alpha value is, the fewer supporting feature point descriptor directions are. The smaller α is, the better the generalization of the model becomes, but too small, the model is degraded into a linear model in practice. The larger the α, the theoretically possible to fit any non-linear data.

Before the simulation starts, the value of alpha is set, the value of beta is set to be 0.1-1, the optimal parameter range is that the initial value of beta is set to be 0.9, and the initial value of alpha is set to be 0.1. Alternatively, the initial value of β is set to 0.1 and the initial value of α is set to 0.9. Then, according to the representation of the model precision, increasing beta or increasing alpha (not increasing simultaneously), or needing to decrease beta or decreasing alpha, namely multiplying 0.1 or 10 each time as a step length, and after determining the approximate range, refining the search matching interval.

The above description is only an embodiment of the present invention, but the scope of the present invention is not limited thereto, and any alternative or alternative method that can be easily conceived by those skilled in the art within the technical scope of the present invention should be covered by the scope of the present invention.

Claims

1. An image matching method for VSLAM indoor high-precision positioning is characterized by comprising the following steps:

step one, extracting feature points of each frame of image;

in the formula (1), both alpha and beta are numeric coefficients, and the numeric range is (0, 1); sqrt represents taking the positive square root; gamma (t) is an illumination inhibition factor, and the value range is (0, 1); w is a_iThe value range is (0,1) for the weighting coefficient; x is the number of_i1I-dimensional coordinate, x, representing the first feature point_i2An ith coordinate representing the second feature point, i ═ 1,2, … n;

the photoinhibition factor γ (t) is expressed as:

2. The image matching method for VSLAM indoor high-precision positioning according to claim 1, wherein in step three, when the image positioning precision exceeds 1m, it is determined that the illumination condition has changed drastically, and the value step length of the illumination suppression factor γ (t) is dynamically adjusted within the range of (0, 1); otherwise, the possible value of each alpha and beta parameter is calculated by adjusting the alpha and beta value step length in the range of (0,1), and then all the combination conditions are traversed and the optimal parameter value is returned.

3. The image matching method for VSLAM high accuracy localization in a room of claim 2, wherein β initial value is set to 0.9, α initial value is set to 0.1; alternatively, the initial value of β is set to 0.1 and the initial value of α is set to 0.9.