CN108230389A

CN108230389A - Unmanned plane localization method based on color space study

Info

Publication number: CN108230389A
Application number: CN201611153787.0A
Authority: CN
Inventors: 赖百胜; 章磊; 朱纯午
Original assignee: Hangzhou Ant Network Technology Co Ltd
Current assignee: Hangzhou Ant Network Technology Co Ltd
Priority date: 2016-12-14
Filing date: 2016-12-14
Publication date: 2018-06-29
Anticipated expiration: 2036-12-14
Also published as: CN108230389B

Abstract

The invention discloses the unmanned plane localization methods learnt based on color space, include the following steps, select a cooperative target；Define a depth neuroid；By the value of SGD algorithmic minimizings L, the parameter value of depth neuroid is solved；Take a width test image Z_t, by Z_tIt is input in input model M, then exports S at this time, then calculate Y_t；Obtain coordinate of the central point of cooperative target image under camera coordinates system；It obtains coordinate of the central point of four cooperative target images under camera coordinates system, takes the mean value of coordinate of the central point of four cooperative target images under camera coordinates system, the mean value is then positioning result.The present invention does not need to artificially set threshold value, and can adapt to a large amount of environmental change by abundant sample.It is compared to the method in existing converting colors space, this method can learn to preferably, more complicated, and be nonlinear color space by sample.

Description

Unmanned plane localization method based on color space study

Technical field

The present invention relates to unmanned plane localization methods.

Background technology

Existing unmanned plane localization method shoots target by camera, detects the position of target in the picture, so as to The anti-position for releasing target under unmanned plane coordinate system, wherein target is the cooperative target of known features.The drawback is that utilize figure As easily being influenced to target detection by environmental change.It is gentle in different illumination such as when cooperative target is red square It waits under environment, the target image variation that camera is shot is bigger, if with traditional two-value method, needs artificially different Different threshold values is set, and the setting of threshold value relies on experience, this greatly reduces the reliability of positioning under environment.

Invention content

It is insufficient in existing product the purpose of the present invention is overcoming, a kind of unmanned plane positioning learnt based on color space is provided Method.

In order to achieve the above object, the present invention is achieved by the following technical solutions：

Based on the unmanned plane localization method of color space study, include the following steps：

(1), a cooperative target is selected, the cooperative target is a square color block Q with color C, The physics length of side of middle Q is l, will have N web to have the aggregated label of cooperative target image for I, will there is the collection that N web has label image It closes labeled as Y, marks the region of Regional Representative's color C that pixel value is 1 in image, the pixel value in other regions is 0；

(2), a depth neuroid is defined, by the set I of cooperative target image and with the set Y for marking image It is all input in depth neuroid, then exports L, S, S_{I, j, k}, the S_{I, j, k}It is i for coordinate in S in S, one of j, k Element, S are expression of each pixel in the color space learnt；

(3), by the value of SGD algorithmic minimizings L, the parameter value of depth neuroid is solved, obtains parameter at this time It is worth corresponding depth neuroid, depth neuroid at this time is labeled as model M；

(4), a width test image Z is taken_t, by Z_tIt is input in model M, then exports S at this time, then calculate Y_t, it is describedThe S_{：,：, k}The matrix that all values of k form, Y are taken for third dimension coordinate in S at this time_tEqual to 1 Region is the region Q of the color C of prediction, calculates region Q minimum external square, the pixel coordinate of square center is marked For (c_x, c_y), it is by the wide label of the pixel of square；

(5), the internal reference matrix of camera is labeled as K,

Focal lengths of the f for the camera, (u₀, v₀) be camera imaging central point, then The central point of cooperative target image is in the homogeneous coordinates of camera imaging plane：

The K^-1Inverse matrix for matrix K；

(6), show that coordinate of the central point of cooperative target image under camera coordinates system is：

(7), the cooperative target of four different colours C is chosen, the cooperative target of four different colours is arranged in matrix pattern Cloth obtains coordinate of the central point of four cooperative target images under camera coordinates system, takes the center of four cooperative target images The mean value of coordinate of the point under camera coordinates system, the mean value are then positioning result.

The present invention depth neuroid be:Input layer -1x1 convolutional layers A-ReLU1 layers -1x1 convolutional layers B - ReLU2 layers -1x1 volumes base C-softmax layers-crossentropy layers, softmax layers are grader, and described image passes through Input layer inputs, then exports C1 by 1x1 convolutional layers A, and the C1 passes through 1x1 convolutional layers by ReLU1 layers of output R1, the R1 B exports C2, and the C2 exports C3 by ReLU2 layers of output R2, the R2 by 1x1 convolutional layers C, and C3 is new color space, The C3 is input to crossentropy layers of output L by softmax layers of output S, the S and the set Y with label image, Described image is the set I of cooperative target image or test image Z_t, ReLU1 layers described, ReLU1 layers 2, ReLU3 layers of definition All it is f (x)=max (0, x), wherein x is the independent variable of function f.

Softmax layers of definition is：

The θ is softmax layers of classification factor, the θ_kThe row k of θ is represented, it is describedFor θ_kTransposition, it is described C3_{I, j,：}The column vector that all elements of i and j form is taken respectively for the first two coordinate in C3.θ ∈ R of the present invention^2×128。

C1 ∈ R of the present invention^H×W×16, R1 ∈ R^H×W×16, C2 ∈ R^H×W×64, R2 ∈ R^H×W×64, C3 ∈ R^H×W×128, S ∈ R^H×W×2, The H is the height of image, and W is the width of image, and R is real number.

Y_{I, j}It is i for coordinate in Y, the element of j, S_{I, j, l}For k S when=1_{I, j, k}Value, S_{I, j, 2}S during for k=2_{I, j, k}Value.

The convolution kernel of 1x1 convolutional layers A is K₁, the K₁∈R^1×1×3×16, the convolution kernel of the 1x1 convolutional layers B is K₂, K₂∈ R^1×1×16×64, the convolution kernel of the 1x1 convolutional layers C is K₃, the K₃∈R^{1×1×64×128}。

Beneficial effects of the present invention are as follows：The present invention in a large amount of sample, learns one newly by deep neural network Color space, while learn a grader so that in the color space learnt, the color characteristics under different scenes It can be opened by automatic distinguishing.Softmax layers of output in neural network, i.e., each pixel is in the color space learnt It represents.Compared to the method for more existing used threshold determination, this method does not need to artificially set threshold value, and can be by abundant Sample adapts to a large amount of environmental change.The method in existing converting colors space is compared to, this method, can by sample Learn to preferably, more complicated, and be nonlinear color space.

Specific embodiment

Technical scheme of the present invention is described further below：

(2), a depth neuroid is defined, by the set I of cooperative target image and with the set for marking image Y is input in depth neuroid, then exports L, S, S_{I, j, k}, the S_{I, j, k}It is i for coordinate in S in S, one of j, k Element, S are expression of each pixel in the color space learnt；

(5), the internal reference matrix of camera is labeled as K,

The K^-1Inverse matrix for matrix K；

The present invention depth neuroid be:Input layer -1x1 convolutional layers A-ReLU1 layers -1x1 convolutional layers B - ReLU2 layers -1x1 volumes base C-softmax layers-crossentropy layers, softmax layers are grader, and described image passes through Input layer inputs, then exports C1 by 1x1 convolutional layers A, and the C1 passes through 1x1 convolutional layers by ReLU1 layers of output R1, the R1 B exports C2, and the C2 exports C3 by ReLU2 layers of output R2, the R2 by 1x1 convolutional layers C, and C3 is new color space, The C3 is input to crossentropy layers of output L by softmax layers of output S, the S and the set Y with label image, Described image is the set I of cooperative target image or test image Z_t。

Softmax layers of definition is：

Y_{I, j}It is i for coordinate in Y, the element of j, S_{I, j, 1}For k S when=1_{I, j, k}Value, S_{I, j, 2}S during for k=2_{I, j, k}Value.

The convolution kernel of 1x1 convolutional layers A is K₁, the K₁∈R^1×1×3×16, the convolution kernel of the 1x1 convolutional layers B is K₂, K₂∈ R^1×1×16×64, the convolution kernel of the 1x1 convolutional layers C is K₃, the K₃∈R^{1×1×64×128}.By the value of SGD algorithmic minimizings L, The parameter value of depth neuroid is solved, the parameter value of solution refers to solving K at this time₁Value, K₂Value, K₃Value, θ values, from And obtain input model M.

The present invention in a large amount of sample, learns a new color space, while learn one by deep neural network A grader so that in the color space learnt, the color characteristics under different scenes can be opened by automatic distinguishing.Nerve Softmax layers of output in network, i.e., expression of each pixel in the color space learnt.Compared to more existing used threshold It is worth the method for judgement, this method does not need to artificially set threshold value, and can adapt to a large amount of environment by abundant sample and become Change, positioning is relatively reliable.The method in existing converting colors space is compared to, this method can learn by sample to more It is suitable, more complicated, and be nonlinear color space.

By the method for deep learning, from sample learning to a more preferably color space, in this color space, Can the easier color characteristics distinguished under varying environment, increase unmanned plane positioning robustness under various circumstances.

It should be noted that listed above is only a kind of specific embodiment of the invention.It is clear that the invention is not restricted to Upper embodiment, can also be there are many deforming, in short, those of ordinary skill in the art can directly lead from present disclosure All deformations for going out or associating, are considered as protection scope of the present invention.

Claims

1. the unmanned plane localization method based on color space study, which is characterized in that include the following steps：

(1), a cooperative target is selected, the cooperative target is a square color block Q with color C, wherein Q's The physics length of side is l, will have N web to have the aggregated label of cooperative target image for I, will there is the set mark that N web has label image Y is denoted as, marks the region of Regional Representative's color C that pixel value is 1 in image, the pixel value in other regions is 0；

(2), define a depth neuroid, by the set I of cooperative target image and with mark image set Y it is all defeated Enter into depth neuroid, then export L, S, S_{I, j, k}, the S_{I, j, k}It is i for coordinate in S, an element of j, k, institute S is stated as expression of each pixel in the color space learnt；

(3), by the value of SGD algorithmic minimizings L, the parameter value of depth neuroid is solved, obtains parameter value pair at this time The depth neuroid answered, depth neuroid at this time are labeled as model M；

(5), the internal reference matrix of camera is labeled as K,

Focal lengths of the f for the camera, (u₀, v₀) be camera imaging central point, then cooperate The central point of target image is in the homogeneous coordinates of camera imaging plane：

The K^-1Inverse matrix for matrix K；

(7), the cooperative target of four different colours C is chosen, the cooperative target of four different colours is arranged in matrix pattern, obtained Go out coordinate of the central point of four cooperative target images under camera coordinates system, take the central point of four cooperative target images in phase The mean value of coordinate under machine coordinate system, the mean value are then positioning result.

2. the unmanned plane localization method according to claim 1 based on color space study, which is characterized in that the depth god It is through metanetwork:Input layer-B-ReLU2 layers -1x1 volumes base C of 1x1 convolutional layers A-ReLU1 layers -1x1 convolutional layers - Softmax layers-crossentropy layers, described softmax layers is grader, and described image is inputted by input layer, then is passed through 1x1 convolutional layers A exports C1, and the C1 exports C2 by ReLU1 layers of output R1, the R1 by 1x1 convolutional layers B, and the C2 leads to It crosses ReLU2 layers of output R2, the R2 and C3 is exported by 1x1 convolutional layers C, the C3 is new color space, and the C3 passes through Softmax layers of output S, the S and the set Y with label image are input to crossentropy layers of output L, and described image is The set I of cooperative target image or test image Z_t, ReLU1 layers described, ReLU2 layers, ReLU3 layers of definition be f (x)=max (0, x), wherein x are the independents variable of function f.

3. the unmanned plane localization method according to claim 2 based on color space study, which is characterized in that described Softmax layers of definition is：

The θ is softmax layers of classification factor, the θ_kThe row k of θ is represented, it is describedFor θ_kTransposition, the C3_{I, j,：} The column vector that all elements of i and j form is taken respectively for the first two coordinate in C3.

4. the unmanned plane localization method according to claim 3 based on color space study, which is characterized in that the θ ∈ R² ^×128。

5. the unmanned plane localization method according to claim 2 based on color space study, which is characterized in that the C1 ∈ R^H ^×W×16, R1 ∈ R^H×W×16, C2 ∈ R^H×W×64, R2 ∈ R^H×W×64, C3 ∈ R^H×W×128, S ∈ R^H×W×2, height of the H for image, W For the width of image, R is real number.

6. the unmanned plane localization method according to claim 2 based on color space study, which is characterized in that describedThe Y_{I, j}It is i for coordinate in Y, the element of j, the S_{I, j, 1} S during for k=1_{I, j, k}Value, the S_{I, j, 2}S during for k=2_{I, j, k}Value.

7. the unmanned plane localization method according to claim 2 based on color space study, which is characterized in that described 1x1 volumes The convolution kernel of lamination A is K₁, the K₁∈R^1×1×3×16, the convolution kernel of the 1x1 convolutional layers B is K₂, the K₂∈R^1×1×16×64, The convolution kernel of the 1x1 convolutional layers C is K₃, the K₃∈R^{1×1×64×128}。