CN109886325B

CN109886325B - Template selection and accelerated matching method for nonlinear color space classification

Info

Publication number: CN109886325B
Application number: CN201910105261.2A
Authority: CN
Inventors: 贾迪; 王伟; 孟祥福; 朱宁丹; 杨宁华
Original assignee: Liaoning Technical University
Current assignee: Liaoning Technical University
Priority date: 2019-02-01
Filing date: 2019-02-01
Publication date: 2022-11-29
Anticipated expiration: 2039-02-01
Also published as: CN109886325A

Abstract

The invention provides a template selection and accelerated matching method for nonlinear color space classification, which comprises the following steps: the method comprises a model training and image matching process, wherein the model training comprises the following steps: collecting training image samples, extracting a CIE chromaticity diagram of the training image samples, and manually marking the color class number of the training image samples; acquiring a five-layer feedforward neural network model; the image matching process comprises the following steps: inputting a pair of color images and setting a sampling rate; carrying out alternate point down-sampling treatment; obtaining a classification result set; calculating a metric value of the similarity probability; selecting i corresponding to the top k values with the highest score as a preferred color class number, and corresponding to a template region in the template image and a region to be matched in the image to be matched according to the preferred color class number; matching relation is obtained between the template area in the template image and the area to be matched in the image to be matched. Experimental results show that the method has higher registration rate and execution speed, and solves the problem that the color distance of the color space measured by a linear model is inconsistent with the visual judgment of human eyes in the existing matching method.

Description

Template selection and accelerated matching method for nonlinear color space classification

Technical Field

The invention belongs to the field of image processing, and particularly relates to a template selection and accelerated matching method for nonlinear color space classification.

Background

The template matching algorithm typically considers all possible transformations, including rotation, scale, and affine transformations. Alexe et al provides an efficient computational way to process high-dimensional vectors in two image-matching windows by extracting the boundary of the overlapping portion of the two windows and using it to constrain and match multiple windows. Tsai et al propose the use of wave decomposition with circular projection to improve matching accuracy, and the use of a rotation transform is repeated. Kim et al presents a gray-scale template matching method that has better rotation resistance and scale transformation. Yao et al propose a method for searching color texture that also takes into account rotation and scaling. Under a wide base line condition, the latter three methods have the problem of low matching quality. Another related study is the work of Tian et al, which performs parameter estimation on a density deformation field, and is a method for obtaining a minimum transformation distance from a target transformation parameter space. FAST-Match was proposed by Korman et al in 2013 to determine the matching result by sampling to calculate the minimum SAD between pixels of the matching region and to use global template matching to achieve accelerated search, but conversion to a grayscale image is required in advance before matching the color image. Based on the method, a document realizes region selection and matching from coarse to fine. CFAST-Match is proposed by Jia et al, the accuracy of color image template matching is improved by calculating the proportion of different colors in a template area, but the method needs to set part of parameters according to experience, and in addition, the method uses DBSCAN density clustering, the execution time is long when large-size images are processed, and the practicability of the method is reduced.

Disclosure of Invention

Based on the technical defects, the invention aims to adopt the nonlinear computing capability of the neural network to calculate the complex relation expression, provides a training set for constructing the neural network by taking a CIE chromaticity diagram as a basis to classify the colors of the images, and utilizes the trained network to realize the image segmentation. The invention provides a template selection and accelerated matching method for nonlinear color space classification, which comprises a model training and image matching process and comprises the following specific steps of:

and (3) a model training process:

step 1: collecting training image samples, extracting a CIE chromaticity diagram of the training image samples, acquiring each Macadam ellipse diagram, recording the Macadam ellipse diagram as a color area, extracting an RGB value corresponding to each color area, and manually marking the color class number to which the Macadam ellipse diagram belongs;

step 2: based on the collected training image samples, a five-layer feedforward neural network model is adopted for training to obtain the five-layer feedforward neural network model, and the five-layer feedforward neural network model comprises the following components: the device comprises an input layer, a first hidden layer, a second hidden layer, a third hidden layer and an output layer;

the number of the neurons of the input layer is 3, and the neurons respectively represent R, G and B values corresponding to each color region extracted from the training image sample; the output layer represents a color class number;

and (3) image matching process:

and 4, step 4: inputting a pair of color images I ₁ And I ₂ Record an image pair I ₁ And I ₂ Wherein, I ₁ As template image, I ₂ Setting a sampling rate alpha for an image to be matched;

and 5: using a set sampling rate alpha, image pair I ₁ And I ₂ Performing alternate point down-sampling treatment to obtain

And

step 6: by obtaining five-layer feedforward neural network model, pair

And

processing to obtain a classification result set

And with

And number of categories

And

wherein, the first and the second end of the pipe are connected with each other,

and with

Are respectively as

And

the number of elements in the set;

wherein the classification result set

And

the following formula is adopted for calculation:

wherein, cluster adopts a five-layer feedforward neural network model for classification processing, desendFor processing by down-sampling at alternate points, i.e.

And 7: establishing indexes for similar clusters through an index matrix IM [ i ] [ j ], and calculating a metric value Score (i) of the similarity probability;

wherein, IM [ i ] [ j ] is an index matrix, epsilon is a real number for ensuring that the denominator is not 0, and count () is used for counting the number;

the index matrix IM [ i ] [ j ] is calculated as follows:

and 8: selecting i corresponding to the top k values with the highest Score as a preferred color class number according to Score (i); has already obtained five-layer feedforward neural network model

And

performing color classification to obtain a classification result, performing region growing on the result to establish a corresponding relation between a color class number and a region, and corresponding a template region in a template image and a region to be matched in an image to be matched according to the preferred color class number;

and step 9: calculating the similarity of the template region in the template image and the region to be matched in the image to be matched to obtain the matching relation between the template region in the template image and the region to be matched in the image to be matched: order image

And

the regional similarity therebetween is Δ _T (I ^D ₁ ,I ^D ₂ ) T is

Pixel p to

And (3) affine transformation matrix between the middle pixels, the similarity calculation method between the regions is as follows:

the calculated values of the corresponding similarity of all affine transformations T are calculated by the above formula _T (I ^D ₁ ,I ^D ₂ ) Calculate the value Δ at all similarities _T (I ^D ₁ ,I ^D ₂ ) Taking the maximum value as the final result, wherein the result shows that the template region in the template image and the region to be matched in the image to be matched are the most matched.

The beneficial technical effects are as follows:

the invention classifies the color space by using the neural network and provides a method for determining a matching template and a region to be matched based on a classification result. The experimental result shows that compared with the existing method, the method has higher registration rate and execution speed, and solves some problems existing in the existing matching method: 1) The linear model measures that the color space color distance is inconsistent with the human eye visual judgment; 2) Automatically selecting a template area; 3) The template matching method is inefficient to perform.

Drawings

FIG. 1 is a flowchart of a template selection and accelerated matching method for nonlinear color space classification according to an embodiment of the present invention;

FIG. 2 is a problem with linear color distance calculation; wherein, FIG. 2 (a) is an image of RGB color values (249, 255, 121); FIG. 2 (b) is an image of RGB color values (221, 255, 121), and FIG. 2 (c) is an image of RGB color values (212, 255, 81);

FIG. 3 is a Macadam ellipse on the CIE chromaticity diagram for an embodiment of the present invention;

FIG. 4 is a CIE chromaticity diagram according to an embodiment of the present invention;

FIG. 5 is a five-layer neural network model according to an embodiment of the present invention;

FIG. 6 is a comparison of experimental results for examples of the present invention; wherein, fig. 6 (a) and fig. 6 (c) are the same template region; FIG. 6 (b) is an experimental result obtained by the method of the present invention; FIG. 6 (d) is the experimental results obtained with CFAST; FIG. 6 (e) is an enlarged view of experimental results obtained by the method of the present invention; FIG. 6 (f) is an enlarged view of experimental results obtained using CFAST;

FIG. 7 illustrates template matching location selection according to an embodiment of the present invention; among them, fig. 7 (a) is an original image; fig. 7 (b) is an image obtained by affine transformation;

FIG. 8 shows the experimental results 1 of the examples of the present invention; wherein, fig. 8 (a) is the down-sampled matching image; FIG. 8 (b) is a down-sampled target image; FIG. 8 (c) is a score map; FIG. 8 (d) is a high score region in a matching image; FIG. 8 (e) is a possible matching region in the target image corresponding to a high score region; FIG. 8 (f) is a possible matching region in the target image corresponding to a high score region; FIG. 8 (g) is a possible matching region in the target image corresponding to a high score region; FIG. 8 (h) is a possible matching region in the target image corresponding to a high score region; FIG. 8 (i) is a template region automatically selected; FIG. 8 (j) is the matching result; FIG. 8 (k) is the corresponding enlarged region in (i) and (j);

FIG. 9 shows the experimental result 2 of the example of the present invention; wherein fig. 9 (a) and 9 (b) are down-sampled image pairs; FIG. 9 (c) is a score plot; FIG. 9 (d) is a diagram of 4 cluster regions selected by the score map; fig. 9 (e) is a clustering region where the cluster centers of the selected clustering region 1 in the image to be matched are similar; fig. 9 (f) is a cluster region where the cluster centers of the selected cluster region 2 in the image to be matched are similar; fig. 9 (g) is a cluster region where the cluster centers of the selected cluster region 3 in the image to be matched are similar; fig. 9 (h) is a clustering region where the cluster centers of the selected clustering region 4 in the image to be matched are similar; FIG. 9 (i) is a template selection position determined by the clustering region of FIG. 9 (d); FIG. 9 (j) is the matching result obtained by the template of FIG. 9 (i); fig. 9 (k) is a corresponding enlarged region of (i) and (j).

Detailed Description

The invention is further described with reference to the accompanying drawings and specific embodiments, and the invention provides a template selection and accelerated matching method for nonlinear color space classification, which includes a model training and image segmentation process, as shown in fig. 1, and includes the following specific steps:

and (3) a model training process:

step 1: collecting training image samples, extracting a CIE chromaticity diagram of the training image samples, acquiring each Macadam ellipse chart, recording the Macadam ellipse chart as a color area, extracting an RGB value corresponding to each color area, and manually marking the color class number of the color area;

the color image has a richer information space than the gray image, and most of the existing color image matching methods usually use a linear formula to calculate the similarity between colors, for example, CFAST uses the euclidean distance between RGB to calculate the similarity F (I) between two pixels ₁ (p),I ₂ (T(p)))：

Dist (, in the formula) is used for calculating the similarity between two input parameters, Δ s (p) is a score coefficient of a region where p is located, and r is a distance threshold radius.

Are respectively I ₁ The channel values of R, G and B at the position of middle p,

are respectively I ₂ The R, G and B channel values at the position of middle T (p), and C (I1 (p)) is I ₁ The cluster center RGB value at the position of p in the middle, I2 (T (p)) is I ₂ I ₂ Cluster center value at the middle T (p) position. The method adopts the Euclidean distance of the RGB space to calculate the similarity between colors, no matter what color space: RGB, lab and HSV all relate to calculating similarity between colors, and Euclidean distance and Manhattan distance are common linear methods. TheseThe method has the problem that the calculation result is inconsistent with the observation result when the color distance is calculated, taking RGB and LAB color space as an example: 1) The colors represented by the RGB color space differ from the colors recognized by the human visual system HVS such that the colors with the smallest distance are not necessarily similar; 2) Colors having the same distance in the LAB color space are not necessarily similar. As shown in fig. 2, the RGB color values in fig. 2 (a), 2 (B), and 2 (C) are (249, 255, 121), (221, 255, 121), (212, 255, 81), respectively, which correspond to the three points a, B, and C in fig. 3. Wherein the euclidean distance between fig. 2 (a) and fig. 2 (b) is 28, the manhattan distance is 28, the euclidean distance between fig. 2 (b) and fig. 2 (c) is 41, and the manhattan distance is 49, and the calculation results in greater similarity between fig. 2 (a) and fig. 2 (b) in terms of both the euclidean distance and the manhattan distance, but from the human visual point of view, fig. 2 (b) and fig. 2 (c) are more similar, while fig. 2 (a) and fig. 2 (b) are not.

The similarity between pixel colors is not a simple linear relation, the human eye has nonuniform perceptibility to the difference of spectral colors, and the Macadam ellipse determines the boundary of the ellipse according to the wide capacity of the colors, thereby providing a guidance method for describing the color vision accuracy of the common human eye and the excellence of distinguishing similar colors. The Macadam ellipse on the CIE chromaticity diagram is shown in FIG. 3, the size of the ellipse is ten times of the actual size, the center point of the ellipse represents the standard color to be tested, the elliptical area represents the color range which can not be distinguished from the color of the center of the ellipse by human eyes, the periphery of the ellipse consists of pixel points which can be distinguished by the human eyes and are different from the color of the center point, and the size of the Macadam ellipse in different areas is different according to different color recognition degrees of the human eyes. Since the size and direction of the ellipse vary with the center position, the color difference cannot be measured by euclidean distance or manhattan distance in space. The complex relation expression is calculated by adopting the nonlinear computing power of the neural network, a training set of the neural network is constructed on the basis of a CIE chromaticity diagram to classify the colors of the images, and the trained network is used for realizing image segmentation.

And 2, step: based on the collected training image samples, a five-layer feedforward neural network model is adopted for training, and the five-layer feedforward neural network model comprises the following components: an input layer, a first hidden layer, a second hidden layer, a third hidden layer, and an output layer, as shown in fig. 5;

the number of the neurons of the input layer is 3, and the neurons respectively represent R, G and B values corresponding to each color area extracted from the training image sample; the output layer represents a color class number;

the neural network training samples are from a CIE chromaticity diagram, and when the training samples are collected, RGB values corresponding to each color region are extracted, and the color class numbers to which the color regions belong are manually labeled, as shown in fig. 4. Since the figure does not relate to black and grey areas, it is expanded to 25 different colour classes, where class number 24 indicates grey and class number 25 indicates black. Five-layer feedforward neural network models were used for training, as shown in fig. 5. The number of the neurons of the input layer is 3, the neurons respectively represent input R, G and B values, and the first hidden layer comprises 51 neurons; the second hidden layer contains 60 neurons, the third hidden layer contains 42 neurons, and the number of output neurons is 25, which represents the color class number.

For the color image I, inputting R, G and B data into a neural network for color classification to obtain a classification result set C of each pixel point:

C _I ＝Cluster(I ^R ,I ^G ,I ^B )

wherein Cluster () is used for classification, here classification by means of neural networks, I ^R Is the R channel value of image I, I ^G Is the G channel value of image I, I ^B Is the B channel value of picture I. Template matching is based on calculating the distance between pixels, and overall similarity is calculated by counting the similarity of all pixels in the whole template area, and the similarity between colors is nonlinear, so that the problem of mismatching can be caused. The method classifies the whole image pair to obtain a segmentation result, each position in a result matrix is composed of 1-25 classification numbers, and whether the colors of each pixel are consistent or not is determined by comparing whether the classification numbers are consistent or not, so that a distance-based calculation mode is avoided, and the registration rate is improved. Let image I ₁ And I ₂ The regional similarity therebetween is Δ _T (I ₁ ,I ₂ ) Let us orderT is I ₁ Pixels p to I in ₂ The method for calculating the similarity between the regions by using the affine transformation matrix between the intermediate pixels comprises the following steps:

and selecting i corresponding to the top k values with the highest Score as the preferred classification number according to the Score (i). Since the neural network already will

And

carrying out color classification to obtain a classification result, carrying out region growing on the result to establish a corresponding relation between the classification number and the region, and obtaining a template region and a region to be matched according to the preferred classification number; order image

And

the regional similarity therebetween is Δ _T (I ^D ₁ ,I ^D ₂ ) T is

Pixel p to

the degree of matching is determined by comparing the similarity between the regions.

Fig. 6 (a) and fig. 6 (c) show the same template region, which is used as the input of the present method and the CFAST method, respectively, and the region contains rich color information. Fig. 6 (b) is an experimental result obtained by the method of this section, and fig. 6 (e) is a partially enlarged view corresponding thereto. Fig. 6 (d) shows the results of an experiment using CFAST, and fig. 6 (f) is a partial enlarged view corresponding thereto. The experimental result shows that compared with the CFAST method, the method provided by the invention has higher matching accuracy.

CFAST uses density clustering for the entire image, and performs a long time when processing a high resolution image. Defining two color images I ₁ And I ₂ Respectively is n ₁ ×n ₁ And n ₂ ×n ₂ Picture I ₁ To I ₂ The set of affine transformations of (1) is Ω. When a large-size image is processed, the matching accuracy can be improved and the matching execution time can be shortened by selecting a proper template and positioning the position to be matched. For example, the shape of FIG. 7 (a) is composed of a majority of RGB values (255, 0) and RGB values (119, 117, 162) that occur as little as possible. Fig. 7 (b) is an image obtained by affine transformation in fig. 7 (a), and it is easy to improve the matching accuracy by selecting, as a template, an area where colors or color combinations appear as little as possible in the target image. If the search area to be matched is the local rectangular area n of the image ₂ ′×n ₂ ', due to n ₂ ×n ₂ ＞＞n ₂ ′×n ₂ ' the search speed is effectively increased by reducing omega, with the purpose of: 1) Providing the location of the best template selection. 2) The location of the template selection is provided and the matching region of the template in the target image is determined.

Input color image I is subjected to down-sampling processing to obtain image I ^D And clustering the data to obtain a set C of classification results of each pixel point.

Wherein, cluster is classified by adopting a five-layer feedforward neural network model, and Desend is separated point down-sampling treatment, that is

For two images I ₁ ,I ₂ Processed using the above formula to obtain a collection

And

then calculating the center of each cluster

And

calculating out

In that

The inverse of the similarity number is taken as a measure Score of the probability of similarity in the target image. And the Top-k cluster selected in the Score is used as a template for improving the accuracy of template matching.

In the formula IM [ i][j]Is a compute cluster center

In that

An index matrix of similar clusters in (1);

in the formula, IM [ i][j]For the index matrix, ε is a minimum value used to ensure that the denominator is not 0][j]Not equal to 0) for computing class clusters

In that

Of (a) is similar. Score (i) is a Score table, with higher scores giving smaller probabilities of colors or color combinations appearing in the target.

g×h→∑g _i ′×h _i ′(i＝1..n)

Top-k type clusters were selected by Score (i). In the above formula, the search area is reduced from g × h of the whole image to several g '× h' search areas, and the radiation transform set Ω is reduced, which specifically includes the following steps:

and (3) image matching process:

and 4, step 4: inputting a pair of color images I ₁ And I ₂ Record an image pair I ₁ And I ₂ Setting a sampling rate alpha;

And

and 6: using the optimized neural network model pair

And

processing to obtain a classification result set

And

and number of categories

And

wherein the content of the first and second substances,

and with

Are respectively as

And with

The number of elements in the set;

wherein the classification result set

And

calculated using the following formula:

And 7: establishing indexes for the similar clusters through an index matrix IM [ i ] [ j ], and calculating a metric value Score (i) of the similarity probability;

wherein, IM [ i ] [ j ] is an index matrix, ε is a minimum value used to ensure that the denominator is not 0, and 0.001 is taken in this embodiment; count () is used to Count the number;

the index matrix IM [ i ] [ j ] is calculated as follows:

and step 8: selecting i corresponding to the top k values with the highest Score as a preferred color class number according to Score (i); has already obtained five-layer feedforward neural network model

And

carrying out color classification to obtain a classification result, carrying out region growing on the result to establish a corresponding relation between a color class number and a region, and corresponding a template region in a template image and a region to be matched in the image to be matched according to the preferred color class number;

And

the regional similarity therebetween is Δ _T (I ^D ₁ ,I ^D ₂ ) T is

Pixel p to

calculating values delta of corresponding similarity of all affine transformations T by the above formula _T (I ^D ₁ ,I ^D ₂ ) Calculate the value Δ for all similarities _T (I ^D ₁ ,I ^D ₂ ) Taking the maximum value as the final result, wherein the result shows that the template region in the template image and the region to be matched in the image to be matched are the most matched.

Fig. 8 (a) and 8 (b) are pairs of down-sampled images, and fig. 8 (c) is a score map. And (4) obtaining a score map by adopting the calculation method in the step 7. Fig. 8 (d) shows that 4 cluster regions are selected by the score map. Fig. 8 (e), fig. 8 (f), fig. 8 (g), and fig. 8 (h) are clustering regions in which the cluster centers of the selected clustering regions in the image to be matched are similar, respectively. As can be seen from the figure, the template regions selected in the images to be matched are all located in the clustering region of the target image, so that only the final matching result needs to be searched in the clustering region in the matching process. Fig. 8 (i) is a template selection position determined by the clustering region of fig. 8 (d), and fig. 8 (j) is a matching result obtained by the template of fig. 8 (i). Fig. 8 (k) is a corresponding enlarged region of (i) and (j). III II III IV is the corresponding region number.

Fig. 9 (a) and 9 (b) are pairs of down-sampled images, and fig. 9 (c) is a score map. And (5) obtaining a score map by adopting the calculation method in the step 7. Fig. 9 (d) shows 4 cluster regions selected by the score map. Fig. 9 (e), 9 (f), 9 (g), and 9 (h) are respectively cluster regions in which the cluster centers of the selected cluster regions in the image to be matched are similar. As can be seen from the figure, the template regions selected in the images to be matched are all located in the clustering region of the target image, so that the final matching result only needs to be searched in the clustering region in the matching process. Fig. 9 (i) is a template selection position determined by the clustering region of fig. 9 (d), and fig. 9 (j) is a matching result obtained by the template of fig. 9 (i). Fig. 9 (k) is a corresponding enlarged region of (i) and (j). III II III IV is the corresponding region number.

Claims

1. A template selection and accelerated matching method for nonlinear color space classification is characterized by comprising a model training and image matching process, and specifically comprising the following steps:

and (3) a model training process:

and 2, step: based on the collected training image samples, a five-layer feedforward neural network model is adopted for training to obtain the five-layer feedforward neural network model, and the five-layer feedforward neural network model comprises the following components: the device comprises an input layer, a first hidden layer, a second hidden layer, a third hidden layer and an output layer;

and (3) image matching process:

and 4, step 4: inputting a pair of color images I ₁ And I ₂ Recording an image pair I ₁ And I ₂ Wherein, I ₁ As template image, I ₂ Setting a sampling rate alpha for an image to be matched;

And

and 6: by obtaining five-layer feedforward neural network model, pair

And

processing to obtain a classification result set

And with

And number of categories

And

wherein the content of the first and second substances,

and

are respectively as

And

the number of elements in the set;

wherein the classification result set

And

calculated using the following formula:

the index matrix IM [ i ] [ j ] is calculated as follows:

and step 8: selecting i corresponding to the top k values with the highest Score as a preferred color class number according to Score (i), and corresponding to a template region in the template image and a region to be matched in the image to be matched according to the preferred color class number;

and step 9: and calculating the similarity of the template region in the template image and the region to be matched in the image to be matched to obtain the matching relation of the template region in the template image and the region to be matched in the image to be matched.

2. The template selection and accelerated matching method for nonlinear color space classification according to claim 1, wherein the specific process of the step 9 is as follows: order image

And

the regional similarity therebetween is Δ _T (I ^D ₁ ,I ^D ₂ ) T is

Pixel p to