CN107085607A

CN107085607A - A kind of image characteristic point matching method

Info

Publication number: CN107085607A
Application number: CN201710258205.3A
Authority: CN
Inventors: 段翰聪; 赵子天; 谭春强; 文慧; 闵革勇; 陈超; 李博洋
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2017-04-19
Filing date: 2017-04-19
Publication date: 2017-08-22
Anticipated expiration: 2037-04-19
Also published as: CN107085607B

Abstract

The invention discloses a kind of image characteristic point matching method, comprise the following steps, storage picture feature point is extracted：Extract the feature of storage image and constitute storage characteristic vector, dimensionality reduction is carried out to its dimension；Vector storage：Split the storage characteristic vector after dimensionality reduction, and quantify to do vector quantization again to first doing product after segmentation per part, constitute product quantizer, vector quantizer, and set up trie tree and Hash table；Picture feature point to be matched is extracted：Extract the feature of image to be matched and constitute characteristic vector to be matched, dimensionality reduction is carried out to its dimension；Vectors matching：Split the characteristic vector to be matched after dimensionality reduction, characteristic vector to be matched is found out with product quantizer, the cluster centre of vector quantizer apart from forward multiple cluster centres, picture according to corresponding to trie tree and Hash table find multiple cluster centres simultaneously constitutes Candidate Set, and the closest picture in Candidate Set with characteristic vector to be matched is calculated using floating point vector；Its speed is fast and precision is high.

Description

A kind of image characteristic point matching method

Technical field

The present invention relates to picture searching technical field, and in particular to a kind of image characteristic point matching method.

Background technology

In picture search field, characteristic matching is a very important link, and the matching efficiency and precision of feature are determined Final search speed and precision.During existing picture searching, it uses following steps：The first step is trained by great amount of samples data One transition matrix, binary code is turned by hash function, and binary code segmentation generates multiple Hash tables, obtained segmentation two Entrance of the ary codes directly as Hash table.Second step, when vector to be checked reaches, switchs to binary system by same mode Code, is mapped to corresponding Hash table entry and its in other entrances for r, all pictures in entrance are used as Candidate Set. 3rd step, does all in Candidate Set picture features are vectorial complete Hamming distances with vector to be checked and calculates, reset distance .When floating point features vector turns binary code, due to the presence of hash function, the precision of vector is caused to be lost, and Among last rearrangement process, still use the Hamming distances based on binary code to calculate, although speed quickly, but by It is not so good as floating point vector in the expression precision of binary code, therefore calling rate has and reduced to a certain degree.

With the rapid development of Internet, the picture on internet has reached 10,000,000,000 rank even more highs at present.Adopt Existing characteristic point matching method is used, it has not adapted to the picture library pattern of existing rapid growth.In mass picture retrieval Efficient, high-precision search how is carried out in tree becomes focus.

The content of the invention

In order to solve the above-mentioned technical problem the present invention provides a kind of image characteristic point matching method, and its matching precision is high and fast Degree is fast.

The present invention is achieved through the following technical solutions：

A kind of image characteristic point matching method, comprises the following steps,

Picture feature point is put in storage to extract：Extract the feature of storage image and constitute storage characteristic vector, its dimension is carried out Dimensionality reduction；

Vector storage：Split the storage characteristic vector after dimensionality reduction, and quantify to do again to first doing product after segmentation per part Vector quantization, constitutes product quantizer, vector quantizer, and set up trie tree and Hash table；

Picture feature point to be matched is extracted：Extract the feature of image to be matched and constitute characteristic vector to be matched, it is tieed up Degree carries out dimensionality reduction；

Vectors matching：Split the characteristic vector to be matched after dimensionality reduction, find out characteristic vector to be matched and product quantizer, to The cluster centre of quantizer is measured apart from forward multiple cluster centres, multiple cluster centre institutes are found according to trie tree and Hash table Corresponding picture simultaneously constitutes Candidate Set, and the closest figure in Candidate Set with characteristic vector to be matched is calculated using floating point vector Piece.

The method of this programme calculates binary code without using the algorithm of similar iterative quantization, and retrieval is built using dimensionality reduction cluster Tree and Hash table, in the cluster of the first level, the data perfectly correlated to its are not clustered, but it is split, Each several part data are completely independent, and can be accelerated using multi-threading parallel process mode, the training time of quantizer is subtracted significantly It is few.The whole process of matching retrieval is divided to two sections of progress, and first paragraph is chooses Candidate Set, and second segment is integrally carried out using floating point vector Distance is calculated, on the premise of Candidate Set scope is larger, then carries out floating-point distance calculating, the recall rate of retrieval result and violence The gap very little matched somebody with somebody, no more than 1 percentage point.Sorted than Hamming distances more accurate.

If there is N bars record in database, violence matching needs to do n times distance calculating, and using the method for this programme, root According to the difference of selected parameter, the record number in Candidate Set is N/100~N/10, greatly reduces and calculates, and is substantially increased With speed.And during trie tree is built, the first stage data of cluster are divided into some, the cluster of some Process is completely self-contained, therefore multithreading can be used to be clustered, and improves cluster speed.

Preferably, the method for vector storage is：

Storage characteristic vector is divided into disjoint P part；

It is k to carry out cluster centre number in each partial interior₁K-means cluster；

For each cluster centre, all data for being assigned to the cluster centre are done into vector quantization, cluster centre number For k₂；

Record the ID for all features for being mapped to correspondence cluster centre or the title of correspondence picture respectively with P Hash table.

Further, the specific method of Vectors matching is：

Characteristic vector to be matched is divided into disjoint P part；

In each partial interior, characteristic vector to be matched and k are calculated₁The distance of individual cluster centre, and chosen distance is minimum W cluster centre；

For W cluster centre of selection, by characteristic vector to be matched k corresponding with the cluster centre₂Individual second strata Row distance calculating is entered at class center one by one, obtains k₂Individual distance；

To W*k₂Individual distance is ranked up, and takes m closest distance, wherein, m is the natural number more than 1；

M are taken out apart from corresponding cluster centre, corresponding Hash table entry is found, by the picture name in these entrances Or

ID constitutes Candidate Set；

By the corresponding picture feature of Image ID in Candidate Set it is vectorial with characteristic vector to be matched using floating point vector one by one Calculate into distance, finally obtain the minimum as target of distance.

Further, the k-means clusters use parallel processing manner.

Preferably, using dimension dimensionality reduction of the principal component analytical method to feature.

The present invention compared with prior art, has the following advantages and advantages：

The present invention builds trie tree and Hash table using dimensionality reduction cluster, and in the cluster of the first level, it is split, Each several part data are completely independent, and can be accelerated using multi-threading parallel process mode, the training time of quantizer is subtracted significantly It is few；The process of matching retrieval first chooses Candidate Set, reuses floating point vector and integrally enters row distance calculating, larger in Candidate Set scope On the premise of, carry out floating-point distance and calculate, the gap very little of the recall rate of retrieval result and violence matching, no more than 1 percentage Point, its retrieval precision is high and efficient.

Embodiment

For the object, technical solutions and advantages of the present invention are more clearly understood, with reference to embodiment, to present invention work Further to describe in detail, exemplary embodiment and its explanation of the invention is only used for explaining the present invention, is not intended as to this The restriction of invention.

Embodiment 1

Using this method, in retrieving, not only allow for the Hash entrance hit completely, it is also contemplated that near it Several entrances, therefore, increase the probability in Candidate Set with the inquiry most like picture of picture, improve recall rate.Carrying out When reordering, original floating-point characteristic vector has been used to enter row distance calculating, it is complete that this mode remains original floating-point characteristic vector The information in portion, and passing through the mode for quantifying to turn into binary features code has a certain degree of loss of significance.

Embodiment 2

Thought based on above-described embodiment, the present embodiment is refined to each step.

The picture be put in storage as trie tree is either directed to also in picture to be matched, is required to extract characteristic point, it is special Levy extracting method a little a lot, for example convolutional neural networks, the dimension of its output characteristic vector be one than larger value, can set It is set to n, n may be 128,256 or 512 etc..Its dimension is larger, can increase the amount of calculation of matching process, it would be desirable to it Dimensionality reduction is carried out, the dimension of the characteristic vector of output can be reduced to d using principal component analytical method and tieed up by dimensionality reduction, wherein, d is less than or equal to N, d desirable 128 or 64.The influence of noise not only can remove using the method for dimensionality reduction, and amount of calculation can be reduced, when reducing calculating Between.

Dimensionality reduction detailed process is as follows：

Assuming that there is S datas, in luv space, S bar n dimensional feature vectors can be by matrix M={ D₁,D₂,…D_nRepresent, Wherein D_nFor S*1 column vector.First, matrix M asks covariance to obtain matrix V ar (M), Var (M)=M^TM*1/n, is recycled existing There is method to ask covariance matrix Var (M) n characteristic value and corresponding characteristic vector, choose maximum d characteristic value and its Corresponding vector, as the matrix R of dimensionality reduction, calculating MR just can obtain L*d matrix, complete the process of dimensionality reduction.

It is that vector storage structure trie tree is prepared after the feature and dimensionality reduction of extraction storage picture, the specific side of vector storage Method is：

Storage characteristic vector is divided into disjoint P part, P=4 portions for example are cut into the d=128 data tieed up Point, using the 1st~32 floating number of characteristic vector as Part I, the 33rd~64 floating number is used as Part II, the 65th ~96 floating numbers are used as Part IV as Part III, the 97th~128 floating number.

It is k to carry out cluster centre number in each partial interior₁K-means clusters, be that first layer quantifies herein, with the Illustrate its detailed process exemplified by a part：

1-1. first chooses k for 32 floating number features of S bars of Part I from S bar features₁Bar is as in cluster The heart；

1-2. is by S bar characteristics respectively at k₁Individual cluster centre carries out Euclidean distance calculating, and which characteristic gather with Class center it is closest, the data is just belonged into the cluster centre；

1-3. will belong to the characteristic of the cluster centre for each cluster centre in previous step, corresponding each Floating number summation takes average, regard the mean vector finally obtained as new k₁Individual cluster centre；

1-4. terminates the k obtained in cluster, step 3 if cluster number of times or cluster error are reduced to certain limit₁It is individual Cluster centre is required, otherwise, returns to step 2.

1-5. records k₁The value of individual cluster centre, needs during retrieval.

On the basis of first layer quantization, for each cluster centre, all data for being assigned to the cluster centre are done Vector quantization, cluster centre number is k₂, quantify herein for the second layer, its detailed process is as follows：

2-1. quantifies each cluster centre for first layer, and the feature for belonging to certain cluster centre is S_iIt is individual, wherein all S_i's With for S, from S_iMiddle selection k₂Bar is used as cluster centre；

2-2. is by S_iBar characteristic is respectively at k₂Individual cluster centre carries out Euclidean distance calculating, and which characteristic gather with Class center it is closest, the data is just belonged into the cluster centre；

2-3. will belong to the characteristic of the cluster centre for each cluster centre in previous step, corresponding each Floating number summation takes average, regard the mean vector finally obtained as new k₂Individual cluster centre；

2-4. terminates the k obtained in cluster, step 3 if cluster number of times or cluster error are reduced to certain limit₂It is individual Cluster centre is required, otherwise, returns to step 2；

2-5. records k₁The value of individual cluster centre, needs during retrieval.

Record the ID for all features for being mapped to correspondence cluster centre or the title of correspondence picture respectively with P Hash table, The Hash codes of each Hash table are log2 (k₁)+log2(k₂) position.For example：

1. assume k₁=16, k₂=16, then Hash codes are 4+4=8,16 cluster centres in preceding 4 expressions first layer One, rear 4 expressions correspond to one of 16 two layers of cluster centres of first layer cluster centre.

2. correspondence is each mapped to the characteristic of second layer cluster centre, its corresponding picture name or ID are added to In the corresponding Hash entrance of Hash codes.Hash herein is simply encoded for cluster centre, is a kind of data structure, for depositing Storage.

The essence of vector storage is laid the foundation for Vectors matching, after trie tree, Hash table are put up, you can enter row vector Matching.If there is picture input to be matched, extract start Vectors matching step after picture feature point to be matched as stated above Suddenly, the specific method of Vectors matching is：

Characteristic vector to be matched is divided into disjoint P part；When dividing method is with storage as dividing method.

In each partial interior, characteristic vector to be matched and k are calculated₁The distance of individual cluster centre, sorts and chosen distance W minimum cluster centre, with P=4, k₁=16, k₂Said exemplified by=16, W=4, first vectorial part to be checked It is bright,

16 cluster centres of first part during to storage, with first vectorial part to be checked, i.e., 1~32 Floating-point number vector and 16 cluster centres carry out the calculating of Euclidean distance.16 distances are ranked up, minimum W=4 is chosen Individual cluster centre.

For W cluster centre of selection, by characteristic vector to be matched k corresponding with the cluster centre₂Individual second strata Row distance calculating is entered at class center one by one, obtains k₂Individual distance, specifically, to the cluster centre of individual selection, inside it, with to be checked 16 cluster centres of the 32 floating-point number vectors and the second hierarchical clustering of asking vector carry out Euclidean distance calculating.

By W*k₂Individual distance is sequentially placed into big top heap, it is ensured that only m distance in big top heap, wherein, m is more than 1 Natural number, m rather than one before why taking are to ensure recall rate.

Take out m in big pushing tow individual apart from corresponding cluster centre, find the figure in corresponding Hash table entry, these entrances Piece title or ID constitute Candidate Set.For m apart from corresponding cluster centre, once encoded during storage for each cluster centre Hash codes, the sole inlet of each Hash codes correspondence Hash table, ask union to can obtain Candidate Set the ID in m entrance.

Enter the corresponding picture feature of Image ID in Candidate Set is vectorial row distance with characteristic vector to be matched one by one and calculate, The minimum as target of distance is finally obtained, is specially：

For each ID in Candidate Set, obtain its 128 complete dimension floating point features vector, with vector sum to be checked these Floating point features vector enters row distance calculating one by one；

K minimum result in selecting step 2, its corresponding ID is required most like picture.When K is 1, it is defined Really search, works as K>It is k nearest neighbor search when 1.

Embodiment 3

For embodiment 2, a detailed embodiment is now disclosed.

Storage picture feature point is extracted and picture feature point extraction step to be matched is not repeated in this embodiment.

Vector storage：Using a large amount of characteristic vectors in database, the training of product quantizer and vector quantizer is carried out. Step is as follows：

D dimension storage characteristic vectors after dimensionality reduction are divided into disjoint P part, by taking D=128, P=4 as an example, by spy The 1st~32 of vector is levied as first paragraph, the 33rd~64 as second segment, the 65th~96 as the 3rd section, the 97th~ 128 are used as the 4th section.

It is k to carry out cluster centre number in each partial interior₁K-means cluster, common P*k₁Individual cluster centre, owns Cluster centre is [C¹ _i]_p={ [c¹ _i]_p, i=0,1,2 ..., k₁；P=0,1,2 ..., P }, by cluster centre [C¹ _i]_pDeposited Storage, due to being segmented in previous step to feature, so the space consuming of storage is only D/P*k₁, this step is referred to as PQ, i.e. product quantify, and also quantify as first layer, obtain corresponding PQ quantizers.Also, due to the independence of various pieces, This process can use multithreading, and multi-process even multinode carries out parallel processing, lifting cluster speed.

On the basis of first layer quantization, cluster centre c is arrived to all clusters¹ _ijData, clustered again, generate k₂Individual cluster centre, common P*k₁*k₂Individual cluster centre, all cluster centres are [C² _ij]_p={ [c² _ij]_p, i=0,1,2,3 ..., k₂；J=0,1,2,3 ... k₁；P=0,1,2 ..., P }, cluster centre is stored, this step is referred to as VQ, i.e. vector quantization, Also turn into the second layer to quantify, corresponding VQ quantizers can be obtained.

P Hash table is set up, the vector set of the P part with being separated in the first step is corresponded, one in each Hash table AltogetherIndividual, the length of each Hash table correspondence Hash codes is alsoFor each part In k₁*k₂Individual cluster centre is encoded, and will be mapped in sample data the characteristic vector of each cluster centre ID or The name of correspondence picture is stored in the entrance in the corresponding Hash table of cluster centre, obtains the row's of falling rope based on many Hash tables Draw.

After four steps of the above are completed, the PQ quantizers based on great amount of samples data, VQ quantizers, retrieval can be obtained Tree and the inverted index structure based on many Hash tables.

After the completion of trie tree and Hash table are set up, if there is picture input to be matched, enter Vectors matching retrieval.

When give a characteristic vector y to be matched when, retrieved from trie tree and inverted index therewith it is closest to Amount, step is as follows：

Vectorial y is divided into P intersecting part, y=[y₁,y₂,y₃,…,y_p]。

For y_p, its k obtained with the quantization of pth segments first layer is calculated one by one₁The distance of individual cluster centre.Define dist (y_p,[c¹ _i]_p)=| | y_p–[c¹ _i]_p||²For in vectorial y to be checked pth part and the ith cluster of sample data pth part The distance of the heart.

Due to the uncertainty of cluster process, with some closest y feature in sample space, it is more likely that ownership In other neighbouring cluster centres, so to consider from y_pOther cluster centres around nearest cluster centre.For upper one The y obtained in step_pReordered with the distance of each cluster centre, choose wherein w minimum cluster centre of distance, as The scope for the inquiry to be carried out below.

The w first layer cluster centre minimum to the distance selected, to w cluster centre, has under each cluster centre k₂Individual second layer cluster centre, defines dist (y_p,[c¹ _ij]_p)=| | y_p–[c¹ _ij]_p||²For vectorial y to be checked pth part with The distance between j-th of second layer cluster centre in sample space under the ith cluster center of first layer.

The w*k obtained to step 4₂Individual distance is ranked up, due to this distance simply to sample space pth part away from From, so closest one can not only be taken, before taking m apart from corresponding cluster centre, found pair according to cluster centre The Hash table entry answered, seeks union by the Image ID or name that are stored in each Hash table entries of m, finally gives most like picture ID Candidate Set.

The corresponding characteristic vector of Image ID is taken out, row distance calculating is once entered with complete vectorial y to be checked, using small This data structure of heap is pushed up, no matter last data set has much, can be worked well, and committed memory amount is also pole Small.Finally obtain the topK of K as search result before distance-taxis, as K=1, as most like picture.

The present embodiment is quantified using product, and vector quantization, many hash indexs solve the problems, such as nearest neighbor search, utilize cluster Parallel computation process and retrieving two-part divide raising and retrieved recall rate.

Above-described embodiment, has been carried out further to the purpose of the present invention, technical scheme and beneficial effect Describe in detail, should be understood that the embodiment that the foregoing is only the present invention, be not intended to limit the present invention Protection domain, within the spirit and principles of the invention, any modification, equivalent substitution and improvements done etc. all should be included Within protection scope of the present invention.

Claims

1. a kind of image characteristic point matching method, it is characterised in that comprise the following steps,

Picture feature point is put in storage to extract：Extract the feature of storage image and constitute storage characteristic vector, dimensionality reduction is carried out to its dimension； Vector storage：Split the storage characteristic vector after dimensionality reduction, and quantify to do vector quantization again to first doing product after segmentation per part, Product quantizer, vector quantizer are constituted, and sets up trie tree and Hash table；

Picture feature point to be matched is extracted：Extract the feature of image to be matched and constitute characteristic vector to be matched, its dimension is entered Row dimensionality reduction；

Vectors matching：Split the characteristic vector to be matched after dimensionality reduction, find out characteristic vector to be matched and product quantizer, vector quantity Change the cluster centre of device apart from forward multiple cluster centres, according to corresponding to trie tree and Hash table find multiple cluster centres Picture and constitute Candidate Set, closest picture with characteristic vector to be matched in Candidate Set is calculated using floating point vector.

2. a kind of image characteristic point matching method according to claim 1, it is characterised in that：It is specific that the vector is put in storage Method is：

Storage characteristic vector is divided into disjoint P part；

For each cluster centre, all data for being assigned to the cluster centre are done into vector quantization, cluster centre number is k₂； Record the ID for all features for being mapped to correspondence cluster centre or the title of correspondence picture respectively with P Hash table.

3. a kind of image characteristic point matching method according to claim 2, it is characterised in that：The Vectors matching it is specific Method is：

Characteristic vector to be matched is divided into disjoint P part；

In each partial interior, characteristic vector to be matched and k are calculated₁The distance of individual cluster centre, and chosen distance minimum W Cluster centre；

For W cluster centre of selection, by characteristic vector to be matched k corresponding with the cluster centre₂Individual second layer cluster centre Enter row distance calculating one by one, obtain k₂Individual distance；

M is taken out apart from corresponding cluster centre, corresponding Hash table entry is found, by the picture name in these entrances or ID constitutes Candidate Set；

Calculated the corresponding picture feature of Image ID in Candidate Set is vectorial one by one using floating point vector with characteristic vector to be matched Enter distance, finally obtain the minimum as target of distance.

4. a kind of image characteristic point matching method according to claim 2, it is characterised in that：The k-means clusters are adopted Use parallel processing manner.

5. a kind of image characteristic point matching method according to claim 1, it is characterised in that：Using principal component analytical method To the dimension dimensionality reduction of feature.

6. a kind of image characteristic point matching method according to claim 1, it is characterised in that：The specific steps of the dimensionality reduction For：

Matrix M is constituted using L bar n dimensional feature vectors data, asks matrix M covariance to obtain matrix V ar (M), wherein, matrix M ={ D₁,D₂,…D_n, n is characterized the dimension of vector, and L is the natural number more than 1；

Ask covariance matrix Var (M) n characteristic value and corresponding characteristic vector, and choose maximum d characteristic value and its Corresponding vector, is used as the matrix R of dimensionality reduction；

MR is calculated with regard to L*d matrix can be obtained, dimensionality reduction is realized.