CN108960251A

CN108960251A - A kind of images match description generates the hardware circuit implementation method of scale space

Info

Publication number: CN108960251A
Application number: CN201810495451.5A
Authority: CN
Inventors: 李广; 朱传杰; 朱恩; 邱晓冬; 朱方杰
Original assignee: Southeast University
Current assignee: Southeast University
Priority date: 2018-05-22
Filing date: 2018-05-22
Publication date: 2018-12-07

Abstract

The invention discloses a kind of images match to describe the hardware circuit implementation method that son generates scale space, includes the following steps: under concrete application occasion, loading need to do matched image；Gaussian smoothing is done to the image and generates first group of image；Next group of image is generated using the method for down-sampling combination gaussian filtering；Difference of Gaussian pyramid is generated according to obtained gaussian pyramid.The present invention is redesigned by the scale space generation method to SIFT image matching algorithm, the characteristics of to adapt to FPGA operation fixed point, parallel, assembly line, it accelerates the speed of scale space generation and saves hardware resource, to effectively optimize the performance of entire characteristic matching module.

Description

A kind of images match description generates the hardware circuit implementation method of scale space

Technical field

The present invention relates to the graphical rule space generation methods based on SIFT algorithm family, and in particular to a kind of images match Description generates the hardware circuit implementation method of scale space.

Background technique

Currently, graphical rule space, which generates, is broadly divided into two kinds: one kind is multi-scale Representation (multi-scale Representation), using the mode for adding low-pass filtering based on down-sampling；Another kind of is that scale space indicates (scale-space representation), using the mode of Laplce's filtering under identical sample rate.It is based on The graphical rule space of SIFT algorithm family generates, and is that the combination of two kinds of above methods is taken its advantage, keeps away its disadvantage, will scheme As scale space divides (octave) and layer (layer) in groups^[1], so as to form the multiscale space of image.

Classical SIFT graphical rule space generation method is developed in Image Multiscale Space Theory, is had very Strong theories integration also has similar structure with human visual system, research shows that detection obtains under SIFT scale space Feature Descriptor comprehensive performance be optimal^[2], including scale invariability (scale invariant), rotational invariance (rotate invariant), illumination invariant (illumination invariant) and certain affine-invariant features (affine invariant)。

However, classical SIFT graphical rule space is had the disadvantage that based on Software Create

1) computationally intensive.For the image of a 512*512, generates for 3 groups 6 layers of scale space, only filter just 6*3=18 group filter is needed, carries out convolution algorithm, it is assumed that average filter template size is 16*16, needs 512*512*15* 15=2²⁶Secondary multiplication, calculation amount are surprising；

2) arithmetic speed is low.Scale space based on filtering operation generates, and relates generally to summation operation and product calculation, and The space complexity and input data of complexity (time complexity and the space complexity) and input data of both operations How much proportional, images of a 512*512, be intel core i5,8G DDR, 64 win7 operation system in CPU On the desktop computer of system, 3 groups 6 layers of scale space is established, runing time was also above 3 seconds.

Summary of the invention

Goal of the invention: the figure in the graphical rule space generation method based on SIFT algorithm family, for dimension calculation As being at least also required to two groups.But scale space establishes huge calculating (based on the smooth of convolution), becomes obstruction and improves image Matching speed, one of the principal element for promoting business throughput provide a kind of based on SIFT algorithm in order to overcome this problem Images match description generates the hardware circuit implementation method of scale space, accelerates the speed that scale space is established, and reduces and calculates Guarantee data precision while amount, efficiently use the characteristic of hardware configuration, including assembly line and computation capability, reduces algorithm Hardware resource occupancy, effectively optimize the performance of entire characteristic matching module, enhance the portability of algorithm.

Technical solution: to achieve the above object, the present invention provides a kind of images match description and generates the hard of scale space Part circuit implementing method, includes the following steps:

1) under concrete application occasion, loading need to do matched image；

2) Gaussian smoothing is done to the image and generates first group of image；

3) next group of image is generated using the method for down-sampling combination gaussian filtering；

4) difference of Gaussian pyramid (Difference of Gaussian, DOG) is generated according to obtained gaussian pyramid.

The detailed process of the step 1 are as follows:

1.1) the gray scale floating data of image is read by Ethernet into DDR；

1.2) image data in DDR is successively read according to the sequence of Row Column into Buffer.

The detailed process of the step 2 are as follows:

2.1) separability for utilizing Gaussian function, splits into two one-dimensional Gaussian filters for 2-d gaussian filters device Convolution will need the one-dimensional filtering coefficients used to carry out fixed point processing, and wherein the size of Gaussian template width should be with Scale where the image is directly proportional；

2.2) column filtering is carried out to image, it is using the symmetry of Gaussian template when column filter, the pixel of symmetric position is first It adds up, the utilization of less multiplier is then multiplied to template；

2.3) capable filtering is carried out to row filter result, using the reuse of data when row filters, is filtered using transposition FIR Device saves hardware resource with less output delay；

2.4) mode for smoothly obtaining next layer in change conventional method to upper one layer, meets semigroup knot using Gaussian function The property of structure, all layers all smoothly obtain the image under this layer of scale to first layer, and make its synchronism output of result, to save place Manage the time.

The detailed process of the step 3 are as follows:

While step 2 smoothly obtains first group of each tomographic image, to two sampling of input picture drop, sampled result is carried out With the consistent processing of process described in step 2, the image of other more large scale groups is obtained.

The detailed process of the step 4 are as follows:

For step 3 obtain as a result, i.e. gaussian pyramid, while a upper tomographic image is subtracted into next tomographic image and obtains height This difference pyramid, is equally parallel processing herein.

Mesoscale space of the present invention generating principle are as follows:

SIFT algorithm combines the respective advantage of scale space representation and pyramidal representation, innovatively proposes height This pyramid is for constructing scale space.

Gaussian pyramid is made of the different size of gaussian filtering layer of multiple groups, and first group since original image with specific ruler Spend σ₀Gaussian convolution core filter to obtain first group of the first floor, then with Gaussian convolution core by filtering layer by layer obtain one group it is discrete Scale space；Next group again by under the layer third from the bottom in upper one group two it is down-sampled obtain this group of first layer, and with same volume Product stratum nucleare layer filter to obtain the scale space of this group, and so on obtain other groups.

The scale image of sequential sampling in order to obtain, enables scale be incremented by with k times.If every group of needs are sought in S layers of scale Characteristic point is looked for, then needs the scale image of at least (S+3) layer.The empirical value obtained is tested according to David Lowe, taking S is 3, again Due to requiring every group of layer third from the bottom is down-sampled to obtain next group of first layer, then the scale that layer third from the bottom is S layers should be 2 times of first layer scale, then k=2^1/S。

Gauss quintar keeps picture size and pixel quantity constant in same group, does according to Scale-space theory to image Gaussian convolution simulates dimensional variation.When image is excessively fuzzy in this group, i.e., when having bulk redundancy information in image, then according to image Pyramidal representation generates next group of the first floor after down-sampled, and so on finally obtain gaussian pyramid.Gaussian pyramid It is exactly based on and combines the method for Scale-space theory and image pyramid, that is, ensure that the scale continuity of image, and big Reduce calculation amount greatly.

Laplce's Gaussian function σ of dimension normalization²▽²The Local Extremum that G is responded in scale space is very stable Characteristics of image.In order to reduce the calculation amount of the operator, difference Gaussian function DoG (Difference of is used in SIFT Gaussian) carry out Laplce's Gaussian function of approximate dimension normalization.Difference gaussian pyramid is by the Gauss gold word that has constructed Difference obtains tower two-by-two, and every group of S+2 layers of Gauss scale layer obtain S+1 layers of difference Gauss layer.

The utility model has the advantages that compared with prior art, the present invention having following advantage:

1, on the basis of previous method for producing software, by parallel computation and the pipeline design, by it is traditional based on Floating number, the scale space serially executed generation are converted into fixed point, parallel generation method, and filtering method is turned by two-dimensional convolution One-dimensional convolution is turned to, the speed of scale space generation is accelerated, the power consumption and hardware resource of floating-point processing are reduced, thus effectively Ground optimizes the performance of entire characteristic matching module；

2, optimization method is generated using the SIFT scale space based on FPGA, because having carried out parallel and flowing water line computation, And certain optimization has been carried out in process, each tomographic image can be with synchronism output, while greatly reducing calculation amount, Er Qie Wherein most stable of a collection of characteristic point has been retained in the extreme point that former algorithm detects, is reduced the pressure of subsequent match, is mentioned The high arithmetic speed of images match, and hardware resource occupancy is greatly reduced, it is with a wide range of applications.

Detailed description of the invention

Fig. 1 is overall flow schematic diagram of the invention；

Fig. 2 is that parallel filtering obtains the schematic diagram of gaussian pyramid；

Fig. 3 is gaussian filtering structural schematic diagram；

Fig. 4 is to obtain the pyramidal schematic diagram of difference of Gaussian by gaussian pyramid.

Specific embodiment

In the following with reference to the drawings and specific embodiments, the present invention is furture elucidated.

Embodiment 1:

As shown in Figure 1, the present invention provides a kind of hardware circuit implementation method of images match description generation scale space, Include the following steps:

1) under concrete application occasion, loading need to do matched image, detailed process are as follows:

1.1) the gray scale floating data of image is read by Ethernet into DDR；

2) Gaussian smoothing is done to the image and generates first group of image, detailed process are as follows:

3) next group of image, detailed process are generated using the method for down-sampling combination gaussian filtering are as follows:

4) difference of Gaussian pyramid, detailed process are generated according to obtained gaussian pyramid are as follows:

According to above method step, it is assumed that input is the picture of a 512*512 size, one byte (8 of each pixel A bit) it is indicated, the gaussian pyramid of 2 groups 6 layers of building, first layer graphical rule positioning 1.6.First by filter system Number fixed point processing, scale is respectively 1.6,1.6*2^1/3,1.6*2^2/3, 3.2,3.2*2^1/3,3.2*2^2/3, 1024 times of expansion, four House five finely tunes the sum of assurance coefficient after entering be 1024.Then Gaussian smoothing is carried out to 6*2=12 tomographic image respectively.Second group of image It is to be filtered on the basis of first group of first layer image drop sampling, specific gaussian filtering structure is as shown in figure 3, filtering is Rank of advanced units filtering, then carries out capable filtering, obtains gaussian pyramid as shown in Figure 2, the gaussian pyramid that filtering is obtained Difference of Gaussian pyramid is obtained, it is specific as shown in Figure 4.

Scale space hardware circuit realizes that step is specific as follows:

Gaussian pyramid is suitble to use spatial parallelism method, while doing gaussian filtering to all images in one group, so not only It can greatly accelerate the algorithm speed of service, moreover it is possible to while filter result is obtained, conveniently do the assembly line arrangement of algorithm.Due to same group In, every layer of Gauss scale image carries out convolution operation simultaneously, and every group of Gaussian convolution core is enabled to be superimposed convolution since first layer scale Remaining each layer, and separately design the coefficient of the convolution kernel of following each layer.

By the superposability of gaussian kernel function, from every group of first layer (relative scalar σ₀) filtering obtain remaining each layer, count Calculate the scale coefficient of the gaussian kernel function of other each layers:

By scale coefficient, the Filtering Template coefficient of each layer of gaussian kernel function finally can be obtained.Again by gaussian kernel function Detachable property be conducive to substantially reduce calculation amount, each 2-d gaussian filters device can be enabled to split two one-dimensional filtering devices, respectively For Gauss line filter and Gauss column filter, and their coefficients are identical.

It is much larger due to compare fixed-point calculation to the resource consumption of floating-point operation in logical device, it will be in algorithm Floating-point operation do fixed point processing.Under the premise of taking into account resource consumption and algorithm performance, this method is to each raw Gaussian Each template parameter of filter is multiplied by 1024, then rounds up, and obtains the gaussian filtering template of only integer.

Detachable dimensional Gaussian convolution kernel function is the identical row convolution kernel function of coefficients and column convolution kernel letter Number.Two-dimensional gaussian filtering is done equal to accumulating once row filtering on the image and once arranging filtering to image, splits dimensional Gaussian Calculation amount can be substantially reduced after core.

With G (x, y, σ₁) for, do the fixed point of gaussian filtering, it is known that σ₀=1.6, it can obtain:

In order to guarantee the adequacy of gaussian filtering, i.e. according to the energy for being included within the scope of 3 σ of radius of gaussian kernel function Integral in the radius is the 99.7% of entire gaussian kernel function, therefore Filtering Template width should also follow the principle, to protect Demonstrate,prove scale invariability.Then G (x, y, σ₁) corresponding to template width should be 6 σ₁≈ 7.358 rounds up and obtains template for odd number Width is 9.By Gaussian filter design factor:

[0.0016 0.0164 0.0861 0.2333 0.3252 0.2333 0.0861 0.0164 0.0016]

1024 are multiplied by each coefficients:

[1.6266 16.6945 88.0879 238.9582 333.2656 238.9582 88.0879 16.6945 1.6266]

Each coefficient is rounded up again:

[2 17 88 239 333 239 88 17 2]

But this can not as final coefficients because the coefficient of the template and be 1025, greater than the amplification of script Coefficient 1024.It is demonstrated experimentally that if coefficient between each layer gaussian filtering template and inconsistent, it will be each interlayer in difference Gauss Error amplification causes characteristic point detection that can not detect the characteristic point of real stabilization and Scale invariant.

Therefore, which also needs to adjust.It is as small as possible for principle to be changed to template, if coefficients and being less than 1024, then it finds and is less than before rounding up but rounds up closest to coefficients X.5, if coefficients and if being greater than 1024 Conversely, to take into account the central symmetry principle of template simultaneously.Due to 1025-1024=1, it is contemplated that central symmetry and change to template The smallest principle is moved, is changed to 333-1=2 for the 333 of template center, obtains final G (x, y, σ₁) template:

[2 17 88 239 332 239 88 17 2]

The Filtering Template coefficient of other scales obtains in the same way.

The centre symmetry of gaussian filtering template is utilized when carrying out convolution.It is filtered using a width for the Gauss of w (w is odd number) For wave template, convolution operation needs w-1 sub-addition and w times each time:

G (i)=G (w-1-i) is known by the centre symmetry of gaussian filtering template, then:

Finally, will successively be subtracted each other to obtain difference of Gaussian pyramid by each tomographic image of gaussian pyramid obtained by the above method.

Embodiment 2:

Scale invariability test, specific testing procedure are carried out to this method below are as follows:

A) the scale space generation step for optimizing source figure and matching figure respectively obtains belonging to respective graphical rule Space, i.e., final difference of Gaussian pyramid；

B) two groups of difference of Gaussian pyramids are obtained into description of characteristic point with the method for classics SIFT respectively；

C) description is schemed into matching and carries out Euclidean distance matching with source figure description, method is: description, recently Neighborhood distance/time nearest neighbor distance < 0.6 is determined as a matching pair, statistical match is to quantity and when completing consumed by matching Between.

By under different platform and scale space generating mode to the scale invariability of the images match of two 640*512 It is tested, obtains Tables 1 and 2:

Scale invariability is tested under 1 FPGA of table

Note: FPGA clock frequency 100MHz

Scale invariability is tested under 2 CPU of table

Note: CPU model core i5, dominant frequency 3.2GHz

Through Tables 1 and 2 as can be seen that at hardware platform FPGA, the scale space generation method after optimization is maintained SIFT possesses the characteristic of feature-rich point quantity, has very big promotion in processing speed, substantially meets the requirement of real-time.

Claims

1. the hardware circuit implementation method that a kind of images match description generates scale space, it is characterised in that: including walking as follows It is rapid:

1) under concrete application occasion, loading need to do matched image；

4) difference of Gaussian pyramid is generated according to obtained gaussian pyramid.

2. a kind of images match description according to claim 1 generates the hardware circuit implementation method of scale space, It is characterized in that: the detailed process of the step 1 are as follows:

1.1) the gray scale floating data of image is read into memory into (DDR) by Ethernet

1.2) image data in DDR is successively read according to the sequence of Row Column into buffer (Buffer).

3. a kind of images match description according to claim 1 generates the hardware circuit implementation method of scale space, It is characterized in that: the detailed process of the step 2 are as follows:

2.1) 2-d gaussian filters device, is split into the volume of two one-dimensional Gaussian filters by the separability for utilizing Gaussian function The one-dimensional filtering coefficients for needing to use are carried out fixed point processing by product, and wherein the size of Gaussian template width should be with this Scale where image is directly proportional；

2.2) column filtering is carried out to image first to have added the pixel of symmetric position using the symmetry of Gaussian template when column filter Come, the utilization of less multiplier is then multiplied to template；

2.3) capable filtering is carried out to row filter result, using the reuse of data when row filters, using transposition FIR filter；

2.4) mode for smoothly obtaining next layer in change conventional method to upper one layer, meets Semigroup Structure using Gaussian function Property, all layers all smoothly obtain the image under this layer of scale to first layer, and make its synchronism output of result.

4. a kind of images match description according to claim 3 generates the hardware circuit implementation method of scale space, It is characterized in that: the detailed process of the step 3 are as follows:

While step 2 smoothly obtains first group of each tomographic image, to two sampling of input picture drop, sampled result is carried out and is walked The rapid 2 consistent processing of process, obtains the image of other more large scale groups.

5. a kind of images match description according to claim 1 or 4 generates the hardware circuit implementation method of scale space, It is characterized by: the detailed process of the step 4 are as follows:

For step 3 obtain as a result, i.e. gaussian pyramid, while a upper tomographic image is subtracted into next tomographic image and obtains Gaussian difference Divide pyramid, is equally parallel processing herein.