CN108460403A - The object detection method and system of multi-scale feature fusion in a kind of image - Google Patents

The object detection method and system of multi-scale feature fusion in a kind of image Download PDF

Info

Publication number
CN108460403A
CN108460403A CN201810065807.1A CN201810065807A CN108460403A CN 108460403 A CN108460403 A CN 108460403A CN 201810065807 A CN201810065807 A CN 201810065807A CN 108460403 A CN108460403 A CN 108460403A
Authority
CN
China
Prior art keywords
scale
target
image
fusion
detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810065807.1A
Other languages
Chinese (zh)
Inventor
张重阳
程浩
刘泽祥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiaotong University
Original Assignee
Shanghai Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiaotong University filed Critical Shanghai Jiaotong University
Priority to CN201810065807.1A priority Critical patent/CN108460403A/en
Publication of CN108460403A publication Critical patent/CN108460403A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/29Graphical models, e.g. Bayesian networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses the object detection method and system of multi-scale feature fusion in a kind of image, the method includes:The first step carries out the scaling of different scale using picture to be detected, constructs image pyramid;Second step obtains the multiple scale detecting template of one group of most of sample size of covering using statistics clustering method;Third walks, and is based on multiple scale detecting template, carries out the target context structure of dimension self-adaption;4th step, multiple dimensioned depth characteristic fusion;5th step, the non-maxima suppression based on soft-decision.The present invention is by constructing the serial of methods such as the sparse pyramid of image multiresolution, multiple scale detecting masterplate, masterplate dimension self-adaption context, the fusion of multiple dimensioned depth characteristic, the abundant excavation and fusion for realizing depth characteristic utilize, and can promote target detection performance.

Description

The object detection method and system of multi-scale feature fusion in a kind of image
Technical field
The present invention relates to a kind of method of object detection field in image, multiple features fusion in specifically a kind of image Object detection method and system.
Background technology
Target detection identification in image has extensive functional need in application scenarios such as intelligent video monitorings, It is also the more popular research direction of computer vision field.Existing image object detection method, because remaining following difficulty And challenge, testing result are also to be hoisted:(1) between similar target, there are larger more for the appearance features such as color and vein shape Sample, otherness.(2) similar target causes the structure feature of sample in class to there is large change there are the diversity of posture.Such as There is target upright, postures, the similar target of different postures such as drop to the ground, tilt will present different profiles, shape in reality Etc. structure features;(3) similar object height, width equidimension size and ratio constant interval are big.One side target physical height There will be larger distributed areas, on the other hand due to the different target of shooting distance also will present in the picture it is different big The dimensional variations such as small, ratio.(4) blocking for target can influence testing result.Its partial information is missing from after target is blocked, Increase detection difficulty.(5) target local environment background and the diversity of illumination cause flase drop to increase.Target appears in as outdoor When, such as urban road, entrance, background is often complex, and some complicated background such as trees, street lamp meeting and targets Generation is obscured, leads to flase drop.
Currently, more mature object detection method can be divided into two classes substantially:(1) it is based on background modeling.This method master It is used to detect moving target in video:The still image that will be inputted carries out scene cut, utilizes mixed Gauss model (GMM) Or the methods of motion detection, it is partitioned into its foreground and background, then extract special exercise target in the foreground.Such methods need to connect Continuous image sequence models to realize, the target detection being not suitable in single image.(2) it is based on statistical learning.Will own The image collection of known genera Mr. Yu's one kind target gets up to form training set, method (such as HOG, Harr based on an engineer Deng) to training set image zooming-out feature.The feature of extraction is generally the letters such as gray scale, texture, histogram of gradients, the edge of target Breath.Then pedestrian detection grader is built according to the feature database of a large amount of training sample.Grader is generally available SVM, The models such as Adaboost and neural network.
The object detection method performance based on statistical learning is more excellent in recent years in terms of comprehensive, the target inspection based on statistical learning Survey method can be divided into traditional artificial characteristic target detection method and depth characteristic machine learning object detection method.
Traditional artificial characteristic target detection method is primarily referred to as its feature for utilizing engineer, to carry out target target Modeling.The characterization method for showing outstanding engineer in recent years includes mainly:Pedro F.Felzenszwalb in 2010 etc. DPM (Deformable Part Model) method (the Object detection with discriminatively of proposition trained part-based models).ICF (the Integral Channel proposed in 2009 such as Piotr Doll á r Features), the ACF methods (Fast Feature Pyramids for Object Detection) proposed in 2014. Informed Harr methods (the Informed Haar-like Features of the propositions such as Shanshan Zhang in 2014 Improve Pedestrian Detection), the extraction more Harr features with characterization information are dedicated to be trained. Although the feature of these engineers achieves certain effect, but because manual features characterize scarce capacity, there are still detections The not high problem of precision.Due to feature learning and ability to express more powerful possessed by depth convolutional neural networks model, scheming As target classification context of detection obtains more and more extensive and successful application.The target detection operator on basis is R-CNN (Region-Convolutional Neural Network) model.Girshick in 2014 et al. proposes RCNN for general The detection of target is again later to propose fast-rcnn and faster-rcnn, improves based on deep learning target detection side The accuracy and speed of method.The methods of Yolo proposed and SSD in 2016, then realize the fast of single stage by thoughts such as Anchor Fast target detection.These target detections based on depth learning technology are all based on greatly single scale, fixed size context Depth characteristic, there are still depth characteristics, and insufficient problem, detection performance to be utilized to need to be further increased.
Invention content
For insufficient existing for the object detection method based on depth model, multiple dimensioned spy in a kind of image of present invention proposition The object detection method and system for levying fusion, by constructing the sparse pyramid of image multiresolution, multiple scale detecting masterplate, masterplate A series of creative methods such as dimension self-adaption context, the fusion of multiple dimensioned depth characteristic, realize the abundant excavation of depth characteristic It is utilized with fusion, promotes target detection performance.
According to the first aspect of the invention, a kind of object detection method of multi-scale feature fusion in image is provided, including:
S1:The scaling of different scale is carried out using picture to be detected, constructs an image pyramid;
S2:Based on the training image that described image pyramid obtains, it is big to obtain one group of covering using statistics clustering method The multiple scale detecting template of most sample sizes;
S3:On the basis of above-mentioned multiple scale detecting template, the target context structure of dimension self-adaption is carried out;
S4:According to target context structure as a result, carry out multiple dimensioned depth characteristic fusion, Analysis On Multi-scale Features figure is obtained;
S5:According to above-mentioned Analysis On Multi-scale Features figure, the non-maxima suppression based on soft-decision is carried out, is realized multiple dimensioned in image The target detection of Fusion Features.
Preferably, in the S1:In order to enable detection network utilizes the detection block of one or several limited sizes, it can be right In image different size of target completely can compactly carry out frame choosing sampling, need to original image carry out it is multiple dimensioned scaling, So that original object increases its by repeatedly scaling is detected probability of the complete compact frame choosing of frame, by picture to be trained by by than Example is scaled to the picture of L different resolution size, to the image pyramid of one resolution ratio of construction from high to low.Specifically , when training, for each original training image, the scaling of multiple scales is carried out, obtains L under different scale Image is for training.When test, for each image to be detected, the scaling of multiple scales is equally carried out, is obtained not With L image under scale for detecting, and to the testing result of this L image, amalgamation judging is carried out, final detection is obtained As a result.
Preferably, the multiple scale detecting mould that one group of most of sample size of covering is obtained using statistics clustering method Plate refers to:Based on K-medoids clustering methods, and using Jie Kade distances (Jaccard distance) as Cluster Assessment Index concentrates target to be clustered by different wide high level and the ratio of width to height training data, forms the width of one group of K cluster centre High ratio, the target template as the most wide high proportions of covering.
Preferably, the target context for carrying out dimension self-adaption is built, and refers to:By the output of CNN network convolutional layers The receptive field each put on characteristic pattern, as candidate target frame;The part that receptive field has more relative to pattern plate bolster is used as target The context of frame, the detection for being used for auxiliary mark identify.
It is highly preferred that it is described carry out dimension self-adaption target context structure, finally obtain a contextual information with Target scale changes and the detection model of variation, i.e.,:Small scaled target will obtain the contextual information of bigger, and large scale target Contextual information it is less, to meet different demands of the target to contextual information of different scale.
Preferably, described to carry out multiple dimensioned depth characteristic fusion, refer to:The characteristic pattern that CNN difference convolutional layers are exported, choosing Go out M layers to be merged, for constructing Analysis On Multi-scale Features pyramid.
It is highly preferred that described carry out multiple dimensioned depth characteristic fusion, specially:It is in the convolutional layer selected for M Last layer of CNN networks, the characteristic pattern output it up-sample it using deconvolution, it is made to be expanded to and last layer feature It after scheming same resolution sizes, does with last layer characteristic pattern and is added pixel-by-pixel, obtain the Analysis On Multi-scale Features figure of fusion adjacent two layers; Again and so on, deconvolution expands, is merged with more last layer characteristic pattern, until the fusion for all M layers of characteristic pattern for completing to select.
Preferably, the non-maxima suppression based on soft-decision refers to:
The maximum detection block of fiducial probability is first elected, other detection blocks and the maximum detection block meter of fiducial probability are passed through IOU (intersection over union) is calculated, then reduces its fiducial probability more than a certain threshold value;
After the maximum detection block of this fiducial probability is removed, then select the maximum detection of fiducial probability in remaining detection block Frame, and remaining detection block and the maximum detection block of fiducial probability are calculated into IOU, then its fiducial probability is dropped more than a certain threshold value It is low;
By the above continuous iteration, the detection block after screening to the end is obtained.
According to the second aspect of the invention, a kind of object detection system of multi-scale feature fusion in image is provided, including:
Image pyramid builds module:The scaling of different scale is carried out using picture to be detected, constructs an image gold word Tower;
Multiple scale detecting template builds module:The instruction that the described image pyramid of module obtains is built based on image pyramid Practice image, the multiple scale detecting template of one group of most of sample size of covering is obtained using statistics clustering method;
Target context builds module:In the multiple scale detecting template that above-mentioned multiple scale detecting template structure module obtains On the basis of, carry out the target context structure of dimension self-adaption;
Multiple dimensioned depth characteristic melts module:According to target context structure module as a result, carrying out multiple dimensioned depth characteristic Fusion, obtains Analysis On Multi-scale Features figure;
Module of target detection:The Analysis On Multi-scale Features figure for melting module according to above-mentioned multiple dimensioned depth characteristic sentenced based on soft Non-maxima suppression certainly realizes the target detection of multi-scale feature fusion in image.
Preferably, described image pyramid construction module, when training, for each original training image, into The scaling of the multiple scales of row obtains L image under different scale for training.When test, for each to be detected Image, equally carry out the scaling of multiple scales, obtain L image under different scale for detecting, and to this L image Testing result carries out amalgamation judging, obtains final testing result.
Preferably, the multiple scale detecting template builds module, which is based on K-medoids clustering methods, and utilizes Jie Kade distances (Jaccard distance) are used as Cluster Assessment index, concentrate target to press different wide high level training data And the ratio of width to height is clustered, and the ratio of width to height of one group of K cluster centre is formed, the target mould as the most wide high proportions of covering Plate.
Preferably, the target context builds module, on the characteristic pattern which export CNN network convolutional layers each The receptive field of point, as candidate target frame;The part that receptive field has more relative to pattern plate bolster, the i.e. context as target frame, It is identified for the detection of auxiliary mark.
Preferably, the multiple dimensioned depth characteristic melts module, the characteristic pattern which exports CNN difference convolutional layers, choosing Go out M layers to be merged, for constructing Analysis On Multi-scale Features pyramid.
It is highly preferred that the multiple dimensioned depth characteristic melts module, CNN is in the convolutional layer which selects M Last layer of network, the characteristic pattern output it up-sample it using deconvolution, it is made to be expanded to and last layer characteristic pattern It after same resolution sizes, does with last layer characteristic pattern and is added pixel-by-pixel, obtain the Analysis On Multi-scale Features figure of fusion adjacent two layers;Again And so on, deconvolution expands, is merged with more last layer characteristic pattern, until the fusion for all M layers of characteristic pattern for completing to select.
Preferably, the module of target detection first elects the maximum detection block of fiducial probability, is detected by others Frame calculates IOU (intersection over union) with the maximum detection block of fiducial probability, is then set more than a certain threshold value Believe that probability reduces;
After the maximum detection block of this fiducial probability is removed, then select the maximum detection of fiducial probability in remaining detection block Frame, and remaining detection block and the maximum detection block of fiducial probability are calculated into IOU, then its fiducial probability is dropped more than a certain threshold value It is low;
By the above continuous iteration, the detection block after screening to the end is obtained.
Compared with prior art, the invention has the advantages that:
According to the target identification of different scale to the difference of contextual information demand in the present invention, the structure in conjunction with CNN is special The concept of point and receptive field is modeled using the contextual information of dimension self-adaption, is identified for auxiliary mark.
In the present invention in such a way that K-medoids clustering methods optimize the selection of template, meets different templates and target is examined Survey model inspection effect.
Image pyramid solves the problems, such as that training picture needs fixed size, design to be obtained using the method for crop in the present invention Picture to fixed size is used to train.
Adequately utilize the feature of CNN different layers can be simultaneously by further feature and shallow-layer Fusion Features in the present invention Using the detailed information of the characterization ability and shallow-layer feature of further feature, the accuracy of detection of small scaled target is promoted.
Analysis On Multi-scale Features testing mechanism is established in the present invention, will again be merged after feature combination, is then examined again It surveys, finally merges the result of detection, realize Analysis On Multi-scale Features detection.
The non-maxima suppression based on soft-decision is utilized in the present invention, improves syncretizing effect.
To sum up, the present invention is by comprehensively utilizing multi-resolution pyramid, multiple dimensioned template cluster, dimension self-adaption context The technologies such as information, the fusion of multiple dimensioned depth characteristic, soft-decision non-maxima suppression, enhance feature learning and the expression of image object Ability, effectively promotes the precision of the target detections such as pedestrian, at the same preferably solve in the prior art target in small scale, long distance From when, the problems such as intensive target detection.
Description of the drawings
Upon reading the detailed description of non-limiting embodiments with reference to the following drawings, other feature of the invention, Objects and advantages will become more apparent upon:
Fig. 1 a, Fig. 1 b are that image pyramid builds flow chart in one embodiment of the invention;
Fig. 2 is that multiple dimensioned template obtains flow chart in one embodiment of the invention;
Fig. 3 is multi-scale feature fusion implementation flow chart in one embodiment of the invention;
Fig. 4 is the non-maxima suppression implementation flow chart of soft-decision in one embodiment of the invention.
Specific implementation mode
With reference to specific embodiment, the present invention is described in detail.Following embodiment will be helpful to the technology of this field Personnel further understand the present invention, but the invention is not limited in any way.It should be pointed out that the ordinary skill of this field For personnel, without departing from the inventive concept of the premise, various modifications and improvements can be made.These belong to the present invention Protection domain.
Existing object detection method can identify certain larger-size targets well, but size is larger Target small part is only accounted in actual life, for distance target farther out, testing result is not fine.Target Detection has following characteristics, by taking pedestrian target as an example:
Feature one, the diversity of scale.Have old man, a middle-aged person, child in one side pedestrian, physics height there will be compared with Big distributed area.On the other hand due to the difference of shooting distance, for the same pedestrian, higher, the remoter bat of distance It takes the photograph and a little takes that pedestrian its pixel is lower, and height is lower;And pedestrian's pixel that lower, closer shooting point takes is higher, it is high Degree is closer to its true height.The pedestrian detection effect that existing method is more than height 100 pixels is preferable, but for remote Distance, the low pedestrian detection effect of height are bad.All it is often desirable when pedestrian detection is applied in vehicle DAS (Driver Assistant System) The pedestrian that system detectio distant location occurs reminds driver, therefore solves the problems, such as that the remote pedestrian of detection is also one urgent The demand cut.
Feature two is blocked.The pedestrian taken in reality can have circumstance of occlusion, on the one hand can there is the feelings that crowd clusters round Condition.When multiple people walk together, look over always that someone will be blocked a part for body from all angles;Another aspect pedestrian May a part for body be sheltered from by the object in environment, such as number, vehicle, house.Its body after pedestrian is blocked at this time Body portion information is missing from, and the result of missing inspection can be then caused for the detector based on human body integrity profile feature.
The present invention proposes that object detection method based on multi-feature fusion is preferably solved from the above problem in image Target of having determined small scale, it is remote when, the problems such as intensive target detection.It is tired that the present invention is based on target detections present in reality It is difficult, it is proposed that it is merged for multiscale target self-adapting detecting, and using multilayer receptive field difference based on pyramid, and Using soft-decision non-maxima suppression for intensive target detection and then the effect of promotion target detection.Wherein:
The first step, the scaling of different scale is carried out using picture to be detected, and construction includes the figure from the multiple resolution ratio of size As pyramid;
Second step clusters sample according to the wide high size of training sample, passes through cluster using statistics clustering method Form K cluster centre, each cluster centre forms one using high mean value not lend oneself to worry and anxiety in this as the detection template of scale;K cluster The template group for forming one group of K different scale, as detector target template group;
Third walks, the target context structure of dimension self-adaption.
To the detection template frame of each scale, all vertically left and right directions uniform expansions expands to same convolutional neural networks The receptive field size that the characteristic pattern that (CNN, Convolutional Neural Network) is finally exported each is put is identical, extension Part forms the contextual information for including target, and the context size for including is adaptive with template size.
4th step, multi-scale feature fusion.
By the way that the more a convolutional layer characteristic patterns of CNN are added fusion pixel-by-pixel in same resolution ratio, target inspection is formed Survey the multiple dimensioned depth characteristic needed for identification.
5th step, the non-maxima suppression based on soft-decision.
Confidence level by reducing detection block replaces directly deleting detection block, then is sieved to detection block by continuous iteration Choosing.
Meanwhile the present invention constructs an image object detecting system, system synthesis profit by integrated above method step With multi-resolution pyramid, multiple dimensioned template cluster, the fusion of dimension self-adaption contextual information, multiple dimensioned depth characteristic, soft sentence Certainly the methods of non-maxima suppression enhances the feature learning and expression ability of image object, effectively promotes the target detections such as pedestrian Precision.
Specifically, object detection system based on multi-feature fusion in a kind of image, including:
Image pyramid builds module:The scaling of different scale is carried out using picture to be detected, constructs an image gold word Tower;
Multiple scale detecting template builds module:The instruction that the described image pyramid of module obtains is built based on image pyramid Practice image, the multiple scale detecting template of one group of most of sample size of covering is obtained using statistics clustering method;
Target context builds module:In the multiple scale detecting template that above-mentioned multiple scale detecting template structure module obtains On the basis of, carry out the target context structure of dimension self-adaption;
Multiple dimensioned depth characteristic melts module:According to target context structure module as a result, carrying out multiple dimensioned depth characteristic Fusion, obtains Analysis On Multi-scale Features figure;
Module of target detection:The Analysis On Multi-scale Features figure for melting module according to above-mentioned multiple dimensioned depth characteristic sentenced based on soft Non-maxima suppression certainly realizes the target detection of multi-scale feature fusion in image.
The above method of the present invention and system are described in detail by taking pedestrian detection as an example below, especially above-mentioned five Implementation involved in a step/module.
One, the scaling that different scale is carried out using picture to be detected constructs image pyramid.
In order to enable detection network utilizes the detection block of one or several limited sizes, it can be to different size of in image Target completely can compactly carry out frame choosing sampling, need to carry out multiple dimensioned scaling, so that original object is passed through to original image Repeatedly scaling increases its detected frame probability that completely compact frame selects.Will picture be trained by be scaled out into L difference The picture of resolution sizes, to the image pyramid of one resolution ratio of construction from high to low.
When the training of pedestrian's detection model and test, initial data is zoomed in and out using the principle of image pyramid Processing, as shown in attached drawing 1a, 1b.
It is exactly specifically, for each image to be trained, to carry out the contracting of 0.5X, 1X and 2X when training It puts, obtains the image under different scale for training, as shown in fig. la.When test, for each figure to be detected Picture carries out the scaling of 0.5X, 1X and 2X, obtains the image under different scale for detecting, and to the detection of three scale images As a result, carrying out amalgamation judging, final testing result is obtained, as shown in figure ib.
In the present embodiment, the gradient of backpropagation is calculated for convenience, so the picture of input needs fixed scale, Such as the size of all pictures is all 640*480 pixels in Caltech.After random scaling, training picture scale has occurred The picture of variation, 640*480 sizes becomes 320*240 after 0.5X scalings, and scale then becomes after passing through 2X scalings For 1280*960.In order to obtain the training picture of unified scale, when training, ruler is cut out from the picture by scaling Degree is the picture of 640*480, and concrete operations flow is as shown in Figure 1a.I.e. for the picture that size is 320*240, image completion is used To 640*480, and for the picture that size is 1280*960, then therefrom random cropping goes out the small figure of 640*480, then by this three Kind picture is simultaneously for training, as shown in fig. la.This way can be effectively increased training samples number, promote deep learning etc. The performance of data-driven method.
Two, best detector template group is obtained using statistics clustering method.
The present invention obtains best detector template group using statistics clustering method, has been demarcated in extraction training image Frame progress K-medoids clustering methods are obtained not same group of (K) multiple dimensioned template by target rectangle frame.Here scale is The size (long and wide) and ratio (the ratio of width to height) of feeling the pulse with the finger-tip mark frame.K multiple dimensioned templates are obtained using statistical method, had made it both The differentiation that most training sample scales take into account sample size in class again can be covered.To consider when choosing template The differentiation of sample size, avoids only being caused that matching template is very few, is difficult to meet multiple dimensioned mesh with single or a small number of scale templates Mark and the accurate match cognization problem of template, but statistical data concentrates wide high and ratio the distribution situation of target sample, by poly- Class forms multiple cluster centres, each cluster centre forms a scale (average the ratio of width to height but not limited to this) template, to root One group of multiple dimensioned template is formed according to sample size distribution;Meanwhile the quantity by limiting cluster centre, to avoid center excessive Lead to that the template training sample of a certain scale is very little, detector cannot train up problem.K- can be used in clustering method Medoids etc. is realized and is carried out cluster and the selection of multiple dimensioned template to wide high and the ratio of width to height of sample that data are concentrated.
When specific operation, can first definition template scale, cover the pedestrian target of different scale.Generally vertical pedestrian's The ratio of width to height is generally 1:3, according to this experience and the general height of pedestrian target in Caltech data sets is combined to be distributed, it can be with It manually goes to select typical template scale, such as 30*90,50*150, but this mode chosen by hand does not have not only Versatility, and very maximum probability cannot choose most suitable template.This is because:
First, in different data sets, due to the difference of photo resolution and monitoring visual angle, the distribution of pedestrian level is Different, this method for choosing template by hand does not have versatility;
Secondly, in true monitoring scene, because of situations such as the appearance of pedestrian's posture, camera angle, block information, Cause the ratio of width to height of pedestrian to vary widely, the template chosen is caused not have typicalness.Meanwhile when choosing template Consider the distribution situation of object height in data set, if the corresponding training sample of the template of a certain scale is very little, can cause The corresponding detector of this template cannot be trained adequately.
Therefore, propose that the method clustered by K-medoids is high to the width of the sample in pedestrian's data set in the present embodiment Clustered, using statistical method obtain the multiple dimensioned templates of K (the present embodiment K=32, certainly, in other embodiments Can also be other numbers).
As shown in Fig. 2, being based on K-medoids clustering methods, pedestrian level and width is concentrated to cluster training data. Using Jie Kade distances (Jaccard distance) as the Cluster Assessment index of K-medoids, i.e.,:
d(si,sj)=1-J (si,sj)
Wherein, si=(hi,wi);sj=(hj,wj) indicating two different pedestrian's frames in data set, h, w indicate capable respectively The height and the width of people's frame, J indicate the Jie Kade similarities (Jaccard similarity coefficient) of standard,
Three, the target context structure of dimension self-adaption.
The present invention using dimension self-adaption target context build, on the one hand, small scaled target because useful information compared with Less, more contextual informations are generally required to assist in identifying;And large scale target does not often need a large amount of context then.Separately On the one hand, since each characteristic point of the full articulamentums of CNN corresponds to a fixed receptive field of scale in artwork, receptive field can be based on To construct the context of different templates.
Specifically, the pattern plate bolster of the K different scale returned out to K-medoids clustering procedures, vertically controls and is expanded It opens up and to expand to the receptive field each put with the full articulamentums of CNN in the same size.Because receptive field size is fixed, then small scale Pattern plate bolster need to do the extension of large scale, thus the context of large scale can be obtained;And large scale template is then on the contrary, obtain The context obtained is smaller.Based on this method, the contextual information with pattern plate bolster dimension self-adaption is can get, is known for auxiliary mark Not.
Four, multiple dimensioned depth characteristic fusion.
In view of the characteristic pattern of CNN each convolutional layer output, all containing the useful feature on different scale, shallow-layer it is defeated Go out to usually contain more local detail features, and high-rise feature usually contains more of overall importance and Semantic information.It is right The feature for the different scale that these different layers obtain carries out fusion utilization and can be obtained more rich feature representation.Based on the think of Think that the present invention devises a kind of Multiscale Fusion method:
As shown in figure 3, M main convolutional layer (often rearmost M layer in CNN networks, this implementation for CNN Example is the characteristic pattern of 3) output, shallow from being deep to, and first carries out deconvolution up-sampling to the characteristic pattern of bottommost layer each with deconvolution, So that it is transformed in the same last layer output same resolution-scale of characteristic pattern, then does the characteristic pattern of this two layers same scale It is added pixel-by-pixel, the Analysis On Multi-scale Features figure after being merged;Analogize accordingly again, realize all M layers of fusions, obtains M layers of fusion The Analysis On Multi-scale Features figure of feature.Such as input picture passes through after Resnet networks, by res5 layers of output result deconvolution one It is secondary to obtain resolution ratio identical with res4, and with the results added of res4, then its deconvolution once obtained identical with res3 Resolution ratio simultaneously obtains final result and is tested with the results added of res3 progress Fusion Features.
Five, the non-maxima suppression of soft-decision.
In pervious method, final testing result is obtained with non-maxima suppression method fusion detection frame.Traditional Non-maxima suppression method be it is a kind of volume method is sentenced based on the hard of greed, correct detection is may result in during fusion Frame is suppressed, especially when IOU (Intersection of Units, the overlapping region between two rectangle frames and combined region Ratio) threshold value selection it is inappropriate when.
As shown in figure 4, the non-maxima suppression Integral Thought of soft-decision is exactly in the present invention:By reducing setting for detection block Reliability replaces directly deleting detection block.
In the present embodiment, concrete operations are:The maximum detection block of fiducial probability is first elected, is then detected by others The frame of frame and maximum fiducial probability calculates IOU, then reduces its fiducial probability more than a certain threshold value.Then this maximum frame is removed The maximum fiducial probability frame in selecting remaining detection block afterwards, and remaining detection block and the frame of maximum fiducial probability are calculated into IOU, Then its fiducial probability is reduced more than a certain threshold value, by continuous iteration, obtains the detection block after screening to the end.
The mode that linear weighted function may be used in the present embodiment reduces the score of detection block according to the value of iou:
In the formula, M indicates the maximum detection block of fiducial probability, biIndicate i-th of detection block, NiIndicate that IOU threshold values (can For empirical value or preset value), iou (M, bi) indicate M and biIOU values.
Therefore, soft-decision of the present invention is more preferable for dense population detection result.
Some details of the invention implemented and preferred feature have been carried out in detail above by for pedestrian detection Thin description.The present invention can also be applied to target detection in other images, however it is not limited to pedestrian detection, the behaviour of other target detections Make similar to the above embodiments, no longer separately illustrates herein.
To sum up, the present invention is by constructing the sparse pyramid of image multiresolution, the multiple scale detecting mould based on cluster optimization A series of creative methods such as version, the fusion of masterplate dimension self-adaption context, multiple dimensioned depth characteristic, realize filling for depth characteristic Divide to excavate and merge and utilize, obtains the promotion of target detection performance.
Specific embodiments of the present invention are described above.It is to be appreciated that the invention is not limited in above-mentioned Particular implementation, those skilled in the art can make various deformations or amendments within the scope of the claims, this not shadow Ring the substantive content of the present invention.

Claims (14)

1. the object detection method of multi-scale feature fusion in a kind of image, which is characterized in that including:
S1:The scaling of different scale is carried out using picture to be detected, constructs an image pyramid;
S2:Based on the training image that described image pyramid obtains, most of one group of covering is obtained using statistics clustering method The multiple scale detecting template of sample size;
S3:On the basis of above-mentioned multiple scale detecting template, the target context structure of dimension self-adaption is carried out;
S4:According to target context structure as a result, carry out multiple dimensioned depth characteristic fusion, Analysis On Multi-scale Features figure is obtained;
S5:According to above-mentioned Analysis On Multi-scale Features figure, the non-maxima suppression based on soft-decision is carried out, realizes Analysis On Multi-scale Features in image The target detection of fusion.
2. the object detection method of multi-scale feature fusion in image according to claim 1, which is characterized in that the S1 In:In order to enable detection network utilizes the detection block of one or several limited sizes, it can be to different size of target in image Frame choosing sampling completely can be compactly carried out, needs to carry out multiple dimensioned scaling to original image, make original object by multiple Scaling increases it and is detected the frame probability that completely compact frame selects, by original image by being scaled out into L different resolution The picture of size, to the image pyramid of one resolution ratio of construction from high to low.
3. the object detection method of multi-scale feature fusion in image according to claim 1, which is characterized in that the profit With the multiple scale detecting template of statistics clustering method one group of most of sample size of covering of acquisition, refer to:Based on K-medoids Clustering method, and Cluster Assessment index is used as using Jie Kade distances, to training data concentrate target press different wide high level and The ratio of width to height is clustered, and the ratio of width to height of one group of K cluster centre is formed, the target mould as the most wide high proportions of covering Plate.
4. the object detection method of multi-scale feature fusion in image according to claim 1, which is characterized in that it is described into The target context of row dimension self-adaption is built, and refers to:The impression that will each be put on the characteristic pattern of CNN network convolutional layers output Open country, as candidate target frame;The part that receptive field has more relative to pattern plate bolster, the i.e. context as target frame, for assisting The detection of target identifies.
5. the object detection method of multi-scale feature fusion in image according to claim 4, which is characterized in that it is described into The target context of row dimension self-adaption is built, and finally obtains the detection that a contextual information changes with target scale variation Model, i.e.,:Small scaled target will obtain the contextual information of bigger, and the contextual information of large scale target is less, to full Different demands of the target of sufficient different scale to contextual information.
6. the object detection method of multi-scale feature fusion in image according to claim 1, which is characterized in that it is described into The multiple dimensioned depth characteristic of row merges, and refers to:The characteristic pattern that CNN difference convolutional layers are exported, selects M layers and is merged, and is used for structure Make Analysis On Multi-scale Features pyramid.
7. the object detection method of multi-scale feature fusion in image according to claim 6, which is characterized in that it is described into The multiple dimensioned depth characteristic fusion of row, specially:In last layer of CNN networks in the convolutional layer selected for M, output it Characteristic pattern it is up-sampled using deconvolution, it is and upper after so that it is expanded to resolution sizes same as last layer characteristic pattern One layer of characteristic pattern does and is added pixel-by-pixel, obtains the Analysis On Multi-scale Features figure of fusion adjacent two layers;Again and so on, deconvolution expansion, It is merged with more last layer characteristic pattern, until the fusion for all M layers of characteristic pattern for completing to select.
8. according to the object detection method of multi-scale feature fusion in claim 1-7 any one of them images, feature exists In the non-maxima suppression based on soft-decision refers to:
The maximum detection block of fiducial probability is first elected, is calculated by other detection blocks and the maximum detection block of fiducial probability IOU then reduces its fiducial probability more than a certain threshold value;
After the maximum detection block of this fiducial probability is removed, then the maximum detection block of fiducial probability in remaining detection block is selected, and Remaining detection block and the maximum detection block of fiducial probability are calculated into IOU, then reduce its fiducial probability more than a certain threshold value;
By the above continuous iteration, the detection block after screening to the end is obtained.
9. the object detection system of multi-scale feature fusion in a kind of image, which is characterized in that including:
Image pyramid builds module:The scaling of different scale is carried out using picture to be detected, one image pyramid of construction is more Size measurement template builds module:The training image that the described image pyramid of module obtains is built based on image pyramid, profit The multiple scale detecting template of one group of most of sample size of covering is obtained with statistics clustering method;
Target context builds module:On the basis for the multiple scale detecting template that above-mentioned multiple scale detecting template structure module obtains On, carry out the target context structure of dimension self-adaption;
Multiple dimensioned depth characteristic melts module:According to target context build module as a result, carry out multiple dimensioned depth characteristic fusion, Obtain Analysis On Multi-scale Features figure;
Module of target detection:The Analysis On Multi-scale Features figure for melting module according to above-mentioned multiple dimensioned depth characteristic, carries out based on soft-decision Non-maxima suppression realizes the target detection of multi-scale feature fusion in image.
10. the object detection system of multi-scale feature fusion in a kind of image according to claim 9, which is characterized in that The multiple scale detecting template builds module, which is based on K-medoids clustering methods, and using Jie Kade distances as poly- Class evaluation index concentrates target to be clustered by different wide high level and the ratio of width to height training data, is formed in one group of K cluster The ratio of width to height of the heart, the target template as the most wide high proportions of covering.
11. the object detection system of multi-scale feature fusion in a kind of image according to claim 9, which is characterized in that The target context builds module, the receptive field which will each put on characteristic pattern that CNN network convolutional layers export, as Candidate target frame;The part that receptive field has more relative to pattern plate bolster, the i.e. context as target frame are used for the inspection of auxiliary mark Survey identification.
12. the object detection system of multi-scale feature fusion in a kind of image according to claim 9, which is characterized in that The multiple dimensioned depth characteristic melts module, and the characteristic pattern which exports CNN difference convolutional layers is selected M layers and merged, and uses In construction Analysis On Multi-scale Features pyramid.
13. the object detection method of multi-scale feature fusion in image according to claim 12, which is characterized in that described Multiple dimensioned depth characteristic melts module, in last layer of CNN networks in the convolutional layer which selects M, outputs it Characteristic pattern it is up-sampled using deconvolution, it is and upper after so that it is expanded to resolution sizes same as last layer characteristic pattern One layer of characteristic pattern does and is added pixel-by-pixel, obtains the Analysis On Multi-scale Features figure of fusion adjacent two layers;Again and so on, deconvolution expansion, It is merged with more last layer characteristic pattern, until the fusion for all M layers of characteristic pattern for completing to select.
14. according to the object detection system of multi-scale feature fusion in a kind of image of claim 9-13 any one of them, Be characterized in that, the module of target detection, first elect the maximum detection block of fiducial probability, by other detection blocks with set Believe that the detection block of maximum probability calculates IOU, then reduces its fiducial probability more than a certain threshold value;
After the maximum detection block of this fiducial probability is removed, then the maximum detection block of fiducial probability in remaining detection block is selected, and Remaining detection block and the maximum detection block of fiducial probability are calculated into IOU, then reduce its fiducial probability more than a certain threshold value;
By the above continuous iteration, the detection block after screening to the end is obtained.
CN201810065807.1A 2018-01-23 2018-01-23 The object detection method and system of multi-scale feature fusion in a kind of image Pending CN108460403A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810065807.1A CN108460403A (en) 2018-01-23 2018-01-23 The object detection method and system of multi-scale feature fusion in a kind of image

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810065807.1A CN108460403A (en) 2018-01-23 2018-01-23 The object detection method and system of multi-scale feature fusion in a kind of image

Publications (1)

Publication Number Publication Date
CN108460403A true CN108460403A (en) 2018-08-28

Family

ID=63238616

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810065807.1A Pending CN108460403A (en) 2018-01-23 2018-01-23 The object detection method and system of multi-scale feature fusion in a kind of image

Country Status (1)

Country Link
CN (1) CN108460403A (en)

Cited By (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109241902A (en) * 2018-08-30 2019-01-18 北京航空航天大学 A kind of landslide detection method based on multi-scale feature fusion
CN109255352A (en) * 2018-09-07 2019-01-22 北京旷视科技有限公司 Object detection method, apparatus and system
CN109255320A (en) * 2018-09-03 2019-01-22 电子科技大学 A kind of improved non-maxima suppression method
CN109308458A (en) * 2018-08-31 2019-02-05 电子科技大学 A method of small target deteection precision is promoted based on characteristic spectrum change of scale
CN109343701A (en) * 2018-09-03 2019-02-15 电子科技大学 A kind of intelligent human-machine interaction method based on dynamic hand gesture recognition
CN109363614A (en) * 2018-08-29 2019-02-22 合肥德易电子有限公司 Intelligent integral robot cavity mirror system with high definition video enhancing processing function
CN109472193A (en) * 2018-09-21 2019-03-15 北京飞搜科技有限公司 Method for detecting human face and device
CN109522958A (en) * 2018-11-16 2019-03-26 中山大学 Based on the depth convolutional neural networks object detection method merged across scale feature
CN109598290A (en) * 2018-11-22 2019-04-09 上海交通大学 A kind of image small target detecting method combined based on hierarchical detection
CN109615016A (en) * 2018-12-20 2019-04-12 北京理工大学 A kind of object detection method of the convolutional neural networks based on pyramid input gain
CN109635717A (en) * 2018-12-10 2019-04-16 天津工业大学 A kind of mining pedestrian detection method based on deep learning
CN109658412A (en) * 2018-11-30 2019-04-19 湖南视比特机器人有限公司 It is a kind of towards de-stacking sorting packing case quickly identify dividing method
CN109711427A (en) * 2018-11-19 2019-05-03 深圳市华尊科技股份有限公司 Object detection method and Related product
CN109815868A (en) * 2019-01-15 2019-05-28 腾讯科技(深圳)有限公司 A kind of image object detection method, device and storage medium
CN109816671A (en) * 2019-01-31 2019-05-28 深兰科技(上海)有限公司 A kind of object detection method, device and storage medium
CN109816036A (en) * 2019-01-31 2019-05-28 北京字节跳动网络技术有限公司 Image processing method and device
CN109886147A (en) * 2019-01-29 2019-06-14 电子科技大学 A kind of more attribute detection methods of vehicle based on the study of single network multiple-task
CN109978035A (en) * 2019-03-18 2019-07-05 西安电子科技大学 Pedestrian detection method based on improved k-means and loss function
CN110046550A (en) * 2019-03-14 2019-07-23 中山大学 Pedestrian's Attribute Recognition system and method based on multilayer feature study
CN110110599A (en) * 2019-04-03 2019-08-09 天津大学 A kind of Remote Sensing Target detection method based on multi-scale feature fusion
CN110189307A (en) * 2019-05-14 2019-08-30 慧影医疗科技(北京)有限公司 A kind of pulmonary nodule detection method and system based on multi-model fusion
CN110321864A (en) * 2019-07-09 2019-10-11 西北工业大学 Remote sensing images explanatory note generation method based on multiple dimensioned cutting mechanism
CN110503098A (en) * 2019-08-29 2019-11-26 西安电子科技大学 A kind of object detection method and equipment of quick real-time lightweight
CN110738208A (en) * 2019-10-08 2020-01-31 创新奇智(重庆)科技有限公司 efficient scale-normalized target detection training method
CN110796039A (en) * 2019-10-15 2020-02-14 北京达佳互联信息技术有限公司 Face flaw detection method and device, electronic equipment and storage medium
CN110796360A (en) * 2019-10-24 2020-02-14 吉林化工学院 Fixed traffic detection source multi-scale data fusion method
CN110909615A (en) * 2019-10-28 2020-03-24 西安交通大学 Target detection method based on multi-scale input mixed perception neural network
CN110991359A (en) * 2019-12-06 2020-04-10 重庆市地理信息和遥感应用中心(重庆市测绘产品质量检验测试中心) Satellite image target detection method based on multi-scale depth convolution neural network
CN111047655A (en) * 2020-01-10 2020-04-21 北京盛开互动科技有限公司 High-definition camera cloth defect detection method based on convolutional neural network
CN111127458A (en) * 2019-12-27 2020-05-08 深圳力维智联技术有限公司 Target detection method and device based on image pyramid and storage medium
CN111144248A (en) * 2019-12-16 2020-05-12 上海交通大学 People counting method, system and medium based on ST-FHCD network model
CN111191621A (en) * 2020-01-03 2020-05-22 北京同方软件有限公司 Rapid and accurate identification method for multi-scale target under large-focus monitoring scene
CN111241893A (en) * 2018-11-29 2020-06-05 阿里巴巴集团控股有限公司 Identification recognition method, device and system
CN111260615A (en) * 2020-01-13 2020-06-09 重庆交通大学 Laser and machine vision fusion-based method for detecting apparent diseases of unmanned aerial vehicle bridge
CN111368600A (en) * 2018-12-26 2020-07-03 北京眼神智能科技有限公司 Method and device for detecting and identifying remote sensing image target, readable storage medium and equipment
CN111582012A (en) * 2019-12-24 2020-08-25 珠海大横琴科技发展有限公司 Method and device for detecting small target ship
CN111738036A (en) * 2019-03-25 2020-10-02 北京四维图新科技股份有限公司 Image processing method, device, equipment and storage medium
CN111767934A (en) * 2019-10-31 2020-10-13 杭州海康威视数字技术股份有限公司 Image identification method and device and electronic equipment
CN111986255A (en) * 2020-09-07 2020-11-24 北京凌云光技术集团有限责任公司 Multi-scale anchor initialization method and device of image detection model
CN112418208A (en) * 2020-12-11 2021-02-26 华中科技大学 Tiny-YOLO v 3-based weld film character recognition method
CN112528976A (en) * 2021-02-09 2021-03-19 北京世纪好未来教育科技有限公司 Text detection model generation method and text detection method
CN112560732A (en) * 2020-12-22 2021-03-26 电子科技大学中山学院 Multi-scale feature extraction network and feature extraction method thereof
CN112580721A (en) * 2020-12-19 2021-03-30 北京联合大学 Target key point detection method based on multi-resolution feature fusion
CN112906677A (en) * 2021-05-06 2021-06-04 南京信息工程大学 Pedestrian target detection and re-identification method based on improved SSD (solid State disk) network
CN113222916A (en) * 2021-04-28 2021-08-06 北京百度网讯科技有限公司 Method, apparatus, device and medium for detecting image using target detection model
CN113221752A (en) * 2021-05-13 2021-08-06 北京惠朗时代科技有限公司 Multi-template matching-based multi-scale character accurate identification method
CN113313213A (en) * 2021-07-28 2021-08-27 中国航空油料集团有限公司 Data set processing method for accelerating training of target detection algorithm
CN113688709A (en) * 2021-08-17 2021-11-23 长江大学 Intelligent detection method, system, terminal and medium for wearing safety helmet
CN114049488A (en) * 2022-01-07 2022-02-15 济南和普威视光电技术有限公司 Multi-dimensional information fusion remote weak and small target detection method and terminal
CN114049627A (en) * 2022-01-11 2022-02-15 浙江华是科技股份有限公司 Ship board detection method, device and system capable of intelligently selecting scaling dimension
CN111191621B (en) * 2020-01-03 2024-06-28 北京同方软件有限公司 Rapid and accurate identification method for multi-scale target in large-focal-length monitoring scene

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107169421A (en) * 2017-04-20 2017-09-15 华南理工大学 A kind of car steering scene objects detection method based on depth convolutional neural networks
CN107609465A (en) * 2017-07-25 2018-01-19 北京联合大学 A kind of multi-dimension testing method for Face datection

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107169421A (en) * 2017-04-20 2017-09-15 华南理工大学 A kind of car steering scene objects detection method based on depth convolutional neural networks
CN107609465A (en) * 2017-07-25 2018-01-19 北京联合大学 A kind of multi-dimension testing method for Face datection

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
JOSEPH REDMON 等: "YOLO9000: Better, Faster, Stronger", 《2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)》 *
NAVANEETH BODLA 等: "Soft-NMS – Improving Object Detection With One Line of Code", 《2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV)》 *
PEIYUN HU 等: "Finding Tiny Faces", 《2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)》 *
TSUNG-YI LIN 等: "Feature Pyramid Networks for Object Detection", 《2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)》 *

Cited By (77)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109363614A (en) * 2018-08-29 2019-02-22 合肥德易电子有限公司 Intelligent integral robot cavity mirror system with high definition video enhancing processing function
CN109363614B (en) * 2018-08-29 2020-09-01 合肥德易电子有限公司 Intelligent integrated robot cavity mirror system with high-definition video enhancement processing function
CN109241902A (en) * 2018-08-30 2019-01-18 北京航空航天大学 A kind of landslide detection method based on multi-scale feature fusion
CN109308458B (en) * 2018-08-31 2022-03-15 电子科技大学 Method for improving small target detection precision based on characteristic spectrum scale transformation
CN109308458A (en) * 2018-08-31 2019-02-05 电子科技大学 A method of small target deteection precision is promoted based on characteristic spectrum change of scale
CN109343701A (en) * 2018-09-03 2019-02-15 电子科技大学 A kind of intelligent human-machine interaction method based on dynamic hand gesture recognition
CN109255320B (en) * 2018-09-03 2020-09-25 电子科技大学 Improved non-maximum suppression method
CN109255320A (en) * 2018-09-03 2019-01-22 电子科技大学 A kind of improved non-maxima suppression method
CN109255352A (en) * 2018-09-07 2019-01-22 北京旷视科技有限公司 Object detection method, apparatus and system
CN109255352B (en) * 2018-09-07 2021-06-22 北京旷视科技有限公司 Target detection method, device and system
CN109472193A (en) * 2018-09-21 2019-03-15 北京飞搜科技有限公司 Method for detecting human face and device
CN109522958A (en) * 2018-11-16 2019-03-26 中山大学 Based on the depth convolutional neural networks object detection method merged across scale feature
CN109711427A (en) * 2018-11-19 2019-05-03 深圳市华尊科技股份有限公司 Object detection method and Related product
CN109598290A (en) * 2018-11-22 2019-04-09 上海交通大学 A kind of image small target detecting method combined based on hierarchical detection
CN111241893A (en) * 2018-11-29 2020-06-05 阿里巴巴集团控股有限公司 Identification recognition method, device and system
CN111241893B (en) * 2018-11-29 2023-06-16 阿里巴巴集团控股有限公司 Identification recognition method, device and system
CN109658412A (en) * 2018-11-30 2019-04-19 湖南视比特机器人有限公司 It is a kind of towards de-stacking sorting packing case quickly identify dividing method
CN109635717A (en) * 2018-12-10 2019-04-16 天津工业大学 A kind of mining pedestrian detection method based on deep learning
CN109615016B (en) * 2018-12-20 2021-06-22 北京理工大学 Target detection method of convolutional neural network based on pyramid input gain
CN109615016A (en) * 2018-12-20 2019-04-12 北京理工大学 A kind of object detection method of the convolutional neural networks based on pyramid input gain
CN111368600A (en) * 2018-12-26 2020-07-03 北京眼神智能科技有限公司 Method and device for detecting and identifying remote sensing image target, readable storage medium and equipment
CN111368600B (en) * 2018-12-26 2023-10-31 北京眼神智能科技有限公司 Remote sensing image target detection and identification method and device, readable storage medium and equipment
CN109815868B (en) * 2019-01-15 2022-02-01 腾讯科技(深圳)有限公司 Image target detection method and device and storage medium
CN109815868A (en) * 2019-01-15 2019-05-28 腾讯科技(深圳)有限公司 A kind of image object detection method, device and storage medium
CN109886147A (en) * 2019-01-29 2019-06-14 电子科技大学 A kind of more attribute detection methods of vehicle based on the study of single network multiple-task
CN109816671B (en) * 2019-01-31 2021-09-24 深兰科技(上海)有限公司 Target detection method, device and storage medium
CN109816036B (en) * 2019-01-31 2021-08-27 北京字节跳动网络技术有限公司 Image processing method and device
CN109816036A (en) * 2019-01-31 2019-05-28 北京字节跳动网络技术有限公司 Image processing method and device
CN109816671A (en) * 2019-01-31 2019-05-28 深兰科技(上海)有限公司 A kind of object detection method, device and storage medium
CN110046550A (en) * 2019-03-14 2019-07-23 中山大学 Pedestrian's Attribute Recognition system and method based on multilayer feature study
CN110046550B (en) * 2019-03-14 2021-07-13 中山大学 Pedestrian attribute identification system and method based on multilayer feature learning
CN109978035A (en) * 2019-03-18 2019-07-05 西安电子科技大学 Pedestrian detection method based on improved k-means and loss function
CN109978035B (en) * 2019-03-18 2021-04-02 西安电子科技大学 Pedestrian detection method based on improved k-means and loss function
CN111738036B (en) * 2019-03-25 2023-09-29 北京四维图新科技股份有限公司 Image processing method, device, equipment and storage medium
CN111738036A (en) * 2019-03-25 2020-10-02 北京四维图新科技股份有限公司 Image processing method, device, equipment and storage medium
CN110110599A (en) * 2019-04-03 2019-08-09 天津大学 A kind of Remote Sensing Target detection method based on multi-scale feature fusion
CN110110599B (en) * 2019-04-03 2023-05-09 天津大学 Remote sensing image target detection method based on multi-scale feature fusion
CN110189307A (en) * 2019-05-14 2019-08-30 慧影医疗科技(北京)有限公司 A kind of pulmonary nodule detection method and system based on multi-model fusion
CN110321864A (en) * 2019-07-09 2019-10-11 西北工业大学 Remote sensing images explanatory note generation method based on multiple dimensioned cutting mechanism
CN110503098A (en) * 2019-08-29 2019-11-26 西安电子科技大学 A kind of object detection method and equipment of quick real-time lightweight
CN110738208A (en) * 2019-10-08 2020-01-31 创新奇智(重庆)科技有限公司 efficient scale-normalized target detection training method
CN110796039A (en) * 2019-10-15 2020-02-14 北京达佳互联信息技术有限公司 Face flaw detection method and device, electronic equipment and storage medium
CN110796360A (en) * 2019-10-24 2020-02-14 吉林化工学院 Fixed traffic detection source multi-scale data fusion method
CN110909615A (en) * 2019-10-28 2020-03-24 西安交通大学 Target detection method based on multi-scale input mixed perception neural network
CN111767934B (en) * 2019-10-31 2023-11-03 杭州海康威视数字技术股份有限公司 Image recognition method and device and electronic equipment
CN111767934A (en) * 2019-10-31 2020-10-13 杭州海康威视数字技术股份有限公司 Image identification method and device and electronic equipment
CN110991359A (en) * 2019-12-06 2020-04-10 重庆市地理信息和遥感应用中心(重庆市测绘产品质量检验测试中心) Satellite image target detection method based on multi-scale depth convolution neural network
CN111144248A (en) * 2019-12-16 2020-05-12 上海交通大学 People counting method, system and medium based on ST-FHCD network model
CN111144248B (en) * 2019-12-16 2024-02-27 上海交通大学 People counting method, system and medium based on ST-FHCD network model
CN111582012A (en) * 2019-12-24 2020-08-25 珠海大横琴科技发展有限公司 Method and device for detecting small target ship
CN111127458A (en) * 2019-12-27 2020-05-08 深圳力维智联技术有限公司 Target detection method and device based on image pyramid and storage medium
CN111127458B (en) * 2019-12-27 2023-06-09 深圳力维智联技术有限公司 Image pyramid-based target detection method, device and storage medium
CN111191621B (en) * 2020-01-03 2024-06-28 北京同方软件有限公司 Rapid and accurate identification method for multi-scale target in large-focal-length monitoring scene
CN111191621A (en) * 2020-01-03 2020-05-22 北京同方软件有限公司 Rapid and accurate identification method for multi-scale target under large-focus monitoring scene
CN111047655B (en) * 2020-01-10 2024-05-14 北京盛开互动科技有限公司 High-definition camera cloth defect detection method based on convolutional neural network
CN111047655A (en) * 2020-01-10 2020-04-21 北京盛开互动科技有限公司 High-definition camera cloth defect detection method based on convolutional neural network
CN111260615A (en) * 2020-01-13 2020-06-09 重庆交通大学 Laser and machine vision fusion-based method for detecting apparent diseases of unmanned aerial vehicle bridge
CN111986255B (en) * 2020-09-07 2024-04-09 凌云光技术股份有限公司 Multi-scale anchor initializing method and device of image detection model
CN111986255A (en) * 2020-09-07 2020-11-24 北京凌云光技术集团有限责任公司 Multi-scale anchor initialization method and device of image detection model
CN112418208B (en) * 2020-12-11 2022-09-16 华中科技大学 Tiny-YOLO v 3-based weld film character recognition method
CN112418208A (en) * 2020-12-11 2021-02-26 华中科技大学 Tiny-YOLO v 3-based weld film character recognition method
CN112580721A (en) * 2020-12-19 2021-03-30 北京联合大学 Target key point detection method based on multi-resolution feature fusion
CN112580721B (en) * 2020-12-19 2023-10-24 北京联合大学 Target key point detection method based on multi-resolution feature fusion
CN112560732A (en) * 2020-12-22 2021-03-26 电子科技大学中山学院 Multi-scale feature extraction network and feature extraction method thereof
CN112560732B (en) * 2020-12-22 2023-07-04 电子科技大学中山学院 Feature extraction method of multi-scale feature extraction network
CN112528976A (en) * 2021-02-09 2021-03-19 北京世纪好未来教育科技有限公司 Text detection model generation method and text detection method
CN113222916A (en) * 2021-04-28 2021-08-06 北京百度网讯科技有限公司 Method, apparatus, device and medium for detecting image using target detection model
CN113222916B (en) * 2021-04-28 2023-08-18 北京百度网讯科技有限公司 Method, apparatus, device and medium for detecting image using object detection model
CN112906677A (en) * 2021-05-06 2021-06-04 南京信息工程大学 Pedestrian target detection and re-identification method based on improved SSD (solid State disk) network
CN113221752A (en) * 2021-05-13 2021-08-06 北京惠朗时代科技有限公司 Multi-template matching-based multi-scale character accurate identification method
CN113313213B (en) * 2021-07-28 2021-11-19 中国航空油料集团有限公司 Data set processing method for accelerating training of target detection algorithm
CN113313213A (en) * 2021-07-28 2021-08-27 中国航空油料集团有限公司 Data set processing method for accelerating training of target detection algorithm
CN113688709B (en) * 2021-08-17 2023-12-05 广东海洋大学 Intelligent detection method, system, terminal and medium for wearing safety helmet
CN113688709A (en) * 2021-08-17 2021-11-23 长江大学 Intelligent detection method, system, terminal and medium for wearing safety helmet
CN114049488A (en) * 2022-01-07 2022-02-15 济南和普威视光电技术有限公司 Multi-dimensional information fusion remote weak and small target detection method and terminal
CN114049627B (en) * 2022-01-11 2022-04-08 浙江华是科技股份有限公司 Ship board detection method, device and system capable of intelligently selecting scaling dimension
CN114049627A (en) * 2022-01-11 2022-02-15 浙江华是科技股份有限公司 Ship board detection method, device and system capable of intelligently selecting scaling dimension

Similar Documents

Publication Publication Date Title
CN108460403A (en) The object detection method and system of multi-scale feature fusion in a kind of image
Luo et al. Traffic sign recognition using a multi-task convolutional neural network
CN105574550B (en) A kind of vehicle identification method and device
CN103514456B (en) Image classification method and device based on compressed sensing multi-core learning
CN108304873A (en) Object detection method based on high-resolution optical satellite remote-sensing image and its system
CN110310264A (en) A kind of large scale object detection method, device based on DCNN
CN109241913A (en) In conjunction with the ship detection method and system of conspicuousness detection and deep learning
CN108416250A (en) Demographic method and device
CN109543632A (en) A kind of deep layer network pedestrian detection method based on the guidance of shallow-layer Fusion Features
CN114220035A (en) Rapid pest detection method based on improved YOLO V4
CN108960404B (en) Image-based crowd counting method and device
CN104166841A (en) Rapid detection identification method for specified pedestrian or vehicle in video monitoring network
CN111783589B (en) Complex scene crowd counting method based on scene classification and multi-scale feature fusion
CN108764096B (en) Pedestrian re-identification system and method
CN110263712A (en) A kind of coarse-fine pedestrian detection method based on region candidate
CN106778687A (en) Method for viewing points detecting based on local evaluation and global optimization
CN110991274B (en) Pedestrian tumbling detection method based on Gaussian mixture model and neural network
CN114758362B (en) Clothing changing pedestrian re-identification method based on semantic perception attention and visual shielding
CN113762009B (en) Crowd counting method based on multi-scale feature fusion and double-attention mechanism
CN112270331A (en) Improved billboard detection method based on YOLOV5
CN108681696A (en) Merge the infrared target robustness recognition methods of multiple features dimensionality reduction and transfer learning
CN113158943A (en) Cross-domain infrared target detection method
CN110008861A (en) A kind of recognition methods again of the pedestrian based on global and local feature learning
CN109086659A (en) A kind of Human bodys' response method and apparatus based on multimode road Fusion Features
Zang et al. Traffic lane detection using fully convolutional neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180828

RJ01 Rejection of invention patent application after publication