CN101630369B - Pedestrian detection method based on wavelet fractal characteristic - Google Patents

Pedestrian detection method based on wavelet fractal characteristic Download PDF

Info

Publication number
CN101630369B
CN101630369B CN2009101830757A CN200910183075A CN101630369B CN 101630369 B CN101630369 B CN 101630369B CN 2009101830757 A CN2009101830757 A CN 2009101830757A CN 200910183075 A CN200910183075 A CN 200910183075A CN 101630369 B CN101630369 B CN 101630369B
Authority
CN
China
Prior art keywords
fractal
epsiv
wavelet
subgraph
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2009101830757A
Other languages
Chinese (zh)
Other versions
CN101630369A (en
Inventor
李舜酩
沈峘
毛建国
柏芳超
缪小冬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Aeronautics and Astronautics
Original Assignee
Nanjing University of Aeronautics and Astronautics
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Aeronautics and Astronautics filed Critical Nanjing University of Aeronautics and Astronautics
Priority to CN2009101830757A priority Critical patent/CN101630369B/en
Publication of CN101630369A publication Critical patent/CN101630369A/en
Application granted granted Critical
Publication of CN101630369B publication Critical patent/CN101630369B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Image Analysis (AREA)

Abstract

The invention discloses a pedestrian detection method based on wavelet fractal characteristic, which relates to the technology of intelligent vehicles in an intelligent traffic system. The pedestrian detection method comprises the following steps: (1) reading a training sample set and standardizing all sample images to the pixel size of 48*96; (2) carrying three times of two-dimensional wavelet transformation on the sample images, and getting six wavelet sub-graphs in the second layer and the third layer; (3) getting absolute values of the wavelet sub-graphs obtained in the step (2), and carrying out stretching and scaling so that the numeric area is uniformly mapped to 0 to 255; (4) respectively getting fractal dimension vectors of the wavelet sub-graphs treated in the step (3); (5) standardizing the fractal vectors corresponding to each sub-graph obtained in the step (4); (6) combining the fractal vectors standardized in the step (5), and obtaining the wavelet fractal characteristic vector of 6*(n-1) dimension; and (7) using the wavelet fractal characteristic obtained in the step (6) for training a soft-supporter vector machine, and using the obtained discriminant function for realizing pedestrian detection. The invention has the characteristics of concise characteristic expression form, strong characteristic distinguishing capability and higher pedestrian detection efficiency.

Description

A kind of pedestrian detection method based on wavelet fractal characteristic
Affiliated technical field
The present invention relates to the The intelligent vehicles technology in the intelligent transportation system, especially a kind of pedestrian detection method based on wavelet fractal characteristic.
Background technology
Target detection is the important content of machine vision research, and at video monitoring, the mobile robot is widely used in the fields such as intelligent vehicle.Because pedestrian's appearance, the combination of posture and intensity of illumination is ever-changing, cause pedestrian detection become sensation target detect in one of complicated problems the most.
Existing pedestrian detection method can be summed up as two main treatment steps, i.e. feature extraction and target detection on process.The main task of target detection step is the target signature that characteristic extraction step is constructed, and is used for the sorter training, thereby new test sample book is made differentiation.Therefore, feature extracting method is to the effect important influence of pedestrian detection.Feature extraction can be divided into two types of sparse expression and dense expression again according to the mode of feature representation.The sparse expression method extracts the point of interest in the target area through certain key feature extraction algorithm, forms the sparse features expression pattern of target.For example yardstick invariant features conversion (Scale-invariant feature transform, SIFT) and principal component analysis (PCA) (Principal Component Analysis, PCA).The target signature of using these feature representation modes to extract has the advantage that is concise in expression, but when describing complex patterns, the information that its proper vector comprised is unfavorable for the sorter training, thereby not good at the effect in pedestrian detection field.And dense expression is that point by point scanning is carried out in the given area, extracts the dense characteristic expression pattern of target area.Typically have Lis Hartel seek peace gradient orientation histogram (Histograms of Oriented Gradient, HOG) etc.What the target signature that these method for distilling extract comprised contains much information, and help the sorter training, but the proper vector dimension is too high.With the HOG method is example, and the image of 64 * 128 sizes will produce the proper vector of at least 4096 dimensions, if adopt Lis Hartel to levy obtaining more intrinsic dimensionality, causes the characteristic amount of redundancy big, and counting yield is low, generally also needs the cooperation of feature selection approach.
Summary of the invention
The object of the invention is to provide a kind of feature representation succinct, and information redundancy is little, the pedestrian detection method based on wavelet fractal characteristic that detection efficiency is high.
The technical scheme that the present invention adopted is:
(1), read in training sample set, sample set comprises complete pedestrian's image of sorter training and does not comprise pedestrian's natural scene image that all sample images are normalized to 48 * 96 pixel sizes, and wherein training sample image is a gray level image;
(2), (x y) carries out two-dimensional wavelet transformation 3 times, obtains 12 sub-graphs after 3 layers of wavelet decomposition, and the yardstick of each layer correspondence is respectively 2 * 2,4 * 4,8 * 8, gets its mesoscale and be 4 * 4,8 * 86 small echo subgraphs in two-layer, promptly to sample image f
{ f 2 H , f 2 V , f 2 D , f 3 H , f 3 V , f 3 D }
Wherein, f (x y) is the gradation of image value, (and x y) is pixel coordinate, and the subscript A of f, V, D, H are each subgraph that the 2-d wavelet branch solution subspace comprises, the corresponding LL subgraph of difference, LH subgraph, HL subgraph and HH subgraph, and the number of plies is decomposed in the subscript representative of f;
(3), step (2) is obtained carrying out suitable stretching and scale transformation again after each small echo subgraph takes absolute value respectively, make its span unification be mapped to 0~255;
(4), the small echo subgraph after step (3) handled asks for the fractal dimension vector respectively, makes yardstick ε=1,2,3 ..., n, n get the positive integer in 2~100, try to achieve D successively according to following formula 1, D 2..., D N-1, promptly fractal dimension is vectorial,
D k = 2 - ln ( V ϵ k - V ϵ k - 1 ) - ln ( V ϵ k + 1 - V ϵ k ) ln ϵ k - ln ϵ k + 1
V ϵ = Σ i , j ( u ϵ ( i , j ) - b ϵ ( i , j ) )
u ϵ ( i , j ) = max { u ϵ - 1 ( i , j ) , max | ( m , n ) - ( i , j ) ≤ 1 | u ϵ - 1 ( m , n ) }
b ϵ ( i , j ) = max { b ϵ - 1 ( i , j ) , max | ( m , n ) - ( i , j ) ≤ 1 | b ϵ - 1 ( m , n ) }
Wherein: D represents fractal dimension; K representes the subscript to dependent variable; ε kRepresent a value in the yardstick span; u εThe upper surface area of the fractal target when the expression yardstick is ε; b εThe lower surface area of the fractal target when the expression yardstick is ε; V εWhen the expression yardstick is ε, the volume that the upper and lower surface area of fractal target surrounds;
(5), the pairing fractal vector of each subgraph that step (4) is obtained is done standardization processing respectively;
(6), the fractal vector after the standardization that all subgraphs after step (5) standardization are corresponding makes up, and obtains one 6 * (n-1) dimensional vector, i.e. the wavelet fractal characteristic vector;
(7), wavelet fractal characteristic that step (6) is extracted is used to train soft support vector sorter, realizes the detection to unknown sample with the discriminant function after the training.
The present invention combines small echo and fractal theory; A kind of new pedestrian's target's feature-extraction method is proposed, be called wavelet fractal characteristic (Wavelet Fractal Signature, WFS); Be used to train the svm classifier device; This characteristic has succinct feature representation form, more excellent characteristic resolution characteristic, thus can improve the pedestrian detection effect.
The WFS characteristic comprises dense sampling and abstract two links:
(1) input picture is decomposed the small echo subpattern of nonautocorrelation under the different resolution at dense sampling element; Select wherein to help most portraying 6 sub-modes of pedestrian's characteristic; Comprised a large amount of luminance difference information; Directional information, thus the pedestrian detection problem is converted into the pattern classification problem of 6 dimension spaces, improved classifying quality.
(2) in abstract link; Through calculating the FRACTAL DIMENSION vector; Extract the characteristic difference of each subpattern under different scale; Can under considerably less intrinsic dimensionality (the maximum occurrences n of yardstick ε is, the proper vector of generation only has 48 dimensions), take out the characteristic of target under different resolution at 9 o'clock, have the advantage that is concise in expression.
(3) according to general knowledge, 2-d wavelet involved in the present invention decomposes, and link counting yielies such as FRACTAL DIMENSION calculating, L1 standardization are all very high, and the computing cost of SVM depends primarily on intrinsic dimensionality and support vector number.On the one hand, the WFS characteristic that provides only has 48 dimensions, is merely 1% of HOG characteristic; On the other hand, adopt the WFS characteristic when considerably less training sample, just can obtain higher detection efficient, significantly reduced the number of support vector, thereby reduced computing cost.
Therefore, the present invention had both had the big advantage of dense sample information amount, possessed the advantage of sparse sampling terseness again.When reducing counting yield, improved verification and measurement ratio.Another beneficial effect of the present invention is to facts have proved that when training sample was considerably less, the present invention program still can have very high verification and measurement ratio, can be applicable to the occasion that training sample is difficult to collect.
Description of drawings
Fig. 1 is a method flow diagram of the present invention.
Embodiment
Below in conjunction with accompanying drawing working of an invention is made and to be further specified.Fig. 1 is method flow diagram of the present invention, and is as shown in Figure 1, and this method comprises following 7 steps.
Step 101: read in training sample set, sample set comprises complete pedestrian's image of sorter training and does not comprise pedestrian's natural scene image that all sample images are normalized to 48 * 96 pixel sizes, and wherein training sample image is a gray level image;
Step 102, (x y) carries out two-dimensional wavelet transformation 3 times, obtains 3 layers of subgraph after the wavelet decomposition, and the yardstick of each layer correspondence is respectively 2 * 2,4 * 4,8 * 8 to sample image f.Remove yardstick and be all subgraphs of 2 * 2, and the f in other two kinds of yardsticks ASubgraph, remaining subgraph does
{ f 2 H , f 2 V , f 2 D , f 3 H , f 3 V , f 3 D } - - - ( 1 )
Wherein, f (x y) is the gradation of image value, (and x y) is pixel coordinate, and the subscript A of f, V, D, H are each subgraph that the 2-d wavelet branch solution subspace comprises, the corresponding LL subgraph of difference, LH subgraph, HL subgraph and HH subgraph, and the number of plies is decomposed in the subscript representative of f;
Step 103: carry out suitable stretching and scale transformation again after each the small echo subgraph in the formula (1) taken absolute value, make its span unification be mapped to 0~255;
Step 104: the small echo subgraph to after step 103 processing is asked for the fractal dimension vector respectively, is specially, and makes yardstick ε=1,2,3 ..., n tries to achieve D successively according to formula (2) 1, D 2..., D N-1, promptly fractal dimension is vectorial, and n gets the integer between 2~100.The value of n is directly proportional with the intrinsic dimensionality of generation, but its value is big more, and the characteristic information amount that comprises is big more, otherwise then more little.Obtain when n gets 9 to optimum through a large amount of experiments;
D k = 2 - ln ( V ϵ k - V ϵ k - 1 ) - ln ( V ϵ k + 1 - V ϵ k ) ln ϵ k - ln ϵ k + 1
V ϵ = Σ i , j ( u ϵ ( i , j ) - b ϵ ( i , j ) ) ( 2 )
u ϵ ( i , j ) = max { u ϵ - 1 ( i , j ) , max | ( m , n ) - ( i , j ) ≤ 1 | u ϵ - 1 ( m , n ) }
b ϵ ( i , j ) = max { b ϵ - 1 ( i , j ) , max | ( m , n ) - ( i , j ) ≤ 1 | b ϵ - 1 ( m , n ) }
Step 105: the pairing fractal vector of each subgraph to step 104 obtains is done standardization respectively, promptly uses formula (3) the corresponding FRACTAL DIMENSION vector of each subgraph that standardizes,
β i = α i | | α i | | 1 + λ - - - ( 3 )
Wherein, || || 1Be 1 norm, α iSubgraph FRACTAL DIMENSION vector before the expression standardization, β iStandardized vector after the expression standardization, λ is a very little positive number, for assurance formula (3) effective;
This step can also be used alternate manners such as the L2 norm fractal vector that standardizes, but the L1 norm has the highest counting yield.
Step 106: the fractal vector after the standardization that all subgraphs after step 105 standardization are corresponding makes up, and obtains one 6 * (n-1) dimensional vector, promptly final wavelet fractal characteristic vector;
Step 107: the wavelet fractal characteristic that step 106 is extracted is used to train soft support vector sorter.Specific algorithm is following:
(A) known training sample set
T={(x 1,y 1),…,(x l,y l)}∈(R n×{-1,1})
(B) select kernel function and punishment parameters C, construct and find the solution optimization problem
min α L ( α ) = Σ i = 1 l α i - 1 2 Σ i = 1 l Σ j = 1 l α i α j y i y j K ( x i , x j ) ( 4 )
s.t.?0≤α i?≤C,
Figure GSB00000648891900053
i=1,2.…,l.
Wherein,
Figure GSB00000648891900054
is kernel function.
(C) solve the optimum solution of step (B) Chinese style (4), note is made α *, obtain optimal classification lineoid (w with formula (5) *, b *),
Figure GSB00000648891900056
b * = y j - Σ i ∈ SV y i α i * K ( x i , x j )
(D) thus obtain the discriminant function f (x) that is used to classify,
f ( x ) = sgn ( ( w * · x ) + b * )
= sgn ( Σ i = 1 l α i * K ( x i , x ) + y i - Σ i = 1 l y i α i * K ( x i , x ) ) - - - ( 6 )
Wherein, the penalty factor C of SVM is the constant greater than zero, and its optimal value need be confirmed through test, get 2000 among the present invention.Kernel function can use polynomial kernel, two layers of neural network to examine and gaussian kernel, the first-selected gaussian kernel of the present invention, and σ gets 1.5, and other nuclear takes second place.
Utilize a window in by altimetric image, to slide with the same size of training sample image; Through obtaining discriminant function f (x) target in the window is classified; Judge whether this pattern is the pedestrian; Thereby the realization pedestrian detection, the WFS characteristic of x representative image wherein, pedestrian or non-pedestrian are represented in the output of discriminant function f (x).

Claims (3)

1. the pedestrian detection method based on wavelet fractal characteristic is characterized in that, this method may further comprise the steps:
(1), read in training sample set, sample set comprises complete pedestrian's image of sorter training and does not comprise pedestrian's natural scene image that all sample images are normalized to 48 * 96 pixel sizes, and wherein training sample image is a gray level image;
(2), (x y) carries out two-dimensional wavelet transformation 3 times, obtains 12 sub-graphs after 3 layers of wavelet decomposition, and the yardstick of each layer correspondence is respectively 2 * 2,4 * 4,8 * 8, gets its mesoscale and be 4 * 4,8 * 86 small echo subgraphs in two-layer, promptly to sample image f
{ f 2 H , f 2 V , f 2 D , f 3 H , f 3 V , f 3 D }
Wherein, f (x y) is the gradation of image value, (and x y) is pixel coordinate, and the subscript A of f, V, D, H are each subgraph that the 2-d wavelet branch solution subspace comprises, the corresponding LL subgraph of difference, LH subgraph, HL subgraph and HH subgraph, and the number of plies is decomposed in the subscript representative of f;
(3), step (2) is obtained carrying out suitable stretching and scale transformation again after each small echo subgraph takes absolute value respectively, make its span unification be mapped to 0~255;
(4), the small echo subgraph after step (3) handled asks for the fractal dimension vector respectively, makes yardstick ε=1,2,3 ..., n, n get the positive integer in 2~100, try to achieve D successively according to following formula 1, D 2..., D N-1, promptly fractal dimension is vectorial,
D k = 2 - ln ( V ϵ k - V ϵ k - 1 ) - ln ( V ϵ k + 1 - V ϵ k ) ln ϵ k - ln ϵ k + 1
V ϵ = Σ i , j ( u ϵ ( i , j ) - b ϵ ( i , j ) )
u ϵ ( i , j ) = max { u ϵ - 1 ( i , j ) , max | ( m , n ) - ( i , j ) ≤ 1 | u ϵ - 1 ( m , n ) }
b ϵ ( i , j ) = max { b ϵ - 1 ( i , j ) , max | ( m , n ) - ( i , j ) ≤ 1 | b ϵ - 1 ( m , n ) }
Wherein: D represents fractal dimension; K representes the subscript to dependent variable; ε kRepresent a value in the yardstick span; u εThe upper surface area of the fractal target when the expression yardstick is ε; b εThe lower surface area of the fractal target when the expression yardstick is ε; V εWhen the expression yardstick is ε, the volume that the upper and lower surface area of fractal target surrounds;
(5), the pairing fractal vector of each subgraph that step (4) is obtained is done standardization processing respectively;
(6), the fractal vector after the standardization that all subgraphs after step (5) standardization are corresponding makes up, and obtains one 6 * (n-1) dimensional vector, i.e. the wavelet fractal characteristic vector;
(7), the wavelet fractal characteristic that step (6) is extracted is used to train soft support vector sorter; The discriminant function that acquisition is used to classify; Utilize a window in by altimetric image, to slide again with the same size of training sample image; Discriminant function with obtaining after the training is classified to the target in the moving window, judges whether this pattern is the pedestrian, thereby realizes pedestrian detection.
2. a kind of pedestrian detection method based on wavelet fractal characteristic according to claim 1 is characterized in that: the maximum occurrences n of said step (4) mesoscale ε is 9.
3. a kind of pedestrian detection method based on wavelet fractal characteristic according to claim 1 is characterized in that: the employed normalized method of said step (5) is: with the corresponding FRACTAL DIMENSION vector of each subgraph of L1 norm standardization, promptly
β i = α i | | α i | | 1 + λ
Wherein, || || 1Be 1 norm, α iSubgraph FRACTAL DIMENSION vector before the expression standardization, β iStandardized vector after the expression standardization, λ is a very little positive number, and is effective in order to guarantee division.
CN2009101830757A 2009-07-30 2009-07-30 Pedestrian detection method based on wavelet fractal characteristic Expired - Fee Related CN101630369B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2009101830757A CN101630369B (en) 2009-07-30 2009-07-30 Pedestrian detection method based on wavelet fractal characteristic

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2009101830757A CN101630369B (en) 2009-07-30 2009-07-30 Pedestrian detection method based on wavelet fractal characteristic

Publications (2)

Publication Number Publication Date
CN101630369A CN101630369A (en) 2010-01-20
CN101630369B true CN101630369B (en) 2012-03-07

Family

ID=41575472

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2009101830757A Expired - Fee Related CN101630369B (en) 2009-07-30 2009-07-30 Pedestrian detection method based on wavelet fractal characteristic

Country Status (1)

Country Link
CN (1) CN101630369B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103226028B (en) * 2013-05-02 2015-07-01 电子科技大学 Method for identifying and detecting disturbance signals of phase-sensitive optical time domain reflectometer
CN104200230B (en) * 2014-09-11 2018-04-27 哈尔滨工业大学 A kind of soil soil property recognition methods based on wavelet transformation and svm classifier
CN105654068B (en) * 2016-02-02 2019-03-26 中国科学院上海微***与信息技术研究所 A kind of target detection background estimating method based on fractal theory
CN110348459B (en) * 2019-06-28 2021-11-16 西安理工大学 Sonar image fractal feature extraction method based on multi-scale rapid carpet covering method
JP7267222B2 (en) * 2020-02-26 2023-05-01 三菱ロジスネクスト株式会社 Processing device, processing method, notification system and program

Also Published As

Publication number Publication date
CN101630369A (en) 2010-01-20

Similar Documents

Publication Publication Date Title
CN109840521B (en) Integrated license plate recognition method based on deep learning
CN102324047B (en) Hyper-spectral image ground object recognition method based on sparse kernel representation (SKR)
CN103971123B (en) Hyperspectral image classification method based on linear regression Fisher discrimination dictionary learning (LRFDDL)
CN109165623B (en) Rice disease spot detection method and system based on deep learning
CN105069811B (en) A kind of Multitemporal Remote Sensing Images change detecting method
Sirmacek et al. Urban area detection using local feature points and spatial voting
CN107992891B (en) Multispectral remote sensing image change detection method based on spectral vector analysis
CN109766858A (en) Three-dimensional convolution neural network hyperspectral image classification method combined with bilateral filtering
CN104657717B (en) A kind of pedestrian detection method based on layering nuclear sparse expression
CN111639587B (en) Hyperspectral image classification method based on multi-scale spectrum space convolution neural network
CN103514456A (en) Image classification method and device based on compressed sensing multi-core learning
CN108038501B (en) Hyperspectral image classification method based on multi-mode compression bilinear pooling
CN106295124A (en) Utilize the method that multiple image detecting technique comprehensively analyzes gene polyadenylation signal figure likelihood probability amount
CN103699874B (en) Crowd abnormal behavior identification method based on SURF (Speed-Up Robust Feature) stream and LLE (Locally Linear Embedding) sparse representation
CN103413151A (en) Hyperspectral image classification method based on image regular low-rank expression dimensionality reduction
CN104268556A (en) Hyperspectral image classification method based on nuclear low-rank representing graph and spatial constraint
CN107895139A (en) A kind of SAR image target recognition method based on multi-feature fusion
Jia et al. Flexible Gabor-based superpixel-level unsupervised LDA for hyperspectral image classification
CN101630369B (en) Pedestrian detection method based on wavelet fractal characteristic
CN105893971A (en) Traffic signal lamp recognition method based on Gabor and sparse representation
CN109492700A (en) A kind of Target under Complicated Background recognition methods based on multidimensional information fusion
CN111915583A (en) Vehicle and pedestrian detection method based on vehicle-mounted thermal infrared imager in complex scene
CN109241870B (en) Coal mine underground personnel identity identification method based on gait identification
CN111639697B (en) Hyperspectral image classification method based on non-repeated sampling and prototype network
CN115861619A (en) Airborne LiDAR (light detection and ranging) urban point cloud semantic segmentation method and system of recursive residual double-attention kernel point convolution network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20120307

Termination date: 20140730

EXPY Termination of patent right or utility model