CN112215103B

CN112215103B - Vehicle pedestrian multi-category detection method and device based on improved ACF

Info

Publication number: CN112215103B
Application number: CN202011034733.9A
Authority: CN
Inventors: 石英; 黄紫旗; 谢长君; 张晖; 华捷
Original assignee: Wuhan University of Technology WUT
Current assignee: Wuhan University of Technology WUT
Priority date: 2020-09-27
Filing date: 2020-09-27
Publication date: 2024-02-23
Anticipated expiration: 2040-09-27
Also published as: CN112215103A

Abstract

The invention relates to a vehicle pedestrian multi-category detection method and device based on improved ACF, wherein the method comprises the following steps: acquiring a vehicle training sample and a pedestrian training sample, and preprocessing the vehicle training sample and the pedestrian training sample; extracting the preprocessed multi-view aggregation channel characteristics of the vehicle training sample and the context pixel aggregation channel characteristics of the pedestrian training sample by using a vehicle pedestrian detection frame, establishing a vehicle detector according to the multi-view aggregation channel characteristics, and establishing a pedestrian detector according to the context pixel aggregation channel characteristics; sharing the aggregation channel characteristics of the image to be detected to the vehicle detector and the pedestrian detector to obtain a vehicle detection result and a pedestrian detection result; and adopting a false detection rejection strategy based on road constraint to perform false detection rejection on the vehicle detection result and the pedestrian detection result. The invention solves the problems of single detection target, low detection precision and easy false detection existing in the prior art.

Description

Vehicle pedestrian multi-category detection method and device based on improved ACF

Technical Field

The invention relates to the technical field of unmanned visual analysis, in particular to a vehicle pedestrian multi-category detection method, device and storage medium based on improved ACF.

Background

With technological progress and improvement of living standard of people, the automobile conservation amount is increased dramatically, and various factors cause frequent traffic accidents. Unmanned efforts are directed to improving this problem, where vehicle and pedestrian detection technology is of paramount importance. The accuracy and instantaneity of the vehicle and pedestrian detection algorithm directly influence the safety performance of the unmanned vehicle.

The current mainstream vehicle and pedestrian detection algorithm has a deep learning detection algorithm and a statistical feature detection algorithm. The CNN feature training period in the deep learning is long, and the calculated amount is large. According to the difference of detection strategies, the statistical feature detection method can be subdivided into a DPM method and a decision tree method, and the method is rarely applied to an unmanned system due to the characteristics of high complexity and low running speed. In the decision tree method, the design of the feature descriptors is the key of the detection algorithm and is the most studied content at present, and the feature descriptors mainly comprise gradient, texture, color and fusion features thereof. The Haar features are mainly used for extracting texture information of a target, are widely applied to the field of vehicle detection, and HOG features and the like are used for capturing information such as outlines, shapes and the like of the target, are representative of gradient features and are usually used for detecting pedestrians. In addition, gray scale, RGB, LUV, and other color features can also be used to characterize the target. However, these features are generally only useful for detecting specific targets, and expression capabilities are limited in complex road scenarios. Aiming at the problems, firstly, integral channel characteristics (Integral Channel Features, ICF) are provided, gradient, color, texture and other characteristics are fused, then, in order to improve detection performance, aggregate channel characteristics (Aggregated Channel Features, ACF) are provided, and compared with the ICF, the ACF algorithm is different in that the characteristics are extracted by utilizing a pixel lookup table mode, and the detection performance is greatly improved compared with the ICF. Then, the LDCF introduces filtering operation on the basis of the ACF to strengthen the expression capacity of the LDCF, but also brings great calculated amount, and greatly reduces the instantaneity although the algorithm detection precision is further improved. Although the LDCF has a larger improvement than the ACF detection precision, the real-time performance is reduced, and the LDCF is difficult to be suitable for detecting the pedestrians of the lightweight vehicles.

Therefore, when the conventional vehicle and pedestrian detection method is applied to a road scene, the problems of single detection target, low detection precision and easy occurrence of false detection are existed.

Disclosure of Invention

In view of the foregoing, there is a need for a method, apparatus and storage medium for detecting pedestrian and vehicle based on an improved ACF, which are used for solving the problems of single detection target, low detection precision and easy occurrence of false detection when detecting vehicles and pedestrians at present.

In a first aspect, the present invention provides a vehicle pedestrian multi-category detection method based on an improved ACF, including the steps of:

acquiring a vehicle training sample and a pedestrian training sample, and preprocessing the vehicle training sample and the pedestrian training sample;

extracting multi-view aggregation channel characteristics of the preprocessed vehicle training samples by using a vehicle pedestrian detection frame, and establishing a vehicle detector according to the multi-view aggregation channel characteristics;

extracting the context pixel aggregation channel characteristics of the preprocessed pedestrian training sample by using a vehicle pedestrian detection frame, and establishing a pedestrian detector according to the context pixel aggregation channel characteristics;

acquiring a preprocessed image to be detected, and sharing the aggregation channel characteristics of the image to be detected to the vehicle detector and the pedestrian detector to obtain a vehicle detection result and a pedestrian detection result;

and adopting a false detection rejection strategy based on road constraint to perform false detection rejection on the vehicle detection result and the pedestrian detection result.

Preferably, in the method for detecting pedestrian in a vehicle based on the improved ACF, the method for preprocessing the vehicle training sample and the pedestrian training sample specifically includes:

scaling the vehicle training sample and the pedestrian training sample in the horizontal and vertical directions, and maintaining the normalization of the center positions of the targets in the vehicle training sample and the pedestrian training sample.

Preferably, in the method for detecting multiple categories of pedestrians in a vehicle based on an improved ACF, the step of extracting the characteristics of a multi-view aggregation channel of the preprocessed training samples of the vehicle by using a vehicle pedestrian detection frame, and establishing a vehicle detector according to the characteristics of the multi-view aggregation channel specifically includes:

and calculating a similar incidence matrix among all sample points in the preprocessed vehicle training sample by adopting a spectral clustering algorithm, obtaining feature vectors with multiple dimensions through matrix spectral decomposition, clustering the feature vectors with multiple dimensions by adopting a K-means algorithm to extract aggregation channel features of multiple visual angles, and training a vehicle detector with corresponding visual angles by utilizing the aggregation channel features of all visual angles.

Preferably, in the method for detecting pedestrian in a vehicle based on an improved ACF, the step of extracting the context pixel aggregation channel characteristics of the pedestrian training sample after the preprocessing by using a vehicle pedestrian detection frame, and establishing a pedestrian detector according to the context pixel aggregation channel characteristics specifically includes:

10 characteristic channels of the pre-processed pedestrian training sample are extracted, and the ten channels are processed by using 2×2 averaging pooling to obtain an aggregate channel F with n=2 _2×2 After the feature, carrying out 2×2 average pooling treatment on the aggregation channel feature F2×2 twice to obtain a region context pixel aggregation channel F _4×4 Features and F _8×8 Characterised in that said F _4×4 Features and F _8×8 Feature sampling to F _2×2 And the resolution ratio is combined to form 30 deformation-resistant context pixel aggregation channel characteristics with the same size, and a pedestrian detector is built according to the context pixel aggregation channel characteristics.

Preferably, in the method for detecting a pedestrian in a vehicle based on an improved ACF, the step of obtaining the preprocessed image to be detected, and sharing the aggregate channel features of the image to be detected to the vehicle detector and the pedestrian detector to obtain a vehicle detection result and a pedestrian detection result further includes:

and carrying out confidence score calibration on the vehicle detection result by adopting a parameterized Logistic regression calibration method.

Preferably, in the method for detecting pedestrian in vehicle based on improved ACF, the step of adopting a false detection rejection policy based on road constraint to perform false detection rejection on the vehicle detection result and the pedestrian detection result includes:

normalizing the height of the vehicle calibration frame of the vehicle training sample and the position coordinates of the lower edge of the vehicle calibration frame, and the height of the pedestrian calibration frame of the pedestrian training sample and the position coordinates of the lower edge of the pedestrian calibration frame;

training a first regression model between the height of the vehicle calibration frame and the position coordinate of the lower edge of the vehicle calibration frame and a second regression model between the height of the pedestrian calibration frame and the position coordinate of the lower edge of the pedestrian calibration frame by adopting a support vector machine;

calculating the height of a predicted vehicle calibration frame corresponding to the position of the lower edge of the vehicle detection result by adopting a first regression model, and calculating the height of a predicted pedestrian calibration frame corresponding to the position of the lower edge of the pedestrian in the pedestrian detection result by adopting a second regression model;

calculating a first error value of the height of the predicted vehicle calibration frame and the height of an actual vehicle calibration frame in the vehicle detection result, and a second error value of the height of the predicted pedestrian calibration frame and the height of the actual pedestrian calibration frame in the pedestrian detection result;

when the first error value is larger than a first threshold value, judging that the vehicle detection result is misdetected, otherwise, accepting the vehicle detection result; and when the second error value is larger than a second threshold value, judging that the pedestrian detection result is misdetected, otherwise, receiving the pedestrian detection result.

Preferably, in the improved ACF-based vehicle pedestrian multi-category detection method, the first regression model is:

H ₁ ＝f(Y ₁ )，

wherein H is ₁ Representing the height of a vehicle calibration frame, Y ₁ Representing the position coordinates of the lower edge of the vehicle calibration frame;

the first error value calculating method comprises the following steps:

wherein E is ₁ Represents a first error value, h ₁ Indicating the height of the actual vehicle calibration frame, h' ₁ Representing the height of the predicted vehicle calibration frame, abs represents the absolute value.

Preferably, in the improved ACF-based vehicle pedestrian multi-category detection method, the second regression model is:

H ₂ ＝f(Y ₂ )，

wherein H is ₂ Representing the height of the pedestrian calibration frame, Y ₂ Representing the position coordinates of the lower edge of the pedestrian calibration frame;

the calculation method of the second error value comprises the following steps:

wherein E is ₂ Represents a second error value, h ₂ Indicating the height of the actual vehicle calibration frame, h' ₂ Representing the height of the predicted vehicle calibration frame, abs represents the absolute value.

In a second aspect, the present invention also provides a vehicle pedestrian multi-category detection device based on the improved ACF, including: a processor and a memory;

the memory has stored thereon a computer readable program executable by the processor;

the processor, when executing the computer readable program, implements the steps in the improved ACF-based vehicle pedestrian multi-category detection method as described above.

In a third aspect, the present invention also provides a computer-readable storage medium storing one or more programs executable by one or more processors to implement the steps in the improved ACF-based vehicle pedestrian multi-category detection method as described above.

[ beneficial effects ]

In the vehicle pedestrian multi-category detection method, the device and the storage medium based on the improved ACF, the problem of single detection category of the Adaboost classifier in the ACF detection algorithm is solved, a multi-category detection frame is adopted, and the vehicle and the pedestrian are detected simultaneously; in order to solve the problem of low detection precision of vehicles and pedestrians, a multi-view vehicle detector and a context pixel pedestrian detector are adopted, so that the visual angle difference of a vehicle sample and deformation of the posture of the pedestrians during walking can be effectively captured, and the detection precision is improved; in order to overcome the false detection phenomenon in the vehicle pedestrian detection process, the false detection is effectively removed by utilizing the road prior information.

Drawings

FIG. 1 is a flowchart of a method for improved ACF-based pedestrian detection in a vehicle in accordance with a preferred embodiment of the present invention;

FIG. 2 is a flowchart illustrating the operation of a preferred embodiment of the vehicle pedestrian detection framework of the present invention;

FIG. 3 is a schematic diagram of a training process for a vehicle detector according to the present invention;

FIG. 4 is a schematic diagram of the training process of the pedestrian detector of the present invention;

FIG. 5 is a statistical graph of the relationship between the target height of the calibration frame and the coordinates of the lower edge thereof;

FIG. 6 is a schematic diagram of the operating environment of a preferred embodiment of the improved ACF-based pedestrian detection program.

Detailed Description

Preferred embodiments of the present invention will now be described in detail with reference to the accompanying drawings, which form a part hereof, and together with the description serve to explain the principles of the invention, and are not intended to limit the scope of the invention.

Referring to fig. 1, the method for detecting pedestrian in a vehicle based on an improved ACF according to the embodiment of the present invention includes the following steps:

s100, acquiring a vehicle training sample and a pedestrian training sample, and preprocessing the vehicle training sample and the pedestrian training sample.

In this embodiment, in order to realize detection of a vehicle and a pedestrian, training of a sample is required first, and in order to ensure detection accuracy, reinforcement and improvement of sample data are required first, and pretreatment is performed, specifically, in step S100, a method for preprocessing a sample specifically includes:

Specifically, the current ACF detection algorithm adopts a horizontal overturning method to strengthen the data set, and ignores the influence of the labeling error of the data set. Meanwhile, the image is standardized generally in the training process, the problem that the scaled image is easily misaligned in the target can be solved, and the detection precision is seriously affected. Therefore, the method and the device directly remove horizontal overturn, increase multi-scale data reinforcement, namely, scale by 1.1 times in the horizontal, vertical and other directions of the original training sample, and maintain the normalization of the center position of the target, and can reduce the sensitivity of the surrounding background of the labeling frame by utilizing the multi-scale reinforcement, thereby improving the classification robustness.

S200, extracting multi-view aggregation channel characteristics of the preprocessed vehicle training samples by using a vehicle pedestrian detection frame, and establishing a vehicle detector according to the multi-view aggregation channel characteristics;

and S300, extracting the context pixel aggregation channel characteristics of the preprocessed pedestrian training sample by using a vehicle pedestrian detection frame, and establishing a pedestrian detector according to the context pixel aggregation channel characteristics.

In this embodiment, since the conventional ACF algorithm has a single detection target, a multi-class pedestrian-vehicle detection frame is introduced, and feature extraction is performed on the vehicle sample and the pedestrian sample, so that ACF features detected by the vehicle detector and the pedestrian detector are shared, and then the ACF features are used for vehicle-pedestrian detection. Specifically, the invention provides a vehicle pedestrian detection framework based on feature sharing, so that a vehicle detector and a pedestrian detector share ACF features, the detection efficiency is improved, and multi-category detection is finished. The frame trains the pedestrian detector and the vehicle detector at the same time in the training stage, and enables the pedestrian detector and the vehicle detector to share ACF characteristics, so that the training efficiency can be remarkably improved. Meanwhile, the framework has universality, other detectors can be added, and other detection categories are easy to expand.

Further, the step S200 specifically includes:

Specifically, as shown in fig. 3, when training a Multi-view aggregation channel feature vehicle detector (Multi-view Aggregated Channel Features, mv-ACF), the invention firstly performs clustering processing on vehicle training samples, extracts features of each view sample, and trains a corresponding vehicle detector, and adopts an unsupervised learning K-means algorithm in consideration of higher dimension of the extracted ACF features. Aiming at the possible cluster degradation problem, the invention adopts a spectral clustering algorithm, calculates the similar incidence matrix among sample points, obtains the feature vector through matrix spectral decomposition, constructs a new feature space, and uses a K-means algorithm for clustering.

And verifying the effectiveness of a spectral clustering algorithm used by the method. And carrying out K-means clustering on the training samples through experiments, and clustering the training samples into 20 classes. The classified samples were trained and tested using an Mv-ACF detector and compared to the spectral clustering algorithm results as shown in table 1. Obviously, the AP precision of the spectral clustering algorithm under different levels is higher than that of the K-means clustering algorithm, and the spectral clustering algorithm achieves the expected effect.

In a preferred embodiment, the step S300 specifically includes:

Specifically, the posture of the pedestrian when walking can cause deformation, so that the difficulty of pedestrian detection is increased. To this end, the present invention proposes a context pixel aggregation channel feature (Context Pixel Aggregated Channel Features, CP-ACF). As shown in fig. 4, 10 feature channels are first extracted in the same manner as the ACF algorithm, and then the ten channels are processed by using 2×2 averaging pooling to obtain ACF features f2×2 with n=2, and then the 2×2 averaging pooling is performed twice to obtain region up and down Wen Xiangsu aggregate f4×4 and f8×8 features. And finally, up-sampling F4×4 and F8×8 to F2×2 resolution, and finally combining to form 30 deformation-resistant CP-ACF channel features with the same size so as to realize fusion of local and context features. When the soft cascade Adaboost is classified, the weak classifier can adaptively select local and contextual features of different regions in the CP-ACF channel, and the CP-ACF has stronger deformation resistance compared with the feature that the ACF can only select a fixed region. The AP accuracy of CP-ACF and ACF at different levels of the KITTI validation set is shown in the table below.

The invention respectively designs the vehicle detector and the pedestrian detector in the frame, fuses road information so as to improve detection precision, realizes feature sharing of the vehicle detector and the pedestrian detector, and improves algorithm instantaneity.

S400, acquiring preprocessed images to be detected, and sharing the aggregation channel characteristics of the images to be detected to the vehicle detector and the pedestrian detector to obtain a vehicle detection result and a pedestrian detection result.

Specifically, in the detection algorithm, the portion that generally takes the longest time is image feature extraction, and in the conventional vehicle pedestrian detection algorithm, vehicle detection and pedestrian detection are generally performed separately, that is, feature extraction is performed separately on images, which takes a long time. The vehicle pedestrian detection framework based on feature sharing is proposed, as shown in fig. 2, so that the vehicle detector and the pedestrian detector share ACF features, the detection efficiency is improved, and multi-category detection is completed.

Further, since the Mv-ACF and the CP-ACF use the same 10 original ACF feature channels, the former uses 2×2 average pooling to extract features, the latter uses three average pooling, 2×2, 4×4, and 8×8 average pooling to extract features, respectively, it can be seen that the latter uses feature channels including the former and the two feature pyramid construction modes are the same, so that the vehicle detector and the pedestrian detector can share the feature pyramid of the latter.

In a preferred embodiment, the step S400 further includes:

Specifically, each subclass of Mv-ACF detector adopts a data training model with different visual angles in the test process, and the detection result can comprise confidence scores with different distributions and detection frames with inconsistent geometric characteristics (such as length-width ratio). Direct merging can introduce noise, so that the subsequent NMS is unstable and the accuracy is reduced. According to the invention, a parameterized Logistic regression calibration method is introduced, and confidence score calibration is carried out on the detection result, so that the distribution of the detection result is more reasonable.

Specifically, let deti= { d _i1 ,d _i2 ,…,d _ij ,…,d _ir And r detection results of the ith subclass detector. Wherein d _ij ＝{R _ij ,c _ij The j-th detection result is represented by a detection frame R _ij Confidence score c _ij Composition is prepared. Set mDet _i ＝{md _i1 ,md _i2 ,…,md _ij ,…,md _ir And } is the calibrated result, where md _ij ＝{R _ij ,c′ _ij }. The purpose of the confidence score calibration is to use a calibration function g _i Make c' _ij ＝g _i (c _ij ). The parameterized Logistic regression calibration method is introduced, and the score is normalized, namely:

wherein the parameter A of the ith subclass detector _i And B _i Obtained by solving the regularized maximum interpretation problem:

substituting formula (1) into formula (2) to obtain

Wherein,

wherein r is ₊ And r _- Respectively the ith subclass for training parameters A _i And B _i Positive and negative samples of (a). y is _j Label representing jth sample, y _j = +1 as target, y _j = -1 represents background. Through the above process, the confidence score calibration of the vehicle subclass detector is completed.

S500, adopting a false detection rejection strategy based on road constraint to perform false detection rejection on the vehicle detection result and the pedestrian detection result.

In this embodiment, in order to avoid reducing the false detection rate, a step of false detection and rejection is further provided for the output result, and a false detection and rejection strategy based on road constraint is introduced, so that the false detection rate is reduced, and a detection algorithm is perfected, so that the method is suitable for detecting lightweight vehicles and pedestrians. Specifically, the step S500 specifically includes:

Wherein the first regression model is:

H ₁ ＝f(Y ₁ )，

the first error value calculating method comprises the following steps:

The second regression model is:

H ₂ ＝f(Y ₂ )，

the calculation method of the second error value comprises the following steps:

In other words, the invention firstly normalizes the heights H of 12186 pedestrians and 15891 vehicle calibration frames and the lower edge position coordinates Y of the calibration frames in the Caltech and KITTI training data set. The false detection phenomenon existing in the vehicle pedestrian detection process can be eliminated by utilizing the road prior information. In order to utilize the road prior information, firstly, statistics is carried out on the heights H of 12186 pedestrians and 15891 vehicle calibration frames in the normalized Caltech and KITTI training data set and the position coordinates Y of the lower edges of the calibration frames, and as the result is shown in fig. 5, a certain statistical relationship exists between the H and the Y. According to the relation, the invention provides a simple and efficient road constraint (Ground Plane Constraints, GPC) false detection rejection strategy, namely, taking the target which does not accord with the relation as false detection. The statistical relationship f may be determined using a regression model between H and Y, first by normalizing H and Y of the training samples, and then training the regression model W between H and Y using the SVM. And comparing the vehicle pedestrian calibration frame given in the detection result with the corresponding group trunk to obtain the vehicle pedestrian calibration frame closest to the real value. After training the model, regarding the detection frame { x, y, W, h } obtained after NMS, the lower edge position of the detection frame is y+h, then using the trained regression model W to calculate the corresponding h ', finally calculating the relative error between h and h', if the final error value is greater than the set threshold, considering the detection frame as false detection, otherwise, accepting the detection frame.

As shown in fig. 6, based on the above-mentioned method for detecting pedestrian in a vehicle based on the improved ACF, the present invention further provides a device for detecting pedestrian in a vehicle based on the improved ACF, where the device for detecting pedestrian in a vehicle based on the improved ACF may be a computing device such as a mobile terminal, a desktop computer, a notebook computer, a palm computer, and a server. The improved ACF-based vehicular pedestrian multi-category detection device includes a processor 10, a memory 20, and a display 30. Fig. 6 shows only a portion of the components of the improved ACF-based vehicle pedestrian multi-category detection device, but it is to be understood that not all of the illustrated components need be implemented, and that more or fewer components may alternatively be implemented.

The memory 20 may in some embodiments be an internal storage unit of the improved ACF-based vehicular pedestrian multi-class detection device, such as a hard disk or memory of the improved ACF-based vehicular pedestrian multi-class detection device. The memory 20 may also be an external storage device of the improved ACF-based vehicle pedestrian multi-class detection device in other embodiments, such as a plug-in hard disk, smart Media Card (SMC), secure Digital (SD) Card, flash Card (Flash Card) or the like, which is provided on the improved ACF-based vehicle pedestrian multi-class detection device. Further, the memory 20 may also include both an internal memory unit and an external memory device of the improved ACF-based vehicular pedestrian multi-class detection device. The memory 20 is used for storing application software and various types of data installed on the improved ACF-based vehicle pedestrian multi-class detection device, such as program code for installing the improved ACF-based vehicle pedestrian multi-class detection device. The memory 20 may also be used to temporarily store data that has been output or is to be output. In an embodiment, the memory 20 stores a modified ACF-based pedestrian multi-category detection program 40, and the modified ACF-based pedestrian multi-category detection program 40 is executable by the processor 10 to implement the modified ACF-based pedestrian multi-category detection method of the embodiments of the present application.

The processor 10 may in some embodiments be a central processing unit (Central Processing Unit, CPU), microprocessor or other data processing chip for executing program code or processing data stored in the memory 20, for example, for performing the improved ACF-based vehicle pedestrian multi-category detection method, etc.

The display 30 may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode) touch, or the like in some embodiments. The display 30 is used for displaying information at the improved ACF-based vehicle pedestrian multi-category detection device and for displaying a visual user interface. The components 10-30 of the improved ACF-based vehicular pedestrian multi-class detection device communicate with each other over a system bus.

In an embodiment, the improved ACF-based vehicle pedestrian multi-category detection method described in the above embodiment is implemented when the processor 10 executes the improved ACF-based vehicle pedestrian multi-category detection program 40 in the memory 20, and since the improved ACF-based vehicle pedestrian multi-category detection method is described in detail above, the description thereof is omitted.

In summary, in the vehicle pedestrian multi-category detection method, device and storage medium based on the improved ACF provided by the invention, the problem of single detection category of the Adaboost classifier in the ACF detection algorithm is solved, and a multi-category detection frame is adopted to detect vehicles and pedestrians simultaneously; in order to solve the problem of low detection precision of vehicles and pedestrians, a multi-view vehicle detector and a context pixel pedestrian detector are adopted, so that the visual angle difference of a vehicle sample and deformation of the posture of the pedestrians during walking can be effectively captured, and the detection precision is improved; in order to overcome the false detection phenomenon in the vehicle pedestrian detection process, the false detection is effectively removed by utilizing the road prior information.

Of course, those skilled in the art will appreciate that implementing all or part of the above-described methods may be implemented by a computer program for instructing relevant hardware (e.g., a processor, a controller, etc.), where the program may be stored in a computer-readable storage medium, and where the program may include the steps of the above-described method embodiments when executed. The storage medium may be a memory, a magnetic disk, an optical disk, or the like.

The present invention is not limited to the above-mentioned embodiments, and any changes or substitutions that can be easily understood by those skilled in the art within the technical scope of the present invention are intended to be included in the scope of the present invention.

Claims

1. The vehicle pedestrian multi-category detection method based on the improved ACF is characterized by comprising the following steps of:

adopting a false detection rejection strategy based on road constraint to perform false detection rejection on the vehicle detection result and the pedestrian detection result;

the step of extracting the pretreated multi-view aggregation channel characteristics of the vehicle training sample by using the vehicle pedestrian detection frame and establishing a vehicle detector according to the multi-view aggregation channel characteristics specifically comprises the following steps:

calculating a similar incidence matrix among all sample points in the preprocessed vehicle training sample by adopting a spectral clustering algorithm, obtaining feature vectors with multiple dimensions through matrix spectral decomposition, clustering the feature vectors with multiple dimensions by adopting a K-means algorithm to extract aggregation channel features of multiple visual angles, and training a vehicle detector with corresponding visual angles by utilizing the aggregation channel features of all visual angles;

the step of extracting the pre-processed context pixel aggregation channel characteristics of the pedestrian training sample by using the vehicle pedestrian detection frame and establishing a pedestrian detector according to the context pixel aggregation channel characteristics specifically comprises the following steps:

2. The improved ACF-based vehicle pedestrian multi-class detection method of claim 1 wherein the method of preprocessing the vehicle training sample and pedestrian training sample is specifically:

3. The method for detecting pedestrian in vehicle according to claim 1, wherein the step of obtaining the preprocessed image to be detected and sharing the aggregate channel features of the image to be detected to the vehicle detector and the pedestrian detector to obtain the vehicle detection result and the pedestrian detection result further comprises:

4. The improved ACF-based vehicle pedestrian multi-category detection method of claim 1 wherein the step of false detection rejection of the vehicle detection result and pedestrian detection result using a road constraint-based false detection rejection strategy comprises:

5. The improved ACF-based vehicle pedestrian multi-class detection method of claim 4 wherein the first regression model is:

，

the first error value calculating method comprises the following steps:

，

wherein,representing a first error value,/>Indicating the height of the actual vehicle calibration frame, +.>Representing the height of the predicted vehicle calibration frame, abs represents the absolute value.

6. The improved ACF-based vehicle pedestrian multi-class detection method of claim 4 wherein the second regression model is:

，

the calculation method of the second error value comprises the following steps:

，

wherein,representing a second error value, +.>Indicating the height of the actual vehicle calibration frame, +.>Representing the height of the predicted vehicle calibration frame, abs represents the absolute value.

7. A vehicle pedestrian multi-category detection device based on an improved ACF, comprising: a processor and a memory;

the processor, when executing the computer readable program, implements the steps in the improved ACF-based vehicular pedestrian multi-category detection method as recited in any one of claims 1 to 6.

8. A computer-readable storage medium storing one or more programs executable by one or more processors to implement the steps in the improved ACF-based vehicle pedestrian multi-category detection method of any of claims 1 to 6.