CN108710967B - Expressway traffic accident severity prediction method based on data fusion and support vector machine - Google Patents

Expressway traffic accident severity prediction method based on data fusion and support vector machine Download PDF

Info

Publication number
CN108710967B
CN108710967B CN201810353803.3A CN201810353803A CN108710967B CN 108710967 B CN108710967 B CN 108710967B CN 201810353803 A CN201810353803 A CN 201810353803A CN 108710967 B CN108710967 B CN 108710967B
Authority
CN
China
Prior art keywords
accident
severity
traffic accident
support vector
vector machine
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810353803.3A
Other languages
Chinese (zh)
Other versions
CN108710967A (en
Inventor
章晨
何杰
刘子洋
邢璐
赵池航
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southeast University
Original Assignee
Southeast University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southeast University filed Critical Southeast University
Priority to CN201810353803.3A priority Critical patent/CN108710967B/en
Publication of CN108710967A publication Critical patent/CN108710967A/en
Application granted granted Critical
Publication of CN108710967B publication Critical patent/CN108710967B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/40Business processes related to the transportation industry
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0137Measuring and analyzing of parameters relative to traffic conditions for specific applications

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • Tourism & Hospitality (AREA)
  • Analytical Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Game Theory and Decision Science (AREA)
  • Development Economics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Traffic Control Systems (AREA)

Abstract

The invention discloses a highway traffic accident severity prediction method based on data fusion and a support vector machine, which comprises the following steps: 1. collecting l variable factors such as road conditions, driver conditions, vehicle conditions and the like when m traffic accidents occur to form a sample set, and recording the severity value r of each traffic accidenti(ii) a 2. Carrying out dimensionality reduction and normalization on the variable factors of the m collected accident samples; 3. constructing a traffic accident severity prediction model by applying a support vector machine algorithm; 4. and (4) substituting the variable factor vector x of the accident to be predicted after dimensionality reduction into the traffic accident severity prediction model established in the step (3) to obtain a severity prediction result of the accident to be predicted. The method can accurately predict the severity of the highway accident.

Description

Expressway traffic accident severity prediction method based on data fusion and support vector machine
Technical Field
The invention belongs to the field of traffic accident analysis and prediction, and particularly relates to a highway traffic accident severity prediction method based on data fusion and a support vector machine.
Background
At present, the analysis of the severity of the accident at home and abroad mainly stays at the level of a single data source and a traditional statistical analysis method, the consideration of influencing factors is less, the analysis is often incomplete, and the model error is larger. As technology advances, data collection is becoming easier. A huge amount of data can be collected about factors that are relevant to traffic accidents, such as road geometry, coil data, weather conditions, road visibility, accident driver conditions, etc. How to control the severity of the accident within a certain range based on the analysis of mass data through a scientific method is an important point to be solved urgently.
Disclosure of Invention
The purpose of the invention is as follows: the invention aims to provide a method for predicting the severity of an expressway traffic accident, which can accurately predict the severity of the expressway accident.
The technical scheme is as follows: the invention adopts the following technical scheme:
the expressway traffic accident severity prediction method based on data fusion and a support vector machine comprises the following steps:
(1) collecting l variable factors such as road conditions, driver conditions and vehicle conditions when m traffic accidents occur, and forming a sample set S ═ (S)1,s2,…,sm) Wherein s isi=(f1i,f2i,…,fli)T,fhiThe h variable factor of the accident numbered i; recording the severity value, r, of each traffic accidentiThe severity value of the accident numbered i, h 1.. l, i 1.. m;
(2) reducing and normalizing the variable factors of the m collected accident samples, and setting a dimension-reduced sample si' for n dimensions, i.e. retaining n variable factors, n<l,si′=(f1i′,f2i′,…,fni′)T,fki' is a variable factor retained after dimensionality reduction, and k is 1.. n;
the formula of the normalization process is:
xki=(fki′-MinValue)/(MaxValue-MinValue)
wherein xkiAs a variable factor fki' normalized value, MinValue is { fk1′,fk2′,…,fkm', and MaxValue is { fk1′,fk2′,…,fkmThe maximum value in';
(3) applying a support vector machine algorithm to construct a traffic accident severity prediction model, wherein the model is as follows:
Figure GDA0002996683510000021
wherein, f (x) is the severity prediction result of the accident to be predicted, and x is the variable factor vector of the accident to be predicted after dimensionality reduction; x is the number ofjFor the j accident sample after dimension reduction normalization, yjThe severity value of the jth accident in the sample set is taken; alpha is alphajCorresponding pull for jth accident sampleThe Greenland multiplier, b is the intercept, and K (·) is the kernel function;
(4) and (3) reducing the dimension of the variable factor vector of the accident to be predicted according to the dimension reduction method in the step (2) to obtain a variable factor vector x of the accident to be predicted after dimension reduction, and substituting x into the traffic accident severity prediction model established in the step (3) to obtain a severity prediction result of the accident to be predicted.
The road conditions include a gradient direction, a flat curve direction, positive or negative represented by 0 or 1, respectively.
The driver condition comprises the age and the sex of the driver; wherein the sex of the driver is represented by 0 or 1 for male or female.
The vehicle condition includes an age of the accident vehicle.
And (3) reducing the dimension of the variable factor sample set S by adopting a principal component analysis method or an independent component analysis method in the step (2).
The kernel function K (-) is a Gaussian kernel function, and the expression is as follows:
Figure GDA0002996683510000022
where σ is a kernel width adjustment parameter.
Has the advantages that: compared with the prior art, the highway traffic accident severity prediction method based on data fusion and the support vector machine disclosed by the invention has the following advantages: 1. multiple data sources are considered instead of a single accident data source, the multiple data sources can enable the model to be established more accurately, and errors of training and testing are smaller; 2. a support vector machine method is used instead of the traditional statistical analysis method, the traditional statistical analysis method is low in operation speed when multivariable is processed, the nonlinear part is complex, and the logic is unclear. The method for predicting the severity of the highway traffic accident based on the data fusion and the support vector machine can well solve the problems.
Drawings
FIG. 1 is an overall control flow diagram of the present invention;
FIG. 2 is a graph of an incident month distribution in the incident data set;
fig. 3 is an age distribution diagram of an accident driver in a vehicle data set.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention are described below with reference to the accompanying drawings.
The data set adopted by the embodiment is a multi-source data set for accidents, weather, driver conditions and road conditions in 2011 to 2015 years in a certain city. As shown in fig. 1, the method for predicting the severity of a highway traffic accident based on data fusion and a support vector machine comprises the following steps:
(1) collecting l variable factors such as road conditions, driver conditions and vehicle conditions when m traffic accidents occur, and forming a sample set S ═ (S)1,s2,…,sm) Wherein s isi=(f1i,f2i,…,fli)T,fhiThe h variable factor of the accident numbered i; recording the severity value, r, of each traffic accidentiThe severity value of the accident numbered i, h 1.. l, i 1.. m;
the relevant information of accident occurrence is recorded in the original accident data set and the driver data set respectively, and the road relevant information is recorded in the road information database, and part of the original data is shown in fig. 2 and fig. 3. Firstly, according to records in an original accident data set, fusing a multi-element data set and a database, comprising the steps of data cleaning, classification parameter digitization, variable value conversion, merging sample variables by using parameters common among databases and the like, wherein the steps comprise the following steps:
(1-1) removing irrelevant elements such as 'accident number', 'road pile number', 'road number' and the like in the original accident data set from the sample variables; deleting samples containing '0' and 'unknown' in variables 'weather', 'light' and 'accident type';
(1-2) converting the variable 'gradient direction' and 'flat curve direction' from 'positive/negative' to '1/0'; in the embodiment, the accident severity is divided into 2 levels according to whether a person is injured, namely, the variable 'accident severity' in the original accident data set is binarized, wherein 1 is used for indicating an accident without injury, and 0 is used for indicating an accident with injury; converting the variable 'production year of the accident vehicle' into 'age of the accident vehicle', and concretely converting the formula:
Figure GDA0002996683510000031
wherein, VehyriIndicating the age of the accident vehicle in the accident occurrence year in the accident numbered i,
Figure GDA0002996683510000041
val (year) as the year of production of the accident vehiclei) The year of the accident with number i.
(1-3) adding the 'driver age' and 'driver sex' in the driver data set to the accident data set through the label 'accident number', and adding the road related variables to the accident data set through the label 'road number', so that the fusion of the multivariate data is realized.
In the present embodiment, the road condition includes a gradient direction, a flat curve direction, positive or negative represented by 0 or 1, respectively; the driver condition includes driver age, driver gender; wherein the sex of the driver is represented by 0 or 1 for male or female; the vehicle condition includes an age of the accident vehicle.
(2) Reducing and normalizing the variable factors of the m collected accident samples, and setting a dimension-reduced sample si' for n dimensions, i.e. retaining n variable factors, n<l,si′=(f1i′,f2i′,…,fni′)T,fki' is a variable factor retained after dimensionality reduction, and k is 1.. n;
the formula of the normalization process is:
xki=(fki′-MinValue)/(MaxValue-MinValue)
wherein xkiAs a variable factor fki' normalized value, MinValue is { fk1′,fk2′,…,fkm' } minimum value, MaxValue is-fk1′,fk2′,…,fkmThe maximum value in';
the variable factor sample set S may be subjected to dimensionality reduction using Principal Component Analysis (PCA) or Independent Component Analysis (ICA).
(3) Applying a support vector machine algorithm to construct a traffic accident severity prediction model, wherein the model is as follows:
Figure GDA0002996683510000042
wherein, f (x) is the severity prediction result of the accident to be predicted, and x is the variable factor vector of the accident to be predicted after dimensionality reduction; x is the number ofjFor the j accident sample after dimension reduction normalization, yjThe severity value of the jth accident in the sample set is taken; alpha is alphajA Lagrange multiplier corresponding to the jth accident sample, b is an intercept, and K (·) is a kernel function;
in this embodiment, the SPSS modeler is applied to establish a regression analysis model of the support vector machine, including establishing a kernel function, convergence accuracy, and the like. The kernel function K (·) adopts a Gaussian kernel function, and the expression is as follows:
Figure GDA0002996683510000043
where σ is a kernel width adjustment parameter.
(4) And (3) reducing the dimension of the variable factor vector of the accident to be predicted according to the dimension reduction method in the step (2) to obtain a variable factor vector x of the accident to be predicted after dimension reduction, and substituting x into the traffic accident severity prediction model established in the step (3) to obtain a severity prediction result of the accident to be predicted.
In the embodiment, the data of the highway traffic accident in 2015 is used as a test sample, and the method for predicting the severity of the traffic accident provided by the invention is verified, so that higher prediction accuracy is obtained.

Claims (7)

1. The expressway traffic accident severity prediction method based on data fusion and a support vector machine is characterized by comprising the following steps of:
(1) collecting l variable factors such as road conditions, driver conditions and vehicle conditions when m traffic accidents occur, and forming a sample set S ═ (S)1,s2,…,sm) Wherein s isi=(f1i,f2i,…,fli)T,fhiThe h variable factor of the accident numbered i; recording the severity value, r, of each traffic accidentiThe severity value of the accident numbered i, h 1.. l, i 1.. m;
(2) reducing and normalizing the variable factors of the m collected accident samples, and setting a dimension-reduced sample siIs n dimension, n<l,si′=(f1i′,f2i′,…,fni′)T,fki' is a variable factor retained after dimensionality reduction, and k is 1.. n;
the formula of the normalization process is:
xki=(fki′-MinValue)/(MaxValue-MinValue)
wherein xkiAs a variable factor fki' normalized value, MinValue is { fk1′,fk2′,…,fkm', and MaxValue is { fk1′,fk2′,…,fkmThe maximum value in';
(3) applying a support vector machine algorithm to construct a traffic accident severity prediction model, wherein the model is as follows:
Figure FDA0002996683500000011
wherein, f (x) is the severity prediction result of the accident to be predicted, and x is the variable factor vector of the accident to be predicted after dimensionality reduction; x is the number ofjFor the j accident sample after dimension reduction normalization, yjThe severity value of the jth accident in the sample set is taken; alpha is alphajA Lagrange multiplier corresponding to the jth accident sample, b is an intercept, and K (·) is a kernel function;
(4) and (3) reducing the dimension of the variable factor vector of the accident to be predicted according to the dimension reduction method in the step (2) to obtain a variable factor vector x of the accident to be predicted after dimension reduction, and substituting x into the traffic accident severity prediction model established in the step (3) to obtain a severity prediction result of the accident to be predicted.
2. The method of claim 1, wherein the road conditions include a slope direction, a flat curve direction, and positive or negative represented by 0 or 1, respectively.
3. The method for predicting the severity of an expressway traffic accident according to claim 1, wherein the driver's condition includes driver age, driver gender; wherein the sex of the driver is represented by 0 or 1 for male or female.
4. The method of claim 1, wherein the vehicle condition comprises an age of an accident vehicle.
5. The method for predicting the severity of an expressway traffic accident based on data fusion and a support vector machine according to claim 1, wherein the principal component analysis is adopted in the step (2) to reduce the dimension of the variable factor sample set S.
6. The method for predicting the severity of an expressway traffic accident based on data fusion and a support vector machine according to claim 1, wherein in the step (2), an independent component analysis method is adopted to perform dimension reduction on the variable factor sample set S.
7. The method for predicting the severity of an expressway traffic accident based on data fusion and a support vector machine according to claim 1, wherein the kernel function K (-) is a gaussian kernel function and the expression is:
Figure FDA0002996683500000021
where σ is a kernel width adjustment parameter.
CN201810353803.3A 2018-04-19 2018-04-19 Expressway traffic accident severity prediction method based on data fusion and support vector machine Active CN108710967B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810353803.3A CN108710967B (en) 2018-04-19 2018-04-19 Expressway traffic accident severity prediction method based on data fusion and support vector machine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810353803.3A CN108710967B (en) 2018-04-19 2018-04-19 Expressway traffic accident severity prediction method based on data fusion and support vector machine

Publications (2)

Publication Number Publication Date
CN108710967A CN108710967A (en) 2018-10-26
CN108710967B true CN108710967B (en) 2021-07-27

Family

ID=63867339

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810353803.3A Active CN108710967B (en) 2018-04-19 2018-04-19 Expressway traffic accident severity prediction method based on data fusion and support vector machine

Country Status (1)

Country Link
CN (1) CN108710967B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110110085A (en) * 2019-04-24 2019-08-09 中电海康集团有限公司 Traffic accident file classification method and system based on character level neural network and SVM
CN113112794A (en) * 2021-03-31 2021-07-13 四川省气象服务中心(四川省专业气象台 四川省气象影视中心) Traffic accident occurrence rate prediction method based on space-time meteorological grid
CN113506440B (en) * 2021-09-08 2021-11-30 四川国蓝中天环境科技集团有限公司 Traffic state estimation method for multi-source data fusion under Lagrange coordinate system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101271625A (en) * 2008-04-03 2008-09-24 东南大学 Method for detecting freeway traffic event by integration supporting vector machine
CN103646534A (en) * 2013-11-22 2014-03-19 江苏大学 A road real time traffic accident risk control method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101271625A (en) * 2008-04-03 2008-09-24 东南大学 Method for detecting freeway traffic event by integration supporting vector machine
CN103646534A (en) * 2013-11-22 2014-03-19 江苏大学 A road real time traffic accident risk control method

Also Published As

Publication number Publication date
CN108710967A (en) 2018-10-26

Similar Documents

Publication Publication Date Title
CN108665093B (en) Deep learning-based expressway traffic accident severity prediction method
CN108710967B (en) Expressway traffic accident severity prediction method based on data fusion and support vector machine
CN107330463B (en) Vehicle type identification method based on CNN multi-feature union and multi-kernel sparse representation
CN111160401B (en) Abnormal electricity utilization discriminating method based on mean shift and XGBoost
CN105374209B (en) A kind of urban area road network running status characteristics information extraction method
CN111126868B (en) Road traffic accident occurrence risk determination method and system
WO2024027027A1 (en) Method and system for recognizing lane changing intention of manually-driven vehicle
CN116631186B (en) Expressway traffic accident risk assessment method and system based on dangerous driving event data
CN111784017A (en) Road condition factor regression analysis-based road traffic accident quantity prediction method
CN114139624A (en) Method for mining time series data similarity information based on integrated model
CN113313357A (en) Traffic road safety evaluation method based on Gaussian process regression analysis
CN113779260A (en) Domain map entity and relationship combined extraction method and system based on pre-training model
CN108985941A (en) A kind of stock intelligent Forecasting of combination newsletter archive
CN104680185A (en) Hyperspectral image classification method based on boundary point reclassification
CN116468935A (en) Multi-core convolutional network-based stepwise classification and identification method for traffic signs
CN116738551B (en) Intelligent processing method for acquired data of BIM model
CN115271565B (en) DEA-based method, device and equipment for evaluating highway pavement maintenance measures
CN115774942A (en) Driving style identification model modeling and statistical method based on Internet of vehicles real vehicle data and SVM
CN111626508B (en) Track traffic vehicle-mounted data prediction method based on xgboost model
CN110083637B (en) Bridge disease rating data-oriented denoising method
CN113591780A (en) Method and system for identifying driving risk of driver
Xi et al. Classification and recognition model for the severity of road traffic accidents
CN111078882A (en) Text emotion measuring method and device
CN111554087B (en) Early warning method for highway blockage
CN117574982B (en) Pre-training model fine tuning method and device based on linear transformation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant