CN108053075A - A kind of scrap-car Forecasting Methodology and system - Google Patents

A kind of scrap-car Forecasting Methodology and system Download PDF

Info

Publication number
CN108053075A
CN108053075A CN201711445897.9A CN201711445897A CN108053075A CN 108053075 A CN108053075 A CN 108053075A CN 201711445897 A CN201711445897 A CN 201711445897A CN 108053075 A CN108053075 A CN 108053075A
Authority
CN
China
Prior art keywords
vehicle
preset time
time period
module
scrap
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711445897.9A
Other languages
Chinese (zh)
Other versions
CN108053075B (en
Inventor
黄智勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING SINOIOV VEHICLE NETWORK TECHNOLOGY Co Ltd
Original Assignee
BEIJING SINOIOV VEHICLE NETWORK TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING SINOIOV VEHICLE NETWORK TECHNOLOGY Co Ltd filed Critical BEIJING SINOIOV VEHICLE NETWORK TECHNOLOGY Co Ltd
Priority to CN201711445897.9A priority Critical patent/CN108053075B/en
Publication of CN108053075A publication Critical patent/CN108053075A/en
Application granted granted Critical
Publication of CN108053075B publication Critical patent/CN108053075B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • General Physics & Mathematics (AREA)
  • Economics (AREA)
  • Tourism & Hospitality (AREA)
  • Strategic Management (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Marketing (AREA)
  • Development Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Artificial Intelligence (AREA)
  • Primary Health Care (AREA)
  • Health & Medical Sciences (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • General Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Educational Administration (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Train Traffic Observation, Control, And Security (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses the Forecasting Methodologies and system of a kind of scrap-car, belong to intelligent transportation big data field.The described method includes:The characteristic information of each vehicle is extracted in the track data of each vehicle in the first preset time period;According to the characteristic information of each vehicle and default scrap-car standard generation training sample, and training sample is divided into training set and test set;Prediction model is trained according to training set, and prediction model is tested using test set;Use following scrap-car of the prediction model prediction after test.In the present invention, based on the historical trajectory data in each vehicle regular period, the training of prediction model is carried out so that the prediction model and prediction result finally obtained be all more scientific and reliability;It is combined simultaneously with machine learning, finds out operation rule of the vehicle before scrapping, solve hysteresis sex chromosome mosaicism of the prior art;For the manufacturer of each vehicle, marketing strategy also can be more accurately determined.

Description

A kind of scrap-car Forecasting Methodology and system
Technical field
The present invention relates to intelligent transportation big data field more particularly to the Forecasting Methodologies and system of a kind of scrap-car.
Background technology
With the fast development of social economy, either private car or stream carrier vehicle all flies with exponential speed Speed increases;Each vehicle manufacturers are also constantly producing and are selling various brands, the vehicle of various functions, all size.It is right For each vehicle manufacturers, timely understanding the demand of car purchaser can make it produce more popular vehicle;Meanwhile Scrap-car in the market can be timely predicted, so as to which demand of clearly marketing is also equal heavy for vehicle manufacturers It will.And the Forecasting Methodology of existing scrap-car, typically only consider the off-duty number of days of vehicle, and set using subjective experience It puts and scraps rule;However judge whether vehicle is scrapped by vehicle off-duty number of days, for vehicle manufacturers, often Through losing optimal marketing opportunity;Meanwhile rule is scrapped using subjective experience setting, it is excessively subjective to also result in prediction result, Without convincingness, and there are certain hysteresis sex chromosome mosaicisms.It can be seen that how accurate, the prediction scrap-car of science, to each vehicle Manufacturer for, be still a urgent problem.
The content of the invention
To solve the deficiencies in the prior art, the present invention provides a kind of Forecasting Methodology and system of scrap-car.
On the one hand, the present invention provides a kind of Forecasting Methodology of scrap-car, including:
Step S1:The characteristic information of each vehicle is extracted in the track data of each vehicle in the first preset time period;
Step S2:According to the characteristic information of each vehicle and default scrap-car standard generation training sample, and by institute It states training sample and is divided into training set and test set;
Step S3:Prediction model is trained according to the training set, and using the test set to the prediction model into Row test;
Step S4:Use following scrap-car of the prediction model prediction after test.
Optionally, before the step S1, further include:The track data of each vehicle in first preset time period is carried out pre- Processing;
Accordingly, the step S1, specially:The characteristic information of each vehicle is extracted in track data after the pre-treatment.
Optionally, the track data includes:A series of position data and travel speed;
The step S1, specifically includes:
First preset time period is divided into M the second preset time periods, and according to time order and function order by preceding n Second preset time period is as the 3rd preset time period, wherein n<M;
According to the position data and travel speed contained in the track data of each vehicle, it is default each second to count each vehicle Operation number of days, maintenance station in period stop number and stop duration;
When stopping number according to operation number of days of each vehicle in each second preset time period, maintenance station and stop It is long, determine that operation number of days variation tendency of each vehicle in the 3rd preset time period, maintenance station are stopped number and averagely stopped By duration.
According to operation number of days of each vehicle in each second preset time period and default stoppage in transit standard, each vehicle is determined Stoppage in transit situation in each second preset time period.
Optionally, the step S2, specifically includes:
Step S2-1:According to stoppage in transit situation of each vehicle in each second preset time period and default scrap-car mark Standard determines scrap-car and operation vehicle in each vehicle;
Step S2-2:By operation number of days variation tendency of each vehicle in scrap-car in the 3rd preset time period, Maintenance station stops number and averagely stops duration establishes correspondence and as positive sample with corresponding vehicle, obtains positive sample collection It closes;Operation number of days variation tendency of each vehicle in the 3rd preset time period in vehicle will be run, number is stopped in maintenance station And averagely stop duration and establish correspondence and as negative sample with corresponding vehicle, obtain negative sample set;
Step S2-3:Positive sample is chosen in the positive sample set according to the first preset ratio, and in the negative sample Negative sample is chosen in set, obtains the training sample of default quantity;
Step S2-4:The training sample of the default quantity is divided by training set and test according to the second preset ratio Collection.
Optionally, the step S3, specifically includes:
Step S3-1:The training set is trained to obtain prediction model according to random forests algorithm;
Step S3-2:The prediction model is assessed according to Receiver operating curve and adjusts model parameter, When the first area under Receiver operating curve meets predetermined threshold value, corresponding prediction model is exported;
Step S3-3:The prediction model of calculating output is on the test set under corresponding Receiver operating curve Second area when the second area meets the predetermined threshold value, performs step S4;Otherwise return to step S3-1;
Accordingly, the step S4, specially:Using current time as deadline, count forward the described 3rd it is default when Between in section the operation number of days variation tendency of each vehicle, maintenance station stop number and averagely stop duration, and using pre- after test It surveys the scrap-car in model prediction future and scraps probability.
On the other hand, the present invention provides a kind of forecasting system of scrap-car, including:
Extraction module, for extracting the feature of each vehicle letter in the track data of each vehicle in the first preset time period Breath;
Generation module, for the characteristic information of each vehicle extracted according to the extraction module and default scrap-car standard Generate training sample;
Division module, the training sample for the generation module to be generated are divided into training set and test set;
Training module, the training set for being divided according to the division module train prediction model;
Test module, for the prediction model trained using the test set of division module division to the training module It is tested;
Prediction module, for following scrap-car of the prediction model prediction after the test module is used to test.
Optionally, the system also includes:Preprocessing module;
The preprocessing module pre-processes for the track data to each vehicle in the first preset time period;
The extraction module, is specifically used for:Each vehicle is extracted in the pretreated track data of the preprocessing module Characteristic information.
Optionally, the track data includes:A series of position data and travel speed;
The extraction module, is specifically used for:
First preset time period is divided into M the second preset time periods, and according to time order and function order by preceding n Second preset time period is as the 3rd preset time period, wherein n<M;
According to the position data and travel speed contained in the track data of each vehicle, it is default each second to count each vehicle Operation number of days, maintenance station in period stop number and stop duration;
When stopping number according to operation number of days of each vehicle in each second preset time period, maintenance station and stop It is long, determine that operation number of days variation tendency of each vehicle in the 3rd preset time period, maintenance station are stopped number and averagely stopped By duration.
According to operation number of days of each vehicle in each second preset time period and default stoppage in transit standard, each vehicle is determined Stoppage in transit situation in each second preset time period.
Optionally, the generation module, specifically includes:Determination sub-module, sample generation submodule and selection submodule;
The determination sub-module, for according to each vehicle that the extraction module extracts in each second preset time period Stoppage in transit situation and default scrap-car standard determine scrap-car and operation vehicle in each vehicle;
The sample generates submodule, for each vehicle in the scrap-car that determines the determination sub-module described the Operation number of days variation tendency, maintenance station in three preset time periods stop number and averagely stop duration and established with corresponding vehicle Correspondence is simultaneously used as positive sample, obtains positive sample set;Each vehicle in operation vehicle that the determination sub-module determines is existed Operation number of days variation tendency, maintenance station in 3rd preset time period stop number and averagely stop duration and corresponding vehicle It establishes correspondence and as negative sample, obtains negative sample set;
The selection submodule, for the positive sample collection obtained according to the first preset ratio in sample generation submodule Positive sample is chosen in conjunction, and negative sample is chosen in the negative sample set obtained in sample generation submodule, obtains present count The training sample of amount;
The division module, is specifically used for:The default quantity for being obtained the selection submodule according to the second preset ratio Training sample be divided into training set and test set.
Optionally, the training module, specifically includes:Training submodule and assessment submodule;
The trained submodule, the training set for being divided according to random forests algorithm to the division module are trained Obtain prediction model;
The assessment submodule, for the prediction mould obtained according to Receiver operating curve to the trained submodule Type is assessed and adjusts model parameter, when the first area under Receiver operating curve meets predetermined threshold value, output Corresponding prediction model;
The test module, is specifically used for:The prediction model of the assessment submodule output is calculated in the division module Second area on the test set of division under corresponding Receiver operating curve, when the second area meets described preset During threshold value, the prediction module is triggered;
The prediction module, is specifically used for:Using current time as deadline, the 3rd preset time period is counted forward The operation number of days variation tendency of interior each vehicle, maintenance station stop number and averagely stop duration, and are surveyed using the test module Prediction model after examination predicts following scrap-car and scraps probability.
The advantage of the invention is that:
In the present invention, based on the historical trajectory data in each vehicle regular period, the generation of prediction model is carried out, The length and width of data has and significantly increases so that the prediction model and prediction result finally obtained it is all more scientific and Reliability;It is combined simultaneously with machine learning, operation rule of the vehicle before scrapping is found out according to random forests algorithm, so as to To prediction scrap-car and the prediction model of probability is scrapped, solves hysteresis sex chromosome mosaicism of the prior art;For each vehicle For manufacturer, marketing strategy also can be more accurately determined.
Description of the drawings
By reading the detailed description of hereafter preferred embodiment, it is various other the advantages of and benefit it is common for this field Technical staff will be apparent understanding.Attached drawing is only used for showing the purpose of preferred embodiment, and is not considered as to the present invention Limitation.And throughout the drawings, the same reference numbers will be used to refer to the same parts.In the accompanying drawings:
Attached drawing 1 is a kind of scrap-car Forecasting Methodology flow chart provided by the invention;
Attached drawing 2 is a kind of scrap-car forecasting system module composition frame chart provided by the invention.
Specific embodiment
The illustrative embodiments of the disclosure are more fully described below with reference to accompanying drawings.Although this public affairs is shown in attached drawing The illustrative embodiments opened, it being understood, however, that may be realized in various forms the disclosure without the reality that should be illustrated here The mode of applying is limited.It is to be able to be best understood from the disclosure on the contrary, providing these embodiments, and can be by this public affairs The scope opened completely is communicated to those skilled in the art.
Embodiment one
According to the embodiment of the present invention, a kind of Forecasting Methodology of scrap-car is provided, as shown in Figure 1, including:
Step 101:The characteristic information of each vehicle is extracted in the track data of each vehicle in the first preset time period;
Preferably, in the present invention, each vehicle is equipped with mobile unit, and mobile unit is every prefixed time interval (such as 30 Second) report the track data of corresponding vehicle;Wherein, track data includes but not limited to:A series of position data and traveling speed Degree;
In the present embodiment, before step 101, further include:To the track data of each vehicle in the first preset time period into Row pretreatment;
Specifically, the track data of each vehicle in the first preset time period is proceeded as follows:
Wrong data in filter footprint data;
Correct the position data deviated in track data;
The benefit corrected according to time dimension in track data passes data;
Wherein, the wrong data in filter footprint data, such as location information mistake, travel speed mistake;Correct track The position data of offset is modified by the position data deviated in data specifically by the methods of Fourier filtering;
Accordingly, step 101, it is specially:The characteristic information of each vehicle is extracted in track data after the pre-treatment.
Further, in the present embodiment, step 101 specifically includes:
First preset time period is divided into M the second preset time periods, and according to time order and function order by preceding n second Preset time period is as the 3rd preset time period, wherein n<M;
According to the position data and travel speed contained in the track data of each vehicle, it is default each second to count each vehicle Operation number of days, maintenance station in period stop number and stop duration;
Number is stopped according to operation number of days of each vehicle in each second preset time period, maintenance station and stops duration, really Operation number of days variation tendency of the fixed each vehicle in the 3rd preset time period, maintenance station stop number and averagely stop duration.
According to operation number of days of each vehicle in each second preset time period and default stoppage in transit standard, determine each vehicle each Stoppage in transit situation in second preset time period.
Wherein, the first preset time period, the second preset time period, the 3rd preset time end can voluntarily be set according to demand It is fixed, for example, in the present embodiment, using the first preset time period as in January, 2016 in June, 2017, the second preset time period First preset time period is then divided into 12 by Shi Changwei mono- month, the 3rd preset time period to illustrate exemplified by first 12 months A month, the 3rd preset time period was in January, 2016 in December, 2016.
Wherein, according to the position data and travel speed contained in the track data of each vehicle, each vehicle is counted each Operation number of days, maintenance station in two preset time periods stop number and stop duration, are specially:
According to the position data, travel speed and corresponding time contained in the track data of each vehicle, arbitrary phase is determined The distance between adjacent two position datas calculate each vehicle in daily distance travelled, when each vehicle exists according to definite distance Daily distance travelled is more than preset travel fare register, using corresponding one day service time as vehicle, counts each vehicle and exists Operation number of days in each second preset time period;Wherein, preset travel mileage is preferably 5 kms;
The position data contained in the track data of each vehicle is matched with the position data of each maintenance station, and general At the corresponding maintenance station of successful position data, stop the conduct that duration is more than preset duration and once really stop, statistics is each It stops number and stops duration in maintenance station of the vehicle in each second preset time period;Wherein, preset duration, when being preferably 2 small.
Wherein, when stopping number according to operation number of days of each vehicle in each second preset time period, maintenance station and stop It is long, when determining that operation number of days variation tendency of each vehicle in the 3rd preset time period, maintenance station are stopped number and averagely stopped It is long, be specially:Using least square method to each vehicle in each second preset time period for being included in the 3rd preset time period Operation number of days is calculated, and obtains operation number of days variation tendency of each vehicle in the 3rd preset time period;It is and default to the 3rd Each vehicle maintenance station in each second preset time period included in period is stopped number and is summed it up, and it is pre- the 3rd to obtain each vehicle If number is stopped in the maintenance station in the period;And each vehicle in each second preset time period to being included in the 3rd preset time period Duration averaging is stopped in maintenance station, obtains maintenance station of each vehicle in the 3rd preset time period and averagely stops duration;
Wherein, stoppage in transit standard is preset, it is preferable that be specially in the present embodiment:When vehicle is in the second preset time period When operation number of days is zero, it is denoted as vehicle and stops transport in second preset time period;Accordingly, it is default each second according to each vehicle Operation number of days and default stoppage in transit standard in period determine stoppage in transit situation of each vehicle in each second preset time period, tool Body is:Judge whether operation number of days of each vehicle in each second preset time period is zero, and be zero corresponding by operation number of days Second preset time period is denoted as corresponding vehicle and stops transport.
Step 102:According to the characteristic information of each vehicle and default scrap-car standard generation training sample, and by generation Training sample is divided into training set and test set;
Preferably, in the present embodiment, presetting scrap-car standard is:The last one contained in 3rd preset time period Second preset time period is not stopped transport, and the second preset time that is adjacent, continuous, presetting quantity after the 3rd preset time period It stops transport in section;Wherein, default quantity is preferably 6;
For example, in the present embodiment, in December, 2016 is not stopped transport, and continuous 6 months after in December, 2016 The vehicle that (i.e. in January, 2017 to June) stops transport is as scrap-car.
In the present embodiment, step 102 specifically includes:
Step 102-1:According to stoppage in transit situation of each vehicle in each second preset time period and default scrap-car standard, Determine the scrap-car and operation vehicle in each vehicle;
Step 102-2:By operation number of days variation tendency of each vehicle in scrap-car in the 3rd preset time period, repair It stands to stop number and averagely stop duration and establishes correspondence and as positive sample with corresponding vehicle, obtain positive sample set; Operation number of days variation tendency of each vehicle in the 3rd preset time period in vehicle will be run, number is stopped in maintenance station and averagely stops Correspondence is established with corresponding vehicle and as negative sample, obtain negative sample set by duration;
Step 102-3:Positive sample is chosen in positive sample set according to the first preset ratio, and is selected in negative sample set Negative sample is taken, obtains the training sample of default quantity;
Preferably, in the present embodiment, the first preset ratio is 1:1, it is 10000 to preset quantity.
Step 102-4:The training sample of default quantity is divided by training set and test set according to the second preset ratio.
Preferably, in the present embodiment, the second preset ratio is 7:3.
It may be noted that ground, scrap-car standard is preset, it can according to demand and actual conditions sets itself.
Step 103:Prediction model is trained according to the training set, and using the test set to the prediction model into Row test;
In the present embodiment, step 103, specifically include:
Step 103-1:Training set is trained according to random forests algorithm to obtain prediction model;
Specifically, using Python instruments, the random forests algorithm in scikit-learn machine learning storehouse is called to instruction Practice collection to be trained to obtain prediction model.
Step 103-2:According to Receiver operating curve (Receiver OperatingCharacteristic Curve, abbreviation ROC curve) prediction model is assessed and adjusts model parameter, when under Receiver operating curve When one area meets predetermined threshold value, corresponding prediction model is exported;
Wherein, the area under Receiver operating curve, also known as AUC, specifically, when the first AUC meets default threshold During value, corresponding prediction model is exported;
Wherein, predetermined threshold value is preferably 0.8.
Step 103-3:Calculate the of the prediction model of output on test set under corresponding Receiver operating curve Two areas when second area meets the predetermined threshold value, perform step 104;Otherwise return to step 103-1;
In the present invention, prediction model is trained using random forests algorithm, it is made to have more interpretation, and is solved existing Sex chromosome mosaicism is lagged in technology;Prediction model is tested by test set simultaneously, has further ensured the standard of prediction model True property.
Step 104:Use following scrap-car of the prediction model prediction after test.
Specifically, using current time as deadline, the 3rd preset time period Nei Geche is counted forward according to preceding method Operation number of days variation tendency, maintenance station stopped number and averagely stops duration, and predicted using the prediction model after test Following scrap-car and scrap probability.
Embodiment two
According to the embodiment of the present invention, a kind of forecasting system of scrap-car is provided, as shown in Fig. 2, including:
Extraction module 201, for extracting the feature of each vehicle in the track data of each vehicle in the first preset time period Information;
Generation module 202, for the characteristic information of each vehicle extracted according to extraction module 201 and default scrap-car mark Quasi- generation training sample;
Division module 203, the training sample for generation module 202 to be generated are divided into training set and test set;
Training module 204, the training set for being divided according to division module 203 train prediction model;
Test module 205, for using division module 203 divide test set to the prediction model that training module is trained into Row test;
Prediction module 206, for following scrap-car of the prediction model prediction after test module 205 is used to test.
According to the embodiment of the present invention, which further includes:Preprocessing module;
Preprocessing module pre-processes for the track data to each vehicle in the first preset time period;
Accordingly, extraction module 201 are specifically used for:Each vehicle is extracted in the pretreated track data of preprocessing module Characteristic information.
According to the embodiment of the present invention, track data includes:A series of position data and travel speed;
Accordingly, extraction module 201 are specifically used for:
First preset time period is divided into M the second preset time periods, and according to time order and function order by preceding n Second preset time period is as the 3rd preset time period, wherein n<M;
According to the position data and travel speed contained in the track data of each vehicle, it is default each second to count each vehicle Operation number of days, maintenance station in period stop number and stop duration;
When stopping number according to operation number of days of each vehicle in each second preset time period, maintenance station and stop It is long, determine that operation number of days variation tendency of each vehicle in the 3rd preset time period, maintenance station are stopped number and averagely stopped By duration.
According to operation number of days of each vehicle in each second preset time period and default stoppage in transit standard, each vehicle is determined Stoppage in transit situation in each second preset time period.
In the present embodiment, the first preset time period, the second preset time period, the 3rd preset time end can be according to need Sets itself is sought, for example, in the present embodiment, being preset by January, 2016 in June, 2017, second of the first preset time period Period when a length of one month, the 3rd preset time period to illustrate exemplified by first 12 months, then by the first preset time period It is divided into 12 months, the 3rd preset time period is in January, 2016 in December, 2016.
According to the embodiment of the present invention, generation module 202 specifically include:Determination sub-module, sample generation submodule and Submodule is chosen, wherein:
Determination sub-module, for the stoppage in transit according to each vehicle that extraction module 201 extracts in each second preset time period Situation and default scrap-car standard determine scrap-car and operation vehicle in each vehicle;
Wherein, presetting scrap-car standard is preferably:Contain in 3rd preset time period the last one second it is default when Between section do not stop transport, and stop transport after the 3rd preset time period in the second preset time period of adjacent, continuous, default quantity; Wherein, default quantity is preferably 6;
For example, in the present embodiment, in December, 2016 is not stopped transport, and continuous 6 months after in December, 2016 The vehicle that (i.e. in January, 2017 to June) stops transport is as scrap-car.
Sample generates submodule, for each vehicle in the scrap-car that determines determination sub-module in the 3rd preset time period In operation number of days variation tendency, maintenance station stop number and averagely stop duration establish correspondence with corresponding vehicle and make For positive sample, positive sample set is obtained;Each vehicle is in the 3rd preset time period in the operation vehicle that determination sub-module is determined Operation number of days variation tendency, maintenance station stop number and averagely stop duration establish correspondence and conduct with corresponding vehicle Negative sample obtains negative sample set;
Submodule is chosen, for being chosen according to the first preset ratio in the obtained positive sample set of sample generation submodule Positive sample, and negative sample is chosen in the negative sample set obtained in sample generation submodule, obtain the training sample of default quantity;
Accordingly, division module 203 are specifically used for:The present count for being obtained selection submodule according to the second preset ratio The training sample of amount is divided into training set and test set.
Wherein, the first preset ratio is preferably 1:1;Second preset ratio is preferably 7:3.
According to the embodiment of the present invention, training module 204 specifically include:Training submodule and assessment submodule, In:
Training submodule, the training set for being divided according to random forests algorithm to division module 203 are trained to obtain Prediction model;
In the present embodiment, training submodule, is specifically used for:Using Python instruments, scikit-learn machines are called The training set that random forests algorithm in learning database divides division module 203 is trained to obtain prediction model.
Submodule is assessed, the prediction model for being obtained according to Receiver operating curve to training submodule is commented Estimate and adjust model parameter, when the first area under Receiver operating curve meets predetermined threshold value, output is corresponding pre- Survey model;Wherein, predetermined threshold value is preferably 0.8;
Test module 205, is specifically used for:Calculate what the prediction model that assessment submodule exports was divided in division module 203 Second area on test set under corresponding Receiver operating curve, when second area meets predetermined threshold value, triggering is pre- Survey module 206;
Accordingly, prediction module 206 are specifically used for:Using current time as deadline, the 3rd preset time is counted forward The operation number of days variation tendency of each vehicle, maintenance station are stopped number and averagely stop duration, and surveyed using test module 205 in section Prediction model after examination predicts following scrap-car and scraps probability.
In the present invention, based on the historical trajectory data in each vehicle regular period, the generation of prediction model is carried out, The length and width of data has and significantly increases so that the prediction model and prediction result finally obtained it is all more scientific and Reliability;It is combined simultaneously with machine learning, operation rule of the vehicle before scrapping is found out according to random forests algorithm, so as to To prediction scrap-car and the prediction model of probability is scrapped, solves hysteresis sex chromosome mosaicism of the prior art;For each vehicle For manufacturer, marketing strategy can be more accurately determined.
The foregoing is only a preferred embodiment of the present invention, but protection scope of the present invention be not limited thereto, Any one skilled in the art in the technical scope disclosed by the present invention, the change or replacement that can be readily occurred in, It should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with the protection model of the claim Subject to enclosing.

Claims (10)

1. a kind of Forecasting Methodology of scrap-car, which is characterized in that including:
Step S1:The characteristic information of each vehicle is extracted in the track data of each vehicle in the first preset time period;
Step S2:According to the characteristic information of each vehicle and default scrap-car standard generation training sample, and by the instruction Practice sample and be divided into training set and test set;
Step S3:Prediction model is trained according to the training set, and the prediction model is surveyed using the test set Examination;
Step S4:Use following scrap-car of the prediction model prediction after test.
2. according to the method described in claim 1, it is characterized in that, before the step S1, further include:To the first preset time The track data of each vehicle is pre-processed in section;
The step S1, specially:The characteristic information of each vehicle is extracted in track data after the pre-treatment.
3. according to the method described in claim 1, it is characterized in that, the track data includes:A series of position data and Travel speed;
The step S1, specifically includes:
First preset time period is divided into M the second preset time periods, and according to time order and function order by preceding n second Preset time period is as the 3rd preset time period, wherein n<M;
According to the position data and travel speed contained in the track data of each vehicle, each vehicle is counted in each second preset time Operation number of days, maintenance station in section stop number and stop duration;
Number is stopped according to operation number of days of each vehicle in each second preset time period, maintenance station and stops duration, really When operation number of days variation tendency of the fixed each vehicle in the 3rd preset time period, maintenance station are stopped number and are averagely stopped It is long.
According to operation number of days of each vehicle in each second preset time period and default stoppage in transit standard, determine each vehicle each Stoppage in transit situation in second preset time period.
4. according to the method described in claim 3, it is characterized in that, the step S2, specifically includes:
Step S2-1:According to stoppage in transit situation of each vehicle in each second preset time period and default scrap-car standard, Determine the scrap-car and operation vehicle in each vehicle;
Step S2-2:By operation number of days variation tendency of each vehicle in scrap-car in the 3rd preset time period, repair It stands to stop number and averagely stop duration and establishes correspondence and as positive sample with corresponding vehicle, obtain positive sample set; Operation number of days variation tendency of each vehicle in the 3rd preset time period in vehicle will be run, number is stopped in maintenance station and flat It stops duration and establishes correspondence and as negative sample with corresponding vehicle, obtain negative sample set;
Step S2-3:Positive sample is chosen in the positive sample set according to the first preset ratio, and in the negative sample set Middle selection negative sample obtains the training sample of default quantity;
Step S2-4:The training sample of the default quantity is divided by training set and test set according to the second preset ratio.
5. according to the method described in claim 4, it is characterized in that, the step S3, specifically includes:
Step S3-1:The training set is trained to obtain prediction model according to random forests algorithm;
Step S3-2:The prediction model is assessed according to Receiver operating curve and adjusts model parameter, when by When the first area under examination person's performance curve meets predetermined threshold value, corresponding prediction model is exported;
Step S3-3:Calculate second of the prediction model exported on the test set under corresponding Receiver operating curve Area when the second area meets the predetermined threshold value, performs step S4;Otherwise return to step S3-1;
The step S4, specially:Using current time as deadline, each vehicle in the 3rd preset time period is counted forward Operation number of days variation tendency, maintenance station stop number and averagely stop duration, and using the prediction model prediction after test not Come scrap-car and scrap probability.
6. a kind of forecasting system of scrap-car, which is characterized in that including:
Extraction module, for extracting the characteristic information of each vehicle in the track data of each vehicle in the first preset time period;
Generation module, for the characteristic information of each vehicle extracted according to the extraction module and the generation of default scrap-car standard Training sample;
Division module, the training sample for the generation module to be generated are divided into training set and test set;
Training module, the training set for being divided according to the division module train prediction model;
Test module, for being carried out using the test set of division module division to the prediction model that the training module is trained Test;
Prediction module, for following scrap-car of the prediction model prediction after the test module is used to test.
7. system according to claim 1, which is characterized in that further include:Preprocessing module;
The preprocessing module pre-processes for the track data to each vehicle in the first preset time period;
The extraction module, is specifically used for:The spy of each vehicle is extracted in the pretreated track data of the preprocessing module Reference ceases.
8. system according to claim 7, which is characterized in that the track data includes:A series of position data and Travel speed;
The extraction module, is specifically used for:
First preset time period is divided into M the second preset time periods, and according to time order and function order by preceding n second Preset time period is as the 3rd preset time period, wherein n<M;
According to the position data and travel speed contained in the track data of each vehicle, each vehicle is counted in each second preset time Operation number of days, maintenance station in section stop number and stop duration;
Number is stopped according to operation number of days of each vehicle in each second preset time period, maintenance station and stops duration, really When operation number of days variation tendency of the fixed each vehicle in the 3rd preset time period, maintenance station are stopped number and are averagely stopped It is long.
According to operation number of days of each vehicle in each second preset time period and default stoppage in transit standard, determine each vehicle each Stoppage in transit situation in second preset time period.
9. system according to claim 8, which is characterized in that the generation module specifically includes:Determination sub-module, sample This generation submodule and selection submodule;
The determination sub-module, for the stoppage in transit according to each vehicle that the extraction module extracts in each second preset time period Situation and default scrap-car standard determine scrap-car and operation vehicle in each vehicle;
The sample generates submodule, pre- the described 3rd for each vehicle in the scrap-car that determines the determination sub-module If the operation number of days variation tendency, maintenance station in the period stop number and averagely stop duration is corresponding with the foundation of corresponding vehicle Relation is simultaneously used as positive sample, obtains positive sample set;Each vehicle is described in the operation vehicle that the determination sub-module is determined Operation number of days variation tendency, maintenance station in 3rd preset time period stop number and averagely stop duration and built with corresponding vehicle Vertical correspondence is simultaneously used as negative sample, obtains negative sample set;
The selection submodule, for according to the first preset ratio in the obtained positive sample set of sample generation submodule Positive sample is chosen, and negative sample is chosen in the negative sample set obtained in sample generation submodule, obtains default quantity Training sample;
The division module, is specifically used for:According to the instruction for the default quantity that the second preset ratio obtains the selection submodule Practice sample and be divided into training set and test set.
10. system according to claim 9, which is characterized in that the training module specifically includes:Training submodule and Assess submodule;
The trained submodule, the training set for being divided according to random forests algorithm to the division module are trained to obtain Prediction model;
The assessment submodule, for according to Receiver operating curve to the prediction model that the trained submodule obtains into Row is assessed and adjusts model parameter, and when the first area under Receiver operating curve meets predetermined threshold value, output corresponds to Prediction model;
The test module, is specifically used for:The prediction model for calculating the assessment submodule output is divided in the division module Test set on second area under corresponding Receiver operating curve, when the second area meets the predetermined threshold value When, trigger the prediction module;
The prediction module, is specifically used for:Using current time as deadline, count each in the 3rd preset time period forward The operation number of days variation tendency of vehicle, maintenance station stop number and averagely stop duration, and after use test module test The following scrap-car of prediction model prediction and scrap probability.
CN201711445897.9A 2017-12-27 2017-12-27 Scrapped vehicle prediction method and system Active CN108053075B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711445897.9A CN108053075B (en) 2017-12-27 2017-12-27 Scrapped vehicle prediction method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711445897.9A CN108053075B (en) 2017-12-27 2017-12-27 Scrapped vehicle prediction method and system

Publications (2)

Publication Number Publication Date
CN108053075A true CN108053075A (en) 2018-05-18
CN108053075B CN108053075B (en) 2021-03-26

Family

ID=62128185

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711445897.9A Active CN108053075B (en) 2017-12-27 2017-12-27 Scrapped vehicle prediction method and system

Country Status (1)

Country Link
CN (1) CN108053075B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108810526A (en) * 2018-06-12 2018-11-13 广东惠禾科技发展有限公司 Camera intelligent fault prediction technique, device and camera management server
CN109102149A (en) * 2018-06-06 2018-12-28 北京市燃气集团有限责任公司 A kind of prediction technique of city gas buried pipeline third party breakage in installation risk
CN109345136A (en) * 2018-10-22 2019-02-15 中山大学 Vehicle survival curve model optimization method based on China's motor vehicle resignation system
CN109635965A (en) * 2018-12-24 2019-04-16 成都四方伟业软件股份有限公司 Bus scraps decision-making technique, device and readable storage medium storing program for executing
CN111144485A (en) * 2019-12-26 2020-05-12 北京中交兴路车联网科技有限公司 Vehicle accident judgment method and system based on xgboost classification algorithm
CN111860954A (en) * 2020-06-18 2020-10-30 上海钧正网络科技有限公司 Vehicle loss of contact prediction method and device, computer equipment and storage medium
CN111861073A (en) * 2019-04-30 2020-10-30 北京嘀嘀无限科技发展有限公司 Operation strategy prediction method and device, electronic equipment and readable storage medium
CN112529104A (en) * 2020-12-23 2021-03-19 东软睿驰汽车技术(沈阳)有限公司 Vehicle fault prediction model generation method, fault prediction method and device
CN113570469A (en) * 2021-07-27 2021-10-29 上海交通大学 Intelligent vehicle change prediction method for vehicle insurance users
CN114912675A (en) * 2022-05-06 2022-08-16 杉数科技(北京)有限公司 Prediction method and device for scrappage of aero-engine blade

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105389974A (en) * 2015-11-19 2016-03-09 深圳市赛格导航科技股份有限公司 Vehicle tracking method and system based on vehicle historical driving data
CN105868865A (en) * 2016-04-13 2016-08-17 华北电力大学 Electric vehicle parc prediction method based on multivariate linear regression method and proportional substitution method
CN106127312A (en) * 2016-06-21 2016-11-16 中车青岛四方机车车辆股份有限公司 The life cycle management history information management method of EMUs and system
CN107025345A (en) * 2017-03-31 2017-08-08 柳州顺联科技有限公司 A kind of Forecasting Methodology of engineering machinery vehicle failure time

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105389974A (en) * 2015-11-19 2016-03-09 深圳市赛格导航科技股份有限公司 Vehicle tracking method and system based on vehicle historical driving data
CN105868865A (en) * 2016-04-13 2016-08-17 华北电力大学 Electric vehicle parc prediction method based on multivariate linear regression method and proportional substitution method
CN106127312A (en) * 2016-06-21 2016-11-16 中车青岛四方机车车辆股份有限公司 The life cycle management history information management method of EMUs and system
CN107025345A (en) * 2017-03-31 2017-08-08 柳州顺联科技有限公司 A kind of Forecasting Methodology of engineering machinery vehicle failure time

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
赵鹏等: "基于灰色***理论的汽车报废量预测", 《工业技术经济》 *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109102149A (en) * 2018-06-06 2018-12-28 北京市燃气集团有限责任公司 A kind of prediction technique of city gas buried pipeline third party breakage in installation risk
CN108810526A (en) * 2018-06-12 2018-11-13 广东惠禾科技发展有限公司 Camera intelligent fault prediction technique, device and camera management server
CN109345136A (en) * 2018-10-22 2019-02-15 中山大学 Vehicle survival curve model optimization method based on China's motor vehicle resignation system
WO2020082683A1 (en) * 2018-10-22 2020-04-30 中山大学 Chinese motor vehicle registration system-based method for optimizing vehicle survival curve model
CN109635965A (en) * 2018-12-24 2019-04-16 成都四方伟业软件股份有限公司 Bus scraps decision-making technique, device and readable storage medium storing program for executing
CN111861073B (en) * 2019-04-30 2024-05-03 北京嘀嘀无限科技发展有限公司 Operation policy prediction method and device, electronic equipment and readable storage medium
CN111861073A (en) * 2019-04-30 2020-10-30 北京嘀嘀无限科技发展有限公司 Operation strategy prediction method and device, electronic equipment and readable storage medium
CN111144485A (en) * 2019-12-26 2020-05-12 北京中交兴路车联网科技有限公司 Vehicle accident judgment method and system based on xgboost classification algorithm
CN111860954A (en) * 2020-06-18 2020-10-30 上海钧正网络科技有限公司 Vehicle loss of contact prediction method and device, computer equipment and storage medium
CN111860954B (en) * 2020-06-18 2024-07-05 上海钧正网络科技有限公司 Vehicle loss-of-link prediction method, device, computer equipment and storage medium
CN112529104A (en) * 2020-12-23 2021-03-19 东软睿驰汽车技术(沈阳)有限公司 Vehicle fault prediction model generation method, fault prediction method and device
CN113570469A (en) * 2021-07-27 2021-10-29 上海交通大学 Intelligent vehicle change prediction method for vehicle insurance users
CN113570469B (en) * 2021-07-27 2024-05-28 上海交通大学 Intelligent vehicle change prediction method for vehicle insurance user
CN114912675A (en) * 2022-05-06 2022-08-16 杉数科技(北京)有限公司 Prediction method and device for scrappage of aero-engine blade

Also Published As

Publication number Publication date
CN108053075B (en) 2021-03-26

Similar Documents

Publication Publication Date Title
CN108053075A (en) A kind of scrap-car Forecasting Methodology and system
CN104899405B (en) Data predication method and system and alarm method and system
CN104134349B (en) A kind of public transport road conditions disposal system based on traffic multisource data fusion and method
CN101504717B (en) Characteristic area positioning method, car body color depth and color recognition method
CN104933860B (en) Bus traffic congestion delay time at stop Forecasting Methodology based on gps data
CN102881162B (en) Data processing and fusion method for large-scale traffic information
CN108345666A (en) A kind of vehicle abnormality track-detecting method based on time-space isolated point
CN102646332A (en) Traffic state estimation device and method based on data fusion
US20220097557A1 (en) Information provision service system for electric vehicle using vehicle sensors
CN109243173A (en) Track of vehicle analysis method and system based on road high definition bayonet data
CN106845768A (en) Bus hourage model building method based on survival analysis parameter distribution
CN107563566A (en) A kind of run time interval prediction method between bus station based on SVMs
CN109887292B (en) Vehicle type identification method and system
CN103606272B (en) A kind of bus rapid transit arrival time Forecasting Methodology based on the volume of the flow of passengers
CN106649709A (en) Vehicle stopping behavior mode prediction and assessment method based on data mining
CN108710637A (en) Taxi exception track real-time detection method based on time-space relationship
CN115691120A (en) Congestion identification method and system based on highway running water data
CN106157609A (en) The monitoring method of a kind of block status and device
CN101794510A (en) Data processing method and device of floating cars
CN109493449A (en) A kind of lorry loading method for estimating state based on lorry GPS track data and high speed transaction data
CN108197977A (en) A kind of recommendation method and device of vehicle brand
CN106127407A (en) Aircraft stroke scoring method based on multi-sensor information fusion and scoring system
CN107993441A (en) A kind of lorry often runs away the Forecasting Methodology and device of line
Li et al. Diagnosing and interpolating loop detector data errors with probe vehicle data
CN109727474A (en) A kind of bus based on fused data precise recognition method out of the station

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant