CN110348969A - Taxi based on deep learning and big data analysis seeks objective policy recommendation method - Google Patents

Taxi based on deep learning and big data analysis seeks objective policy recommendation method Download PDF

Info

Publication number
CN110348969A
CN110348969A CN201910641328.4A CN201910641328A CN110348969A CN 110348969 A CN110348969 A CN 110348969A CN 201910641328 A CN201910641328 A CN 201910641328A CN 110348969 A CN110348969 A CN 110348969A
Authority
CN
China
Prior art keywords
taxi
state
data
deep learning
big data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910641328.4A
Other languages
Chinese (zh)
Other versions
CN110348969B (en
Inventor
王桐
孙博
张乐君
李升波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harbin Engineering University
Original Assignee
Harbin Engineering University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin Engineering University filed Critical Harbin Engineering University
Priority to CN201910641328.4A priority Critical patent/CN110348969B/en
Publication of CN110348969A publication Critical patent/CN110348969A/en
Application granted granted Critical
Publication of CN110348969B publication Critical patent/CN110348969B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0645Rental transactions; Leasing transactions

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Economics (AREA)
  • Accounting & Taxation (AREA)
  • Marketing (AREA)
  • Human Resources & Organizations (AREA)
  • Development Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Finance (AREA)
  • Software Systems (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • Game Theory and Decision Science (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Traffic Control Systems (AREA)
  • Devices For Checking Fares Or Tickets At Control Points (AREA)

Abstract

The invention discloses the taxis based on deep learning and big data analysis to seek objective policy recommendation method, and the recommended method is the following steps are included: step 1: cleaning to taxi historical trajectory data;Step 2: taxi pickup point is extracted;Step 3: taxi hot spot is extracted;Step 4: passenger capacity prediction is carried out to hot spot;Step 5: taxi recommended models are proposed.Taxi caused by the present invention is mismatched for taxi in city and Customer information seeks objective difficult problem, it proposes one kind and the following passengers quantity is predicted using deep learning based on traffic big data, and with can the markov decision process of time-varying the technology of objective strategy of seeking is provided for taxi driver by Policy iteration, solve the problems, such as that existing matching mechanisms matching is difficult, increase the working efficiency of taxi, so that taxi income is more scientific, and passenger's chauffeur is also more easier.

Description

Taxi based on deep learning and big data analysis seeks objective policy recommendation method
Technical field
The invention belongs to taxis to seek objective distribution field, propose the taxi based on deep learning and big data analysis and seek Objective policy recommendation method.
Background technique
With the continuous improvement of national economic development level, motor vehicle occupies increasing ratio in urban transportation, And corresponding road area per capita is constantly in low-level state, exerts heavy pressures on to urban transportation.In addition, China is existing Having city road network is typically all that density is low, arterial highway spacing is excessive, branch is short, function is chaotic, belongs to the traffic system of low speed, difficult To adapt to the needs of Hyundai Motor traffic, the modern facilities of traffic control and management and traffic safety management are not able to satisfy reality Demand.The big or middle city in the whole nation is crowded in the prevalence of road, the phenomenon of congested with cars, traffic order confusion.Urban population swashs Increasing requires taxi trade to have to fast development, this has caused a series of problems.Taxi trade has obtained fastish Development, this has caused the problems such as some taxis match " hardly possible " with passenger, and many taxi drivers devote a tremendous amount of time searching Passenger, and passenger waits for quite a long time in certain places.Occur much for the uneven phenomenon of this matching Chauffeur software, this can also partially alleviate this problem.However, taxi driver drives after passenger getting off car there is no specific Direction, experienced taxi driver can tell the popular place of different periods;The insufficient driver of experience then can only be blindly It drives, finds passenger at random.For these experienced taxi drivers it is unconspicuous seek objective rule and be all hidden in be collected into Taxi historical data in.The present invention carries out mining analysis for these taxi historical datas, and it is unobvious to search out these Seek objective trend, to provide more reasonable carrying route for taxi driver.
Summary of the invention
The purpose of the present invention is to propose to the taxis based on deep learning and big data analysis to seek objective policy recommendation method, from The best angle for finding passenger's strategy is recommended to start with for the taxi of empty driving, the real time information and historical track number in conjunction with taxi According to the Generalization bounds for proposing carrying hot spot region, is provided for taxi driver and most preferably seek objective scheme, to solve existing matching machine The problem of system matching hardly possible, low efficiency.
The invention is realized by the following technical scheme: the taxi based on deep learning and big data analysis is sought objective strategy and is pushed away Recommend method, which is characterized in that the recommended method the following steps are included:
Step 1: taxi historical trajectory data is cleaned;
Step 2: taxi pickup point is extracted from the taxi historical trajectory data through over cleaning;
Step 3: taxi hot spot is extracted from the taxi pickup point extracted;
Step 4: passenger capacity prediction is carried out to taxi hot spot;
Step 5: taxi recommended models are proposed.
Further, in step 1, specifically, including: removal invalid data, removal nothing to historical trajectory data cleaning With field, go unless this city data, removal repeated data.
Further, include: in step 2
Step 2 one: taxi historical trajectory data is uploaded into Hadoop cluster;
Step 2 two: data are mapped to using the map function of Spark to go out by the historical trajectory data that load was cleaned Number of hiring a car is major key, other fields are the RDD object of value;
Step 2 three: the element with identical major key is pooled to one using the groupByKey method of Spark platform In set;
Step 2 four: it is ranked up using value of the sort method to element each in RDD according to time field;
Step 2 five: filter out taxi passenger carrying status by 0 change to 10000000 data.
Further, in step 3, hot spot is found using the DBSCAN clustering algorithm based on density, DBSCAN includes two A important parameter: minimum includes points minPts and sweep radius eps, and is respectively set minimum comprising points and scanning half Diameter.
Further, in step 4, specifically, being followed to the return value of the input addition itself of hidden layer neuron The output of ring neural network RNN, RNN hidden layer s and output layer o are as follows:
st=f (Uxt+Wst-1), (1)
ot=g (Vst), (2)
Wherein, t is the moment, and x is input layer, and s is hidden layer, and o is output layer, and matrix W is that the value of hidden layer last time is made For the weight of this input, U and V are weight matrix,
RNN is trained using error backpropagation algorithm, and the error amount of i-th layer of t moment is propagated in both directions:
One direction is to be transmitted to a layer network, this part is only related with weight matrix U;
Another direction is along timeline by being propagated recursively to initial time, this part is related with weight matrix W.
Further, Dropout layers are added on the basis of 3 layers of RNN.
Further, in step 5, specifically, making recommendation mould with improved Markovian decision process IMDP Type,
The Markovian decision process IMDP is expressed as five-tuple (S, A, a Psa,γ,Rsa), wherein S is state set It closes;A={ is waited, be moved to next hot spot }, is a series of actions that driver can do in pickup point;PsaFor transition probability square Battle array, is obtained by data mining, is indicated in state SiThe probability that movement a ∈ A is transferred to NextState is carried out in the case of ∈ S;γ ∈ (0,1) is discount factor;It is in state siReturn after taking behavior a ∈ A under ∈ S, after current hotspot makes a choice Cost be Cij=Eij, EijIt is vehicle from state siTo sjIdle time, correspondingly, obtained return is denoted asWherein, XjFor state sjPrediction get on the bus number,
Taxi is from a state s in state set S0Start, selects a movement a in behavior aggregate A0Reach next shape State s1, a movement a is equally selected in this case1Arrival state s2, this process is described as
Under the influence of the above process, defining Reward Program is
It is abbreviated as R (s0)+γR(s1)+γ2R(s2)+…
Our target is the mean value E [R (s for making global Reward Program by selection strategy0)+γR(s1)+γ2R(s2) + ...] reach maximum, herein, tactful π is the function that state is mapped to movement, i.e. π: S → A, therefore at a state s, The movement a=π (s) that we select,
When using strategy π, desired value of the accumulation return at state s is defined as state-value function:
Vπ(s)=E [R (s0)+γR(s1)+γ2R(s2)+…|s0=s, π] (3)
Turn to Bellman equation:
R (s) expression is returned at once in formula (4), i.e., the return that selection movement a can be obtained immediately at state s, and equation is right Side Section 2 is sum term, indicates return following under this selection,
Define optimum state-value function are as follows:
The form of the graceful equation of Bell are as follows:
Equally define optimal policy function are as follows:
From formula (4)~(7):
Following Utilization strategies iteratively solve IMDP, specific steps are as follows:
Step 1: initialization stateful V (s) and π (s), wherein be initialized as randomized policy;
Step 2: assessing current strategies with current V (s), the V (s) of each state is calculated, until V (s) Convergence, has just trained this state value function V (s);
Step 3: being improved with current strategies valuation functions V (s) obtained in the previous step, in each state s, to every A possible movement a calculates the expected value for reaching NextState after taking this to act, chooses the phase for making to reach NextState The maximum movement of cost function is hoped to carry out more new strategy π (s), then circulation step two and step 3 again, until V (s) and π (s) All convergences, finally obtain taxi recommended models.
The beneficial effects of the present invention are: taxi caused by the present invention is mismatched for taxi in city and Customer information Vehicle seeks objective difficult problem, propose it is a kind of the following passengers quantity is predicted using deep learning based on traffic big data, and with can when The markov decision process of change provides the technology for seeking objective strategy by Policy iteration into taxi driver, solves existing With the difficult problem of mechanism matching, the working efficiency of taxi is increased, so that taxi income is more scientific, and passenger's chauffeur It is more easier.
Detailed description of the invention
Fig. 1 is the method stream that the taxi of the invention based on deep learning and big data analysis seeks objective policy recommendation method Cheng Tu;
Fig. 2 is carrying hot spot thermodynamic chart of the visualization in Baidu map;
Fig. 3 is Recognition with Recurrent Neural Network RNN schematic diagram.
Specific embodiment
Technical solution in the embodiment of the present invention that following will be combined with the drawings in the embodiments of the present invention carries out clear, complete Ground description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.Based on this Embodiment in invention, every other reality obtained by those of ordinary skill in the art without making creative efforts Example is applied, shall fall within the protection scope of the present invention.
Shown in referring to Fig.1, the invention is realized by the following technical scheme: the taxi based on deep learning and big data analysis Vehicle seeks objective policy recommendation method, the recommended method the following steps are included:
Step 1: taxi historical trajectory data is cleaned;
Step 2: taxi pickup point is extracted from the taxi historical trajectory data through over cleaning;
Step 3: taxi hot spot is extracted from the taxi pickup point extracted;
Step 4: passenger capacity prediction is carried out to taxi hot spot;
Step 5: taxi recommended models are proposed.
Specifically, the present invention taxi historical track is stored using big data processing platform Hadoop and Spark, It excavates, training set data is trained using the RNN in Python deep learning library Keras by Pycharm platform, and pass through survey Examination collection data carry out the accuracy of verification algorithm.During prediction, root-mean-square error RMSE and MRE couples of average relative error are utilized Prediction result, which is done, to be assessed and compares with traditional algorithms results such as SVM and BPNN.
The recommendation stage hires out car data to Beijing first with the library Numpy and the library Pandas of Python and does further It excavates and arranges, obtain state transition probability matrix corresponding with behavior and return matrix, be updated in IMDP model and imitated Really obtain Generalization bounds, comparison seeks the objective time using seeking in objective time and former data of obtaining of Generalization bounds.
In the preferred embodiment of this part, in step 1, specifically, including: removal nothing to historical trajectory data cleaning Effect data, removal hashed field are gone unless this city data, removal repeated data.
The partial results of data cleansing are referring to shown in table 1:
Table 1
Specifically, data cleansing (Data cleaning) is big data information excavating and last training pattern or carries out it His any beginning further applied.Data before data cleansing processing, referred to as initial data, wherein generally can include many heavy Multiple information, inessential or even wrong information.Therefore it is necessary to which initial data is further screened and is handled.Number It is not only to screen out not needing and analyze data the information to cut little ice according to cleaning, further includes to the mistake in data Corrigendum and the recovery of default value of information etc..Since the data in data warehouse are the collection for the data of a certain specific content It closes, these data extract from multiple operation systems and cover a series of historical informations.However this will appear and has The information of data be the problems such as incorrect and some data clash between each other.These are incorrect or have conflict Data be significant adverse to the research work of next step, these data are called " dirty data ".In general, can be according to certain rule " dirty data " is washed off, here it is data cleansings.Data cleansing mainly solves the problems, such as it is the undesirable data of filtering, no The data for meeting specification generally include incomplete data, the data of mistake, duplicate data these three types.
And data cleansing of the invention mainly includes following four partial content:
(1) it removes invalid data: including partial invalidity data in taxi historical track, and these data are to of the invention Research contents does not have any value;
(2) remove hashed field: the positioning device of taxi includes bulk information, such as taxi mode bit, message id etc. Field.But the present invention does not use, therefore removes;
(3) go unless this city data: taxi may go to non-this city place during business, and GPS positioning device Still there is record to this partial data.For convenience of the subsequent analysis of this paper, need to remove this partial data;
(4) repeated data is removed.
In the preferred embodiment of this part, include: in step 2
Step 2 one: taxi historical trajectory data is uploaded into Hadoop cluster;
Step 2 two: data are mapped to using the map function of Spark to go out by the historical trajectory data that load was cleaned Number of hiring a car is major key, other fields are the RDD object of value;
Step 2 three: the element with identical major key is pooled to one using the groupByKey method of Spark platform In set;
Step 2 four: it is ranked up using value of the sort method to element each in RDD according to time field;
Step 2 five: filter out taxi passenger carrying status by 0 change to 10000000 data.
Specifically, the original historical data of taxi only includes the anchor point of taxi and the various states of anchor point, In order to further analyze the trip mode of passenger, the present invention is firstly the need of the passenger point for extracting taxi.Due to taxi history Data volume is huge, and needs repeatedly to be ranked up taxi historical data, screen etc. operation during extracting passenger point, So the present invention analyzes taxi historical data using the storage capacity of Hadoop and the distributed computing function of Spark, The partial data that passenger point is extracted is as shown in table 2:
Table 2
Shown in referring to Fig.1, in the preferred embodiment of this part, in step 3, clustered using the DBSCAN based on density Algorithm finds hot spot, and DBSCAN includes two important parameters: minimum includes points minPts and sweep radius eps, and respectively Setting is minimum to include points and sweep radius.
Specifically, the present invention can analyze the carrying hot spot region in city in taxi point data,
In order to find the big region of carrying density, and it is this using distance as the cluster of standard and discomfort to be similar to K-means For finding the big region of carrying density, therefore the present invention finds hot spot using the DBSCAN clustering algorithm based on density. DBSCAN includes two important parameters: minimum includes points (minPts) and sweep radius (eps).Minimum includes this hair of counting It is bright to be set as 3000 according to the frequency of carrying time.For sweep radius, the present invention is averagely sought the objective time 8 with taxi and divided Clock is foundation, and assumes that the average speed per hour of cab driving is 20km/h, therefore it is about 3 public that taxi, which averagely seeks visitor's distance, In, to guarantee that driver can search out the hot spot within 3 kilometers in any position as far as possible, between two nearest hot spots of distance away from From should be maintained near 6 kilometers, therefore show that sweep radius eps should be 6 kilometers.Visualize the carrying hot spot in Baidu map Thermodynamic chart is as shown in Figure 2.
Referring to shown in Fig. 3, in the preferred embodiment of this part, in step 4, specifically, due to the uncertainty of passenger And in 00:00~06:00 period, since passenger is sparse and contingency is larger, in order to accurately predict that hot spot future multiplies Guest's number, common machine learning algorithm (such as: KNN, SVM etc.) tend not to predict that future gets on the bus number very accurately.And because Value for traditional full Connection Neural Network BPNN hidden layer is solely dependent upon input, generally requires to know when to time series forecasting Road previous state or preceding several states, for this problem, present invention proposition adds itself to the input of hidden layer neuron Return value obtain Recognition with Recurrent Neural Network RNN, which is most good at processing time series, and structure is as shown in Figure 3.
Referring to shown in Fig. 3, in the preferred embodiment of this part, the Recognition with Recurrent Neural Network RNN is inversely propagated using error Algorithm training.
Specifically, being different from fully-connected network, the output of RNN hidden layer s and output layer o are as follows:
st=f (Uxt+Wst-1), (1)
ot=g (Vst), (2)
Wherein, t is the moment, and x is input layer, and s is hidden layer, and o is output layer, and matrix W is that the value of hidden layer last time is made For the weight of this input, U and V are weight matrix.
Due to the presence of iterative relation it can be seen from formula (1), theoretically RNN can look forward any number of inputs, this It is exactly the reason of RNN is suitble to time series forecasting.RNN is equally trained using error backpropagation algorithm, i-th layer of t moment Error amount is propagated in both directions:
One direction is to be transmitted to a layer network, this part is only related with weight matrix U;
Another direction is along timeline by being propagated recursively to initial time, this part is related with weight matrix W.
In the preferred embodiment of this part, Dropout layers are added on the basis of 3 layers of RNN.
Specifically, Dropout layers are added on the basis of 3 layers of RNN in the training process to prevent network over-fitting The input neuron connection of certain percentage (rate=0.2) is disconnected when each undated parameter at random.
Referring to shown in Fig. 3, in the preferred embodiment of this part, in step 5, specifically, coming for taxi driver It says, the place of the upper taxi of passenger is called pickup point.When taxi empty, driver needs to find next bit passenger as early as possible.For This, driver should select a place as potential pickup point.So when there is the pickup point of many candidates, how driver is selected Could no-load ratio be reduced to the greatest extent to increase taxi income by selecting? the improved Markovian decision process of the present invention IMDP studies this problem to making better recommendation.
Traditional Markovian decision process MDP can be expressed as five-tuple (S, A, a Psa,γ,Rsa), wherein S is State set;A is behavior set (waiting or go to next hot spot in current hotspot);PsaFor transition probability matrix, indicate State SiThe probability that movement a ∈ A is transferred to NextState is carried out in the case of ∈ S;γ ∈ (0,1) is discount factor;It is in shape State siReturn after taking behavior a ∈ A under ∈ S.The core of MDP is to find optimal policy, and so-called strategy refers to state to movement Mapping, be expressed as π (x).If a strategy maximizes total prospective earnings, it is exactly optimal.
Recommend this problem specific to taxi, since the seating capacity of each hot spot changes over time, so each hot spot Carrying probability and taxi to seek the return of visitor be also time-varying, therefore traditional MDP can not provide strategy for taxi and push away It recommends.Each carrying hot spot is extended to 96 states (every 15 minutes states), each state pair by the IMDP that the present invention uses The seating capacity for answering current slot, by this method, the MDP of non-time-varying can become time-varying MDP, thus preferably for out It hires a car recommendation.
In IMDP, S is state set, quantity N=100*96;Driver indicates in a series of actions that pickup point can be done It { is waited, be moved to next hot spot } for A=, the selection of next hot spot is depended on being currently located hot spot;PsaPass through data mining It obtains;Take γ=0.9;Cost after current hotspot makes a choice is Cij=Eij, EijIt is vehicle from state siTo sjZero load Time, correspondingly, obtained return is denoted asXjFor state sjPrediction get on the bus number.
Taxi is from a state s in state set S0Start, selects a movement a in behavior aggregate A0Reach next shape State s1, a movement a is equally selected in this case1Arrival state s2, this process is described as
Under the influence of the above process, defining Reward Program is
It is abbreviated as R (s0)+γR(s1)+γ2R(s2)+…
Our target is the mean value E [R (s for making global Reward Program by selection strategy0)+γR(s1)+γ2R(s2) + ...] reach maximum.Herein, tactful π is the function that state is mapped to movement, i.e. π: S → A, therefore at a state s, The movement a=π (s) that we select.
When using strategy π, desired value of the accumulation return at state s is defined as state-value function:
Vπ(s)=E [R (s0)+γR(s1)+γ2R(s2)+…|s0=s, π] (3)
Turn to Bellman equation
R (s) expression is returned at once in formula (4), i.e., the return that selection movement a can be obtained immediately at state s, and equation is right Side Section 2 is sum term, indicates return following under this selection.
Defining optimum state-value function is
The form of the graceful equation of Bell is
Equally defining optimal policy function is
From formula (4)~(7):
Following Utilization strategies iteratively solve IMDP, and the method that Policy iteration solves MDP optimal policy is initial from one The strategy of change sets out, and first carries out Policy evaluation, then improvement strategy, assesses improved strategy, further improvement strategy, passes through Continuous iteration updates, until strategy is restrained.Specific steps are as follows:
Step 1: initialization stateful V (s) and π (s) (being initialized as randomized policy);
Step 2: assessing current strategies with current V (s), the V (s) of each state is calculated, until V (s) Convergence, has just trained this state value function V (s).
Step 3: being improved with current strategies valuation functions V (s) obtained in the previous step, in each state s, to every A possible movement a calculates the expected value for reaching NextState after taking this to act, chooses the phase for making to reach NextState The maximum movement of cost function is hoped to carry out more new strategy π (s), then circulation step two and step 3 again, until V (s) and π (s) All convergences.
The present invention stores Beijing's taxi historical track using big data processing platform Hadoop and Spark, It excavates, training set data is trained using the RNN in Python deep learning library Keras by Pycharm platform, and pass through survey Examination collection data carry out the accuracy of verification algorithm.During prediction, root-mean-square error (RMSE) and average relative error are utilized (MRE) prediction result is done and assesses and is compared with traditional algorithms results such as SVM and BPNN.
The recommendation stage hires out car data to Beijing first with the library Numpy and the library Pandas of Python and does further It excavates and arranges, obtain state transition probability matrix corresponding with behavior and return matrix, be updated in IMDP model and imitated Really obtain Generalization bounds, comparison seeks the objective time using seeking in objective time and former data of obtaining of Generalization bounds.

Claims (7)

1. the taxi based on deep learning and big data analysis seeks objective policy recommendation method, which is characterized in that the recommendation side Method the following steps are included:
Step 1: taxi historical trajectory data is cleaned;
Step 2: taxi pickup point is extracted from the taxi historical trajectory data through over cleaning;
Step 3: taxi hot spot is extracted from the taxi pickup point extracted;
Step 4: passenger capacity prediction is carried out to taxi hot spot;
Step 5: taxi recommended models are proposed.
2. the taxi according to claim 1 based on deep learning and big data analysis seeks objective policy recommendation method, Be characterized in that, in step 1, specifically, to historical trajectory data cleaning include: removal invalid data, removal hashed field, It goes unless this city data, removal repeated data.
3. the taxi according to claim 1 based on deep learning and big data analysis seeks objective policy recommendation method, It is characterized in that, includes: in step 2
Step 2 one: taxi historical trajectory data is uploaded into Hadoop cluster;
Step 2 two: data are mapped to using the map function of Spark with taxi by the historical trajectory data that load was cleaned Number is major key, other fields are the RDD object of value;
Step 2 three: the element with identical major key is pooled to a set using the groupByKey method of Spark platform In;
Step 2 four: it is ranked up using value of the sort method to element each in RDD according to time field;
Step 2 five: filter out taxi passenger carrying status by 0 change to 10000000 data.
4. the taxi according to claim 1 based on deep learning and big data analysis seeks objective policy recommendation method, It is characterized in that, in step 3, hot spot is found using the DBSCAN clustering algorithm based on density, DBSCAN includes two important Parameter: minimum includes points minPts and sweep radius eps, is respectively set minimum comprising points and sweep radius.
5. the taxi according to claim 1 based on deep learning and big data analysis seeks objective policy recommendation method, It is characterized in that, in step 4, passenger capacity prediction is carried out to taxi hot spot and has used Recognition with Recurrent Neural Network RNN, specifically , Recognition with Recurrent Neural Network RNN, RNN hidden layer s and output layer are obtained to the return value of the input addition itself of hidden layer neuron The output of o are as follows:
st=f (Uxt+Wst-1), (1)
ot=g (Vst), (2)
Wherein, t is the moment, and x is input layer, and s is hidden layer, and o is output layer, and matrix W is the value of hidden layer last time as this The weight of secondary input, U and V are weight matrix,
RNN is trained using error backpropagation algorithm, and the error amount of i-th layer of t moment is propagated in both directions:
One direction is to be transmitted to a layer network, this part is only related with weight matrix U;
Another direction is along timeline by being propagated recursively to initial time, this part is related with weight matrix W.
6. the taxi according to claim 5 based on deep learning and big data analysis seeks objective policy recommendation method, It is characterized in that, Dropout layers is added on the basis of 3 layers of RNN.
7. the taxi according to claim 1 based on deep learning and big data analysis seeks objective policy recommendation method, It is characterized in that, in step 5, specifically, taxi recommended models are made with Markovian decision process IMDP, specifically, institute It states Markovian decision process IMDP and is expressed as five-tuple (S, A, a Psa,γ,Rsa), wherein S is state set;A={ etc. Wait be moved to next hot spot }, it is a series of actions that driver can do in pickup point;PsaFor transition probability matrix, pass through data Excavation obtains, and indicates in state SiThe probability that movement a ∈ A is transferred to NextState is carried out in the case of ∈ S;γ ∈ (0,1) is folding Detain the factor;It is in state siReturn after taking behavior a ∈ A under ∈ S, the cost after current hotspot makes a choice are Cij =Eij, EijIt is vehicle from state siTo sjIdle time, obtained return is denoted asWherein, XjFor state sjPrediction get on the bus number,
Taxi is from a state s in state set S0Start, selects a movement a in behavior aggregate A0Reach NextState s1, Equally one movement a of selection in this case1Arrival state s2, this process is described as:
Under the influence of the above process, Reward Program is defined are as follows:
It writes a Chinese character in simplified form are as follows:
R(s0)+γR(s1)+γ2R(s2)+…
Using tactful π as a stateful corresponding strategy set, tactful π is the function that state is mapped to movement, i.e., π: S → A, at a state s, selection acts a=π (s),
When using strategy π, desired value of the accumulation return at state s is defined as state-value function:
Vπ(s)=E [R (s0)+γR(s1)+γ2R(s2)+…|s0=s, π] (3)
Turn to Bellman equation:
R (s) expression is returned at once in formula (4), i.e., the return that can obtain immediately of selection movement a state s at, equation right side the Binomial is sum term, indicates return following under this selection,
Define optimum state-value function are as follows:
The form of the graceful equation of Bell are as follows:
Equally define optimal policy function are as follows:
From formula (4)~(7):
Following Utilization strategies iteratively solve IMDP, specific steps are as follows:
Step 1: initialization stateful V (s) and π (s), wherein be initialized as randomized policy;
Step 2: assessing current strategies with current V (s), calculates the V (s) of each state, until V (s) is received It holds back, has just trained this state value function V (s);
Step 3: being improved with current strategies valuation functions V (s) obtained in the previous step, in each state s, to it is each can The movement a of energy calculates the expected value for reaching NextState after taking this to act, chooses the expectation valence for making to reach NextState The maximum movement of value function carrys out more new strategy π (s), then circulation step two and step 3 again, until V (s) and π (s) all Convergence, finally obtains taxi recommended models.
CN201910641328.4A 2019-07-16 2019-07-16 Taxi passenger searching strategy recommendation method based on deep learning and big data analysis Active CN110348969B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910641328.4A CN110348969B (en) 2019-07-16 2019-07-16 Taxi passenger searching strategy recommendation method based on deep learning and big data analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910641328.4A CN110348969B (en) 2019-07-16 2019-07-16 Taxi passenger searching strategy recommendation method based on deep learning and big data analysis

Publications (2)

Publication Number Publication Date
CN110348969A true CN110348969A (en) 2019-10-18
CN110348969B CN110348969B (en) 2022-04-08

Family

ID=68176487

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910641328.4A Active CN110348969B (en) 2019-07-16 2019-07-16 Taxi passenger searching strategy recommendation method based on deep learning and big data analysis

Country Status (1)

Country Link
CN (1) CN110348969B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111445054A (en) * 2019-11-05 2020-07-24 天津大学 Evacuation path recommendation method based on machine learning
CN111862579A (en) * 2020-06-10 2020-10-30 深圳大学 Taxi scheduling method and system based on deep reinforcement learning
CN113379159A (en) * 2021-07-05 2021-09-10 南京工程学院 Taxi driver passenger-searching route recommendation method based on gray model and Markov decision process
CN113837535A (en) * 2021-08-18 2021-12-24 中国核电工程有限公司 Method for backtracking serious accident process of nuclear power plant
CN114692955A (en) * 2022-03-10 2022-07-01 东南大学 Taxi path planning method based on Markov decision and queuing theory

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100106603A1 (en) * 2008-10-20 2010-04-29 Carnegie Mellon University System, method and device for predicting navigational decision-making behavior
CN107590175A (en) * 2017-07-31 2018-01-16 同济大学 A kind of taxi pickup point based on deep learning recommends method
CN107832882A (en) * 2017-11-03 2018-03-23 上海交通大学 A kind of taxi based on markov decision process seeks objective policy recommendation method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100106603A1 (en) * 2008-10-20 2010-04-29 Carnegie Mellon University System, method and device for predicting navigational decision-making behavior
CN107590175A (en) * 2017-07-31 2018-01-16 同济大学 A kind of taxi pickup point based on deep learning recommends method
CN107832882A (en) * 2017-11-03 2018-03-23 上海交通大学 A kind of taxi based on markov decision process seeks objective policy recommendation method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
孟祥武等: "基于用户轨迹数据的移动推荐***研究", 《软件学报》 *
薛佳瑶等: "基于卷积循环神经网络的城市区域车流量预测模型", 《信息工程大学学报》 *
鲍冠文: "出租车载客热点推荐方法研究及***实现", 《中国优秀硕士学位论文全文数据库》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111445054A (en) * 2019-11-05 2020-07-24 天津大学 Evacuation path recommendation method based on machine learning
CN111862579A (en) * 2020-06-10 2020-10-30 深圳大学 Taxi scheduling method and system based on deep reinforcement learning
CN113379159A (en) * 2021-07-05 2021-09-10 南京工程学院 Taxi driver passenger-searching route recommendation method based on gray model and Markov decision process
CN113379159B (en) * 2021-07-05 2024-01-02 南京工程学院 Taxi driver passenger searching route recommendation method based on gray model and Markov decision process
CN113837535A (en) * 2021-08-18 2021-12-24 中国核电工程有限公司 Method for backtracking serious accident process of nuclear power plant
CN113837535B (en) * 2021-08-18 2024-05-17 中国核电工程有限公司 Method for backtracking severe accident process of nuclear power plant
CN114692955A (en) * 2022-03-10 2022-07-01 东南大学 Taxi path planning method based on Markov decision and queuing theory

Also Published As

Publication number Publication date
CN110348969B (en) 2022-04-08

Similar Documents

Publication Publication Date Title
CN110348969A (en) Taxi based on deep learning and big data analysis seeks objective policy recommendation method
Guo et al. Learning to route with sparse trajectory sets
US20220374712A1 (en) Decision making for motion control
Huang et al. Large scale real-time ridesharing with service guarantee on road networks
CN110836675B (en) Decision tree-based automatic driving search decision method
Ionita et al. Where to park? predicting free parking spots in unmonitored city areas
CN111582559B (en) Arrival time estimation method and device
CN110490393B (en) Taxi passenger-searching route planning method, system and medium combining experience and direction
CN110598917B (en) Destination prediction method, system and storage medium based on path track
Li et al. Towards smart transportation system: A case study on the rebalancing problem of bike sharing system based on reinforcement learning
CN113312760B (en) Network online motor vehicle right-turn track planning method and device based on traffic simulation
CN113706220A (en) User portrait determination, user demand prediction method, and data processing system
EP2288872A1 (en) Method and device for calculating a navigation route to associated destinations
CN116050245A (en) Highway automatic driving commercial vehicle track prediction and decision method and system based on complex network theory
CN114839984A (en) Regular bus path planning method, device, equipment and storage medium
Lin et al. Insights into Travel Pattern Analysis and Demand Prediction: A Data-Driven Approach in Bike-Sharing Systems
CN112052405B (en) Passenger searching area recommendation method based on driver experience
JP2021190094A (en) Delivery plan creation method, device, system, and computer readable storge medium
Castagna et al. Demand-responsive rebalancing zone generation for reinforcement learning-based on-demand mobility
CN111882092A (en) Taxi vehicle searching method suitable for shared trip
Liu et al. Learning to route via theory-guided residual network
Buijs et al. Using neural nets to predict transportation mode choice: Amsterdam network change analysis
CN115565376A (en) Vehicle travel time prediction method and system fusing graph2vec and double-layer LSTM
CN115186969A (en) Multi-guide particle swarm multi-target carpooling problem solving optimization method with variable neighborhood search
Liu et al. Data-driven pick-up location recommendation for ride-hailing services

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant