CN115204535A - Purchasing business volume prediction method based on dynamic multivariate time sequence and electronic equipment - Google Patents

Purchasing business volume prediction method based on dynamic multivariate time sequence and electronic equipment Download PDF

Info

Publication number
CN115204535A
CN115204535A CN202211126215.9A CN202211126215A CN115204535A CN 115204535 A CN115204535 A CN 115204535A CN 202211126215 A CN202211126215 A CN 202211126215A CN 115204535 A CN115204535 A CN 115204535A
Authority
CN
China
Prior art keywords
time
input
sequence
attention
stage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202211126215.9A
Other languages
Chinese (zh)
Other versions
CN115204535B (en
Inventor
卢华
陈晶
陈挚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hubei Xintong Communication Co ltd
Original Assignee
Hubei Xintong Communication Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hubei Xintong Communication Co ltd filed Critical Hubei Xintong Communication Co ltd
Priority to CN202211126215.9A priority Critical patent/CN115204535B/en
Publication of CN115204535A publication Critical patent/CN115204535A/en
Application granted granted Critical
Publication of CN115204535B publication Critical patent/CN115204535B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2474Sequence data queries, e.g. querying versioned data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/049Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06393Score-carding, benchmarking or key performance indicator [KPI] analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/08Logistics, e.g. warehousing, loading or distribution; Inventory or stock management
    • G06Q10/087Inventory or stock management, e.g. order filling, procurement or balancing against orders

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Resources & Organizations (AREA)
  • General Physics & Mathematics (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Development Economics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Operations Research (AREA)
  • Marketing (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Educational Administration (AREA)
  • Biomedical Technology (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Game Theory and Decision Science (AREA)
  • Molecular Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Finance (AREA)
  • Fuzzy Systems (AREA)
  • Accounting & Taxation (AREA)
  • Probability & Statistics with Applications (AREA)
  • Databases & Information Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a purchasing business volume prediction method based on a dynamic multivariate time sequence and electronic equipment. The invention solves the problem of incomprehensive manual feature extraction in purchasing business volume prediction, and the time convolution module provided extracts the nonlinear features of the static time sequence. The invention solves the problems of gradient disappearance and gradient explosion caused by overlong time interval when the long-period time sequence prediction is carried out by purchasing traffic prediction, and the proposed multi-stage attention module adaptively selects important characteristics to model the dependency relationship between the states of the long time interval. The invention solves the problem that a nonlinear module cannot process the prediction of the dynamic time sequence in the purchasing traffic prediction, and the proposed autoregressive module processes the dynamic time sequence with unfixed period.

Description

Purchasing business volume prediction method based on dynamic multivariate time sequence and electronic equipment
Technical Field
The invention belongs to the field of artificial intelligence in computer science, relates to a purchasing traffic prediction method and electronic equipment, and particularly relates to a purchasing traffic prediction method and electronic equipment based on a dynamic multivariate time sequence in robot-oriented process automation.
Background
In recent years, purchasing management and supply chain management play an extremely important role in the development and competition process of enterprises, a large amount of purchasing business volume prediction work exists in the traditional purchasing process, the purchasing business volume prediction is that the purchasing business volume of the next period is predicted based on large batch of purchasing data in the purchasing process, and the efficient and reliable purchasing business volume prediction can help the enterprises to make better purchasing decisions in the next period, so that the cost is saved, and the purchasing efficiency is improved. However, the purchasing data required for purchasing traffic prediction is often of various types, mostly time series data and long in duration, and part of data has no fixed period and belongs to dynamic multivariate time series data. Time series refers to a series of data collected at regular intervals, and multivariate time series refers to a multivariate time series, i.e., a plurality of variables recorded over time. The dynamic multivariate time series data refers to multivariate time series data with an unfixed period and dynamically changing sequence length.
The traditional purchasing business volume prediction method is to manually select features and train a full-connection neural network for prediction. However, the problem of incomplete feature extraction by manpower, the long-term dependence relationship of long-period time series cannot be modeled by a fully-connected neural network, and dynamic time series data cannot be processed, so that the search for purchasing traffic prediction of dynamic multivariate time series in robot process automation is one of the targets pursued by many enterprises.
The Robot Process Automation (RPA) is a new type artificial intelligent virtual process automation robot, which can simulate the operation of human on the computer interface by specific rules and automatically execute the corresponding process task according to the rules to replace or assist the human to complete the related computer operation. The demand for Robotic Process Automation (RPA) has increased rapidly in recent years by various industries, and it is estimated that as many as 90% of medium and large organizations have chosen Robotic Process Automation (RPA) solutions by the year 2020. Robot Process Automation (RPA) has been widely used in various fields, and in recent years, with the advent and development of corporate financial sharing centers, robot Process Automation (RPA) has also been increasingly used in the fields of procurement management and supply chain management. The Robot Process Automation (RPA) has the advantages of high working efficiency, high accuracy, high expandability, compliance and safety, capability of automatically simplifying tasks, capability of reducing purchasing management cost and capability of freeing up space for more strategic activities.
Disclosure of Invention
The invention provides a purchasing traffic prediction method and electronic equipment based on dynamic multivariate time sequence in robot process automation, which are oriented to the problems that the characteristics are not comprehensively extracted manually in the purchasing traffic prediction, when the purchasing traffic prediction is carried out for long-period time sequence prediction, the problems of gradient disappearance and gradient explosion caused by overlong time interval and the demand for the purchasing traffic prediction of the dynamic time sequence.
The method adopts the technical scheme that: a purchasing traffic prediction method based on dynamic multivariate time sequence inputs data obtained by a robot flow automatic data acquisition network into a robot flow automatic purchasing traffic prediction network to predict purchasing traffic;
the robot process automation data acquisition network extracts index data required by service purchase quantity according to the keyword table, arranges the data according to time, and obtains time sequence data of N indexes as an external sequence
Figure 100002_DEST_PATH_IMAGE001
Wherein N represents the number of indexes, and T is the length of the external sequence; each index is a variable, the firstnThe time series data of each index is the secondnSingle variable time series data
Figure 100002_DEST_PATH_IMAGE002
Is a firstnTime series data of the individual indices; the time sequence data of all indexes form dynamic multi-element time sequence data which are used as input variables X of the robot process automatic purchase business volume prediction network;
the robot process automatic purchasing business volume prediction network comprises a time convolution module, a multi-stage attention module, a full connection layer and an autoregressive module;
the time convolution module is characterized in that the time convolution module is composed of K convolution kernels, the size of each convolution kernel is F multiplied by M, F is the width of each convolution kernel, M is the depth of each convolution kernel, time sequence data of other N static indexes except the purchasing business volume index are used as an input external sequence X to be input into each convolution kernel, and finally an output characteristic sequence H with the size of P multiplied by K is obtained, wherein P is P
Figure 183407DEST_PATH_IMAGE003
Is also the length of the univariate time series;
the multi-stage attention module comprises a first stage attention layer, a second stage attention layer and a time attention layer which are connected in sequence; the output variable H passes through the first stage attention layer, an external sequence is selected in a self-adaptive mode for learning, the output external sequence of the first stage attention layer is connected with a target sequence corresponding to time through the second stage attention layer, finally the hidden states in the first two layers are combined through the time attention layer to learn the characteristics of longer time dependence, and the characteristics of longer time dependence are output
Figure 100002_DEST_PATH_IMAGE004
The autoregressive module takes the dynamic multivariate time sequence data as an input vector X and outputs the input vector X
Figure 897285DEST_PATH_IMAGE005
Figure 100002_DEST_PATH_IMAGE006
Wherein, the first and the second end of the pipe are connected with each other,
Figure 37542DEST_PATH_IMAGE007
in order to be a super-parameter,
Figure 100002_DEST_PATH_IMAGE008
is a learnable weight vector, X is an input vector,Uin order to input the weight matrix, the weight matrix is input,bin order to be a vector of the offset,
Figure 229489DEST_PATH_IMAGE009
is the noise at the time t, and,
Figure 100002_DEST_PATH_IMAGE010
is a variance, independent of time;
the robot process automation data acquisition network finally outputs the predicted value of the minimized purchasing business volume
Figure 902915DEST_PATH_IMAGE011
The technical scheme adopted by the electronic equipment is as follows: an electronic device for forecasting procurement traffic based on a dynamic multivariate time series, comprising:
one or more processors;
a storage device for storing one or more programs that, when executed by the one or more processors, cause the one or more processors to implement the dynamic multivariate time series-based procurement traffic prediction method.
Compared with the prior art, the invention has the advantages and positive effects mainly reflected in the following aspects:
(1) The invention provides a method for predicting purchasing traffic of a dynamic multi-element time sequence in robot process automation. The invention solves the problem of incomplete feature extraction in the traditional purchasing traffic prediction process, and the time convolution module provided by the invention extracts the nonlinear feature of the static time sequence.
(2) The invention solves the problems of gradient disappearance and gradient explosion caused by overlong time interval when long-period time sequence prediction is carried out by purchasing traffic prediction, and the proposed multi-stage attention module adaptively selects important characteristics to model the dependency relationship between states of long time interval.
(3) The invention solves the problem that the nonlinear module cannot process the prediction of the dynamic time sequence required in the purchasing business volume prediction, and the proposed autoregressive module processes the dynamic time sequence with unfixed period.
Drawings
FIG. 1 is a general framework diagram of a method according to an embodiment of the invention;
FIG. 2 is a schematic loss diagram of a Purchases data set model according to an embodiment of the present invention;
FIG. 3 is a NASDAQ100 data set model loss diagram of an embodiment of the present invention;
FIG. 4 is a schematic diagram of a model prediction result on a Purchases data set according to an embodiment of the present invention;
FIG. 5 is a graph illustrating the prediction of the model on the NASDAQ100 dataset according to an embodiment of the present invention;
FIG. 6 is a schematic diagram illustrating prediction effects of different time steps according to an embodiment of the present invention.
Detailed Description
For the purpose of facilitating understanding and implementing the invention by those of ordinary skill in the art, the invention is described in further detail below with reference to the accompanying drawings and examples, it being understood that the examples described herein are for purposes of illustration and explanation only and are not intended to be limiting.
Referring to fig. 1, the method for forecasting the purchasing traffic based on the dynamic multivariate time sequence provided by the present invention inputs the data obtained through the robot process automation data acquisition network into the robot process automation purchasing traffic forecasting network to forecast the purchasing traffic;
according to the robot process automation data acquisition network, index data required by service purchase quantity are extracted according to the keyword table, the data are sorted according to time, and time sequence data of N indexes are obtained and used as time sequence dataExternal sequences
Figure 100002_DEST_PATH_IMAGE012
Wherein N represents the number of indexes, and T is the length of the external sequence; each index is a variable, respectivelynThe time series data of each index isnSingle variable time series data
Figure 20651DEST_PATH_IMAGE013
Is as followsnTime series data of each index; and the time sequence data of all indexes form dynamic multi-element time sequence data which are used as input variables X of the robot process automatic purchase traffic prediction network.
The keyword table adopted in this embodiment is shown in table 1 below;
TABLE 1
Figure 100002_DEST_PATH_IMAGE014
The robot process automatic procurement business volume prediction network comprises a time convolution module, a multi-stage attention module, a full connection layer and an autoregressive module;
the time convolution module of this embodiment has K convolution kernels, and the size of each convolution kernel is F × M, where F is the width of the convolution kernel and M is the depth of the convolution kernel.
In this embodiment, a convolution neural module is first built, a convolution kernel of the convolution neural module is set, and the size of each convolution kernel is F × M by K convolution kernels, where F is the width of the convolution kernel, and M is the depth of the convolution kernel. Inputting the input variable X into K convolution kernels in sequence, inputting X into the secondkOutput variable obtained by convolution kernel
Figure 350001DEST_PATH_IMAGE015
Wherein W is k Weight parameter of k convolution kernel represents convolution operation, RELU is nonlinear activation function, and RELU function is defined as
Figure 100002_DEST_PATH_IMAGE016
b k Is a bias vector.
In this embodiment, after the time series data of N static indicators other than the index of the purchase traffic is input as the input external sequence X into the convolution kernel, the output feature sequence H with the size of P × K is finally obtained, where P ish k Is also the length of the univariate time series.
The multi-stage attention module of the present embodiment includes a first stage attention layer, a second stage attention layer, and a temporal attention layer connected in sequence; the output variable H passes through the first stage attention layer, an important external sequence is selected in a self-adaptive mode for learning, the output external sequence of the first stage attention layer is connected with a target sequence of corresponding time through the second stage attention layer, finally hidden states in the first two layers are combined through the time attention layer to learn characteristics of longer time dependence, and output is carried out
Figure 407956DEST_PATH_IMAGE017
In the first stage attention layer of this embodiment, the output signature sequence H of the previous step is used as the input sequence X of this step, and the memory cell state S at the time of the long short term memory network (LSTM) t-1 is determined according to the target sequence y without considering the target sequence y t-1 And hidden layer state h at time t-1 t-1 Constructing the kth input sequence X k Attention value at time t
Figure 100002_DEST_PATH_IMAGE018
Figure 139151DEST_PATH_IMAGE019
Wherein the content of the first and second substances,
Figure 100002_DEST_PATH_IMAGE020
is a parameter that needs to be learned by the user,
Figure 696297DEST_PATH_IMAGE021
in the form of a matrix of state weights,uis a hidden layerState h t-1 I.e., the number of hidden layer nodes of the long-short term memory network (LSTM),U m in order to input the weight matrix, the weight matrix is input,b m for the bias vector, tanh () is a nonlinear activation function,X k is as followskThe number of the input sequences is one,
Figure 100002_DEST_PATH_IMAGE022
according totInput attention of time of day
Figure 247364DEST_PATH_IMAGE023
Calculating attention weights
Figure 100002_DEST_PATH_IMAGE024
Calculating first stage attention output
Figure 843430DEST_PATH_IMAGE025
(ii) a Andthidden state at time of day
Figure 100002_DEST_PATH_IMAGE026
In which
Figure 662088DEST_PATH_IMAGE027
Is a long short term memory network LSTM (LSTM) used as an encoder.
The second stage attention layer of this embodiment constructs an input sequence of the second attention stage at time t, the secondkAn input sequence
Figure 100002_DEST_PATH_IMAGE028
Wherein isyThe sequence of the object is determined,
Figure 154250DEST_PATH_IMAGE029
representing a vertical concatenation of matrices;
computing intermediate states of the second stage attention tier
Figure 100002_DEST_PATH_IMAGE030
Figure 254930DEST_PATH_IMAGE031
Wherein
Figure 100002_DEST_PATH_IMAGE032
Is a parameter to be learned;
Figure 326791DEST_PATH_IMAGE033
in the form of a matrix of state weights,ufor hiding the layer state h t-1 I.e., the number of hidden layer nodes of the long-short term memory network (LSTM),U s in order to input the weight matrix, the weight matrix is input,b s for the bias vector, tanh () is a nonlinear activation function;
according to intermediate state
Figure 100002_DEST_PATH_IMAGE034
Computing second stage weights
Figure 534043DEST_PATH_IMAGE035
Figure 100002_DEST_PATH_IMAGE036
According to input attention
Figure 931527DEST_PATH_IMAGE037
And weight
Figure 100002_DEST_PATH_IMAGE038
Computing the second stage attention tier output
Figure 519503DEST_PATH_IMAGE039
Figure 100002_DEST_PATH_IMAGE040
Wherein
Figure 893590DEST_PATH_IMAGE041
In order to be a hyper-parameter,
Figure 100002_DEST_PATH_IMAGE042
for the purpose of the second stage attention weighting,
Figure 453885DEST_PATH_IMAGE043
is the second stage input sequence.
The first two phases focus more on the relationship between sequences in the time series, and for better prediction of the target value, it is necessary to learn the correlation to the time series data for a longer time interval. This phase is a temporal attention phase, and by means of the temporal attention layer, longer-time dependencies are learned based on the hidden states of the first two phases and the hidden state with the prediction target.
The temporal attention layer of the present embodiment first calculates the intermediate states
Figure 100002_DEST_PATH_IMAGE044
Figure 84586DEST_PATH_IMAGE045
Wherein, the first and the second end of the pipe are connected with each other,
Figure 100002_DEST_PATH_IMAGE046
in the form of a matrix of state weights,
Figure 97541DEST_PATH_IMAGE047
in order to input the weight matrix, the weight matrix is input,
Figure 100002_DEST_PATH_IMAGE048
for the bias vector, the hidden layer state of the long short term memory network (LSTM) as decoder at time t-1
Figure 278249DEST_PATH_IMAGE049
Figure 100002_DEST_PATH_IMAGE050
The long-short term memory network (LSTM) as an encoder in the first two stages is formed by transversely splicing the states of memory units at the time t-1,
Figure 693050DEST_PATH_IMAGE051
the memory unit state of the long-short term memory network (LSTM) at the current stage at the moment k,
Figure 100002_DEST_PATH_IMAGE052
output for the second attention stage;
according to intermediate state
Figure 494653DEST_PATH_IMAGE053
Calculating temporal attention weights
Figure 100002_DEST_PATH_IMAGE054
Figure 493439DEST_PATH_IMAGE055
Attention weighting according to time
Figure 100002_DEST_PATH_IMAGE056
Computing context vectorsC t
Figure 710794DEST_PATH_IMAGE057
According to context vectorC t Calculating
Figure 100002_DEST_PATH_IMAGE058
Figure 42418DEST_PATH_IMAGE059
Calculating the time attention stagetHidden layer states of long-short term memory networks (LSTM) as decoders at time of day
Figure 100002_DEST_PATH_IMAGE060
In which
Figure 218185DEST_PATH_IMAGE061
For long short term memory networks (LSTM) to be used as decoders,
Figure 100002_DEST_PATH_IMAGE062
in the form of a matrix of state weights,
Figure 441618DEST_PATH_IMAGE063
is a bias vector;
calculating final output of a multi-stage attention module
Figure 100002_DEST_PATH_IMAGE064
Figure 728243DEST_PATH_IMAGE065
Wherein, the first and the second end of the pipe are connected with each other,
Figure 100002_DEST_PATH_IMAGE066
in the form of a matrix of state weights,
Figure 648794DEST_PATH_IMAGE067
in order to be a vector of the offset,
Figure DEST_PATH_IMAGE068
is the hidden state of the decoder at time t,C t is a context vector.
The autoregressive module of this embodiment outputs the dynamic multivariate time series data as an input vector X
Figure DEST_PATH_IMAGE069
Figure DEST_PATH_IMAGE070
Wherein, the first and the second end of the pipe are connected with each other,
Figure DEST_PATH_IMAGE071
in order to be a hyper-parameter,
Figure DEST_PATH_IMAGE072
is a learnable weight vector, X is an input vector,Uin order to input the weight matrix, the weight matrix is input,bin order to be a vector of the offset,
Figure DEST_PATH_IMAGE073
is the noise at the time t, and,
Figure DEST_PATH_IMAGE074
is a variance, independent of time;
the robot process automation data acquisition network of the embodiment finally outputs the predicted value of the minimized purchasing traffic
Figure DEST_PATH_IMAGE075
. And finally, the robot process automation stores the prediction result to a corresponding position.
The robot process automation data acquisition network of the embodiment is a trained robot process automation data acquisition network; the training process adopts a small-batch random gradient descent method and an ADAM optimizer, and the objective function is the predicted value of the minimized purchasing trafficy t And true valueyMean square error between
Figure DEST_PATH_IMAGE076
The invention is further illustrated by the following experiments. To test the performance of the present invention, two data sets were used, as shown in table 2.
TABLE 2
Data set Test set Test set Description of data set
Purchases 40000 10000 Shopping data
NASDAQ
100 32000 8560 Stock price data
Shopping data sets (Purchases): shopping datasets (Purchases) are Kaggle-based "get valuable shopper" challenge datasets that contain the shopping history of thousands of individuals, with each user record containing over a year transactions, including many fields such as product name, chain of stores, quantity, and date of purchase. In the experiment, the data was processed over time to obtain a time series shopping data set (containing 50000 samples), of which 4000 were used as training set and the rest were used as test set.
Nasdak 100 dataset (NASDAQ 100): this data set collected the stock prices of 81 companies in the NASDAQ100 index as a driving time series. The NASDAQ100 index serves as the target sequence. In the experiment, the first 32000 samples were used as training set, and the last 8560 samples were used as test set.
In order to measure the effectiveness of the purchasing traffic prediction method based on the dynamic multivariate time sequence, two evaluation indexes are used in the experiment: root Mean Square Error (RMSE) and mean square error (MAE). Suppose that
Figure DEST_PATH_IMAGE077
Is the firstiAt a time of a sampletThe true value of the time of day,
Figure DEST_PATH_IMAGE078
is the firstiAt a time of a sampletThe predicted value of the time model, RMSE, is defined as:
Figure DEST_PATH_IMAGE079
MAE is defined as:
Figure DEST_PATH_IMAGE080
the smaller the two values are, the better the prediction effect is, wherein N is the number of samples. Parameters need to be set in the purchasing business volume prediction method based on the dynamic multivariate time sequence, such as time step length T of the time sequence. In order to reduce the experimental error, 10 experiments were repeated, and the average of 10 experiments was taken as the final result.
As shown in Table 3, the dynamic multivariate time series-based prediction method for the procurement traffic volume is about 0.3 in both RMSE and MAE on two data sets, and the effect is better than that of the traditional method (LSTM). The method has different prediction effects on different data sets because the data characteristics of different data sets are different.
TABLE 3 evaluation results
Figure DEST_PATH_IMAGE081
Please refer to fig. 2 and fig. 3, which are a purchasas data set model loss diagram and a NASDAQ100 data set model loss diagram, respectively; as can be seen from fig. 2 and 3, the dynamic multivariate time sequence-based purchasing traffic prediction method has a good training convergence effect on two different data sets, and proves that the method has good prediction performance on both periodic and non-periodic time sequences. As can be seen from fig. 2 and 3, the training loss of the model decreases with the increase of the number of training rounds, which proves the effectiveness of the model in predicting the time series.
In order to observe the model prediction effect, the prediction results of the model on two different data sets are plotted in fig. 4 and 5. As can be seen from fig. 4 and 5, when the training data is sufficient, the model prediction result is very close to the real result. In addition, as the training time is increased, the overlapping effect of the predicted value and the true value of the model on the two data sets is better and better. The solid line represents the predicted value and the dotted line represents the true value.
In order to further optimize the parameters of the model, the model performance at different time steps on different data sets was tested. The experimental result is shown in fig. 6, and it can be seen from fig. 6 that, on the Purchases data set, when the time step is 4, the performance of the model is the best; on the NASDAQ100 s dataset, the model performed best when the time step was 9.
The invention provides a purchasing business volume prediction method based on a dynamic multivariate time sequence in robot-oriented process automation. The invention solves the problem of incomprehensive manual feature extraction in purchasing traffic prediction, and the time convolution module provided by the invention extracts the nonlinear feature of a static time sequence. The invention solves the problems of gradient disappearance and gradient explosion caused by overlong time interval when long-period time sequence prediction is carried out by purchasing traffic prediction, and the proposed multi-stage attention module adaptively selects important characteristics to model the dependency relationship between states of long time interval. The invention solves the problem that a nonlinear module cannot process the prediction of the dynamic time sequence in the purchasing traffic prediction, and the proposed autoregressive module processes the dynamic time sequence with unfixed period.
The invention can provide a more accurate and comprehensive purchasing business volume prediction method for users in more fields of artificial intelligence, purchasing management and the like.
It should be understood that the above description of the preferred embodiments is given for clarity and not for any purpose of limitation, and that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (7)

1. A purchasing business volume prediction method based on dynamic multivariate time sequence is characterized in that: inputting data obtained through a robot process automation data acquisition network into a robot process automation purchasing traffic prediction network to predict purchasing traffic;
the robot process automation data acquisition network extracts index data required by service purchase quantity according to the keyword table, arranges the data according to time, and obtains time sequence data of N indexes as an external sequence
Figure DEST_PATH_IMAGE001
Wherein N represents the number of indexes, and T is the length of the external sequence; each index is a variable, respectivelynThe time series data of each index is the secondnSingle variable time series data
Figure DEST_PATH_IMAGE002
Is as followsnTime series data of the individual indices; the time sequence data of all indexes form dynamic multi-element time sequence data which are used as input variables X of the robot process automatic purchase business volume prediction network;
the robot process automatic procurement business volume prediction network comprises a time convolution module, a multi-stage attention module, a full connection layer and an autoregression module;
the time convolution module is characterized in that K convolution kernels are used, the size of each convolution kernel is F multiplied by M, wherein F is the width of each convolution kernel, M is the depth of each convolution kernel, time sequence data of other N static indexes except the purchasing traffic indexes are used as an input external sequence X to be input into each convolution kernel, and finally an output characteristic sequence H with the size of P multiplied by K is obtained, wherein P is P
Figure DEST_PATH_IMAGE003
Is also the length of the univariate time series;
the multi-stage attention module, comprising sequential connectionsA first stage attention layer, a second stage attention layer and a time attention layer are connected; the output variable H passes through the first-stage attention layer, an external sequence is selected in a self-adaptive mode to learn, the output external sequence of the first-stage attention layer is connected with a target sequence of corresponding time through the second-stage attention layer, finally hidden states in the first two layers are combined and learned to a feature which depends on the output variable H for a longer time through the time attention layer, and the feature which depends on the output variable H for a longer time is output
Figure DEST_PATH_IMAGE004
The autoregressive module takes the dynamic multivariate time sequence data as an input vector X and outputs the input vector X
Figure DEST_PATH_IMAGE005
Figure DEST_PATH_IMAGE006
Wherein the content of the first and second substances,
Figure DEST_PATH_IMAGE007
in order to be a hyper-parameter,
Figure DEST_PATH_IMAGE008
is a learnable weight vector, X is an input vector,Uin order to input the weight matrix, the weight matrix is input,bin order to be a vector of the offset,
Figure DEST_PATH_IMAGE009
is the noise at the time of the t-time,
Figure DEST_PATH_IMAGE010
is a variance, independent of time;
the robot process automation data acquisition network finally outputs the predicted value of the minimized purchasing business volume
Figure DEST_PATH_IMAGE011
2. The method of claim 1, wherein the method for forecasting procurement traffic based on dynamic multivariate time series comprises: in the time convolution module, input variable X is input into K convolution kernels in sequence, and X is input into the second convolution kernelkOutput variable obtained by convolution kernel
Figure DEST_PATH_IMAGE012
WhereinW k Is as followskWeight parameter of convolution kernel represents convolution operation, RELU is nonlinear activation function, and RELU function is defined as
Figure DEST_PATH_IMAGE013
b k Is a bias vector.
3. The method for forecasting procurement traffic based on dynamic multivariate time series as claimed in claim 1, characterized in that: the attention layer of the first stage takes the output characteristic sequence H of the previous stage as the input sequence X of the first stage, does not consider the target sequence y, and is based on the memory unit state S at the moment of the long-short term memory network (LSTM) t-1 t-1 And hidden layer state h at time t-1 t-1 Constructing the kth input sequence X k Attention value at time t
Figure DEST_PATH_IMAGE014
Figure DEST_PATH_IMAGE015
Wherein the content of the first and second substances,
Figure DEST_PATH_IMAGE016
is a parameter that needs to be learned,
Figure DEST_PATH_IMAGE017
is the status rightThe weight matrix is a matrix of the weight,ufor hiding the layer state h t-1 The dimension of (a), i.e. the number of hidden layer nodes of the long-short term memory network LSTM,U m in order to input the weight matrix, the weight matrix is input,b m in order to be a vector of the offset,
Figure DEST_PATH_IMAGE018
in order to be a non-linear activation function,X k is a firstkThe number of the input sequences is one,
Figure DEST_PATH_IMAGE019
according totInput attention of time
Figure DEST_PATH_IMAGE020
Calculating attention weights
Figure DEST_PATH_IMAGE021
Calculating first stage attention output
Figure DEST_PATH_IMAGE022
(ii) a Andthidden state at time of day
Figure DEST_PATH_IMAGE023
Wherein
Figure DEST_PATH_IMAGE024
Is a long short term memory network LSTM used as an encoder.
4. The method of claim 3, wherein the method comprises the steps of: the second stage attention layer is constructed in a second attention stagetInput sequence of moments, nokAn input sequence
Figure DEST_PATH_IMAGE025
Wherein isyThe sequence of the object is determined,
Figure DEST_PATH_IMAGE026
representing a vertical concatenation of matrices;
computing an intermediate state of the second stage attention tier
Figure DEST_PATH_IMAGE027
Figure DEST_PATH_IMAGE028
Wherein
Figure DEST_PATH_IMAGE029
Is a parameter to be learned;
Figure DEST_PATH_IMAGE030
in the form of a matrix of state weights,ufor hiding the layer state h t-1 The dimension of (a), i.e. the number of hidden layer nodes of the long-short term memory network LSTM,U s in order to input the weight matrix, the weight matrix is input,b s for the bias vector, tanh () is a nonlinear activation function;
according to the intermediate state
Figure DEST_PATH_IMAGE031
Computing second stage weights
Figure DEST_PATH_IMAGE032
Figure DEST_PATH_IMAGE033
According to input attention
Figure DEST_PATH_IMAGE034
And weight
Figure DEST_PATH_IMAGE035
Meter for measuringComputing second stage attention tier output
Figure DEST_PATH_IMAGE036
Figure DEST_PATH_IMAGE037
Wherein
Figure DEST_PATH_IMAGE038
In order to be a super-parameter,
Figure DEST_PATH_IMAGE039
for the purpose of the second stage attention weighting,
Figure DEST_PATH_IMAGE040
is the second stage input sequence.
5. The method of claim 4, wherein the method for forecasting procurement traffic based on dynamic multivariate time series comprises the following steps: the temporal attention layer, first computing intermediate states
Figure DEST_PATH_IMAGE041
Figure DEST_PATH_IMAGE042
Wherein, the first and the second end of the pipe are connected with each other,
Figure DEST_PATH_IMAGE043
in the form of a matrix of state weights,
Figure DEST_PATH_IMAGE044
in order to input the weight matrix, the weight matrix is input,
Figure DEST_PATH_IMAGE045
long and short term memory networks as decoders for offset vectorsHidden layer state of LSTM at time t-1
Figure DEST_PATH_IMAGE046
Figure DEST_PATH_IMAGE047
The long-short term memory network LSTM as an encoder in the first two stages is formed by transversely splicing the states of memory units of the long-short term memory network LSTM at the time t-1,
Figure DEST_PATH_IMAGE048
the memory unit state of the long-short term memory network LSTM at the moment k at the current stage,
Figure DEST_PATH_IMAGE049
output for the second attention stage;
according to the intermediate state
Figure DEST_PATH_IMAGE050
Calculating temporal attention weights
Figure DEST_PATH_IMAGE051
Figure DEST_PATH_IMAGE052
Attention weighting according to time
Figure DEST_PATH_IMAGE053
Computing context vectorsC t
Figure DEST_PATH_IMAGE054
According to context vectorC t Calculating
Figure DEST_PATH_IMAGE055
Figure DEST_PATH_IMAGE056
Calculating the time attention stagetHidden layer state of long-short term memory network LSTM as decoder at time of day
Figure DEST_PATH_IMAGE057
Wherein
Figure DEST_PATH_IMAGE058
For the long short term memory network LSTM to be used as a decoder,
Figure DEST_PATH_IMAGE059
in the form of a matrix of state weights,
Figure DEST_PATH_IMAGE060
is a bias vector;
calculating final output of a multi-stage attention module
Figure DEST_PATH_IMAGE061
Figure DEST_PATH_IMAGE062
Wherein, the first and the second end of the pipe are connected with each other,
Figure DEST_PATH_IMAGE063
in the form of a matrix of state weights,
Figure DEST_PATH_IMAGE064
in order to be a vector of the offset,
Figure DEST_PATH_IMAGE065
is the hidden state of the decoder at time t,C t is a context vector.
6. The method according to any one of claims 1-5The method for predicting the purchasing business volume of the dynamic multi-element time sequence is characterized by comprising the following steps: the robot process automatic data acquisition network is a trained robot process automatic data acquisition network; the training process adopts a small-batch random gradient descent method and an ADAM optimizer, and the objective function is the predicted value of the minimized purchasing trafficy t And true valueyMean square error between
Figure DEST_PATH_IMAGE066
7. An electronic device for forecasting procurement traffic based on a dynamic multivariate time series, comprising:
one or more processors;
a storage device for storing one or more programs that, when executed by the one or more processors, cause the one or more processors to implement the dynamic multivariate time series based procurement traffic prediction method of any of claims 1-6.
CN202211126215.9A 2022-09-16 2022-09-16 Purchasing business volume prediction method based on dynamic multivariate time sequence and electronic equipment Active CN115204535B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211126215.9A CN115204535B (en) 2022-09-16 2022-09-16 Purchasing business volume prediction method based on dynamic multivariate time sequence and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211126215.9A CN115204535B (en) 2022-09-16 2022-09-16 Purchasing business volume prediction method based on dynamic multivariate time sequence and electronic equipment

Publications (2)

Publication Number Publication Date
CN115204535A true CN115204535A (en) 2022-10-18
CN115204535B CN115204535B (en) 2022-12-06

Family

ID=83572324

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211126215.9A Active CN115204535B (en) 2022-09-16 2022-09-16 Purchasing business volume prediction method based on dynamic multivariate time sequence and electronic equipment

Country Status (1)

Country Link
CN (1) CN115204535B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090024491A1 (en) * 2007-07-18 2009-01-22 General Electric Company System and method to calculate procurement of assets
CN110163401A (en) * 2018-02-12 2019-08-23 腾讯科技(深圳)有限公司 Prediction technique, data predication method and the device of time series
WO2021139251A1 (en) * 2020-07-30 2021-07-15 平安科技(深圳)有限公司 Server system anomaly detection method and apparatus, computer device, and storage medium
US20210350225A1 (en) * 2020-05-08 2021-11-11 International Business Machines Corporation Determining multivariate time series data dependencies
WO2021257610A1 (en) * 2020-06-15 2021-12-23 Spartan Capital Intelligence, Llc Time series forecasting and visualization methods and systems
CN114493014A (en) * 2022-01-28 2022-05-13 湖南大学 Multivariate time series prediction method, multivariate time series prediction system, computer product and storage medium
CN114818515A (en) * 2022-06-24 2022-07-29 中国海洋大学 Multidimensional time sequence prediction method based on self-attention mechanism and graph convolution network
CN114881354A (en) * 2022-05-31 2022-08-09 支付宝(杭州)信息技术有限公司 Method and device for predicting multivariate time series

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090024491A1 (en) * 2007-07-18 2009-01-22 General Electric Company System and method to calculate procurement of assets
CN110163401A (en) * 2018-02-12 2019-08-23 腾讯科技(深圳)有限公司 Prediction technique, data predication method and the device of time series
US20210350225A1 (en) * 2020-05-08 2021-11-11 International Business Machines Corporation Determining multivariate time series data dependencies
WO2021257610A1 (en) * 2020-06-15 2021-12-23 Spartan Capital Intelligence, Llc Time series forecasting and visualization methods and systems
WO2021139251A1 (en) * 2020-07-30 2021-07-15 平安科技(深圳)有限公司 Server system anomaly detection method and apparatus, computer device, and storage medium
CN114493014A (en) * 2022-01-28 2022-05-13 湖南大学 Multivariate time series prediction method, multivariate time series prediction system, computer product and storage medium
CN114881354A (en) * 2022-05-31 2022-08-09 支付宝(杭州)信息技术有限公司 Method and device for predicting multivariate time series
CN114818515A (en) * 2022-06-24 2022-07-29 中国海洋大学 Multidimensional time sequence prediction method based on self-attention mechanism and graph convolution network

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
杜圣东等: "一种基于序列到序列时空注意力学习的交通流预测模型", 《计算机研究与发展》 *

Also Published As

Publication number Publication date
CN115204535B (en) 2022-12-06

Similar Documents

Publication Publication Date Title
Guan et al. An adaptive neuro-fuzzy inference system based approach to real estate property assessment
CN111160968A (en) SKU-level commodity sales prediction method and device
Li et al. Multi-factor based stock price prediction using hybrid neural networks with attention mechanism
CN109191276B (en) P2P network lending institution risk assessment method based on reinforcement learning
CN110175689A (en) A kind of method of probabilistic forecasting, the method and device of model training
CN110264270A (en) A kind of behavior prediction method, apparatus, equipment and storage medium
CA3165582A1 (en) Data processing method and system based on similarity model
Adeyemo et al. Effects of normalization techniques on logistic regression in data science
Dat et al. Hybrid online model based multi seasonal decompose for short-term electricity load forecasting using ARIMA and online RNN
Zhou et al. Supply chain fraud prediction based on xgboost method
Goyal et al. A solution to forecast demand using long short-term memory recurrent neural networks for time series forecasting
Yarushev et al. Review of studies on time series forecasting based on hybrid methods, neural networks and multiple regression
Pardeshi et al. Stock market price prediction: a hybrid LSTM and sequential self-attention based approach
CN108765137A (en) A kind of credit demand prediction technique and system, storage medium
KR102409041B1 (en) portfolio asset allocation reinforcement learning method using actor critic model
CN115204535B (en) Purchasing business volume prediction method based on dynamic multivariate time sequence and electronic equipment
Pradhan et al. Application of neural network in prediction financial viability
CN107743071A (en) The enhancing method for expressing and device of a kind of network node
AmirAskari et al. A modified fuzzy relational model approach to prediction of foreign exchange rates
Sun et al. Short-term stock price forecasting based on an SVD-LSTM model
CN114092188A (en) Recommendation system algorithm of lightweight B2B E-commerce platform
Dendramis et al. Measuring the Default Risk of Small Business Loans: Improved Credit Risk Prediction using Deep Learning
Murthy et al. Model for Predicting Prospective Big-Mart Sales Based on Grid Search Optimization (GSO)
PhamToan et al. Improving forecasting model for fuzzy time series using the Self-updating clustering and Bi-directional Long Short Term Memory algorithm
CN117557318B (en) Management intelligent analysis method and system based on virtual shopping

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant