CN116308854A - Information cascading popularity prediction method and system based on probability diffusion - Google Patents

Information cascading popularity prediction method and system based on probability diffusion Download PDF

Info

Publication number
CN116308854A
CN116308854A CN202211577433.4A CN202211577433A CN116308854A CN 116308854 A CN116308854 A CN 116308854A CN 202211577433 A CN202211577433 A CN 202211577433A CN 116308854 A CN116308854 A CN 116308854A
Authority
CN
China
Prior art keywords
cascade
implicit
state
ode
implicit state
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211577433.4A
Other languages
Chinese (zh)
Inventor
周帆
程章桃
肖哲
徐增
钟婷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kashgar Electronic Information Industry Technology Research Institute
University of Electronic Science and Technology of China
CETC 54 Research Institute
Original Assignee
Kashgar Electronic Information Industry Technology Research Institute
University of Electronic Science and Technology of China
CETC 54 Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kashgar Electronic Information Industry Technology Research Institute, University of Electronic Science and Technology of China, CETC 54 Research Institute filed Critical Kashgar Electronic Information Industry Technology Research Institute
Publication of CN116308854A publication Critical patent/CN116308854A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/11Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
    • G06F17/13Differential equations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Analysis (AREA)
  • Computational Mathematics (AREA)
  • Software Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Business, Economics & Management (AREA)
  • Health & Medical Sciences (AREA)
  • Algebra (AREA)
  • General Business, Economics & Management (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Tourism & Hospitality (AREA)
  • Primary Health Care (AREA)
  • Databases & Information Systems (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention belongs to the technical field of information propagation, and discloses an information cascade popularity prediction method and system based on probability diffusion, which are used for modeling time irregularity of cascade events and uncertainty of information propagation based on a neural ordinary differential equation and a diffusion probability model and carrying out cascade popularity prediction. Firstly, constructing information cascade data into a cascade social graph, a cascade graph and a cascade sequence, wherein the cascade graph and the cascade sequence are used for feature learning of downstream structure and sequence models; then, a gating mechanism based on a normal differential equation and time perception obtains cascading implicit characteristics; on the basis, from the perspective of space-time hidden variables, combining a conditional diffusion probability model and an implicit ordinary differential equation to obtain an implicit characteristic of uncertain cascade; finally, popularity prediction is performed by using the cascade implicit features and the cascade uncertain implicit features. The method can be used for continuous time state modeling and propagation uncertainty modeling of information cascading, and can be used for predicting popularity of the information cascading better.

Description

Information cascading popularity prediction method and system based on probability diffusion
Technical Field
The invention belongs to the technical field of information propagation, relates to information cascade popularity prediction, in particular to information diffusion (Information Diffusion) and popularity prediction (Popularity Prediction) in Deep Learning (Deep Learning), and relates to a method based on a neural ordinary differential equation (Neural Ordinary Differential Equations, NODEs) and a diffusion probability model (Diffusion Probabilistic Model, DPM).
Background
With the widespread use of social media, such as Twitter, newwave microblogs, etc., it has become one of the main sources of information for users to generate and disseminate information and guide users' daily decisions. The forwarding or sharing behavior of users facilitates rapid propagation of information on a social platform, facilitating generation of information cascades, such as: the initial information release (e.g., news, blogs) of the user and the sharing and forwarding of other users form an information cascade. The information cascade popularity prediction is to predict the scale of a forwarding user after a certain cascade (text pushing, microblog and the like) passes a specific period of time by observing the propagation evolution process (such as forwarding user and time) of the early stage of information. The accurate prediction of the popularity of the information cascade is beneficial to the rapid and effective propagation of the information, and can bring great economic and social effects, which also gets great attention in academia and industry. In the propagation of information, there are generally two important phenomena: the temporal irregularities of cascading events (e.g., users may forward or share news, tweets, etc. at any time) and the inherent uncertainty of information dissemination. The existing accurate prediction method of the information cascade popularity is mainly divided into three types: (1) probability generation model: modeling an information propagation process by using a time point process, such as a poisson process, a hopus process and the like, by using a strength function of a newly-appearing cascade event; (2) Based on the model of the characteristics, the method predicts the cascade popularity by exploring and designing the characteristics such as cascade structure, time, user attribute, information content and the like; (3) The deep learning model models cascading events mainly using simple event sequence models, such as a recurrent neural network (Recurrent Neural Network, RNN), long Short-Term Memory (LSTM), and gated loop units (Gated Recurrent Unit, GRU). However, the existing method ignores the time irregularity of cascading events and the uncertainty of information propagation in the actual information diffusion process, so that accurate prediction of the popularity of information cascading is difficult to realize, and the prediction effect is not ideal.
Disclosure of Invention
The invention aims at solving the technical problems in the prior art, and designs a novel information cascade popularity prediction method and system based on a neural ordinary differential equation and a diffusion probability model, which only use the structural information of a cascade social graph and a cascade graph and the diffusion information of the cascade, simulate the time irregularity of a cascade event and the uncertainty of information propagation in the information propagation process, and improve the accuracy of cascade popularity prediction.
The method is characterized in that a framework based on a neural ordinary differential equation and a diffusion probability model is constructed to model time irregularity of cascade events and uncertainty of information propagation, and cascade popularity prediction is carried out. Firstly, constructing collected cascade data into a cascade social graph, a cascade graph and a cascade sequence, wherein the cascade graph, the cascade graph and the cascade sequence are used for feature learning of a downstream structure and sequence model; a novel time-aware neural frequent differentiation module (Temporal Ordinary Differential Equations, T-ODE) is then designed to model the temporal irregularities of cascading events by generalizing the discrete feature states in the RNN to continuous-time dynamics defined by the ODE. Meanwhile, the input information and time interval information between cascade events are considered through a gating mechanism to update the implicit state of cascade, so that the learned cascade characteristics can better accord with the real information propagation process. On the basis, from the perspective of space-time hidden variables, a cascade uncertainty modeling module (DPM-ODE) combining a conditional diffusion probability model and an implicit ordinary differential equation (Latent Ordinary Differential Equations) is designed, and the module simultaneously considers the uncertainty of cascade evolution (time dependence of a cascade diagram) and the uncertainty of user space association (space structure of the cascade diagram). In this module, the conditional diffusion probability model approximates the posterior distribution score of the user structural features on the condition of continuous time cascading features, thereby reconstructing the cascading graph structural features to simulate the uncertainty of the user spatial correlation. The conditional probability diffusion model is trained through explicit cascading structure generation, and useful correlations between structural features can be observed. On the basis, the implicit ordinary differential equation defines a generating process with time according to the deterministic evolution of the initial cascade state, and simultaneously acquires cascade characteristics of propagation uncertainty. Through the design of two modules, the continuous time dynamic and propagation uncertainty representation of the cascade is combined and input into a fully connected layer for popularity prediction. The method realizes a more novel popularity modeling mode and improves the accuracy of cascade popularity prediction.
Based on the inventive thought, the invention provides an information cascade popularity prediction method based on probability diffusion, which comprises the following steps:
s1, constructing cascade data into a social graph, a cascade graph and a cascade sequence; then obtaining global structural features according to the structural attributes of the cascade global social graph; obtaining node similarity characteristics according to the cascade diagram; then, splicing the global structural features and the node similarity features to obtain a structural embedded representation of the user;
s2, acquiring the implicit state of the current moment according to the implicit state of the previous moment and the embedded representation of the user structure; then obtaining an ODE implicit state by using a first ODE solver according to the implicit state at the previous moment and the implicit state at the current moment; then, the implicit state and the ODE implicit state at the current moment are cascaded to obtain a cascaded implicit state; then obtaining an updated cascade implicit state according to the implicit state at the current moment and the cascade implicit state and outputting cascade implicit characteristics through full connection based on a time-aware gating mechanism;
s3, generating a target cascade structure embedded representation of a user based on the diffusion probability model by taking the cascade implicit characteristic obtained in the step S2 as a condition, and resampling; then, evolution is carried out on the resampling result in a probability space by using a second ODE solver to obtain an implicit characteristic with uncertain cascade;
S4, predicting the cascade popularity according to the cascade implicit characteristic obtained in the step S2 and the cascade uncertain implicit characteristic splicing result.
In step S1, the cascade data is constructed as a social graph
Figure SMS_1
Cascading diagram->
Figure SMS_2
And a cascading sequence. The social graph->
Figure SMS_3
Mainly consisting of all subscribers to the cascading sequence. The cascade diagram->
Figure SMS_4
Mainly consists of any one of the subscribers of the cascade sequence.
The cascaded global social graph is then learned using sparse matrix decomposition (sparsematrixfactor)
Figure SMS_5
Obtaining global structural feature E g (see C.Donnat, M.Zitnik, D.Hallac, and J.Leskovic, "Learning structural node embeddings via diffusion wavelets," in SIGKDD,2018, pp.1320-132); modeling a cascade Graph using Graph Wavelets model>
Figure SMS_6
Obtaining node similarity features E c (see J.Zhang, Y.Dong, Y.Wang, J.Tang, and M.Ding, "ProNE: fast and scalable network representation learning," in IJCAI, macao, china, aug.10-16,2019, pp.4278-4284); finally to E g And E is c And performing splicing operation to obtain the structural representation E of the user.
Because of the randomness of human response behavior, the forwarding of information may occur at any time, resulting in an irregular time sequence in the information cascade. The invention promotes the cascade state in the RNN to continuous time dynamic defined by ODE through step S2, and simultaneously considers input information and time interval information between cascade events through a gating mechanism.
Firstly, acquiring the implicit state of the current moment by utilizing an LSTM unit according to the implicit state of the previous moment and the embedded representation of the user structure so as to avoid the gradient disappearance or explosion phenomenon caused by using an ODE solver.
Given at time t i-1 Is a concatenation of implicit state pairs (c) i-1 ,h i-1 ) And user u's structure embedding
Figure SMS_7
First they are input into LSTM cell to generate new implicit stateState (c) i ,h′ i ):
Figure SMS_8
Wherein θ l Representing model parameters that can be learned in LSTM cells.
Then we will h i-1 Inputting the data to a first ODE solver based on Euler algorithm to obtain an ODE implicit state z i
z i =ODESolver(f ω ,h i-1 ,h′ i ,(t i-1 ,t i ))
Through the above operation, using a first ODE solver based on the Euler algorithm to evaluate the cascade hidden states between successive observations, an irregular time interval t is constructed i-1 And t i A cascade of continuous times in between.
To construct the actual time t i We will implicit state h 'at the current time' i And ODE implicit State z i As input, learn the true characteristics of the cascade; the GRU unit is used to update the implicit state of cascade, i.e. the implicit state of current time and the implicit state of ODE are cascade to obtain the implicit state h i . The above process is expressed as:
h″ i =GRUCell(θ g ,h′ i ,z i )
wherein,,
Figure SMS_9
is the ODE from time t i-1 By time t i Is a solution of h i Is the cascade implicit state after GRU unit update, θ g Representing model parameters that are learnable in the GRU unit;
finally, a time-aware gating mechanism (T-Gate) is designed to integrate the implicit representations of the first two steps to generate a continuous-time cascade of states.
The neural ordinary differential equation regards the parameter update in the neural network as a process of solving the ordinary differential equation, and from the point of view of the numerical method, the discrete layer of the neural network can be regarded as the Euler discretization of the differential equation:
Figure SMS_10
Figure SMS_11
wherein h (t) =h t . The neural network is composed of f ω (. Cndot.) parameterization, building a continuous dynamics of cascading implicit states. The parameter update process of NODE can be regarded as solving ODE numerically.
Given an implicit state h' i And h' i Updating concatenated implicit state h using time-aware gating mechanisms i
h i =v i ⊙h″ i +(1-v i )⊙h′ i ,
Wherein,,
Figure SMS_12
representing time gating; Δt (delta t) i Representing t i -t i-1 . Time gating->
Figure SMS_13
The help model determines the continuous time state that needs to be delivered.
Finally, we calculate the output state o through a full link layer 1 …o n }. Wherein,,
Figure SMS_14
and n represents the number of users evolving in the early stage. Typically, we use the last output state o n Representing a cascade implicit feature Z.
In step S3, the information propagation uncertainty exists not only in the evolution process of the cascade (time dependence of the cascade graph) but also in the spatial correlation between users (spatial structure of the cascade graph). The invention integrates a diffusion probability model and an implicit neural ordinary differential equation to model information diffusion uncertainty from the perspective of space-time hidden variables. The invention reconstructs cascading information cascading structural features by approximating posterior distribution scores of user structural features on the basis of a designed conditional diffusion probability model by taking continuous time cascading features as conditions. The conditional probability diffusion model is used for structure generation and useful correlations between structural features can be observed. On the basis, the implicit ordinary differential equation defines a generating process with time according to the deterministic evolution of the initial cascade state, and simultaneously acquires cascade characteristics of propagation uncertainty. Based on the above analysis, step S3 of the present invention comprises the following sub-steps:
s31 given information cascade diagram
Figure SMS_15
And social graph->
Figure SMS_16
And structural feature E, obtaining an initial data distribution of q (E 0 ),E 0 =E;
S32 based on the diffusion probability model, the forward process gradually adds noise to the data to reduce the prior q (E 0 ) Converting into Gaussian distribution which is easy to process; obtaining model distribution through the inverse process of the diffusion probability model; obtaining user structure embedded E by model distributed sampling ta
The uncertainty of the structure embedding is captured in a probabilistic manner using a diffusion probability model. The conditional diffusion probability model uses the cascade implicit characteristic Z learned in step S2 co Structure embedding for generating target concatenation using =z as condition
Figure SMS_17
Probability generation of the diffusion probability model is aimed at using model distribution +.>
Figure SMS_18
To estimate the true conditional data distribution +.>
Figure SMS_19
In this step, based onThe diffusion probability model, forward process gradually adds a priori q (E 0 ) Converting into Gaussian distribution which is easy to process; the forward process of each time step R e {0,1,2, …, R } is defined as a gaussian change:
Figure SMS_20
Figure SMS_21
in the formula, = r Controlling a process of adding gaussian noise to data for a fixed constant;
Figure SMS_22
the gaussian distribution is represented by the formula,
Figure SMS_23
mean, beta r I represents variance; inverse process simulation conditional distribution using conditional diffusion probability model
Figure SMS_24
Defining a conditional denoising function E θ :(E ta |Z co )→E ta It will Z co As input:
Figure SMS_25
Figure SMS_26
in the method, in the process of the invention,
Figure SMS_27
mean value of gaussian distribution; />
Figure SMS_28
Representing the variance of the gaussian distribution;
we set alpha r :=1-β r And
Figure SMS_29
distribution q (E) r |E 0 ) Expressed as:
Figure SMS_30
Figure SMS_31
the goal of (2) is to eliminate gaussian noise added during diffusion. By minimizing the variational negative log-likelihood, the parameter θ can be fitted to the data distribution q (E 0 ):
Figure SMS_32
The above formula can be simplified and effectively trained by random gradient descent:
Figure SMS_33
Wherein the noise
Figure SMS_34
θ Is the output of a noise prediction model integrating a residual neural network, a convolution neural network and an attention mechanism, and the input of the noise prediction model is represented by a user structure +.>
Figure SMS_35
The step of time r and the step of S2 are obtained to obtain a cascade implicit characteristic Z; wherein (1)>
Figure SMS_36
The sampling process may be defined as:
Figure SMS_37
wherein,,
Figure SMS_38
and->
Figure SMS_39
By sampling from r to 0 we obtain a sample E ta For modeling the evolution uncertainty described below.
S33 embedding E according to the user structure ta Acquiring an initial cascade state
Figure SMS_40
Then reconstructing an initial cascade state based on the variation self-encoder;
embedding E according to user structure ta According to the step S2 given above, the initial cascade state is obtained by inverse solution
Figure SMS_41
The extrapolation property of the ODE is then utilized to model the cascading evolution dynamics with uncertainty based on the variational self-encoder (VariationalAutoencoders, VAEs) framework. Assume that
Figure SMS_42
Is a true posterior distribution, we use model distribution based on the VAE framework +.>
Figure SMS_43
To approximate the true posterior distribution->
Figure SMS_44
Wherein phi is a parameter of the neural network, by using a linear transformation from +.>
Figure SMS_45
Derived mean->
Figure SMS_46
Sum of variances->
Figure SMS_47
Combining with the re-parameterization technique, re-sampling to obtain a reconstructed initial cascade shapeStatus- >
Figure SMS_48
Wherein ζ is sampled from the normal ethernet distribution.
S34, evolution is carried out on the reconstructed initial cascade state in a probability space by using a second ODE solver to obtain an implicit characteristic Z with uncertain cascade T
Figure SMS_49
Wherein f ξ (. Cndot.) is the ODE function of calculating the derivative, t n Indicating the last observation instant of the concatenated sequence. In this way we have generated a continuous evolutionary trajectory in which each point represents an implicit variable p (Z 'following a posterior distribution' t |Z′ 0 ,…,Z′ t-1 ). Finally, we optimize model parameters by maximizing the lower bound of Evidence (ELBO):
Figure SMS_50
in the method, in the process of the invention,
Figure SMS_51
expressed as +.>
Figure SMS_52
Is the desire under conditions; />
Figure SMS_53
Represents the KL divergence Kullback-Leibler divergence.
In step S4, the cascade popularity is obtained by using the multi-layer perceptron according to the cascade implicit characteristic and the cascade uncertain implicit characteristic splicing result obtained in step S2.
First, we combine the cascade implicit feature Z from step S2 with the cascade uncertain implicit feature Z from step S3 T Performing splicing operations, and then inputting them into Multi-layer perceptron (Multi-Layer Perceptrons, MLPs) for cascade popularityAnd (3) degree prediction:
Figure SMS_54
wherein Concat (-) represents the splice operation. In the training process, we target the Mean Square Log Error (MSLE), combining MSLE and ELBO to train the model. The final loss function is defined as:
Figure SMS_55
Where N is the total number of cascades, P k (t p ) Representing the number of users of the kth real forwarding cascade,
Figure SMS_56
representing popularity of the kth prediction.
The invention further provides an information cascading popularity prediction system based on probability diffusion, which comprises the following steps:
the structure embedding representation acquisition module is used for constructing cascade data into a social graph, a cascade graph and a cascade sequence; then obtaining global structural features according to the structural attributes of the cascade global social graph; obtaining node similarity characteristics according to the cascade diagram; then, splicing the global structural features and the node similarity features to obtain a structural embedded representation of the user;
time-aware neural ordinary differential module T-ODE: the implicit state acquisition module is used for acquiring the implicit state of the current moment according to the implicit state of the previous moment and the embedded representation of the user structure; then obtaining an ODE implicit state by using a first ODE solver according to the implicit state at the previous moment and the implicit state at the current moment; then, the implicit state and the ODE implicit state at the current moment are cascaded to obtain a cascaded implicit state; then obtaining an updated cascade implicit state according to the implicit state at the current moment and the cascade implicit state and outputting cascade implicit characteristics through full connection based on a time-aware gating mechanism;
Cascaded uncertainty modeling module DPM-ODE: generating a target cascade structure embedded representation of a user based on a diffusion probability model by taking cascade implicit characteristics obtained by a time-aware neural ordinary differential module T-ODE as conditions, and resampling; then, evolution is carried out on the resampling result in a probability space by using a second ODE solver to obtain an implicit characteristic with uncertain cascade;
and the cascade popularity prediction model is used for predicting cascade popularity according to the cascade implicit characteristics and the cascade uncertain implicit characteristic splicing result.
The neural ordinary differential module T-ODE of the time perception comprises:
the LSTM unit is used for acquiring the implicit state of the current moment according to the implicit state of the previous moment and the embedded representation of the user structure;
the first ODE solver obtains an ODE implicit state according to the implicit state at the previous moment and the implicit state at the current moment;
the GRU unit is used for cascading the implicit state at the current moment and the implicit state of the ODE to obtain a cascading implicit state;
the updating unit is used for updating the cascade implicit state based on a time-aware gating mechanism;
and the full connection layer is used for carrying out full connection processing on the splicing result of the global structural features and the node similarity features to obtain the structural embedded representation of the user.
The cascade uncertainty modeling module DPM-ODE comprises:
an initial data distribution acquisition unit for giving an information cascade graph
Figure SMS_57
And social graph->
Figure SMS_58
And structural feature E, obtaining an initial data distribution of q (E 0 ),E 0 =E;
Uncertainty-based user structure embedding generation unit, forward process gradually adds a priori q (E 0 ) Converting into Gaussian distribution which is easy to process; obtaining model distribution through the inverse process of the diffusion probability model; then pass throughSampling the model distribution to obtain the embedded E of the user structure ta
Variable self-encoder for obtaining user structure embedded E according to neural ordinary differential module T-ODE perceived by time ta Is of the initial cascade state of (1)
Figure SMS_59
Reconstructing an initial cascade state;
a second ODE solver for evolving the reconstructed initial cascade state in the probability space to obtain a cascade uncertainty implicit feature Z T
Figure SMS_60
Wherein f ξ (. Cndot.) is the ODE function that calculates the derivative.
The information cascade popularity prediction system based on probability diffusion uses a multi-layer perceptron as a cascade popularity prediction model.
Compared with the prior art, the invention has the following beneficial effects:
1. the method is based on a neural ordinary differential equation and a diffusion probability model, can be used for continuous time state modeling and propagation uncertainty modeling of information cascade, and can be used for predicting popularity of the information cascade better.
2. The method is extremely important for understanding the information evolution process in the social network and explaining the cascade epidemic reasons; for example, the method and the device for predicting the forwarding quantity of a certain microblog in a future period can be used for downstream tasks such as marketing design, rumor prediction and the like.
3. The invention provides a neural ordinary differential equation module (T-ODE) of time perception, which considers the influence of time factors in the information propagation process and is used for modeling the time irregularity of cascade events and capturing continuous time dynamics of time perception.
4. The invention provides a cascade uncertain modeling module (DPM-ODE) combining a conditional diffusion probability model and an implicit neural ordinary differential equation; the conditional diffusion probability model reconstructs the information cascade structure characteristics of cascade by taking the continuous time cascade characteristics as the conditions, and on the basis, an implicit ordinary differential equation defines the generation process along with time according to the deterministic evolution of the initial cascade state, and simultaneously acquires the cascade characteristics of propagation uncertainty.
Drawings
FIG. 1 is a schematic diagram of an information cascade depicting an information cascade diffusion process and popularity prediction tasks.
Fig. 2 is a schematic diagram of the information cascade your popularity prediction flow based on probability diffusion in the present invention.
Fig. 3 is a schematic diagram of a noise prediction model.
Interpretation of the terms
Information concatenation (Information Cascade): fig. 1 illustrates this process in one example: after a root node issues an information content, the information content browsed by the attention of the root node is shared or forwarded. The information content propagates through the social network via the forwarding behavior of the user and facilitates the generation of the information cascade. The information cascade popularity prediction task is a classical task, predicts the scale of a cascade (text push, microblog, etc.), and is potentially affected by the user after a period of observation. The theoretical basis can be referred to in the literature [ J.Cheng, L.Adamic, P.A.Dow, J.M.Kleinberg, and J.Leskovic.can cascades be predicted In Proc.of WWW,2014 ]
Neural ordinary differential equation (Neural Ordinary Differential Equations, NODEs): the ordinary differential equation parameterizes the derivatives of hidden states by using neural networks, rather than discrete sequences of hidden layers as used in traditional models (e.g., resNet and RNN), and has a balance between numerical accuracy and computation while saving significant memory costs. The theoretical basis can be referred to in the literature [ R.T.Chen, Y.Rubanova, J.Bettencourt, and D.Duvenaud. Neal ordinary differential equivalents. In NeurIPS,2018, pp.6572-6583 ]
Diffusion probability model (Diffusion Probabilistic Model, DPM): the diffusion probability model can flexibly model complex data through a Markov chain, and gradually converts the data distribution into a distribution which is easy to process by adding noise through a forward diffusion process. A back diffusion process is then defined to generate data in a generative manner. The theoretical basis can be referred to in the literature [ J.Sohl-Dickstein, E.Weiss, N.Maheswaranathan, and S.Ganguli.deep unsupervised learning using nonnequibrimthermodammals. In ICML,2015 ]
The log function used in the present invention is based on 10.
Detailed Description
The invention will be further described with reference to the accompanying drawings.
Example 1
As shown in fig. 2, the information cascade popularity prediction method based on probability diffusion provided in this embodiment includes the following steps:
s1, constructing cascade data into a social graph, a cascade graph and a cascade sequence; then obtaining global structural features according to the structural attributes of the cascade global social graph; obtaining node similarity characteristics according to the cascade diagram; then, splicing the global structural features and the node similarity features to obtain a structural embedded representation of the user;
S2, acquiring the implicit state of the current moment according to the implicit state of the previous moment and the embedded representation of the user structure; then obtaining an ODE implicit state by using a first ODE solver according to the implicit state at the previous moment and the implicit state at the current moment; then, the implicit state and the ODE implicit state at the current moment are cascaded to obtain a cascaded implicit state; then obtaining an updated cascade implicit state according to the implicit state at the current moment and the cascade implicit state and outputting cascade implicit characteristics through full connection based on a time-aware gating mechanism;
s3, generating a target cascade structure embedded representation of a user based on the diffusion probability model by taking the cascade implicit characteristic obtained in the step S2 as a condition, and resampling; then, evolution is carried out on the resampling result in a probability space by using a second ODE solver to obtain an implicit characteristic with uncertain cascade;
s4, predicting the cascade popularity according to the cascade implicit characteristic obtained in the step S2 and the cascade uncertain implicit characteristic splicing result.
The above steps S1 to S4 are explained in detail below.
Step S1 is that cascade data are built into a social graph
Figure SMS_61
Cascading diagram->
Figure SMS_62
And a cascading sequence. The social graph->
Figure SMS_63
Mainly consisting of all subscribers to the cascading sequence. The cascade diagram- >
Figure SMS_64
Mainly consists of any one of the subscribers of the cascade sequence. Then learn the cascaded global social graph using sparse matrix decomposition (sparsematrixfactor)>
Figure SMS_65
Obtaining global structural feature E g (see C.Donnat, M.Zitnik, D.Hallac, and J.Leskovic, "Learning structural node embeddings via diffusion wavelets," in SIGKDD,2018, pp.1320-132); modeling a cascade Graph using Graph Wavelets model>
Figure SMS_66
Obtaining node similarity features E c (see J.Zhang, Y.Dong, Y.Wang, J.Tang, and M.Ding, "ProNE: fast and scalable network representation learning," in IJCAI, macao, china, aug.10-16,2019, pp.4278-4284); finally to E g And E is c And performing splicing operation to obtain the structural representation E of the user.
Step S2 is to acquire the implicit state of the current time by using the LSTM unit according to the implicit state of the previous time and the embedded representation of the user structure. Given at time t i-1 Is a concatenation of implicit state pairs (c) i-1 ,h i-1 ) And user u's structure embedding
Figure SMS_67
First they are input into LSTM unit to generate newImplicit state (c) i ,h′ i ):
Figure SMS_68
Wherein θ l Representing model parameters that can be learned in LSTM cells.
Then we will h i-1 Inputting the data to a first ODE solver based on Euler algorithm to obtain an ODE implicit state z i
z i =ODESolver(f ω ,h i-1 ,h′ i ,(t i-1 ,t i ))
Updating the cascade implicit state by using GRU unit, i.e. cascade the implicit state at the current time and ODE implicit state to obtain cascade implicit state h i . The above process is expressed as:
h″ i =GRUCell(θ g ,h′ i ,z i )
wherein,,
Figure SMS_69
is the ODE from time t i-1 By time t i Is a solution of h i Is the cascade implicit state after GRU unit update, θ g Representing model parameters that can be learned in the GRU units.
Given an implicit state h' i And h' i Updating concatenated implicit state h using time-aware gating mechanisms i
h i =ν i ⊙h″ i +(1-ν i )⊙h′ i ,
Wherein,,
Figure SMS_70
representing time gating.
Finally, we calculate the output state o through a full link layer 1 …o n }. Wherein,,
Figure SMS_71
and n represents the early stage of the modelingNumber of users. Typically, we use the last output state o n Representing a cascade implicit feature Z.
The step S3 comprises the following sub-steps:
s31 given information cascade diagram
Figure SMS_72
And social graph->
Figure SMS_73
And structural feature E, obtaining an initial data distribution of q (E 0 ),E 0 =E;
S32 based on the diffusion probability model, the forward process gradually adds noise to the data to reduce the prior q (E 0 ) Converting into Gaussian distribution which is easy to process; obtaining model distribution through the inverse process of the diffusion probability model; obtaining user structure embedded E by model distributed sampling ta
Based on the diffusion probability model, the forward process gradually adds a priori q (E 0 ) Converting into Gaussian distribution which is easy to process; the forward process of each time step R e {0,1,2, …, R } is defined as a gaussian change:
Figure SMS_74
Figure SMS_75
wherein beta is r Controlling a process of adding gaussian noise to data for a fixed constant;
Figure SMS_76
the gaussian distribution is represented by the formula,
Figure SMS_77
mean, beta r I represents the variance.
Inverse process simulation conditional distribution using conditional diffusion probability model
Figure SMS_78
Defining a conditional denoising function E θ :(E ta |Z co )→E ta It will Z co As input:
Figure SMS_79
Figure SMS_80
in the method, in the process of the invention,
Figure SMS_81
mean value of gaussian distribution; />
Figure SMS_82
Representing the variance of the gaussian distribution.
We set alpha r :=1-β r And
Figure SMS_83
distribution q (E) r |E 0 ) Expressed as:
Figure SMS_84
Figure SMS_85
the goal of (2) is to eliminate gaussian noise added during diffusion. By minimizing the variational negative log-likelihood, the parameter θ can be fitted to the data distribution q (E 0 ):
Figure SMS_86
The above formula can be simplified and effectively trained by random gradient descent:
Figure SMS_87
wherein the noise
Figure SMS_88
θ Is the predicted output of the noise prediction model. The noise prediction model is shown in fig. 3 and includes an input layer, an attention layer, and an output layer. The input layer consists of a first convolution layer, a time step encoder, a first multi-layer perceptron (MLP) and a second convolution layer in parallel; the first convolution layer and the second convolution layer are both 1×1 convolution layers; the input of the first convolution is the user structural representation +. >
Figure SMS_89
I.e. < ->
Figure SMS_90
The time-step encoder input is a time-step r, which is used to learn the parameter matrix W from the time-step r r The corresponding row of the middle index is taken as output; the input of the first multi-layer perceptron is the cascade implicit characteristic Z obtained in the step S2; the outputs of the first convolution layer, the time step encoder and the first multi-layer perceptron are spliced in dimension to obtain a result which is used as the input of the second convolution layer; the attention layer is output and +_in a second convolution layer using a conventional structure (see Vaswani, ashish, et al, "Attention is all you need." InNIPS, 2017)>
Figure SMS_91
The sum is used as the input of the attention layer, and the output of the attention layer is used as the input of the output layer; taking the second multi-layer perceptron as an output layer, wherein the output E of the second multi-layer perceptron θ
The sampling process may be defined as:
Figure SMS_92
wherein,,
Figure SMS_93
and->
Figure SMS_94
By sampling from r to 0 we obtain a sample E ta For modeling the evolution uncertainty described below.
S33 embedding E according to the user structure ta Acquiring an initial cascade state
Figure SMS_95
The initial concatenation state is then reconstructed from the encoder based on the variance.
Embedding E according to user structure ta According to the step S2 given above, the initial cascade state is obtained by inverse solution
Figure SMS_96
The extrapolation property of the ODE is then utilized to model the cascading evolution dynamics with uncertainty based on the variational self-encoder (VariationalAutoencoders, VAEs) framework. Assume that
Figure SMS_97
Is a true posterior distribution, we use model distribution based on the VAE framework +.>
Figure SMS_98
To approximate the true posterior distribution->
Figure SMS_99
Wherein phi is a parameter of the neural network, by using a linear transformation from +.>
Figure SMS_100
Derived mean->
Figure SMS_101
Sum of variances->
Figure SMS_102
Resampling, combined with a repartitioning technique, results in a reconstructed initial cascade state +.>
Figure SMS_103
Wherein ζ is extracted from the distribution of the n-TaiAnd (5) sampling.
S34 an initial cascade state Z 'reconstructed in probability space using a second ODE solver' 0 Evolution is carried out to obtain implicit characteristic Z 'with uncertain cascade' T
Figure SMS_104
Wherein f ξ (. Cndot.) is the ODE function of calculating the derivative, t n Indicating the last observation instant of the concatenated sequence. In this way we have generated a continuous evolutionary trajectory in which each point represents an implicit variable p (Z 'following a posterior distribution' t |Z′ 0 ,…,Z′ t-1 ). Finally, we optimize model parameters by maximizing the lower bound of Evidence (ELBO):
Figure SMS_105
in the method, in the process of the invention,
Figure SMS_106
expressed as +.>
Figure SMS_107
Is the desire under conditions; />
Figure SMS_108
Represents the KL divergence Kullback-Leibler divergence.
Step S4, namely, the cascade implicit characteristic Z obtained in step S2 and the cascade uncertain implicit characteristic Z obtained in step S3 are combined T Performing a stitching operation, and then inputting them into a Multi-layer perceptron (Multi-Layer Perceptrons, MLPs) for cascade popularity prediction:
Figure SMS_109
wherein Concat (-) represents the splice operation.
In the training process, we target the Mean Square Log Error (MSLE), combining MSLE and ELBO to train the model. The final loss function is defined as:
Figure SMS_110
where N is the total number of cascades, P k (t p ) Representing the number of users of the kth real forwarding cascade,
Figure SMS_111
representing popularity of the kth prediction.
Example 2
The embodiment provides an information cascade popularity prediction system based on probability diffusion, which comprises the following components:
the structure embedding representation acquisition module is used for constructing cascade data into a social graph, a cascade graph and a cascade sequence; then obtaining global structural features according to the structural attributes of the cascade global social graph; obtaining node similarity characteristics according to the cascade diagram; then, splicing the global structural features and the node similarity features to obtain a structural embedded representation of the user;
time-aware neural ordinary differential module T-ODE: the implicit state acquisition module is used for acquiring the implicit state of the current moment according to the implicit state of the previous moment and the embedded representation of the user structure; then obtaining an ODE implicit state by using a first ODE solver according to the implicit state at the previous moment and the implicit state at the current moment; then, the implicit state and the ODE implicit state at the current moment are cascaded to obtain a cascaded implicit state; then obtaining an updated cascade implicit state according to the implicit state at the current moment and the cascade implicit state and outputting cascade implicit characteristics through full connection based on a time-aware gating mechanism;
Cascaded uncertainty modeling module DPM-ODE: generating a target cascade structure embedded representation of a user based on a diffusion probability model by taking cascade implicit characteristics obtained by a time-aware neural ordinary differential module T-ODE as conditions, and resampling; then, evolution is carried out on the resampling result in a probability space by using a second ODE solver to obtain an implicit characteristic with uncertain cascade;
and the cascade popularity prediction model is used for predicting cascade popularity according to the cascade implicit characteristics and the cascade uncertain implicit characteristic splicing result.
The neural ordinary differential module T-ODE of the time perception comprises:
the LSTM unit is used for acquiring the implicit state of the current moment according to the implicit state of the previous moment and the embedded representation of the user structure;
the first ODE solver obtains an ODE implicit state according to the implicit state at the previous moment and the implicit state at the current moment;
the GRU unit is used for cascading the implicit state at the current moment and the implicit state of the ODE to obtain a cascading implicit state;
the updating unit is used for updating the cascade implicit state based on a time-aware gating mechanism;
and the full connection layer is used for carrying out full connection processing on the splicing result of the global structural features and the node similarity features to obtain the structural embedded representation of the user.
The cascade uncertainty modeling module DPM-ODE comprises:
an initial data distribution acquisition unit for giving an information cascade graph
Figure SMS_112
And social graph->
Figure SMS_113
And structural feature E, obtaining an initial data distribution of q (E 0 ),E 0 =E;
Uncertainty-based user structure embedding generation unit, forward process gradually adds a priori q (E 0 ) Converting into Gaussian distribution which is easy to process; obtaining model distribution through the inverse process of the diffusion probability model; obtaining the embedded E of the user structure by distributed sampling of the model ta
Variable self-encoder for obtaining user structure embedded E according to neural ordinary differential module T-ODE perceived by time ta Is of the initial cascade state of (1)
Figure SMS_114
Reconstructing an initial cascade state;
a second ODE solver for evolving the reconstructed initial cascade state in the probability space to obtain a cascade uncertainty implicit feature Z T
Figure SMS_115
Wherein f ξ (. Cndot.) is the ODE function that calculates the derivative.
The cascade popularity prediction model is a multi-layer perceptron.
The training process of the information cascade popularity prediction system based on probability diffusion is as follows: training the information cascade popularity prediction system based on probability diffusion according to the steps S1-S4 by using training data and according to the loss function
Figure SMS_116
Acquiring a loss value, and optimizing system network parameters through random gradient descent (SGD); repeating the above process until the loss value becomes stable.
After the training of the information cascade popularity prediction system based on probability diffusion is completed, inputting the known information cascade into the system, and obtaining the prediction result of the information cascade popularity according to the steps S1-S4 given above.
Application example
The predictive effect of the probability diffusion based information cascade popularity prediction system (CasDO) provided by the examples on three different real datasets (Twitter, weibo, and APS), a first dataset source reference [ L.Weng, F.Menczer, and y. -y.ahn.visual prediction and communitystructure in social networks.scientific Reports, vol.3, no.1, pp.1-6,2013 ], a second dataset source reference [ Q.Cao, H.Shen, K.Cen, W.Ouyang, and x.cheng.deep hawkes: bridging the gap between prediction and understanding of information cascades.in CIKM,2017 ], a third dataset source reference [ https:// journ aps.org/dataseconds ]. The data ratio of the training set to the sample in the test set is 7:1.5:1.5.
Meanwhile, the information cascade popularity prediction system (CasDO) based on probability diffusion provided by the invention is compared with 5 different baseline models (Feature-Deep, deepHawkes, casCN, latentODE, casFlow), MSLE is used as an evaluation index (the smaller the value is, the better the prediction effect is), and the prediction result is shown in a table 1.
Table 1: effect of popularity predictions on application data sets
Figure SMS_117
The rest of the methods in the table are described as follows:
Feature-Deep: and extracting structural features and time features from the information cascade data, and inputting the structural features and the time features into a two-layer multi-layer perceptron for prediction. [ X.Xu, F.Zhou, K.Zhang, S.Liu, and G.Trajcevski.CasFlow: exploring hierarchical structures and propagation uncertainty for cascade prediction.InTKDE, pp.1-14,2021 ]
Deepfhawkes: the method integrates the deep neural network into the popularity prediction of the point-in process, and considers three main aspects of the Hox process, namely the influence of users, the self-excitation mechanism and the time attenuation. [ Q.Cao, H.Shen, K.Cen, W.Ouyang, and X.Cheng. DeepHawkes: bridging the gap between prediction and understanding of information cascades. In CIKM,2017 ]
CasCN: the method combines a recurrent neural network and a graph convolution network, and utilizes time and structure information to carry out cascading prediction. It captures the evolution process by sampling the sub-cascade diagram and using LSTM. [ X.Chen, F.Zhou, K.Zhang, G.Trajcevski, T.Zhong, and F.zhang. Information diffusion prediction via recurrent cascades con-version. InICDE, 2019 ]
LatentODE: it extends the discrete RNN to continuous time concealment dynamics defined by the ODE. It treats the potential representation as a time series variable in the RNN, being able to handle any time interval between observations. [ Y.Rubanova, R.T.Chen, and D.Duvenaud.Latent odes for irregularly sampled time series.In NeuroIPS, 2019 ]
CasFlow: it learns the local and global structures in the information cascade and uses the variation self-encoder and regularized streams to enhance the learned cascade representation. [ X.Xu, F.Zhou, K.Zhang, S.Liu, and G.Trajcevski.CasFlow: exploring hierarchical structures and propagation uncertainty for cascade prediction.InTKDE, pp.1-14,2021 ]
As can be seen from the experimental results in Table 1, the probability diffusion-based information cascade popularity prediction system (CasDO) provided by the invention can greatly improve the accuracy of popularity prediction compared with other baseline models.
Therefore, the information cascading popularity prediction system based on probability diffusion can model the time irregularity of cascading events and the uncertainty of information propagation in the information propagation process, and can also improve the accuracy of popularity prediction. Experiments on three real data sets demonstrate the superior performance of the present invention over the most advanced baseline model. The improvement of the performance of the proposal shows that the advantages of the neural ordinary differential equation are combined with the diffusion probability model, so that the information cascade diffusion process can be effectively simulated, and the information cascade popularity can be predicted more accurately.
In summary, the invention relates to a neural ordinary differential equation and a diffusion probability model, designs time irregularity of a time-aware neural ordinary differential equation modeling cascade event, and better simulates an information propagation process in the real world by promoting discrete cascade states of an information cascade to continuous time dynamics. The diffusion probability model and the implicit ordinary differential equation are then fused to model the uncertainty associated with the information cascade. The diffusion probability model learns uncertainty of user space association by reconstructing the structural features of the cascaded graph, and on the basis, an implicit ordinary differential equation is utilized to define a generating process along with time according to initial cascade state deterministic evolution, so that cascade features of propagation uncertainty are obtained. Finally, the aim of predicting the popularity of the information cascade is achieved by utilizing the continuous time characteristic and the propagation uncertainty characteristic of the cascade.
Those of ordinary skill in the art will recognize that the embodiments described herein are for the purpose of aiding the reader in understanding the principles of the present invention and should be understood that the scope of the invention is not limited to such specific statements and embodiments. Those of ordinary skill in the art can make various other specific modifications and combinations from the teachings of the present disclosure without departing from the spirit thereof, and such modifications and combinations remain within the scope of the present disclosure.

Claims (10)

1. The information cascade popularity prediction method based on probability diffusion is characterized by comprising the following steps of:
s1, constructing cascade data into a social graph, a cascade graph and a cascade sequence; then obtaining global structural features according to the structural attributes of the cascade global social graph; obtaining node similarity characteristics according to the cascade diagram; then, splicing the global structural features and the node similarity features to obtain a structural embedded representation of the user;
s2, acquiring the implicit state of the current moment according to the implicit state of the previous moment and the embedded representation of the user structure; then obtaining an ODE implicit state by using a first ODE solver according to the implicit state at the previous moment and the implicit state at the current moment; then, the implicit state and the ODE implicit state at the current moment are cascaded to obtain a cascaded implicit state; then obtaining an updated cascade implicit state according to the implicit state at the current moment and the cascade implicit state and outputting cascade implicit characteristics through full connection based on a time-aware gating mechanism;
S3, generating a target cascade structure embedded representation of a user based on the diffusion probability model by taking the cascade implicit characteristic obtained in the step S2 as a condition, and resampling; then, evolution is carried out on the resampling result in a probability space by using a second ODE solver to obtain an implicit characteristic with uncertain cascade;
s4, predicting the cascade popularity according to the cascade implicit characteristic obtained in the step S2 and the cascade uncertain implicit characteristic splicing result.
2. The method for predicting popularity of information cascade based on probability diffusion as claimed in claim 1, wherein in step S1, cascade data is constructed as social graph
Figure FDA0003989461630000011
Cascading diagram->
Figure FDA0003989461630000012
A cascading sequence; then learn the cascade global social graph +.>
Figure FDA0003989461630000013
Obtaining global structural feature E g The method comprises the steps of carrying out a first treatment on the surface of the Modeling a cascade graph using a graph wavelet model +.>
Figure FDA0003989461630000014
Obtaining node similarity features E c The method comprises the steps of carrying out a first treatment on the surface of the Finally to E g And E is c And performing splicing operation to obtain the structural representation E of the user.
3. The method for predicting the popularity of the information cascade based on the probability diffusion according to claim 1, wherein in the step S2, the implicit state of the current time is obtained by using the LSTM unit according to the implicit state of the previous time and the embedded representation of the user structure.
4. The method for predicting popularity of information cascade based on probability diffusion according to claim 1, wherein in step S2, the implicit state of the current time and the implicit state of ODE are cascaded by using a GRU unit to obtain the implicit state of cascade,
h″ i =GRUCell(θ g ,h′ i ,z i ),
wherein,,
Figure FDA0003989461630000015
is the ODE from time t i-1 By time t i Is a solution of h i Is the cascade implicit state after GRU unit update, θ g Representing model parameters that are learnable in the GRU unit;
given an implicit state h' i And h' i Updating concatenated implicit state h using time-aware gating mechanisms i
h i =ν i ⊙h″ i +(1-ν i )⊙h′ i
Wherein,,
Figure FDA0003989461630000021
representing time gating.
5. The method for predicting popularity of a cascade of information based on probability diffusion of claim 1, wherein step S3 comprises the sub-steps of:
s31 given information cascade diagram
Figure FDA0003989461630000022
And social graph->
Figure FDA0003989461630000023
And structural feature E, obtaining an initial data distribution of q (E 0 ),E 0 =E;
S32 based on the diffusion probability model, the forward process gradually adds noise to the data to reduce the prior q (E 0 ) Converting into Gaussian distribution which is easy to process; obtaining model distribution through the inverse process of the diffusion probability model; obtaining user structure embedded E by model distributed sampling ta
S33 embedding E according to the user structure ta Acquiring an initial cascade state
Figure FDA0003989461630000024
Then reconstructing an initial cascade state based on the variation self-encoder;
s34 makingEvolution of the reconstructed initial cascade state in the probability space by using a second ODE solver to obtain an implicit characteristic Z 'with uncertain cascade' T
Figure FDA0003989461630000025
Wherein f ξ (. Cndot.) is the ODE function that calculates the derivative.
6. The information cascade popularity prediction method based on probability diffusion according to claim 1, wherein in step S4, cascade popularity is obtained by using a multi-layer perceptron according to the concatenation result of the cascade implicit feature and the cascade uncertain implicit feature obtained in step S2.
7. An information cascade popularity prediction system based on probability diffusion, comprising:
the structure embedding representation acquisition module is used for constructing cascade data into a social graph, a cascade graph and a cascade sequence; then obtaining global structural features according to the structural attributes of the cascade global social graph; obtaining node similarity characteristics according to the cascade diagram; then, splicing the global structural features and the node similarity features to obtain a structural embedded representation of the user;
time-aware neural ordinary differential module T-ODE: the implicit state acquisition module is used for acquiring the implicit state of the current moment according to the implicit state of the previous moment and the embedded representation of the user structure; then obtaining an ODE implicit state by using a first ODE solver according to the implicit state at the previous moment and the implicit state at the current moment; then, the implicit state and the ODE implicit state at the current moment are cascaded to obtain a cascaded implicit state; then obtaining an updated cascade implicit state according to the implicit state at the current moment and the cascade implicit state and outputting cascade implicit characteristics through full connection based on a time-aware gating mechanism;
Cascaded uncertainty modeling module DPM-ODE: generating a target cascade structure embedded representation of a user based on a diffusion probability model by taking cascade implicit characteristics obtained by a time-aware neural ordinary differential module T-ODE as conditions, and resampling; then, evolution is carried out on the resampling result in a probability space by using a second ODE solver to obtain an implicit characteristic with uncertain cascade;
and the cascade popularity prediction model is used for predicting cascade popularity according to the cascade implicit characteristics and the cascade uncertain implicit characteristic splicing result.
8. The information cascade popularity prediction system based on probability diffusion of claim 7, wherein the time-aware neural ordinary differential module T-ODE comprises:
the LSTM unit is used for acquiring the implicit state of the current moment according to the implicit state of the previous moment and the embedded representation of the user structure;
the first ODE solver obtains an ODE implicit state according to the implicit state at the previous moment and the implicit state at the current moment;
the GRU unit is used for cascading the implicit state at the current moment and the implicit state of the ODE to obtain a cascading implicit state;
the updating unit is used for updating the cascade implicit state based on a time-aware gating mechanism;
And the full connection layer is used for carrying out full connection processing on the splicing result of the global structural features and the node similarity features to obtain the structural embedded representation of the user.
9. The information cascade popularity prediction system of claim 7, wherein the cascade uncertainty modeling module DPM-ODE comprises:
an initial data distribution acquisition unit for giving an information cascade graph
Figure FDA0003989461630000033
And social graph->
Figure FDA0003989461630000034
And structural feature E, obtaining an initial data distribution of q (E 0 ),E 0 =E;
Uncertainty-based user structure embedding generation unit, forward process gradually adds a priori q (E 0 ) Converting into Gaussian distribution which is easy to process; obtaining model distribution through the inverse process of the diffusion probability model; obtaining the embedded E of the user structure by distributed sampling of the model ta
Variable self-encoder for obtaining user structure embedded E according to neural ordinary differential module T-ODE perceived by time ta Is of the initial cascade state of (1)
Figure FDA0003989461630000031
Reconstructing an initial cascade state;
a second ODE solver for evolving the reconstructed initial cascade state in the probability space to obtain a cascade uncertainty implicit feature Z' T
Figure FDA0003989461630000032
Wherein f ξ (. Cndot.) is the ODE function that calculates the derivative.
10. The information cascade popularity prediction system based on probability diffusion according to claim 7, wherein a multi-layer perceptron is used as a cascade popularity prediction model.
CN202211577433.4A 2022-08-30 2022-12-09 Information cascading popularity prediction method and system based on probability diffusion Pending CN116308854A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2022110528920 2022-08-30
CN202211052892 2022-08-30

Publications (1)

Publication Number Publication Date
CN116308854A true CN116308854A (en) 2023-06-23

Family

ID=86836527

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211577433.4A Pending CN116308854A (en) 2022-08-30 2022-12-09 Information cascading popularity prediction method and system based on probability diffusion

Country Status (1)

Country Link
CN (1) CN116308854A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117076931A (en) * 2023-10-12 2023-11-17 北京科技大学 Time sequence data prediction method and system based on conditional diffusion model
CN117610717A (en) * 2023-11-13 2024-02-27 重庆大学 Information popularity prediction method based on double-variation cascade self-encoder

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117076931A (en) * 2023-10-12 2023-11-17 北京科技大学 Time sequence data prediction method and system based on conditional diffusion model
CN117076931B (en) * 2023-10-12 2024-01-12 北京科技大学 Time sequence data prediction method and system based on conditional diffusion model
CN117610717A (en) * 2023-11-13 2024-02-27 重庆大学 Information popularity prediction method based on double-variation cascade self-encoder

Similar Documents

Publication Publication Date Title
CN110929164B (en) Point-of-interest recommendation method based on user dynamic preference and attention mechanism
CN110928993A (en) User position prediction method and system based on deep cycle neural network
CN116308854A (en) Information cascading popularity prediction method and system based on probability diffusion
CN110336700B (en) Microblog popularity prediction method based on time and user forwarding sequence
CN111241421B (en) User forwarding behavior prediction method based on social context information
CN113536144B (en) Social network information propagation scale prediction method and device
CN114595396B (en) Federal learning-based sequence recommendation method and system
Konomi et al. Bayesian treed multivariate gaussian process with adaptive design: Application to a carbon capture unit
CN112182423A (en) Information propagation evolution trend prediction method based on attention mechanism
Bedi et al. CitEnergy: A BERT based model to analyse Citizens’ Energy-Tweets
Mohammad Nezhad et al. An artificial neural network meta-model for constrained simulation optimization
CN114928548A (en) Social network information propagation scale prediction method and device
CN115051929B (en) Network fault prediction method and device based on self-supervision target perception neural network
Xiao et al. User behavior prediction of social hotspots based on multimessage interaction and neural network
CN115660147A (en) Information propagation prediction method and system based on influence modeling between propagation paths and in propagation paths
Li et al. [Retracted] Emotion Analysis Model of Microblog Comment Text Based on CNN‐BiLSTM
Yu et al. Missing air pollution data recovery based on long-short term context encoder
CN117688255A (en) Recommendation method and system based on double social view contrast learning
CN116993185A (en) Time sequence prediction method, device, equipment and storage medium
CN115953215B (en) Search type recommendation method based on time and graph structure
Chen et al. A forecasting system of micro-blog public opinion based on artificial neural network
Roy et al. Air quality index forecasting using hybrid neural network model with LSTM on AQI sequences
CN114118530A (en) Prediction method and device based on multi-household power consumption prediction model
CN115495669A (en) Social media content popularity prediction method integrating multi-scale cascade and time sequence characteristics
CN117539963B (en) Dynamic analysis method and system for social network data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination