CN114821119B

CN114821119B - Method and device for training graph neural network model aiming at graph data invariant features

Info

Publication number: CN114821119B
Application number: CN202210714507.8A
Authority: CN
Inventors: 王翔; 李思杭; 何向南; 张岸
Original assignee: University of Science and Technology of China USTC
Current assignee: University of Science and Technology of China USTC
Priority date: 2022-06-22
Filing date: 2022-06-22
Publication date: 2022-10-28
Anticipated expiration: 2042-06-22
Also published as: CN114821119A

Abstract

The invention discloses a method and a device for training a graph neural network model aiming at invariant features of graph data. The method comprises the following steps: performing feature extraction on the original image by using an interpretation generator to obtain probability distribution of invariant intrinsic feature subgraphs and probability distribution of environmental noise subgraphs; obtaining an invariant intrinsic characteristic enhancement view and an environmental noise enhancement view according to a preset sampling proportion; processing by using a backbone graph neural network encoder to obtain a first invariant intrinsic feature graph vector, a second invariant intrinsic feature graph vector and an environmental noise graph vector; obtaining a first invariant intrinsic feature hidden space vector, a second invariant intrinsic feature hidden space vector and an environmental noise hidden space vector by using a multi-layer perceptron projector; optimizing the backbone graph neural network encoder using a loss value; and (4) iterating to perform feature extraction operation, sampling operation, processing operation and optimization operation until the loss value is converged to a preset condition, so as to obtain the trained backbone graph neural network encoder.

Description

Method and device for training graph neural network model aiming at graph data invariant features

Technical Field

The invention relates to the field of image processing and artificial intelligence, in particular to a method and a device for training a graph neural network model aiming at invariant features of graph data, electronic equipment and a storage medium.

Background

Although deep learning has made great progress in many research fields in recent years, it is still a data-driven method, and in practical application, there are often challenges that the acquisition of labeled training samples is difficult and limited in number. In the field of graph neural networks, graph data enhancement technology and contrast learning technology in the prior art also face the problems of high training cost, poor model generalization capability and the like.

Disclosure of Invention

In view of the above problems, the present invention provides a method and an apparatus for training a graph neural network model for graph data invariant features, an electronic device, and a storage medium, so as to solve at least one of the above problems.

According to a first aspect of the present invention, there is provided a training method of a graph neural network model for graph data invariant features, comprising:

performing feature extraction on an original image by using an interpretation Generator (RG) to obtain probability distribution of invariant intrinsic feature subgraphs and probability distribution of environment noise subgraphs;

sampling the probability distribution of the invariant intrinsic feature subgraph according to a preset sampling proportion to obtain a first invariant intrinsic feature enhanced view and a second invariant intrinsic feature enhanced view, and sampling the probability distribution of the environmental noise subgraph according to the preset sampling proportion to obtain an environmental noise enhanced view;

processing the first invariant intrinsic feature enhanced view, the second invariant intrinsic feature enhanced view and the environmental noise enhanced view by using a backbone diagram neural network encoder to obtain a first invariant intrinsic feature diagram vector, a second invariant intrinsic feature diagram vector and an environmental noise diagram vector;

processing the first invariant intrinsic feature map vector, the second invariant intrinsic feature map vector and the ambient noise map vector by using a Multi-Layer Perceptron (MLP) Projection Head (PH) to obtain a first invariant intrinsic feature latent space vector, a second invariant intrinsic feature latent space vector and an ambient noise latent space vector;

inputting the first invariant intrinsic characteristic hidden space vector, the second invariant intrinsic characteristic hidden space vector and the environmental noise hidden space vector into a loss function to obtain a loss value, and optimizing a backbone graph neural network encoder according to the loss value;

and (4) iterating to perform feature extraction operation, sampling operation, processing operation and optimization operation until the loss value is converged to a preset condition, so as to obtain the trained backbone graph neural network encoder.

According to an embodiment of the invention, the interpretation generator comprises a neural network encoder and a multi-layered perceptron.

According to the embodiment of the invention, the extracting the feature of the original image by using the interpretation generator to obtain the probability distribution of the invariant intrinsic feature subgraph and the probability distribution of the environmental noise subgraph comprises the following steps:

processing the original image by using the graph neural network encoder to obtain vector expressions of all nodes of the original image;

calculating the vector expression of each node in the original image by using a multilayer perception computer to obtain the importance score of each node in the original image;

normalizing the importance score of each node in the original graph to obtain the sampling probability of each node in the original graph;

and obtaining the probability distribution of the unchanged intrinsic characteristic subgraph and the probability distribution of the environmental noise subgraph according to the sampling probability of each node in the original graph.

According to the embodiment of the invention, the probability distribution of the invariant intrinsic feature subgraph is determined by formula (1):

（1），

wherein the ambient noise sub-graph probability distribution is determined by equation (2):

（2），

wherein,

a sub-graph of the invariant intrinsic features is represented,

a sub-graph of the environmental noise is represented,

the nodes are represented as a list of nodes,

representing the artwork

The set of nodes of (a) is,

a set of nodes representing an invariant intrinsic feature subgraph,

a set of nodes representing an ambient noise sub-graph,

representing original drawings

Node (a) of

The probability of (c).

According to an embodiment of the present invention, the processing the first invariant intrinsic feature enhancement view, the second invariant intrinsic feature enhancement view, and the environmental noise enhancement view by using the backbone neural network encoder to obtain the first invariant intrinsic feature map vector, the second invariant intrinsic feature map vector, and the environmental noise map vector includes:

respectively extracting vector expressions of each node in a first invariant intrinsic characteristic enhanced view, a second invariant intrinsic characteristic enhanced view and an environmental noise enhanced view by using a backbone diagram neural network encoder;

performing point multiplication on the vector expression of each node in the first invariant intrinsic characteristic enhancement view and the probability distribution of the invariant intrinsic characteristic subgraph to obtain a first point multiplication result;

performing point multiplication on the vector expression of each node in the second invariant intrinsic characteristic enhancement view and the probability distribution of the invariant intrinsic characteristic subgraph to obtain a second point multiplication result;

carrying out point multiplication on the vector expression of each node in the environmental noise enhancement view and the probability distribution of the environmental noise subgraph to obtain an environmental noise point multiplication result;

and performing pooling operation on the first point multiplication result, the second point multiplication result and the environmental noise point multiplication result respectively to obtain a first invariant intrinsic feature map vector, a second invariant intrinsic feature map vector and an environmental noise map vector.

According to an embodiment of the present invention, the above-mentioned loss function includes a sufficiency loss function and an independence loss function;

wherein the loss function is determined by equation (3):

（3），

wherein,

is a hyper-parameter that balances the adequacy loss function and the independence loss function,

an interpretation generator is represented that generates a set of interpretation data,

a neural network encoder of a backbone diagram is shown,

a multi-layer perceptron projector head is shown,

a function representing the loss of adequacy is expressed,

the loss of independence function is expressed as,

the image of the original image is represented,

a collection of the original pictures is represented,

representing a mathematical expectation.

According to an embodiment of the present invention, the above-mentioned sufficiency loss function is determined by equation (4):

（4），

wherein the independence loss function is determined by equation (5):

（5），

wherein,

representing a transpose of the first invariant intrinsic feature hidden space vector,

representing a second invariant intrinsic feature hidden space vector,

representing an ambient noise-hidden spatial vector,

which is indicative of a temperature over-parameter,

representing a set of ambient noise subgraphs generated from each raw graph data in the same set of graph training data,

is to divide the training data of the same group of graphs

Obtaining hidden space vectors of enhanced views of other samples through a backbone diagram neural network encoder and a multi-layer perceptron projector head

A set is formed.

According to a second aspect of the present invention, there is provided a training apparatus for a graph neural network model for graph data invariant features, comprising:

the feature extraction module is used for extracting features of the original image by using the interpretation generator to obtain probability distribution of invariant intrinsic feature subgraphs and probability distribution of environmental noise subgraphs;

the sampling module is used for sampling the probability distribution of the invariant intrinsic characteristic subgraph according to a preset sampling proportion to obtain a first invariant intrinsic characteristic enhanced view and a second invariant intrinsic characteristic enhanced view, and sampling the probability distribution of the environmental noise subgraph according to the preset sampling proportion to obtain an environmental noise enhanced view;

the first processing module is used for processing the first invariant intrinsic feature enhancement view, the second invariant intrinsic feature enhancement view and the environmental noise enhancement view by using the backbone map neural network encoder to obtain a first invariant intrinsic feature map vector, a second invariant intrinsic feature map vector and an environmental noise map vector;

the second processing module is used for processing the first invariant intrinsic feature map vector, the second invariant intrinsic feature map vector and the environmental noise map vector by using the multi-layer perceptron projector head to obtain a first invariant intrinsic feature hidden space vector, a second invariant intrinsic feature hidden space vector and an environmental noise hidden space vector;

the optimization module is used for inputting the first invariant intrinsic characteristic hidden space vector, the second invariant intrinsic characteristic hidden space vector and the environmental noise hidden space vector into a loss function to obtain a loss value and optimizing the backbone graph neural network encoder according to the loss value;

and the iterative training module is used for iteratively performing feature extraction operation, sampling operation, processing operation and optimization operation until the loss value is converged to a preset condition to obtain the trained backbone graph neural network encoder.

According to a third aspect of the present invention, there is provided an electronic apparatus comprising:

one or more processors;

a storage device for storing one or more programs,

wherein the one or more programs, when executed by the one or more processors, cause the one or more processors to perform a method of training a graph neural network model for graph data invariant features.

According to a fourth aspect of the present invention, there is provided a computer readable storage medium having stored thereon executable instructions that, when executed by a processor, cause the processor to perform a method of training a graph neural network model for graph data invariant features.

According to the method and the device for training the graph neural network model aiming at the invariant features of the graph data, the nodes containing important semantic information in the graph data are captured by the interpretation generator, the ability of the graph neural network model for interpreting the input data is given, and the performance of a backbone graph neural network encoder in the graph neural network model is improved, so that the training cost of the model is reduced, the generalization ability of the model is improved, and the application scene of the trained model is expanded.

Drawings

FIG. 1 is a flow diagram of a method of training a graph neural network model for graph data invariant features, according to an embodiment of the present invention;

FIG. 2 is a flow chart for obtaining probability distribution of an invariant intrinsic feature sub-graph and probability distribution of an environmental noise sub-graph according to an embodiment of the present invention;

FIG. 3 is a flow chart of obtaining a first invariant intrinsic feature map vector, a second invariant intrinsic feature map vector, and an ambient noise map vector according to an embodiment of the present invention;

FIG. 4 is a diagram of a comparative learning framework for graph data invariant features, according to an embodiment of the present invention;

FIG. 5 is a block diagram of a training apparatus for a graph neural network model for graph data invariant features, in accordance with an embodiment of the present invention;

fig. 6 is a diagram of MNIST visualization according to an embodiment of the present invention;

FIG. 7 is a diagram illustrating a MUTAG visualization according to an embodiment of the present invention;

FIG. 8 schematically illustrates a block diagram of an electronic device suitable for implementing a method of training a graph neural network model for graph data invariant features in accordance with an embodiment of the present invention.

Detailed Description

In order that the objects, technical solutions and advantages of the present invention will become more apparent, the present invention will be further described in detail with reference to the accompanying drawings in conjunction with the following specific embodiments.

At present, a method based on self-supervision contrast learning utilizes a large amount of non-labeled data to perform pre-training, and then utilizes a small amount of labeled data to perform model parameter fine tuning, so that the performance of a model in a downstream task is remarkably improved, and the method is widely applied to the field of graph neural networks.

In general, the graph contrast learning framework can be divided into two parts: the image data enhancement module creates a plurality of enhanced views of original image data through various image data enhancement methods; and secondly, the contrast learning module enables the projections of the multiple enhanced views expressed by the same sample on the hypersphere to be as close as possible, and meanwhile forces the projections of the enhanced views of different samples to be dispersed as much as possible. Obviously, the graph data enhancement module plays an important role in characterizing the essential features of the graph data.

Current graph data enhancement techniques can be broadly divided into two categories: one is a method based on a random mechanism. The graph data often contains abundant structural information, and random sampling of nodes, edges or attributes of the graph data can cause changes of semantic information of the graph data, such as discarding cyano groups of cyanide molecules to change the properties of the molecules from being virulent to non-toxic, thereby misleading a subsequent contrast learning module. The second is a method based on domain knowledge. The most important substructures of the graph data are identified through the guidance of expert knowledge, and the substructures are reserved as much as possible when data enhancement is carried out, so that the essential characteristics and semantic information of the graph data are maintained. Expert knowledge is often expensive, difficult to obtain even in some scenarios, and models that have been pre-trained using domain-specific expert knowledge are difficult to generalize to new domains. Therefore, domain knowledge based methods are still very limited in application.

One of the purposes of the invention is to explore the influence of a data enhancement module on the performance of a pre-training model in graph contrast learning, and design an automatic graph data enhancement method which does not depend on expert knowledge and simultaneously retains the essential characteristics and semantic information of original graph data, thereby improving the performance of the existing graph contrast learning framework.

In the interpretable domain of research, the invariant intrinsic properties (or invariant intrinsic features) of graph data training samples are generally defined as: some of the input features that determine their distinction from other samples. According to its definition, graph data

Invariant intrinsic characteristic sub-graph (or invariant intrinsic characteristic sub-graph)

Two conditions should be met: one is adequacy condition, i.e. conditions of sufficiency

Retaining graph data

Neutralizing possible predicted results

Relevant semantic information:

represents a given input

Conditions of (2)A probability density function; the second is independence condition, namely an invariant intrinsic characteristic subgraph

Data on the graph

Complement of

Is defined as the ambient noise that is,

should be associated with possible predicted outcomes

Independent of each other:

，

indicating probability independence. Sufficiency and independence conditions guarantee

Capturing image data

All features that are distinguishable from the rest of the sample, while preventing them from capturing only a very small number of background features

And (4) degrading.

FIG. 1 is a flow chart of a method of training a graph neural network model for graph data invariant features, according to an embodiment of the present invention.

As shown in FIG. 1, the method includes operations S110 to S160.

In operation S110, feature extraction is performed on the original image by using the interpretation generator to obtain invariant intrinsic feature sub-image probability distribution and environmental noise sub-image probability distribution.

The interpretation Generator (RG) is actually an intrinsic feature extraction neural network, and is intended to extract features of an original image. The probability distribution of the unchanged intrinsic characteristic subgraph and the probability distribution of the environmental noise subgraph are complementary sets relative to the original graph.

In operation S120, the invariant intrinsic feature sub-graph probability distribution is sampled according to a preset sampling ratio to obtain a first invariant intrinsic feature enhanced view and a second invariant intrinsic feature enhanced view, and the environmental noise sub-graph probability distribution is sampled according to the preset sampling ratio to obtain an environmental noise enhanced view.

The first invariant intrinsic feature enhanced view, the second invariant intrinsic feature enhanced view and the environmental noise enhanced view meet the requirements of sufficiency and independence.

In operation S130, the first invariant intrinsic feature enhanced view, the second invariant intrinsic feature enhanced view, and the ambient noise enhanced view are processed by using the backbone map neural network encoder to obtain a first invariant intrinsic feature map vector, a second invariant intrinsic feature map vector, and an ambient noise map vector.

In operation S140, the first invariant intrinsic feature map vector, the second invariant intrinsic feature map vector, and the ambient noise map vector are processed by a Multi-Layer Perceptron (MLP) Projection Head (PH) to obtain a first invariant intrinsic feature hidden space vector, a second invariant intrinsic feature hidden space vector, and an ambient noise hidden space vector.

In operation S150, the first invariant intrinsic feature hidden space vector, the second invariant intrinsic feature hidden space vector, and the environmental noise hidden space vector are input into a loss function to obtain a loss value, and the backbone neural network encoder is optimized according to the loss value.

In operation S160, the feature extraction operation, the sampling operation, the processing operation, and the optimization operation are performed iteratively until the loss value converges to the preset condition, so as to obtain the trained backbone graph neural network encoder.

The preset conditions include, but are not limited to: the loss value converges to a fixed value, the loss value oscillates in a certain interval or the descending amplitude of the loss value reaches the expected effect.

According to an embodiment of the invention, the interpretation generator comprises a neural network encoder and a multi-layer perceptron.

FIG. 2 is a flow chart for obtaining probability distribution of an invariant intrinsic feature sub-graph and probability distribution of an environmental noise sub-graph according to an embodiment of the present invention.

As shown in fig. 2, the feature extraction of the original image by using the interpretation generator to obtain the probability distribution of the invariant intrinsic feature subgraph and the probability distribution of the environmental noise subgraph includes operations S210 to S240.

In operation S210, the original image is processed by the neural network encoder to obtain vector representations of all nodes of the original image.

In operation S220, a vector representation of each node in the original image is calculated using the multi-layer perceptron, and an importance score of each node in the original image is obtained.

In operation S230, the importance score of each node in the original image is normalized to obtain a sampling probability of each node in the original image.

In operation S240, according to the sampling probability of each node in the original graph, a probability distribution of the invariant intrinsic feature subgraph and a probability distribution of the environmental noise subgraph are obtained.

（1），

（2），

wherein,

a sub-graph of the invariant intrinsic features is represented,

a sub-graph of the ambient noise is represented,

the nodes are represented as a list of nodes,

representing the artwork

The set of nodes of (a) is,

a set of nodes representing an invariant intrinsic feature subgraph,

a set of nodes representing an ambient noise sub-graph,

representing original drawings

Node (a) of

The probability of (c).

Fig. 3 is a flowchart for obtaining a first invariant intrinsic feature map vector, a second invariant intrinsic feature map vector, and an ambient noise map vector according to an embodiment of the present invention.

As shown in fig. 3, the processing the first invariant intrinsic feature enhancement view, the second invariant intrinsic feature enhancement view and the environmental noise enhancement view by using the backbone neural network encoder to obtain the first invariant intrinsic feature map vector, the second invariant intrinsic feature map vector and the environmental noise map vector includes operations S310 to S350.

In operation S310, vector expressions of each node in the first invariant intrinsic feature enhanced view, the second invariant intrinsic feature enhanced view and the environmental noise enhanced view are respectively extracted by using a backbone graph neural network encoder.

In operation S320, the vector expression of each node in the first invariant intrinsic feature enhancement view is point-multiplied with the probability distribution of the invariant intrinsic feature subgraph to obtain a first point-multiplied result.

In operation S330, the vector expression of each node in the second invariant intrinsic feature enhanced view is point-multiplied with the probability distribution of the invariant intrinsic feature subgraph to obtain a second point-multiplied result.

In operation S340, the vector expression of each node in the environmental noise enhanced view is point-multiplied with the environmental noise subgraph probability distribution to obtain an environmental noise point-multiplied result.

In operation S350, pooling operations are performed on the first point multiplication result, the second point multiplication result, and the ambient noise point multiplication result, respectively, to obtain a first invariant intrinsic feature map vector, a second invariant intrinsic feature map vector, and an ambient noise map vector.

wherein the loss function is determined by equation (3):

（3），

wherein,

a neural network encoder of a backbone diagram is shown,

a multi-layer perceptron projector head is shown,

a function representing the loss of adequacy is expressed,

a function representing the loss of independence is expressed,

the image of the original image is represented,

a collection of the original pictures is represented,

representing a mathematical expectation.

The loss function gives consideration to sufficiency and independence, and can better reflect important semantic information of the graph data.

（4），

wherein the independence loss function is determined by equation (5):

（5），

wherein,

representing a second invariant intrinsic feature latent space vector,

representing an ambient noise-hidden spatial vector,

which is indicative of a temperature over-parameter,

is to divide the training data of the same group of graphs

The formed set.

The sufficiency loss function and the independence loss function are improved on the basis of the InfonCE function; wherein the sufficiency loss function is optimized

The essential characteristic estimation network can extract the key information in the original image and capture the expression

The essential characteristic important nodes ensure the sufficiency condition; optimizing independence loss function

The essential characteristic estimation network is more stable, the influence of environmental noise nodes except the key information in the input data is ignored, and the independence condition is ensured.

FIG. 4 is a comparative learning framework diagram for graph data invariant features according to an embodiment of the present invention.

The above-described method provided by the present invention is described in further detail below with reference to fig. 4.

The new graph contrast learning framework shown in fig. 4 allows the graph data reinforcement part to generate a reinforced view that satisfies the sufficiency and independence conditions.

Firstly, the following components are mixed

The conditional probability distribution of the original image is approximate to the condition that each node in the original image node set is sampled to enter

The probability of (c), i.e. as shown in equation (1):

（1），

is an original drawing

Of the plurality of nodes in the network,

represents a node

Is sampled into

Is a probability of reflecting a node

For expression

Importance of the essential features. Similarly, will

Is approximated to the form shown in equation (2):

（2）。

by an interpretation generator

Will make the above probability distribution function

The parameters are parameterized,

the method comprises two sub-modules, namely a graph neural network encoder and a multilayer perceptron, as shown in formula (6) to formula (8):

（6），

（7），

（8），

is a neural network encoder for encoding the original image

As input, and output

All node expressions of

，

Is a simple multi-layer perceptron based on nodes

Expression of

Derive its importance score, i.e. is

Parameterization of (2). Based on

A constant intrinsic feature subgraph can be obtained by probabilistically sampling a fixed proportion (e.g., 80%) of the nodes and preserving the interconnections between them

And ambient noise figure

。

Inputting the enhanced view obtained by sampling into a backbone diagram neural network encoder

(i.e., the pre-trained target model) to obtain a graph-level representation thereof, as shown in equation (9):

（9），

unlike the general framework, before the pooling operation, the probability vector obtained by the node expression and interpretation generator extracted by the backbone neural network is expressed and interpreted

A dot product is performed so that subsequent gradients can be passed back to the interpretation generator. Then will be

Inputting a multi-layer perceptron projector head

And obtaining the final implicit space expression.

Finally, the implicit space expression is input into a loss function to train the model, and after the pre-training is completed, only the backbone graph neural network encoder is reserved

And used as a feature encoder for downstream tasks. It should be noted that the pre-training framework is model independent, and an appropriate backbone neural network encoder may be selected according to task requirements

。

FIG. 5 is a block diagram of a training apparatus for a graph neural network model for graph data invariant features, according to an embodiment of the present invention.

As shown in fig. 5, the training apparatus 500 for the graph neural network model for the graph data invariant features includes a feature extraction module 510, a sampling module 520, a first processing module 530, a second processing module 540, an optimization module 550, and an iterative training module 560.

And the feature extraction module 510 is configured to perform feature extraction on the original image by using the interpretation generator to obtain probability distribution of invariant intrinsic feature subgraph and probability distribution of environmental noise subgraph.

The sampling module 520 is configured to sample the probability distribution of the invariant intrinsic feature subgraph according to a preset sampling ratio to obtain a first invariant intrinsic feature enhanced view and a second invariant intrinsic feature enhanced view, and sample the probability distribution of the environmental noise subgraph according to the preset sampling ratio to obtain an environmental noise enhanced view.

The first processing module 530 is configured to process the first invariant intrinsic feature enhancement view, the second invariant intrinsic feature enhancement view, and the ambient noise enhancement view by using the backbone neural network encoder to obtain a first invariant intrinsic feature map vector, a second invariant intrinsic feature map vector, and an ambient noise map vector.

The second processing module 540 is configured to process the first invariant intrinsic feature map vector, the second invariant intrinsic feature map vector, and the environmental noise map vector by using the multi-layer perceptron projector to obtain a first invariant intrinsic feature hidden space vector, a second invariant intrinsic feature hidden space vector, and an environmental noise hidden space vector.

And an optimizing module 550, configured to input the first invariant intrinsic feature hidden space vector, the second invariant intrinsic feature hidden space vector, and the environmental noise hidden space vector into a loss function, obtain a loss value, and optimize the backbone graph neural network encoder according to the loss value.

And the iterative training module 560 is used for iteratively performing feature extraction operation, sampling operation, processing operation and optimization operation until the loss value is converged to a preset condition, so as to obtain the trained backbone graph neural network encoder.

The device provided by the invention can improve the generalization capability of the graph neural network model, expand the application scene of the device and greatly reduce the training cost because no expert marking is needed in the training process.

Fig. 6 is a diagram illustrating MNIST visualization according to an embodiment of the present invention.

Fig. 7 is a schematic diagram of a MUTAG visualization result according to an embodiment of the present invention.

To better assist those skilled in the art to understand the present invention, the improvement and advantages of the above method provided by the present invention over the prior art are further described in detail by the specific embodiments in conjunction with fig. 6-7 and table 1.

Compared with the existing Graph contrast Learning framework, the Graph neural network model general pre-training framework Invariant feature Graph contrast Learning (GCL-IR) aiming at the Graph data Invariant feature has the following advantages: 1) In the absence of expert knowledge, the interpretation generator can accurately capture nodes containing important semantic information, and a certain ability of the model for interpreting input data is given. 2) The backbone model pre-trained under the GCL-IR framework performs significantly better in downstream tasks.

The advantages described above are demonstrated by exhaustive experiments on various data sets in various fields.

Table 1 shows data statistics of data sets used in the experiments

(1) And estimating the importance of the node. To verify the ability of the interpretation generator to capture nodes containing important semantic information, without the constraints of expert knowledge, experiments were conducted on the MNIST-Superpixel dataset, first using the IGNORE of all labels in the training set and pre-training using the GCL-IR framework, and then visualizing the interpretation generator's estimate of the importance of a portion of the data nodes as shown in FIG. 6. The three lines from top to bottom are the raw MNIST picture data, the MNIST superpixel graph data and the evaluation of the interpretation generator on the importance of the nodes. The darker the color, the higher the score, and the more likely it is to be retained in the enhanced view. From experimental results, interpretation generators pre-trained using GCL-IR have the ability to accurately capture nodes containing important semantic information.

Verification was then also performed on the real world biochemical molecular dataset MUTAG. Firstly, model pre-training is carried out on a ZINC-2M data set containing 2,000,000 unlabeled biochemical molecules by using a GCL-IR framework, and then the estimation result of the node importance of a part of MUTAG data set is visualized as shown in FIG. 7. After six randomly selected molecular samples, two chemists marked groups with mutation properties to the chemical molecules, and the results are shown as yellow marks on the first row. The interpretation generator from the GCL-IR pre-training marks the equivalent number of nodes with the highest significance, and the results are shown in the second row with green marks. If the professional annotation is regarded as a true value, the accuracy of the GCL-IR interpretation generator reaches 83.3%. Experimental results demonstrate the ability of the GCL-IR pre-trained interpretation generator to accurately capture important nodes on a real biochemical molecular dataset.

(2) And the performance of the downstream tasks of the backbone network is improved. In the GCL-IR framework, a backbone graph neural network encoder is pre-trained by using a ZINC-2M biochemical molecular data set, and then supervised learning is carried out on 8 different downstream biochemical molecular data sets by using the pre-trained backbone network. The GCL-IR performance was compared fairly with the most advanced pre-training frameworks Attramking, contextPred, graphCL, graphLoG and AD-GCL at present using the same backbone neural network. The results of the experiment are shown in table 2. Compared with an untrained model, the GCL-IR remarkably improves the ROC-AUC index of the model in a downstream classification task. The GCL-IR also achieves the optimal effect compared with other pre-training methods.

Table 2: GCL-IR backbone network migration learning ROC-AUC index

The GCL-IR is a general pre-training method of a graph neural network model aiming at the invariant characteristic of graph data and independent of a model, and can be applied to the graph neural network model under various self-supervision pre-training-fine tuning training paradigms. The interpretation generator of GCL-IR has the ability of accurately capturing important nodes, so that the interpretability of the model is increased, and meanwhile, the reinforced view of essential features retained in GCL-IR more accurately guides the subsequent backbone graph neural network comparison learning process, so that the feature expression ability of the GCL-IR in downstream tasks is obviously improved.

As shown in fig. 8, an electronic device 800 according to an embodiment of the present invention includes a processor 801 which can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM) 802 or a program loaded from a storage section 808 into a Random Access Memory (RAM) 803. The processor 801 may include, for example, a general purpose microprocessor (e.g., CPU), an instruction set processor and/or associated chipset, and/or a special purpose microprocessor (e.g., application Specific Integrated Circuit (ASIC)), among others. The processor 801 may also include onboard memory for caching purposes. The processor 801 may comprise a single processing unit or a plurality of processing units for performing the different actions of the method flows according to embodiments of the present invention.

In the RAM 803, various programs and data necessary for the operation of the electronic apparatus 800 are stored. The processor 801, the ROM 802, and the RAM 803 are connected to each other by a bus 804. The processor 801 performs various operations of the method flow according to the embodiment of the present invention by executing programs in the ROM 802 and/or the RAM 803. Note that the programs may also be stored in one or more memories other than the ROM 802 and the RAM 803. The processor 801 may also perform various operations of method flows according to embodiments of the present invention by executing programs stored in the one or more memories.

Electronic device 800 may also include input/output (I/O) interface 805, input/output (I/O) interface 805 also connected to bus 804, according to an embodiment of the invention. Electronic device 800 may also include one or more of the following components connected to I/O interface 805: an input portion 806 including a keyboard, a mouse, and the like; an output section 807 including a signal such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, and a speaker; a storage portion 808 including a hard disk and the like; and a communication section 809 including a network interface card such as a LAN card, a modem, or the like. The communication section 809 performs communication processing via a network such as the internet. A drive 810 is also connected to the I/O interface 805 as necessary. A removable medium 811 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 810 as necessary, so that a computer program read out therefrom is mounted on the storage section 808 as necessary.

The present invention also provides a computer-readable storage medium, which may be contained in the apparatus/device/system described in the above embodiments; or may exist separately and not be assembled into the device/apparatus/system. The computer-readable storage medium carries one or more programs which, when executed, implement the method according to an embodiment of the present invention.

According to embodiments of the present invention, the computer readable storage medium may be a non-volatile computer readable storage medium, which may include, for example but is not limited to: a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present invention, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. For example, according to embodiments of the present invention, a computer-readable storage medium may include the ROM 802 and/or the RAM 803 described above and/or one or more memories other than the ROM 802 and the RAM 803.

The above embodiments are provided to further explain the objects, technical solutions and advantages of the present invention in detail, and it should be understood that the above embodiments are only examples of the present invention and are not intended to limit the present invention, and any modifications, equivalents, improvements and the like made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A training method of a graph neural network model aiming at graph data invariant features comprises the following steps:

performing feature extraction on an original image by using an interpretation generator to obtain invariant intrinsic feature sub-image probability distribution and environmental noise sub-image probability distribution, wherein the interpretation generator comprises a graph neural network encoder and a multilayer perceptron;

processing the first invariant intrinsic feature enhancement view, the second invariant intrinsic feature enhancement view and the environmental noise enhancement view by using a backbone map neural network encoder to obtain a first invariant intrinsic feature map vector, a second invariant intrinsic feature map vector and an environmental noise map vector;

processing the first invariant intrinsic feature map vector, the second invariant intrinsic feature map vector and the environmental noise map vector by using a multi-layer perceptron projector to obtain a first invariant intrinsic feature hidden space vector, a second invariant intrinsic feature hidden space vector and an environmental noise hidden space vector;

inputting the first invariant intrinsic feature hidden space vector, the second invariant intrinsic feature hidden space vector and the environmental noise hidden space vector into a loss function to obtain a loss value, and optimizing the backbone graph neural network encoder according to the loss value;

iteratively performing feature extraction operation, sampling operation, processing operation and optimization operation until the loss value is converged to a preset condition to obtain a trained backbone graph neural network encoder;

the method for extracting the features of the original image by using the interpretation generator to obtain the probability distribution of the invariant intrinsic feature subgraph and the probability distribution of the environmental noise subgraph comprises the following steps:

processing the original image by using the image neural network encoder to obtain vector expressions of all nodes of the original image;

calculating the vector expression of each node in the original image by using the multilayer perceptron to obtain the importance score of each node in the original image;

obtaining the probability distribution of the unchanged intrinsic characteristic subgraph and the probability distribution of the environmental noise subgraph according to the sampling probability of each node in the original graph;

wherein the processing the first invariant intrinsic feature enhanced view, the second invariant intrinsic feature enhanced view, and the environmental noise enhanced view by using the backbone graph neural network encoder to obtain a first invariant intrinsic feature map vector, a second invariant intrinsic feature map vector, and an environmental noise map vector includes:

respectively extracting vector expressions of each node in the first invariant intrinsic feature enhanced view, the second invariant intrinsic feature enhanced view and the environmental noise enhanced view by utilizing the backbone graph neural network encoder;

performing point multiplication on the vector expression of each node in the environmental noise enhancement view and the probability distribution of the environmental noise subgraph to obtain an environmental noise point multiplication result;

and performing pooling operation on the first point multiplication result, the second point multiplication result and the environmental noise point multiplication result respectively to obtain the first invariant intrinsic feature map vector, the second invariant intrinsic feature map vector and the environmental noise map vector.

2. The method of claim 1, wherein the invariant intrinsic feature subgraph probability distribution is determined by equation (1):

（1），

（2），

wherein,

a sub-graph of the invariant intrinsic features is represented,

a sub-graph of the environmental noise is represented,

the nodes are represented as a list of nodes,

representing the artwork

The set of nodes of (a) is,

a set of nodes representing an invariant intrinsic feature subgraph,

a set of nodes representing an ambient noise sub-graph,

representing the artwork

Node (a) of

The probability of (c).

3. The method of claim 1, wherein the loss function comprises a sufficiency loss function and an independence loss function;

wherein the loss function is determined by equation (3):

（3），

wherein,

the interpretation generator is represented and the interpretation generator is represented,

represents the backbone graph neural network encoder,

representing the multi-layer perceptron projection head,

a function representing the loss of adequacy function,

representing the function of the loss of independence,

the original image is represented by a representation of the original image,

the set of artwork is represented, representing a mathematical expectation.

4. The method of claim 3, wherein the sufficiency loss function is determined by equation (4):

（4），

wherein the independence loss function is determined by equation (5):

（5），

wherein,

representing the second invariant intrinsic feature latent space vector,

representing the ambient noise-hidden spatial vector,

indicating the temperatureThe super-parameter is set to be,

is to divide the training data of the same group of graphs

The formed set.

5. A training apparatus for a graph neural network model for graph data invariant features, comprising:

the characteristic extraction module is used for extracting the characteristics of the original image by using the interpretation generator to obtain the probability distribution of the unchanged intrinsic characteristic subgraph and the probability distribution of the environmental noise subgraph, wherein the interpretation generator comprises a graph neural network encoder and a multilayer perceptron;

a first processing module, configured to process the first invariant intrinsic feature enhancement view, the second invariant intrinsic feature enhancement view, and the ambient noise enhancement view by using a backbone graph neural network encoder to obtain a first invariant intrinsic feature map vector, a second invariant intrinsic feature map vector, and an ambient noise map vector;

the second processing module is used for processing the first invariant intrinsic feature map vector, the second invariant intrinsic feature map vector and the environmental noise map vector by using a multi-layer perceptron projector head to obtain a first invariant intrinsic feature hidden space vector, a second invariant intrinsic feature hidden space vector and an environmental noise hidden space vector;

the optimization module is used for inputting the first invariant intrinsic feature hidden space vector, the second invariant intrinsic feature hidden space vector and the environmental noise hidden space vector into a loss function to obtain a loss value, and optimizing the backbone graph neural network encoder according to the loss value;

an iterative training module for iteratively performing feature extraction operation, sampling operation, processing operation and optimization operation until the loss value converges to a preset condition to obtain a trained backbone graph neural network encoder

6. An electronic device, comprising:

one or more processors;

a storage device for storing one or more programs,

wherein the one or more programs, when executed by the one or more processors, cause the one or more processors to perform the method of any of claims 1~4.

7. A computer readable storage medium having stored thereon executable instructions that, when executed by a processor, cause the processor to perform the method of any of claims 1~4.