CN114997340B

CN114997340B - Literature classification method based on graph neural network system

Info

Publication number: CN114997340B
Application number: CN202210915495.5A
Authority: CN
Inventors: 杨永鹏; 杨真真; 杨震
Original assignee: Nanjing University of Posts and Telecommunications
Current assignee: Nanjing University of Posts and Telecommunications
Priority date: 2022-08-01
Filing date: 2022-08-01
Publication date: 2022-10-14
Anticipated expiration: 2042-08-01
Also published as: CN114997340A

Abstract

The invention relates to a literature classification method based on a graph neural network system, which is characterized in that the graph neural network system based on a graph filtering kernel and a generalized non-convex norm is designed and divided into a first multilayer perceptron module, a linear rectification function module, a second multilayer perceptron module, a target graph neural network module and a normalized exponential function module according to a processing flow, wherein the target graph neural network module is constructed based on the graph filtering kernel and the generalized non-convex norm and comprises a graph filtering kernel term used for extracting useful information of a graph signal, a graph Laplace regularization term used for carrying out global graph smoothing processing on the graph signal and a generalized non-convex norm term used for carrying out local graph smoothing processing on the graph signal, and in application, the target graph neural network module is solved by adopting a prediction check descent and ascent algorithm; therefore, the graph neural network system forms a network to be trained, and the network is trained to obtain a document classification model, so that the classification precision of documents and the working efficiency of practical document application can be effectively improved.

Description

Literature classification method based on graph neural network system

Technical Field

The invention relates to a literature classification method based on a graph neural network system, and belongs to the technical field of machine learning of graph neural network classification.

Background

In recent years, machine learning techniques, such as neural networks, have attracted considerable attention because they have been able to successfully promote the development of fields such as pattern recognition and data mining. Classical neural network methods include convolutional neural networks, periodic neural networks, autoencoders and the like, but most of such neural network methods are used for processing regular data, and the effect of processing image data existing in non-Euclidean spaces is not good enough, and for such reasons, image neural network technology comes along. At present, the graph neural networks are roughly divided into two categories, one is a spectrum-based graph neural network such as a chebyshev model and the like, and the other is a space-based graph neural network such as a graph convolution neural network, a graph attention machine network and the like. However, the existing graph neural network has low classification precision in practical application and low practical application efficiency.

Disclosure of Invention

The invention aims to solve the technical problem of providing a document classification method based on a graph neural network system, which adopts a brand-new network structure design and can effectively improve the precision and efficiency of document classification in application.

The invention adopts the following technical scheme for solving the technical problems: the invention designs a literature classification method based on a graph neural network system, which is characterized in that a literature classification model is obtained through steps A to C based on sample literatures which respectively correspond to preset literature categories and graph signals of matrixes formed by characteristics of preset types respectively, and the literature to be classified is classified by applying the literature classification model according to step i;

a, constructing a target graph neural network module based on a graph filtering kernel and a generalized non-convex norm, wherein the target graph neural network module comprises a graph filtering kernel item used for extracting useful information of a graph signal, a graph Laplace regular item used for carrying out global graph smoothing on the graph signal, and a generalized non-convex norm item used for carrying out local graph smoothing on the graph signal, and then entering a step B;

b, constructing a network to be trained, wherein the input end of the network to be trained is formed by the input end of a first multilayer sensor module, the output end of the first multilayer sensor module is sequentially connected with a linear rectification function module, a second multilayer sensor module, a target graph neural network module and a normalization index function module in series, and the output end of the normalization index function module forms the output end of the network to be trained, and then entering the step C;

c, training a network to be trained according to the incidence relation among the sample documents based on the graph signals respectively corresponding to the sample documents and the document types respectively corresponding to the sample documents, and obtaining a document classification model taking the graph signals corresponding to the documents as input and the document types corresponding to the documents as output;

and i, obtaining a diagram signal formed by preset various types of characteristics corresponding to the document to be classified, and applying a document classification model to obtain the document category corresponding to the document to be classified.

As a preferred technical scheme of the invention: the target graph neural network module comprises a graph filtering kernel item for extracting useful information of graph signals

Graph Laplace regularization term for global graph smoothing of graph signals

And a generalized non-convex norm term for local graph smoothing of graph signals

And the optimization equation corresponding to the target graph neural network module is as follows:

wherein,

，

representing the map signal received by the target map neural network module,

representation map signal

A transfer function is performed that can be learned,

a representation of the parameters that can be learned,

representing target graph neural network modules for received graph signals

The output signal after a number of iterations,

represents the norm of Frobenius,

、

、

respectively, the super-parameters are the parameters,

indicating a balance factor greater than 0 and,

the trace function of the matrix is represented by,

，

a matrix of the laplacian of the graph is represented,

the unit matrix is represented by a matrix of units,

，

a graph adjacency matrix representing the regularization,

a matrix of degrees representing the graph is shown,

in order to map the adjacency matrix of the figure,

，

show each itemThe document corresponds to the first of the association relation graph between the vertexes

The number of the top points is equal to the number of the top points,

the second of which represents the association relationship graph between the corresponding vertexes of each document

The number of the top points is equal to the number of the top points,

the first in the graph representing the association relationship between the corresponding vertexes of each document

The degree of each of the vertices,

The degree of each of the vertices,

and

are respectively

To (1) a

A sum of

The output signal of the first and second switching circuits is,

is broad, non-convex, conformable, closedAnd a lower semi-continuous function of the function,

，

。

as a preferred technical scheme of the invention: the optimization equation corresponding to the target graph neural network module is solved by adopting a predictive check descending and ascending algorithm in the following mode;

first according to

Is a conjugate function of

And matrix variables to be solved

Supremum function, infimum function

Construction of

；

Then, the optimization equation corresponding to the target graph neural network module is updated as follows:

and finally solving the optimization equation corresponding to the updated target graph neural network module.

As a preferred technical scheme of the invention: based on the initialization

、

、

Solving an optimization equation corresponding to the updated graph neural network module according to the following steps;

step 1) updating by adopting a steepest descent method according to the following formula

By a value of (i), i.e. using

Represents;

wherein,

represents the solution process

The number of sub-iterations is,

which represents the first step-size factor,

the gradient operator is represented by a gradient operator,

representation solving

Of a minor iteration

A value of (d); then entering step 2);

step 2) solving by adopting the steepest ascent method according to the following formula

；

Wherein,

is a proximity operator which is a function of the proximity operator,

represents a second step size factor; then entering step 3);

step 3) according to the following formula:

updating variable updates

Then step 4) is entered;

step 4) judgment

Whether or not the number of iterations is not less than a predetermined maximum number of iterations

If yes, ending the iteration; otherwise to

Is updated by adding 1 and returns to step 1).

Compared with the prior art, the document classification method based on the graph neural network system has the following technical effects by adopting the technical scheme:

the invention designs a literature classification method based on a graph neural network system, designs a graph neural network system based on a graph filtering kernel and a generalized non-convex norm, and divides the graph neural network system into a first multilayer perceptron module, a linear rectification function module, a second multilayer perceptron module, a target graph neural network module and a normalized exponential function module according to a processing flow, wherein the target graph neural network module is constructed based on the graph filtering kernel and the generalized non-convex norm and comprises a graph filtering kernel term used for extracting useful information of a graph signal, a graph Laplace regularization term used for carrying out global graph smoothing processing on the graph signal and a generalized non-convex norm term used for carrying out local graph smoothing processing on the graph signal, and in application, the target graph neural network module is solved by adopting a prediction check descending and ascending algorithm; therefore, the graph neural network system forms a network to be trained, and the network is trained to obtain a document classification model, so that the classification precision of documents and the working efficiency of practical document application can be effectively improved.

Drawings

FIG. 1 is a system block diagram of a neural network system of the present design;

FIG. 2 is a flow chart of the solving of the neural network module of the target graph in the design of the present invention;

FIG. 3 is a graph comparing the effect of the document classification method based on the graph neural network system compared with other classical graph neural networks for classifying graph nodes.

Detailed Description

The following description will explain embodiments of the present invention in further detail with reference to the accompanying drawings.

The invention designs a literature classification method based on a graph neural network system, and in practical application, a literature classification model is obtained through steps A to C based on sample literatures which respectively correspond to preset literature categories and graph signals of matrixes formed by characteristics of various types, which are respectively corresponding to preset sample literatures.

step B, as shown in fig. 1, constructing a network to be trained, wherein the input end of the network to be trained is formed by the input end of a first multilayer perceptron (MLP) module, the output end of the first multilayer perceptron (MLP) module is sequentially connected in series with a linear rectification function module (ReLU), a second multilayer perceptron (MLP) module, a target graph neural network module and a normalization index function module (Softmax), the output end of the normalization index function module (Softmax) forms the output end of the network to be trained, and then the step C is carried out;

and (3) after the document classification model is obtained through the steps A to C, further applying the document classification model, and executing the step i to classify the document to be classified.

And i, acquiring a graph signal formed by preset various types of characteristics corresponding to the document to be classified, and applying a document classification model to acquire the document category corresponding to the document to be classified.

In practical application, the optimization equation corresponding to the designed target graph neural network module is shown as formula (1):

(1)

wherein,

and

for the graph filtering kernel, the invention will

And

are all provided with

，

Is about

Is set as the generalized non-convex function in the present invention

Then, the optimization model corresponding to equation (1) becomes the following optimization equation:

wherein, the design target graph neural network module comprises a graph filtering kernel term for extracting useful information of the graph signal

Graph Laplace regularization term for global graph smoothing of graph signals

And generalized non-convex norm term for local graph smoothing of graph signals

。

Wherein,

，

representing the map signal received by the target map neural network module,

show to the picture letterNumber (C)

A transfer function is performed that can be learned,

a representation of the parameters that can be learned,

representing target graph neural network modules for received graph signals

The output signal after a number of iterations,

represents the norm of Frobenius,

、

、

respectively, the parameters are the super-parameters,

indicating a balance factor greater than 0 and,

the trace function of the matrix is represented by,

，

a matrix of laplacian of the graph is represented,

the matrix of the unit is expressed by,

，

a graph adjacency matrix representing the regularization,

a matrix of degrees representing the graph is shown,

in order to be a graph of the adjacency matrix,

，

The number of the top points is equal to the number of the top points,

The number of the vertexes is equal to that of the vertex,

The degree of each of the vertices,

the first one in the graph representing the association relationship between the corresponding vertexes of each document

The degree of each of the vertices,

and

are respectively

To (1) a

A sum of

The output signal of the first and second switching circuits is,

is a generalized, non-convex, fitting, closed and lower semicontinuous function,

，

。

and in practical application, the optimization equation corresponding to the neural network module of the target graph is obtained because

Difficult to solve directly, the design of the invention converts it into

The conjugate function of the target graph neural network module is solved, namely, a prediction check descending and ascending algorithm is adopted, and the optimization equation corresponding to the target graph neural network module is solved in the following mode.

First according to

Is a conjugate function of

And the matrix variables to be solved for

Supremum function

Construction of

。

Then, the optimization equation corresponding to the neural network module of the target graph is updated as follows:

finally, as shown in FIG. 2, based on the initial

、

、

And solving the optimization equation corresponding to the updated graph neural network module according to the following steps 1) to 4).

By a value of (i), i.e. using

Representing;

wherein,

represents the solution process

The number of sub-iterations is,

which represents the first step-size factor,

the gradient operator is represented by a gradient operator,

representation solution

Of a minor iteration

A value of (d); then step 2) is entered.

；

Wherein,

is a proximity operator, which is a function of the proximity operator,

represents a second step size factor; then step 3) is entered.

Step 3) according to the following formula:

updating variable updates

Then step 4) is entered.

Step 4) judgment

Whether or not it is not less than a preset maximum number of iterations

If yes, ending the iteration; otherwise to

Is updated by adding 1 and returns to step 1).

The technical scheme includes that a graph neural network system based on a graph filtering kernel and a generalized non-convex norm is designed, and the graph neural network system based on the graph filtering kernel and the generalized non-convex norm is divided into a first multilayer perceptron module, a linear rectification function module (ReLU), a second multilayer perceptron module, a target graph neural network module and a normalized exponential function module (Softmax) according to a processing flow, wherein the target graph neural network module is constructed based on the graph filtering kernel and the generalized non-convex norm and comprises a graph filtering regular term used for extracting useful information of a graph signal, a graph Laplace term used for carrying out global graph smoothing on the graph signal and a generalized non-convex norm term used for carrying out local graph smoothing on the graph signal, and in application, the target graph neural network module is solved by adopting a prediction check descent and ascent algorithm; in this way, the graph neural network system forms a network to be trained, and trains the network to obtain a document classification model, and in practical application, as shown in fig. 3, the OurGNN is a new graph neural network system proposed by the present invention, and compared with graph node classification performed by other classical graph neural networks GCN (graph convolution neural network), GAT (graph attention neural network), SGC (simplified graph convolution network), APPNP (accelerated neural prediction personalized propagation), graphSAGE (graph sample aggregation neural network), the classification method provided by the present invention can effectively improve the classification accuracy of documents and the work efficiency of practical document application from the experimental effects of three document cited graph data, namely Cora, citeseer and Pubmed. Meanwhile, the application of the design method is expanded to the node classification problem of a document author relationship diagram (comprising a relationship diagram between two document authors, namely CS and Physics) and a purchase relationship diagram (comprising two purchase relationship diagrams, namely Computers and Photo), and the simulation effect of two groups of experiments expanded by the design method is further verified by the invention, so that the effectiveness of the score classification method in practical application is further verified.

The embodiments of the present invention have been described in detail with reference to the drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge of those skilled in the art without departing from the gist of the present invention.

Claims

1. A literature classification method based on a graph neural network system is characterized in that: obtaining a document classification model through steps A to C based on sample documents which respectively correspond to preset document types and graph signals of matrixes formed by characteristics of the sample documents which respectively correspond to preset types of the documents, and classifying the documents to be classified by applying the document classification model according to the step i;

2. The method of claim 1, wherein the method comprises: the target graph neural network module comprises a graph filtering kernel item for extracting useful information of a graph signal

Graph Laplace regularization term for global graph smoothing of graph signals

wherein,

，

representing the map signal received by the target map neural network module,

representation map signal

A transfer function is performed that can be learned,

the representation of the learnable parameter is,

representing target graph neural network modules against received graph signals

The output signal after a number of iterations,

represents the Frobenius norm,

、

、

respectively, the super-parameters are the parameters,

indicating a balance factor greater than 0 and,

the trace function of the matrix is represented by,

，

a matrix of the laplacian of the graph is represented,

the unit matrix is represented by a matrix of units,

，

a graph adjacency matrix representing the regularization,

a matrix of degrees representing the graph is shown,

in order to map the adjacency matrix of the figure,

，

The number of the top points is equal to the number of the top points,

the first to the second of the association graph

The number of the top points is equal to the number of the top points,

The degree of each of the vertices is,

The degree of each of the vertices,

and

are respectively

To (1) a

An

The output signal of the first and second switching circuits is output,

，

。

3. the method of claim 2, wherein the method comprises: the optimization equation corresponding to the target graph neural network module is solved by adopting a predictive check descending and ascending algorithm in the following mode;

first according to