CN117251583A

CN117251583A - Text enhanced knowledge graph representation learning method and system based on local graph structure

Info

Publication number: CN117251583A
Application number: CN202311544608.6A
Authority: CN
Inventors: 李志飞; 蹇越; 张龑; 王时绘; 张淼
Original assignee: Hubei University
Current assignee: Hubei University
Priority date: 2023-11-20
Filing date: 2023-11-20
Publication date: 2023-12-19
Anticipated expiration: 2043-11-20
Also published as: CN117251583B

Abstract

The invention provides a text enhanced knowledge graph representation learning method based on a local graph structure, which comprises the following steps: s1: acquiring a knowledge graph of a local graph structure, and calculating to acquire a first input vector and a second input vector through the knowledge graph and an input triplet; s2: constructing a learning model, and training a first input vector and a second input vector through the learning model to obtain a first prediction vector and a second prediction vector; s3: constructing a scoring function through the first prediction vector and the second prediction vector, and calculating through the scoring function to obtain a loss function; s4: repeating the steps S1-S3 until the loss function is smaller than a preset value, and obtaining a trained learning model. The learning model constructed by the invention fuses the text coding and the local diagram structure information of the knowledge graph, thereby being beneficial to simultaneously acquiring the contextualization and the structured knowledge; by integrating the u-hop neighbor entity into the input vector, the problem of insufficient structural information in the text-based method is solved.

Description

Text enhanced knowledge graph representation learning method and system based on local graph structure

Technical Field

The invention relates to the technical field of knowledge maps, in particular to a text enhanced knowledge map representation learning method and system based on a local map structure.

Background

In recent years, knowledge patterns (KGs) have been widely used in various natural language processing fields. Significant efficacy has been demonstrated in search engines, knowledge questions and answers, recommendation systems, and other tasks. Various types of knowledge bases such as Freebase, wordNet, NELL have been created for storing structural information of human knowledge. Knowledge maps are mainly composed of two elements, which are described as (subject entities, relationships, guest entities) for storing knowledge. While representation learning of the knowledge-graph aims at mapping entities and relationships to a continuous, low-dimensional vector space, which facilitates the computation of the inference process between entities.

In order to obtain a high-level representation of a knowledge graph, the concept of projecting entities and relationships inherent in the knowledge graph onto different points in the vector space has been proposed in recent years. The former knowledge graph representation learning method is mainly a graph embedding method, and aims to acquire representations of elements in the graph in the form of low-dimensional vectors. But this approach ignores the context and text information that may be helpful in enhancing entity characterization learning. This deficiency makes them particularly vulnerable to the challenges presented by the imperfection of the drawing. These limitations greatly impair their ability to efficiently generalize and represent high quality.

For this reason, many studies have employed text encoding methods to obtain a representation. The text-based strategy utilizes supplemental input signals and contextual representations of natural language text associated with knowledge triples to enable missing component prediction in knowledge-graph representation learning. The text may contain text attributes of the entities and relationships, including their names or descriptions. By pre-training the language model, the text encoder can be seamlessly generalized to new graph elements and is not affected by the challenges presented by graph imperfection. However, the existing text-based knowledge graph representation learning method still has the problems of lack of structural knowledge and excessive reliance on semantic information due to insufficient efficacy of a text encoder in acquiring structural insights. This will affect the accuracy and performance of the representation learning.

Disclosure of Invention

In order to solve the technical problems, the invention provides a text enhanced knowledge graph representation learning method based on a local graph structure, which comprises the following steps:

s1: obtaining a knowledge graph of a local graph structure, and obtaining a first input vector V through calculation of the knowledge graph and an input triplet _sr And a second input vector V _o ；

S2: constructing a learning model, and obtaining a first input vector V through the learning model _sr And a second input vector V _o Training to obtain a first predictive vector e _sr And a second predictive vector e _o ；

S3: by a first predictive vector e _sr And a second predictive vector e _o Constructing a scoring function, and calculating to obtain a loss function through the scoring function;

s4: and repeating the steps S1-S3 until the loss function is smaller than a preset value, obtaining a trained learning model, and carrying out a link prediction task through the trained learning model.

Preferably, step S1 specifically includes:

s11: obtaining a knowledge graph of a local graph structure，/>Representing a set of all entities->Representing a set of all relationships->Representing the correct triplet set; by passing through/>Obtaining an input triplet (s, r, o), wherein s represents a subject entity embedding, r represents a relationship embedding, and o represents a guest entity embedding;

s12: obtaining a hop neighbor list L of a subject entity embedded in s _s And text description T of subject entity _s Obtaining a jump neighbor list L of an object entity with the object entity embedded in o _o Textual description T of a guest entity _o ；

S13: acquisition of L _s U-hop neighbor entity L of middle body entity _s [u]Will s, r, L _s [u]And T _s Combining to obtain a first input vector V _sr ；

S14: acquisition of L _o U-hop neighbor entity L of middle guest entity _o [u]O, L _o [u]And T _o Combining to obtain a second input vector V _o 。

Preferably:

first input vector V _sr The calculation formula of (2) is as follows:

V _sr =s+L _s [u]+r+T _s

second input vector V _o The calculation formula of (2) is as follows:

V _o =o+L _o [u]+T _o

preferably, step S2 specifically includes:

s21: to the first input vector V _sr And a second input vector V _o Input learning model for V _sr And V _o Word segmentation processing is carried out to obtain word vectors, and a first hidden layer vector is obtained through calculation of attention, relative position distance and semantic information among the word vectorsAnd a second hidden layer vector->；

S22: by text encoder pairsAnd->Performing feature extraction forward propagation to obtain a first language model vector H _sr And a second language model vector H _o ；

S23: will H _sr And H _o Vector processing is carried out through pooling and regularization, and a first prediction vector e with unified dimension is obtained _sr And a second predictive vector e _o 。

Preferably, step S21 specifically includes:

s211: for V _sr Word segmentation processing is carried out to obtain a first word vector set:

wherein token represents word segmentation processing, n represents the total number of first word vectors,representing an nth first word vector;

for V _o Performing word segmentation processing to obtain a second word vector set:

where m represents the total number of second word vectors,representing an mth second word vector;

s212: the attention among the word vectors is obtained through calculation, and the calculation formula is as follows:

wherein H represents a semantic information hiding state associated with the word vector; q (Q) _s 、K _s And V _s Respectively through projection matrix W _qs 、W _ks And W is _vs Generated semantic vector matrix, K _p And Q _p Respectively through projection matrix W _kp And W is _qp A generated relative position vector matrix;

s213: the relative position distance is calculated and obtained, and the calculation formula is as follows:

wherein a and b are labels of a word vector, D (a, b) represents a relative position distance from label a to label b, and k represents a maximum relative position distance;

s214: calculating to obtain attention vector A from tag a to tag b _a,b The calculation formula is as follows:

wherein,attention information representing semantics to semantics, +.>Represents Q _s Line a, < >>Represent K _s In the row b of (c),attention information representing semantics to relative position, < ->Represent K _p Line D (a, b), +.>Attention information representing relative position to semantics, < ->Representation ofQ _p Line D (a, b);

s215: constructing attention operation attention through attention vectors among all labels, and carrying out attention operation on all word vectors to obtain a first hidden layer vectorAnd a second hidden layer vector->The calculation formula is as follows:

。

preferably, the scoring function in step S3The expression of (2) is:

wherein d _edu Representing the euclidean distance of the first predictive vector from the second predictive vector,representing cosine similarity of the first predictive vector and the second predictive vector,/and the like>Representing a reorder fraction obtained by reordering the predicted entities,/->Is the Euclidean distance weight.

A text-enhanced knowledge-graph representation learning system based on a local graph structure, comprising:

the input vector acquisition module is used for acquiring a knowledge graph of the local graph structure and acquiring a first input vector V through calculation of the knowledge graph and an input triplet _sr And a second input vector V _o ；

Predictive vector acquisitionA module for constructing a learning model for the first input vector V _sr And a second input vector V _o Training to obtain a first predictive vector e _sr And a second predictive vector e _o ；

A loss function calculation module for passing through the first prediction vector e _sr And a second predictive vector e _o Constructing a scoring function, and calculating to obtain a loss function through the scoring function;

and the link prediction module is used for repeatedly training the learning model until the loss function is smaller than a preset value, obtaining a trained learning model, and carrying out a link prediction task through the trained learning model.

The invention has the following beneficial effects:

the learning model constructed by the invention fuses the text coding and the local diagram structure information of the knowledge graph, thereby being beneficial to simultaneously acquiring the contextualization and the structured knowledge; the problem of insufficient structural information in a text-based method is solved by integrating the u-hop neighbor entity into the input vector; a structure enhancement mechanism combining the relative position distance is provided to offset prediction deviation caused by semantic difference, so that the finally obtained trained learning model can perform more accurate link prediction task.

Drawings

FIG. 1 is a flow chart of a method according to an embodiment of the present invention;

the achievement of the objects, functional features and advantages of the present invention will be further described with reference to the accompanying drawings, in conjunction with the embodiments.

Detailed Description

It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.

Referring to fig. 1, the invention provides a text enhanced knowledge graph representation learning method based on a local graph structure, which comprises the following steps:

S2: construction of learning modelModel, first input vector V through learning model _sr And a second input vector V _o Training to obtain a first predictive vector e _sr And a second predictive vector e _o ；

Further, the step S1 specifically includes:

s11: obtaining a knowledge graph of a local graph structure，/>Representing a set of all entities->Representing a set of all relationships->Representing the correct triplet set; by->Obtaining an input triplet (s, r, o), wherein s represents a subject entity embedding, r represents a relationship embedding, and o represents a guest entity embedding;

S14: acquisition of L _o Chinese objectU-hop neighbor entity L of entity _o [u]O, L _o [u]And T _o Combining to obtain a second input vector V _o 。

Specifically, the hop neighbor list L is obtained _s And L _o When the method is used, the subject entity s and the object entity o in the input triples (s, r, o) are excluded from the neighbor list; this is because in one triplet (s, r, o), for the subject entity s and the guest entity o, they are 1-hop neighbors of each other; thus, if the subject entity s is merged to the u-hop neighbor entity L of the guest entity o _o [u]Potentially causing a risk of tag leakage in the predicted outcome and vice versa.

Further:

first input vector V _sr The calculation formula of (2) is as follows:

V _sr =s+L _s [u]+r+T _s

second input vector V _o The calculation formula of (2) is as follows:

V _o =o+L _o [u]+T _o

specifically, when the hop count u of the u-hop neighbor entity is selected, u=1 is generally selected as the hop count, so that excessive neighbor nodes caused by excessive hop count selection can be avoided, and the acquired neighbor entity information dilutes the information of the original entity and causes the performance degradation of the model.

Further, a text encoder is adopted in the process of the step S2, and a specific decoupling attention mechanism is combined, namely, the attention calculation in the original self-attention mechanism is decoupled and divided into two parts of semantic information and relative position confidence, and the attention bias is calculated respectively, so that the relative position information among all the labels is fused into the text encoder;

the step S2 specifically comprises the following steps:

Furthermore, in the embodiment of the invention, the text encoder end-to-end architecture integrating the relative position information, wherein in the input stage of the encoder, semantic information among original labels is utilized, the relative position information obtained by combining the relative position coding is utilized, and 5 attention offset vector matrixes are obtained through linear change and expressed as follows: k (K) _p ，K _s ，Q _p ，Q _s ，V _s . And then the five attention bias information are added through a multi-head attention mechanism to calculate, so that the relative position information and the semantic information are successfully fused. And then performing text coding through forward propagation and aggregation and regularization operations of the encoder, finally, fusing absolute position information among tags, and transmitting the information to a next layer of encoder to perform the same coding operation, wherein the whole framework consists of N layers of encoder structures. Through the calculation of the encoder, the invention strengthens the extraction of the knowledge of the graph structure by fusing the relative position information, and lightens the prediction deviation caused by semantic ambiguity which is caused by only containing semantic information through a decoupled attention mechanism in the encoder, and the structure has obvious improvement on the performance of connection prediction;

the step S21 specifically includes:

wherein,attention information representing semantics to semantics, +.>Represents Q _s Line a, < >>Represent K _s Line b, < >>Attention information representing semantics to relative position, < ->Represent K _p Line D (a, b), +.>Attention information representing relative position to semantics, < ->Represents Q _p Line D (a, b);

。

furthermore, the invention adopts a double encoder structure, and utilizes the forward propagation, aggregation and regularization processes of the text encoder to extract information, and the first language model vector H in the step S22 _sr And a second language model vector H _o The generation process of (2) is as follows:

where the encoder represents the forward propagation and aggregation operations of the text encoder.

Furthermore, when vector processing is performed on the language model vectors, we firstly adopt an average pooling technology to pool the two vectors, reduce the dimension while retaining the main features, and then regularize the vectors through an L2 regularization technology to ensure that the dimensions of the two predicted vectors are uniform, and the first predicted vector e in step S23 _sr And a second predictive vector e _o The generation process of (2) is as follows:

wherein,representing pooling and regularization operations.

Furthermore, in order to obtain a more accurate triplet prediction result, the invention combines Euclidean distance on the basis of cosine function and introduces a reordering score, so that the strategy can ensure that the similarity score between two prediction vectors can be accurately calculated;

scoring function in step S3The expression of (2) is:

wherein,d _edu representing the euclidean distance of the first predictive vector from the second predictive vector,representing cosine similarity of the first predictive vector and the second predictive vector,/and the like>Representing a reorder fraction obtained by reordering the predicted entities,/->Is the Euclidean distance weight.

Specifically, in order to obtain the spatial similarity of two prediction vectors more accurately, euclidean distance score is introduced, and is calculated as follows:

wherein, given an n-dimensional predictive vector e _sr And e _o ，And->Representing the projection of each of these two vectors along the respective dimensions, the Euclidean distance is:

in order to control the weights of Euclidean distances in the overall similarity score, we are in the scoring functionThe parameter controlling the Euclidean distance weight is also introduced>In addition, in order to enhance the local graph structure information and optimize the prediction performance, a direct reordering strategy is employed.This is because entities that are closer in distance are expected to have stronger relevance than entities that are farther apart;

the calculation formula of (2) is as follows:

wherein o is _i Representing the calculated predicted guest entity if predicted guest entity o _i U-hop neighbor list L of its corresponding subject entity _s [u]In (c), then increasing a score assigned to the candidate entity. The scoring function can comprehensively calculate the scores of the two prediction vectors, and plays an important role in improving the prediction performance.

Further, in step S3, the loss function selects an InfoNCE loss function, which is expressed as follows:

wherein,representing additional intervals added during the calculation, +.>Temperature factor indicative of the negative influence of control difficulties during calculation +.>A score representing the negative sampling triples, D being the total number of negative sampling triples;

A predictive vector acquisition module for constructing a learning model by which the first input vector V is subjected to _sr And a second input vector V _o Training to obtain a first predictive vector e _sr And a second predictive vector e _o ；

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or system that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or system. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or system that comprises the element.

The foregoing embodiment numbers of the present invention are merely for the purpose of description, and do not represent the advantages or disadvantages of the embodiments. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The use of the terms first, second, third, etc. do not denote any order, but rather the terms first, second, third, etc. are used to interpret the terms as labels.

The foregoing description is only of the preferred embodiments of the present invention, and is not intended to limit the scope of the invention, but rather is intended to cover any equivalents of the structures or equivalent processes disclosed herein or in the alternative, which may be employed directly or indirectly in other related arts.

Claims

1. A text enhanced knowledge graph representation learning method based on a local graph structure is characterized by comprising the following steps:

2. The text-enhanced knowledge graph representation learning method based on the partial graph structure according to claim 1, wherein the step S1 is specifically:

3. The text-enhanced knowledge-graph representation learning method based on the partial graph structure according to claim 2, wherein:

first input vector V _sr The calculation formula of (2) is as follows:

V _sr =s+L _s [u]+r+T _s

second input vector V _o The calculation formula of (2) is as follows:

V _o =o+L _o [u]+T _o 。

4. the text-enhanced knowledge graph representation learning method based on the partial graph structure according to claim 1, wherein the step S2 is specifically:

s21: to the first input vector V _sr And a second input vector V _o Input learningModel of V _sr And V _o Word segmentation processing is carried out to obtain word vectors, and a first hidden layer vector is obtained through calculation of attention, relative position distance and semantic information among the word vectorsAnd a second hidden layer vector->；

5. The text-enhanced knowledge graph representation learning method based on the partial graph structure according to claim 4, wherein the step S21 is specifically:

wherein,representation semanticsAttention information to semantics, ++>Represents Q _s Line a, < >>Represent K _s In the row b of (c),attention information representing semantics to relative position, < ->Represent K _p Line D (a, b), +.>Attention information representing relative position to semantics, < ->Represents Q _p Line D (a, b);

。

6. the text-enhanced knowledge graph representation learning method based on the partial graph structure according to claim 1, wherein the scoring function in step S3The expression of (2) is:

7. A text-enhanced knowledge graph representation learning system based on a local graph structure, comprising: