CN115618097A - Entity alignment method for prior data insufficient multi-social media platform knowledge graph - Google Patents
Entity alignment method for prior data insufficient multi-social media platform knowledge graph Download PDFInfo
- Publication number
- CN115618097A CN115618097A CN202211075622.1A CN202211075622A CN115618097A CN 115618097 A CN115618097 A CN 115618097A CN 202211075622 A CN202211075622 A CN 202211075622A CN 115618097 A CN115618097 A CN 115618097A
- Authority
- CN
- China
- Prior art keywords
- entity
- similarity
- knowledge graph
- entities
- alignment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 38
- 239000011159 matrix material Substances 0.000 claims description 32
- 238000004364 calculation method Methods 0.000 claims description 11
- 238000005070 sampling Methods 0.000 claims description 11
- 238000012549 training Methods 0.000 claims description 10
- 230000004927 fusion Effects 0.000 claims description 8
- 230000008569 process Effects 0.000 claims description 8
- 230000003993 interaction Effects 0.000 claims description 4
- 230000011218 segmentation Effects 0.000 claims description 4
- 230000009467 reduction Effects 0.000 claims description 2
- 230000008485 antagonism Effects 0.000 claims 2
- 230000007246 mechanism Effects 0.000 abstract description 8
- 230000000694 effects Effects 0.000 abstract description 5
- 238000012804 iterative process Methods 0.000 abstract description 4
- 238000009825 accumulation Methods 0.000 abstract description 3
- 239000013589 supplement Substances 0.000 abstract description 2
- 239000013598 vector Substances 0.000 description 15
- 230000006870 function Effects 0.000 description 8
- 230000003042 antagnostic effect Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000002950 deficient Effects 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008450 motivation Effects 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 238000007500 overflow downdraw method Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9536—Search customisation based on social or collaborative filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/01—Social networking
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Economics (AREA)
- Strategic Management (AREA)
- Computing Systems (AREA)
- General Health & Medical Sciences (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Health & Medical Sciences (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Life Sciences & Earth Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to an entity alignment method for a knowledge graph of a multi-social media platform with insufficient prior data, which utilizes knowledge graph information to supplement the prior information of the multi-social media platform, so that the accuracy of entity alignment is improved; the invention adds the entities which are possible to be aligned into the prior data set in the iterative process by introducing an iterative mechanism, and simultaneously introduces an alignment judgment mechanism to prevent the continuous accumulation of errors in the iterative process; the invention also introduces an embedding distribution alignment mechanism, restricts the shape of the unmarked entity in the embedding space based on the countermeasure network, and enables the entity source embedding and the target embedding distribution to be approximately isomorphic on the premise of not needing more prior data. The invention provides an entity alignment model architecture, which can achieve better effect only by a small amount of prior information and effectively reduce the dependence on data labels.
Description
Technical Field
The invention relates to the field of machine learning and the field of natural language processing, in particular to an entity alignment method for a knowledge graph of multiple social media platforms with insufficient prior data.
Background
With the continuous development of the internet in recent years, the activities of users on social media platforms are more and more active. The user can pay attention to activities and hot news of the large V user on Facebook, and can add friend chatting and sharing trends on Twitter, and a knowledge graph formed by social media data of the user comprises various contents such as the user, events, subjects and the like. Due to the different main functions of different social media platforms, the data of the user on different social media has no obvious relevance. The process of judging that different entities in the knowledge graph constructed by the social media platform point to things in the same real world is called entity alignment, and the realization of the entity alignment has important significance for applications such as user portrayal, recommendation systems and the like.
The study of entity alignment based on knowledge graph embedding receives more and more attention, and the entity is embedded into a low-dimensional feature space by a representation learning method based on the assumption that equivalent entities in different knowledge graphs have similar neighborhood structures, and the vector similarity is calculated to find potential aligned entity pairs. Meanwhile, in order to solve the problem of data sparseness, many studies have been made to enhance the performance of entity alignment using additional information.
Entity alignment based on knowledge graph embedding relies on annotated prior information, but data annotation is typically costly and requires a significant amount of time and money. The accuracy of the current semi-supervised entity alignment algorithm cannot be guaranteed under the condition of a small amount of labeled data. Therefore, it becomes more important to maintain a high accuracy while reducing the dependency on the a priori information.
Disclosure of Invention
Technical problem to be solved
Aiming at the problem that the accuracy of the entity alignment result of the knowledge graph of the multiple social media platforms is low under the condition of insufficient prior data, the invention provides an entity alignment method for the knowledge graph of the multiple social media platforms under the condition of insufficient prior data, provides a model for introducing an iteration strategy and an embedding distribution alignment strategy, adds a newly discovered possible aligned entity in the training process into a training set, realizes the dynamic increase of the prior data, and simultaneously restrains the shape of an unmarked entity in an embedding space based on an antagonistic network. The method can supplement the information of the original data, thereby ensuring the accuracy of entity alignment under the condition of insufficient prior data.
Technical scheme
An entity alignment method for a priori data insufficient multi-social media platform knowledge graph is characterized by comprising the following steps:
s1: constructing a knowledge graph according to attributes, data and interaction of a user on a plurality of social media platforms;
s2: obtaining a similarity matrix of three dimensions from the structure, the semantics and the character string according to the knowledge graph, and performing feature fusion to obtain initial entity similarity;
s3: iteratively adding the possibly aligned entities with high entity similarity into a training data set in an automatic marking mode to be used as prior data, and realizing dynamic increase of the prior data;
s4: frequency sampling of unlabeled entities, aligning the embedded distribution of source and target knowledge-maps in generating a countermeasure network, may enable a reduction in the distance between alignable entities;
s5: and recalculating the similarity of the structure, and performing feature fusion to obtain the result of entity alignment.
The further technical scheme of the invention is as follows: in the knowledge graph constructed in the S1, the central words of the users, the attributes and the comment content after word segmentation are all used as entity nodes, and the knowledge graph is constructed by the user and the user, the user and the attributes and the user and the comment central words in a triple < h, r and t >.
The further technical scheme of the invention is as follows: the initial entity similarity calculation of the S2 comprises three similarity matrixes obtained based on the knowledge graph of the S1: and fusing the similarity matrixes of the three dimensions to obtain a unified entity similarity matrix.
The further technical scheme of the invention is as follows: in S3, the iteration strategy specifically includes:
s31: according to the S2 entity initial similarity, for any entity x, when the similarity between the entity y and the entity x is larger than a threshold value, adding the entity y into a candidate alignment entity of the entity x;
s32: constructing a bipartite graph by using all x and y which meet a condition, wherein nodes represent entities, and edges represent probabilities that the nodes can be aligned;
s33: searching the edges with the maximum probability value and no intersection in the bipartite graph to obtain a pair of entities which are one-in-one and most likely to be aligned, and marking alignment labels on the pairs of entities;
s34: in the iteration process, an alignment judgment method is used for preventing the marked entity from being repeatedly marked or becoming unmarked;
s35: automatically tagged pairs of entities are added to pairs of seed entities, the two knowledgemaps are connected using the entities, and the own knowledgemap is enriched with information from another entity.
The further technical scheme of the invention is as follows: in S4, the embedding distribution strategy specifically includes:
s41: frequency sampling is carried out on all entity pairs;
s42: an indicator defining the GAN represents from which knowledge-graph the entity came;
s43: judging domain features among classified entities by a GAN (generic identifier);
s44: the least square generates an antagonistic network LSGAN as an antagonistic loss, which selects the least square loss as a discriminator, and the antagonistic module adopts a 0-1 coding method.
The further technical scheme of the invention is as follows: in S5, re-fusing the similarity matrices of the three dimensions to obtain a uniform entity similarity matrix specifically includes:
s51: calculating the similarity of the obtained sums to obtain a similarity matrix embedded with the structural features;
s52: and setting the same weight of each dimension, and obtaining a uniform entity similarity matrix by weighted average.
Advantageous effects
According to the entity alignment method for the knowledge graph of the multi-social-media platform with insufficient prior data, the prior information of the multi-social-media platform is supplemented by the knowledge graph information, so that the accuracy of entity alignment is improved; the invention adds the entities which are possible to be aligned into the prior data set in the iterative process by introducing an iterative mechanism, and simultaneously introduces an alignment judgment mechanism to prevent the continuous accumulation of errors in the iterative process; the invention also introduces an embedding distribution alignment mechanism, restricts the shape of the unmarked entity in the embedding space based on the countermeasure network, and enables the entity source embedding and the target embedding distribution to be approximately isomorphic on the premise of not needing more prior data. The invention provides an entity alignment model architecture, which can achieve better effect only by a small amount of prior information and effectively reduce the dependence on data labels.
Drawings
The drawings are only for purposes of illustrating particular embodiments and are not to be construed as limiting the invention, wherein like reference numerals are used to designate like parts throughout.
FIG. 1 is an example diagram of a knowledge graph of an entity alignment method for a priori data-deficient multi-social media platform knowledge graph;
FIG. 2 is an embedded distributed alignment motivation diagram for an entity alignment method for a priori data insufficient multi-social media platform knowledge graph;
FIG. 3 is a model flow diagram of an entity alignment method for a priori data-deficient multi-social media platform knowledge graph.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. In addition, the technical features involved in the embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.
An entity alignment method for a priori data insufficient multi-social media platform knowledge graph comprises the following steps:
s1, constructing a knowledge graph according to attributes, data and interaction of a user on a plurality of social media platforms;
s2, obtaining a similarity matrix of three dimensions from the structure, the semantics and the character string according to the knowledge graph, and performing feature fusion to obtain initial entity similarity;
s3, iteratively adding entities which are possibly aligned into the training data set in an automatic marking mode to realize dynamic increase of prior data;
s4, performing frequency sampling on the unmarked entities, and aligning the embedded distribution of the source knowledge graph and the target knowledge graph in the generation of the countermeasure network, so that the distance between the entities capable of being aligned is reduced;
and S5, recalculating the similarity of the structure, and performing feature fusion to further obtain the result of entity alignment.
The technical solution of the present invention is described in detail below:
s1, constructing a knowledge graph according to attributes, data and interaction of a user on a plurality of social media platforms;
in the constructed knowledge graph, the central words of the users, the attributes and the comment content after word segmentation are all used as entity nodes, and the knowledge graph is constructed among the users, the attributes and the comments in a triple < h, r and t > mode. Specifically, a knowledge graph is constructed by taking a user ID, a user nickname, a user comment subject and a user comment keyword as entity nodes, wherein the nodes are < user, friend, user >, < user, nickname, user nickname >, < user, discussion, user comment subject >, < user, comment, user comment keyword >, < user comment keyword >, comment connection and user comment keyword >.
S2, obtaining a similarity matrix of three dimensions from the structure, the semantics and the character string according to the knowledge graph, and performing feature fusion to obtain initial entity similarity;
the specific process for obtaining the initial entity similarity comprises the following steps:
constructing a dual relation graph, wherein the relation in the graph is represented as the weight of the two relations, the weight is represented as the similarity value of the two relations, and the weight calculation formula is as follows:
wherein r is i Representing a certain relationship in the knowledge-graph, r j Representing a certain relationship in the knowledge-graph, H i Representing relationships r in a knowledge graph i Connected set of head nodes, H j Representing relationships r in a knowledge graph j Connected set of head nodes, H represents the relation r i And r j Similarity value of connected head nodes, T i Representing relationships r in a knowledge graph i Connected tail node set, H j Representing relationships r in a knowledge graph j Connected tail node set, T representing the relationship r i And r j A similarity value of the connected tail nodes;
applying a graph attention mechanism to iterate to obtain node representations e of the dual relationship graph and the original graph;
and calculating the similarity among the vectors by using a cosine similarity algorithm to obtain a structural similarity matrix, wherein the cosine similarity calculation formula is as follows:
wherein e i Vector representation representing node i in the knowledge-graph, e j Representing vector representation of a node j in the knowledge graph, and sim represents structural similarity;
performing Word segmentation on comment corpora of all users in original data, and inputting the comment corpora into Word2vec, fastText and GLOVE pre-training Word vector models to train models;
inputting entities in the knowledge graph into three word vector models to obtain vector representation;
calculating a similarity matrix between vectors by using a cosine similarity algorithm to obtain a semantic similarity matrix;
measuring the difference between the two character strings by adopting a Levenshtein distance;
calculating the Levenshtein ratio represents the similarity between entity names, and the calculation formula is as follows:
wherein e i Representing entity name vector representations in a knowledge graph, e j Representing entity name vector representations in the knowledge graph, lev representing a Levenshtein distance between two character string vector representations, and r representing similarity between entity names;
assuming that each dimension has equal importance to obtain a uniform entity similarity matrix, the calculation formula is as follows:
wherein S k Representing a string similarity matrix, a semantic similarity matrix, a structural similarity matrix, mean being a mean function, std being a function of calculating the standard deviation,is a normalized matrix representing the similarity of character strings, a normalized matrix of semantic similarity, a normalized matrix of structural similarity, S * Representing a similarity average matrix;
s3, iteratively adding entities which are possibly aligned into a training data set in an automatic marking mode to realize dynamic increase of prior data;
the automatic marking and adding of the entity to the training data set specifically comprises:
s31: according to the initial entity similarity calculated in the S2, for any entity x, when the similarity between the entity y and the entity x is larger than a threshold value, adding the entity y into a candidate alignment entity of the entity x; for example, in a ground t-round iteration:
π(y|x;Θ t )=σ(sim(x,y))
wherein x represents any entity in the knowledge graph, y represents any entity except x in the knowledge graph, sim represents the similarity between the entity x and the entity y obtained according to the similarity matrix fusing the multidimensional characteristics in S1, and theta t The representation is obtained by calculation according to the multidimensional characteristics of the t-th round, and pi represents the probability of distributing labels;
Y x '={y|y∈Y'andπ(y|x;Θ t )>γ 1 }
wherein psi t To representIndicating function, X 'represents entity set, Y' x Representing candidate aligned entities of entity x, max representing a maximum function;
s32: constructing a bipartite graph by using all x, y satisfying conditions, wherein nodes represent entities, and edges represent probabilities that the nodes can be aligned;
s33: searching the edges with the maximum probability values and without intersection in the bipartite graph to obtain a pair of entities which are one-to-one and most likely to be aligned, and marking alignment labels on the pair of entities;
s34: in the iteration process, an alignment judgment method is used for preventing the marked entity from being repeatedly marked or becoming unmarked;
when the entities conflict, the model expects to obtain an entity pair with higher possibility, and the calculation formula of the similarity difference of the entity pair is as follows:
wherein x represents any entity, y represents one candidate aligned entity of the entity x, y' represents another candidate aligned entity of the entity x, pi represents the above assigned tag probability in S21,representing differences in entity pair similarity;
s35: the automatically labeled entity pairs are iteratively added to the seed entity pair, the two knowledge-graphs are connected using the entities, and the knowledge-graph itself is enriched with information from another entity.
S4, performing frequency sampling on the unmarked entities, and aligning the embedded distribution of the source knowledge graph and the target knowledge graph in the generation of the countermeasure network, so that the distance between the entities capable of being aligned is reduced;
the method comprises the following specific steps:
s41: frequency sampling is carried out on all entity pairs;
the calculation formula of the frequency sampling vector is as follows:
wherein f is hi Representing the number of entities i as head entities in all triples, f ti Representing the number of entities i as tail entities in all triples, P i Representing the frequency sampling vector of the entity i, and E representing the number of nodes in the knowledge graph;
s42: an indicator defining the GAN represents from which knowledge-graph the entity came;
s43: judging domain features among classified entities by a GAN (generic area network) discriminator;
the definition of the optimal discriminator is as follows:
wherein KG s Representing source knowledge-graph, KG t Representing a target knowledge-graph, D * Is the best discriminator;
s44: the least square generation countermeasure network (LSGAN) is used as a countermeasure loss, the least square loss is used as a discriminator, and a 0-1 coding method is adopted by a countermeasure module.
D and G are defined as follows:
where δ represents the tag smoothing value, e s Representing the inputs to generate the countermeasure network module based on the S2 structural feature embedding module described above, e t Representing inputs to generate a countermeasure network module based on the S2 structural feature embedding module, D representing a discriminator indicating from which knowledge graph the entity came, G representing a generator from which the countermeasure network module was trained, λ 1 Representing the ratio of loss functions, P, used to correct for between different building blocks t Representing a target knowledge-graph KG t Sampling frequency vector of (P) s Representing a target knowledge-graph KG s The sampling frequency vector of (a);
s5, recalculating the similarity of the structures, and performing feature fusion;
the similarity calculation formula for the structural feature embedding is as follows:
d(Ge s ,e t )=||Ge s -e t ||
wherein e s Representing the inputs to generate the countermeasure network module based on the S2 structural feature embedding module described above, e t Representing the input of the generation countermeasure network module based on the S2 structural feature embedding module, G representing a generator which is well trained by the generation countermeasure network module, and d representing the similarity of the structural feature embedding;
and (3) calculating the final similarity of the entities by using the same fusion method in the S1:
the calculation formula is as follows:
wherein S k Representing a string similarity matrix, a semantic similarity matrix, a structural similarity matrix, mean being a mean function, std being a function of calculating the standard deviation,is a normalized matrix representing the similarity of character strings, a normalized matrix of semantic similarity, a normalized matrix of structural similarity, S * Representing a similarity average matrix;
and finally, obtaining an entity alignment result according to the similarity matrix between the entities.
The invention provides an entity alignment method for a knowledge graph of multiple social media platforms with insufficient prior data. Since users and user data of multiple social media platforms are not directly related, it takes a lot of time and money to label samples when making entity alignments. The invention provides an entity alignment method introducing an iteration strategy and an embedded distribution alignment strategy. The iteration strategy realizes the dynamic increase of prior data by adding a newly discovered possible alignment entity in the training process into a training set, and simultaneously introduces an alignment judgment mechanism to prevent the continuous accumulation of errors in the iteration process; the embedding distribution alignment strategy is based on that the shape of the unmarked entity in the embedding space is constrained by the countermeasure network, namely the shape of the unmarked entity is required to be similar as much as possible, so as to ensure the accuracy of entity alignment under the condition of insufficient prior data. The invention utilizes knowledge map information, introduces an iteration strategy and an embedded distribution alignment strategy, makes up for insufficient prior data of the original user, and ensures that the accuracy of entity alignment is higher.
While the invention has been described with reference to specific embodiments, the invention is not limited thereto, and various equivalent modifications or substitutions can be easily made by those skilled in the art within the technical scope of the present disclosure.
Claims (6)
1. An entity alignment method for a priori data insufficient multi-social media platform knowledge graph is characterized by comprising the following steps:
s1: constructing a knowledge graph according to attributes, data and interaction of a user on a plurality of social media platforms;
s2: obtaining a similarity matrix of three dimensions from the structure, the semantics and the character string according to the knowledge graph, and performing feature fusion to obtain initial entity similarity;
s3: the entity with high entity similarity and possible alignment is added into the training data set in an automatic marking mode in an iterative mode and used as prior data, and dynamic increase of the prior data is achieved;
s4: frequency sampling of unlabeled entities, aligning the embedded distribution of source and target knowledge-maps in generating a countermeasure network, may enable a reduction in the distance between alignable entities;
s5: and recalculating the similarity of the structure, and performing feature fusion to obtain the result of entity alignment.
2. The entity alignment method for the a priori data insufficient multi-social media platform knowledge graph according to claim 1, wherein: in the knowledge graph constructed in the S1, the central words of the users, the attributes and the comment content after word segmentation are all used as entity nodes, and the knowledge graph is constructed by the user and the user, the user and the attributes and the user and the comment central words in a triple < h, r and t >.
3. The entity alignment method for the a priori data insufficient multi-social media platform knowledge graph according to claim 2, wherein the entity alignment method comprises the following steps: the initial entity similarity calculation of the S2 comprises three similarity matrixes obtained based on the knowledge graph of the S1: and fusing the similarity matrixes of the three dimensions to obtain a unified entity similarity matrix.
4. The entity alignment method for the priori data insufficient multi-social media platform knowledge graph according to claim 3, wherein in S3, the iteration strategy specifically comprises:
s31: according to the S2 entity initial similarity, for any entity x, when the similarity between the entity y and the entity x is larger than a threshold value, adding the entity y into a candidate alignment entity of the entity x;
s32: constructing a bipartite graph by using all x, y satisfying conditions, wherein nodes represent entities, and edges represent probabilities that the nodes can be aligned;
s33: searching the edges with the maximum probability value and no intersection in the bipartite graph to obtain a pair of entities which are one-in-one and most likely to be aligned, and marking alignment labels on the pairs of entities;
s34: in the iteration process, an alignment judgment method is used for preventing the marked entity from being repeatedly marked or becoming unmarked;
s35: automatically tagged pairs of entities are added to pairs of seed entities, the two knowledgemaps are connected using the entities, and the own knowledgemap is enriched with information from another entity.
5. The entity alignment method for the a priori data insufficient multi-social media platform knowledge graph according to claim 4, wherein the entity alignment method comprises the following steps: in S4, the embedding distribution strategy specifically includes:
s41: frequency sampling is carried out on all entity pairs;
s42: an indicator defining GAN represents from which knowledge-graph an entity is;
s43: judging domain features among classified entities by a GAN (generic identifier);
s44: the least square generates the confrontation network LSGAN as the antagonism loss, which selects the least square loss as the discriminator, and the antagonism module adopts the 0-1 coding method.
6. The entity alignment method for a priori data insufficient multi-social media platform knowledge graph according to claim 5, wherein: in S5, re-fusing the similarity matrices of the three dimensions to obtain a uniform entity similarity matrix specifically includes:
s51: calculating the similarity of the obtained sums to obtain a similarity matrix embedded with the structural features;
s52: and setting the same weight of each dimension, and obtaining a uniform entity similarity matrix by weighted average.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211075622.1A CN115618097A (en) | 2022-09-05 | 2022-09-05 | Entity alignment method for prior data insufficient multi-social media platform knowledge graph |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211075622.1A CN115618097A (en) | 2022-09-05 | 2022-09-05 | Entity alignment method for prior data insufficient multi-social media platform knowledge graph |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115618097A true CN115618097A (en) | 2023-01-17 |
Family
ID=84858508
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211075622.1A Pending CN115618097A (en) | 2022-09-05 | 2022-09-05 | Entity alignment method for prior data insufficient multi-social media platform knowledge graph |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115618097A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116610820A (en) * | 2023-07-21 | 2023-08-18 | 智慧眼科技股份有限公司 | Knowledge graph entity alignment method, device, equipment and storage medium |
CN117370583A (en) * | 2023-12-08 | 2024-01-09 | 湘江实验室 | Knowledge-graph entity alignment method and system based on generation of countermeasure network |
-
2022
- 2022-09-05 CN CN202211075622.1A patent/CN115618097A/en active Pending
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116610820A (en) * | 2023-07-21 | 2023-08-18 | 智慧眼科技股份有限公司 | Knowledge graph entity alignment method, device, equipment and storage medium |
CN116610820B (en) * | 2023-07-21 | 2023-10-20 | 智慧眼科技股份有限公司 | Knowledge graph entity alignment method, device, equipment and storage medium |
CN117370583A (en) * | 2023-12-08 | 2024-01-09 | 湘江实验室 | Knowledge-graph entity alignment method and system based on generation of countermeasure network |
CN117370583B (en) * | 2023-12-08 | 2024-03-19 | 湘江实验室 | Knowledge-graph entity alignment method and system based on generation of countermeasure network |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112131404B (en) | Entity alignment method in four-risk one-gold domain knowledge graph | |
Yan et al. | Image classification by cross-media active learning with privileged information | |
CN111753024B (en) | Multi-source heterogeneous data entity alignment method oriented to public safety field | |
CN115618097A (en) | Entity alignment method for prior data insufficient multi-social media platform knowledge graph | |
CN112215837B (en) | Multi-attribute image semantic analysis method and device | |
WO2021139247A1 (en) | Construction method, apparatus and device for medical domain knowledge map, and storage medium | |
CN114090783A (en) | Heterogeneous knowledge graph fusion method and system | |
CN112988917B (en) | Entity alignment method based on multiple entity contexts | |
CN107077487A (en) | Personal photo is tagged using depth network | |
WO2024099037A1 (en) | Data processing method and apparatus, entity linking method and apparatus, and computer device | |
CN112883289B (en) | PMF recommendation method based on social trust and tag semantic similarity | |
WO2022262320A1 (en) | Information completion method for knowledge graph-based power distribution network cim model, and system | |
CN110245349A (en) | A kind of syntax dependency parsing method, apparatus and a kind of electronic equipment | |
Lin et al. | Deep unsupervised hashing with latent semantic components | |
CN115438197A (en) | Method and system for complementing relationship of matter knowledge map based on double-layer heterogeneous graph | |
CN113158798A (en) | Short video classification method based on multi-mode feature complete representation | |
WO2023168810A1 (en) | Method and apparatus for predicting properties of drug molecule, storage medium, and computer device | |
Zhou et al. | Rank2vec: learning node embeddings with local structure and global ranking | |
CN115438274A (en) | False news identification method based on heterogeneous graph convolutional network | |
Ou et al. | Cross-modal generation and pair correlation alignment hashing | |
CN116386895B (en) | Epidemic public opinion entity identification method and device based on heterogeneous graph neural network | |
Perdana et al. | Instance-based deep transfer learning on cross-domain image captioning | |
Li et al. | Person text-image matching via text-feature interpretability embedding and external attack node implantation | |
CN114691880A (en) | Knowledge graph construction method and device and electronic equipment | |
Peng et al. | TH-SLP: Web service link prediction based on topic-aware heterogeneous graph neural network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |