CN113051410A - Scientific research cooperative group discovery method based on density clustering - Google Patents

Scientific research cooperative group discovery method based on density clustering Download PDF

Info

Publication number
CN113051410A
CN113051410A CN201911380838.7A CN201911380838A CN113051410A CN 113051410 A CN113051410 A CN 113051410A CN 201911380838 A CN201911380838 A CN 201911380838A CN 113051410 A CN113051410 A CN 113051410A
Authority
CN
China
Prior art keywords
expert
experts
core
cooperation
dist
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911380838.7A
Other languages
Chinese (zh)
Inventor
陈盛之
李千目
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Science and Technology
Original Assignee
Nanjing University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Science and Technology filed Critical Nanjing University of Science and Technology
Priority to CN201911380838.7A priority Critical patent/CN113051410A/en
Publication of CN113051410A publication Critical patent/CN113051410A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/38Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/101Collaborative creation, e.g. joint development of products or services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Databases & Information Systems (AREA)
  • Strategic Management (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Economics (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Library & Information Science (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a scientific research cooperative group discovery method based on density clustering. The method comprises the following steps: importing batch thesis data, and taking the batch thesis data as a training set; preprocessing the thesis data, and extracting expert cooperation information, namely an expert name set in the same thesis; constructing an expert cooperation relationship network by utilizing expert cooperation information, and calculating a shortest path between an expert and the expert by utilizing a Dijkstra algorithm; constructing expert cooperation of each expert, belonging to the field, by utilizing the shortest path between the experts, wherein belonging to the field represents the radius of the field; determining core experts by utilizing the cooperation of the experts belonging to the number of the experts in the field, and constructing a core expert set; and obtaining all the cooperative community clusters in the expert cooperative network according to the core expert set. The invention can quickly find out expert cooperation teams and core experts in a given paper set, and provides effective support for subsequent literature measurement.

Description

Scientific research cooperative group discovery method based on density clustering
Technical Field
The invention relates to the field of discovery of cooperative groups, in particular to a scientific research cooperative group discovery method based on density clustering.
Background
Scientific research cooperation refers to the work form of individual and individual, individual and group, and group of scientific research personnel collaborating with each other according to plans to complete the same scientific research task. Scientific research is a complex and arduous group labor, and the interplay between people in scientific research activities directly influences the completion of scientific research cooperation and scientific research plans. Scientific research has been in a relatively decentralized, disorganized state over a considerable period of past history, and is a "free study" conducted by individuals, and has been largely completed by those "masters" and "inventors".
With the progress of modern scientific research, the importance of scientific research cooperation is being recognized by more and more people. The scientific research cooperation is a need for overcoming scientific difficulty and promoting scientific and technological progress, and how to organize and coordinate the scientific research cooperation at present is an important subject faced by managers. There are many cross-domain collaborations in the scientific research field, and the collaboration is often embodied in the form of a paper. Tracking and mastering the cooperative team relationship of experts is particularly important in the scientific education promotion. However, the discovery of the scientific research cooperation team at present mainly depends on a community discovery related algorithm or manual marking, and the algorithm is high in complexity, low in accuracy and long in time consumption.
Disclosure of Invention
The invention aims to provide a scientific research cooperative group discovery method capable of quickly and accurately finding out expert cooperative groups and core experts in a given paper set, and the core of the scientific research groups and the cores of the scientific research groups are automatically excavated by utilizing the characteristic of good adaptivity of density clustering.
The technical solution for realizing the purpose of the invention is as follows: a scientific research cooperative group discovery method based on density clustering comprises the following steps:
step 1, importing batch thesis data, and taking the batch thesis data as a training set;
step 2, preprocessing the thesis data, and extracting expert cooperation information, namely an expert name set in the same thesis;
step 3, constructing an expert cooperation relationship network by utilizing expert cooperation information, and calculating the shortest path between the experts by utilizing a Dijkstra algorithm;
step 4, establishing expert cooperation of each expert in the domain by using the shortest path between the experts, wherein the expert cooperation belongs to the domain, and the radius of the domain is represented by the belonging to the domain;
step 5, determining core experts by utilizing the cooperation of the experts belonging to the number of the experts in the field, and constructing a core expert set;
and 6, acquiring all the cooperative community clusters in the expert cooperative network according to the core expert set.
Further, the expert cooperation information is used to construct an expert cooperation relationship network in step 3, and a Dijkstra algorithm is used to calculate the shortest path between the experts, specifically as follows:
step 3.1, according to the list information of the authors of the paper, the authors of the paper are taken as nodes, the authors of the paper are connected together by using edges, the reciprocal of the number of the paper is taken as the weight of the edges, the undirected weighted graph G (V, E) of the expert cooperation information is constructed, and the weight w of the edges between the node i and the node jijObtained by the following formula:
Figure BDA0002342188800000021
v represents a point set of the undirected weighted graph of the expert cooperation information, E represents an edge set of the undirected weighted graph of the expert cooperation information, and Count represents the number of cooperation papers among experts;
step 3.2, calculating the shortest path between the expert and the expert in the undirected weighted graph G of the expert cooperation information by utilizing a Dijkstra algorithm, and specifically comprising the following steps:
step 3.2.1, input expert cooperation information undirected weighted graph G (V, E), input target expert name as source point V0
Step 3.2.2, representing undirected weighted graph arcs [ m ] by adjacency matrix arcs][n]Representing edges<vm,vn>If there is no edge<vm,vn>Then arcs [ m ]][n]Infinity, where m, n ∈ { m | vm∈V};
Step 3.2.3, set S record the peak of the shortest path, make set S initial value as { v }0};
Step 3.2.4, set the array dist [ 2 ]]Recording from the source point v0To the other respectiveA vertex viCurrent shortest path length of, dist i]The initial value is arcs [0 ]][i]Where i ∈ { i | v [ ]i∈V};
Step 3.2.5, selecting V from vertex set V-SjSatisfies dist [ j]=Min{dist[i]|vi∈V-S},vjI.e. a currently determined one of the variables v0The end point of the starting shortest path is that the set S is S ═ S { U { v }j};
Step 3.2.6, modify the Slave Source Point v0To each vertex V in the set V-SkShortest path length of (2): if dist [ j ]]+arcs[j][k]<dist[k]Let dist [ k ]]=dist[j]+arcs[j][k];
Step 3.2.7, repeating the steps 3.2.5-3.2.6 until the set V-S is an empty set;
step 3.2.8, output array dist [ ]]Wherein the expert points viAnd a target expert source point v0Has a shortest distance of dist [ i ]]。
Further, the step 4 of constructing the expert cooperation e field of each expert by using the shortest path between the experts specifically includes the following steps:
step 4.1, inputting a preset value epsilon and the expert cooperative relationship network generated in the step 3;
and 4.2, traversing the expert set, and selecting all experts with the distance not more than the e from the expert i as the field of the e of the expert for each expert i.
Further, in the step 5, the core experts are determined by using the cooperation of the experts belonging to the number of experts in the field, and a core expert set is constructed, specifically as follows:
step 5.1, inputting a preset value MinPts and the field belonging to the expert generated in the step 4;
step 5.2, calculating the total number of experts belonging to the field of the experts, and if the value is greater than a preset value MinPts, regarding the experts as core experts and putting the experts into a core expert set;
and 5.3, repeating the step 5.2 until all experts are traversed, and outputting a core expert set.
Further, the step 6 of obtaining all the collaboration group clusters in the expert collaboration network according to the core expert set specifically includes:
step 6.1, arbitrarily selecting a core expert in the core expert set, finding out all experts with reachable density and generating a cooperative group cluster of the experts; the density is defined by the expert as follows:
for expert i and expert j, if there is an expert sequence P1,P2,…,PnIn which P is1=i,PnJ, and Pm+1Is PmThe expert j is called as the density reachable expert of the expert i, wherein the density reachable expert is defined as follows:
if the expert j belongs to the field of the expert i, the expert j is a direct expert with the density of the expert i;
6.2, removing the experts with the reachable density found in the step 6.1 from the rest of the core experts;
step 6.3, repeating the step 6.1-6.2 from the updated core expert set until all the core experts are traversed or removed;
and 6.4, outputting the cooperative group cluster of the experts.
Compared with the prior art, the invention has the remarkable advantages that: (1) by utilizing the characteristic of good adaptability of density clustering, the centers of scientific research teams and scientific research teams can be automatically excavated, the method is simple, and the efficiency and the accuracy are high; (2) the expert cooperation team and the core experts in a given paper set can be quickly found, and effective support is provided for subsequent literature measurement.
Drawings
FIG. 1 is a schematic flow chart of the scientific research cooperative group discovery method based on density clustering.
Detailed Description
A scientific research cooperative group discovery method based on density clustering comprises the following steps:
step 1, importing batch thesis data, and taking the batch thesis data as a training set;
step 2, preprocessing the thesis data, and extracting expert cooperation information, namely an expert name set in the same thesis;
step 3, constructing an expert cooperation relationship network by utilizing expert cooperation information, and calculating the shortest path between the experts by utilizing a Dijkstra algorithm;
step 4, establishing expert cooperation of each expert in the domain by using the shortest path between the experts, wherein the expert cooperation belongs to the domain, and the radius of the domain is represented by the belonging to the domain;
step 5, determining core experts by utilizing the cooperation of the experts belonging to the number of the experts in the field, and constructing a core expert set;
and 6, acquiring all the cooperative community clusters in the expert cooperative network according to the core expert set.
Further, the expert cooperation information is used to construct an expert cooperation relationship network in step 3, and a Dijkstra algorithm is used to calculate the shortest path between the experts, specifically as follows:
step 3.1, according to the list information of the authors of the paper, the authors of the paper are taken as nodes, the authors of the paper are connected together by using edges, the reciprocal of the number of the paper is taken as the weight of the edges, the undirected weighted graph G (V, E) of the expert cooperation information is constructed, and the weight w of the edges between the node i and the node jijObtained by the following formula:
Figure BDA0002342188800000041
v represents a point set of the undirected weighted graph of the expert cooperation information, E represents an edge set of the undirected weighted graph of the expert cooperation information, and Count represents the number of cooperation papers among experts;
step 3.2, calculating the shortest path between the expert and the expert in the undirected weighted graph G of the expert cooperation information by utilizing a Dijkstra algorithm, and specifically comprising the following steps:
step 3.2.1, input expert cooperation information undirected weighted graph G (V, E), input target expert name as source point V0
Step 3.2.2, representing undirected weighted graph arcs [ m ] by adjacency matrix arcs][n]Indicating side < v >m,vn>If there is no edge<vm,vn>Then arcs [ m ]][n]Infinity, where m, n ∈ { m | vm∈V};
Step 3.2.3, set S record the peak of the shortest path, make set S initial value as { v }0};
Step 3.2.4, set the array dist [ 2 ]]Recording from the source point v0To other respective vertex viCurrent shortest path length of, dist i]The initial value is arcs [0 ]][i]Where i ∈ { i | v [ ]i∈V};
Step 3.2.5, selecting V from vertex set V-SjSatisfies dist [ j]=Min{dist[i]|vi∈V-S},vjI.e. a currently determined one of the variables v0The end point of the starting shortest path is that the set S is S ═ S { U { v }j};
Step 3.2.6, modify the Slave Source Point v0To each vertex V in the set V-SkShortest path length of (2): if dist [ j ]]+arcs[j][k]<dist[k]Let dist [ k ]]=dist[j]+arcs[j][k];
Step 3.2.7, repeating the steps 3.2.5-3.2.6 until the set V-S is an empty set;
step 3.2.8, output array dist [ ]]Wherein the expert points viAnd a target expert source point v0Has a shortest distance of dist [ i ]]。
Further, the step 4 of constructing the expert cooperation e field of each expert by using the shortest path between the experts specifically includes the following steps:
step 4.1, inputting a preset value epsilon and the expert cooperative relationship network generated in the step 3;
and 4.2, traversing the expert set, and selecting all experts with the distance not more than the e from the expert i as the field of the e of the expert for each expert i.
Further, in the step 5, the core experts are determined by using the cooperation of the experts belonging to the number of experts in the field, and a core expert set is constructed, specifically as follows:
step 5.1, inputting a preset value MinPts and the field belonging to the expert generated in the step 4;
step 5.2, calculating the total number of experts belonging to the field of the experts, and if the value is greater than a preset value MinPts, regarding the experts as core experts and putting the experts into a core expert set;
and 5.3, repeating the step 5.2 until all experts are traversed, and outputting a core expert set.
Further, the step 6 of obtaining all the collaboration group clusters in the expert collaboration network according to the core expert set specifically includes:
step 6.1, arbitrarily selecting a core expert in the core expert set, finding out all experts with reachable density and generating a cooperative group cluster of the experts; the density is defined by the expert as follows:
for expert i and expert j, if there is an expert sequence P1,P2,…,PnIn which P is1=i,PnJ, and Pm+1Is PmThe expert j is called as the density reachable expert of the expert i, wherein the density reachable expert is defined as follows:
if the expert j belongs to the field of the expert i, the expert j is a direct expert with the density of the expert i;
6.2, removing the experts with the reachable density found in the step 6.1 from the rest of the core experts;
step 6.3, repeating the step 6.1-6.2 from the updated core expert set until all the core experts are traversed or removed;
and 6.4, outputting the cooperative group cluster of the experts.
The invention is further described with reference to the following figures and detailed description.
Examples
With reference to fig. 1, the invention relates to a scientific research cooperative group discovery method based on density clustering, which comprises the following steps:
step 1, importing batch thesis data;
step 2, preprocessing the thesis data and extracting expert cooperation information;
step 3, constructing an expert cooperation relationship network by utilizing the expert cooperation information, and calculating the shortest path between the experts by utilizing a Dijkstra algorithm, wherein the method specifically comprises the following steps:
step 3.1, according to the list information of the authors of the thesis, the authors are used as nodes,connecting the authors of the co-written papers together by using edges, taking the reciprocal of the number of the co-written papers as the weight of the edges, constructing an undirected weighted graph G (V, E) of the expert cooperation information, and constructing the weight w of the edges between the nodes i and jijObtained by the following formula:
Figure BDA0002342188800000061
wherein V represents the point set of the undirected weighted graph of the expert cooperation information, E represents the edge set of the undirected weighted graph of the expert cooperation information, and Count represents the number of cooperative papers among experts.
Step 3.2, calculating the shortest path between the experts in the cooperative network G by utilizing a Dijkstra algorithm, and specifically comprising the following steps:
step 3.2.1, input expert cooperation information undirected weighted graph G (V, E), input target expert name as source point V0
Step 3.2.2, representing undirected weighted graph arcs [ m ] by adjacency matrix arcs][n]Representing edges<vm,vn>If there is no edge<vm,vn>Then arcs [ m ]][n]Infinity, where m, n ∈ { m | vm∈V};
Step 3.2.3, set S record the peak of the shortest path, make set S initial value as { v }0};
Step 3.2.4, set the array dist [ 2 ]]Recording from the source point v0To other respective vertex viCurrent shortest path length of, dist i]The initial value is arcs [0 ]][i]Where i ∈ { i | v [ ]i∈V};
Step 3.2.5, selecting V from vertex set V-SjSatisfies dist [ j]=Min{dist[i]|vi∈V-S},vjI.e. a currently determined one of the variables v0An end point of a starting shortest path; let set S ═ U { v } vj};
Step 3.2.6, modify the Slave Source Point v0To each vertex V in the set V-SkShortest path length of (2): if dist [ j ]]+arcs[j][k]<dist[k]Let dist [ k ]]=dist[j]+arcs[j][k];
Step 3.2.7, repeat step 3.2.5 and step 3.2.6 until the set V-S is an empty set;
step 3.2.8, output array dist [ ]]Wherein expert viAnd a target expert v0Has a shortest distance of dist [ i ]]。
Step 4, establishing expert cooperation of each expert in the field by using the shortest path between the experts, wherein the expert cooperation belongs to the field as follows:
step 4.1, inputting a preset value epsilon and the expert cooperative relationship network generated in the step 3;
step 4.2, traversing the expert set, and for each expert i, selecting all experts with the distance from the expert i not more than the belonged to the field as belonged to the field of the expert;
and 5, determining core experts by utilizing the cooperation of the experts belonging to the number of the experts in the field, and constructing a core expert set, wherein the method specifically comprises the following steps:
step 5.1, inputting a preset value MinPts and the field belonging to the expert generated in the step 4;
step 5.2, calculating the total number of experts belonging to the field of the experts, and if the value is greater than a preset value MinPts, regarding the experts as core experts and putting the experts into a core expert set;
and 5.3, repeating the step 5.2 until all experts are traversed, and outputting a core expert set.
Step 6, obtaining all the cooperative community clusters in the expert cooperative network according to the core expert set, and specifically comprising the following steps:
step 6.1, arbitrarily selecting a core expert in the core expert set, finding out all experts with reachable density and generating a cooperative group cluster of the experts; the density is defined by the expert as follows:
for expert i and expert j, if there is an expert sequence P1,P2,…,PnIn which P is1=i,PnJ, and Pm+1Is PmThe expert j is called as the density reachable expert of the expert i, wherein the density reachable expert is defined as follows:
if the expert j belongs to the field of the expert i, the expert j is a direct expert with the density of the expert i;
6.2, removing the experts with the reachable density found in the step 6.1 from the rest of the core experts;
step 6.3, repeating the steps 6.1-6.2 from the updated core expert set until all the core experts are traversed or removed;
and 6.4, outputting the cooperative group cluster of the experts.
The method can automatically dig out the scientific research team and the core of the scientific research team by utilizing the characteristic of good adaptability of density clustering, and has the advantages of simple method, high efficiency and high accuracy; the expert cooperation team and the core experts in a given paper set can be quickly found, and effective support is provided for subsequent literature measurement.

Claims (5)

1. A scientific research cooperative group discovery method based on density clustering is characterized by comprising the following steps:
step 1, importing batch thesis data, and taking the batch thesis data as a training set;
step 2, preprocessing the thesis data, and extracting expert cooperation information, namely an expert name set in the same thesis;
step 3, constructing an expert cooperation relationship network by utilizing expert cooperation information, and calculating the shortest path between the experts by utilizing a Dijkstra algorithm;
step 4, establishing expert cooperation of each expert in the domain by using the shortest path between the experts, wherein the expert cooperation belongs to the domain, and the radius of the domain is represented by the belonging to the domain;
step 5, determining core experts by utilizing the cooperation of the experts belonging to the number of the experts in the field, and constructing a core expert set;
and 6, acquiring all the cooperative community clusters in the expert cooperative network according to the core expert set.
2. The scientific research cooperative community finding method based on density clustering as claimed in claim 1, wherein the expert cooperation relationship network is constructed by using expert cooperation information in step 3, and the shortest path between an expert and an expert is calculated by using Dijkstra algorithm, specifically as follows:
step 3.1, according to the list information of the authors of the paper, the authors are used as nodes, the authors writing the paper together are connected by using edges, the reciprocal of the number of the written papers together is used as the weight of the edges, an undirected weighted graph G (V, E) of the expert cooperation information is constructed, and the weight w of the edges between the node i and the node j is calculatedijObtained by the following formula:
Figure FDA0002342188790000011
v represents a point set of the undirected weighted graph of the expert cooperation information, E represents an edge set of the undirected weighted graph of the expert cooperation information, and Count represents the number of cooperation papers among experts;
step 3.2, calculating the shortest path between the expert and the expert in the undirected weighted graph G of the expert cooperation information by utilizing a Dijkstra algorithm, and specifically comprising the following steps:
step 3.2.1, input expert cooperation information undirected weighted graph G ═ V, E, input target expert name as source point V0
Step 3.2.2, representing undirected weighted graph arcs [ m ] by adjacency matrix arcs][n]Representing edges<vm,vn>If there is no edge<vm,vn>Then arcs [ m ]][n]Infinity, where m, n ∈ { m | vm∈V};
Step 3.2.3, set S record the peak of the shortest path, make set S initial value as { v }0};
Step 3.2.4, set the array dist [ 2 ]]Recording from the source point v0To other respective vertex viCurrent shortest path length of, dist i]The initial value is arcs [0 ]][i]Where i ∈ { i | v [ ]i∈V};
Step 3.2.5, selecting V from vertex set V-SjSatisfies dist [ j]=Min{dist[i]|vi∈V-S},vjI.e. a currently determined one of the variables v0The end point of the starting shortest path is that the set S is S ═ S { U { v }j};
Step 3.2.6, modify the Slave Source Point v0To each vertex V in the set V-SkShortest path length of (2): if dist [ j ]]+arcs[j][k]<dist[k]Let dist [ k ]]=dist[j]+arcs[j][k];
Step 3.2.7, repeating the steps 3.2.5-3.2.6 until the set V-S is an empty set;
step 3.2.8, output array dist [ ]]Wherein the expert points viAnd a target expert source point v0Has a shortest distance of dist [ i ]]。
3. The scientific research cooperative community finding method based on density clustering as claimed in claim 1, wherein the expert cooperation e field of each expert is constructed by using the shortest path between the experts in step 4, and the method is as follows:
step 4.1, inputting a preset value epsilon and the expert cooperative relationship network generated in the step 3;
and 4.2, traversing the expert set, and selecting all experts with the distance not more than the e from the expert i as the field of the e of the expert for each expert i.
4. The scientific research cooperative community finding method based on density clustering as claimed in claim 1, wherein the core experts are determined by the number of experts in the expert cooperation E field in step 5, and a core expert set is constructed, specifically as follows:
step 5.1, inputting a preset value MinPts and the field belonging to the expert generated in the step 4;
step 5.2, calculating the total number of experts belonging to the field of the experts, and if the value is greater than a preset value MinPts, regarding the experts as core experts and putting the experts into a core expert set;
and 5.3, repeating the step 5.2 until all experts are traversed, and outputting a core expert set.
5. The scientific research cooperative community finding method based on density clustering as claimed in claim 1, wherein all cooperative community clusters in the expert cooperative network are obtained according to the core expert set in step 6, specifically as follows:
step 6.1, arbitrarily selecting a core expert in the core expert set, finding out all experts with reachable density and generating a cooperative group cluster of the experts; the density is defined by the expert as follows:
for expert i and expert j, if there is an expert sequence P1,P2,…,PnIn which P is1=i,PnJ, and Pm+1Is PmThe expert j is called as the density reachable expert of the expert i, wherein the density reachable expert is defined as follows:
if the expert j belongs to the field of the expert i, the expert j is a direct expert with the density of the expert i;
6.2, removing the experts with the reachable density found in the step 6.1 from the rest of the core experts;
step 6.3, repeating the step 6.1-6.2 from the updated core expert set until all the core experts are traversed or removed;
and 6.4, outputting the cooperative group cluster of the experts.
CN201911380838.7A 2019-12-27 2019-12-27 Scientific research cooperative group discovery method based on density clustering Pending CN113051410A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911380838.7A CN113051410A (en) 2019-12-27 2019-12-27 Scientific research cooperative group discovery method based on density clustering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911380838.7A CN113051410A (en) 2019-12-27 2019-12-27 Scientific research cooperative group discovery method based on density clustering

Publications (1)

Publication Number Publication Date
CN113051410A true CN113051410A (en) 2021-06-29

Family

ID=76506906

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911380838.7A Pending CN113051410A (en) 2019-12-27 2019-12-27 Scientific research cooperative group discovery method based on density clustering

Country Status (1)

Country Link
CN (1) CN113051410A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115630141A (en) * 2022-11-11 2023-01-20 杭州电子科技大学 Scientific and technological expert retrieval method based on community query and high-dimensional vector retrieval

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105825672A (en) * 2016-04-11 2016-08-03 中山大学 City guidance area extraction method based on floating car data
CN109214462A (en) * 2018-09-25 2019-01-15 东北大学 A kind of spatial data on-line talking method based on distributed incremental DBSCAN algorithm
CN109308315A (en) * 2018-10-19 2019-02-05 南京理工大学 A kind of collaborative recommendation method based on specialist field similarity and incidence relation
CN109492098A (en) * 2018-10-24 2019-03-19 北京工业大学 Target corpus base construction method based on Active Learning and semantic density

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105825672A (en) * 2016-04-11 2016-08-03 中山大学 City guidance area extraction method based on floating car data
CN109214462A (en) * 2018-09-25 2019-01-15 东北大学 A kind of spatial data on-line talking method based on distributed incremental DBSCAN algorithm
CN109308315A (en) * 2018-10-19 2019-02-05 南京理工大学 A kind of collaborative recommendation method based on specialist field similarity and incidence relation
CN109492098A (en) * 2018-10-24 2019-03-19 北京工业大学 Target corpus base construction method based on Active Learning and semantic density

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
张占峰等: "MapReduce框架下常用聚类算法比较研究", 《河北省科学院学报》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115630141A (en) * 2022-11-11 2023-01-20 杭州电子科技大学 Scientific and technological expert retrieval method based on community query and high-dimensional vector retrieval
CN115630141B (en) * 2022-11-11 2023-04-25 杭州电子科技大学 Scientific and technological expert retrieval method based on community query and high-dimensional vector retrieval

Similar Documents

Publication Publication Date Title
CN109697233A (en) A kind of knowledge mapping system building method
CN105677648B (en) A kind of Combo discovering method and system based on label propagation algorithm
US20090193341A1 (en) Visualization of meeting invitee status as a method of collaboration
CN104317789A (en) Method for building passenger social network
CN112990385B (en) Active crowdsourcing image learning method based on semi-supervised variational self-encoder
CN107391542A (en) A kind of open source software community expert recommendation method based on document knowledge collection of illustrative plates
CN110263236A (en) Social network user multi-tag classification method based on dynamic multi-view learning model
CN107122786A (en) A kind of mass-rent learning method and device
CN109800354A (en) A kind of resume modification intension recognizing method and system based on the storage of block chain
CN113051410A (en) Scientific research cooperative group discovery method based on density clustering
CN113361928B (en) Crowd-sourced task recommendation method based on heterogram attention network
CN107067727B (en) A kind of road traffic service level evaluation method based on fuzzy KNN characteristic matching
CN109885797B (en) Relational network construction method based on multi-identity space mapping
CN107818328A (en) With reference to the deficiency of data similitude depicting method of local message
CN117217634B (en) Enterprise cooperation community discovery method based on complex network
CN117744882A (en) Enterprise invoice quantity prediction method, device, equipment and storage medium
CN111414719B (en) Method and device for extracting peripheral features of subway station and estimating traffic demand
CN111353625A (en) Method, device, equipment and storage medium for predicting dot part quantity
CN110991803B (en) Enterprise actual controller supporting distributed computing and control path computing method
CN113570118A (en) Workshop scheduling and analyzing method based on scheduling rule
Smolander et al. Cell-connectivity-guided trajectory inference from single-cell data
CN117236465A (en) Information entropy-based federal decision tree information measurement method
CN114240539B (en) Commodity recommendation method based on Tucker decomposition and knowledge graph
JP2010079325A (en) Model construction method, construction system, and program for construction
CN115374940A (en) Risk label determination method and device based on knowledge graph

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20210629