CN112651228A - Method for identifying crowdsourcing design effective participants in internet technology community - Google Patents

Method for identifying crowdsourcing design effective participants in internet technology community Download PDF

Info

Publication number
CN112651228A
CN112651228A CN202011531547.6A CN202011531547A CN112651228A CN 112651228 A CN112651228 A CN 112651228A CN 202011531547 A CN202011531547 A CN 202011531547A CN 112651228 A CN112651228 A CN 112651228A
Authority
CN
China
Prior art keywords
user
design
article
relevance
index
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011531547.6A
Other languages
Chinese (zh)
Inventor
梁若愚
郭伟
张凌浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangnan University
Original Assignee
Jiangnan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangnan University filed Critical Jiangnan University
Priority to CN202011531547.6A priority Critical patent/CN112651228A/en
Publication of CN112651228A publication Critical patent/CN112651228A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/338Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • Economics (AREA)
  • Primary Health Care (AREA)
  • General Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Computing Systems (AREA)
  • Human Resources & Organizations (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to the technical field of internet technology, in particular to a method for identifying crowdsourcing design effective participants in an internet technology community, which can eliminate useless information, accurately screen effective users with design innovation capacity, recruit high-quality design participants and simultaneously have a perfect evaluation index system for the crowdsourcing design effective participants; the method comprises the following steps: s1, acquiring contribution degree information of technical community users; s2, acquiring each article issued by the user and corresponding evaluation parameters according to the technical community user information; s3, designing relevance according to article content statistics issued by a user; s4, calculating the weight values of the user contribution degree information, the article evaluation parameters and the article content design relevance through a coefficient of variation method; and S5, counting the characteristic value of each user according to the contribution degree information of each user, the article evaluation parameters and the relevance degree of the comparison sequence of article content design relevance and the reference sequence.

Description

Method for identifying crowdsourcing design effective participants in internet technology community
Technical Field
The invention relates to the technical field of internet technology, in particular to a method for identifying crowdsourcing design effective participants in an internet technology community.
Background
Online crowdsourcing is a new business model that is created in 2006 and aims to create economic value by distributing complex work to the public through the internet in a collaborative manner after the complex work is disassembled. The crowdsourcing design mode is a method of subpackaging part or all of design work to organizations or individuals with corresponding capabilities and resources through an internet platform, and factors such as the scale of the overall design work, a task decomposition mode, capability requirements, participating crowd categories/roles, participant management and motivation, a task package sending mode, acceptance criteria/modes, cost and the like need to be comprehensively considered. The premise for implementing the crowdsourcing design is to locate and recruit user individuals with design-related capabilities, and the internet of the individuals is aggregated by various technical communities such as station cool, CSDN forum, Chinese mechanical community, UI China and the like.
In recent years, more and more enterprises begin to conduct product development activities by means of a crowdsourcing design mode to cope with the trend of increasingly diversified market demands, however, in the concrete implementation process of the mode, the inventor finds that at least the following problems exist in the prior art:
(1) professional crowdsourcing design service platforms operated by enterprises/third parties are usually low in popularity and weak in public influence, and design participants with sufficient quality are difficult to recruit;
(2) the internet technology community comprises a large number of invalid users such as advertisement publishers and water filling persons, and crowdsourcing design activity organizers lack strategies and methods capable of accurately identifying valid users with design innovation capacity;
(3) there is a lack of an evaluation index system for the active participants of crowd-sourced designs.
Disclosure of Invention
In order to solve the technical problems, the invention provides a method for identifying crowdsourcing design effective participants in an internet technology community, which can eliminate useless information, accurately identify effective users with design innovation capacity, recruit high-quality design participants and simultaneously has a perfect evaluation index system for the crowdsourcing design effective participants.
The invention discloses a method for identifying crowdsourcing design effective participants in an internet technology community, which comprises the following steps of:
s1, acquiring contribution degree information of technical community users;
s2, acquiring each article issued by the user and corresponding evaluation parameters according to the technical community user information;
s3, designing relevance according to article content statistics issued by a user;
s4, calculating the weight values of the user contribution degree information, the article evaluation parameters and the article content design relevance through a coefficient of variation method;
s5, according to the contribution degree information of each user, the article evaluation parameters and the relevance degree of the comparison sequence of article content design relevance and the reference sequence, counting the characteristic value of each user;
and S6, selecting the users with the characteristic values exceeding the preset relevance threshold value as the effective participants of the crowd-sourced design.
In the method for identifying crowdsourced design effective participants in the internet technology community, in the step S4, the user points, the post forwarding number, the post comment number, the product structure/function/appearance related words in the text content contributed by the user, the technology related words, the design related words, the effective length of the contributed content, and the contribution content timeliness are used as the indexes for evaluating the effectiveness of the participants, and the importance weight of each evaluation index is calculated by using a coefficient of variation method.
The invention discloses a method for identifying crowdsourcing design effective participants in an internet technology community, which comprises the following steps of:
the user integral, the post forwarding number and the post comment number are statistical data directly taken from an internet technology community;
"product structure/function/appearance related vocabulary in user-contributed text content", "technology related vocabulary", "design related vocabulary" are the number of corresponding vocabulary appearing in the article issued by the user;
"effective length of contribution content":
Figure BDA0002852244100000031
wherein N isa、Nb、Nc、NdRespectively representing the vocabulary number, the product structure/function/appearance related vocabulary number, the technology related vocabulary number and the design related vocabulary number in the article issued by the user;
"contribution content timeliness": an article published within the last two months is denoted 2 and one published two months ago is denoted 1.
The invention discloses a method for identifying crowdsourcing design effective participants in an internet technology community, which comprises the following steps of:
Figure BDA0002852244100000032
wherein i is 1,2, …, n; viIs the coefficient of variation of the i index, i.e. the coefficient of standard deviation; sigmaiIs the standard deviation of the i index;
Figure BDA0002852244100000033
is the average of the i index.
The invention discloses a method for identifying crowdsourcing design effective participants in an internet technology community, which comprises the following steps of:
Figure BDA0002852244100000034
the invention discloses a method for identifying crowdsourcing design effective participants in an internet technology community.
According to the method for identifying the active participants of the crowdsourcing design in the Internet technology community, the threshold value can be modified according to the requirements of the crowdsourcing design.
Compared with the prior art, the invention has the beneficial effects that: useless information can be eliminated, effective users with design innovation capability can be accurately screened, high-quality design participants are recruited, and meanwhile, the evaluation index system for crowdsourcing design effective participants is complete.
Drawings
FIG. 1 is a logic flow diagram of the present invention.
Detailed Description
The following detailed description of embodiments of the present invention is provided in connection with the accompanying drawings and examples. The following examples are intended to illustrate the invention but are not intended to limit the scope of the invention.
The method comprises the steps of obtaining grade information of technical community users through a data mining technology (the invention is not embodied), wherein the grade information comprises points, contribution values, grades and the like, contribution contents such as postings, postbacks and other texts, and comment numbers, forwarding numbers and the like below the postings; the method comprises the following steps of taking user points, post forwarding numbers, post comment numbers, product structure/function/appearance related words in text contents contributed by users, technology related words, design related words, effective length of contribution contents and contribution content timeliness as indexes for evaluating the effectiveness of participants, wherein the statistical calculation method of each index comprises the following steps:
the user integral, the post forwarding number and the post comment number are statistical data directly taken from an internet technology community;
"product structure/function/appearance related vocabulary in user-contributed text content", "technology related vocabulary", "design related vocabulary" are the number of corresponding vocabulary appearing in the article issued by the user;
"effective length of contribution content":
Figure BDA0002852244100000041
wherein N isa、Nb、Nc、NdRespectively representing the vocabulary number, the product structure/function/appearance related vocabulary number, the technology related vocabulary number and the design related vocabulary number in the article issued by the user;
"contribution content timeliness": an article published within the last two months is denoted 2 and one published two months ago is denoted 1.
Calculating the importance weight of each evaluation index by using a coefficient of variation method, and the method comprises the following steps:
1) the coefficient of variation formula of each index is as follows:
Figure BDA0002852244100000042
in the formula ViIs the coefficient of variation of the i-th index, also known as the coefficient of standard deviation; sigmaiIs the standard deviation of the i index;
Figure BDA0002852244100000043
is the average of the i index.
2) The weight of each index is as follows:
Figure BDA0002852244100000051
selecting the optimal values of all indexes in the research sample to form a reference sequence; and calculating the association degree between the candidate user index sequence and the reference sequence by utilizing gray association analysis, wherein the gray association analysis calculation is specifically as follows:
step 1, carrying out dimensionless processing on original data:
Xk(i) and y (i) respectively represent comparison sequences and reference sequences, k is 1,2, …, m, i is 1,2, …, n, m represents the number of users to be sorted, and n represents the number of indexes; the original data is normalized by using an averaging method, and the formula is as follows:
Figure BDA0002852244100000052
Figure BDA0002852244100000053
in the formula: x is the number ofkiThe quantized value of the ith index of the kth target user; x is the number ofiThe average value of the ith index in the user set to be ranked is obtained; y isiIs the quantized value of the ith index in the reference sequence; it should be noted that, in the subsequent calculation process, some users whose index values are the same as the corresponding index values in the reference sequence need to be removed from the data normalized by the averaging method, so as to avoid generating invalid results;
step 2, calculating a grey correlation coefficient;
the calculation formula of the gray correlation coefficient is as follows:
Figure BDA0002852244100000054
in the formula:
Figure BDA0002852244100000055
Δ3=|Y'(i)-X'k(i) i, where M ═ 1, 2.., M }, N ═ 1, 2.., N }; zeta is the resolution factor at [0,1 ]]The interval value is usually 0.5 ζ;
step 3, calculating grey correlation;
the grey correlation is calculated by adopting a weighting method, and the formula is as follows:
Figure BDA0002852244100000061
in the formula: w is aiIs an index weighted value; sorting according to the relevance of each user to obtain user validity sorting; the relevance value is higher than a preset threshold value, namely crowdsourcingCounting the active participants.
The above description is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, several modifications and variations can be made without departing from the technical principle of the present invention, and these modifications and variations should also be regarded as the protection scope of the present invention.

Claims (7)

1. A method for identifying crowdsourced design active participants in an Internet technology community is characterized by comprising the following steps:
s1, acquiring contribution degree information of technical community users;
s2, acquiring each article issued by the user and corresponding evaluation parameters according to the technical community user information;
s3, designing relevance according to article content statistics issued by a user;
s4, calculating the weight values of the user contribution degree information, the article evaluation parameters and the article content design relevance through a coefficient of variation method;
s5, according to the contribution degree information of each user, the article evaluation parameters and the relevance degree of the comparison sequence of article content design relevance and the reference sequence, counting the characteristic value of each user;
and S6, selecting the users with the characteristic values exceeding the preset relevance threshold value as the effective participants of the crowd-sourced design.
2. The method as claimed in claim 1, wherein in S4, the "user score", "post forwarding number", "post comment number", "product structure/function/appearance related vocabulary in text content contributed by user", "technology related vocabulary", "design related vocabulary", "effective length of contribution content", "contribution content timeliness" are used as indicators for evaluating the effectiveness of participants, and the importance weight of each evaluation indicator is calculated by using a coefficient of variation method.
3. The method of claim 2, wherein the statistical calculation of the metrics comprises:
the user integral, the post forwarding number and the post comment number are statistical data directly taken from an internet technology community;
"product structure/function/appearance related vocabulary in user-contributed text content", "technology related vocabulary", "design related vocabulary" are the number of corresponding vocabulary appearing in the article issued by the user;
"effective length of contribution content":
Figure FDA0002852244090000011
wherein N isa、Nb、Nc、NdRespectively representing the vocabulary number, the product structure/function/appearance related vocabulary number, the technology related vocabulary number and the design related vocabulary number in the article issued by the user;
"contribution content timeliness": an article published within the last two months is denoted 2 and one published two months ago is denoted 1.
4. The method of claim 3, wherein the coefficient of variation formula of each index is as follows:
Figure FDA0002852244090000021
wherein i is 1,2, …, n; viIs the coefficient of variation of the i index, i.e. the coefficient of standard deviation; sigmaiIs the standard deviation of the i index;
Figure FDA0002852244090000022
is the average of the i index.
5. The method of claim 4, wherein the weight of each index is calculated by the formula:
Figure FDA0002852244090000023
6. the method as claimed in claim 5, wherein the method selects the optimal value of each index in the research sample to form a reference sequence, and calculates the association degree between the candidate user index sequence and the reference sequence by using gray association analysis, and if the association degree is higher than a preset threshold value, it is the valid participant of the crowd-sourced design.
7. The method of claim 1, wherein the threshold is modified according to requirements of a crowdsourcing design.
CN202011531547.6A 2020-12-22 2020-12-22 Method for identifying crowdsourcing design effective participants in internet technology community Pending CN112651228A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011531547.6A CN112651228A (en) 2020-12-22 2020-12-22 Method for identifying crowdsourcing design effective participants in internet technology community

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011531547.6A CN112651228A (en) 2020-12-22 2020-12-22 Method for identifying crowdsourcing design effective participants in internet technology community

Publications (1)

Publication Number Publication Date
CN112651228A true CN112651228A (en) 2021-04-13

Family

ID=75359181

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011531547.6A Pending CN112651228A (en) 2020-12-22 2020-12-22 Method for identifying crowdsourcing design effective participants in internet technology community

Country Status (1)

Country Link
CN (1) CN112651228A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103810169A (en) * 2012-11-06 2014-05-21 腾讯科技(深圳)有限公司 Method and device for detecting community domain experts
CN107958317A (en) * 2016-10-17 2018-04-24 腾讯科技(深圳)有限公司 A kind of method and apparatus that crowdsourcing participant is chosen in crowdsourcing project
CN109886581A (en) * 2019-02-25 2019-06-14 天津工业大学 Participant's selection method based on the quality of data in a kind of mobile gunz perception task
CN110297990A (en) * 2019-05-23 2019-10-01 东南大学 The associated detecting method and system of crowdsourcing marketing microblogging and waterborne troops

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103810169A (en) * 2012-11-06 2014-05-21 腾讯科技(深圳)有限公司 Method and device for detecting community domain experts
CN107958317A (en) * 2016-10-17 2018-04-24 腾讯科技(深圳)有限公司 A kind of method and apparatus that crowdsourcing participant is chosen in crowdsourcing project
CN109886581A (en) * 2019-02-25 2019-06-14 天津工业大学 Participant's selection method based on the quality of data in a kind of mobile gunz perception task
CN110297990A (en) * 2019-05-23 2019-10-01 东南大学 The associated detecting method and system of crowdsourcing marketing microblogging and waterborne troops

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
方鹏程: "用户贡献内容质量评价研究", 中国硕士学位论文全文数据库 信息科技辑, vol. 978, no. 10, pages 12 *
梁若愚: "在线开放创新社区环境下的领先用户识别方法研究", 机械设计, vol. 36, no. 10, pages 3 *

Similar Documents

Publication Publication Date Title
Shriver et al. Social ties and user-generated content: Evidence from an online social network
CN111984873B (en) Service recommendation system and method
CN108322317A (en) A kind of account identification correlating method and server
Guan et al. Early detection of COVID-19 outbreaks using human mobility data
CN106408325A (en) User consumption behavior prediction analysis method based on user payment information and system
CN109949174A (en) A kind of isomery social network user entity anchor chain connects recognition methods
CN106021622A (en) Information quantitative analysis system based on big data
Peng et al. Predictive analytics for University of Virginia football recruiting
Shi et al. A generalized count model on customers' purchases in O2O market
Dong et al. The impacts of live chat on service–product purchase: Evidence from a large online outsourcing platform
CN110633410A (en) Information processing method and device, storage medium, and electronic device
CN103744958B (en) A kind of Web page classification method based on Distributed Calculation
CN108509588B (en) Lawyer evaluation method and recommendation method based on big data
CN113592197A (en) Household service recommendation system and method
CN113821827A (en) Joint modeling method and device for protecting multi-party data privacy
CN111221915B (en) Online learning resource quality analysis method based on CWK-means
CN117172795A (en) Intelligent technical service fee online consultation system
CN112836137A (en) Person network support degree calculation system and method, terminal, device, and storage medium
CN112651228A (en) Method for identifying crowdsourcing design effective participants in internet technology community
CN110197196A (en) Question processing method, device, electronic equipment and storage medium
CN108805458A (en) A kind of enterprise technology Competitiveness Assessment method and device
CN113793024A (en) Bidding method and evaluation system based on behavioral portrait of bidding participant
CN109559169B (en) Method for identifying sharp users based on online user scoring
CN113918558A (en) Supplier close relation identification method based on community discovery and association rule analysis
TWI722327B (en) Audio-visual content and user interaction sequence analysis system and method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination