KR101772199B1 - System for knowledge verification based on crowdsourcing - Google Patents
System for knowledge verification based on crowdsourcing Download PDFInfo
- Publication number
- KR101772199B1 KR101772199B1 KR1020150152389A KR20150152389A KR101772199B1 KR 101772199 B1 KR101772199 B1 KR 101772199B1 KR 1020150152389 A KR1020150152389 A KR 1020150152389A KR 20150152389 A KR20150152389 A KR 20150152389A KR 101772199 B1 KR101772199 B1 KR 101772199B1
- Authority
- KR
- South Korea
- Prior art keywords
- data
- knowledge
- reliability
- completion
- evaluation
- Prior art date
Links
- 238000012795 verification Methods 0.000 title abstract description 18
- 238000011156 evaluation Methods 0.000 claims abstract description 66
- 230000005540 biological transmission Effects 0.000 claims abstract description 9
- 238000000926 separation method Methods 0.000 claims description 4
- 238000012358 sourcing Methods 0.000 abstract description 22
- 238000000034 method Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000003190 augmentative effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 238000012946 outsourcing Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
Images
Classifications
-
- G06F17/30303—
-
- G06F17/30289—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/01—Social networking
-
- G06Q50/30—
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Primary Health Care (AREA)
- Computing Systems (AREA)
- General Health & Medical Sciences (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Health & Medical Sciences (AREA)
- Strategic Management (AREA)
- Economics (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Operations Research (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention relates to a crowd sourcing-based knowledge verification system capable of determining the reliability of knowledge data based on crowd sourcing and building a knowledge database accordingly.
Specifically, the knowledge data is extracted through an uncertain knowledge database in which uncertain knowledge data collected through a knowledge collection unit is stored and an answer knowledge database in which validated correct knowledge data is stored, and an evaluation set is generated to generate an evaluation set in which the knowledge data is combined part; An evaluation set transmission unit for transmitting the evaluation set to an evaluation set providing server; A completion set collection unit for collecting a completion set from the evaluation set providing server; A completion set determiner for determining reliability of the completion set; And a data determination unit that transmits the completion set to the correct answer knowledge database when the reliability of the completion set transmitted from the completion set verification unit satisfies a set reference value.
Thus, the present invention provides a system whereby the participant can evaluate the evaluation set via the evaluation set provision server and determine the evaluated completion set to store the verified data in the correct knowledge database.
Description
The present invention relates to a crowd sourcing-based knowledge verification system capable of determining the reliability of knowledge data based on crowd sourcing and building a knowledge database accordingly.
Specifically, the knowledge data is extracted through an uncertain knowledge database in which uncertain knowledge data collected through a knowledge collection unit is stored and an answer knowledge database in which validated correct knowledge data is stored, and an evaluation set is generated to generate an evaluation set in which the knowledge data is combined part; An evaluation set transmission unit for transmitting the evaluation set to an evaluation set providing server; A completion set collection unit for collecting a completion set from the evaluation set providing server; A completion set determiner for determining reliability of the completion set; And a data determination unit that transmits the completion set to the correct answer knowledge database when the reliability of the completion set transmitted from the completion set verification unit satisfies a set reference value.
Thus, the present invention provides a system whereby the participant can evaluate the evaluation set through the evaluation set provision server and determine the evaluated completion set to store the verified data in the correct knowledge database.
Generally, a knowledge service creates knowledge data through natural language processing of various documents and information existing on the Internet, and builds a knowledge database by storing such knowledge data as a knowledge database. Based on this knowledge database, the user can receive the necessary knowledge service.
Recently, knowledge of various fields has been explosively increased, and the amount of knowledge data accumulated in a knowledge database is increasing. In particular, as users who use knowledge services seek various knowledge and desire to share non-professional knowledge such as their own experience, wisdom, know-how, etc., Knowledge data is generated, and websites based on the generated knowledge data are attracting attention.
However, since it is difficult to judge the reliability of the generated knowledge data, the user can not confirm whether or not the knowledge is correct. Therefore, in order to solve such a problem, Japanese Patent Application Laid-Open No. 10-2010-0066642 discloses a knowledge information providing system and method (hereinafter referred to as "
The above technique can judge the accuracy and reliability of the knowledge (question) through the expert, but it is difficult for the expert to judge a vast amount of knowledge, and even if the expert sends the wrong answer to the questioner, Therefore, there is a concern that the system is data-ized because it is satisfied with the wrong answer.
Therefore, it is necessary to incorporate crowd sourcing so that a vast amount of knowledge can be judged.
Crowd sourcing is a combination of crowd and outsourcing, which means engaging the public in some of the activities of the company, such as production and services. Through crowd sourcing, many users can judge the reliability of knowledge data. The reliability of the knowledge data can be verified.
In the meantime, as described above, the reliability is determined through the participation of the user and the determined knowledge data can be accumulated in the knowledge database. In the registration patent publication No. 10-0756382, Prior art 2 ').
In the
However, since the
Therefore, it is necessary to develop a system that can grasp the accuracy and reliability of the knowledge generated by the user, and can verify the knowledge generated in the system without expert judgment.
Furthermore, it is required to develop a technology that can verify knowledge through crowd sourcing and store the proven knowledge in a knowledge database.
It is an object of the present invention to provide a crowd sourcing-based knowledge verification system that can verify and verify the reliability of unverified knowledge data.
Another object of the present invention is to provide a crowd sourcing-based knowledge verification system capable of quickly determining a vast amount of knowledge data by using crowd sourcing and constructing it as a knowledge database.
According to an aspect of the present invention, there is provided a crowd sourcing-based knowledge verification system,
A non-deterministic knowledge database storing uncertain knowledge data collected through a knowledge collection unit; A correct answer knowledge database stored with verified correct answer knowledge data; An evaluation set generation unit for generating an evaluation set in which the undetermined knowledge data and the correct answer knowledge data are combined; An evaluation set transmission unit for transmitting the evaluation set to an evaluation set providing server; A completion set collection unit for collecting a completion set from the evaluation set providing server; A completion set determiner for determining reliability of the completion set; And a data determination unit that transmits the completion set to the correct answer knowledge database when the reliability of the completion set transmitted from the completion set verification unit satisfies a set reference value.
The crowd sourcing-based knowledge verification system according to the present invention has a remarkable effect of judging the reliability of the unverified knowledge data and verifying the reliability thereof.
Further, the present invention has a remarkable effect of rapidly performing knowledge data determination and knowledge database construction using crow sourcing.
Further, the present invention can remarkably determine the reliability of the knowledge data as compared with the expert group, and has a remarkable effect with high accuracy in the reliability calculation.
1 shows a configuration of a crowd sourcing-based knowledge verification system according to the present invention.
Figure 2 shows knowledge data according to an exemplary embodiment of the present invention.
Figure 3 shows an evaluation set and a completion set in accordance with an exemplary embodiment of the present invention.
4 shows a first embodiment of the evaluation set verifying section of the present invention.
5 shows a second embodiment of the evaluation set verifying section of the present invention.
FIG. 6 illustrates an example of the degree of difficulty of the difficulty determination module according to an exemplary embodiment of the present invention.
The terms and words used in the present specification and claims should not be construed as limited to ordinary or dictionary meanings and the inventor can properly define the concept of the term to describe its invention in the best possible way And should be construed in accordance with the principles and meanings and concepts consistent with the technical idea of the present invention.
Therefore, the embodiments described in the present specification and the configurations shown in the drawings are merely the most preferred embodiments of the present invention and are not intended to represent all of the technical ideas of the present invention. Therefore, various equivalents And variations are possible.
Before describing the present invention with reference to the accompanying drawings, it should be noted that the present invention is not described or specifically described with respect to a known configuration that can be easily added by a person skilled in the art, Let the sound be revealed.
The present invention relates to a crowd sourcing-based knowledge verification system capable of determining the reliability of knowledge data based on crowd sourcing and building a knowledge database accordingly.
Such a crowd sourcing-based knowledge verification system is illustrated in FIGS. 1 to 6 of the accompanying drawings.
Figure 1 illustrates the configuration of a crowd sourcing-based knowledge validation system of the present invention, Figure 2 illustrates knowledge data in accordance with an exemplary embodiment of the present invention, Figure 3 depicts an exemplary set of evaluation sets, Figure 4 shows a first embodiment of the evaluation set verifier of the present invention, Figure 5 shows a second embodiment of the evaluation set verifier of the present invention, Figure 6 shows an example of the present invention An example of determining the degree of difficulty of the difficulty level determination module according to the embodiment is shown.
The knowledge database is divided into an uncertain knowledge database (10) and a correct knowledge database (20).
The
The
A wiki is a website that allows users to easily add, edit, and delete content through a simple markup language using a web browser on a device with Internet access.
The correct
As shown in FIG. 2 (b) of the accompanying drawings, the correct
The evaluation
The crowd sourcing-based
The evaluation set
The evaluation
At this time, the
When all (M + N) pieces of knowledge data have been evaluated, the evaluation set 113 evaluated by the plurality of participants in the evaluation set providing
3B, the completion set 131 determines whether (M + N) pieces of knowledge data have been judged to be true or false by the participant through the selection
The completion set 131 as described above is collected in the completion
The completion set
The completion set
The first data is knowledge data extracted from the correct
The second data is knowledge data extracted from the
This is because the reliability of the first data can be determined through the
The first
Since the first data is the correct
Accordingly, the reliability of the first data can be evaluated through CA and IA of the first data, and a method of evaluating the reliability is as follows.
In this case, M means the number of first data, CA (Correct Answer) means the number of knowledge data processed as correct answer, and IA (Incorrect Answer) means the number of knowledge data processed as wrong answer.
Therefore, the reliability of the first data is as follows.
Here, S denotes the reliability of the first data.
The second
This is because it is possible to estimate through the first data evaluated by the participant how much the participant has knowledge verification ability and to estimate the reliability through Equation (2) so that the second data is also equal to the reliability of the first data .
Therefore, the reliability value of the second data, that is, the
At this time, the same
However, even if the reliability of the first data calculated through Equation (2) is estimated to be the same as the reliability of the second data, there is a concern that the reliability of the first data and the second data may be determined through the differences therebetween.
For example, the evaluation set
Accordingly, the completion set
The degree-of-
At this time, the sum of the degree of difficulty of the first data is equal to the sum of the degree of difficulty of the corrected first data and the sum of the degree of difficulty of the erroneously processed first data. Therefore, the reliability value for the second data is as follows.
Here, S denotes the reliability value of the second data,
Represents the sum of difficulty levels of the first data processed correctly, Represents the degree of difficulty of the first data.Therefore, an accurate reliability value can be calculated by evaluating the reliability in consideration of the degree of difficulty.
However, consideration should be given to cases in which the number of evaluations of arbitrarily selected i-th knowledge is smaller than Z and large cases. At this time, Z can be arbitrarily set by the manager as " the number of times of selection as a criterion for ensuring validity of the uncertain knowledge data ". In other words, the reliability is less accurate as the knowledge data is less frequently evaluated by the participant.
Therefore, if the number of evaluations (hereinafter referred to as C) for the i-th knowledge is less than Z, it is impossible to evaluate the degree of difficulty of knowledge, and the intermediate value of 0.5 is determined as the initial difficulty value. In this case, the difficulty values to be judged are expressed by Equations (4) and (5).
At this time, W represents the degree of difficulty, A represents the number of times judged to be true for the specific knowledge, and B represents the number of times judged to be false for the specific knowledge.
The
Advantageously, the correct
The reference value includes a first reference value based on the reliability and a second reference value based on the number of participants participating in the evaluation, and is a basis for determining whether the second data can be stored in the
The first and second reference values may be set in advance by the administrator, and accordingly, the first reference value is a reference reliability value determined by the manager, and the second reference value is the number of evaluation participants determined by the administrator.
At this time, the reliability value of the second data moving to the correct
On the other hand, if the reference value is not satisfied, it is transmitted to the evaluation set
According to the present invention, the reliability of the unverified knowledge data can be determined and verified through the above-described configuration and the embodiments, and it is possible to quickly perform the knowledge data determination and knowledge database construction using the crowd sourcing do.
Further, the present invention can remarkably determine the reliability of the knowledge data as compared with the expert group, and has a remarkable effect with high accuracy in the reliability calculation.
1 to 6 describe only the main points of the present invention, and the present invention is not limited to the configurations of Figs. 1 to 6, as various designs can be made within the technical scope thereof. It is self-evident.
10: Undecided knowledge database 11: Knowledge collection unit
20: Correct answer database 30: Evaluation set providing server
100: Crowd sourcing-based knowledge verification system
110: Evaluation set generation unit 111: Undecided knowledge data
112: Correct answer knowledge data 113: Evaluation set
113a: Selection window 120: Evaluation set transmission unit
130: Completed set collecting part 131: Completed set
131a: Selection result display column 140: Completion set determination unit
141: Completion set separation module 142: First reliability determination module
143: second reliability determination module 144: difficulty determination module
145: third reliability determination module 150:
Claims (7)
An evaluation set transmission unit for transmitting the evaluation set to an evaluation set providing server;
A completion set collection unit for collecting a completion set from the evaluation set providing server;
A completion set determiner for determining reliability of the completion set; And
And transmits the completion set to the correct answer knowledge database when the reliability of the completion set transmitted from the completion set determination unit satisfies the set reference value and transmits the completion set to the evaluation set transmission unit if the reliability is not satisfied ≪ / RTI >
The evaluation set providing server includes:
Providing the evaluation set to the participants, sending the completed set evaluated from the participants to the completed set collector,
Wherein the completion set determination unit includes:
And a completion set separation module that separates the complete set into first data and second data,
Wherein the first data is extracted from the correct answer knowledge database,
Wherein the second data is extracted from the uncertain knowledge database,
Wherein the completion set determination unit includes:
A first reliability determination module that determines reliability of the first data;
A difficulty level determination module for determining a difficulty level of the first data; And
And a third reliability determination module that determines reliability of the second data using the difficulty of the first data,
The reference value,
As a criterion for judging whether or not the second data can be stored in the correct knowledge database 20,
A first reference value based on the reliability; And a second reference value based on the number of participants who evaluated the evaluation set,
The data determination unit determines that the reliability of the second data among the completed sets transmitted to the correct answer knowledge database 20 has a range of '(first reference value) <(reliability of second data) 1, And transmits to the correct answer knowledge database (20) only when the number of participating participants is larger than the second reference value.
(The mathematical formulas used in the above system are as follows.
(Where M is the number of first data, CA is the number of knowledge data processed as the correct answer, and IA is the number of knowledge data processed as an incorrect answer)
(S is the reliability of the first data)
(S is the reliability of the second data, The sum of difficulty levels of the correct first data, Is the sum of the degree of difficulty of the first data)
(Where W is the difficulty level, A is the number of times judged to be true for the specific knowledge, B is the number of times judged to be false for the specific knowledge, Z is the number of selection which is the criterion for ensuring validity for the undecided knowledge data, ≪ / RTI >
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020150152389A KR101772199B1 (en) | 2015-10-30 | 2015-10-30 | System for knowledge verification based on crowdsourcing |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020150152389A KR101772199B1 (en) | 2015-10-30 | 2015-10-30 | System for knowledge verification based on crowdsourcing |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20170050617A KR20170050617A (en) | 2017-05-11 |
KR101772199B1 true KR101772199B1 (en) | 2017-09-01 |
Family
ID=58741594
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020150152389A KR101772199B1 (en) | 2015-10-30 | 2015-10-30 | System for knowledge verification based on crowdsourcing |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR101772199B1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102385294B1 (en) * | 2021-10-15 | 2022-04-11 | 셀렉트스타 주식회사 | Methods, systems and computer readable media for automatically selecting sample tasks and obtaining correct answers for worker competency tests without expert response through crowdsourcing |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109783631B (en) * | 2019-02-02 | 2022-05-17 | 北京百度网讯科技有限公司 | Community question-answer data verification method and device, computer equipment and storage medium |
KR102254653B1 (en) * | 2019-03-05 | 2021-05-21 | 주식회사 엔비티 | Data labeling system using legacy service and method thereof |
KR102138574B1 (en) * | 2019-03-12 | 2020-08-12 | 주식회사 크라우드웍스 | Method for evaluating inspectors who perform checks on work tasks based on crowd sourcing and apparatus thereof |
KR102138576B1 (en) * | 2019-03-25 | 2020-07-29 | 주식회사 크라우드웍스 | Method for assigning work to collecting data based on crowdsourcing and apparatus thereof |
KR102155746B1 (en) * | 2019-12-09 | 2020-09-17 | 주식회사 크라우드웍스 | Method for verifying inspectors by sample inspection between inspectors |
KR102155750B1 (en) * | 2020-02-17 | 2020-09-14 | 주식회사 크라우드웍스 | Method for managing inspection quality using returned work result of crowdsourcing based project for artificial intelligence training data generation |
KR102480575B1 (en) * | 2020-02-19 | 2022-12-23 | 주식회사 에이모 | Method and apparatus for cloud outsorcing task distribution by using artificial intelligent |
KR102481583B1 (en) * | 2020-02-19 | 2022-12-27 | 주식회사 에이모 | Method and apparatus for forming inspection criteria data with cloud outsorcing work by using artificial intelligent |
KR102405298B1 (en) * | 2020-02-19 | 2022-06-07 | 주식회사 에이모 | Apparatus and method for cloud outsorcing task management by using artificial intelligent |
CN111339231B (en) * | 2020-02-25 | 2024-04-09 | 合肥四维图新科技有限公司 | Crowd-sourced update result processing method and device |
KR102155791B1 (en) * | 2020-03-17 | 2020-09-15 | 주식회사 크라우드웍스 | Method for secondarily inspecting for case estimated as negative inspection of crowdsourcing based project for artificial intelligence training data generation |
KR102159574B1 (en) * | 2020-05-07 | 2020-09-24 | 주식회사 크라우드웍스 | Method for estimating and managing the accuracy of work results of crowdsourcing based projects for artificial intelligence training data generation |
KR102183812B1 (en) * | 2020-06-02 | 2020-11-30 | 주식회사 크라우드웍스 | Method for paying inspection fee based on hourly wage for each inspector using the verification work results of crowdsourcing based projects for artificial intelligence training data generation |
KR102232874B1 (en) * | 2020-06-11 | 2021-03-29 | 주식회사 크라우드웍스 | Method for evaluating new inspector using verification work result of crowdsourcing based project for artificial intelligence training data generation |
-
2015
- 2015-10-30 KR KR1020150152389A patent/KR101772199B1/en active IP Right Grant
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102385294B1 (en) * | 2021-10-15 | 2022-04-11 | 셀렉트스타 주식회사 | Methods, systems and computer readable media for automatically selecting sample tasks and obtaining correct answers for worker competency tests without expert response through crowdsourcing |
WO2023063499A1 (en) * | 2021-10-15 | 2023-04-20 | 셀렉트스타 주식회사 | Automated method, system, and computer readable medium for selecting sample task for testing ability of worker and obtaining correct answer therefor through crowdsourcing without answer from expert |
KR20230054239A (en) * | 2021-10-15 | 2023-04-24 | 셀렉트스타 주식회사 | Methods, systems and computer readable media for automatically selecting sample tasks and obtaining correct answers for worker competency tests without expert response through crowdsourcing |
KR102632055B1 (en) | 2021-10-15 | 2024-02-01 | 셀렉트스타 주식회사 | Methods, systems and computer readable media for automatically selecting sample tasks and obtaining correct answers for worker competency tests without expert response through crowdsourcing |
Also Published As
Publication number | Publication date |
---|---|
KR20170050617A (en) | 2017-05-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101772199B1 (en) | System for knowledge verification based on crowdsourcing | |
US10432643B2 (en) | System and method for validating users using social network information | |
Nakano et al. | Analysis of cyber aggression and cyber-bullying in social networking | |
PH12020551830A1 (en) | Computerized systems and methods for determining authenticity using micro expressions | |
US9418567B1 (en) | Selecting questions for a challenge-response test | |
Choi et al. | Team integration and owner satisfaction: Comparing integrated project delivery with construction management at risk in health care projects | |
US9984427B2 (en) | Data ingestion module for event detection and increased situational awareness | |
US9864855B2 (en) | Verification data processing method and device and storage medium | |
JP2017153078A5 (en) | Artificial intelligence learning system, response relay system, and artificial intelligence learning method | |
McCord et al. | Empirical demonstration that expected utility decision analysis is not operational | |
CN104899195B (en) | Recommendation method and device for personalized education resources | |
US11226991B2 (en) | Interest tag determining method, computer device, and storage medium | |
CN107291775B (en) | Method and device for generating repairing linguistic data of error sample | |
US10057155B2 (en) | Method and apparatus for determining automatic scanning action | |
CN111709244A (en) | Deep learning method for identifying causal relationship of contradictory dispute events | |
Maiti et al. | Prioritizing non-functional requirements in agile software engineering | |
CN104852916A (en) | Social engineering-based webpage verification code recognition method and system | |
Starbird | Crowdwork, crisis and convergence: how the connected crowd organizes information during mass disruption events | |
CN110390109B (en) | Method and device for analyzing association relation among multiple group chat messages | |
CN113723477A (en) | Cross-feature federal abnormal data detection method based on isolated forest | |
US20140136534A1 (en) | Similarity calculating method and apparatus | |
CN106897743B (en) | Mobile attendance anti-cheating big data detection method based on Bayesian model | |
KR101894060B1 (en) | Advertisement providing server using chatbot | |
CN103853933B (en) | Android digital forensics-oriented user behavior analysis method and system | |
CN104462279A (en) | Method and device for acquiring feature information of analysis object |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
AMND | Amendment | ||
E601 | Decision to refuse application | ||
AMND | Amendment | ||
X701 | Decision to grant (after re-examination) | ||
GRNT | Written decision to grant |