CN110287385A - A kind of corpus data acquisition method, system and storage medium - Google Patents
A kind of corpus data acquisition method, system and storage medium Download PDFInfo
- Publication number
- CN110287385A CN110287385A CN201910526963.8A CN201910526963A CN110287385A CN 110287385 A CN110287385 A CN 110287385A CN 201910526963 A CN201910526963 A CN 201910526963A CN 110287385 A CN110287385 A CN 110287385A
- Authority
- CN
- China
- Prior art keywords
- user
- answer
- question
- information
- data acquisition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 32
- 230000005284 excitation Effects 0.000 claims abstract description 13
- 230000004044 response Effects 0.000 claims abstract description 5
- 238000004590 computer program Methods 0.000 claims description 6
- 239000000284 extract Substances 0.000 claims description 4
- 238000010586 diagram Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000010408 sweeping Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- 238000012550 audit Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
- G06F16/9032—Query formulation
- G06F16/90332—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9536—Search customisation based on social or collaborative filtering
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a kind of corpus data acquisition method, system and storage mediums, which comprises receives the input information of the first user, wherein the input information is as obtained by the external equipment input for being loaded with instant communication software;In response to the input information, multiple question and answer topics are extracted from question and answer exam pool, and are at least one another first user of the first user On-line matching, meanwhile, the question and answer topic is sent on the external equipment of each first user;The answer information of each first user is received, and the answer information is verified and stored;On the external equipment for sending each first user of the preset Electron Excitation to after being verified;Its effect is: question and answer problem data is distributed to two or more users, if passing through verifying for the answer information of each user of the same topic, then think that the corpus data is effective, each participating user can obtain certain excitation, to effectively increase the validity of data acquisition.
Description
Technical field
The present invention relates to field of computer technology, and in particular to a kind of corpus data acquisition method, system and storage medium.
Background technique
Currently in order to compile corpus data, it is main there are two ways to: 1. crowdsourcing companies are by the collection of data and whole
Reason task is distributed away, and user is required to provide clean data with compensation;2. the data that social software or website are collected are carried out
Cleaning or customer service handmarking or audit, obtain clean data.Problem of the existing technology: 1. cannot guarantee number
According to clean degree, for example there are dirty datas;2. it is relatively low using customer service manual examination and verification or labeling effciency, and it is easy error.
Summary of the invention
A kind of can be improved that be to provide of the embodiment of the present invention acquires a kind of corpus data acquisition method of data validity, is
System and storage medium.
In a first aspect, a kind of corpus data acquisition method provided in an embodiment of the present invention, which comprises
Receive the input information of the first user, wherein the input information is by being loaded with the outer of instant communication software
Portion's equipment input gained, the external equipment includes at least one of: smart phone and PC;
In response to the input information, multiple question and answer topics are extracted from question and answer exam pool, and are existed for first user
At least one another first user of lines matching, meanwhile, the question and answer topic is sent on the external equipment of each first user;
The answer information of each first user is received, and the answer information is verified and stored;
On the external equipment for sending each first user of the preset Electron Excitation to after being verified.
In one possible implementation, the method also includes:
The online lazy weight of first user is then first user when can not match to first user
At least one second user is matched, using the second user as another first user, wherein the second user is visitor
Take personnel.
In one possible implementation, the method also includes:
The question and answer information of third user input is received, and the question and answer information is stored in the question and answer exam pool, wherein
The third user is the user actively putd question to.
In one possible implementation, the answer information is stored, is specifically included:
The answer information includes topic answer and the question-and-answer problem purpose sentence of same meaning;
Classification storage is carried out according to the different type of the answer information.
Second aspect, a kind of corpus data acquisition system provided in an embodiment of the present invention, including client and cloud;
The client is used to obtain the input information of the first user and externally sends, wherein the input information is logical
Cross the external equipment input gained for being loaded with instant communication software;
The cloud is used to receive and respond to the input information, and multiple question and answer topics are extracted from question and answer exam pool,
And be at least one another first user of the first user On-line matching, meanwhile, the question and answer topic is sent to each first
On the external equipment of user;
The cloud is also used to receive the answer information of each first user, and the answer information is verified and deposited
Storage;On the external equipment for sending each first user of the preset Electron Excitation to after being verified.
In one possible implementation, the cloud is also used to the online lazy weight of first user, can not
To first user matching when, then match at least one second user for first user, using the second user as
Another first user, wherein the second user is contact staff.
In one possible implementation, the cloud is also used to receive the question and answer information of third user input, and will
The question and answer information is stored in the question and answer exam pool, wherein the third user is the user actively putd question to.
In one possible implementation, described that the answer information is stored, it specifically includes:
The answer information includes topic answer and the question-and-answer problem purpose sentence of same meaning;
Classification storage is carried out according to the different type of the answer information.
The third aspect, a kind of computer readable storage medium, computer storage medium are stored with computer program, computer
Program includes program instruction, which makes the step of processor execution first aspect the method when being executed by a processor
Suddenly.
By adopting the above technical scheme, have the advantage that a kind of corpus data acquisition method proposed by the present invention, system and
Storage medium, the embodiment of the present invention is by being distributed to two or more users for question and answer problem data, if for the same topic
The answer information of each user of purpose passes through verifying, then it is assumed that the corpus data is effective, and each participating user can obtain certain
Excitation, to effectively increase the validity of data acquisition.
Detailed description of the invention
Fig. 1 is a kind of method flow diagram of corpus data acquisition method provided in an embodiment of the present invention;
Fig. 2 is the application scenarios schematic diagram that the first user participates in corpus data acquisition in the embodiment of the present invention;
Fig. 3 is the application scenarios schematic diagram that second user participates in corpus data acquisition in the embodiment of the present invention;
Fig. 4 is the application scenarios schematic diagram that third user participates in corpus data acquisition in the embodiment of the present invention;
Fig. 5 is the structural schematic diagram that each user participates in corpus data acquisition in the embodiment of the present invention.
Specific embodiment
In order to keep the technical problem to be solved in the present invention, technical solution and advantage clearer, below in conjunction with attached drawing and
Specific embodiment is described in detail, and the following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention..
Shown in referring to Fig.1, the embodiment of the invention provides a kind of corpus data acquisition methods, which comprises
S101 receives the input information of the first user, wherein the input information is by being loaded with instant communication software
External equipment input gained, the external equipment includes at least one of: smart phone and PC.
Specifically, the instant communication software includes the social communication software or hard of the softwares such as wechat, QQ or independent research
The input form of part equipment, the input information includes manual form and automatic form, and manual form is defeated using traditional text
Enter or speech recognition input, scanning input etc. can be used in the input forms such as video, audio and picture, automatic form;With reference to Fig. 2
Shown, the present embodiment is illustrated in such a way that participant is scanned the two-dimensional code by smart phone, the external equipment
It can be special corpus transacter, the first user here is a group in participant, the class of other participants
Type will be described subsequent.
S102 extracts multiple question and answer topics in response to the input information from question and answer exam pool, and uses for described first
At least one another first user of family On-line matching, meanwhile, the question and answer topic is sent to the external equipment of each first user
On.
Specifically, cloud executes above-mentioned steps, and the cloud can be regarded as server end, and cloud is entered information triggering
Afterwards, choose at least one question and answer topic from exam pool, the present embodiment by extract it is multiple for be illustrated, and On-line matching is at least
One the first user of others, that is, class of answering a question participant, it is different former that strange land priority principle, gender can be used in matched principle
Then with professional person's principle etc., the information such as region, gender in matching principle obtained after being authorized by user or user from
The modes such as oneself input obtain, and details are not described herein, can reduce the probability practised fraud between each user in this way.
S103, receives the answer information of each first user, and the answer information is verified and stored.
Specifically, cloud executes above-mentioned steps, and the standard of verifying is 2 users or 2 or more user for same
It is required that reply it is identical or consistent, then pass through verifying, it is believed that the corpus data of this acquisition is effective, and is stored;Wherein,
The answer information is stored, is specifically included:
The answer information includes topic answer and the question-and-answer problem purpose sentence of same meaning;
Classification storage is carried out according to the different type of the answer information;Answer is divided into two classes i.e. in the present embodiment, the
One kind is answer (i.e. topic answer);Second class is the sentence of same meaning (i.e. the sentence of same meaning of topic answer);When participant participates in, question and answer
Topic has corresponding prompt, that is to say, that further includes prompt information in the question and answer topic;In order to make it easy to understand, below into
Row is for example, for example, will answer data is divided into 2 classes: the first kind is answers, and for a corpus data A, the reply of user is
Answer to A, is set as B, it may be assumed that B is the answer of A, such as: A=" what is your name? ", " I is Lee to B=.";Second class is
The sentence of same meaning, for a corpus data A, the reply of user is the sentence equivalent in meaning with A, is set as A ', it may be assumed that A ' is the synonymous of A
Sentence, such as: A=" what is your name? ", A '=" who are you? ";
The answer information is verified, further includes:
The geographical location of each first user is verified;I.e. when carrying out matching user, priority match relevance as far as possible
Lesser 2 or multiple users, relevance judge according to the geographical location of user, for example, the geographical location phase of 2 users
Away from less than 100 meters, it is believed that the relevance of this 2 users is larger;
The external equipment of each first user is verified;For example, judging that 2 users join from the same hardware device simultaneously
With answer, then it is assumed that the relevance of this 2 users is larger, for example, the MAC Address etc. of equipment is judged;
The social software of each first user is verified with the presence or absence of association;For example, 2 users are deposited by social software
In association (wechat good friend, QQ friends etc.), then it is assumed that the relevance of this 2 users is larger.Cloud executes above-mentioned steps, if association
Property it is larger, then be not verified, using above-mentioned step, improve the validity that user answers, reduce between each user mutually
The case where cheating.
S104, on the external equipment for sending each first user of the preset Electron Excitation to after being verified.
Specifically, the Electron Excitation is integral, red packet or other virtual objects, will not enumerate, passes through herein
This mode, the conscientiously degree for improving everybody participation and answering a question, to improve the validity of data acquisition.
Fig. 2 gives the application scenarios that first user carries out corpus data acquisition, and detailed process refers to foregoing description
With it is as shown in the figure.
Through the above scheme, question and answer problem data is distributed to two or more users, if for the same topic
The answer information of each user passes through verifying, then it is assumed that and the corpus data is effective, and each participating user can obtain certain excitation,
To effectively increase the validity of data acquisition.
In other embodiments, it is contemplated that actual applicable cases, it is understood that there may be can not be same for first user matching
When the user group of sample, the method also includes:
The online lazy weight of first user is then first user when can not match to first user
At least one second user is matched, using the second user as another first user, wherein the second user is visitor
Take personnel.
Specifically, in order not to destroy the proof rule of setting, as shown in figure 3, using contact staff as the use of a type
Family group substitutes matched another first user described previously, is equivalent to the user's second being responsible for playing the part of in Fig. 2, conscientiously answers every
One problem, follow-up process with it is described previously identical, details are not described herein.
In other embodiments, the method also includes:
The question and answer information of third user input is received, and the question and answer information is stored in the question and answer exam pool, wherein
The third user is the user actively putd question to.
Specifically, in use, refering to what is shown in Fig. 4, the user that active is putd question to passes through as another participant's type
The third user and cloud carry out the collection of corpus data in a manner of puing question to or chat, by way of role transforming,
First is that the exam pool of itself is enriched, in application, the first user or second user can be sent to these data, to obtain
It answers;Second is that obtaining the focus from the angle of participant to corpus data, more personalized and specific aim.
With reference to Fig. 5, participant is divided into three groups by this programme, is set as the first user, second user, third user;The
One user is participated in by way of scanning or wechat public platform, and the user's first and user's second of the first user is by sweeping live two-dimensional code
Participation is answered a question (question-and-answer problem purpose quantity is 1~10), if the two is all consistent for the answer of each topic, both sides will
It receives awards, does not otherwise reward, but will record corresponding data volume;
Second user (can not be matched to user's second) when there is demand in time and be responsible for playing the part of user's second, conscientiously as customer service
Answer each problem;
Third user is as experience user, that is, the beneficiary of data, therefore third user may need use of paying,
It can be participated in by sweeping two dimensional code, be putd question to public platform and perhaps chat these data clouds and can be sent to the first user or the
Two users, to be answered.
Based on the same inventive concept of above-mentioned acquisition method, the embodiment of the invention also provides a kind of acquisitions of corpus data to be
System, including client and cloud.
The client is used to obtain the input information of the first user and externally sends, wherein the input information is logical
Cross the external equipment input gained for being loaded with instant communication software.
Specifically, the quantity of the client is multiple, and the instant communication software includes the softwares such as wechat, QQ or autonomous
The input form of the social communication software or hardware device of research and development, the input information includes manual form and automatic form, hand
Speech recognition can be used using input forms, automatic forms such as traditional text input or video, audio and pictures in dynamic form
Input, scanning input etc.;The external equipment is also possible to special corpus transacter, and the first user here is to participate in
A group in person.
The cloud is used to receive and respond to the input information, and multiple question and answer topics are extracted from question and answer exam pool,
And be at least one another first user of the first user On-line matching, meanwhile, the question and answer topic is sent to each first
On the external equipment of user.
Specifically, the cloud can be regarded as server end, after cloud is entered information triggering, choose at least from exam pool
One question and answer topic, the present embodiment by extract it is multiple for be illustrated, and the first user that On-line matching is at least one other,
Strange land priority principle, gender distinct principle and professional person's principle etc. can be used in class of answering a question participant, matched principle,
The information such as region, gender in matching principle obtain after being authorized by user or the modes such as user oneself input obtain,
This is repeated no more, and can reduce the probability practised fraud between each user in this way.
The cloud is also used to receive the answer information of each first user, and the answer information is verified and deposited
Storage;On the external equipment for sending each first user of the preset Electron Excitation to after being verified.
Specifically, cloud executes above-mentioned steps, and the standard of verifying is 2 users or 2 or more user for same
It is required that reply it is identical or consistent, then pass through verifying, it is believed that the corpus data of this acquisition is effective, and is stored;Wherein,
The answer information is stored, is specifically included:
The answer information includes topic answer and the question-and-answer problem purpose sentence of same meaning;
Classification storage is carried out according to the different type of the answer information;Answer is divided into two classes i.e. in the present embodiment, the
One kind is answer (i.e. topic answer);Second class is the sentence of same meaning (i.e. the sentence of same meaning of topic answer);
When participant participates in, question and answer topic has corresponding prompt, that is to say, that further includes mentioning in the question and answer topic
Show information;In order to make it easy to understand, being exemplified below;For example, will answer data is divided into 2 classes: the first kind is answer, for
One corpus data A, the reply of user are the answers to A, are set as B, it may be assumed that B is the answer of A, such as: " you are any name to A=
Word? ", " I is Lee to B=.";Second class is the sentence of same meaning, and for a corpus data A, the reply of user is equivalent in meaning with A
Sentence, be set as A ', it may be assumed that A ' is the sentence of same meaning of A, such as: A=" what is your name? ", A '=" who are you? ";
The answer information is verified, further includes:
The geographical location of each first user is verified;I.e. when carrying out matching user, priority match relevance as far as possible
Lesser 2 or multiple users, relevance judge according to the geographical location of user, for example, the geographical location phase of 2 users
Away from less than 100 meters, it is believed that the relevance of this 2 users is larger;
The external equipment of each first user is verified;For example, judging that 2 users join from the same hardware device simultaneously
With answer, then it is assumed that the relevance of this 2 users is larger, for example, the MAC Address etc. of equipment is judged;
The social software of each first user is verified with the presence or absence of association;For example, 2 users are deposited by social software
In association (wechat good friend, QQ friends etc.), then it is assumed that the relevance of this 2 users is larger.Cloud executes above-mentioned steps, if association
Property it is larger, then be not verified, using above-mentioned step, improve the validity that user answers, reduce between each user mutually
The case where cheating;
The Electron Excitation is integral, red packet or other virtual objects, will not enumerate herein, passes through this side
Formula, the conscientiously degree for improving everybody participation and answering a question, to improve the validity of data acquisition;Pass through the language that will be collected
Material data (answer data i.e. described above) is stored and is added in exam pool, and then the data of exam pool send out collection answer,
These answer informations increase to exam pool again, form dynamic circulation.
By above system, question and answer problem data is distributed to two or more users, if for the same topic
The answer information of each user passes through verifying, then it is assumed that and the corpus data is effective, and each participating user can obtain certain excitation,
To effectively increase the validity of data acquisition.
In other embodiments, the cloud is also used to the online lazy weight of first user, can not be to described
When one user matches, then at least one second user is matched for first user, using the second user as described another
First user, wherein the second user is contact staff.
Specifically, it is contemplated that actual applicable cases, it is understood that there may be same user can not be matched for first user
When group, in order not to destroy the proof rule of setting, using contact staff as the user group of a type, substitute described previously
Matched another first user, is equivalent to the user's second being responsible for playing the part of in Fig. 2, conscientiously answer each problem, follow-up process with
Described previously identical, details are not described herein.
In other embodiments, the cloud is also used to receive the question and answer information of third user input, and by the question and answer
Information is stored in the question and answer exam pool, wherein the third user is the user actively putd question to.
Specifically, user active putd question to as another participant's type, by the third user and cloud with
The mode putd question to or chatted carries out the collection of corpus data, by way of role transforming, first is that the exam pool of itself is enriched,
In application, the first user or second user can be sent to these data, to be answered;Second is that obtaining from participation
The angle of person is to the focus of corpus data, more personalized and specific aim.
The embodiment of the invention also provides a kind of computer readable storage medium, computer storage medium is stored with computer
Program, computer program include program instruction, which makes processor execute first aspect institute when being executed by a processor
The step of stating method.
The computer readable storage medium can use hard disk or memory.The computer readable storage medium can also be with
It is External memory equipment, such as plug-in type hard disk, intelligent memory card (Smart Media Card, SMC), secure digital
(Secure Digital, SD) card, flash card (Flash Card) etc..Further, the computer readable storage medium is also
Can both including the terminal memory and also including External memory equipment.The computer readable storage medium of the present embodiment, holds
Row method as described in the examples, details are not described herein.
Those of ordinary skill in the art may be aware that system module described in conjunction with the examples disclosed in this document and
Method and step can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and soft
The interchangeability of part generally describes each exemplary composition and step according to function in the above description.These function
It can be implemented in hardware or software actually, the specific application and design constraint depending on technical solution.Professional skill
Art personnel can use different methods to achieve the described function each specific application, but this realization should not be recognized
It is beyond the scope of this invention.
Finally, it should be noted that foregoing description is only a specific embodiment of the invention, but protection scope of the present invention
It is not limited thereto, anyone skilled in the art in the technical scope disclosed by the present invention, can readily occur in
Change or replacement, should be covered by the protection scope of the present invention.
Claims (9)
1. a kind of corpus data acquisition method, which is characterized in that the described method includes:
Receive the input information of the first user, wherein the input information is set by being loaded with the outside of instant communication software
Standby input gained, the external equipment includes at least one of: smart phone and PC;
In response to the input information, multiple question and answer topics are extracted from question and answer exam pool, and are online of first user
With at least one another first user, meanwhile, the question and answer topic is sent on the external equipment of each first user;
The answer information of each first user is received, and the answer information is verified and stored;
On the external equipment for sending each first user of the preset Electron Excitation to after being verified.
2. a kind of corpus data acquisition method according to claim 1, which is characterized in that the method also includes:
The online lazy weight of first user is then first user matching when can not match to first user
At least one second user, using the second user as another first user, wherein the second user is customer service people
Member.
3. a kind of corpus data acquisition method according to claim 1, which is characterized in that the method also includes:
The question and answer information of third user input is received, and the question and answer information is stored in the question and answer exam pool, wherein is described
Third user is the user actively putd question to.
4. a kind of corpus data acquisition method according to claim 1, which is characterized in that deposited to the answer information
Storage, specifically includes:
The answer information includes topic answer and the question-and-answer problem purpose sentence of same meaning;
Classification storage is carried out according to the different type of the answer information.
5. a kind of corpus data acquisition system, which is characterized in that including client and cloud;
The client is used to receive the input information of the first user, wherein the input information is by being loaded with Instant Messenger
Believe that the external equipment of software inputs gained;
The cloud is used in response to the input information, extracts multiple question and answer topics from question and answer exam pool, and is described the
At least one another first user of one user's On-line matching, meanwhile, the question and answer topic is sent to the outside of each first user
In equipment;
The cloud is also used to receive the answer information of each first user, and the answer information is verified and stored;Hair
On the external equipment for sending each first user of the preset Electron Excitation to after being verified.
6. a kind of corpus data acquisition system according to claim 5, which is characterized in that the cloud is also used to described
The online lazy weight of one user, when can not be matched to first user, then for first user match at least one the
Two users, using the second user as another first user, wherein the second user is contact staff.
7. a kind of corpus data acquisition system according to claim 5, which is characterized in that the cloud is also used to receive
The question and answer information of three users input, and the question and answer information is stored in the question and answer exam pool, wherein the third user is
The user actively putd question to.
8. a kind of corpus data acquisition system according to claim 5, which is characterized in that it is described to the answer information into
Row storage, specifically includes:
The answer information includes topic answer and the question-and-answer problem purpose sentence of same meaning;
Classification storage is carried out according to the different type of the answer information.
9. a kind of computer readable storage medium, computer storage medium are stored with computer program, which is characterized in that computer
Program includes program instruction, which require processor perform claim described in any one of 1-4
The step of method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910526963.8A CN110287385A (en) | 2019-06-18 | 2019-06-18 | A kind of corpus data acquisition method, system and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910526963.8A CN110287385A (en) | 2019-06-18 | 2019-06-18 | A kind of corpus data acquisition method, system and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110287385A true CN110287385A (en) | 2019-09-27 |
Family
ID=68003941
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910526963.8A Pending CN110287385A (en) | 2019-06-18 | 2019-06-18 | A kind of corpus data acquisition method, system and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110287385A (en) |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102148856A (en) * | 2010-12-30 | 2011-08-10 | 百度在线网络技术(北京)有限公司 | Knowledge information interaction service method, platform and site |
CN103106267A (en) * | 2013-02-02 | 2013-05-15 | 浙江大学 | Information collection method based on microblog crowdsourcing question-answering system |
CN105664488A (en) * | 2014-11-20 | 2016-06-15 | 博雅网络游戏开发(深圳)有限公司 | Card game control method and system |
US20160196299A1 (en) * | 2015-01-03 | 2016-07-07 | International Business Machines Corporation | Determining Answer Stability in a Question Answering System |
CN105991399A (en) * | 2015-02-05 | 2016-10-05 | 天脉聚源(北京)科技有限公司 | Method and system for realizing questioning over network |
CN106485570A (en) * | 2016-09-20 | 2017-03-08 | 网易(杭州)网络有限公司 | A kind of information processing method and device |
CN107783970A (en) * | 2016-08-25 | 2018-03-09 | 武汉聚蜗网络科技有限公司 | A kind of expert's question answering system and its operating method |
CN107899245A (en) * | 2017-12-11 | 2018-04-13 | 武汉卓讯互动信息科技有限公司 | A kind of anti-cheating method, device and system |
CN108197202A (en) * | 2017-12-28 | 2018-06-22 | 百度在线网络技术(北京)有限公司 | Data verification method, device, server and the storage medium of crowdsourcing task |
CN108961002A (en) * | 2018-07-05 | 2018-12-07 | 厦门微芽互娱科技有限公司 | Intelligence spells group's method, medium, terminal device and system |
CN109525480A (en) * | 2018-09-14 | 2019-03-26 | 广东神马搜索科技有限公司 | Customer problem collection system and method |
CN109783631A (en) * | 2019-02-02 | 2019-05-21 | 北京百度网讯科技有限公司 | Method of calibration, device, computer equipment and the storage medium of community's question and answer data |
-
2019
- 2019-06-18 CN CN201910526963.8A patent/CN110287385A/en active Pending
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102148856A (en) * | 2010-12-30 | 2011-08-10 | 百度在线网络技术(北京)有限公司 | Knowledge information interaction service method, platform and site |
CN103106267A (en) * | 2013-02-02 | 2013-05-15 | 浙江大学 | Information collection method based on microblog crowdsourcing question-answering system |
CN105664488A (en) * | 2014-11-20 | 2016-06-15 | 博雅网络游戏开发(深圳)有限公司 | Card game control method and system |
US20160196299A1 (en) * | 2015-01-03 | 2016-07-07 | International Business Machines Corporation | Determining Answer Stability in a Question Answering System |
CN105991399A (en) * | 2015-02-05 | 2016-10-05 | 天脉聚源(北京)科技有限公司 | Method and system for realizing questioning over network |
CN107783970A (en) * | 2016-08-25 | 2018-03-09 | 武汉聚蜗网络科技有限公司 | A kind of expert's question answering system and its operating method |
CN106485570A (en) * | 2016-09-20 | 2017-03-08 | 网易(杭州)网络有限公司 | A kind of information processing method and device |
CN107899245A (en) * | 2017-12-11 | 2018-04-13 | 武汉卓讯互动信息科技有限公司 | A kind of anti-cheating method, device and system |
CN108197202A (en) * | 2017-12-28 | 2018-06-22 | 百度在线网络技术(北京)有限公司 | Data verification method, device, server and the storage medium of crowdsourcing task |
CN108961002A (en) * | 2018-07-05 | 2018-12-07 | 厦门微芽互娱科技有限公司 | Intelligence spells group's method, medium, terminal device and system |
CN109525480A (en) * | 2018-09-14 | 2019-03-26 | 广东神马搜索科技有限公司 | Customer problem collection system and method |
CN109783631A (en) * | 2019-02-02 | 2019-05-21 | 北京百度网讯科技有限公司 | Method of calibration, device, computer equipment and the storage medium of community's question and answer data |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Udupa | Enterprise Hindutva and social media in urban India | |
Ackland | Web social science: Concepts, data and tools for social scientists in the digital age | |
Crowe | Leadership in the open: A new paradigm in emergency management | |
Hellström | Crowdsourcing as a tool for political participation?-the case of Ugandawatch | |
Postoaca | The anonymous elect: Market research through online access panels | |
Tan et al. | Virtually boyfriends: the ‘social factory’and affective labor of male virtual lovers in China | |
CN110287385A (en) | A kind of corpus data acquisition method, system and storage medium | |
CN109146737B (en) | Intelligent interaction method and device based on examination platform | |
Falcão et al. | Researching hard-to-reach populations: lessons learned from dispersed migrant communities | |
Hood et al. | What happens when animals tweet? A case study at Brookfield Zoo | |
Bor | Democratic deliberation on social network sites: A study of digital deliberative discourse in the 2012 election | |
Ridho et al. | The Urgency of Understanding Digital Literacy In The Flow of Digitalization of Communication And Information | |
Smith et al. | The Global Society of Peace Engineers—advocating for the profession | |
Mazzarotto | Dating in the digital age: A research experiment | |
Kolonin et al. | Reputation system for online communities | |
Muaka | The Role of Social Media in Facilitating Diplomatic Engagements in East Africa. A Comparative Study of Kenya and Rwanda. | |
Wang et al. | Differences in chatting behavior between two kakaogroup communities composed of female immigrants | |
Tung et al. | Social Media and Privacy: Comparing US and Japanese College Students’ Use of Facebook and Twitter | |
Ferreira | Brand perceptions of consumer activists using Twitter: an exploratory study | |
Konieczy | The impact of modern information and communication technologies on social movements | |
Wang | New Media, Public Participation and the Government in China: from the BBS to the Weibo age. | |
Zhang | Digital Lifeworld and Communicative Interaction: Conceptualizing the Transformative Potentials of Social Networking in the Public Sphere | |
Martin | Incivility Online: Exploring the Local Newspaper Journalist-Audience Relationship | |
Inyang | INFLUENCE OF SOCIAL MEDIA ON AIRTEL PRACTICE OF PUBLIC RELATIONS | |
Erlandson | Social Media and Social Networking |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190927 |