CN116189216A - Method and device for rapidly inputting topics based on structured data - Google Patents

Method and device for rapidly inputting topics based on structured data Download PDF

Info

Publication number
CN116189216A
CN116189216A CN202310148865.1A CN202310148865A CN116189216A CN 116189216 A CN116189216 A CN 116189216A CN 202310148865 A CN202310148865 A CN 202310148865A CN 116189216 A CN116189216 A CN 116189216A
Authority
CN
China
Prior art keywords
question
content
topic
line
questions
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310148865.1A
Other languages
Chinese (zh)
Inventor
马鸿图
王佳静
金庆文
张震
陈健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Whaty Technology Development Co ltd
Original Assignee
Beijing Whaty Technology Development Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Whaty Technology Development Co ltd filed Critical Beijing Whaty Technology Development Co ltd
Priority to CN202310148865.1A priority Critical patent/CN116189216A/en
Publication of CN116189216A publication Critical patent/CN116189216A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/414Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/11File system administration, e.g. details of archiving or snapshots
    • G06F16/116Details of conversion of file system types or formats
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • G06Q50/205Education administration or guidance

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Tourism & Hospitality (AREA)
  • Strategic Management (AREA)
  • Educational Technology (AREA)
  • Educational Administration (AREA)
  • Databases & Information Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Graphics (AREA)
  • Data Mining & Analysis (AREA)
  • Geometry (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • General Business, Economics & Management (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a method and a device for rapidly inputting topics based on structured data, which belong to the technical field of computer processing, and the method comprises the following steps: determining a question to be input; sequentially identifying the related content of each question to be input according to a preset rule, wherein the related content of each question at least comprises a question stem, a question type to which the question belongs and a knowledge point to which the question belongs; the identified topics are stored as file content in a format readable by the topic library system. The embodiment of the invention can improve the question recording efficiency, save the question recording time of a user and lighten the burden of the user.

Description

Method and device for rapidly inputting topics based on structured data
Technical Field
The invention relates to the technical field of computer processing, in particular to a method and a device for rapidly inputting topics based on structured data.
Background
As the first step of the online learning examination, the input of the questions in the question bank not only affects the flow and progress of the whole examination, but also relates to the efficiency and quality of the follow-up examination. The following problems are common in the current big question recording tools:
the operation flow is complicated, so that the learning cost is high;
the strict fault tolerance rate of the format specification is low, so that the problem recording tool is not good and the efficiency is low.
Therefore, a fast recording method for improving the recording efficiency and saving the recording time of the user is needed.
Disclosure of Invention
The embodiment of the invention provides a method and a device for rapidly inputting questions based on structured data, which can improve the efficiency of recording questions, save the time of recording questions for a user and lighten the burden of the user. The technical proposal is as follows:
in a first aspect, an embodiment of the present invention provides a method for rapidly inputting a topic based on structured data, including:
determining a question to be input;
sequentially identifying the related content of each question to be input according to a preset rule, wherein the related content of each question at least comprises a question stem, a question type to which the question belongs and a knowledge point to which the question belongs;
the identified topics are stored as file content in a format readable by the topic library system.
Optionally, the related content of the title further includes: at least one of options and option content in the questions, answers in the questions, answer analysis in the questions, question attributes to which the questions belong and question labels contained in the questions.
Optionally, identifying the stem in the topic includes:
reading the content of the first row in the paragraph;
if the first line is started by a number plus a specific punctuation mark and the Nth line is ended by a line feed symbol, the content before the line feed symbol after the specific punctuation mark is determined to be the stem content in the title, and N is a positive integer.
Optionally, after determining that the content before the line feed symbol is the stem content in the topic after the specific punctuation mark, the method further includes:
continuing to read the next row of content;
if the next line of content that continues to be read is free of content, or begins with a number or letter or with a first particular field, it is determined that the stem content in the identified topic has ended.
Optionally, identifying the topic type to which the topic belongs includes:
after the stem in the question is identified, reading the text content of the last section of the stem;
and determining the topic type to which the topic belongs according to the topic type identification text in the previous text content.
Optionally, the method further comprises:
and if the text content of the previous section does not contain the topic type identification text, determining the topic type of the previous topic as the topic type of the current topic.
Optionally, identifying the knowledge point to which the topic belongs includes:
after the related field of the knowledge point is added with a specific punctuation mark, the next text content after reading the next line feed is the content before the beginning of the fourth specific field, or the next text content after reading the line feed is the next question, and the content before the next question is determined to be the knowledge point to which the question belongs;
wherein if a plurality of knowledge points exist, the plurality of knowledge points are separated by a preset punctuation mark.
Optionally, identifying options and option content in the title includes:
reading the text content of the next section of the stem;
starting with any letter plus a specific punctuation mark in the next text content until a line feed is read, and identifying the next text content as a section of content; repeating the steps until the identified content is free of content, or the identified content starts with a first specific field, and determining that the option in the identified title is ended; if the identified section of content contains a plurality of text contents which start by any letter plus the specific punctuation mark and spaces are arranged between the text contents and the letters, the section of content is divided into a plurality of options by using space separators, and the first option is not defaulted;
and for each identified option, adding any letter in each option with the beginning of the specific punctuation mark, and determining the text content before the next option as the option content of the current option.
Optionally, identifying an answer in the question includes:
after the options and the option contents in the topics are identified, reading the next text content after the line feed symbol;
and adding a specific punctuation mark to the relevant field of the answer in the next text content after the line-changing symbol, and determining the content before the next text content after the line-changing symbol is read to be the next question before the next text content after the line-changing symbol is read to be the second specific field, and determining the content before the next question as the answer in the question.
Optionally, identifying answer parsing in the questions includes:
after the answers in the questions are identified, reading the text content of the next section after the line feed symbol;
and adding a specific punctuation mark to the relevant field of answer analysis in the next text content after the line-changing symbol, and determining that the next text content after the line-changing symbol is read is the content before the beginning of a third specific field or the next text content after the line-changing symbol is read as the next question, and determining that the content before the next question is the answer analysis in the questions.
Optionally, identifying the topic attribute to which the topic belongs includes:
and after the related field of the title attribute starts, determining that the next text content after the next line-feed is read is the content before the start of the fifth specific field or the next text content after the line-feed is read is the next title, and determining that the content before the next title is the title attribute to which the title belongs.
Optionally, identifying the topic tag contained in the topic includes:
and after the beginning of the label related field, the next text content after the next line-feeding is read is the content before the beginning of the sixth specific field, or the next text content after the line-feeding is read is the next question, and the content before the next question is determined as the question label contained in the question.
In a second aspect, an embodiment of the present invention provides a rapid entry device for a topic based on structured data, including:
the determining module is used for determining the questions to be recorded;
the identification module is used for sequentially identifying the related content of each question to be input according to a preset rule, wherein the related content of the questions at least comprises a question stem, a question type to which the question belongs and a knowledge point to which the question belongs;
and the storage module is used for storing the identified topics as file contents in a format readable by the topic library system.
In a third aspect, an embodiment of the present invention provides an electronic device, including: a processor, a memory and a program stored on the memory and executable on the processor, which when executed by the processor implements the steps of the structured data based rapid entry method as provided in the first aspect.
In a fourth aspect, an embodiment of the present invention provides a readable storage medium having stored thereon a program which, when executed by a processor, implements the steps of the structured data based method of rapid entry of topics as provided in the first aspect.
The technical scheme of the invention has the beneficial effects that:
the method and the device for rapidly inputting the topics based on the structured data provided by the embodiment of the invention comprise the steps of determining the topics to be input; sequentially identifying the related content of each question to be input according to a preset rule, wherein the related content of each question at least comprises a question stem, a question type to which the question belongs and a knowledge point to which the question belongs; the identified topics are stored as file content in a format readable by the topic library system. The method and the device for rapidly inputting the questions based on the structured data provided by the embodiment of the invention are based on the form of online learning and question brushing examination, can improve the question-recording efficiency, save the question-recording time of a user, lighten the burden of the user, establish brand-new question-recording logic by adopting AI intelligent recognition, perform structured extraction, analysis and understanding on texts by using a machine learning technology, automatically recognize the contents such as question types, knowledge points and the like of the questions to which the questions belong, and can check and adjust the question formats in real time so as to realize batch rapid input of various question types.
The technical scheme of the invention greatly reduces the workload of inputting the questions by the user, reduces the complexity of setting the contents of the questions, can more efficiently and accurately identify and input the questions more comprehensively, has strong format inclusion of identification, wide coverage and good user experience.
Drawings
FIG. 1 is a flow chart of a method for rapidly inputting topics based on structured data, disclosed in an embodiment of the invention;
FIG. 2 is a flow chart of another method for rapid entry of topics based on structured data according to an embodiment of the present invention;
fig. 3 is a schematic structural diagram of a rapid entry device for topics based on structured data according to an embodiment of the present invention.
Detailed Description
In order to make the technical problems, technical solutions and advantages to be solved more apparent, the following detailed description will be given with reference to the accompanying drawings and specific embodiments. In the following description, specific details such as specific configurations and components are provided merely to facilitate a thorough understanding of embodiments of the invention. It will therefore be apparent to those skilled in the art that various changes and modifications can be made to the embodiments described herein without departing from the scope and spirit of the invention. In addition, descriptions of well-known functions and constructions are omitted for clarity and conciseness.
It should be appreciated that reference throughout this specification to "one embodiment" or "an embodiment" means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearances of the phrases "in one embodiment" or "in an embodiment" in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.
In various embodiments of the present invention, it should be understood that the sequence numbers of the following processes do not mean the order of execution, and the order of execution of the processes should be determined by the functions and internal logic, and should not constitute any limitation on the implementation process of the embodiments of the present invention.
One of the achievement objects of the present invention at least comprises: the quick entry scheme of the on-line question bank based on the structured data accurate identification is provided for the on-line question bank creator, and the batch quick entry of the questions is ensured. In view of this, an embodiment of the present invention provides a method for quickly inputting a question based on structured data, as shown in fig. 1, where the method includes:
step 101, determining the title to be entered.
In practical application, usually, a teacher prepares questions to be input into a question bank system in advance, and specific questions are generally arranged in a word document mode.
Step 102, for each question to be input, sequentially identifying related contents of the questions according to a preset rule, wherein the related contents of the questions at least comprise a question stem, a question type to which the questions belong and a knowledge point to which the questions belong.
In practical application, the stem, the question type and the knowledge point are the contents which are necessary for a question.
Further preferably, the related content of the questions may further include at least one of options and option content in the questions, answers in the questions, answer analysis in the questions, question attributes to which the questions belong, and question labels contained in the questions.
In practical application, a topic in the topic library system is generally composed of the following 8 large structured data: stem, question type, options content, answer resolution, knowledge points, question attributes, and question labels. The stem, the question type and the knowledge point are the contents which are necessary for a question, and other data can be available or not.
Step 103, storing the identified topics as file contents in a format readable by a topic library system.
Based on the form of online learning and question brushing examination, in order to improve the question recording efficiency, save the question recording time of a user and lighten the burden of the user, the embodiment of the invention provides a question quick input method of an online question bank based on the accurate identification of structured data.
The technical scheme of the invention greatly reduces the workload of inputting the questions by the user, reduces the complexity of setting the contents of the questions, can more efficiently and accurately identify and input the questions more comprehensively, has strong format inclusion of identification, wide coverage and good user experience.
As a preferred implementation scheme of the embodiment of the invention, aiming at step 102 of the embodiment of the invention, the applicant specifically uses 8 large structured data, which specifically comprises a question stem, a question type to which the question belongs, options and option contents in the question, answers in the question, answer analysis in the question, knowledge points to which the question belongs, question attributes to which the question belongs and a question label contained in the question, as examples, and describes a specific implementation method for sequentially identifying the relevant contents of the questions according to preset rules for each question to be input in the step 102 of the embodiment of the invention. As shown in fig. 2, the method includes:
step 201, identifying a stem in a question.
First, the embodiment of the invention identifies the stem in the question.
Specifically, the content of the first row in the paragraph is read; if the first line is started by a number plus a specific punctuation mark and the Nth line is ended by a line feed symbol, the content before the line feed symbol after the specific punctuation mark is determined to be the stem content in the title, and N is a positive integer.
Where a specific punctuation mark is, for example, "" or ", in practical applications, the content of the first line in the reading paragraph starts with a number +", or "," until a line ends with a line-feed, then all the content between these default to the stem content.
In order to further improve the accuracy, the method of the invention can further comprise the following steps: after determining that the content preceding the line feed is the stem content in the title after determining a specific punctuation mark (e.g. ",") the next line of content is selected to be read continuously; if the next line of content which is continuously read is free of content, or starts with numbers or letters or with a first specific field, the content of the stem in the identification title is determined to be ended, otherwise, the reading is continuously performed.
The first specific field is, for example: answer, correct answer, reference answer, answer parsing, knowledge points, question attributes such as difficulty, labels, question labels, and the like.
Step 202, identify the question type to which the question belongs.
After the question stem in the question is identified, the question type to which the question belongs can be selected to be continuously identified.
Specifically, after the stem in the question is identified, the embodiment of the invention reads the text content of the last section of the stem, and determines the question type to which the question belongs according to the question type identification text in the text content of the last section.
The title type identification text comprises, for example: single item selection questions, multiple item selection questions, gap filling questions, judgment questions, gap filling questions, completion type gap filling questions, simple answer questions and compound questions.
In practical application, after the embodiment of the invention separates the question stem, a text content of the last section of the question stem is read, and whether the text content contains the following eight main questions or not is judged: the method comprises the steps of selecting a single item, selecting a plurality of items, filling blank, judging the item, selecting blank, finishing blank, answering a simple question and a compound question, identifying characters according to the question type, and identifying and determining the question type. Wherein:
aiming at the single item selection question type, the text content is as follows: single selection, single selection questions and the like are all identified as 'single selection questions'; aiming at a plurality of choice question types, the text contents are as follows: multiple choices, multiple choice questions, etc., all can be identified as "multiple choice questions"; aiming at the complete filling problem type, the text content is as follows: the shape, the gap, and the like are all identified as a 'gap-filling question'; for other question types, the word of the question of the text content can be successfully identified without writing.
Further optionally, the method further comprises:
and if the text content of the previous section does not contain the topic type identification text, determining the topic type of the previous topic as the topic type of the current topic.
In practical application, if a question is not marked with a question type, the question type of the previous question (i.e. the last question) is read, and the question type of the previous question is determined as the question type to which the current question belongs. For example, assuming that the second question does not have a explicit question type, the question type to which the first question belongs is read and the question type of the first question is determined as the question type of the second question.
Step 203, identify the options in the title and the content of the options.
After the question type of the question is identified, the option and the option content in the question can be selected to be continuously identified.
Specifically, after the question type of the question is identified, the embodiment of the invention continues to read the text content of the next section of the question stem; starting with any letter plus a specific punctuation mark in the next text content until a line feed is read, and identifying the next text content as a section of content; repeating the steps until the identified content is free of content, or the identified content starts with a first specific field, and determining that the option in the identified title is ended; if the identified section of content contains a plurality of text contents which start by any letter plus the specific punctuation mark and spaces are arranged between the text contents and the letters, the section of content is divided into a plurality of options by using space separators, and the first option is not defaulted;
and for each identified option, adding any letter in each option with the beginning of the specific punctuation mark, and determining the text content before the next option as the option content of the current option.
In practice, specific punctuation marks are, for example, "" or "". In the embodiment of the invention, in the text content of the next section of the read stem, the text content is identified as a section of content from the beginning of any letter plus ", or". Until the line-feed character is read. If the section of content contains a plurality of text contents beginning with any letter plus ", or", "and space exists between the text contents and the letters, dividing the section of content into a plurality of options by using a space separator, and if the section of content does not contain the text contents, defaulting to the first option, namely only identifying one option;
and then, continuously reading the content of the new section according to the steps until the new section has no content or starts with a first specific field, such as ' answer, correct answer, reference answer, answer analysis, knowledge point, question attribute such as difficulty, label, question label ', and the like ', and if the new section has no content, the default option identification is finished, otherwise, the reading is continued.
And for each identified option, reading the content of the option, and reading the content after the beginning of any letter plus ", or". Until the next option is read, wherein the content between the options is the content corresponding to the option.
In the embodiment of the invention, the question options are judged to support correct/incorrect, true/False, yes/no, T/F, true/False, v/x.
Step 204, identify answers in the questions.
After the options and the contents of the options in the questions are identified, the answer in the questions can be selected to be continuously identified.
Specifically, after the options and the option contents in the questions are identified, the embodiment of the invention continues to read the next text content after the line feed symbol; and adding a specific punctuation mark to the relevant field of the answer in the next text content after the line-changing symbol, and determining that the next text content after the line-changing symbol is read is the content before the beginning of the first specific field or the next text content after the line-changing symbol is read as the next question, and determining that the content before the next question is the answer in the question.
The answer related fields are fields such as answer, correct answer, reference answer, etc., and the specific punctuation marks are ": in the embodiment of the invention, in the next text content after the line-feed symbol, the fields + ", which are read, are started by the fields of answer, correct answer, reference answer and the like: the "post content" until the next text content after the line feed is read starts with a second specific field (such as the fields of question attribute, label, question name, etc. of answer parsing, knowledge point, difficulty, etc.), and the content between these is the answer in the question. Or, in the next text content after the line-feed, these fields + ", are read starting with these fields of" answer, correct answer, reference answer ", etc.: and (3) the content after the next text content after the line feed is read is the next question, and the content before the next question defaults to the answer in the question.
Specifically, in the practical application of the invention, aiming at the questions with the question types of ' blank filling questions and simple answering questions ', after the question stems are separated, the text content of the next section after line changing symbols is read, the fields of ' answer, correct answer and reference answer ' are used for beginning, and the fields of + ' are read: the beginning content is started by the fields of question attribute, label, question name and the like after the line-feed character is read until the next text content is read, namely the answer analysis, knowledge point, difficulty and the like, or the next text content after the line-feed character is identified as the next question, the default answer content is ended, otherwise, the reading is continued;
for objective questions, the questions must contain "answer, correct answer, reference answer" fields, and subjective questions may not be filled, if not, the answer is not recognized;
for the filling questions, the multi-space answer is used for the 'or'; "to identify the correct answer for the different blank, if the symbol of" || "is read, the symbol is identified to connect multiple synonym answers.
Step 205, identify answer parsing in the questions.
After the answers in the questions are identified, the answer parsing in the questions can be selected to continue to be identified.
Specifically, after the answers in the questions are recognized, the embodiment of the invention continues to read the text content of the next section after the line feed symbol; and adding a specific punctuation mark to the relevant field of answer analysis in the next text content after the line-changing symbol, and determining that the next text content after the line-changing symbol is read is the content before the beginning of a third specific field or the next text content after the line-changing symbol is read as the next question, and determining that the content before the next question is the answer analysis in the question.
In the embodiment of the present invention, the relevant fields of answer analysis are, for example, fields of "answer analysis, analysis", etc., and specific punctuation marks are ": the third specific field is, for example, a field such as "knowledge point, question attribute such as difficulty, label, question label, and question name". The embodiment of the invention uses the ' answer analysis ' field + ': the beginning content is started by the fields of 'knowledge points, question attributes such as difficulty, labels, question names' and the like until the next text content after the line-feed is read, or the next text content after the line-feed is identified as the next question, the default answer analysis is finished.
It should be noted that, in the embodiment of the present invention, the answer analysis is not necessary to fill in the item, and the answer analysis is not recognized if not filled in.
Step 206, identifying knowledge points to which the title belongs.
After the answer parsing in the questions is identified, knowledge points to which the questions belong can be selected to continue to be identified.
Specifically, after the answer analysis in the question is identified, the embodiment of the invention starts the relevant field of the knowledge point plus a specific punctuation mark until the next text content after the next line-feed is read is the content before the start of the fourth specific field, or the next text content after the line-feed is read is the next question, and the content before the next question is determined as the knowledge point to which the question belongs.
Wherein the knowledge point related fields are fields such as "knowledge point", and the specific punctuation marks are for example ": the fourth specific field is, for example, a field such as "question attribute such as difficulty", a label, a question label, and a question name ".
It should be noted that, in the embodiment of the present invention, a question may belong to a plurality of knowledge points, and a plurality of different knowledge points are numbered with a "or"; "to identify. Wherein if multiple knowledge points exist, the multiple knowledge points can be separated by preset punctuation marks (such as 'or' ").
In step 207, the topic attribute to which the topic belongs is identified.
After the knowledge points to which the topics belong are identified, the topic attributes to which the topics belong can be selected to be continuously identified.
Specifically, after the knowledge point to which the topic belongs is identified, the topic attribute is determined as the topic attribute to which the topic belongs after the knowledge point to which the topic belongs starts with the topic attribute related field and until the next text content after the next line-feed is read is the content before the beginning of the fifth specific field or the next text content after the line-feed is read is the next topic.
The related field of the question attribute is a field of "question attribute such as difficulty" which is already built in the question bank system in advance, and the fifth specific field is a field of "tag, question label, question name", etc. In the embodiment of the invention, the title attribute identification starts with the built field of the title attribute such as difficulty in the title library system until the next text content after the line feed is read, starts with the fields such as label, title label and title name, or identifies the next text content after the line feed as the next title, then the default title attribute content is ended, otherwise, the reading is continued.
In the embodiment of the invention, the question attribute can be customized in the system, and the identification can be performed when the question is recorded after the question attribute is set.
In step 208, the topic tag contained in the topic is identified.
After identifying the topic attribute to which the topic belongs, the topic label contained in the topic can be selected to be continuously identified.
Specifically, after identifying the attribute of the title to which the title belongs, the embodiment of the invention starts with the relevant field of the tag until the next text content after the next line-feed is read is the content before the beginning of the sixth specific field, or the next text content after the line-feed is read is the next title, and the content before the next title is determined as the title tag contained in the title.
The label-related field is, for example, a field such as "label, title label", and the sixth specific field is, for example, a field such as "title name". In the embodiment of the invention, the title tag contained in the title identifies the content beginning with the fields of the tag, the title tag and the like until the next text content after the line-feed is read begins with the field of the title name, or the next text content after the line-feed is identified as the next title, and the default title contained in the title tag is ended.
In the embodiment of the invention, the topic label can be customized in the system, and the topic can be identified when the topic is recorded after the label is set.
It should be noted that, the implementation sequence of steps 201 to 208 in the above embodiment is not limited in the embodiment of the present invention. In practical application, some steps can flexibly select the sequence of execution according to practical requirements.
In the following, the applicant makes simple schematic illustration about an application scenario of the rapid entry method of topics based on structured data provided by the embodiment of the present invention.
In practical application, the rapid batch entry of questions is the first step of online examination of students, and the application links and the application flow in the field of online education examination of the embodiment of the invention are specifically described below.
S1, preparation of a title: the teacher prepares the questions to be input into the system in advance, and generally arranges the specific questions in a word document mode;
s2, online input: the rapid and batch accurate entry of the questions based on the structured data is realized by using the rapid entry method of the questions based on the structured data;
s3, managing a question bank: after the questions are input into the corresponding knowledge points, a teacher builds a brand-new and systematic knowledge directory tree according to the requirements, and distributes different question library directory rights for different roles, so that management and flexible application of a question resource system are realized, and meanwhile, co-building and sharing of the question libraries can be realized;
s4, composing a test paper: according to the task library catalog authority distributed by the teacher, the teacher can rapidly and conveniently extract the task from the task library catalog with the editable authority, and supports the combination of the following two types of test papers to meet different examination requirements: common test paper-manually selecting required questions to form test paper, and the test paper structure and the questions are fixedly used by students in examination; random examination paper-setting the question setting range and rule of the examination paper, and randomly drawing questions according to the rule when students take examination, so as to realize thousands of people;
s5, examination application: the teacher sets examination parameters (such as examination names, examination time, examination times and the like), selects the established examination papers, adds reference students, and can take the corresponding examination after issuing the examination;
s6, reading the test paper: after taking an examination, the teacher reviews the answered examination paper, and after completing the review, the student can check the score and detailed examination paper condition.
The embodiment of the invention also provides a rapid entry device of the title based on the structured data, as shown in fig. 3, comprising:
a determining module 100, configured to determine a question to be entered;
the identifying module 200 is configured to identify, for each question to be entered, related content of the question in turn according to a preset rule, where the related content of the question includes at least a stem, a question type to which the question belongs, and a knowledge point to which the question belongs;
further preferably, the related content of the title may further include: at least one of options and option content in the questions, answers in the questions, answer analysis in the questions, question attributes to which the questions belong and question labels contained in the questions.
The storage module 300 is configured to store the identified topic as a format file content readable by the topic library system.
As an optional implementation manner of the embodiment of the present invention, when identifying the stem in the question, the identification module 200 is specifically configured to read the content of the first line in the paragraph; if the first line is started by a number plus a specific punctuation mark and the Nth line is ended by a line feed symbol, the content before the line feed symbol after the specific punctuation mark is determined to be the stem content in the title, and N is a positive integer.
And, further optionally, the identification module 200 is further configured to: continuing to read a row of content; if the line of content that continues to be read is free of content, or begins with a number or letter or with a first particular field, it is determined that the stem content in the identified topic has ended.
As an alternative implementation manner of the embodiment of the present invention, when the type of the question to which the question belongs is identified, the identification module 200 is specifically configured to read the text content of the last segment of the question stem after the question stem in the question is identified; and determining the topic type to which the topic belongs according to the topic type identification text in the previous text content.
And, further optionally, the identification module 200 is further configured to: and if the text content of the previous section does not contain the topic type identification text, determining the topic type of the previous topic as the topic type of the current topic.
As an optional implementation manner of the embodiment of the present invention, when identifying options and option contents in a question, the identification module 200 is specifically configured to read a text content of a next segment of a question stem; starting with any letter plus a specific punctuation mark in the next text content until a line feed is read, and identifying the next text content as a section of content; repeating the steps until the identified content is free of content, or the identified content starts with a first specific field, and determining that the option in the identified title is ended; if the identified section of content contains a plurality of text contents which start by any letter plus the specific punctuation mark and spaces are arranged between the text contents and the letters, the section of content is divided into a plurality of options by using space separators, and the first option is not defaulted; and for each identified option, adding any letter in each option with the beginning of the specific punctuation mark, and determining the text content before the next option as the option content of the current option.
As an optional implementation manner of the embodiment of the present invention, when identifying an answer in a question, the identification module 200 is specifically configured to, after identifying an option and an option content in the question, read a next text content after a line feed; and adding a specific punctuation mark to the relevant field of the answer in the next text content after the line-changing symbol, and determining the content before the next text content after the line-changing symbol is read to be the next question before the next text content after the line-changing symbol is read to be the first specific field, and determining the content before the next question as the answer in the question.
As an optional implementation manner of the embodiment of the present invention, when the answer in the question is identified for analysis, the identification module 200 is specifically configured to read the text content of the next segment after the line-feed symbol after the answer in the question is identified; and adding a specific punctuation mark to the relevant field of answer analysis in the next text content after the line-changing symbol, and determining that the next text content after the line-changing symbol is read is the content before the beginning of a third specific field or the next text content after the line-changing symbol is read as the next question, and determining that the content before the next question is the answer analysis in the questions.
As an optional implementation manner of this embodiment of the present invention, when identifying a knowledge point to which a question belongs, the identification module 200 is specifically configured to begin with a knowledge point related field plus a specific punctuation mark until the next text content after reading the next line-changing symbol is the content before beginning with a fourth specific field, or the next text content after reading the line-changing symbol is the next question, and determine the content before the next question as the knowledge point to which the question belongs;
wherein if a plurality of knowledge points exist, the plurality of knowledge points are separated by a preset punctuation mark.
As an optional implementation manner of this embodiment of the present invention, when identifying a question attribute to which a question belongs, the identifying module 200 is specifically configured to determine, after starting with a relevant field of the question attribute, a next text content after reading a next line-feed is a content before starting with a fifth specific field, or a next text content after reading a line-feed is a next question, and determine, as a question attribute to which the question belongs, a content before the next question.
As an alternative implementation manner of this embodiment of the present invention, when identifying a topic tag included in a topic, the identification module 200 is specifically configured to determine, after starting with a tag related field, a next text content after reading a next line-feed is a content before starting with a sixth specific field, or a next text content after reading a line-feed is a next topic, and determine, as a topic tag included in the topic, a content before the next topic.
It should be noted that, the rapid entry device for the title based on the structured data is a device corresponding to the rapid entry method for the title based on the structured data in the foregoing embodiment, and all implementation means in the foregoing method embodiments are applicable to the embodiment of the rapid entry device for the title based on the structured data, so that the same technical effects can be achieved.
In addition, an embodiment of the present invention further provides an electronic device, including: a processor, a memory and a program stored on the memory and executable on the processor, which when executed by the processor, performs the steps of the structured data based rapid entry method as described above.
And, an embodiment of the present invention further provides a readable storage medium having stored thereon a program which, when executed by a processor, implements the steps of the structured data based rapid entry method as provided in the first aspect.
While the foregoing is directed to the preferred embodiments of the present invention, it will be appreciated by those skilled in the art that various modifications and adaptations can be made without departing from the principles of the present invention, and such modifications and adaptations are intended to be comprehended within the scope of the present invention.

Claims (13)

1. The title rapid entry method based on the structured data is characterized by comprising the following steps of:
determining a question to be input;
sequentially identifying the related content of each question to be input according to a preset rule, wherein the related content of each question at least comprises a question stem, a question type to which the question belongs and a knowledge point to which the question belongs;
the identified topics are stored as file content in a format readable by the topic library system.
2. The method of claim 1, wherein the related content of the title further comprises: at least one of options and option content in the questions, answers in the questions, answer analysis in the questions, question attributes to which the questions belong and question labels contained in the questions.
3. The method of claim 1, wherein identifying the stem in the topic comprises:
reading the content of the first row in the paragraph;
if the first line is started by a number plus a specific punctuation mark and the Nth line is ended by a line feed symbol, the content before the line feed symbol after the specific punctuation mark is determined to be the stem content in the title, and N is a positive integer.
4. The method of claim 3, wherein after determining that the content preceding the line feed is stem content in a topic after determining the particular punctuation mark, the method further comprises:
continuing to read the next row of content;
if the next line of content that continues to be read is free of content, or begins with a number or letter or with a first particular field, it is determined that the stem content in the identified topic has ended.
5. The method of claim 1, wherein identifying the question type to which the question belongs comprises:
after the stem in the question is identified, reading the text content of the last section of the stem;
and determining the topic type to which the topic belongs according to the topic type identification text in the previous text content.
6. The method of claim 5, wherein the method further comprises:
and if the text content of the previous section does not contain the topic type identification text, determining the topic type of the previous topic as the topic type of the current topic.
7. The method of claim 1, wherein identifying knowledge points to which the topic belongs comprises:
after the related field of the knowledge point is added with a specific punctuation mark, the next text content after reading the next line feed is the content before the beginning of the fourth specific field, or the next text content after reading the line feed is the next question, and the content before the next question is determined to be the knowledge point to which the question belongs;
wherein if a plurality of knowledge points exist, the plurality of knowledge points are separated by a preset punctuation mark.
8. The method of claim 2, wherein identifying options and option content in the title comprises:
reading the text content of the next section of the stem;
starting with any letter plus a specific punctuation mark in the next text content until a line feed is read, and identifying the next text content as a section of content; repeating the steps until the identified content is free of content, or the identified content starts with a first specific field, and determining that the option in the identified title is ended; if the identified section of content contains a plurality of text contents which start by any letter plus the specific punctuation mark and spaces are arranged between the text contents and the letters, the section of content is divided into a plurality of options by using space separators, and the first option is not defaulted;
and for each identified option, adding any letter in each option with the beginning of the specific punctuation mark, and determining the text content before the next option as the option content of the current option.
9. The method of claim 2, wherein identifying answers in the questions comprises:
after the options and the option contents in the topics are identified, reading the next text content after the line feed symbol;
and adding a specific punctuation mark to the relevant field of the answer in the next text content after the line-changing symbol, and determining the content before the next text content after the line-changing symbol is read to be the next question before the next text content after the line-changing symbol is read to be the second specific field, and determining the content before the next question as the answer in the question.
10. The method of claim 2, wherein identifying answer resolution in a question comprises:
after the answers in the questions are identified, reading the text content of the next section after the line feed symbol;
and adding a specific punctuation mark to the relevant field of answer analysis in the next text content after the line-changing symbol, and determining that the next text content after the line-changing symbol is read is the content before the beginning of a third specific field or the next text content after the line-changing symbol is read as the next question, and determining that the content before the next question is the answer analysis in the questions.
11. The method of claim 2, wherein identifying the topic attribute to which the topic belongs comprises:
and after the related field of the title attribute starts, determining that the next text content after the next line-feed is read is the content before the start of the fifth specific field or the next text content after the line-feed is read is the next title, and determining that the content before the next title is the title attribute to which the title belongs.
12. The method of claim 2, wherein identifying the topic tag contained in the topic comprises:
and after the beginning of the label related field, the next text content after the next line-feeding is read is the content before the beginning of the sixth specific field, or the next text content after the line-feeding is read is the next question, and the content before the next question is determined as the question label contained in the question.
13. The utility model provides a quick entry device of title based on structured data which characterized in that includes:
the determining module is used for determining the questions to be recorded;
the identification module is used for sequentially identifying the related content of each question to be input according to a preset rule, wherein the related content of the questions at least comprises a question stem, a question type to which the question belongs and a knowledge point to which the question belongs;
and the storage module is used for storing the identified topics as file contents in a format readable by the topic library system.
CN202310148865.1A 2023-02-14 2023-02-14 Method and device for rapidly inputting topics based on structured data Pending CN116189216A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310148865.1A CN116189216A (en) 2023-02-14 2023-02-14 Method and device for rapidly inputting topics based on structured data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310148865.1A CN116189216A (en) 2023-02-14 2023-02-14 Method and device for rapidly inputting topics based on structured data

Publications (1)

Publication Number Publication Date
CN116189216A true CN116189216A (en) 2023-05-30

Family

ID=86432377

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310148865.1A Pending CN116189216A (en) 2023-02-14 2023-02-14 Method and device for rapidly inputting topics based on structured data

Country Status (1)

Country Link
CN (1) CN116189216A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116662305A (en) * 2023-06-06 2023-08-29 森纵艾数(北京)科技有限公司 Question bank management method, system, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106354740A (en) * 2016-05-04 2017-01-25 上海秦镜网络科技有限公司 Electronic examination paper inputting method
CN109947836A (en) * 2019-03-21 2019-06-28 江西风向标教育科技有限公司 English paper structural method and device
CN112364830A (en) * 2020-11-30 2021-02-12 长沙冉星信息科技有限公司 Method for inputting user examination questionnaire based on word document
CN112381099A (en) * 2020-11-24 2021-02-19 中教云智数字科技有限公司 Question recording system based on digital education resources

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106354740A (en) * 2016-05-04 2017-01-25 上海秦镜网络科技有限公司 Electronic examination paper inputting method
CN109947836A (en) * 2019-03-21 2019-06-28 江西风向标教育科技有限公司 English paper structural method and device
CN112381099A (en) * 2020-11-24 2021-02-19 中教云智数字科技有限公司 Question recording system based on digital education resources
CN112364830A (en) * 2020-11-30 2021-02-12 长沙冉星信息科技有限公司 Method for inputting user examination questionnaire based on word document

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116662305A (en) * 2023-06-06 2023-08-29 森纵艾数(北京)科技有限公司 Question bank management method, system, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
US6685482B2 (en) Method and system for creating and evaluating quizzes
JP2006092473A (en) Answering support system and apparatus, and answering support program
CN108647211A (en) A kind of method for pushing of children for learning content
CN111209728B (en) Automatic labeling and inputting method for test questions
CN102074134A (en) New word test system for judging proficiency level of new word according to reply time and method thereof
Khabib Introducing artificial intelligence (AI)-based digital writing assistants for teachers in writing scientific articles
CN107766301A (en) Text handling method, system, medium and computing device
CN113204581A (en) Topic recommendation method, device and equipment based on big data and storage medium
CN116189216A (en) Method and device for rapidly inputting topics based on structured data
CN110750632B (en) Improved Chinese ALICE intelligent question-answering method and system
CN111507075B (en) Method and device for converting data format
CN112989783B (en) Intelligent winding device and method
CN110569401A (en) paper marking method and device, computer equipment and storage medium
CN115358232A (en) Method and system for analyzing answers of test questions
CN112364830A (en) Method for inputting user examination questionnaire based on word document
CN102929612A (en) Method for importing txt test library in interactive teaching platform
CN112164262A (en) Intelligent paper reading tutoring system
CN113569112A (en) Tutoring strategy providing method, system, device and medium based on question
CN115618822A (en) Test question input method, system and computer storage medium
CN115640791A (en) Test question labeling method, system and computer storage medium
CN117520527B (en) Method, system, electronic device and readable storage medium for generating answering dialogue data
CN111627275B (en) Processing method and device for language representation information test, storage medium and terminal
CN112749268B (en) FAQ system ordering method, device and system based on mixing strategy
Nishihara et al. A web-based application for writing novels
CN111858844B (en) System and method for determining matching accuracy of topic text paragraph relative to reference text paragraph

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination