CN107291694A - A kind of automatic method and apparatus, storage medium and terminal for reading and appraising composition - Google Patents

A kind of automatic method and apparatus, storage medium and terminal for reading and appraising composition Download PDF

Info

Publication number
CN107291694A
CN107291694A CN201710498079.9A CN201710498079A CN107291694A CN 107291694 A CN107291694 A CN 107291694A CN 201710498079 A CN201710498079 A CN 201710498079A CN 107291694 A CN107291694 A CN 107291694A
Authority
CN
China
Prior art keywords
composition
word
module
language analysis
topic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710498079.9A
Other languages
Chinese (zh)
Other versions
CN107291694B (en
Inventor
张帆
马楠
陈冬晓
刘志山
阎鹏
邓澍军
郭常圳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Ape Power Technology Co.,Ltd.
Original Assignee
Beijing Chalk Future Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Chalk Future Technology Co Ltd filed Critical Beijing Chalk Future Technology Co Ltd
Priority to CN201710498079.9A priority Critical patent/CN107291694B/en
Publication of CN107291694A publication Critical patent/CN107291694A/en
Application granted granted Critical
Publication of CN107291694B publication Critical patent/CN107291694B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B7/00Electrically-operated teaching apparatus or devices working with questions and answers
    • G09B7/02Electrically-operated teaching apparatus or devices working with questions and answers of the type wherein the student is expected to construct an answer to the question which is presented or wherein the machine gives an answer to the question presented by a student

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Business, Economics & Management (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of automatic method for reading and appraising composition, including:A1, loaded targets composition and its corresponding compostion topic;A2, the text subject for extracting target composition and the compostion topic topic theme;A3, the text subject is compared with the topic theme, and according to the result of comparison weighting generation fraction.Automatic method and apparatus, storage medium and the terminal for reading and appraising composition that the present invention is provided, can be write a composition with loaded targets and carry out language analysis automatically, extract theme, thus realize automatically, objectively read and appraise composition, save manpower, improve efficiency.The invention also discloses a kind of automatic device for reading and appraising composition, and storage medium and terminal.

Description

A kind of automatic method and apparatus, storage medium and terminal for reading and appraising composition
Technical field
The present invention relates to technical field of data processing, more particularly to a kind of automatic method and apparatus for reading and appraising composition, storage Medium and terminal.
Background technology
Language teaching is a critically important discipline of education in China's educational system, and the Chinese can be learnt by language teaching Word, comprehension China history culture of extensive knowledge and profound scholarship.The training particularly write a composition, is the best inspection to student written ability.
During existing composition is read and appraised, read and appraised mostly by the subjectivity of teacher, human factor is more.Sometimes when teacher will When reading and appraising substantial amounts of theme, inevitably slip up, and then can not objectively reflect the writing ability of student.
Accordingly, it would be desirable to which a kind of can automatically, objectively read and appraise the method and system of composition, to save manpower, effect is improved Rate, so as to solve technical problem present in prior art.
The content of the invention
In view of this, the invention provides a kind of automatic method and apparatus, storage medium and terminal for reading and appraising composition, with Solve technical problem present in prior art.
The invention discloses a kind of automatic method for reading and appraising composition, including:
A1, loaded targets composition and its corresponding compostion topic;
A2, the text subject for extracting target composition and the compostion topic topic theme;
A3, the text subject is compared with the topic theme, and according to the weighting generation point of the result of comparison Number.
In the schematical embodiment of the present invention, the text subject for extracting target composition includes:
A21, by target composition be cut into multiple words;
A22, progress language analysis of being write a composition to the target that cutting is multiple words;
A23, the text subject for extracting according to the result of language analysis target composition;
The topic theme for extracting compostion topic includes:
A24, the compostion topic that target is write a composition is cut into multiple words;
A25, to cutting for multiple words the compostion topic carry out language analysis;
A26, the topic theme for extracting according to the result of language analysis the compostion topic.
In the schematical embodiment of the present invention, the language analysis is carried out to target composition or compostion topic Including:
B1, the part of speech to each word in each sentence are labeled, wherein, the part of speech include noun, verb, The combination of one or more of adjective, pronoun, number, measure word, adverbial word, preposition, conjunction, auxiliary word, interjection, onomatopoeia;
B2, the name type of the noun word recognized in each sentence and mark, wherein, the name type includes people The combination of one or more of name, place name, mechanism name;
B3, the subject to the predicate in each sentence are labeled;
Relation between b4, the word in each sentence, is labeled to the grammer of the sentence;
B5, identification participle word and part of speech between semantic association relation and mark.
In the schematical embodiment of the present invention, step a23 includes:
A231, write a composition according to target in annotation results, recognize word similarity, the word is clustered, so The text subject that the target is write a composition once is extracted according to the result of the cluster afterwards;
The text rule that a232, basis prestore, second extraction is carried out to the text subject that target is write a composition;
The text subject that a233, basis are extracted twice, weighting obtains the final text subject of the target composition.
In the schematical embodiment of the present invention, after step a2, in addition to:
A4, the rhetorical devices of the identification target composition;
A5, generation fraction of being write a composition according to the rhetorical devices to the target, and perform step a6;
After step a3, in addition to:
A6, the step a3 fractions generated and step the a5 fraction generated is weighted, obtains the target composition most Whole fraction.
The embodiment of the invention also discloses a kind of automatic device for reading and appraising composition, including:
Load-on module, loaded targets composition and its corresponding compostion topic;
Subject distillation module, extracts the text subject of target composition and the topic theme of the compostion topic;
Theme comparing module, the text subject is compared with the topic theme, and is added according to the result of comparison Power generation fraction.
In the schematical embodiment of the present invention, the subject distillation module includes:
Target composition is cut into multiple words by composition word segmentation module, the composition word segmentation module;
First language analysis module, the first language analysis module is to the target of the cutting for multiple words Composition carries out language analysis;
Text subject extraction module, the text subject extraction module extracts the target according to the result of language analysis and made The text subject of text;
The subject distillation module also includes:
The compostion topic that target is write a composition is cut into multiple by topic word segmentation module, the topic word segmentation module Word;
Second language analysis module, the second language analysis module is to the compostion topic of the cutting for multiple words Carry out language analysis;
Topic subject distillation module, the topic subject distillation module extracts the theme according to the result of language analysis Purpose topic theme.
The present invention a schematical embodiment in, the first language analysis module target is write a composition into Row language analysis, including:
The first language analysis module is labeled to the part of speech of each word in each sentence, wherein, institute's predicate Property include noun, verb, adjective, pronoun, number, measure word, adverbial word, preposition, conjunction, auxiliary word, interjection, onomatopoeia in one Individual or multiple combination;
The first language analysis module recognizes the name type and mark of the noun word in each sentence, wherein, institute Stating name type includes the combination of one or more of name, place name, mechanism name;
The first language analysis module is labeled to the subject of the predicate in each sentence;
Relation between word of the first language analysis module in each sentence, is carried out to the grammer of the sentence Mark;
The first language analysis module recognizes the semantic association relation between participle word and part of speech and marked;
The second language analysis module carries out language analysis to the compostion topic, including:
The second language analysis module is labeled to the part of speech of each word in each sentence, wherein, institute's predicate Property include noun, verb, adjective, pronoun, number, measure word, adverbial word, preposition, conjunction, auxiliary word, interjection, onomatopoeia in one Individual or multiple combination;
The second language analysis module recognizes the name type and mark of the noun word in each sentence, wherein, institute Stating name type includes the combination of one or more of name, place name, mechanism name;
The second language analysis module is labeled to the subject of the predicate in each sentence;
Relation between word of the second language analysis module in each sentence, is carried out to the grammer of the sentence Mark;
The second language analysis module recognizes the semantic association relation between participle word and part of speech and marked.
In the schematical embodiment of the present invention, the text subject extraction module includes:
First subject distillation module, the first subject distillation module write a composition according to target in annotation results, recognize word The similarity of language, is clustered to the word, the text subject then write a composition according to the result of the cluster to the target Once extracted;
Second theme extraction module, the second theme extraction module is write a composition according to the text rule prestored to target Text subject carries out second extraction;
Weighting block, according to the text subject extracted twice, weighting obtains the final text master of the target composition Topic.
In the schematical embodiment of the present invention, after the subject distillation module, in addition to:
Rhetorical devices identification module, the rhetorical devices identification module recognizes the rhetorical devices of the target composition;
Rhetorical devices scoring modules, the rhetorical devices scoring modules are write a composition according to the rhetorical devices to the target Generate fraction;
Final score generation module, the final score generation module respectively with the rhetorical devices scoring modules and The theme comparing module connection, the fraction and the rhetorical devices scoring modules that the theme comparing module is generated is given birth to Into fraction be weighted, obtain the target and write a composition final fraction.
The invention also discloses a kind of storage medium, be stored with computer instruction, real when the computer instruction is performed The method for now reading and appraising composition automatically as described above.
The invention also discloses a kind of terminal, including processor and memory, be stored with calculating in the memory Machine is instructed;In the application program launching, the processor reads the computer instruction and realized as described above automatic The method for reading and appraising composition.
Automatic method and apparatus, storage medium and the terminal for reading and appraising composition that the present invention is provided, can be write a composition with loaded targets And carry out language analysis automatically, extract theme so that realize it is automatic, objectively read and appraise composition, save manpower, improve effect Rate.
Brief description of the drawings
Fig. 1 is the automatic method flow schematic diagram for reading and appraising composition of one embodiment of the invention;
Fig. 2 is the automatic method flow for reading and appraising the text subject that target composition is extracted in composition of one embodiment of the invention Schematic diagram;
Fig. 3 is the automatic method flow for reading and appraising progress language analysis of being write a composition in composition to target of one embodiment of the invention Schematic diagram;
Fig. 4 be one embodiment of the invention the automatic method for reading and appraising composition in step a23 detail flowchart;
Fig. 5 is the stream for the topic theme that compostion topic is extracted in the automatic method for reading and appraising composition of one embodiment of the invention Journey schematic diagram;
Fig. 6 is another detail flowchart of the automatic method for reading and appraising composition of one embodiment of the invention;
Fig. 7 is the schematic diagram one of the automatic device for reading and appraising composition of the embodiment of the present invention;
Fig. 8 is the schematic diagram two of the automatic device for reading and appraising composition of the embodiment of the present invention;
Fig. 9 is the schematic diagram three of the automatic device for reading and appraising composition of the embodiment of the present invention;
Figure 10 is the schematic diagram four of the automatic device for reading and appraising composition of the embodiment of the present invention;
Figure 11 is the schematic diagram five of the automatic device for reading and appraising composition of the embodiment of the present invention.
Embodiment
The embodiment to the present invention is described below in conjunction with the accompanying drawings.
In order to solve the artificial technological deficiency for reading and appraising composition present in prior art, the embodiment of the invention discloses one The automatic method for reading and appraising composition is planted, can automatically, objectively read and appraise composition, so as to save manpower, efficiency is improved.
To achieve these goals, referring to Fig. 1, the artificial method for reading and appraising composition disclosed in the embodiment of the present invention include with Lower step a1~a3:
A1, loaded targets composition and its corresponding compostion topic.
In loading, compostion topic is loaded respectively and target composition is analyzed.It should be noted that actually making With in scene, such as in once taking an examination, compostion topic only one of which, corresponding target composition has a lot.
A2, the text subject for extracting target composition and the compostion topic topic theme.
More at large, referring to Fig. 2, extracting the text subject of target composition includes step a21~a23:
A21, by target composition be cut into multiple words;
, wherein it is desired to explain, in Chinese, word is the semantic most basic unit of carrying.Cutting word is also this The basis that inventive method can be realized.
For example:" I thinks that the value of life can be realized by posteriori effort by being in the bright achievement in poor and humble family.", this The word segmentation result of technology is:" I/think/,/be in poor and humble family// bright/effort/into/just/can be with/by/day after tomorrow/carry out/reality Existing/life// value/.If " word segmentation result be " ... be in poor and humble family// bright/achievement/can with/pass through/... " because " into Just " it is also a common word, it is likely that such word segmentation result occur.
How to avoid ambiguity during cutting word is a problem.The present invention is with the middle school's language composition largely accumulated Language material and corresponding composition theme material, using based on dictionary with word frequency statisticses are combined by the way of, the plan of involvement knowledge understanding The cutting of word slightly is carried out to the article of input, to ensure the accuracy of word segmentation.
A22, progress language analysis of being write a composition to the target that cutting is multiple words.
More at large, in one embodiment of the invention, referring to Fig. 3, carrying out language analysis to target composition is Analysis to word, and the analysis to relation between word, specifically include step b1~b5:
B1, the part of speech to each word in each sentence are labeled, wherein, the part of speech include noun, verb, The combination of one or more of adjective, pronoun, number, measure word, adverbial word, preposition, conjunction, auxiliary word, interjection, onomatopoeia;
B2, the name type of the noun word recognized in each sentence and mark, wherein, the name type includes people The combination of one or more of name, place name, mechanism name.
For example:" beam so-and-so【Name】Birth people of brother and sister five in common people family, family, can only but lean on that poor work of father Provide to strive to keep body and soul together.Determine to go to coke-oven plant since his primary school, in family【Mechanism name】Material is got to go home to process.He daily will The material for carrying tens kilogram weights by that small and weak body sets out, chicken sleeps rear return home, so toward returning home and factory before crowing Start very early in the morning and go on until late at night, year after year.But his study was never put down, famous Britain is admitted to excellent achievement【Place name】 Bristol Polytechnics【Mechanism name】.”
B3, the subject to the predicate in each sentence are labeled.
Predicate, for describing or judging the lexical item of relation between objectifiability, feature or object, such as the time, place, Personage etc., help is provided for follow-up feature extraction.Such as "Yes" in " cat is animal " one is exactly a predicate, and " cat " is object;" 3 " being more than " being more than in 2 " were a predicates.
Relation between b4, the word in each sentence, is labeled to the grammer of the sentence.
For example:Subject-predicate in sentence, dynamic guest, the grammatical item such as side by side are recognized, and analyzes the relation between each composition.
B5, identification participle word and part of speech between semantic association relation and mark.
By the incidence relation, the semantic information of deep layer can be directly obtained across the constraint of sentence top layer syntactic structure.
A23, the text subject for extracting according to the result of language analysis target composition.
More specifically, referring to Fig. 4, step a23 includes:
A231, write a composition according to target in annotation results, recognize word similarity, the word is clustered, so The text subject that the target is write a composition once is extracted according to the result of the cluster afterwards;
The text rule that a232, basis prestore, second extraction is carried out to the text subject that target is write a composition;
The text subject that a233, basis are extracted twice, weighting obtains the final text subject of the target composition.
It can be seen that, the once extraction for the text subject write a composition to target is different with the dimension of second extraction:
Once extract be to target write a composition internal extraction, by composition self-information, by measure composition in word it Between similarity, build text subject using the method for cluster, and the significance level according to different text subjects in article, Extract keyword therein.So just the text subject of article can be found to a certain extent, and extract and text subject Related keyword, improves coverage of the keyword to text subject.
Second extraction is the model using build up outside of writing a composition, and finds the rule in a large amount of texts (when referring to training automatically The related language material of the substantial amounts of middle school student's composition of standby), find text subject so as to automatic, and text subject is related Word is found out.
More at large, referring to Fig. 5, extracting the topic theme of compostion topic includes step a24~a26:
A24, the compostion topic that target is write a composition is cut into multiple words;
For the word segmentation of compostion topic, the word segmentation write a composition with target is substantially identical, just repeats no more herein.
A25, to cutting for multiple words the compostion topic carry out language analysis.
It is substantially identical with the foregoing language analysis write a composition to target for the language analysis of compostion topic, herein just no longer Repeat.
A26, the topic theme for extracting according to the result of language analysis the compostion topic.
For the extraction of the topic theme of compostion topic, the substantive phase of extraction with the foregoing text subject write a composition to target Together, just repeat no more herein.
Illustrate to illustrate below.
For example, a compostion topic has three students, item rainbow is diligent conscientiously, try to learn, interim, final examination achievement Annual level is optimal.Ni guest shows typically in the study, is the activist of students' union, the walking intermediate frequency organized in students' union Frequency is shown up, and shows talent, is elected as student leader in students' union's change-session election contest finally.Bright not good into family financial situation, he is in effort Study, while complete every learning tasks, the time after school is used in do small business on, over a year, not only will to family Red cent, also posts 3000 yuans subsidy family expenses to family.
So, the present embodiment can be extracted to the topic theme of the topic:Rainbow student makes great efforts, it is diligent, into good performance It is elegant;Ni guest student is positive, talented;It is bright into student it is with family in straitened circumstances, self-reliance.
When corresponding target is write a composition, such as, for student Xiang Hong, the text subject once extracted is:Diligently (70 points), (60 points) in good standing, student make great efforts (50 points);The text subject of second extraction is:(30 points), XXX (20 points) are made great efforts in study, Diligently (5 points), after the result weighting extracted twice, for Xiang Hong, the standards of grading of text subject are:Student's effort (80 points), Diligently (75 points), (60 points) in good standing.
Corresponding to the text subject of other students, extracting method is similar, and the present embodiment just will not enumerate.
A3, the text subject is compared with the topic theme, and according to the weighting generation point of the result of comparison Number.
Above-mentioned flow terminates.
Alternatively, in scoring, it is contemplated that the rhetorical devices of composition can increase the literary grace of article, can also add pair The discrimination of rhetorical devices, is used as the bonus point for improving literary grace of writing a composition.
So, referring to Fig. 6, after step a2, in addition to:
A4, the rhetorical devices of the identification target composition.
In the present embodiment, rhetorical devices include conventional parallelism, metaphor etc..
For example, the identification process of parallelism rhetorical devices is as follows:
A41, extraction candidate parallelism sentence.Target composition is after the pretreatment such as participle, subordinate sentence, part-of-speech tagging, from article In release, including the parallelism such as comma, branch, fullstop, paragraph;
A42, trimness are examined.Consider from the length of parallelism and the neat degree of three parallelism short sentences.
A43, parallelism mark are examined.Parallelism mark is that have identical company in a distinguishing feature of parallelism sentence, i.e., three parallelisms sentence Continuous character string.
A44, metaphor mark.Often along with Figures of Speech gimmick in parallelism sentence, Figures of Speech can more show student's Composition elegance.
A5, generation fraction of being write a composition according to the rhetorical devices to the target, and perform step a6;
After step a3, in addition to:
A6, the step a3 fractions generated and step the a5 fraction generated is weighted, obtains the target composition most Whole fraction.
The automatic method for reading and appraising composition that the present invention is provided, can be write a composition with loaded targets and carry out language analysis automatically, carry Take theme, thus realize it is automatic, objectively read and appraise composition, save manpower, improve efficiency.
The exemplary scheme of following automatic devices for reading and appraising composition for the present embodiment.It should be noted that this is commented automatically The technical scheme for readding the technical scheme and the above-mentioned automatic method for reading and appraising composition of the device of composition belongs to same design, comments automatically The detail content that the technical scheme of the device of composition is not described in detail is read, the above-mentioned automatic method for reading and appraising composition is may refer to Technical scheme description.
The embodiment of the present invention additionally provides a kind of automatic device for reading and appraising composition, referring to Fig. 7, including:
Load-on module 11, loaded targets composition and its corresponding compostion topic;
Subject distillation module 12, extracts the text subject of target composition and the topic theme of the compostion topic;
Theme comparing module 13, the text subject is compared with the topic theme, and according to the result of comparison Weighting generation fraction.
Alternatively, in one embodiment of the invention, referring to Fig. 8, subject distillation module 12 includes:
Target composition is cut into multiple words by composition word segmentation module 121, the composition word segmentation module 121;
First language analysis module 122, the first language analysis module 122 is to institute of the cutting for multiple words State target composition and carry out language analysis;
Text subject extraction module 123, the text subject extraction module 123 is extracted according to the result of language analysis should The text subject of target composition;
Alternatively, in one embodiment of the invention, referring to Fig. 9, subject distillation module 12 also includes:
Topic word segmentation module 124, the compostion topic cutting that the topic word segmentation module 124 writes a composition target Into multiple words;
Second language analysis module 125, the second language analysis module 125 is to the work of the cutting for multiple words Literary topic carries out language analysis;
Topic subject distillation module 126, the topic subject distillation module 126 is extracted according to the result of language analysis should The topic theme of compostion topic.
Alternatively, first language analysis module 122 carries out language analysis to target composition, including:
First language analysis module 122 is labeled to the part of speech of each word in each sentence, wherein, institute's predicate Property include noun, verb, adjective, pronoun, number, measure word, adverbial word, preposition, conjunction, auxiliary word, interjection, onomatopoeia in one Individual or multiple combination;
First language analysis module 122 recognizes the name type and mark of the noun word in each sentence, wherein, name Type is claimed to include the combination of one or more of name, place name, mechanism name;
First language analysis module 122 is labeled to the subject of the predicate in each sentence;
Relation between word of the first language analysis module 122 in each sentence, rower is entered to the grammer of the sentence Note;
First language analysis module 122 recognizes the semantic association relation between participle word and part of speech and marked.
125 pairs of the second language analysis module compostion topic carries out language analysis, including:
Second language analysis module 125 is labeled to the part of speech of each word in each sentence, wherein, institute's predicate Property include noun, verb, adjective, pronoun, number, measure word, adverbial word, preposition, conjunction, auxiliary word, interjection, onomatopoeia in one Individual or multiple combination;
Second language analysis module 125 recognizes the name type and mark of the noun word in each sentence, wherein, institute Stating name type includes the combination of one or more of name, place name, mechanism name;
Second language analysis module 125 is labeled to the subject of the predicate in each sentence;
Relation between word of the second language analysis module 125 in each sentence, rower is entered to the grammer of the sentence Note;
Second language analysis module 125 recognizes the semantic association relation between participle word and part of speech and marked.
Alternatively, referring to Figure 10, text subject extraction module 123 includes:
First subject distillation module 1231, the first subject distillation module 1231 write a composition according to target in mark knot Really, the similarity of word is recognized, the word is clustered, then the target write a composition according to the result of the cluster Text subject once extracted;
Second theme extraction module 1232, the second theme extraction module 1232 is according to the text rule prestored, to mesh The text subject for being denoted as text carries out second extraction;
Weighting block 1233, according to the text subject extracted twice, weighting obtains the final text of the target composition Theme.
In addition, referring to Figure 11, after the subject distillation module 12, in addition to:
Rhetorical devices identification module 14, the rhetorical devices identification module 14 recognizes the rhetorical devices of the target composition;
Rhetorical devices scoring modules 15, the rhetorical devices scoring modules 15 are according to the rhetorical devices to the target Composition generation fraction;
Final score generation module 16, the final score generation module 16 respectively with the rhetorical devices scoring modules 15 and the theme comparing module 13 connect, the fraction and the rhetorical devices that the theme comparing module 13 is generated The fraction that scoring modules 15 are generated is weighted, and obtains the final fraction of the target composition.
The embodiment of the invention also discloses a kind of terminal, including processor and memory, stored in the memory There is computer instruction;In the application program launching, the processor reads the computer instruction and realized as described above The automatic method for reading and appraising composition.
It should be noted that the terminal can be desktop PC, notebook, palm PC and cloud server Deng computing device.It will be appreciated by persons skilled in the art that terminal is for receiving data and export structure after being handled Equipment.The example above is not the restriction to terminal, is that, in some occasions, terminal can also include input-output equipment, net Network access device, bus etc..
The processor can be CPU (Central Processing Unit, CPU), can also be it His general processor, digital signal processor (Digital Signal Processor, DSP), application specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field- Programmable Gate Array, FPGA) or other PLDs, discrete gate or transistor logic device Part, discrete hardware components etc..General processor can be microprocessor or the processor can also be any conventional processing Device etc., the processor is the control centre of the terminal, utilizes various interfaces and each portion of the whole terminal of connection Point.
The memory mainly includes storing program area and storage data field, wherein, storing program area can store operation system Application program (such as sound-playing function, image player function etc.) needed for system, at least one function etc.;Storage data field It can store and created data (such as voice data, phone directory etc.) etc. are used according to mobile phone.In addition, memory can be wrapped High-speed random access memory is included, nonvolatile memory, such as hard disk, internal memory, plug-in type hard disk, intelligence can also be included Storage card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card, flash card (Flash Card), at least one disk memory, flush memory device or other volatile solid-state parts.
The embodiment of the invention also discloses a kind of storage medium, be stored with computer instruction, and the computer instruction is held The method as described above for reading and appraising composition automatically is realized during row.
The computer instruction include computer program code, the computer program code can for source code form, Object identification code form, executable file or some intermediate forms etc..The computer-readable medium can include:It can carry Any entity or device, recording medium, USB flash disk, mobile hard disk, magnetic disc, CD, the computer of the computer program code are deposited Reservoir, read-only storage (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electric carrier signal, telecommunication signal and software distribution medium etc..It should be noted that computer-readable Jie The content that matter is included can carry out appropriate increase and decrease according to legislation in jurisdiction and the requirement of patent practice, such as at certain A little jurisdictions, according to legislation and patent practice, computer-readable medium does not include electric carrier signal and telecommunication signal.
The preferred embodiments of the disclosure and embodiment are explained in detail above in conjunction with accompanying drawing, but this hair It is bright to be not limited to the above-described embodiment and examples, can also be not in the knowledge that those skilled in the art possess Made a variety of changes on the premise of departing from present inventive concept.

Claims (12)

1. a kind of automatic method for reading and appraising composition, it is characterised in that including:
A1, loaded targets composition and its corresponding compostion topic;
A2, the text subject for extracting target composition and the compostion topic topic theme;
A3, the text subject is compared with the topic theme, and according to the result of comparison weighting generation fraction.
2. the automatic method for reading and appraising composition according to claim 1, it is characterised in that the text of the extraction target composition Theme includes:
A21, by target composition be cut into multiple words;
A22, progress language analysis of being write a composition to the target that cutting is multiple words;
A23, the text subject for extracting according to the result of language analysis target composition;
The topic theme for extracting compostion topic includes:
A24, the compostion topic that target is write a composition is cut into multiple words;
A25, to cutting for multiple words the compostion topic carry out language analysis;
A26, the topic theme for extracting according to the result of language analysis the compostion topic.
3. the automatic method for reading and appraising composition according to claim 2, it is characterised in that enter to target composition or compostion topic The row language analysis includes:
B1, the part of speech to each word in each sentence are labeled, wherein, the part of speech includes noun, verb, described The combination of one or more of word, pronoun, number, measure word, adverbial word, preposition, conjunction, auxiliary word, interjection, onomatopoeia;
B2, the name type of noun word in each sentence of identification and mark, wherein, the name type include name, The combination of one or more of name, mechanism name;
B3, the subject to the predicate in each sentence are labeled;
Relation between b4, the word in each sentence, is labeled to the grammer of the sentence;
B5, identification participle word and part of speech between semantic association relation and mark.
4. the automatic method for reading and appraising composition according to claim 3, it is characterised in that step a23 includes:
A231, write a composition according to target in annotation results, recognize word similarity, the word is clustered, Ran Hougen The text subject that the target is write a composition once is extracted according to the result of the cluster;
The text rule that a232, basis prestore, second extraction is carried out to the text subject that target is write a composition;
The text subject that a233, basis are extracted twice, weighting obtains the final text subject of the target composition.
5. the automatic method for reading and appraising composition according to claim 1, it is characterised in that after step a2, in addition to:
A4, the rhetorical devices of the identification target composition;
A5, generation fraction of being write a composition according to the rhetorical devices to the target, and perform step a6;
After step a3, in addition to:
A6, the step a3 fractions generated and step the a5 fraction generated is weighted, obtains the target composition final Fraction.
6. a kind of automatic device for reading and appraising composition, it is characterised in that including:
Load-on module, loaded targets composition and its corresponding compostion topic;
Subject distillation module, extracts the text subject of target composition and the topic theme of the compostion topic;
Theme comparing module, the text subject is compared with the topic theme, and weights life according to the result of comparison Component number.
7. the automatic device for reading and appraising composition according to claim 6, it is characterised in that the subject distillation module includes:
Target composition is cut into multiple words by composition word segmentation module, the composition word segmentation module;
First language analysis module, the first language analysis module to cutting for multiple words the target write a composition into Row language analysis;
Text subject extraction module, the text subject extraction module extracts the text of target composition according to the result of language analysis This theme;
The subject distillation module also includes:
The compostion topic that target is write a composition is cut into multiple words by topic word segmentation module, the topic word segmentation module;
Second language analysis module, the second language analysis module carries out language to cutting for the compostion topic of multiple words Speech analysis;
Topic subject distillation module, the topic subject distillation module extracts the topic of the compostion topic according to the result of language analysis Mesh theme.
8. the automatic device for reading and appraising composition according to claim 7, it is characterised in that the first language analysis module pair The target composition carries out language analysis, including:
The first language analysis module is labeled to the part of speech of each word in each sentence, wherein, the part of speech bag Include one or many in noun, verb, adjective, pronoun, number, measure word, adverbial word, preposition, conjunction, auxiliary word, interjection, onomatopoeia Individual combination;
The first language analysis module recognizes the name type and mark of the noun word in each sentence, wherein, the name Type is claimed to include the combination of one or more of name, place name, mechanism name;
The first language analysis module is labeled to the subject of the predicate in each sentence;
Relation between word of the first language analysis module in each sentence, is labeled to the grammer of the sentence;
The first language analysis module recognizes the semantic association relation between participle word and part of speech and marked;
The second language analysis module carries out language analysis to the compostion topic, including:
The second language analysis module is labeled to the part of speech of each word in each sentence, wherein, the part of speech bag Include one or many in noun, verb, adjective, pronoun, number, measure word, adverbial word, preposition, conjunction, auxiliary word, interjection, onomatopoeia Individual combination;
The second language analysis module recognizes the name type and mark of the noun word in each sentence, wherein, the name Type is claimed to include the combination of one or more of name, place name, mechanism name;
The second language analysis module is labeled to the subject of the predicate in each sentence;
Relation between word of the second language analysis module in each sentence, is labeled to the grammer of the sentence;
The second language analysis module recognizes the semantic association relation between participle word and part of speech and marked.
9. the automatic device for reading and appraising composition according to claim 8, it is characterised in that the text subject extraction module bag Include:
First subject distillation module, the first subject distillation module write a composition according to target in annotation results, identification word Similarity, is clustered to the word, and the text subject then write a composition according to the result of the cluster to the target is carried out Once extract;
Second theme extraction module, the second theme extraction module is according to the text rule prestored, the text write a composition to target Theme carries out second extraction;
Weighting block, according to the text subject extracted twice, weighting obtains the final text subject of the target composition.
10. the automatic device for reading and appraising composition according to claim 6, it is characterised in that after the subject distillation module, Also include:
Rhetorical devices identification module, the rhetorical devices identification module recognizes the rhetorical devices of the target composition;
Rhetorical devices scoring modules, the rhetorical devices scoring modules are according to the rhetorical devices to target composition generation point Number;
Final score generation module, the final score generation module respectively with the rhetorical devices scoring modules and the master Inscribe the fraction of comparing module connection, the fraction that the theme comparing module is generated and rhetorical devices scoring modules generation It is weighted, obtains the final fraction of the target composition.
11. a kind of storage medium, it is characterised in that be stored with computer instruction, is realized such as when the computer instruction is performed The automatic method for reading and appraising composition described in claim any one of 1-5.
12. a kind of terminal, it is characterised in that including processor and memory, the computer that is stored with the memory refers to Order;
In the application program launching, the processor reads the computer instruction and realized as claim 1-5 is any The automatic method for reading and appraising composition described in.
CN201710498079.9A 2017-06-27 2017-06-27 Method and device for automatically reviewing composition, storage medium and terminal Active CN107291694B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710498079.9A CN107291694B (en) 2017-06-27 2017-06-27 Method and device for automatically reviewing composition, storage medium and terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710498079.9A CN107291694B (en) 2017-06-27 2017-06-27 Method and device for automatically reviewing composition, storage medium and terminal

Publications (2)

Publication Number Publication Date
CN107291694A true CN107291694A (en) 2017-10-24
CN107291694B CN107291694B (en) 2021-04-13

Family

ID=60098366

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710498079.9A Active CN107291694B (en) 2017-06-27 2017-06-27 Method and device for automatically reviewing composition, storage medium and terminal

Country Status (1)

Country Link
CN (1) CN107291694B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107784109A (en) * 2017-10-31 2018-03-09 浠绘旦 A kind of appraisal procedure and system of network novel commercial value
CN109190108A (en) * 2018-07-20 2019-01-11 北京理琪教育科技有限公司 Language composition corrects method and system automatically
CN109227564A (en) * 2018-10-19 2019-01-18 南京工业大学 Automatic correcting robot for paper work
CN109614624A (en) * 2018-12-12 2019-04-12 广东小天才科技有限公司 English sentence recognition method and electronic equipment
CN109903616A (en) * 2019-04-08 2019-06-18 西安培华学院 A kind of interactive system and method for Aided English Teaching
CN110222344A (en) * 2019-06-17 2019-09-10 上海元趣信息技术有限公司 A kind of composition factor analysis algorithm taught for pupil's composition
CN110264792A (en) * 2019-06-17 2019-09-20 上海元趣信息技术有限公司 One kind is for pupil's composition intelligent tutoring system
CN110414006A (en) * 2019-07-31 2019-11-05 京东方科技集团股份有限公司 Theme mask method, device, electronic equipment and the storage medium of text
CN110598202A (en) * 2019-06-20 2019-12-20 华中师范大学 Method for automatically identifying primary school Chinese composition ranking sentences
CN112528628A (en) * 2020-12-18 2021-03-19 北京一起教育科技有限责任公司 Text processing method and device and electronic equipment
CN112528799A (en) * 2020-12-02 2021-03-19 广州宏途教育网络科技有限公司 Teaching live broadcast method and device, computer equipment and storage medium
CN112527968A (en) * 2020-12-22 2021-03-19 大唐融合通信股份有限公司 Composition review method and system based on neural network

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102279844A (en) * 2011-08-31 2011-12-14 中国科学院自动化研究所 Method and system for automatically testing Chinese composition
CN102779220A (en) * 2011-05-10 2012-11-14 李德霞 English test paper scoring system
CN105183712A (en) * 2015-08-27 2015-12-23 北京时代焦点国际教育咨询有限责任公司 Method and apparatus for scoring English composition
CN106126613A (en) * 2016-06-22 2016-11-16 苏州大学 One composition of digressing from the subject determines method and device
CN106372056A (en) * 2016-08-25 2017-02-01 久远谦长(北京)技术服务有限公司 Natural language-based topic and keyword extraction method and system
CN106502981A (en) * 2016-10-09 2017-03-15 广西师范大学 Automatically analyzed and decision method based on the Figures of Speech sentence of part of speech, syntax and dictionary
US20170140659A1 (en) * 2015-11-14 2017-05-18 The King Abdulaziz City For Science And Technology Method and system for automatically scoring an essay using plurality of linguistic levels

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102779220A (en) * 2011-05-10 2012-11-14 李德霞 English test paper scoring system
CN102279844A (en) * 2011-08-31 2011-12-14 中国科学院自动化研究所 Method and system for automatically testing Chinese composition
CN105183712A (en) * 2015-08-27 2015-12-23 北京时代焦点国际教育咨询有限责任公司 Method and apparatus for scoring English composition
US20170140659A1 (en) * 2015-11-14 2017-05-18 The King Abdulaziz City For Science And Technology Method and system for automatically scoring an essay using plurality of linguistic levels
CN106126613A (en) * 2016-06-22 2016-11-16 苏州大学 One composition of digressing from the subject determines method and device
CN106372056A (en) * 2016-08-25 2017-02-01 久远谦长(北京)技术服务有限公司 Natural language-based topic and keyword extraction method and system
CN106502981A (en) * 2016-10-09 2017-03-15 广西师范大学 Automatically analyzed and decision method based on the Figures of Speech sentence of part of speech, syntax and dictionary

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
YEN-YU CHEN 等: "An Unsupervised Automated Essay Scoring System", 《IEEE INTELLIGENT SYSTEMS》 *
刘明杨 等: "基于文采特征的高考作文自动评分", 《智能计算机与应用》 *
巩捷甫: "面向语文作文自动评阅的修辞手法识别***的设计与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
张云涛 等: "基于综合方法的文本主题句的自动抽取", 《上海交通大学学报》 *
柯育强: "大学英语作文自动评分***中文本聚类的应用", 《电子技术与软件工程》 *
蔡黎 等: "少数民族汉语考试作文自动评分的特征提取研究", 《第五届全国青年计算语言学研讨会》 *

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107784109A (en) * 2017-10-31 2018-03-09 浠绘旦 A kind of appraisal procedure and system of network novel commercial value
CN109190108A (en) * 2018-07-20 2019-01-11 北京理琪教育科技有限公司 Language composition corrects method and system automatically
CN109227564A (en) * 2018-10-19 2019-01-18 南京工业大学 Automatic correcting robot for paper work
CN109614624A (en) * 2018-12-12 2019-04-12 广东小天才科技有限公司 English sentence recognition method and electronic equipment
CN109903616A (en) * 2019-04-08 2019-06-18 西安培华学院 A kind of interactive system and method for Aided English Teaching
CN110264792A (en) * 2019-06-17 2019-09-20 上海元趣信息技术有限公司 One kind is for pupil's composition intelligent tutoring system
CN110222344A (en) * 2019-06-17 2019-09-10 上海元趣信息技术有限公司 A kind of composition factor analysis algorithm taught for pupil's composition
CN110264792B (en) * 2019-06-17 2021-11-09 上海元趣信息技术有限公司 Intelligent tutoring system for composition of pupils
CN110222344B (en) * 2019-06-17 2022-09-23 上海元趣信息技术有限公司 Composition element analysis algorithm for composition tutoring of pupils
CN110598202A (en) * 2019-06-20 2019-12-20 华中师范大学 Method for automatically identifying primary school Chinese composition ranking sentences
CN110414006A (en) * 2019-07-31 2019-11-05 京东方科技集团股份有限公司 Theme mask method, device, electronic equipment and the storage medium of text
CN112528799A (en) * 2020-12-02 2021-03-19 广州宏途教育网络科技有限公司 Teaching live broadcast method and device, computer equipment and storage medium
CN112528628A (en) * 2020-12-18 2021-03-19 北京一起教育科技有限责任公司 Text processing method and device and electronic equipment
CN112528628B (en) * 2020-12-18 2024-02-02 北京一起教育科技有限责任公司 Text processing method and device and electronic equipment
CN112527968A (en) * 2020-12-22 2021-03-19 大唐融合通信股份有限公司 Composition review method and system based on neural network

Also Published As

Publication number Publication date
CN107291694B (en) 2021-04-13

Similar Documents

Publication Publication Date Title
CN107291694A (en) A kind of automatic method and apparatus, storage medium and terminal for reading and appraising composition
Blanchard et al. TOEFL11: A corpus of non‐native English
Biber et al. Corpus linguistics: Investigating language structure and use
Villalon et al. Concept map mining: A definition and a framework for its evaluation
Reynolds Insights from Russian second language readability classification: complexity-dependent training requirements, and feature evaluation of multiple categories
Volodina et al. SweLL on the rise: Swedish learner language corpus for European reference level studies
CN106649819B (en) Method and device for extracting entity words and hypernyms
Adebara et al. Towards afrocentric NLP for African languages: Where we are and where we can go
JPWO2005057524A1 (en) Evaluation scoring device for writing essay
CN111311459B (en) Interactive question-setting method and system for international Chinese teaching
Burchardt et al. The German Language in the Digital Age
CN111914532A (en) Chinese composition scoring method
CN108280065A (en) A kind of foreign language text evaluation method and device
Gugliotta et al. Tarc: Incrementally and semi-automatically collecting a tunisian arabish corpus
Siewert et al. LSDC-a comprehensive dataset for low Saxon dialect classification
El Kah et al. Application of Arabic language processing in language learning
Volodina On two SweLL learner corpora–SweLL-pilot and SweLL-gold
Rybka The linguistic encoding of landscape in Lokono
Haokip Socio-Linguistic Situation in North-East India
CN112989068B (en) Knowledge graph construction method for Tang poetry knowledge and Tang poetry knowledge question-answering system
Graedler NEST-a corpus in the brooding box
Khandait et al. Automatic question generation through word vector synchronization using lamma
CN109800419A (en) A kind of game sessions lines generation method and system
Medrano Toward a Khipu Transcription" Insistence": a Corpus-Based Study of the Textos Andinos
Yan The construction and application of the corpus system in primary school English classroom: taking Hainan Province as an example

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: Units F01-03 and 05-10 on the 6th floor of No.1 Building, No.8 Courtyard, Guangshun South Street, Chaoyang District, Beijing

Applicant after: Beijing Ape Power Future Technology Co.,Ltd.

Address before: Room A116, Floor 2, 88 Xiangshan Road, Haidian District, Beijing

Applicant before: BEIJING FENBI WEILAI TECHNOLOGY CO.,LTD.

CB02 Change of applicant information
TA01 Transfer of patent application right

Effective date of registration: 20200506

Address after: 100102 unit F01, 5th floor and unit 04, F01, 6th floor, building 1, yard 8, Guangshun South Street, Chaoyang District, Beijing

Applicant after: Beijing ape force Education Technology Co.,Ltd.

Address before: Units F01-03 and 05-10 on the 6th floor of No.1 Building, No.8 Courtyard, Guangshun South Street, Chaoyang District, Beijing

Applicant before: Beijing Ape Power Future Technology Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder

Address after: 100102 unit F01, 5th floor, building 1, yard 8, Guangshun South Street, Chaoyang District, Beijing

Patentee after: Beijing Ape Power Technology Co.,Ltd.

Address before: 100102 unit F01, 5th floor, building 1, yard 8, Guangshun South Street, Chaoyang District, Beijing

Patentee before: Beijing ape force Education Technology Co.,Ltd.

CP01 Change in the name or title of a patent holder