CN103151042B

CN103151042B - Full-automatic oral evaluation management and points-scoring system and methods of marking thereof

Info

Publication number: CN103151042B
Application number: CN201310034371.7A
Authority: CN
Inventors: 王岚; 宋阳; 陈蒙; 金晓虎; 李阳
Original assignee: Shenzhen Institute of Advanced Technology of CAS
Current assignee: Shenzhen Institute of Advanced Technology of CAS
Priority date: 2013-01-23
Filing date: 2013-01-29
Publication date: 2016-02-24
Anticipated expiration: 2033-01-29
Also published as: CN103151042A

Abstract

The present invention relates to Computer Assisted Instruction (CAI) field, a kind of full-automatic oral evaluation management and points-scoring system are provided, it comprise connect successively client, land server and the webserver, described examinee result of answering comprises one in the spoken evaluating result of the spoken evaluating result of reading aloud topic and spontaneous spoken statement topic, the described server that lands also comprises scoring apparatus, this scoring apparatus comprises identification module and grading module, and described identification module comprises acoustics submodule, language submodule and recognin module; Institute's scoring module comprises feature extraction submodule and scoring submodule.The present invention also provides a kind of full-automatic oral evaluation management and methods of marking.Full-automatic oral evaluation management of the present invention and points-scoring system and methods of marking thereof can completely by computer controlled automatic examinee preparation Reaction time with answer the time, without the need to manual intervention, and more more accurate than manual time-keeping, further ensure that the fairness of evaluation and test.

Description

Full-automatic oral evaluation management and points-scoring system and methods of marking thereof

Technical field

The present invention relates to Computer Assisted Instruction (CAI) field, particularly relate to a kind of full-automatic oral evaluation management and points-scoring system and methods of marking thereof.

Background technology

Along with popularizing of China's English teaching, the problem of Dumb English is also more and more serious, and therefore the teaching of Oral English Practice also comes into one's own day by day with evaluation and test.The oral evaluation of English is carried out in multiple provinces and cities of China, and what wherein Guangdong Province had brought common college entrance examination into spoken English evaluating must examine scope.Due to the singularity of oral evaluation mode and content, the scoring of current Oral English Practice mainly or by the mode of manually appraising solves.Artificial scoring has a lot of very formidable shortcoming:

Teacher and the face-to-face oral evaluation of student lack fairness and accuracy.Different teacher is difficult to ensure completely the same for the scoring of same examinee, same teacher is also difficult to ensure completely the same for the standards of grading of different examinee, everything all will reduce the fairness of evaluation and test greatly.Native system formulates a set of just unified machine automatic scoring standard that everybody approves, thus can avoid the diversity of values that causes because of artificial phoneme, ensure that the fairness of evaluation and test.

Artificial scoring oral evaluation scoring efficiency is very low.The a set of full-automatic oral evaluation management of the present invention's design and points-scoring system, artificial scoring is replaced with machine scoring, from the distribution of evaluation and test paper, to beginning and the end of evaluation and test, to the preservation of examinee's answer audio frequency, arrive automatic scoring more all automatically to be completed by machine, scoring efficiency improves greatly.

Traditional oral evaluation organization and administration are very complicated.Traditional Human To Human's oral evaluation, owing to adopting the form of interview, needs in a large number through the spoken examiner of professional training; Meanwhile, every evaluation and test can only be checked and rated several students, if will to annual level even the study English student of course of whole school carry out an oral evaluation, not only time-consuming effort also exists to leak out the examination questions and lets out the risk of topic.The a set of full-automatic oral evaluation management of the present invention's design is with points-scoring system can for annual level, even whole school student carries out once justice, safety, efficiently Oral English Practice level are evaluated and tested simultaneously simultaneously, and can greatly use manpower and material resources sparingly, that also can avoid bringing because of evaluating and testing in batches lets out topic risk.

Tradition oral evaluation needs manual control to evaluate and test the time starting and terminate, and change exercise question, even the little topic of per pass all needs reclocking, bothers very much at every turn.

Summary of the invention

The present invention is for solving the problems of the technologies described above, there is provided a kind of without the need to manual intervention, timing is more accurate and a step guarantees full-automatic oral evaluation management and the points-scoring system of evaluating and testing fairness, it comprises the client connected successively, land server and the webserver, wherein, the arrangement of webserver primary responsibility evaluation result, the distribution of collection and paper, land server primary responsibility machine automatic scoring, client primary responsibility is evaluated and tested, paper is distributed to client from the webserver by landing server, examinee's result of answering uploads to the webserver from client by landing server, described examinee result of answering comprises one in the spoken evaluating result of the spoken evaluating result of reading aloud topic and spontaneous spoken statement topic, the described server that lands also comprises scoring apparatus, this scoring apparatus comprises identification module and grading module, described identification module comprises acoustics submodule, language submodule and recognin module, described acoustics submodule extracts the answer acoustic feature of audio frequency of examinee and obtains acoustic model, described language submodule obtains language model according to topic information and training text, described recognin module is carried out decoding by acoustic model and language model obtain recognition result to examinee's audio frequency of answering, institute's scoring module comprises feature extraction submodule and scoring submodule, described feature extraction submodule is for extracting the flow comprehensive characteristics in described recognition result, described flow comprehensive characteristics comprises the feature in the feature in pronouncing accuracy direction in spoken test and appraisal, the feature in fluency direction and text semantic similarity direction, described scoring submodule carries out scoring training to flow comprehensive characteristics, obtain Rating Model, and according to Rating Model, recognition result is marked.

Preferably, the feature in described pronouncing accuracy direction is obtained by following steps:

For examinee answer audio frequency carry out decoding identify, obtain recognition result;

Recognition result and correct text are carried out pressure align, calculate the pressure alignment score of each phoneme;

Build single-tone element decoded model and each phoneme of decoding, calculate the maximum likelihood score of each phoneme;

Utilize the feature of forcing the difference of alignment score and maximum likelihood score to obtain pronouncing accuracy direction.

Preferably, the feature in described fluency direction comprises word speed feature and the duration characteristics that pauses in short-term, and described word speed feature is obtained by following steps:

The frame number that in voice identification result, each phoneme is corresponding is counted according to recognition result;

The ratio of the duration of the total number of phoneme and all phonemes is utilized to obtain word speed feature;

The described duration characteristics of pause is in short-term obtained by following steps:

Frame number that in voice identification result, each phoneme is corresponding and the total frame number of audio frequency is counted according to recognition result;

The summation of the duration utilizing all phonemes to pause in short-term and the ratio of total pronunciation duration are paused duration characteristics in short-term.

Preferably, the feature in described text semantic similarity direction comprises semantic relevancy feature and syntactic structure similarity feature.

Preferably, described semantic relevancy feature comprises the following steps:

Calculate the semantic similarity score of each word in each word in recognition result and model answer;

Calculate the semantic similarity score of each sentence in each word in recognition result and model answer;

Calculate semantic similarity score maximal value in recognition result in each word and model answer in each sentence or mean value as the similarity score between word and sentence;

Calculate the similarity score between examinee's answer and model answer.

Preferably, described syntactic structure similarity feature comprises the following steps:

Syntax sequence vector set up in each sentence being respectively recognition result;

Obtain the syntactic structure similarity score of each sentence in recognition result and each sentence in model answer respectively, to get in recognition result each Sentence Grammar structural similarity score maximal value as the syntactic structure similarity score of this sentence;

By to the syntactic structure similarity feature in recognition result between each the weighted average calculation examinee's answer of Sentence Grammar structural similarity score and model answer.

Preferably, the webserver comprises scheduler module, for dispatching evaluation and test information landing between server and the webserver; Adopt networking evaluation and test pattern, the communication of server of landing between the examination hall making to be distributed in different location is managed by described webserver scheduler module United Dispatching.

Preferably, described system comprises the role of three kinds of different rights: examinee, teacher and keeper, and examinee's primary responsibility is evaluated and tested and answered; Teacher's primary responsibility system volume, issue are evaluated and tested, manage evaluation and test, check evaluation result and the work of scoring, the method combined that scoring aspect employing system is marked and teacher marks; The management of keeper's primary responsibility evaluation and test and the time of welltesting software are controlled.

Preferably, role playing is inscribed to the Key for Reference that the little topic of per pass can be utilized the to have provided referenced text as text semantic similarity analysis, and then expand other Key for Reference; The transcription text of audio content can be utilized as the referenced text of text semantic similarity analysis for repetition topic, and then expand other Key for Reference.

The present invention provides a kind of full-automatic oral evaluation management and methods of marking in addition, and it comprises following several step:

A0, choose the process that some examinees carry out as described in steps A 1 ~ A5, automatic scoring model training is carried out in combination of then described flow comprehensive characteristics and teacher being marked, and forms Rating Model;

A1, collect examinee and to answer audio frequency;

A2, extract the answer acoustic feature of audio frequency of examinee and obtain acoustic model, and obtain language model according to topic information and training text;

A3, according to the acoustic model set up and language model, examinee's audio frequency of answering is carried out to decoding and obtains recognition result;

A4, the flow comprehensive characteristics extracted in recognition result, described flow comprehensive characteristics comprises the feature in the feature in pronouncing accuracy direction in spoken test and appraisal, the feature in fluency direction and text semantic similarity direction;

A5, according to flow comprehensive characteristics formed Rating Model carry out automatic scoring.

Full-automatic oral evaluation management of the present invention and points-scoring system and methods of marking thereof: inscribe the feature spoken basis of testing and assessing adding text semantic similarity direction in traditional reading aloud, thus can test and assess for spontaneous spoken statement; Devise unique high in the clouds evaluating system framework, guarantee that evaluation and test is efficiently carried out, take full advantage of the resource of whole system simultaneously, greatly improve and organize oral evaluation efficiency, saved a large amount of manpower and materials; Oral evaluation can be organized on a large scale simultaneously, meet within the scope of provinces and cities even nationwide and organize the demand of oral evaluation simultaneously; The form of oral evaluation is also by diversification more, more comprehensively just to the investigation of examinee's spoken language proficiency.

Accompanying drawing explanation

Fig. 1 is the full-automatic oral evaluation management of one embodiment of the invention and methods of marking schematic flow sheet;

Fig. 2 is the full-automatic oral evaluation management of one embodiment of the invention and points-scoring system configuration diagram;

Fig. 3 is one embodiment of the invention semantic relevancy feature extraction schematic flow sheet;

Fig. 4 is another embodiment of the present invention semantic relevancy feature extraction schematic flow sheet;

Fig. 5 is one embodiment of the invention syntactic structure similarity feature extraction process flow diagram.

Embodiment

Below in conjunction with accompanying drawing and specific embodiment, the present invention is described in further detail.

Embodiment:

The invention provides a kind of full-automatic oral evaluation management and points-scoring system, it comprises the client connected successively, land server and the webserver, wherein, the arrangement of webserver primary responsibility evaluation result, the distribution of collection and paper, land server primary responsibility machine automatic scoring, client primary responsibility is evaluated and tested, paper is distributed to client from the webserver by landing server, examinee's result of answering uploads to the webserver from client by landing server, described examinee result of answering comprises one in the spoken evaluating result of the spoken evaluating result of reading aloud topic and spontaneous spoken statement topic, the described server that lands also comprises scoring apparatus, this scoring apparatus comprises identification module and grading module, described identification module comprises acoustics submodule, language submodule and recognin module, described acoustics submodule extracts the answer acoustic feature of audio frequency of examinee and obtains acoustic model, described language submodule obtains language model according to topic information and training text, described recognin module is carried out decoding by acoustic model and language model obtain recognition result to examinee's audio frequency of answering, institute's scoring module comprises feature extraction submodule and scoring submodule, described feature extraction submodule is for extracting the flow comprehensive characteristics in described recognition result, described flow comprehensive characteristics comprises the feature in the feature in pronouncing accuracy direction in spoken test and appraisal, the feature in fluency direction and text semantic similarity direction, described scoring submodule carries out scoring training to flow comprehensive characteristics, obtain Rating Model, and according to Rating Model, recognition result is marked.

As shown in Figure 1, be the full-automatic oral evaluation management of one embodiment of the invention and methods of marking schematic flow sheet, it comprises following several step:

A1, collect examinee and to answer audio frequency;

A4, the flow comprehensive characteristics extracted in recognition result;

In the above-described embodiments, described language model comprises one or more of topic related text, written related text and colloquial style text.So-called topic related text refers to determines a topic, and model answer can be seen to examinee or not see to examinee, the text formed according to topic and model answer by examinee when evaluating and testing; So-called written related text refers to the speech text that examinee is formed according to model answer; So-called colloquial style text refers to not with reference to any model answer, the speech text formed freely is played completely by examinee, described teacher scoring refers to engages the teacher of specialty to mark to examinee's audio frequency of answering, carry out two commenting, namely with two teachers that mark examination papers to the final result being equally divided into this examinee of achieving the result, if the achievement that two teachers that mark examination papers provide differs more than 3 points, three are taked to comment mode.

As shown in Figure 2, be the full-automatic oral evaluation management of one embodiment of the invention and points-scoring system configuration diagram, it is mainly divided into following three parts:

Client: computer, panel computer, high-end smartphones all can as clients of the present invention, and client must possess independently audio frequency and video playing function and independently audio input device.Described client comprises evaluation and test module and release module, described evaluation and test module is used for examinee and carries out oral evaluation, comprise examination question issue, evaluation and test, rolling etc., and process and transmission examinee answer audio frequency to landing server, to answer audio frequency for examinee, evaluation and test module is also provided with first processing module for the format conversion of audio frequency of answering to examinee and feature extraction.After evaluation and test terminates, the evaluation and test achievement of examinee also will be issued on the client by release module.

Land server: the server that lands refers to the high-performance computer being arranged on examination hall, the server that lands comprises communication module, acquisition module and grading module, described communication module is used for being received in the evaluation and test information transmission of the webserver to client, it provides paper to client at special time and controls the evaluation and test time, described acquisition module is for audio frequency of answering from client collection examinee, institute's scoring module is used for identifying examinee's test paper, decode, marking, and after having marked, evaluation result is fed back to client in time by communication module.According to examination hall scale and calculation task amount, the server that lands can select multiple stage high-performance computer to set up the form of computer cluster, to accelerate scoring and the speed of decoding.

The webserver: webserver primary responsibility centralized control is dispersed in the server that lands in each examination hall, it comprises scheduler module, analysis module and enquiry module, the webserver comprises scheduler module, for dispatching evaluation and test information landing between server and the webserver, adopt networking evaluation and test pattern, the communication of server of landing between the examination hall making to be distributed in different location is managed by described webserver scheduler module United Dispatching, unified welltesting software is landed server control the time that its evaluation and test starts and terminate to each, simultaneously the object done like this is convenient to organize oral evaluation on a large scale, so often organize and once evaluate and test, just without the need to preparing a examination paper for each examinee in advance, examination paper can be distributed to each and land server by the webserver, examination paper can be distributed to each client by the server that lands again, significantly reduce the risk of letting out topic, described analysis module terminates to do concentrated analysis and treament to answer information and its scoring event of examinee afterwards for evaluating and testing, by Information Statistics such as examinee's total score, individual event score and ranks out, described enquiry module inquires about the information such as examinee's total score, individual event score and rank at any time for academics and students.

Described system comprises the role of three kinds of different rights: examinee, teacher and keeper, and examinee's primary responsibility is evaluated and tested and answered; Teacher's primary responsibility system volume, issue are evaluated and tested, manage evaluation and test, check evaluation result and the work of scoring, the method combined that scoring aspect employing system is marked and teacher marks; The time control of the management that keeper's primary responsibility is evaluated and tested, welltesting software and the maintenance of overall evaluating system.

In the present embodiment, identification module described in scoring apparatus adopts the decode system based on extensive continuous speech recognition, and acoustic model adopts the acoustic model based on Hidden Markov Model (HMM), and what language model adopted is based on the grammatical language model of N unit.During multipass decoding, because unknown examinee answers content so adopt the process such as non-supervisory self-adaptation, decode in two phases directly decoding, return based on maximum linear likelihood.Many encoding and decoding techniques can be adopted to improve discrimination during decoding in addition.After having decoded, extracted the flow comprehensive characteristics of needs by grading module from recognition result, training Rating Model, finally carries out machine automatic scoring according to Rating Model to new audio frequency.

The management of the present embodiment full-automatic oral evaluation and the core algorithm of points-scoring system adopt the machine learning method based on pattern-recognition, and the described machine learning based on pattern-recognition adopts the study of linear regression method, case similarity assessment, correlation rule learns, one in neural network or support vector machine.

The feature in the direction of pronouncing accuracy described in above-described embodiment is obtained by following steps:

The feature specific embodiment in described pronouncing accuracy direction:

Represent etic acoustic feature with O, p represents the pronunciation phonemes corresponding to phonetics, and Q represents all set of phonemes.Under the prerequisite of then given acoustic feature O, the posterior probability of phoneme p is:

PP＝P(p|o)＝P(o|p)P(p)/∑ _q∈QP(Q|q)P(q)

Conveniently calculate, make following hypothesis: all phonemes are all that equiprobability occurs, existing P (p)=P (q), and the summation in denominator can use maximum estimated value, so just obtains the definition of pronouncing accuracy:

GOP(p)＝log(P(o|p)/MAX _q∈Q ^P(o|q))。

Feature extraction: pronouncing accuracy can be obtained by following formula

GOP＝S _FA-S _PL。

The feature in the direction of fluency described in above-described embodiment comprises word speed feature and the duration characteristics that pauses in short-term, and described word speed feature is obtained by following steps:

About described word speed feature specific embodiment:

In second language learning, word speed feature (ROS) well can characterize the fluency and spoken proficient of speaking, especially true for beginner.Use T _srepresent the duration of all phonemes, N _prepresent the total number of phoneme, then have:

ROS＝N _P/T _S。

About the described specific embodiment of pause feature in short-term:

Pause in short-term in voice between word and word also reflects pronunciation fluency.In general, the dead time is longer, and corresponding fluency is also poorer.Use D _sPrepresentative accounts for the proportion of overall length when pausing, T represents total pronunciation often, T _irepresent i-th to pause often, N _sPrepresent total number of pausing in short-term, then have:

The feature in the similarity of text semantic described in above-described embodiment direction comprises semantic relevancy feature and syntactic structure similarity feature.

Described semantic relevancy feature comprises the following steps:

Calculate semantic similarity score maximal value in recognition result in each word and model answer in each sentence as the similarity score between word and sentence;

Calculate the similarity score between examinee's answer and model answer.

Described semantic similarity feature can also be obtained by following steps:

To calculate in recognition result in each word and model answer each sentence semantic similarity score averages as the similarity score between word and sentence;

Calculate the similarity score between examinee's answer and model answer.

Described syntactic structure similarity feature comprises the following steps:

Described semantic relevancy feature specific embodiment:

As shown in Figure 3, for one embodiment of the invention semantic relevancy feature extraction process flow diagram, semantic relevancy can greatly react the semantic degree of correlation between examinee's answer and model answer, and the semantic relevancy of two words is higher in general, and the answer of examinee is also more near the mark answer.Semantic similarity between word and word can be calculated by word network (WordNet), and WordNet is the English glossary semantic net of a coverage broadness.Noun, verb, adjective and adverbial word are organized into a synon network separately, each TongYiCi CiLin represents a basic semantic concept, and is also connected (polysemant will appear in its TongYiCi CiLin of each meaning) by various relation between these set.Calculate the semantic similarity between two words for the method expanding semantic covering, such as we will calculate the semantic similarity of drawing and decal in English, and first we will obtain the semanteme of two words by WordNet:

drawing:paperthatisspeciallypreparedforuseindrafting。

decal:theartoftransferringdesignsfromspeciallypreparedpapertoawoodorglassormetalsurface。

The semanteme having three words in the semanteme of two words covers, wherein paper be unitary word we long-pending 1 point, and speciallyprepared be a binary phrase we long-pending 4 points, the semantic similarity between drawing and decal is 5 points.We just define a marking mechanism like this:

1) occur in the semanteme between two words unitary word we long-pending one point, occurs our long-pending n of a n unit phrase square point because the probability of an appearance n conjunction will far away than occurring that the probability of n unitary word is low simultaneously.Usually in WordNet, each word can comprise multiple semanteme.

With ω, ω ' respectively represent two word S (ω) represent word ω all semantemes in word network, c _irepresent i-th semanteme of word ω in word network, the semanteme between two words getting similarity score maximal value between two words when the similarity of calculating two words.

SCORE _lesk(ω，ω')＝MAX _ci∈s(ω')c _j∈s(ω′)rel(c _ic _j)。

After having had the semantic similarity between word and word, just the similarity score between word and sentence can be calculated, get similar score averages between word to word in sentence as the similarity score between word and sentence, with | U| represents the number of the word in sentence U.

{SCORE}_{W} (ω, U) = \frac{1}{| U |} Σ_{ωi &Element; u} {SCORE}_{W} (ω, ωi) .

Finally just can calculate the similarity score between examinee's answer U and model answer P, represent the duplicate removal word number of examinee's answer with uniq (U), then the similarity score between U and P is:

{SCORE}_{U} (U, P) = \frac{1}{| U |} Σ_{ωi &Element; u} {SCORE}_{W} (ωi, P) uniq (U) .

As shown in Figure 4, for another embodiment of the present invention semantic relevancy feature extraction process flow diagram, semantic relevancy can greatly react the semantic degree of correlation between examinee's answer and model answer, and the semantic relevancy of two words is higher in general, and the answer of examinee is also more near the mark answer.With ω, ω ' respectively represent two word S (ω) represent word ω all semantemes in word network, c _irepresent i-th semanteme of word ω in word network, the semanteme between two words getting similarity score maximal value between two words when the similarity of calculating two words.

After having had the semantic similarity between word and word, just the similarity score between word and sentence can be calculated.Get similar score maximal value between word to word in sentence as the similarity score between word and sentence, represent sentence with U.

SCORE _W(ω,U)＝max _ωi∈ _uSCORE(ω,ωi)。

Finally just can calculate the similarity score between examinee's answer U and model answer P.

Described syntactic structure similarity feature specific embodiment:

As shown in Figure 5, be one embodiment of the invention syntactic structure similarity feature extraction process flow diagram.

Consider following two sentences:

T ₁:ω ₁ω ₂ω ₃ω ₄ω ₅ω ₆ω ₇ω ₈ω ₉。

T ₂:ω ₁ω ₂ω ₃ω ₉ω ₅ω ₆ω ₇ω ₈ω ₄。

Sentence T ₁and T ₂the word comprised is the same, and carry out semantic similarity analysis if simple by the word comprised in two words, the result so drawn will be T ₁with T ₂the meaning of one's words expressed by two words is duplicate.The word order of two sentences is different, and here by a kind of scheme of being carried out syntactic structure similarity analysis by word order of proposition, we are to T ₁and T ₂in each word renumber, such as T ₁the numbering of middle dog is the numbering of 4, over is 6.Definition sentence T ₁with T ₂syntax sequence vector is r ₁and r ₂then have:

r ₁＝(123456789)

r ₂＝(123956784)

So sentence T ₁with T ₂syntactic structure similarity can be calculated by following formula:

S_{r} = 1 - \frac{| | r_{1} - r_{2} | |}{| | r_{1} + r_{2} | |} .

T in actual conditions ₁with T ₂the word comprised might not be duplicate, need the word rule of correspondence between definition sentence here.

If at T ₂in a word ω _ibe also contained in T ₁in the middle of, we by this word at sentence T ₁the numbering that middle first time occurs as it at T ₂in numbering.Otherwise we will at T ₁middle searching and ω _isemantic immediate word

If ω _iwith similarity exceed the threshold value preset, so ω _iat T ₁in numbering be just set to at T ₂in numbering.

If first two search all have failed, so ω _inumbering be just set to 0.

According to above rule, we can extract and obtain syntactic structure similarity feature.

Utilize formula obtain every syntactic structure similarity talked about in recognition result every word and model answer respectively, to get in recognition result every language method structural similarity maximal value as its syntactic structure similarity.

Syntactic structure similarity S between final examinee's answer recognition result and model answer _rECcan be expressed as

S_{REC} = \frac{1}{M} Σ_{T_{1} &Element; REC} {MAX}_{T_{2} &Element; REF} S_{r} (T_{1,} T_{2}) .

Wherein M is the sentence number of each examinee's recognition result, and REC represents recognition result, and REF represents Key for Reference.

After obtaining above flow comprehensive characteristics, we need adjustment feature weight to make Rating Model effect best, can carry out feature weight optimization based on expectation-maximization algorithm.

In above any embodiment, the problem of fixed reference answer is not had for objective test, we have proposed a kind of answer expansion scheme based on text semantic similarity analysis: role playing is inscribed to the Key for Reference that the little topic of per pass can be utilized the to have provided referenced text as text semantic similarity analysis, and then expand other Key for Reference; The transcription text of audio content can be utilized as the referenced text of text semantic similarity analysis for repetition topic, and then expand other Key for Reference, examinee not necessarily answers by Key for Reference completely, so can not carry out pressure alignment.Decode procedure comprises the first decoding of row, speaker adaptation and decode in two phases identification; From recognition result, flow comprehensive characteristics is extracted after decode in two phases identification completes.Then flow comprehensive characteristics and teacher are marked and carry out Rating Model training, obtain Rating Model, carry out automatic scoring.

Full-automatic oral evaluation management of the present invention and points-scoring system to be answered text without the need to predicting examinee, and examinee can carry out the statement of spontaneity spoken language according to topic content, need to know that examinee's topic related text of answering can be marked before scoring; Read aloud the spoken test and appraisal of topic relative to tradition, the present invention not only can carry out the spoken language test and appraisal of reading aloud topic, can also carry out the spoken language test and appraisal of spontaneous spoken statement topic; Mark more comprehensively just, pronouncing accuracy and the fluency of examinee can be investigated when the statement of examinee's spontaneity, more can reflect the spoken language proficiency of examinee's reality; Examinee's text of answering no longer is restricted, and automatic scoring evaluation and test topic type also will no longer be only limitted to read aloud topic, so just can investigate examinee when spontaneous spoken statement to the understanding of language, utilization and ability to express; Semantic relevancy feature and syntactic structure similarity feature is added in flow comprehensive characteristics, the semantic dependency that examinee's spoken language uses can be investigated like this, the syntactical level of examinee's spoken language can be investigated again, devise unique high in the clouds evaluating system framework, guarantee that evaluation and test is efficiently carried out, take full advantage of the resource of whole system simultaneously, greatly improve and organize oral evaluation efficiency, save a large amount of manpower and materials; Oral evaluation can be organized on a large scale simultaneously, meet within the scope of provinces and cities even nationwide and organize the demand of oral evaluation simultaneously; The form of oral evaluation is also by diversification more, more comprehensively just to the investigation of examinee's spoken language proficiency.

Be understandable that, for the person of ordinary skill of the art, other various corresponding change and distortion can be made by technical conceive according to the present invention, and all these change the protection domain that all should belong to the claims in the present invention with distortion.

Claims

1. a full-automatic oral evaluation management and points-scoring system, it comprises the client connected successively, land server and the webserver, wherein, the arrangement of webserver primary responsibility evaluation result, the distribution of collection and paper, land server primary responsibility machine automatic scoring, client primary responsibility is evaluated and tested, paper is distributed to client from the webserver by landing server, examinee's result of answering uploads to the webserver from client by landing server, described examinee result of answering comprises one in the spoken evaluating result of the spoken evaluating result of reading aloud topic and spontaneous spoken statement topic, it is characterized in that: described in land server and also comprise scoring apparatus, this scoring apparatus comprises identification module and grading module, described identification module comprises acoustics submodule, language submodule and recognin module, described acoustics submodule extracts the answer acoustic feature of audio frequency of examinee and obtains acoustic model, described language submodule obtains language model according to topic information and training text, described recognin module is carried out decoding by acoustic model and language model obtain recognition result to examinee's audio frequency of answering, institute's scoring module comprises feature extraction submodule and scoring submodule, described feature extraction submodule is for extracting the flow comprehensive characteristics in described recognition result, described flow comprehensive characteristics comprises the feature in the feature in pronouncing accuracy direction in spoken test and appraisal, the feature in fluency direction and text semantic similarity direction, and the feature in described text semantic similarity direction comprises semantic relevancy feature and syntactic structure similarity feature,

Described feature extraction submodule, for extracting described semantic relevancy feature, comprising: the semantic similarity score calculating each word in each word in recognition result and model answer; Calculate the semantic similarity score of each sentence in each word in recognition result and model answer; Calculate semantic similarity score maximal value in recognition result in each word and model answer in each sentence or mean value as the similarity score between word and sentence; Calculate the similarity score between examinee's answer and model answer;

Described feature extraction submodule, for extracting described syntactic structure similarity feature, comprising: syntax sequence vector set up in each sentence being respectively recognition result; Obtain the syntactic structure similarity score of each sentence in recognition result and each sentence in model answer respectively, to get in recognition result each Sentence Grammar structural similarity score maximal value as the syntactic structure similarity score of this sentence; By to the syntactic structure similarity feature in recognition result between each the weighted average calculation examinee's answer of Sentence Grammar structural similarity score and model answer;

Described identification module adopts the decode system based on extensive continuous speech recognition, described acoustic model adopts based on Hidden Markov Model (HMM), described language model adopts the language model based on unit's syntax, adopt based on multipass decoding technique when decoding, the decoding of described multipass comprise directly decode, the non-supervisory self-adaptation that returns based on maximum linear likelihood and decode in two phases; Described scoring submodule carries out scoring training to flow comprehensive characteristics, obtains Rating Model, and marks to recognition result according to Rating Model.

2. full-automatic oral evaluation management as claimed in claim 1 and points-scoring system, is characterized in that: described feature extraction submodule also for extracting the feature in described pronouncing accuracy direction, comprising:

3. oral evaluation management as claimed in claim 1 full-automatic and points-scoring system, is characterized in that: the webserver comprises scheduler module, for dispatching evaluation and test information landing between server and the webserver; Adopt networking evaluation and test pattern, the communication of server of landing between the examination hall making to be distributed in different location is managed by described webserver scheduler module United Dispatching.

4. full-automatic oral evaluation management as claimed in claim 1 and points-scoring system, it is characterized in that: described system comprises the role of three kinds of different rights: examinee, teacher and keeper, examinee's primary responsibility is evaluated and tested and is answered; Teacher's primary responsibility system volume, issue are evaluated and tested, manage evaluation and test, check evaluation result and the work of scoring, the method combined that scoring aspect employing system is marked and teacher marks; The management of keeper's primary responsibility evaluation and test and the time of welltesting software are controlled.

5. full-automatic oral evaluation management and a methods of marking, is characterized in that: it comprises following several step:

A0, choose the process that some examinees carry out as described in steps A 1 ~ A5, automatic scoring model training is carried out in combination of then flow comprehensive characteristics and teacher being marked, and forms Rating Model;

A1, collect examinee and to answer audio frequency;

A4, the flow comprehensive characteristics extracted in recognition result, described flow comprehensive characteristics comprises the feature in the feature in pronouncing accuracy direction in spoken test and appraisal, the feature in fluency direction and text semantic similarity direction; The feature in described text semantic similarity direction comprises semantic relevancy feature and syntactic structure similarity feature;

Obtain described semantic relevancy feature to comprise the following steps: for examinee answer audio frequency carry out decoding identify, obtain recognition result; Calculate the semantic similarity score of each word in each word in recognition result and model answer; Calculate the semantic similarity score of each sentence in each word in recognition result and model answer; Calculate semantic similarity score maximal value in recognition result in each word and model answer in each sentence or mean value as the similarity score between word and sentence; Calculate the similarity score between examinee's answer and model answer;

Obtain described syntactic structure similarity feature to comprise the following steps: for examinee answer audio frequency carry out decoding identify, obtain recognition result; Syntax sequence vector set up in each sentence being respectively recognition result; Obtain the syntactic structure similarity score of each sentence in recognition result and each sentence in model answer respectively, to get in recognition result each Sentence Grammar structural similarity score maximal value as the syntactic structure similarity score of this sentence; By to the syntactic structure similarity feature in recognition result between each the weighted average calculation examinee's answer of Sentence Grammar structural similarity score and model answer;

Adopt the decode system based on extensive continuous speech recognition to carry out decoding to identify, described acoustic model adopts based on Hidden Markov Model (HMM), described language model adopts the language model based on unit's syntax, adopt based on multipass decoding technique when decoding, the decoding of described multipass comprise directly decode, the non-supervisory self-adaptation that returns based on maximum linear likelihood and decode in two phases;