CN1979638A - Method for correcting error of voice identification result - Google Patents

Method for correcting error of voice identification result Download PDF

Info

Publication number
CN1979638A
CN1979638A CNA2005101274476A CN200510127447A CN1979638A CN 1979638 A CN1979638 A CN 1979638A CN A2005101274476 A CNA2005101274476 A CN A2005101274476A CN 200510127447 A CN200510127447 A CN 200510127447A CN 1979638 A CN1979638 A CN 1979638A
Authority
CN
China
Prior art keywords
error correction
masterplate
degree
fragment
recognition result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2005101274476A
Other languages
Chinese (zh)
Inventor
王晓瑞
江杰
王士进
丁鹏
徐波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Automation of Chinese Academy of Science
Original Assignee
Institute of Automation of Chinese Academy of Science
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Automation of Chinese Academy of Science filed Critical Institute of Automation of Chinese Academy of Science
Priority to CNA2005101274476A priority Critical patent/CN1979638A/en
Publication of CN1979638A publication Critical patent/CN1979638A/en
Pending legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

The invention relates to the voice recognition technical field, especially relating to a voice recognition result correcting method, namely a method for correct the recognition result by error correction knowledge library, and the most basic characteristics of the method comprises: 1. using continuous language fragments in corpus as error correcting template, and using the corpus to build error correction template library; 2. indexing for the error correction template library and using the searching technique to fast find error correction templates; 3. according to error correction modes, using creditability to cut recognition result into short recognition fragments and submitting creditable parts in the recognition fragments to an error correction template system for fast find, and obtaining error correction template candidates highly related with the recognition fragments; and 4. using acoustic confusion matrix to select templates close to acoustic characteristics of the recognition fragments from the error correction template candidates to make substitution error correction.

Description

A kind of method for correcting error of voice identification result
Technical field
The present invention relates to the speech recognition technology field, particularly a kind of method for correcting error of voice identification result.
Background technology
Present most of speech recognition system adopts the N unit syntax (Ngram) language model, because there is a kind of incomplete independence assumption in this model, suppose that promptly current speech only relies on this speech N-1 speech before, its limitation shows the uncertain inference of its just preceding N-1 speech, causes recognition result skimble-skamble sentence or fragment often to occur.
Summary of the invention
The present invention proposes a kind of method for correcting error of voice identification result, can utilize variable length error correction masterplate, recognition result is carried out error correction according to degree of confidence and acoustics degree of obscuring.The present invention can be used for large vocabulary continuous speech recognition system.The present invention mainly contains following feature: the one, and as the error correction masterplate, utilize corpus to set up the error correction template library with the connected speech fragment in the corpus; The 2nd, the error correction template library is set up index, use Fast search technique that the error correction template library is searched fast; The 3rd, according to error correction mode, utilize degree of confidence that recognition result is cut into short identification fragment, and will discern the error correction masterplate system that the believable part in the fragment submits to and search fast, obtain and discern the high error correction masterplate candidate of fragment correlativity; The 4th, utilize acoustics degree of obscuring matrix from error correction masterplate candidate, to select and discern the close masterplate of fragment acoustic feature and replace error correction.
Technical scheme
A kind of method for correcting error of voice identification result may further comprise the steps:
1) recognition system is discerned computing and confidence calculations to the input voice, is had
The recognition result of degree of confidence;
2), recognition result is cut into little identification fragment according to the height of degree of confidence according to error correction mode;
3) resulting identification fragment is input to error correction masterplate searching system, obtains and discern the high error correction masterplate candidate list of fragment correlativity;
4) calculate the acoustics degree of obscuring of discerning error correction masterplate in fragment and the candidate list, select the wherein the highest masterplate of acoustics similarity, when the similarity degree of identification segment and this error correction masterplate during greater than a reliable thresholding, use error correction masterplate replaces this recognition result fragment;
5) fragment after the error correction is merged, obtain the recognition result after the error correction.
Described method for correcting error of voice identification result also comprises, the input voice are being carried out confidence calculations when discerning computing, obtains having the step of the recognition result of degree of confidence.
Described a kind of method for correcting error of voice identification result, also comprise, when recognition result being cut into little identification fragment according to the height of putting letter, the long word of degree of confidence thresholding CM-threshold and system's error correction template at first is set counts max-var-length, think when degree of confidence is higher than CM-threshold that recognition result is reliable, reliable number of words must not be greater than max-var-length in the identification fragment after the cutting.
Described method for correcting error of voice identification result, also comprise, with the recognition result piecemeal, the stroke that continuous degree of confidence is higher or lower than CM-threshold is the step of a module, is about to recognition result and divides one or more (A, x into, B) pattern constitutes, wherein A, B are the module that degree of confidence is higher than CM-threshold, and x is the module that degree of confidence is lower than CM-threshold, and maximum one of A and B are empty module.
Described method for correcting error of voice identification result, also comprise, for low confidence module x all in the recognition result, if the length of A is the part of max-var-length with length adjacent with x among the A then more than or equal to max-var-length, be made as sub-A, form fragment (sub-A with x, x), sub-A is used for searching for the error correction template library, the unfixed step of the length of sub-A.
Described method for correcting error of voice identification result, also comprise, for low confidence module x all in the recognition result, if A length is less than max-var-length, then with part sub-B adjacent among the B with x, with A, x form fragment (A, x, sub-B), A and sub-B are used for searching for the step of error correction template library, maximum one of A and sub-B can be empty module, the length of A and sub-B and must not be wherein greater than max-var-length, and the length of A and sub-B is fixing.
Described method for correcting error of voice identification result also comprises, recognition result is cut into fragment after, the believable part of each fragment is searched in the error correction template library fast, obtain the step of the high error correction masterplate of one or more and identification fragment correlativity.
Described method for correcting error of voice identification result also comprises, error correction masterplate searching system comprises two parts, and first is the foundation of error correction masterplate index, and second portion is the search of error correction masterplate.
Described method for correcting error of voice identification result, also comprise, wherein the ultimate principle of first is, the language fragments of all continuous numbers of words between 6 to 12 in the corpus as the error correction masterplate, at first from corpus, extract all error correction masterplates, adopt inverted file the error correction template library to be set up index then,, need compress inverted file in order to reduce the size of inverted file as index structure.
Described method for correcting error of voice identification result, also comprise, wherein the ultimate principle of second portion is, at first believable part is converted to boolean queries during inquiry, in index database, search for fast, have the characteristics of sequential, locality at voice identification result, when being converted to boolean queries, need to add the sequential requirement of believable part and the locality requirement between speech and speech.
Described method for correcting error of voice identification result, also comprise, all results that search is returned for the error correction masterplate use the error correction masterplate and the acoustics degree of obscuring of identification segment to select the step of optimum masterplate, for each masterplate T in identification Segment A and the error correction masterplate candidate list i, calculate A and T iDegree of obscuring C (A, T i), as maximal value maxC (A, T wherein i) when surpassing a reliable thresholding, we use this error correction masterplate to replace the identification fragment, if maxC (A, T i) less than this thresholding, then keep the identification fragment.
Described method for correcting error of voice identification result, also comprise, the error correction masterplate comprises that with the calculating of the acoustics degree of obscuring of identification segment three parts constitute, first is the statistics of the female identity confusion situation of Chinese phonetic, second portion is that the posterior probability of the female identity confusion degree of Chinese phonetic is calculated, and third part is the fuzzy whole matching of identification segment and error correction masterplate.
Described method for correcting error of voice identification result, also comprise, the ultimate principle of first is, speech database is discerned, and obtain the situation of obscuring between situation and all simple or compound vowel of a Chinese syllable obscured between all initial consonants in the following manner: suppose can not produce between the sound mother and obscure, if its recognition result of one of them sample is pinyin string C 1' V 1' C 2' V 2' ... C m' V m', this recognition result and correct C 1V 1C 2V 2C nV nCarry out dynamically making that to whole it can be to last pinyin string maximum, it is right so just to obtain a large amount of pinyin string, i.e. (C 1', C 1), (V 1', V 1) ... (C m', C n) (V m', V n), add up the right occurrence number of these pinyin string, obtain the total sample number of each initial consonant and it is identified as the number of times of other each initial consonants, and obtain the total sample number of each simple or compound vowel of a Chinese syllable and it is identified as the number of times of other each simple or compound vowel of a Chinese syllable.
Described method for correcting error of voice identification result also comprises, wherein the ultimate principle of second portion is, according to the statistics of first, at first calculates the probability that each initial consonant is identified as other initial consonants, C iBe confused into C jBlur level, its computing formula is:
P ( C j | C i ) = Σ ( C i , C j ) | C i | ∑ (C wherein i, C j) be C iKnown
Wei C jSum, | C i| be C iTotal sample number,
When recognition result is C iThe time, correct result should be C jPosterior probability:
P ~ ( C j | C i ) = P ( C i | C j ) P ( C j ) Σ k P ( C i | C k ) P ( C k )
Wherein P ( C j ) = | C j | Σ | C i | , ∑ | C i| represent the total sample number of all initial consonants, the computing method and the initial consonant of simple or compound vowel of a Chinese syllable are similar.
Described method for correcting error of voice identification result also comprises, wherein the 3rd ultimate principle is, the pinyin string of establishing the identification Segment A is C 1' V 1' C 2' V 2' ... C m' V m' establish i masterplate T in the error correction masterplate candidate list iPinyin string be C 1V 1C 2V 2C nV n, then define A and T iAcoustics degree of obscuring C (A, T i) be: find an alignment (1, i 1), (2, i 2) ... (k, i k) ... (m, i m), this alignment makes P ~ ( T i | A ) = Π P ~ ( C k | C i k ) P ~ ( V k | V i k ) Obtain maximal value, defining this maximal value is A and T iAcoustics degree of obscuring.
Described method for correcting error of voice identification result also comprises, when practical application, calculates after at first posterior probability being taken the logarithm, and problem is converted into feasible
Log P ~ ( T i | A ) = ΣLog P ~ ( C k | C i k ) + ΣLog P ~ ( V k | V i k )
Obtain maximal value, use this maximal value as A and T this moment iLogarithm acoustics degree of obscuring.
Embodiment
The present invention mainly contains three modules, and the one, utilize the cutting of degree of confidence to recognition result, the 2nd, the acquisition of error correction masterplate candidate list, the 3rd, the calculating of identification segment and error correction masterplate acoustics degree of obscuring.Described in detail below.
Utilize the cutting of degree of confidence to recognition result.The long word of degree of confidence thresholding CM-threshold and system's error correction template at first is set counts max-var-length, the recognition result that degree of confidence is higher than CM-threshold is thought reliable, then recognition result is carried out cutting, and step is as follows:
1. with the recognition result piecemeal, the stroke that continuous degree of confidence is higher or lower than CM-threshold is a module, divide recognition result into one or more (A, x, B) structure constitutes, wherein A, B are the module that degree of confidence is higher than CM-threshold, and x is the module that degree of confidence is lower than CM-threshold, and maximum one of A and B are empty module.
2. for low confidence module x all in the recognition result
A) if the length of A more than or equal to max-var-length, is the part of max-var-length with length adjacent with x among the A then, be made as sub-A, with x form fragment (sub-A, x), sub-A is used for searching for the error correction template library, the length of sub-A is fixing;
B) if A length less than max-var-length, with part sub-B adjacent with x among the B, (sub-B), A and sub-B are used for searching for the error correction template library for A, x, and maximum one of A and sub-B can be empty module to form fragment with A, x.The length of A and sub-B and must not be greater than max-var-length wherein, the length of A and sub-B is fixing.
The acquisition of error correction masterplate candidate list.After recognition result is cut into fragment, the believable part in each fragment is submitted to error correction masterplate searching system, obtains and discern the high error correction masterplate candidate of fragment correlativity.Error correction masterplate searching system comprises two parts, and first is the foundation of error correction masterplate index, and second portion is the quick search to the error correction template library.
Wherein the ultimate principle of first is, the language fragments of all continuous numbers of words between 6 to 12 in the corpus as the error correction masterplate, at first from corpus, extract all error correction masterplates, adopt inverted file the error correction template library to be set up index then as index structure.In order to reduce the size of inverted file, need compress inverted file.
Wherein the ultimate principle of second portion is, at first the believable part in the fragment is converted to boolean queries, retrieves fast in index database.Have sequential, locality characteristics at voice identification result, when being converted to boolean queries, need to add the sequential requirement of believable part in the fragment and the locality requirement between speech and speech.
The calculating of acoustics degree of obscuring.The error correction masterplate often obtains one or more candidates, at this moment uses the error correction masterplate and the acoustics degree of obscuring of identification segment to select optimum masterplate.For each masterplate T in identification Segment A and the error correction masterplate candidate list i, calculate A and T iDegree of obscuring C (A, T i), as maximal value maxC (A, T wherein i) when surpassing a reliable thresholding, we use this error correction masterplate to replace the identification fragment, if maxC (A, T i) less than this thresholding, then keep the identification fragment.
The calculating of degree of obscuring comprises that three parts constitute, and first is the statistics of the female identity confusion situation of Chinese phonetic, and second portion is that the posterior probability of the female identity confusion degree of Chinese phonetic is calculated, and third part is the fuzzy whole matching of identification segment and error correction masterplate.
Wherein the ultimate principle of first is, speech database is discerned, and obtain the situation of obscuring between situation and all simple or compound vowel of a Chinese syllable obscured between all initial consonants in the following manner: suppose can not produce between the sound mother and obscure, if its recognition result of one of them sample is pinyin string C 1' V 1' C 2' V 2' ... C m' V m', this recognition result and correct C 1V 1C 2V 2C nV nCarry out dynamically making that to whole it can be to last pinyin string maximum, it is right so just to obtain a large amount of pinyin string, i.e. (C 1', C 1), (V 1', V 1) ... (C m', C n) (V m', V n), add up the right occurrence number of these pinyin string, obtain the total sample number of each initial consonant and it is identified as the number of times of other each initial consonants, and obtain the total sample number of each simple or compound vowel of a Chinese syllable and it is identified as the number of times of other each simple or compound vowel of a Chinese syllable.
Wherein the ultimate principle of second portion is, according to the statistics of first, at first calculates the probability that each initial consonant is identified as other initial consonants, C iBe confused into C jBlur level, its computing formula is:
P ( C j | C i ) = Σ ( C i , C j ) | C i |
∑ (C wherein i, C j) expression C iBe identified as C jSum, | C i| expression C iTotal sample number,
When recognition result is C iThe time correct result should be C jPosterior probability:
P ~ ( C j | C i ) = P ( C i | C j ) P ( C j ) Σ k P ( C i | C k ) P ( C k )
Wherein P ( C j ) = | C j | Σ | C i | , ∑ | C i| represent the total sample number of all initial consonants.
Wherein the 3rd ultimate principle is, the pinyin string of establishing the identification Segment A is C 1' V 1' C 2' V 2' ... C m' V m', establish i masterplate T in the error correction masterplate candidate list iPinyin string be C 1V 1C 2V 2C nV n, then define A and T iAcoustics degree of obscuring C (A, T i) be: find an alignment (1, i 1), (2, i 2) ... (k, i k) ... (m, i m), this alignment makes
P ~ ( T i | A ) = Π P ~ ( C k | C i k ) P ~ ( V k | V i k ) - - - ( 1 )
Obtain maximal value, defining this maximal value is A and T iAcoustics degree of obscuring.
When practical application, calculate after at first posterior probability being taken the logarithm, problem is converted into feasible
Log P ~ ( T i | A ) = Σ Log P ~ ( C k | C i k ) + ΣLog P ~ ( V k | V i k )
Obtain maximal value, use this maximal value as A and T this moment iLogarithm acoustics degree of obscuring.

Claims (16)

1. method for correcting error of voice identification result may further comprise the steps:
1) recognition system is discerned computing and confidence calculations to the input voice, obtains having the recognition result of degree of confidence;
2), recognition result is cut into little identification fragment according to the height of degree of confidence according to error correction mode;
3) resulting identification fragment is input to error correction masterplate searching system, obtains and discern the high error correction masterplate candidate list of fragment correlativity;
4) calculate the acoustics degree of obscuring of discerning error correction masterplate in fragment and the candidate list, select the wherein the highest masterplate of acoustics similarity, when the similarity degree of identification segment and this error correction masterplate during greater than a reliable thresholding, use error correction masterplate replaces this recognition result fragment;
5) fragment after the error correction is merged, obtain the recognition result after the error correction.
2. method for correcting error of voice identification result according to claim 1 is characterized in that, also comprises, the input voice are being carried out confidence calculations when discerning computing, obtains having the step of the recognition result of degree of confidence.
3. a kind of method for correcting error of voice identification result according to claim 1, it is characterized in that, also comprise, when recognition result being cut into little identification fragment according to the height of putting letter, the long word of degree of confidence thresholding CM-threshold and system's error correction template at first is set counts max-var-length, think when degree of confidence is higher than CM-threshold that recognition result is reliable, reliable number of words must not be greater than max-var-length in the identification fragment after the cutting.
4. according to claim 1 or 3 described method for correcting error of voice identification result, it is characterized in that, comprise that also with the recognition result piecemeal, the stroke that continuous degree of confidence is higher or lower than CM-threshold is the step of a module, be about to recognition result and divide one or more (A into, x, B) pattern constitutes, and wherein A, B are the module that degree of confidence is higher than CM-threshold, x is the module that degree of confidence is lower than CM-threshold, and maximum one of A and B are empty module.
5. according to claim 1 or 3 or 4 described method for correcting error of voice identification result, it is characterized in that, also comprise, for low confidence module x all in the recognition result, if the length of A is more than or equal to max-var-length, be the part of max-var-length then with length adjacent among the A with x, be made as sub-A, with x form fragment (sub-A, x), sub-A is used for searching for the error correction template library, the unfixed step of the length of sub-A.
6. according to claim 1 or 3 or 4 described method for correcting error of voice identification result, it is characterized in that, also comprise, for low confidence module x all in the recognition result, if A length is less than max-var-length, then with part sub-B adjacent among the B with x, form fragment (A with A, x, x, sub-B), A and sub-B are used for searching for the step of error correction template library, and maximum one of A and sub-B can be empty module, the length of A and sub-B and must not be greater than max-var-length wherein, the length of A and sub-B is fixing.
7. according to claim 1 or 5 or 6 described method for correcting error of voice identification result, it is characterized in that, also comprise, after recognition result is cut into fragment, the believable part of each fragment is searched in the error correction template library fast, obtain step one or more and the error correction masterplate that identification fragment correlativity is high.
8. according to claim 1 or 7 described method for correcting error of voice identification result, it is characterized in that also comprise, error correction masterplate searching system comprises two parts, first is the foundation of error correction masterplate index, and second portion is the search of error correction masterplate.
9. according to claim 1 or 8 described method for correcting error of voice identification result, it is characterized in that, also comprise, wherein the ultimate principle of first is, the language fragments of all continuous numbers of words between 6 to 12 in the corpus at first extracted all error correction masterplates as the error correction masterplate from corpus, adopt inverted file the error correction template library to be set up index then as index structure, in order to reduce the size of inverted file, need compress inverted file.
10. according to claim 1 or 8 described method for correcting error of voice identification result, it is characterized in that, also comprise, wherein the ultimate principle of second portion is, at first believable part is converted to boolean queries during inquiry, in index database, searches for fast, have the characteristics of sequential, locality at voice identification result, when being converted to boolean queries, need to add to the sequential requirement of believable part and the locality requirement between speech and speech.
11. according to claim 1 or 7 described method for correcting error of voice identification result, it is characterized in that, also comprise, all results that search is returned for the error correction masterplate, use the error correction masterplate and the acoustics degree of obscuring of identification segment to select the step of optimum masterplate, for each masterplate T in identification Segment A and the error correction masterplate candidate list i, calculate A and T iDegree of obscuring C (A, T i), as maximal value maxC (A, T wherein i) when surpassing a reliable thresholding, we use this error correction masterplate to replace the identification fragment, if maxC (A, T i) less than this thresholding, then keep the identification fragment.
12. according to claim 1 or 11 described method for correcting error of voice identification result, it is characterized in that, also comprise, the error correction masterplate comprises that with the calculating of the acoustics degree of obscuring of identification segment three parts constitute, first is the statistics of the female identity confusion situation of Chinese phonetic, second portion is that the posterior probability of the female identity confusion degree of Chinese phonetic is calculated, and third part is the fuzzy whole matching of identification segment and error correction masterplate.
13. according to claim 1 or 12 described method for correcting error of voice identification result, it is characterized in that, also comprise, the ultimate principle of first is, speech database is discerned, and obtain the situation of obscuring between situation and all simple or compound vowel of a Chinese syllable obscured between all initial consonants in the following manner: suppose can not produce between the sound mother and obscure, if its recognition result of one of them sample is pinyin string C 1' V 1' C 2' V 2' ... C m' V m', this recognition result and correct C 1V 1C 2V 2C nV nCarry out dynamically making that to whole it can be to last pinyin string maximum, it is right so just to obtain a large amount of pinyin string, i.e. (C 1', C 1), (V 1', V 1) ... (C m', C n) (V m', V n), add up the right occurrence number of these pinyin string, obtain the total sample number of each initial consonant and it is identified as the number of times of other each initial consonants, and obtain the total sample number of each simple or compound vowel of a Chinese syllable and it is identified as the number of times of other each simple or compound vowel of a Chinese syllable.
14., it is characterized in that also comprise, wherein the ultimate principle of second portion is according to claim 1 or 12 or 13 described method for correcting error of voice identification result,, calculate the probability that each initial consonant is identified as other initial consonants, C earlier according to the statistics of first iBe confused into C jBlur level, its computing formula is:
P ( C j | C i ) = Σ ( C i , C j ) | C i |
∑ (C wherein i, C j) be C iBe identified as C jSum, | C i| be C iTotal sample number is when recognition result is C iThe time, correct result should be C jPosterior probability:
P ~ ( C j | C i ) = P ( C i | C j ) P ( C j ) Σ k P ( C i | C k ) P ( C k )
Wherein P ( C j ) = | C j | Σ | C i | , ∑ | C i| represent the total sample number of all initial consonants, the computing method and the initial consonant of simple or compound vowel of a Chinese syllable are similar.
15., it is characterized in that also comprise, wherein the 3rd ultimate principle is according to claim 1 or 12 described method for correcting error of voice identification result, the pinyin string of establishing the identification Segment A is C 1' V 1' C 2' V 2' ... C m' V m' establish i masterplate T in the error correction masterplate candidate list iPinyin string be C 1V 1C 2V 2C nV n, then define A and T iAcoustics degree of obscuring C (A, T i) be: find an alignment (1, i 1), (2, i 2) ... (k, i k) ... (m, i m), this alignment makes P ~ ( T i | A ) = Π P ~ ( C k | C i k ) P ~ ( V k | V i k ) Obtain maximal value, defining this maximal value is A and T iAcoustics degree of obscuring.
16., it is characterized in that according to claim 1 or 12 or 15 described method for correcting error of voice identification result, also comprise, when practical application, calculate after at first posterior probability being taken the logarithm, problem is converted into feasible
Log P ~ ( T i | A ) = ΣLog P ~ ( C k | C i k ) + ΣLog P ~ ( V k | V i k )
Obtain maximal value, use this maximal value as A and T this moment iLogarithm acoustics degree of obscuring.
CNA2005101274476A 2005-12-02 2005-12-02 Method for correcting error of voice identification result Pending CN1979638A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2005101274476A CN1979638A (en) 2005-12-02 2005-12-02 Method for correcting error of voice identification result

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2005101274476A CN1979638A (en) 2005-12-02 2005-12-02 Method for correcting error of voice identification result

Publications (1)

Publication Number Publication Date
CN1979638A true CN1979638A (en) 2007-06-13

Family

ID=38130774

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2005101274476A Pending CN1979638A (en) 2005-12-02 2005-12-02 Method for correcting error of voice identification result

Country Status (1)

Country Link
CN (1) CN1979638A (en)

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101894549A (en) * 2010-06-24 2010-11-24 中国科学院声学研究所 Method for fast calculating confidence level in speech recognition application field
CN101452701B (en) * 2007-12-05 2011-09-07 株式会社东芝 Confidence degree estimation method and device based on inverse model
CN102831177A (en) * 2012-07-31 2012-12-19 聚熵信息技术(上海)有限公司 Statement error correction method and system
CN103000176A (en) * 2012-12-28 2013-03-27 安徽科大讯飞信息科技股份有限公司 Speech recognition method and system
CN103021412A (en) * 2012-12-28 2013-04-03 安徽科大讯飞信息科技股份有限公司 Voice recognition method and system
CN105244029A (en) * 2015-08-28 2016-01-13 科大讯飞股份有限公司 Voice recognition post-processing method and system
CN105632499A (en) * 2014-10-31 2016-06-01 株式会社东芝 Method and device for optimizing voice recognition result
CN106340293A (en) * 2015-07-06 2017-01-18 无锡天脉聚源传媒科技有限公司 Method and device for adjusting audio data recognition result
CN106534548A (en) * 2016-11-17 2017-03-22 科大讯飞股份有限公司 Voice error correction method and device
CN106683677A (en) * 2015-11-06 2017-05-17 阿里巴巴集团控股有限公司 Method and device for recognizing voice
CN106710592A (en) * 2016-12-29 2017-05-24 北京奇虎科技有限公司 Speech recognition error correction method and speech recognition error correction device used for intelligent hardware equipment
CN106875943A (en) * 2017-01-22 2017-06-20 上海云信留客信息科技有限公司 A kind of speech recognition system for big data analysis
CN107045496A (en) * 2017-04-19 2017-08-15 畅捷通信息技术股份有限公司 The error correction method and error correction device of text after speech recognition
CN108052498A (en) * 2010-01-05 2018-05-18 谷歌有限责任公司 The words grade of phonetic entry is corrected
CN109920432A (en) * 2019-03-05 2019-06-21 百度在线网络技术(北京)有限公司 A kind of audio recognition method, device, equipment and storage medium
CN109922371A (en) * 2019-03-11 2019-06-21 青岛海信电器股份有限公司 Natural language processing method, equipment and storage medium
CN110428822A (en) * 2019-08-05 2019-11-08 重庆电子工程职业学院 A kind of speech recognition error correction method and interactive system
CN110797026A (en) * 2019-09-17 2020-02-14 腾讯科技(深圳)有限公司 Voice recognition method, device and storage medium
CN111192586A (en) * 2020-01-08 2020-05-22 北京松果电子有限公司 Voice recognition method and device, electronic equipment and storage medium
CN112784581A (en) * 2020-11-20 2021-05-11 网易(杭州)网络有限公司 Text error correction method, device, medium and electronic equipment
CN113407694A (en) * 2018-07-19 2021-09-17 深圳追一科技有限公司 Customer service robot knowledge base ambiguity detection method, device and related equipment
CN114530145A (en) * 2020-11-23 2022-05-24 中移互联网有限公司 Speech recognition result error correction method and device, and computer readable storage medium

Cited By (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101452701B (en) * 2007-12-05 2011-09-07 株式会社东芝 Confidence degree estimation method and device based on inverse model
CN108052498A (en) * 2010-01-05 2018-05-18 谷歌有限责任公司 The words grade of phonetic entry is corrected
CN101894549A (en) * 2010-06-24 2010-11-24 中国科学院声学研究所 Method for fast calculating confidence level in speech recognition application field
CN102831177B (en) * 2012-07-31 2015-09-02 聚熵信息技术(上海)有限公司 Statement error correction and system thereof
CN102831177A (en) * 2012-07-31 2012-12-19 聚熵信息技术(上海)有限公司 Statement error correction method and system
CN103021412B (en) * 2012-12-28 2014-12-10 安徽科大讯飞信息科技股份有限公司 Voice recognition method and system
CN103021412A (en) * 2012-12-28 2013-04-03 安徽科大讯飞信息科技股份有限公司 Voice recognition method and system
CN103000176B (en) * 2012-12-28 2014-12-10 安徽科大讯飞信息科技股份有限公司 Speech recognition method and system
CN103000176A (en) * 2012-12-28 2013-03-27 安徽科大讯飞信息科技股份有限公司 Speech recognition method and system
CN105632499B (en) * 2014-10-31 2019-12-10 株式会社东芝 Method and apparatus for optimizing speech recognition results
CN105632499A (en) * 2014-10-31 2016-06-01 株式会社东芝 Method and device for optimizing voice recognition result
CN106340293A (en) * 2015-07-06 2017-01-18 无锡天脉聚源传媒科技有限公司 Method and device for adjusting audio data recognition result
CN106340293B (en) * 2015-07-06 2019-11-29 无锡天脉聚源传媒科技有限公司 A kind of method of adjustment and device of audio data recognition result
CN105244029B (en) * 2015-08-28 2019-02-26 安徽科大讯飞医疗信息技术有限公司 Voice recognition post-processing method and system
CN105244029A (en) * 2015-08-28 2016-01-13 科大讯飞股份有限公司 Voice recognition post-processing method and system
CN106683677A (en) * 2015-11-06 2017-05-17 阿里巴巴集团控股有限公司 Method and device for recognizing voice
US11664020B2 (en) 2015-11-06 2023-05-30 Alibaba Group Holding Limited Speech recognition method and apparatus
CN106534548A (en) * 2016-11-17 2017-03-22 科大讯飞股份有限公司 Voice error correction method and device
CN106534548B (en) * 2016-11-17 2020-06-12 科大讯飞股份有限公司 Voice error correction method and device
CN106710592A (en) * 2016-12-29 2017-05-24 北京奇虎科技有限公司 Speech recognition error correction method and speech recognition error correction device used for intelligent hardware equipment
CN106875943A (en) * 2017-01-22 2017-06-20 上海云信留客信息科技有限公司 A kind of speech recognition system for big data analysis
CN107045496A (en) * 2017-04-19 2017-08-15 畅捷通信息技术股份有限公司 The error correction method and error correction device of text after speech recognition
CN107045496B (en) * 2017-04-19 2021-01-05 畅捷通信息技术股份有限公司 Error correction method and error correction device for text after voice recognition
CN113407694A (en) * 2018-07-19 2021-09-17 深圳追一科技有限公司 Customer service robot knowledge base ambiguity detection method, device and related equipment
CN109920432A (en) * 2019-03-05 2019-06-21 百度在线网络技术(北京)有限公司 A kind of audio recognition method, device, equipment and storage medium
US11264034B2 (en) 2019-03-05 2022-03-01 Baidu Online Network Technology (Beijing) Co., Ltd Voice identification method, device, apparatus, and storage medium
CN109922371B (en) * 2019-03-11 2021-07-09 海信视像科技股份有限公司 Natural language processing method, apparatus and storage medium
CN109922371A (en) * 2019-03-11 2019-06-21 青岛海信电器股份有限公司 Natural language processing method, equipment and storage medium
CN110428822A (en) * 2019-08-05 2019-11-08 重庆电子工程职业学院 A kind of speech recognition error correction method and interactive system
CN110428822B (en) * 2019-08-05 2022-05-03 重庆电子工程职业学院 Voice recognition error correction method and man-machine conversation system
CN110797026A (en) * 2019-09-17 2020-02-14 腾讯科技(深圳)有限公司 Voice recognition method, device and storage medium
CN111192586A (en) * 2020-01-08 2020-05-22 北京松果电子有限公司 Voice recognition method and device, electronic equipment and storage medium
CN112784581A (en) * 2020-11-20 2021-05-11 网易(杭州)网络有限公司 Text error correction method, device, medium and electronic equipment
CN112784581B (en) * 2020-11-20 2024-02-13 网易(杭州)网络有限公司 Text error correction method, device, medium and electronic equipment
CN114530145A (en) * 2020-11-23 2022-05-24 中移互联网有限公司 Speech recognition result error correction method and device, and computer readable storage medium
CN114530145B (en) * 2020-11-23 2023-08-15 中移互联网有限公司 Speech recognition result error correction method and device and computer readable storage medium

Similar Documents

Publication Publication Date Title
CN1979638A (en) Method for correcting error of voice identification result
US10073673B2 (en) Method and system for robust tagging of named entities in the presence of source or translation errors
US8131539B2 (en) Search-based word segmentation method and device for language without word boundary tag
JP5449521B2 (en) Search device and search program
CN101510222B (en) Multilayer index voice document searching method
US8606559B2 (en) Method and apparatus for detecting errors in machine translation using parallel corpus
CN101878476B (en) Machine translation for query expansion
CN109710929A (en) A kind of bearing calibration, device, computer equipment and the storage medium of speech recognition text
CN105957518A (en) Mongolian large vocabulary continuous speech recognition method
CN105068997B (en) The construction method and device of parallel corpora
CN104166462A (en) Input method and system for characters
Sak et al. Morpholexical and discriminative language models for Turkish automatic speech recognition
CN110942767B (en) Recognition labeling and optimization method and device for ASR language model
CN111832299A (en) Chinese word segmentation system
CN115617955B (en) Hierarchical prediction model training method, punctuation symbol recovery method and device
Nguyen et al. Improving vietnamese named entity recognition from speech using word capitalization and punctuation recovery models
JPWO2010044123A1 (en) Search device, search index creation device, and search system
CN111160014A (en) Intelligent word segmentation method
KR20230009564A (en) Learning data correction method and apparatus thereof using ensemble score
Fusayasu et al. Word-error correction of continuous speech recognition based on normalized relevance distance
Byambakhishig et al. Error correction of automatic speech recognition based on normalized web distance.
Sen et al. Bangla natural language processing: A comprehensive review of classical machine learning and deep learning based methods
Liang et al. An efficient error correction interface for speech recognition on mobile touchscreen devices
Dinarelli Spoken language understanding: from spoken utterances to semantic structures
JP2003308094A (en) Method for correcting recognition error place in speech recognition

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Open date: 20070613