JPH01237597A

JPH01237597A - Voice recognizing and correcting device

Info

Publication number: JPH01237597A
Application number: JP63064599A
Authority: JP
Inventors: Toru Sanada; 真田　徹; Akihiro Kimura; 晋太木村
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1988-03-17
Filing date: 1988-03-17
Publication date: 1989-09-22

Abstract

PURPOSE:To facilitate the recognition and the correction by storing the first input voice and subsequently, discriminating which part of the first input voice it is, when a voice of only a corrected part has been reinputted and deciding definitely the recognition of the input voice when the next candidate which has been displayed confirms with the input voice. CONSTITUTION:An input voice is analyzed by an acoustic analyzing part 11, and an acoustic parameter time series is stored in a sentence voice spotting part 13. Subsequently, by comparing said voice with a dictionary by a sentence voice recognizing part 12 by using the acoustic parameter time series outputted from the acoustic analyzing part 11, it is converted to a character sentence, and a result of conversion is displayed on a display part 14. In this state, when an erroneous recognition part is detected, the character sentence which has become the next candidate is displayed on the display part by bringing only its erroneous recognition part to voice input again. In such a way, a user can be released from a complicated operation for instructing an initial end and a terminal of the erroneous recognition part by a cursor on the display part in order to correct an erroneous recognition.

Description

【発明の詳細な説明】〔概　　要〕単語毎に区切らずに連続して発生した音声を認識する連
続音声認識装置に係、特に認識誤りの訂正を行う音声認
識訂正装置に関し、カーソルキー操作による使用者に対する負担を軽減して
認識誤りがあった際には再度誤り部分を発声することに
よって、認識訂正を行えるようにすることを目的とし、入力音声の成分を分析する音響分析部と、該音響分析部
から出力された最初の入力音声を第１の候補に基づいて
認識する音声認識部と、該音声認識部の出力を表示する
手段と、前記音響分析部から出力された、最初の入力音
声を記憶するとともに、次に訂正部分だけの音声が再入
力されたときに、前記最初の入力音声のうちのどの部分
であるかを判別し、当該訂正部分について次の候補を前
記音声認識部に出力する音声スポツティング部と、前記
表示手段で表示された前記衣の候補が前記入力音声に合
致している場合に、該入力音声の認識を確定する手段と
からなるように構成する。[Detailed Description of the Invention] [Summary] This invention relates to a continuous speech recognition device that recognizes speech that occurs continuously without separating each word, and in particular to a speech recognition correction device that corrects recognition errors. The purpose of this system is to reduce the burden on the user and, in the event of a recognition error, to be able to correct the recognition by re-uttering the erroneous part. a speech recognition section that recognizes the first input speech output from the acoustic analysis section based on a first candidate; a means for displaying the output of the speech recognition section; and a first input speech output from the acoustic analysis section. In addition to storing the speech, when the next time that only the corrected portion of the speech is re-inputted, it determines which part of the first input speech it is, and selects the next candidate for the corrected portion as the speech recognition unit. and means for determining recognition of the input voice when the clothing candidate displayed by the display means matches the input voice.

〔産業上の利用分野〕[Industrial application field]

本発明は、単語毎に区切らずに連続して発生した音声を
認識する連続音声認識装置に係り、特に認識誤りの訂正
を行う音声認識訂正装置に関する。The present invention relates to a continuous speech recognition device that recognizes continuously generated speech without dividing each word, and more particularly to a speech recognition correction device that corrects recognition errors.

〔従来の技術〕[Conventional technology]

従来は第５図に示すように音声をまず音響分析部１に入
力し、ここで入力音声を分析して音響パラメータ時系列
、つまり音声波形データを所定長の分析フレームに分割
し、各分析フレームにおける音声パワーの周波数特性を
時系列に配列したデータに変換する。音響パラメータ時
系列を用いて、文章音声認識部２で文字文章に変換して
入力音声を認識する。この認識は、入力音声の音響パラ
メータ時系列と辞書に記憶されている音響パラメータ時
系列とについて連続ＤＰマツチングによって比較するこ
とによって行われる。この認識結果は表示部４により表
示される。表示された文字文章中に認識誤りがあった場
合には使用者がカーソル指示部３によって、カーソルを
用いて誤り箇所を指示する。表示部４上でカーソルによ
って指示された部分について次候補となっている文章に
置き換え、この次に表れた文字文章が正しい音声認識を
行った結果であるが、否がを使用者が判断する。Conventionally, as shown in Fig. 5, audio is first input to the acoustic analysis unit 1, where the input audio is analyzed and the audio parameter time series, that is, the audio waveform data, is divided into analysis frames of a predetermined length, and each analysis frame is Convert the frequency characteristics of the audio power in to data arranged in time series. Using the acoustic parameter time series, the text speech recognition unit 2 converts the input speech into text and recognizes the input speech. This recognition is performed by comparing the acoustic parameter time series of the input speech and the acoustic parameter time series stored in the dictionary by continuous DP matching. This recognition result is displayed on the display section 4. If there is a recognition error in the displayed text, the user uses the cursor instruction section 3 to indicate the location of the error. The part indicated by the cursor on the display section 4 is replaced with the next candidate sentence, and the character sentence that appears next is the result of correct voice recognition, but the user decides whether or not it is correct.

そしてこのカーソル指示による訂正を繰り返し行い、使
用者の要求する音声認識結果が表示された後、これを認
識結果として文章音声認識部２に出力する。Then, the correction based on the cursor instruction is repeated, and after the voice recognition result requested by the user is displayed, this is outputted to the text voice recognition section 2 as the recognition result.

〔発明が解決しようとする課題〕[Problem to be solved by the invention]

第５図に示した従来の技術では認識誤りの箇所を人手に
よりカーソルを用いて指示する作業が必要である。この
とき、表示部４に表示された文字文章中認識誤りの箇所
の始端と終端とを判別してカーソルで指示する必要があ
るので、カーソルの移動操作が煩雑である。従って、音
声認識は、カーソルキーやマウス等のボインティングデ
イバイスの使用をできるだけ減少させることが望ましい
のにかかわらず、使用者にカーソルキー等の負担を強い
ることになる。In the conventional technique shown in FIG. 5, it is necessary to manually indicate the location of the recognition error using a cursor. At this time, it is necessary to determine the beginning and end of the portion of the text displayed on the display unit 4 where the recognition error occurred and indicate it with the cursor, which makes the cursor movement operation complicated. Therefore, although it is desirable to reduce the use of pointing devices such as cursor keys and a mouse as much as possible, voice recognition imposes a burden on the user.

本発明は上記従来の問題点を解消し、使用者がカーソル
の移動操作を行わずに音声の認識訂正を行えるようにす
ることを目的とする。SUMMARY OF THE INVENTION An object of the present invention is to solve the above-mentioned conventional problems and to enable a user to recognize and correct speech without moving a cursor.

〔課題を解決するための手段〕[Means to solve the problem]

第１図には本発明の原理ブロック図を示し、入力音声は
音響分析部１１で分析し、音響パラメータ時系列を文章
音声スポツティング部１３に保存する。そして、音響分
析部１１から出力する音響パラメータ時系列を用いて、
従来例と同様に、文章音声認識部１２で辞書と比較する
ことにより文字文章に変換し、表示部１４に変換結果を
表示する。使用者はこの表示結果を目視することにより
認識誤りがあったことを判別した場合には、その誤った
部分のみを使用者が音声により再度入力する。そして入
力された音声を音響分析部１１で再度音響パラメータ時
系列に変換する。そしてこの再度入力された訂正部分の
入力音声に対応する音響パラメータ時系列と、文章音声
スポツティング部１３において先に保存されていた入力
音声全体の音響パラメータ時系列との例えば音声パワー
がマツチする部分を探索する。これにより、入力音声全
体のうちで特に誤認識をおこした部分を音声によって指
示することができる。この指示された部分を文章音声認
識部１２に入力し、この部分の第１の候補を次候補とな
っている文字文章に置き換える。そしてこの次候補を表
示部１４に表示して正しい認識であるか否を使用者が判
別する。これを繰り返して使用者の要求する結果が表示
部１４に表示されたのちこれを認識結果として出力する
。FIG. 1 shows a block diagram of the principle of the present invention. Input speech is analyzed by an acoustic analysis section 11, and an acoustic parameter time series is stored in a text speech spotting section 13. Then, using the acoustic parameter time series output from the acoustic analysis unit 11,
Similar to the conventional example, the text-speech recognition unit 12 converts the text into text by comparing it with a dictionary, and displays the conversion result on the display unit 14. When the user determines that a recognition error has occurred by visually observing the display results, the user inputs only the erroneous part by voice again. The input voice is then converted into an acoustic parameter time series again by the acoustic analysis unit 11. Then, for example, a portion where the audio power of the audio parameter time series corresponding to the input audio of the corrected portion input again and the audio parameter time series of the entire input audio previously stored in the text audio spotting unit 13 matches. Explore. As a result, it is possible to specify by voice the portion of the input voice that is particularly misrecognized. This designated portion is input to the text speech recognition unit 12, and the first candidate for this portion is replaced with the next candidate character sentence. Then, this next candidate is displayed on the display unit 14, and the user determines whether the recognition is correct or not. This is repeated until the result requested by the user is displayed on the display section 14, which is then output as the recognition result.

〔作　　　用〕[For production]

本発明によれば、入力音声信号全体を表示部上に表示し
た後、誤認識部分を見つけた場合にはその誤認識部分の
みを再度音声入力することにより、表示部上に次候補と
なった文字文章を表示することができる。このため誤認
識を訂正するために、表示部上でカーソルによって誤認
識部分の始端及び終端を指示するという煩雑な操作から
使用者を一切開放することができる。According to the present invention, after displaying the entire input audio signal on the display section, if an erroneously recognized part is found, only that erroneously recognized part is input as the next candidate on the display part. Text can be displayed. Therefore, in order to correct misrecognition, the user can be freed from the troublesome operation of pointing the start and end of the misrecognized portion using a cursor on the display section.

〔実　　施　　例〕〔Example〕

第２図は本発明の一実施例のブロック図である。 FIG. 2 is a block diagram of one embodiment of the present invention.

入力音声をまず多数の異なる通過周波数帯域を持つバン
ドパスフィルタからなるＢＰＦ群２１に入力し、入力音
声の音響パラメータの時系列を分析する。このＢＰＦ群
２１の出力の音響パラメータ時系列は、記憶回路２７に
記憶される。また、ＢＰＦ群２１の出力の音響パラメー
タ時系列は連続ＤＰマツチング回路２２に入力され、単
語辞書２３中の音響パラメータ時系列とＤＰマツチング
を行う。単語ラティス整理回路２４によりこの単語ラテ
ィスを生成する。単語ラティスは単語候補を複数個ずつ
配列したものである。単語候補の順位づけされた単語ラ
ティスをその単語の始端時刻と終端時刻とともに、単語
ラティス記憶回路２９に記憶する。文章候補の第１位と
なったものをＣＲＴ２５に表示するとともに、記憶回路
２６に格納する。この記憶回路２６は使用者がＣＲＴ２
５を目視し、確定キーを押すことにより、正しい認識で
あることが確認されたときには、その記憶内容を認識結
果として出力する。Input audio is first input to a BPF group 21 consisting of bandpass filters having a large number of different pass frequency bands, and the time series of acoustic parameters of the input audio is analyzed. The acoustic parameter time series output from the BPF group 21 is stored in the storage circuit 27. Further, the acoustic parameter time series output from the BPF group 21 is input to a continuous DP matching circuit 22, and DP matching is performed with the acoustic parameter time series in the word dictionary 23. This word lattice is generated by the word lattice arrangement circuit 24. A word lattice is an array of word candidates. The word lattice in which the word candidates are ranked is stored in the word lattice storage circuit 29 together with the start and end times of the word. The first sentence candidate is displayed on the CRT 25 and stored in the memory circuit 26. This memory circuit 26 is connected to the CRT2 by the user.
5 and presses the confirmation key to confirm that the recognition is correct, the stored contents are output as the recognition result.

もしＣＲＴ２５に表示された文字文章が誤っていた場合
には、訂正キーを押して誤った部分のみ再度音声で入力
する。訂正キーを押すと訂正キー制御回路３０により、
記憶回路２７に記憶されていた入力文章全体のフィルタ
出力の音響パラメータ時系列が、連続ＤＰマツチング回
路２８に入力される。If the text displayed on the CRT 25 is incorrect, press the correction key and re-enter only the incorrect part by voice. When the correction key is pressed, the correction key control circuit 30
The filter output acoustic parameter time series of the entire input sentence stored in the storage circuit 27 is input to the continuous DP matching circuit 28.

そして、再入力された誤った部分のみに対応する音声は
、ＢＰＦ群２１で音響パラメータ時系列に変換され、連
続ＤＰマツチング回路２８に送られる。連続ＤＰマツチ
ング回路２８では記憶されていた入力文書全体の音響パ
ラメータ時系列の中から、再入力された音声の音響パラ
メータ時系列が最もよくマツチする部分を探す。探し出
した部分の始端と終端の時刻例えば入力音声の最初から
５００ｍ５から１　ｓｅｃ迄という情報を始終端変更回
路３１に送る。始終端変更回路３１では始端から予め与
えられた闇値をひき終端に予め与えられた闇値を加える
。すなわち、候補サーチを正確に行うために訂正部分の
始終端の幅をすこし広げる。この変更された始端終端を
サーチ回路３２に送る。Then, the re-input audio corresponding to only the erroneous part is converted into an acoustic parameter time series by the BPF group 21 and sent to the continuous DP matching circuit 28. The continuous DP matching circuit 28 searches for a part that best matches the acoustic parameter time series of the re-input speech from among the stored acoustic parameter time series of the entire input document. Information about the start and end times of the found portion, for example, 500 m5 to 1 sec from the beginning of the input audio, is sent to the start and end change circuit 31. The start/end end changing circuit 31 subtracts a predetermined darkness value from the start end and adds a predetermined darkness value to the end end. That is, in order to accurately search for candidates, the width of the beginning and end of the corrected portion is slightly widened. The changed start and end points are sent to the search circuit 32.

サーチ回路３２は単語ラティス記憶回路２９から単語ラ
ティスを読み込み、順位が第１位でかつ始終端変更回路
３１から送られてきた始終端の内側に入る単語を見つけ
る。The search circuit 32 reads the word lattice from the word lattice storage circuit 29 and finds the word that is ranked first and falls inside the start and end points sent from the start and end change circuit 31.

すなわち、認識を誤った単語として表示されている現在
の第１位の候補単語を見つける。That is, the current No. 1 candidate word that is displayed as an incorrectly recognized word is found.

次に、１位−最下位＋１位置き換え回路３３により、１
位にある見つけた単語の順位を最下位の候補のひとつ下
すなわち、最下位＋１位に書き換える。次に、ソート回
路３４で単語ラティス順位をソートすると２位にあった
単語が１位に上がり、以下順位が繰り上がり、今、最下
位＋１位に書き込んだ単語候補が最下位へと上がる。Next, the 1st-lowest + 1st-place replacement circuit 33 replaces 1
Rewrite the ranking of the word found in the lowest position to one position below the lowest candidate, that is, to the lowest + 1st position. Next, when the word lattice ranking is sorted by the sorting circuit 34, the word that was in second place is moved up to first place, and the subsequent rankings are moved up, and the word candidate that has just been written in the lowest position + one position is moved up to the lowest position.

この結果の１位から最下位までの候補の単語が配列され
た単語ラティスを再び単語ラティス記憶回路２９に送り
返す。The word lattice in which the candidate words from the first place to the last place are arranged is sent back to the word lattice storage circuit 29 again.

単語ラティス整理回路２４は単語ラティス記憶回路２９
を参照して、新たに第１位となった候補をＣＲＴ２５と
記憶回路２６に出力する。The word lattice organizing circuit 24 is a word lattice storage circuit 29
, and outputs the newly ranked candidate to the CRT 25 and the storage circuit 26.

そして、使用者は新たに第１位となった候補が正しい音
声入力であるかどうかを判断し、正しい時には確定キー
を押すことにより、記憶回路２６はその単語を認識結果
として出力する。Then, the user determines whether the newly ranked first candidate is the correct voice input, and when it is correct, by pressing the confirm key, the memory circuit 26 outputs the word as a recognition result.

もし、正しくないときには、使用者は再度訂正キーを押
して、その誤った部分の音声入力を行い、前述した処理
が繰り返される。If it is incorrect, the user presses the correction key again to input the incorrect part of the voice, and the above-described process is repeated.

例えば、「象の鼻は長い」が音声として入力したとする
と、第３図のような単語ラティスが単語ラティス整理回
路２４中に記憶される。最初の入力の「象の鼻は長い」
は図示のような音声パワーをもち、これに対して、候補
第１位として「象の棚は高い」が候補第１位、「鼻は長
い」は候補第２位、「ない」が候補第３位となる。ＣＲ
Ｔ２５に第１候補の「象の棚は高い」と出力される。こ
の場合、訂正キーを押して「鼻は長い」と誤った部分の
みを音声で再入力する。すると、第４図に示すように連
′１７ｔＤＰマツチング回路２８中で、再入力された音
声と最初に入力された音声とがマツチングがとられる。For example, if "An elephant's trunk is long" is input as a voice, a word lattice as shown in FIG. 3 is stored in the word lattice sorting circuit 24. First input: "Elephants have long trunks"
has the voice power as shown in the figure, and on the other hand, "The elephant's shelf is high" is the first candidate, "The nose is long" is the second candidate, and "No" is the first candidate. 3rd place. CR
At T25, the first candidate "Elephant shelf is high" is output. In this case, press the correction key and re-enter only the incorrect part, ``I have a long nose,'' by voice. Then, as shown in FIG. 4, the re-input voice and the first input voice are matched in the DP matching circuit 28.

これによって最初に入力された音声のどの部分が誤って
おり、再入力されたかがわかる。最初の入力の音声パワ
ーの波形と再入力の音声パワーの波形とが最も近似して
いる部分を抽出つまり、スポツティングが行われる。次
に、始終端変更回路３１とサーチ回路３２と１位→最下
位＋１位置き換え回路３３とソート回路３４により再入
力音声にマツチした部分についてのみ順位の入れ換えが
行われる。そして、最初に入力された音声ラティスの１
位「棚は高い」は最下位になり、２位だった「鼻は長い
」は入れ換えられて１位になる。これにより、単語ラテ
ィス整理回路２４から「象の鼻は長い」とＣＲＴ２５及
び記憶回路２６に出力される。これは正しい入力である
ので、確定キーを押せば、記憶回路２６から認識結果と
して「象の鼻は長い」が出力されることになる。This lets you know which parts of the originally input audio were incorrect and should be re-entered. Spotting is performed to extract the part where the waveform of the audio power of the first input and the waveform of the audio power of the re-input are most similar. Next, the start/end change circuit 31, the search circuit 32, the 1st->lowest+1st-place replacement circuit 33, and the sorting circuit 34 change the order of only the part that matches the re-input audio. Then, 1 of the first input audio lattice
The second place, ``The shelf is high,'' is now at the bottom, and the second place, ``My nose is long,'' has been swapped and becomes the first place. As a result, the word lattice organizing circuit 24 outputs "An elephant's trunk is long" to the CRT 25 and the memory circuit 26. Since this is a correct input, if the confirm key is pressed, the memory circuit 26 will output "Elephants have long trunks" as a recognition result.

〔発明の効果〕〔Effect of the invention〕

本発明によれば、音声入力にあって、誤った部分のみを
再度使用者が入力することによって、音声認識の誤りを
訂正することができるので、音声入力の訂正にあたって
カーソルを移動操作する必要がな（、認識訂正を極めて
容易に行うことかのできるという効果を奏する。According to the present invention, errors in speech recognition can be corrected by the user re-entering only the erroneous part of the speech input, so there is no need to move the cursor when correcting the speech input. (This has the effect of making recognition correction extremely easy.

【図面の簡単な説明】[Brief explanation of the drawing]

第１図は本発明の音声認識訂正装置の原理ブロック図、第２図は本発明の一実施例のブロック図、第３図は上記
実施例における、単語ラティス記ｔ！回路における単語
ラティスの構造を示す図、第４図は第２図に示した実施
例において、単語認識の訂正を行う順序を説明する図、第５図は従来の音声認識訂正装置のブロック図である。１１・・・音響分析部、１２・・・文章音声認識部、１３・・・文章音声スポツティング部、１４・・・表示
部。FIG. 1 is a block diagram of the principle of the speech recognition and correction device of the present invention, FIG. 2 is a block diagram of an embodiment of the present invention, and FIG. 3 is a word lattice t! in the above embodiment. Figure 4 is a diagram showing the structure of a word lattice in a circuit; Figure 4 is a diagram explaining the order in which word recognition is corrected in the embodiment shown in Figure 2; Figure 5 is a block diagram of a conventional speech recognition correction device. be. 11... Acoustic analysis section, 12... Text speech recognition section, 13... Text speech spotting section, 14... Display section.

Claims

【特許請求の範囲】１）入力音声の成分を分析する音響分析部（１１）と、
該音響分析部（１１）から出力された最初の入力音声を
第１の候補に基づいて認識する音声認識部（１２）と、
該音声認識部（１２）の出力を表示する手段（１４）と
、前記音響分析部（１１）から出力された、最初の入力
音声を記憶するとともに、次に訂正部分だけの音声が再
入力されたときに、前記最初の入力音声のうちのどの部
分であるかを判別し、当該訂正部分について次の候補を
前記音声認識部（１２）に出力する文章音声スポッティ
ング部（１３）と、前記表示部（１４）で表示された前
記次の候補が前記入力音声に合致している場合に、該入
力音声の認識を確定する手段とからなることを特徴とす
る音声認識訂正装置。２）前記音声スポッティング部（１３）は、音響分析部
（１１）から出力される前記入力音声を記憶する記憶回
路（２７）と、前回表示された入力音声の一部に誤りが
あった際に、この誤りを訂正するための信号を入力する
訂正制御回路（３０）と、前記記憶回路（２７）と訂正
制御回路（３０）との出力を受けて、次に入力された訂
正部分だけの入力音声と前記記憶回路（２７）に記憶さ
れた入力音声の全体とを比較して訂正される部分を抽出
する連続ＤＰマッチング回路（２８）とを含むことを特
徴とする請求項１記載の音声認識訂正装置。３）前記音声スポッティング部（１３）はさらに前記連
続ＤＰマッチング回路（２８）の出力を受けて前記入力
音声の訂正部分の始端と終端とを変更する始終端変更回
路（３１）と、単語ラティスの候補の入れ替えを行うラ
ティス順位入替回路を有することを特徴とする請求項２
記載の音声認識訂正装置。４）前記ラティス順位入替回路は、前記始終端変更回路
（３１）の出力を受けてその始終端の範囲に入る単語ラ
ティスをサーチするサーチ回路（３２）と、該始終端の
間に入る単語候補の第１位を最下位の下へ回す候補順位
書換回路（３３）と、該候補順位ＳＨ書換回路（３３）
の出力を受けて候補順位をソーティングし次に新たな第
１位の候補を出力させるソート回路（３４）とからなる
ことを特徴とする請求項３記載の音声認識訂正装置。５）音声入力を認識して表示し、次に訂正部分のみを再
度音声入力し、最初の音声入力と再度の音声入力とを比
較して訂正部分を特定し、該訂正部分の認識の次候補を
表示することを特徴とする音声認識訂正方法。[Claims] 1) an acoustic analysis unit (11) that analyzes components of input audio;
a speech recognition section (12) that recognizes the first input speech output from the acoustic analysis section (11) based on the first candidate;
Means (14) for displaying the output of the speech recognition section (12) and the first input speech outputted from the acoustic analysis section (11) are stored, and then only the corrected portion of the speech is re-inputted. a sentence speech spotting section (13) that determines which part of the first input speech is the corrected part and outputs the next candidate for the corrected part to the speech recognition section (12); A speech recognition and correction device comprising: means for confirming recognition of the input speech when the next candidate displayed in section (14) matches the input speech. 2) The audio spotting unit (13) includes a storage circuit (27) that stores the input audio output from the acoustic analysis unit (11), and a memory circuit (27) that stores the input audio that is output from the acoustic analysis unit (11), , a correction control circuit (30) which inputs a signal for correcting this error, receives the outputs of the storage circuit (27) and the correction control circuit (30), and inputs only the corrected part that is input next. The speech recognition system according to claim 1, further comprising a continuous DP matching circuit (28) for comparing the speech and the entire input speech stored in the storage circuit (27) and extracting a portion to be corrected. correction device. 3) The voice spotting unit (13) further includes a start/end change circuit (31) that receives the output of the continuous DP matching circuit (28) and changes the start and end of the corrected portion of the input voice; Claim 2 characterized by comprising a lattice rank exchange circuit for exchanging candidates.
The speech recognition correction device described. 4) The lattice rank switching circuit includes a search circuit (32) that receives the output of the start/end change circuit (31) and searches for word lattices that fall within the range of the start/end ends, and word candidates that fall between the start/end ends. a candidate rank rewriting circuit (33) that moves the first rank to the lowest rank, and a candidate rank SH rewriting circuit (33)
4. The speech recognition and correction apparatus according to claim 3, further comprising a sorting circuit (34) for sorting the candidate rankings in response to the output from the above, and then outputting a new first-ranked candidate. 5) Recognize and display the voice input, then input only the corrected part again, compare the first voice input with the second voice input to identify the corrected part, and select the next candidate for recognition of the corrected part. A speech recognition correction method characterized by displaying.