JPH07306694A

JPH07306694A - Sound input device

Info

Publication number: JPH07306694A
Application number: JP6119724A
Authority: JP
Inventors: Satoshi Tsukada; 聡塚田
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1994-05-10
Filing date: 1994-05-10
Publication date: 1995-11-21

Abstract

PURPOSE:To provide a sound input device which can save a memory by equipping it with a function to select optimum input level. CONSTITUTION:In each input circuit 1', input sound signals V are amplified with different amplification factors, and A/D conversion is performed, and the signals from the start of input to time T are stored as front sound signals F in a front memory 30'. A gain selection control circuit 2' selects a signal that it judges to be on optimum level out of front sound selection signals F1-Fn, and informs it by a gain selection signal S. A selection signal 4 selects one out of sound digital signals D1-Dn by the selection signal S, and stores the signal from after passage of time T to the finish of sound input as a rear sound signal B in a rear memory 5. A feature abstracting part 3' selects one out of front sound signals F1-Fn, and adds a rear sound signal stored in the rear memory 5 to the rear of this selected front memory signal so as to abstract features.

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、最適な入力レベルを
選択する機能を備えた音声入力装置に関するものであ
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice input device having a function of selecting an optimum input level.

【０００２】[0002]

【従来の技術】一般に、音声入力装置における入力音声
レベルは、発声者の特質や発声時の環境などの種々の要
因により変動する。これに対して、従来の音声入力装置
では、入力音声レベルに対して最適な利得（増幅率）で
入力を行うために、例えば、異なる増幅率の増幅回路に
音声を入力し、入力終了後に最適な入力レベルの音声を
選択する方法が採用されている。従来の音声入力装置の
要部の構成を図２（Ａ）に示す。同図において、１（１
−１〜１−ｎ）は入力回路、２はゲイン選択制御回路、
３は特徴抽出部である。入力回路１は、図２（Ｂ）に示
すように、増幅回路１０，Ａ／Ｄ変換回路２０，メモリ
３０を備えている。入力回路１−１〜１−ｎにおいてそ
の増幅回路１０の増幅率はそれぞれ異なっている。2. Description of the Related Art Generally, an input voice level in a voice input device varies depending on various factors such as the characteristics of a speaker and the environment at the time of vocalization. On the other hand, in a conventional voice input device, in order to perform input with an optimum gain (amplification factor) for the input voice level, for example, voice is input to an amplifier circuit with a different amplification factor, and the optimum input is made after the input is completed. A method of selecting a voice with a different input level is adopted. FIG. 2A shows the configuration of the main part of a conventional voice input device. In the figure, 1 (1
-1 to 1-n) are input circuits, 2 is a gain selection control circuit,
3 is a feature extraction unit. As shown in FIG. 2B, the input circuit 1 includes an amplification circuit 10, an A / D conversion circuit 20, and a memory 30. The amplification factors of the amplifier circuits 10 in the input circuits 1-1 to 1-n are different from each other.

【０００３】この音声入力装置において、入力音声信号
Ｖは、入力回路１−１〜１−ｎへ与えられる。入力回路
１−１〜１−ｎでは、その増幅回路１０によって、入力
音声信号Ｖを予め定められた増幅率で増幅し、増幅音声
信号Ａとする。そして、Ａ／Ｄ変換回路２０によって、
増幅回路１０からの増幅音声信号Ａをアナログ／ディジ
タル変換し、音声ディジタル信号Ｄとする。そして、こ
の音声ディジタル信号Ｄをメモリ３０へ記憶する。この
場合、メモリ３０に記憶される音声ディジタル信号Ｄ
は、音声入力が開始されてから終了するまでの入力音声
の全区間にわたる。ゲイン選択制御回路２は、入力回路
１−１〜１−ｎのメモリ３０に記憶された音声ディジタ
ル信号Ｄ１〜Ｄｎの中から最適な入力レベルと判断され
る信号を選択し、どの信号を選択したかをゲイン選択信
号Ｓによって特徴抽出部３へ通知する。特徴抽出部３
は、入力回路１−１〜１−ｎのメモリ３０に記憶されて
いる音声ディジタル信号Ｄ１〜Ｄｎの中から、ゲイン選
択信号Ｓで指定される音声ディジタル信号を最適入力レ
ベルの音声ディジタル信号として選択し、この選択した
音声ディジタル信号から特徴抽出を行う（特公平１−３
６６４０号、特開昭６２−２７２３００号、特開昭６３
−３１６０９７号参照）。In this voice input device, the input voice signal V is given to the input circuits 1-1 to 1-n. In each of the input circuits 1-1 to 1-n, the amplifier circuit 10 amplifies the input audio signal V at a predetermined amplification factor to obtain an amplified audio signal A. Then, by the A / D conversion circuit 20,
The amplified audio signal A from the amplifier circuit 10 is analog-to-digital converted into an audio digital signal D. Then, the audio digital signal D is stored in the memory 30. In this case, the audio digital signal D stored in the memory 30
Over the entire section of the input voice from the start to the end of voice input. The gain selection control circuit 2 selects a signal judged to be an optimum input level from the audio digital signals D1 to Dn stored in the memory 30 of the input circuits 1-1 to 1-n, and selects which signal. This is notified to the feature extraction unit 3 by the gain selection signal S. Feature extraction unit 3
Selects the audio digital signal designated by the gain selection signal S as the audio digital signal of the optimum input level from the audio digital signals D1 to Dn stored in the memory 30 of the input circuits 1-1 to 1-n. Then, feature extraction is performed from the selected audio digital signal (Japanese Patent Publication No. 1-3.
6640, JP-A-62-272300, JP-A-63.
316097).

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、このよ
うな従来の音声入力装置によると、入力音声の全区間に
わたる音声ディジタル信号Ｄをメモリ３０に記憶させ、
このメモリ３０に記憶された入力回路１−１〜１−ｎの
音声ディジタル信号Ｄ１〜Ｄｎに基づいて、そのうちの
一つを最適入力レベルの音声ディジタル信号として選択
するようにしているため、メモリを過大に必要とすると
いう問題があった。However, according to such a conventional voice input device, the voice digital signal D over the entire section of the input voice is stored in the memory 30,
Based on the audio digital signals D1 to Dn of the input circuits 1-1 to 1-n stored in the memory 30, one of them is selected as the audio digital signal of the optimum input level. There was a problem of needing too much.

【０００５】本発明はこのような課題を解決するために
なされたもので、その目的とするところは、メモリを節
約することのできる音声入力装置を提供することにあ
る。The present invention has been made to solve the above problems, and an object of the present invention is to provide a voice input device capable of saving memory.

【０００６】[0006]

【課題を解決するための手段】このような目的を達成す
るために、その第１発明（請求項１に係る発明）は、入
力音声信号を第１〜第Ｎの増幅手段によってそれぞれ異
なる増幅率で増幅し、この第１〜第Ｎの増幅手段により
増幅された入力音声信号を第１〜第ＮのＡ／Ｄ変換手段
によってディジタル信号に変換し音声ディジタル信号と
して出力し、この第１〜第ＮのＡ／Ｄ変換手段の出力す
る音声ディジタル信号の入力開始から予め定められた時
間Ｔまでの間の信号を前部音声信号として第１〜第Ｎの
前部メモリ手段に記憶ささ、この第１〜第Ｎの前部メモ
リ手段に記憶された前部音声信号に基づいて第１〜第Ｎ
のＡ／Ｄ変換手段の出力する音声ディジタル信号のうち
何れか一つを最適入力レベルの音声ディジタル信号と定
めるようにしたものである。また、その第２発明（請求
項２に係る発明）は、入力音声信号を第１〜第Ｎの増幅
手段によってそれぞれ異なる増幅率で増幅し、この第１
〜第Ｎの増幅手段により増幅された入力音声信号を第１
〜第ＮのＡ／Ｄ変換手段によってディジタル信号に変換
し音声ディジタル信号として出力し、この第１〜第Ｎの
Ａ／Ｄ変換手段の出力する音声ディジタル信号の入力開
始から予め定められた時間Ｔまでの間の信号を前部音声
信号として第１〜第Ｎの前部メモリ手段に記憶させ、こ
の第１〜第Ｎの前部メモリ手段に記憶された前部音声信
号から最適な入力レベルの信号を選択しどの信号を選択
したかを通知手段より通知するものとし、この通知手段
からの通知に基づいて第１〜第ＮのＡ／Ｄ変換手段の出
力する音声ディジタル信号のうち何れか一つを選択し、
この選択した音声ディジタル信号の上記時間Ｔの経過後
から音声入力が終了するまでを後部音声信号として選択
手段より出力するものとし、この選択手段より出力され
る後部音声信号を後部メモリ手段に記憶させ、上記通知
手段からの通知に基づいて第１〜第Ｎの前部メモリ手段
の記憶されている前部音声信号のうち何れか一つを選択
し、この選択した前部音声信号の後に後部メモリ手段に
記憶されている後部音声信号を加えて最適入力レベルの
音声ディジタル信号とするようにしたものである。ま
た、その第３発明（請求項３に係る発明）は、入力音声
信号を増幅する増幅回路と、この増幅回路により増幅さ
れた入力音声信号をディジタル信号に変換し音声ディジ
タル信号として出力するＡ／Ｄ変換回路と、このＡ／Ｄ
変換回路の出力する音声ディジタル信号の入力開始から
予め定められた時間Ｔまでの間の信号を前部音声信号と
して記憶する前部メモリとによって第１〜第Ｎの入力回
路を構成し（各入力回路の増幅回路の増幅率はそれぞれ
異なる）、この第１〜第Ｎの入力回路の前部メモリに記
憶された前部音声信号から最適な入力レベルの信号を選
択しどの信号を選択したかをゲイン選択信号としてゲイ
ン選択制御回路より出力するものとし、このゲイン選択
制御回路から出力されるゲイン選択信号に基づいて第１
〜第Ｎの入力回路のＡ／Ｄ変換回路の出力する音声ディ
ジタル信号のうち何れか一つを選択し、この選択した音
声ディジタル信号の上記時間Ｔの経過後から音声入力が
終了するまでを後部音声信号として選択回路より出力す
るものとし、この選択回路の出力する後部音声信号を後
部メモリに記憶させ、ゲイン選択制御回路からのゲイン
選択信号に基づいて第１〜第Ｎの入力回路の前部メモリ
に記憶されている前部音声信号のうち何れか一つを選択
し、この選択した前部メモリ信号の後に後部メモリの記
憶している後部音声信号を加えて最適入力レベルの音声
ディジタル信号とし、この最適入力レベルの音声ディジ
タル信号から特徴抽出を行うようにしたものである。In order to achieve such an object, the first invention (the invention according to claim 1) of the present invention is that the input audio signal has different amplification factors depending on the first to Nth amplification means. The input audio signal amplified by the first to Nth amplifying means is converted into a digital signal by the first to Nth A / D converting means and output as a digital audio signal. A signal from the start of input of the audio digital signal output from the N A / D conversion means to a predetermined time T is stored as a front audio signal in the first to Nth front memory means, and 1st to Nth based on the front audio signals stored in the 1st to Nth front memory means
One of the audio digital signals output from the A / D conversion means is determined as the audio digital signal of the optimum input level. The second invention (the invention according to claim 2) amplifies the input audio signal by different amplification factors by the first to Nth amplifying means, and
~ The first input voice signal amplified by the Nth amplifying means
A predetermined time T from the start of the input of the audio digital signal output from the first to Nth A / D converting means, which is converted into a digital signal by the Nth A / D converting means and output as a digital audio signal. The signals between 1 to N are stored in the first to Nth front memory means as front voice signals, and the optimum input level of the front voice signals stored in the first to Nth front memory means is stored. A signal is selected, and which signal is selected is notified from the notifying means, and one of the audio digital signals output from the first to Nth A / D converting means based on the notification from the notifying means. Choose one,
After the time T of the selected audio digital signal has elapsed and until the audio input is completed, the selecting means outputs the rear audio signal, and the rear audio signal output from the selecting means is stored in the rear memory means. , Any one of the front audio signals stored in the first to Nth front memory means is selected based on the notification from the notification means, and the rear memory is selected after the selected front audio signal. The rear audio signal stored in the means is added to obtain an audio digital signal having an optimum input level. A third invention (the invention according to claim 3) is an A / A circuit for amplifying an input audio signal, and an A / A which outputs an audio digital signal by converting the input audio signal amplified by the amplifier circuit into a digital signal. D conversion circuit and this A / D
The first to Nth input circuits are configured by a front memory that stores a signal from the start of input of the audio digital signal output from the conversion circuit to a predetermined time T as a front audio signal (each input The amplification factor of each of the amplifier circuits of the circuit is different), and a signal having an optimum input level is selected from the front audio signals stored in the front memories of the first to Nth input circuits, and which signal is selected. It is assumed that the gain selection signal is output from the gain selection control circuit, and the first selection is performed based on the gain selection signal output from the gain selection control circuit.
-Selecting any one of the audio digital signals output from the A / D conversion circuit of the Nth input circuit, the rear part from the lapse of the time T of the selected audio digital signal to the end of the audio input. The audio signal is output from the selection circuit, the rear audio signal output from the selection circuit is stored in the rear memory, and the front parts of the first to Nth input circuits are stored based on the gain selection signal from the gain selection control circuit. Select one of the front audio signals stored in the memory and add the selected rear memory signal to the rear audio signal stored in the rear memory to obtain the audio digital signal of the optimum input level. The feature extraction is performed from the audio digital signal of the optimum input level.

【０００７】[0007]

【作用】したがってこの発明によれば、その第１発明で
は、第１〜第ＮのＡ／Ｄ変換手段の出力する音声ディジ
タル信号の入力開始から時間Ｔまでの間の信号が前部音
声信号として第１〜第Ｎの前部メモリ手段に記憶され、
これら記憶された前部音声信号に基づいて最適入力レベ
ルの音声ディジタル信号が定められる。また、その第２
発明では、第１〜第ＮのＡ／Ｄ変換手段の出力する音声
ディジタル信号の入力開始から時間Ｔまでの間の信号が
前部音声信号として第１〜第Ｎの前部メモリ手段に記憶
され、これら記憶された前部音声信号から最適な入力レ
ベルの信号が選択され、どの信号を選択したかが通知さ
れ、この通知に基づいて第１〜第ＮのＡ／Ｄ変換手段の
出力する音声ディジタル信号のうち何れか一つが選択さ
れ、この選択された信号の上記時間Ｔの経過後から音声
入力が終了するまでが後部音声信号として後部メモリ手
段に記憶され、選択された前部メモリ手段の記憶する前
部音声信号の後に後部メモリ手段の記憶する後部音声信
号が加えられて最適入力レベルの音声ディジタル信号と
される。また、その第３発明では、第１〜第Ｎの入力回
路のＡ／Ｄ変換回路の出力する音声ディジタル信号の入
力開始から時間Ｔまでの間の信号が前部音声信号として
第１〜第Ｎの入力回路の前部メモリに記憶され、これら
記憶された前部音声信号から最適な入力レベルの信号が
選択され、どの信号を選択したかがゲイン選択信号とし
て出力され、このゲイン選択信号に基づいて第１〜第Ｎ
の入力回路のＡ／Ｄ変換回路の出力する音声ディジタル
信号のうち何れか一つが選択され、この選択された音声
ディジタル信号の上記時間Ｔの経過後から音声入力が終
了するまでが後部音声信号として後部メモリに記憶さ
れ、選択された前部メモリの記憶する前部音声信号の後
に後部メモリの記憶する後部音声信号が加えられてて最
適入力レベルの音声ディジタル信号とされ、この最適入
力レベルの音声ディジタル信号から特徴抽出が行われ
る。According to the present invention, therefore, in the first invention, the signal from the start of input of the audio digital signals output from the first to Nth A / D conversion means to time T is the front audio signal. Stored in first to Nth front memory means,
An audio digital signal having an optimum input level is determined based on these stored front audio signals. Also, the second
In the invention, the signals from the start of the input of the audio digital signals output by the first to Nth A / D conversion means to the time T are stored in the first to Nth front memory means as the front audio signals. , A signal having an optimum input level is selected from the stored front audio signals, and which signal is selected is notified, and based on this notification, the audio output from the first to Nth A / D conversion means is output. One of the digital signals is selected, and after the time T of the selected signal elapses until the voice input is completed, it is stored in the rear memory means as a rear voice signal, and the selected front memory means is stored. After the front audio signal to be stored, the rear audio signal stored in the rear memory means is added to obtain the audio digital signal of the optimum input level. In the third aspect of the invention, a signal from the start of input of the audio digital signal output from the A / D conversion circuit of the first to Nth input circuits to time T is the first to Nth audio signals. Is stored in the front memory of the input circuit of, the signal of the optimum input level is selected from these stored front audio signals, and which signal is selected is output as a gain selection signal. 1st to Nth
Any one of the audio digital signals output from the A / D conversion circuit of the input circuit is selected, and after the time T of the selected audio digital signal elapses until the audio input is completed is the rear audio signal. The audio signal of the optimum input level is obtained by adding the rear audio signal stored in the rear memory to the audio signal of the optimum input level after adding the rear audio signal stored in the rear memory to the audio signal of the optimum input level stored in the rear memory. Feature extraction is performed from the digital signal.

【０００８】[0008]

【実施例】以下、本発明を実施例に基づき詳細に説明す
る。図１（Ａ）はこの発明の一実施例を示す音声入力装
置の要部の構成図である。同図において、１’（１’−
１〜１’−ｎ）は入力回路、２’はゲイン選択制御回
路、３’は特徴抽出部、４は選択回路、５は後部メモリ
である。入力回路１’は、図１（Ｂ）に示すように、増
幅回路１０，Ａ／Ｄ変換回路２０，前部メモリ３０’を
備えている。入力回路１’−１〜１’−ｎにおいてその
増幅回路１０の増幅率はそれぞれ異なっている。EXAMPLES The present invention will now be described in detail based on examples. FIG. 1A is a configuration diagram of a main part of a voice input device showing an embodiment of the present invention. In the figure, 1 '(1'-
1 to 1'-n) is an input circuit, 2'is a gain selection control circuit, 3'is a feature extraction unit, 4 is a selection circuit, and 5 is a rear memory. As shown in FIG. 1B, the input circuit 1'includes an amplifier circuit 10, an A / D conversion circuit 20, and a front memory 30 '. The amplification factors of the amplifier circuit 10 in the input circuits 1'-1 to 1'-n are different from each other.

【０００９】この音声入力装置において、入力音声信号
Ｖは、入力回路１’−１〜１’−ｎへ与えられる。入力
回路１’−１〜１’−ｎでは、その増幅回路１０によっ
て、入力音声信号Ｖを予め定められた増幅率で増幅し、
増幅音声信号Ａとする。そして、Ａ／Ｄ変換回路２０に
よって、帯域制限を行った後、増幅回路１０からの増幅
音声信号Ａをアナログ／ディジタル変換し、音声ディジ
タル信号Ｄとする。この音声ディジタル信号Ｄは、入力
開始から予め定められた時間Ｔまでの間の信号が、前部
音声信号Ｆとして前部メモリ３０’へ記憶される。In this voice input device, the input voice signal V is given to the input circuits 1'-1 to 1'-n. In the input circuits 1′-1 to 1′-n, the amplifier circuit 10 amplifies the input audio signal V at a predetermined amplification factor,
The amplified audio signal A is used. Then, after band limitation is performed by the A / D conversion circuit 20, the amplified audio signal A from the amplifier circuit 10 is subjected to analog / digital conversion to be an audio digital signal D. As the audio digital signal D, a signal from the start of input to a predetermined time T is stored as a front audio signal F in the front memory 30 '.

【００１０】ここで、入力開始の判定（始端検出）は、
音声入力装置が発声者に発声を促すために指示を出すタ
イミング、あるいは、発声者が発声を開始する前に発声
者自身がスイッチ等で音声入力装置に入力開始を指示す
るタイミングなどを用いることができる。また、時間Ｔ
は、後述の最適入力レベルの音声ディジタル信号の決定
において、入力開始から時間Ｔの間の音声ディジタル信
号に基づくため、入力音声全体として最適なレベルかど
うかを決定できるだけの長さとしておく必要がある。時
間Ｔの一例としては、１音節分の音声が入力できるだけ
の長さなどがある。この時間Ｔは、入力開始を検出して
からのタイマ（図示せず）でのカウント時間として定め
る。Here, the input start determination (start edge detection) is
It is possible to use the timing at which the voice input device gives an instruction to the speaker to prompt the user to speak, or the timing at which the speaker itself instructs the voice input device to start input by a switch etc. before the speaker starts speaking. it can. Also, time T
Is based on the voice digital signal from the start of input to the time T in the determination of the voice digital signal of the optimum input level, which will be described later, and therefore it must be long enough to determine whether it is the optimum level for the entire input voice. . An example of the time T is a length such that one syllable voice can be input. This time T is set as a count time by a timer (not shown) after the input start is detected.

【００１１】ゲイン選択制御回路２’は、入力回路１’
−１〜１’−ｎの前部メモリ３０’に記憶された前部音
声信号Ｆ１〜Ｆｎの中から最適な入力レベルと判断され
る信号をどれか１つ選択し、どの信号を選択したかをゲ
イン選択信号Ｓによって選択回路４へ通知する。ここ
で、最適な入力レベルを選択するとは、後述の特徴抽出
部３’において特徴抽出を最適な状態で行い誤差の少な
い特徴データが得られるようにすることである。最適な
入力レベルを選択する方法の一例として、それぞれの増
幅率による音声パワーのピーク値が所定の閾値（例え
ば、±５Ｖ）を越えないでかつ最大のものを選択するよ
うな方法をとることが考えられる。このとき、予め入力
開始から時間Ｔの間の音声パワーと入力音声の全区間で
の最適入力レベルとの関係を調べておくことにより、所
定の閾値を設定すれば、入力開始から時間Ｔの間の音声
パワーのピーク値に基づいて、入力音声全体での最適な
増幅率となっているものを選択することができる。The gain selection control circuit 2'includes an input circuit 1 '.
-1 to 1'-n, which one of the signals judged as the optimum input level is selected from the front audio signals F1 to Fn stored in the front memory 30 ', and which signal is selected Is notified to the selection circuit 4 by the gain selection signal S. Here, selecting the optimum input level is to perform feature extraction in a feature extraction unit 3'described later so as to obtain feature data with a small error. As an example of a method of selecting an optimum input level, a method of selecting a maximum value so that the peak value of audio power due to each amplification factor does not exceed a predetermined threshold value (for example, ± 5 V) Conceivable. At this time, if a predetermined threshold value is set by checking the relationship between the voice power from the input start and the time T and the optimum input level in all the sections of the input voice in advance, the period from the input start to the time T Based on the peak value of the voice power of, it is possible to select the one that has the optimum amplification factor for the entire input voice.

【００１２】選択回路４は、ゲイン選択制御回路２’か
らのゲイン選択信号Ｓにより、入力回路１’−１〜１’
−ｎのＡ／Ｄ変換回路２０の出力する音声ディジタル信
号Ｄ１〜Ｄｎのうち何れか一つを選択し、すなわち音声
ディジタル信号Ｄ１〜Ｄｎのうち何れか一つを最適入力
レベルの音声ディジタル信号と定め、この選択した音声
ディジタル信号の時間Ｔの経過後から音声入力が終了す
るまでを後部音声信号Ｂとして出力する。選択回路４の
出力する後部音声信号Ｂは後部メモリ５に記憶される。
選択回路４での音声ディジタル信号の選択方法として次
のような，の方式が考えられる。ゲイン選択制御回路２’で選択された信号を出力する
入力回路からの音声ディジタル信号を選択する。ゲイン選択制御回路２’で選択された信号に基づいて
予め作成された対応テーブル（前音声と後音声との関
係，用途に応じた対応テーブル）から最適な入力回路か
らの音声ディジタル信号を選択する。The selection circuit 4 is responsive to the gain selection signal S from the gain selection control circuit 2'for input circuits 1'-1 to 1 '.
Select any one of the audio digital signals D1 to Dn output from the -n A / D conversion circuit 20, that is, select any one of the audio digital signals D1 to Dn as the audio digital signal of the optimum input level. After the lapse of the time T of the selected audio digital signal until the end of the audio input, the audio signal B is output as the rear audio signal B. The rear audio signal B output from the selection circuit 4 is stored in the rear memory 5.
The following method can be considered as a method of selecting the audio digital signal in the selection circuit 4. The audio digital signal from the input circuit that outputs the signal selected by the gain selection control circuit 2'is selected. The optimum audio digital signal from the input circuit is selected from the correspondence table (the correspondence table according to the relationship between the front voice and the rear voice and the application) created in advance based on the signal selected by the gain selection control circuit 2 '. .

【００１３】一方、ゲイン選択制御回路２’の出力する
ゲイン選択信号Ｓは、特徴抽出部３’へも与えられる。
特徴抽出部３’は、ゲイン選択制御回路２’からのゲイ
ン選択信号Ｓにより、入力回路１’−１〜１’−ｎの前
部メモリ３０’の記憶している前部音声信号Ｆ１〜Ｆｎ
のうち何れか一つを選択し、この選択した前部メモリ信
号の後に後部メモリ５の記憶している後部音声信号を加
えて最適入力レベルの音声ディジタル信号とし、この最
適入力レベルの音声ディジタル信号から特徴抽出を行
う。これにより、本実施例によれば、入力開始から時間
Ｔの経過後は最適入力レベルの音声ディジタル信号のみ
を記憶すればよく、入力回路１’−１〜１’−ｎのＡ／
Ｄ変換回路２０からの音声ディジタル信号の各々を入力
音声の全区間にわたり記憶する方法に比べ、メモリを大
幅に節約することができるようになる。なお、特徴抽出
の方法の一例としては、文献「音声認識（新美康永著、
共立出版発行）」の３８ページ〜５２ページに記載のケ
プストラム分析や線形予測分析などを用いることができ
る。On the other hand, the gain selection signal S output from the gain selection control circuit 2'is also given to the feature extraction section 3 '.
The feature extraction unit 3 ′ uses the gain selection signal S from the gain selection control circuit 2 ′ to output the front audio signals F1 to Fn stored in the front memory 30 ′ of the input circuits 1′-1 to 1′-n.
One of the selected front memory signals is added to the rear audio signal stored in the rear memory 5 after the selected front memory signal to obtain an optimum input level audio digital signal. Feature extraction from. Thus, according to the present embodiment, after the time T has elapsed from the start of input, only the audio digital signal of the optimum input level needs to be stored, and A / A of the input circuits 1'-1 to 1'-n is stored.
Compared with the method of storing each of the audio digital signals from the D conversion circuit 20 over the entire section of the input audio, the memory can be saved significantly. In addition, as an example of the feature extraction method, there is a document “Voice recognition (by Yasunaga Niimi,
Kyoritsu Publishing ”), pages 38 to 52, and the cepstrum analysis and linear prediction analysis can be used.

【００１４】[0014]

【発明の効果】以上説明したことから明らかなように本
発明によれば、その第１発明では、第１〜第ＮのＡ／Ｄ
変換手段の出力する音声ディジタル信号の入力開始から
時間Ｔまでの間の信号が前部音声信号として第１〜第Ｎ
の前部メモリ手段に記憶され、これら記憶された前部音
声信号に基づいて最適入力レベルの音声ディジタル信号
が定められるので、時間Ｔの経過後は最適入力レベルの
音声ディジタル信号のみを記憶するようになすことによ
り、第１〜第ＮのＡ／Ｄ変換手段からの音声ディジタル
信号の各々を入力音声の全区間にわたり記憶する方法に
比べ、メモリを大幅に節約することが可能となる。ま
た、その第２発明では、第１〜第ＮのＡ／Ｄ変換手段の
出力する音声ディジタル信号の入力開始から時間Ｔまで
の間の信号が前部音声信号として第１〜第Ｎの前部メモ
リ手段に記憶され、これら記憶された前部音声信号から
最適な入力レベルの信号が選択され、どの信号を選択し
たかが通知され、この通知に基づいて第１〜第ＮのＡ／
Ｄ変換手段の出力する音声ディジタル信号のうち何れか
一つが選択され、この選択された信号の上記時間Ｔの経
過後から音声入力が終了するまでが後部音声信号として
後部メモリ手段に記憶され、選択された前部メモリ手段
の記憶する前部音声信号の後に後部メモリ手段の記憶す
る後部音声信号が加えられて最適入力レベルの音声ディ
ジタル信号とされるので、第１〜第ＮのＡ／Ｄ変換手段
からの音声ディジタル信号の各々を入力音声の全区間に
わたり記憶する方法に比べ、メモリを大幅に節約するこ
とができる。また、その第３発明では、第１〜第Ｎの入
力回路のＡ／Ｄ変換回路の出力する音声ディジタル信号
の入力開始から時間Ｔまでの間の信号が前部音声信号と
して第１〜第Ｎの入力回路の前部メモリに記憶され、こ
れら記憶された前部音声信号から最適な入力レベルの信
号が選択され、どの信号を選択したかがゲイン選択信号
として出力され、このゲイン選択信号に基づいて第１〜
第Ｎの入力回路のＡ／Ｄ変換回路の出力する音声ディジ
タル信号のうち何れか一つが選択され、この選択された
音声ディジタル信号の上記時間Ｔの経過後から音声入力
が終了するまでが後部音声信号として後部メモリに記憶
され、選択された前部メモリの記憶する前部音声信号の
後に後部メモリの記憶する後部音声信号が加えられてて
最適入力レベルの音声ディジタル信号とされ、この最適
入力レベルの音声ディジタル信号から特徴抽出が行われ
るので、第１〜第ＮのＡ／Ｄ変換回路からの音声ディジ
タル信号の各々を入力音声の全区間にわたり記憶する方
法に比べメモリを大幅に節約したうえ、特徴抽出を行う
ことができる。As is apparent from the above description, according to the present invention, in the first invention, the first to Nth A / Ds are used.
The signals from the start of the input of the audio digital signal output by the converting means to the time T are the first to Nth audio signals as front audio signals.
The audio digital signal of the optimum input level is stored in the front memory means and the audio digital signal of the optimum input level is determined based on these stored front audio signals. Therefore, after the time T, only the audio digital signal of the optimum input level is stored. By doing so, it is possible to significantly save the memory as compared with the method of storing each of the audio digital signals from the first to Nth A / D converting means over the entire section of the input audio. In the second aspect of the invention, the signals from the start of input of the audio digital signals output from the first to Nth A / D conversion means to the time T are the first audio signals to the first to Nth audio signals. A signal having an optimum input level is selected from the front audio signals stored in the memory means, and which signal is selected is notified. Based on this notification, the first to Nth A /
Any one of the audio digital signals output by the D conversion means is selected, and after the elapse of the time T of the selected signal until the audio input is completed, it is stored in the rear memory means as the rear audio signal and selected. The rear audio signal stored in the rear memory means is added to the front audio signal stored in the front memory means to be an audio digital signal of the optimum input level. Therefore, the first to Nth A / D conversions are performed. Memory can be saved significantly compared to a method in which each of the audio digital signals from the means is stored over the entire duration of the input audio. In the third aspect of the invention, a signal from the start of input of the audio digital signal output from the A / D conversion circuit of the first to Nth input circuits to time T is the first to Nth audio signals. Is stored in the front memory of the input circuit of, the signal of the optimum input level is selected from these stored front audio signals, and which signal is selected is output as a gain selection signal. 1st
Any one of the audio digital signals output from the A / D conversion circuit of the Nth input circuit is selected, and the rear audio is output after the time T of the selected audio digital signal elapses until the audio input is completed. The signal is stored in the rear memory as a signal, and the rear audio signal stored in the rear memory is added after the front audio signal stored in the selected front memory to obtain an audio digital signal of the optimum input level. Since the feature extraction is performed from the audio digital signal of, the memory is significantly saved as compared with the method of storing each of the audio digital signals from the first to Nth A / D conversion circuits over the entire section of the input audio. Feature extraction can be performed.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明に係る音声入力装置の要部の構成を示
す図である。FIG. 1 is a diagram showing a configuration of a main part of a voice input device according to the present invention.

【図２】従来の音声入力装置の要部の構成を示す図で
ある。FIG. 2 is a diagram showing a configuration of a main part of a conventional voice input device.

【符号の説明】[Explanation of symbols]

１’（１’−１〜１’−ｎ）…入力回路、２’…ゲイン
選択制御回路、３’…特徴抽出部、４…選択回路、５…
後部メモリ、１０…増幅回路、２０…Ａ／Ｄ変換回路、
３０’…前部メモリ。1 '(1'-1 to 1'-n) ... Input circuit, 2' ... Gain selection control circuit, 3 '... Feature extraction unit, 4 ... Selection circuit, 5 ...
Rear memory, 10 ... Amplifying circuit, 20 ... A / D converting circuit,
30 '... front memory.

Claims

【特許請求の範囲】[Claims]

【請求項１】入力音声信号をそれぞれ異なる増幅率で
増幅する第１〜第Ｎの増幅手段と、この第１〜第Ｎの増幅手段により増幅された入力音声信
号をディジタル信号に変換し音声ディジタル信号として
出力する第１〜第ＮのＡ／Ｄ変換手段と、この第１〜第ＮのＡ／Ｄ変換手段の出力する音声ディジ
タル信号の入力開始から予め定められた時間Ｔまでの間
の信号を前部音声信号として記憶する第１〜第Ｎの前部
メモリ手段と、この第１〜第Ｎの前部メモリ手段に記憶された前部音声
信号に基づいて前記第１〜第ＮのＡ／Ｄ変換手段の出力
する音声ディジタル信号のうち何れか一つを最適入力レ
ベルの音声ディジタル信号と定める手段とを備えたこと
を特徴とする音声入力装置。1. A first to Nth amplifying means for amplifying an input sound signal with different amplification factors respectively, and an input sound signal amplified by the first to Nth amplifying means is converted into a digital signal and a sound digital signal. First to Nth A / D conversion means for outputting as signals, and signals from the start of input of the audio digital signals output by the first to Nth A / D conversion means to a predetermined time T Based on the front audio signals stored in the first to N-th front memory means, and the first to N-th A A voice input device comprising means for determining any one of the voice digital signals output by the D / D conversion means as a voice digital signal of an optimum input level.

【請求項２】入力音声信号をそれぞれ異なる増幅率で
増幅する第１〜第Ｎの増幅手段と、この第１〜第Ｎの増幅手段により増幅された入力音声信
号をディジタル信号に変換し音声ディジタル信号として
出力する第１〜第ＮのＡ／Ｄ変換手段と、この第１〜第ＮのＡ／Ｄ変換手段の出力する音声ディジ
タル信号の入力開始から予め定められた時間Ｔまでの間
の信号を前部音声信号として記憶する第１〜第Ｎの前部
メモリ手段と、この第１〜第Ｎの前部メモリ手段に記憶された前部音声
信号から最適な入力レベルの信号を選択しどの信号を選
択したかを通知する通知手段と、この通知手段からの通知に基づいて前記第１〜第ＮのＡ
／Ｄ変換手段の出力する音声ディジタル信号のうち何れ
か一つを選択し、この選択した音声ディジタル信号の前
記時間Ｔの経過後から音声入力が終了するまでを後部音
声信号として出力する選択手段と、この選択手段の出力する後部音声信号を記憶する後部メ
モリ手段と、前記通知手段からの通知に基づいて前記第１〜第Ｎの前
部メモリ手段の記憶している前部音声信号のうち何れか
一つを選択し、この選択した前部音声信号の後に前記後
部メモリ手段の記憶している後部音声信号を加えて最適
入力レベルの音声ディジタル信号とする手段とを備えた
ことを特徴とする音声入力装置。2. A first to Nth amplifying means for amplifying an input audio signal with different amplification factors respectively, and an input audio signal amplified by the first to Nth amplifying means is converted into a digital signal and an audio digital signal. First to Nth A / D conversion means for outputting as signals, and signals from the start of input of the audio digital signals output by the first to Nth A / D conversion means to a predetermined time T Are stored as front audio signals, and a signal having an optimum input level is selected from the front audio signals stored in the first to Nth front memory means. Notifying means for notifying whether the signal is selected, and the first to Nth A's based on the notification from the notifying means.
Selecting means for selecting any one of the audio digital signals output from the D / D conversion means, and outputting as a rear audio signal from the time T of the selected audio digital signal to the end of the audio input. Which of the rear memory means for storing the rear audio signal output by the selecting means and the front audio signal stored in the first to Nth front memory means based on the notification from the notification means Means for selecting one of them and adding the rear audio signal stored in the rear memory means to the selected front audio signal to obtain an audio digital signal of an optimum input level. Voice input device.

【請求項３】入力音声信号を増幅する増幅回路と、こ
の増幅回路により増幅された入力音声信号をディジタル
信号に変換し音声ディジタル信号として出力するＡ／Ｄ
変換回路と、このＡ／Ｄ変換回路の出力する音声ディジ
タル信号の入力開始から予め定められた時間Ｔまでの間
の信号を前部音声信号として記憶する前部メモリとを備
え、前記増幅回路の増幅率がそれぞれ異なる第１〜第Ｎ
の入力回路と、この第１〜第Ｎの入力回路の前部メモリに記憶された前
部音声信号から最適な入力レベルの信号を選択しどの信
号を選択したかをゲイン選択信号として出力するゲイン
選択制御回路と、このゲイン選択制御回路から出力されるゲイン選択信号
に基づいて前記第１〜第Ｎの入力回路のＡ／Ｄ変換回路
の出力する音声ディジタル信号のうち何れか一つを選択
し、この選択した音声ディジタル信号の前記時間Ｔの経
過後から音声入力が終了するまでを後部音声信号として
出力する選択回路と、この選択回路の出力する後部音声信号を記憶する後部メ
モリと、前記ゲイン選択制御回路からのゲイン選択信号に基づい
て前記第１〜第Ｎの入力回路の前部メモリの記憶してい
る前部音声信号のうち何れか一つを選択し、この選択し
た前部メモリ信号の後に前記後部メモリの記憶している
後部音声信号を加えて最適入力レベルの音声ディジタル
信号とし、この最適入力レベルの音声ディジタル信号か
ら特徴抽出を行う特徴抽出部とを備えたことを特徴とす
る音声入力装置。3. An amplifier circuit for amplifying an input audio signal, and an A / D for converting the input audio signal amplified by this amplifier circuit into a digital signal and outputting it as an audio digital signal.
The amplifier circuit includes a conversion circuit and a front memory for storing a signal from the start of input of the audio digital signal output from the A / D conversion circuit to a predetermined time T as a front audio signal. 1st to Nth with different amplification factors
Of the input circuit and the front audio signal stored in the front memories of the first to Nth input circuits, and a gain for outputting a signal having an optimum input level selected as a gain selection signal. A selection control circuit and one of the audio digital signals output from the A / D conversion circuits of the first to Nth input circuits are selected based on the gain selection signal output from the gain selection control circuit. A selection circuit that outputs a rear audio signal from the time T of the selected audio digital signal until the end of audio input, a rear memory that stores the rear audio signal output by the selection circuit, and the gain Based on the gain selection signal from the selection control circuit, one of the front audio signals stored in the front memory of the first to Nth input circuits is selected, and the selected front audio signal is selected. A rear voice signal stored in the rear memory after the memory signal is added to form a voice digital signal having an optimum input level, and a feature extraction unit for performing feature extraction from the voice digital signal having the optimum input level is provided. And a voice input device.