JP4779748B2

JP4779748B2 - Voice input / output device for vehicle and program for voice input / output device

Info

Publication number: JP4779748B2
Application number: JP2006086196A
Authority: JP
Inventors: 利文加藤
Original assignee: Denso Corp
Current assignee: Denso Corp
Priority date: 2006-03-27
Filing date: 2006-03-27
Publication date: 2011-09-28
Anticipated expiration: 2026-03-27
Also published as: JP2007266754A

Description

本発明は、車両用音声入出力装置および音声入出力装置用プログラムに関する。 The present invention relates to a vehicle voice input / output device and a program for a voice input / output device.

従来、車両用ナビゲーション装置、車両用ハンズフリー装置等、音声入出力機能を備えた装置が車両に多く搭載されている。このような車両用音声入出力装置のための技術として、複数の乗員のうち特定の乗員だけが聞こえるように音声を出力する技術が提案されている。例えば、特許文献１には、複数のスピーカから互いに位相がずれた音声を出力させることで、運転席の乗員の位置でのみ聞こえるような音声出力を実現する技術が開示されている。 Conventionally, many devices having a voice input / output function, such as a vehicle navigation device and a vehicle hands-free device, are mounted on a vehicle. As a technique for such a voice input / output device for a vehicle, a technique for outputting a voice so that only a specific passenger among a plurality of passengers can be heard has been proposed. For example, Patent Document 1 discloses a technology that realizes sound output that can be heard only at the position of an occupant in a driver's seat by outputting sounds that are out of phase with each other from a plurality of speakers.

また、車両用のものではないが、特許文献２には、複数のスピーカ間の位相のずれを変化させることで、出力した音声が聞こえる位置を変化させる技術が開示されている。
特開２００４−１１２５２０８号公報特開平８−２２１０８１号公報 Further, although not for a vehicle, Patent Document 2 discloses a technique for changing a position where an output sound can be heard by changing a phase shift between a plurality of speakers.
JP 2004-1125208 A JP-A-8-2221081

また、車室内に複数の乗員がいる場合、それらの乗員のうち発話を行った者のみに聞こえるように音を出力したい場合もある。例えば、特定の乗員がハンズフリー装置を用いて通話しているとき、通話相手の発言を他の乗員に聞かれたくない場合がある。また、ナビゲーション装置に対して音声コマンドを発した特定の乗員だけに、そのコマンドに対する応答音声を聞かせたい場合がある。 In addition, when there are a plurality of occupants in the passenger compartment, there are cases where it is desired to output a sound so that only the person who speaks among the occupants can hear. For example, when a specific occupant is making a call using a hands-free device, there is a case in which other occupants do not want to hear the other party's speech. In some cases, only a specific occupant who has issued a voice command to the navigation device wants to hear a response voice to the command.

本発明は上記点に鑑み、車内の発話者の位置に向けてスピーカの指向性を変化させる技術を提供することを目的とする。 An object of this invention is to provide the technique which changes the directivity of a speaker toward the position of the speaker in a vehicle in view of the said point.

上記目的を達成するための請求項１に記載の発明は、車両に搭載される車両用音声入出力装置であって、姿勢に応じた方向に指向性を有する指向性スピーカと、当該指向性スピーカの姿勢を変える駆動部と、当該車両の車室内の複数位置に配置された複数のマイクと、制御部とを備える。そして、当該制御部は、当該複数のマイクが受けた音声に基づいて、当該指向性スピーカから話者の頭部への方向（以下、目標方向という）を繰り返し推定し、話者本人が発話したか否かを判定し、話者本人が発話していると判定したことに基づいて、目標方向の変化に指向性スピーカの指向性を追従させるよう、駆動部を制御するものであり、さらに、制御部は、話者本人が発話したか否かの判定を、推定した目標方向の変化が基準量以下であるか否かに基づいて行うとともに、制御部が推定する目標方向に基づいて基準量を変化させる。 The invention according to claim 1 for achieving the above object, a vehicular audio input output apparatus mounted on a vehicle, the directional loudspeaker having directivity in a direction corresponding to the posture, the directivity A drive unit that changes the attitude of the speaker, a plurality of microphones arranged at a plurality of positions in the vehicle interior of the vehicle, and a control unit are provided. And the said control part repeatedly estimates the direction (henceforth a target direction) from the said directional speaker to a speaker's head based on the audio | voice which the said several microphone received, and the speaker himself uttered Based on the determination that the speaker himself is speaking, the drive unit is controlled so that the directivity of the directional speaker follows the change in the target direction . The control unit determines whether or not the speaker himself has spoken based on whether or not the estimated change in the target direction is equal to or less than the reference amount, and based on the target direction estimated by the control unit To change.

このようになっているので、指向性スピーカが駆動部によって姿勢を変えられることで、その指向性の方向が、発話者の推定された頭部の方向を向く。したがって、発話者の位置に向けてスピーカの指向性を変化させることができる。
特に、話者本人が発話していると判定したことに基づき、発話者本人の発話であることを確認してから指向性スピーカの姿勢を変化させることで、当初の発話者以外の乗員が言葉を発したときに、または車内外で音が発生したときに、指向性スピーカが誤ってその方向に指向性を向けてしまうような事態の発生頻度を低減できる。これにより、指向性スピーカによる特定の一人の発話者への追従性の正確度がより高まる。
話者本人が発話したか否かの判定は、推定した目標方向の変化が基準量以下であるか否かに基づいて行うことができる。この場合、制御部は、制御部が推定する当該目標方向に基づいて当該基準量を変化させることで、車室内の位置毎の特性に応じたよりきめ細かい本人確認が可能となる。 Thus, the orientation of the directional speaker can be changed by the drive unit, so that the direction of the directivity faces the direction of the head estimated by the speaker. Therefore, the directivity of the speaker can be changed toward the position of the speaker.
In particular, if it is determined that the speaker is speaking, the directional speaker's posture is changed after confirming that the speaker is speaking, so that the passenger other than the original speaker can speak When sound is generated or when sound is generated inside or outside the vehicle, the frequency of occurrence of a situation in which the directional speaker erroneously directs directivity in that direction can be reduced. Thereby, the accuracy of the followability to a specific one speaker by the directional speaker is further increased.
The determination as to whether or not the speaker himself has spoken can be made based on whether or not the estimated change in the target direction is equal to or less than a reference amount. In this case, the control unit changes the reference amount based on the target direction estimated by the control unit, thereby enabling finer identity verification according to the characteristics for each position in the vehicle interior.

また、請求項２に記載のように、当該複数のマイクは、当該車室内の天井部に取り付けられる第１のマイク群と、当該車室内の天井部以外の部分（以下、別部分という）に取り付けられる第２のマイク群とを備え、当該第１のマイク群のそれぞれは、当該天井部の一列に並ばない３箇所に配置され、当該第２のマイク群のぞれぞれは、当該別部分の一列に並ばない３箇所に配置され、当該制御部は、当該複数のマイクのそれぞれ受けた音声間の時間遅れに基づいて、当該話者の頭部位置を推定し、推定した頭部位置に基づいて、当該目標方向を推定するようになっていてもよい。 In addition, as described in claim 2, the plurality of microphones includes a first microphone group attached to a ceiling portion in the vehicle interior and a portion other than the ceiling portion in the vehicle interior (hereinafter referred to as another portion). Each of the first microphone groups is arranged in three places that are not lined up in a row of the ceiling portion, and each of the second microphone groups is the separate microphone group. Arranged at three locations that are not aligned in a row, the control unit estimates the head position of the speaker based on the time delay between the sounds received by the plurality of microphones, and the estimated head position Based on the above, the target direction may be estimated.

このように、天井部とそれ以外の部分とにあるマイク群を用いて頭部位置を推定することで、目標方向を精度よく推定できるようになる。 Thus, it is possible to estimate the target direction with high accuracy by estimating the head position using the microphone groups on the ceiling and other parts.

また、請求項３に記載のように、当該制御部は、ユーザによる所定の入力操作があったことに基づいて、当該話者本人が発話したか否かに関わらず、当該目標方向の変化に当該指向性スピーカの指向性を追従させるよう、当該駆動部を制御するようになっていてもよい。このようになっていることで、指向性スピーカの指向性を向けるべき対象の乗員を切り替えることが可能となる。 In addition, as described in claim 3 , the control unit changes the target direction based on whether or not the speaker himself / herself speaks based on a predetermined input operation by the user. The drive unit may be controlled to follow the directivity of the directional speaker. In this way, it becomes possible to switch the target occupant to which the directivity of the directional speaker should be directed.

以下、本発明の一実施形態について説明する。図１に、本実施形態に係るハンズフリー装置１（車両用音声入出力装置の一例に相当する）のハードウェア構成を示す。ハンズフリー装置１は、制御部１０、インターフェース１１、前部マイク群１２、天井マイク群１３、駆動部１４、狭指向性スピーカ１５、広指向性スピーカ１６、および操作スイッチ群１７を有している。 Hereinafter, an embodiment of the present invention will be described. FIG. 1 shows a hardware configuration of a hands-free device 1 (corresponding to an example of a vehicle voice input / output device) according to the present embodiment. The hands-free device 1 includes a control unit 10, an interface 11, a front microphone group 12, a ceiling microphone group 13, a drive unit 14, a narrow directional speaker 15, a wide directional speaker 16, and an operation switch group 17. .

制御部１０は、図示しないＣＰＵ、ＲＡＭ、ＲＯＭ、Ｉ／Ｏ等を備えている。このＣＰＵは、ＲＯＭ中の後述する各種プログラムを読み出して実行し、その実行の際には、ＲＡＭ中の記憶領域を用いると共に、インターフェース１１、前部マイク群１２、天井マイク群１３、駆動部１４、狭指向性スピーカ１５、広指向性スピーカ１６、および操作スイッチ群１７のそれぞれと制御信号のやりとりを、Ｉ／Ｏを介して行う。なお、ＲＯＭには、プログラム以外にも、車室内の空間の座標における、前部マイク群１２の取り付け位置座標、天井マイク群１３の取り付け位置座標、および狭指向性スピーカ１５の取り付け位置座標の情報が記録されている。以下、ＣＰＵがＲＯＭ中のプログラムを実行する処理を、制御部１０が実行する処理として記載する。 The control unit 10 includes a CPU, RAM, ROM, I / O and the like (not shown). The CPU reads and executes various programs described later in the ROM, and uses the storage area in the RAM for executing the program, and also includes the interface 11, the front microphone group 12, the ceiling microphone group 13, and the drive unit 14. Control signals are exchanged with each of the narrow directional speaker 15, the wide directional speaker 16, and the operation switch group 17 via the I / O. In addition to the program, the ROM stores information on the mounting position coordinates of the front microphone group 12, the mounting position coordinates of the ceiling microphone group 13, and the mounting position coordinates of the narrow directivity speaker 15 in the coordinates of the space in the vehicle interior. Is recorded. Hereinafter, the process in which the CPU executes the program in the ROM will be described as the process executed by the control unit 10.

インターフェース１１は、制御部１０と図示しない携帯電話機とを有線または無線により接続する装置である。無線によって接続する場合、インターフェース１１は例えばＢｌｕｅｔｏｏｔｈ（登録商標）用通信機器であってもよい。 The interface 11 is a device that connects the control unit 10 and a mobile phone (not shown) by wire or wirelessly. When connecting wirelessly, the interface 11 may be, for example, a Bluetooth (registered trademark) communication device.

前部マイク群１２は、複数のマイクユニットを有している。図２に、車室内前部のダッシュボード２１における前部マイク群１２の配置を示す。この図に示す通り、前部マイク群１２の各マイクユニット１２ａ〜ｃは、ダッシュボード２１の中央正面に、一直線でない配置、より詳しくは正三角形の配置で同一面上に並べられている。マイクユニット１２ａ〜ｃの相互間の距離は、例えば１０センチメートルである。 The front microphone group 12 has a plurality of microphone units. FIG. 2 shows the arrangement of the front microphone group 12 on the dashboard 21 in the front part of the vehicle interior. As shown in this figure, the microphone units 12a to 12c of the front microphone group 12 are arranged on the same plane in the center front surface of the dashboard 21 in a non-straight arrangement, more specifically, an equilateral triangle arrangement. The distance between the microphone units 12a to 12c is, for example, 10 centimeters.

天井マイク群１３は、複数のマイクユニットを有している。図３に、車室内の天井２２における天井マイク群１３の配置を示す。この図に示す通り、天井マイク群１３の各マイクユニット１３ａ〜ｃは、天井２２の中央部に、一直線でない配置、より詳しくは正三角形の配置で同一面上に並べられている。マイクユニット１３ａ〜ｃの相互間の距離は、例えば１０センチメートルである。 The ceiling microphone group 13 has a plurality of microphone units. FIG. 3 shows an arrangement of the ceiling microphone group 13 on the ceiling 22 in the vehicle interior. As shown in this figure, the microphone units 13a to 13c of the ceiling microphone group 13 are arranged on the same plane in the central portion of the ceiling 22 in a non-straight arrangement, more specifically, an equilateral triangle arrangement. The distance between the microphone units 13a to 13c is, for example, 10 centimeters.

駆動部１４は、天井２２の直下に取り付けられ、狭指向性スピーカ１５の姿勢を機械的に変化させる装置である。図４に、この駆動部１４の構造の一例および駆動部１４と狭指向性スピーカ１５の機械的接続関係の一例を示す。この図に示す通り、駆動部１４は、第１モータ１４１、回転部１４２、第２モータ１４３、および固定部１４４を有している。第１モータ１４１は、天井２２の中央部に固定され、制御部１０からの制御に基づいて、車両の上下方向を軸として回転部１４２を回転させる。第２モータ１４３は、回転部１４２に固定されている。固定部１４４は、その一端が第２モータ１４３に接続されている。この第２モータ１４３は、制御部１０からの制御に基づいて、車両の水平方向を軸として固定部１４４を回転させる。 The drive unit 14 is a device that is attached directly below the ceiling 22 and mechanically changes the attitude of the narrow directional speaker 15. FIG. 4 shows an example of the structure of the drive unit 14 and an example of a mechanical connection relationship between the drive unit 14 and the narrow directivity speaker 15. As shown in this figure, the drive unit 14 includes a first motor 141, a rotating unit 142, a second motor 143, and a fixed unit 144. The first motor 141 is fixed to the center of the ceiling 22 and rotates the rotating unit 142 around the vertical direction of the vehicle based on the control from the control unit 10. The second motor 143 is fixed to the rotating unit 142. One end of the fixing portion 144 is connected to the second motor 143. The second motor 143 rotates the fixing unit 144 around the horizontal direction of the vehicle based on the control from the control unit 10.

狭指向性スピーカ１５は、固定部１の端部のうち、第２モータ１４３と反対側の端部に固定された、狭い指向性を有するスピーカである。狭指向性スピーカ１５としては、例えば、超音波を搬送波とし、音声を変調波とする音波を狭い方向（例えばπ／５００ステラジアン）に出力する超狭指向性スピーカを用いる。この狭指向性スピーカ１５は、その本体に対する指向性の向きが固定されている。したがって、狭指向性スピーカ１５は、姿勢が変わると、その姿勢の変化に伴って指向性の向きも変化する。 The narrow directivity speaker 15 is a speaker having a narrow directivity that is fixed to an end portion of the fixed portion 1 on the side opposite to the second motor 143. As the narrow directivity speaker 15, for example, an ultra narrow directivity speaker that outputs a sound wave having an ultrasonic wave as a carrier wave and a sound as a modulated wave in a narrow direction (for example, π / 500 steradians) is used. The directionality of the directivity with respect to the main body of the narrow directivity speaker 15 is fixed. Therefore, when the orientation of the narrow directivity speaker 15 changes, the orientation of the directivity also changes as the orientation changes.

このような駆動部１４および狭指向性スピーカ１５の構造により、車両の上から下への向きの軸を極軸とする極座標表示で表現すると、第２モータ１４３が狭指向性スピーカ１５の向きの緯度（θ）を変化させ、第１モータ１４１が狭指向性スピーカ１５の向きの経度（φ）を変化させることができる。したがって制御部１０は、狭指向性スピーカ１５の指向性の向きを、天井２２の下側の任意の方向に向けるよう、駆動部１４を介して制御することができる。 With such a structure of the drive unit 14 and the narrow directional speaker 15, the second motor 143 has the orientation of the narrow directional speaker 15 when expressed in polar coordinate display with the axis from the top to the bottom of the vehicle as the polar axis. By changing the latitude (θ), the first motor 141 can change the longitude (φ) of the direction of the narrow directivity speaker 15. Therefore, the control unit 10 can control the directivity direction of the narrow directivity speaker 15 via the drive unit 14 so as to be directed to an arbitrary direction below the ceiling 22.

広指向性スピーカ１６は、車内のどの席に座っている乗員にも聞こえるように音声を出力する、指向性のない、あるいは広い指向性を有するスピーカである。 The wide directivity speaker 16 is a speaker that outputs sound so that it can be heard by a passenger sitting in any seat in the vehicle, and has no directivity or wide directivity.

操作部１７は、複数のメカニカルスイッチ等のユーザが操作可能な機器から成り、そのユーザの操作に応じた信号を制御部１０に出力する。この操作スイッチ群１７は、後述する秘話制御に用いられるためのボタンとして、全員ボタンおよび秘話ボタンを有している。 The operation unit 17 includes devices that can be operated by a user, such as a plurality of mechanical switches, and outputs a signal corresponding to the operation of the user to the control unit 10. The operation switch group 17 has an all-members button and a secret button as buttons used for secret control described later.

図５に、制御部１０がプログラムを実行することで実現する機能の構成を示す。制御部１０は、この図に示すようなハンズフリー制御、ビームフォーミング制御、および秘話制御を、電話通話開始を契機に、並列的に実行するようになっている。制御部１０は、電話通話の開始を、インターフェース１１を介した携帯電話機からの着信通知、またはユーザによる操作スイッチ群１７に対する電話呼び出し操作があったことに基づいて検出し、それに基づいて当該携帯電話機と制御信号をやり取りすることで通話回線を開き、上記制御を開始する。 FIG. 5 shows a configuration of functions realized by the control unit 10 executing a program. The control unit 10 performs hands-free control, beam forming control, and secret talk control as shown in this figure in parallel when the telephone call starts. The control unit 10 detects the start of the telephone call based on the incoming call notification from the mobile phone via the interface 11 or the telephone call operation on the operation switch group 17 by the user, and based on that, the mobile phone The control line is opened by exchanging control signals with and the above control is started.

ハンズフリー制御は、インターフェース１１に接続された携帯電話機と音声信号をやり取りすることで、乗員が携帯電話機から手を離したまま通話できるようにするための制御である。具体的には、制御部１０は、ハンズフリー制御のためのプログラムを実行することで、主に以下の（Ａ）、（Ｂ）の処理を実行する。
（Ａ）インターフェース１１を介して当該携帯電話機から受け取った通話相手の音声信号を狭指向性スピーカ１５および広指向性スピーカ１６のいずれか一方または両方に出力する。
（Ｂ）また、前部マイク群１２および天井マイク群１３のいずれかまたは両方から受けた乗員の発話音声の信号を、通話相手宛ての送信用の音声信号として、インターフェース１１を介して当該携帯電話機に出力する。 The hands-free control is a control for allowing a passenger to talk while leaving his / her hand away from the mobile phone by exchanging voice signals with the mobile phone connected to the interface 11. Specifically, the control unit 10 mainly executes the following processes (A) and (B) by executing a program for hands-free control.
(A) The other party's voice signal received from the mobile phone via the interface 11 is output to one or both of the narrow directional speaker 15 and the wide directional speaker 16.
(B) In addition, the cellular phone is connected via the interface 11 with a signal of the utterance voice of the occupant received from either or both of the front microphone group 12 and the ceiling microphone group 13 as a voice signal for transmission to the other party. Output to.

ビームフォーミング制御は、前部マイク群１２および天井マイク群１３の各マイクユニットのいずれかまたは両方から受けた音声信号のそれぞれに対して異なる遅延処理を施し、それら遅延処理の結果を合成することで、車室内の特定の位置からの音に対してのみ強い感度を有する音声信号を生成するための制御である。制御部１０は、各マイクユニットからの信号をどの程度遅延させるかについては、秘話制御中の後述する話者の口の位置算出の結果に基づいて決定する。 The beam forming control is performed by performing different delay processing on each of the audio signals received from either or both of the microphone units of the front microphone group 12 and the ceiling microphone group 13 and synthesizing the results of the delay processing. This is control for generating an audio signal having a strong sensitivity only to sound from a specific position in the vehicle interior. The control unit 10 determines how much the signal from each microphone unit is delayed based on the result of calculation of the position of the mouth of the speaker, which will be described later, during secret speech control.

秘話制御は、前部マイク群１２および天井マイク群１３から受けた乗員の発話音声に基づいて通話者の耳位置（または頭部位置）を推定し、その耳位置に狭指向性スピーカ１５の指向性を向ける制御である。この秘話制御のために制御部１０がＲＯＭから読み出して実行する秘話制御用プログラム１００のフローチャートを示す。以下、このフローチャートに沿って制御部１０の秘話制御の作動を説明する。 In the secret talk control, the ear position (or head position) of the caller is estimated based on the uttered voice of the occupant received from the front microphone group 12 and the ceiling microphone group 13, and the direction of the narrow directional speaker 15 is directed to the ear position. It is a control that directs sex. The flowchart of the secret talk control program 100 which the control part 10 reads from ROM for execution for this secret talk control is shown. Hereinafter, the operation of the secret story control of the control unit 10 will be described along this flowchart.

この秘話制御用プログラム１００は、大別して３つの部分を有している。第１の部分は、ステップ１０５、１１０、１１５から成る通話者特定部、第２の部分はステップ１２０、１２５、１３０、１３５、１４０、１４５から成る通話者追従部、第３の部分はステップ１５０、１５５、１６０から成る全員通話制御部である。 The secret story control program 100 is roughly divided into three parts. The first part is a caller identification unit comprising steps 105, 110 and 115, the second part is a caller tracking unit comprising steps 120, 125, 130, 135, 140 and 145, and the third part is step 150. 155, 160 is a call control unit for all members.

制御部１０は、秘話制御用プログラム１００の実行において、まずこの通話者特定部のステップ１０５において、乗員の発話があるまで待ち、発話があると、続いてステップ１１０を実行する。発話があるか否かは、前部マイク群１２、天井マイク群１３からの受信音量が所定の基準値を超えたか否かに基づいて判定する。また、車室内外の雑音による誤認識を提言するために、発話があるか否かは、前部マイク群１２、天井マイク群１３からの受信音声の信号を周波数解析し、人の声が強い周波数帯における強度が基準より高いか否かで判定するようになっていてもよい。 In the execution of the secret talk control program 100, the control unit 10 first waits until the occupant speaks in step 105 of the caller identification unit, and if there is a talk, subsequently executes step 110. Whether or not there is an utterance is determined based on whether or not the reception volume from the front microphone group 12 and the ceiling microphone group 13 exceeds a predetermined reference value. In addition, in order to propose misrecognition due to noise inside and outside the vehicle interior, whether or not there is a speech is determined by frequency analysis of the signals of the received speech from the front microphone group 12 and the ceiling microphone group 13 and a strong human voice. The determination may be made based on whether or not the intensity in the frequency band is higher than the reference.

ステップ１１０では、話者の耳位置を推定する。具体的には、まず話者の口の位置を算出し、その口の位置より基準長さ上方の位置を、話者の耳の位置として算出する。この基準長さは、例えば、人の口の位置の高さから耳の位置の高さまでの平均距離（例えば数センチ）である。話者の口の位置の算出は、以下のようにして行う。 In step 110, the ear position of the speaker is estimated. Specifically, first, the position of the mouth of the speaker is calculated, and a position that is a reference length higher than the position of the mouth is calculated as the position of the speaker's ear. The reference length is, for example, an average distance (for example, several centimeters) from the height of the person's mouth position to the height of the ear position. The speaker's mouth position is calculated as follows.

まず、前部マイク群１２におけるマイクユニット１２ａ〜ｃからの受信音声信号間の遅延時間を算出し、その遅延時間に基づいて、前部マイク群１２の位置から見た発話者の口の位置の向きを算出する。それぞれの遅延時間は、前部マイク群１２ａ〜ｃからの信号のうちから２つの信号を抽出するすべての組み合わせについて、その組み合わせの信号同士の相関関数を算出することで実現できる。このような遅延時間の算出および遅延時間に基づく発話者の方向の算出方法は、例えば特開２００１−２３６０９２号公報に詳述されている。上述した通り、マイクユニット１２ａ〜ｃは同一直線上に並んでいないので、前部マイク群１２の位置から見た発話者の口方向の上下角および左右角を算出することができる。 First, the delay time between the reception voice signals from the microphone units 12a to 12c in the front microphone group 12 is calculated, and the position of the mouth of the speaker viewed from the position of the front microphone group 12 is calculated based on the delay time. Calculate the orientation. Each delay time can be realized by calculating a correlation function between signals of the combinations for all combinations of extracting two signals from the signals from the front microphone groups 12a to 12c. Such calculation of the delay time and the method of calculating the direction of the speaker based on the delay time are described in detail in, for example, Japanese Patent Application Laid-Open No. 2001-236092. As described above, since the microphone units 12a to 12c are not arranged on the same straight line, the vertical and horizontal angles in the mouth direction of the speaker viewed from the position of the front microphone group 12 can be calculated.

続いて、天井マイク群１３におけるマイクユニット１３ａ〜ｃからの受信音声信号間の遅延時間を算出し、その遅延時間に基づいて、天井マイク群１３の位置から見た発話者の口の位置の向きを、前部マイク群１２の場合と同様に算出する。上述した通り、マイクユニット１３ａ〜ｃは同一直線上に並んでいないので、天井マイク群１３の位置から見た発話者の口方向の前後角および左右角を算出することができる。 Subsequently, the delay time between the received audio signals from the microphone units 13a to 13c in the ceiling microphone group 13 is calculated, and the direction of the position of the mouth of the speaker viewed from the position of the ceiling microphone group 13 based on the delay time. Is calculated in the same manner as in the case of the front microphone group 12. As described above, since the microphone units 13 a to 13 c are not arranged on the same straight line, it is possible to calculate the front-rear and left-right angles of the speaker's mouth viewed from the position of the ceiling microphone group 13.

続いて、このようにして算出した前部マイク群１２からの口の方向および天井マイク群１３からの口の方向の交差する点を、口の位置として算出する。なお、この算出は、ＲＯＭ中の前部マイク群１２および天井マイク群１３の取り付け位置座標を利用して行う。このようにして、車室内の発話者の口の位置が算出される。 Subsequently, the intersection of the mouth direction from the front microphone group 12 and the mouth direction from the ceiling microphone group 13 calculated in this way is calculated as the mouth position. This calculation is performed using the attachment position coordinates of the front microphone group 12 and the ceiling microphone group 13 in the ROM. In this way, the position of the mouth of the speaker in the passenger compartment is calculated.

なお、このステップ１１０において算出した発話者の耳位置は、基準耳位置として、ＲＡＭに記録する。 Note that the ear position of the speaker calculated in step 110 is recorded in the RAM as a reference ear position.

続いてステップ１１５では、ステップ１１０で算出した発話者の耳の位置に基づいて、狭指向性スピーカ１５のスピーカの指向性を向けるべき方向を算出し、その算出した方向に狭指向性スピーカ１５の実際の指向性が向くよう、駆動部１４を制御する。具体的には、ＲＯＭ中の狭指向性スピーカ１５の取り付け位置座標から、算出した耳の位置座標への方向を、指向性を向けるべき方向とする。そして、駆動部１４の第１モータ１４１および第２モータ１４３を制御することで、その方向に狭指向性スピーカ１５の実際の指向性を向ける。なおこの際、ハンズフリー制御における通話相手の音声の出力先として、狭指向性スピーカ１５のみを用いるようにする。 Subsequently, in step 115, a direction in which the directivity of the speaker of the narrow directivity speaker 15 is to be directed is calculated based on the position of the speaker's ear calculated in step 110, and the direction of the narrow directivity speaker 15 is calculated in the calculated direction. The drive unit 14 is controlled so that the actual directivity is suitable. Specifically, the direction from the attachment position coordinate of the narrow directivity speaker 15 in the ROM to the calculated ear position coordinate is a direction in which directivity should be directed. Then, by controlling the first motor 141 and the second motor 143 of the drive unit 14, the actual directivity of the narrow directivity speaker 15 is directed in that direction. At this time, only the narrow directivity speaker 15 is used as the output destination of the voice of the other party in the hands-free control.

このようにすることで、通話相手からの音声は、ステップ１０５で検出された発話の主のみに聞こえるようになる。結果として、この発話者は、電話通話を行う者、すなわち通話者として特定されたことになる。 In this way, the voice from the other party can be heard only by the main utterance detected in step 105. As a result, this speaker is identified as a person who makes a telephone call, that is, a caller.

続いて制御部１０は、通話者追従部のステップ１２０で、「車内で発話があり、かつ、通話相手からの音声が狭指向性スピーカ１５から出力されていない」という条件が満たされているか否かを判定し、満たされていれば続いてステップ１２５を実行し、満たされていなければ続いてステップ１４０を実行する。車内で発話があるか否かは、ステップ１０５における場合と同様にして判定する。通話相手からの音声が狭指向性スピーカ１５から出力されているか否かは、インターフェース１１から通話相手の音声信号を受けているか否かに基づいて判定する。 Subsequently, the control unit 10 determines whether or not the condition that “there is an utterance in the vehicle and the voice from the other party is not output from the narrow directivity speaker 15” is satisfied in Step 120 of the caller following unit. If it is satisfied, step 125 is executed. If it is not satisfied, step 140 is executed. Whether or not there is an utterance in the vehicle is determined in the same manner as in step 105. Whether or not the voice from the other party is output from the narrow directivity speaker 15 is determined based on whether or not the other party's voice signal is received from the interface 11.

ステップ１２５では、ステップ１２０で検出した発話の主の耳位置を、ステップ１１０と同じ方法で算出する。 In step 125, the main ear position of the utterance detected in step 120 is calculated by the same method as in step 110.

続いてステップ１３０では、ステップ１２０で検出した発話の主が、通話者特定部で特定した通話者であるか否かを判定する。具体的には、ステップ１２５で算出した耳位置と、ＲＡＭ中に基準耳位置として記憶されている位置とを比較する。そして、それら２つの位置間の距離が基準距離より小さい場合は、発話の主が通話者であると判定し、基準距離より大きい場合は、発話の主が通話者でないと判定する。ここで、基準距離は、あらかじめ決められた一定値（例えば２０センチ）であってもよいし、一定の範囲内でランダムに決まる値であってもよいし、各種条件に基づいて変動する値であってもよい。 Subsequently, in step 130, it is determined whether or not the main utterance detected in step 120 is the caller specified by the caller specifying unit. Specifically, the ear position calculated in step 125 is compared with the position stored as the reference ear position in the RAM. When the distance between the two positions is smaller than the reference distance, it is determined that the main speaker is the speaker, and when the distance is larger than the reference distance, it is determined that the main speaker is not the speaker. Here, the reference distance may be a predetermined constant value (for example, 20 centimeters), may be a value determined randomly within a certain range, or may be a value that varies based on various conditions. There may be.

例えば、基準耳位置が、車室内のどの座席に属するものかに基づいて、基準距離を変化させてもよい。例えば、基準耳位置が運転席に属する場合よりも、基準耳位置が助手席に属する場合の方が基準距離が大きくなってもよい。また、基準耳位置が助手席に属する場合よりも、基準耳位置が後部座席に属する場合の方が基準距離が大きくなってもよい。これは、運転席、助手席、後部座席の順に、乗員の動きやすさが増大するという性質を利用した設定である。なお、「耳位置が座席に属する」とは、耳位置が当該座席の上方の位置に存在することをいう。 For example, the reference distance may be changed based on which seat in the vehicle interior the reference ear position belongs to. For example, the reference distance may be larger when the reference ear position belongs to the passenger seat than when the reference ear position belongs to the driver seat. Further, the reference distance may be larger when the reference ear position belongs to the rear seat than when the reference ear position belongs to the passenger seat. This is a setting using the property that the occupant's mobility increases in the order of the driver's seat, the passenger seat, and the rear seat. Note that “the ear position belongs to the seat” means that the ear position exists at a position above the seat.

また、ハンズフリー装置１が、自車両が走行中か否かを検出できる場合、基準距離は、走行中であるときよりも、走行中でないときの方が大きくなっていてもよい。これは、走行中よりも停止中の方が乗員が動きやすいという性質を利用した設定である。自車両が走行中か否かについては、車両に取り付けられた車速センサに基づいて検出してもよいし、車両のシフトレバーがドライビングポジションにあるかパーキングポジションにあるかに基づいて判定してもよい。 Further, when the hands-free device 1 can detect whether or not the host vehicle is traveling, the reference distance may be larger when the vehicle is not traveling than when the vehicle is traveling. This is a setting that utilizes the property that the occupant can move more easily when stopped than when traveling. Whether or not the host vehicle is traveling may be detected based on a vehicle speed sensor attached to the vehicle, or may be determined based on whether the shift lever of the vehicle is in the driving position or the parking position. Good.

なお、この耳位置のずれが基準よりも大きいか否かを判定することは、狭指向性スピーカ１５から耳位置への方向のずれが基準よりも大きいが否かを判定することでもある。 Note that determining whether or not the deviation in the ear position is larger than the reference is also determining whether or not the deviation in the direction from the narrow directivity speaker 15 to the ear position is larger than the reference.

このステップ１３０で発話の主が通話者であると判定した場合、続いてステップ１３５を実行し、通話者でないと判定した場合、再度ステップ１２０を実行する。ステップ１３５では、ステップ１２５で算出した耳位置に基づいて、ステップ１１５と同じ方法で、当該耳位置に狭指向性スピーカ１５の指向性を向ける制御を行う。ステップ１３５の後、制御部１０は再度ステップ１２０を実行する。 If it is determined in step 130 that the main speaker is the caller, then step 135 is executed. If it is determined that the speaker is not a caller, step 120 is executed again. In step 135, based on the ear position calculated in step 125, the directivity of the narrow directivity speaker 15 is controlled to the ear position by the same method as in step 115. After step 135, the control unit 10 executes step 120 again.

ステップ１４０では、通話者をリセットする旨の乗員による操作があったか否かを判定する。具体的には、操作スイッチ群１７の秘話ボタンが押されたか否かを判定する。当該操作があった場合、続いて通話者特定部のステップ１０５を再度実行し、ない場合、続いてステップ１４５を実行する。 In step 140, it is determined whether or not there has been an operation by the passenger to reset the caller. Specifically, it is determined whether or not the secret button of the operation switch group 17 has been pressed. If there is such an operation, then step 105 of the caller identification unit is executed again, and if not, step 145 is executed subsequently.

ステップ１４５では、秘話を解除する旨の乗員による操作があったか否かを判定する。具体的には、操作スイッチ群１７の全員ボタンが押されたか否かを判定する。当該操作があった場合、続いて全員通話制御部のステップ１５０を実行し、ない場合、再度ステップ１２０を実行する。 In step 145, it is determined whether or not there has been an operation by the occupant to cancel the secret story. Specifically, it is determined whether or not the everyone button of the operation switch group 17 has been pressed. If there is such an operation, then step 150 of the all-call controller is executed. If not, step 120 is executed again.

ステップ１５０では、広指向性スピーカ１６を、ハンズフリー通話用のスピーカとして使用するよう設定する。これによって、通話相手の音声が広指向性スピーカ１６を通じて乗員全員に聞こえるようになる。 In step 150, the wide directional speaker 16 is set to be used as a speaker for hands-free calling. As a result, the voice of the other party can be heard by all the occupants through the wide directional speaker 16.

続いてステップ１５５では、秘話を再開する旨の乗員による操作があったか否かを判定する。具体的には、操作スイッチ群１７の秘話ボタンが押されたか否かを判定する。当該操作があった場合、続いてステップ１６０を実行し、ない場合再度ステップ１５５を実行する。 Subsequently, in step 155, it is determined whether or not there has been an operation by the passenger to resume the secret story. Specifically, it is determined whether or not the secret button of the operation switch group 17 has been pressed. If there is such an operation, then step 160 is executed, and if not, step 155 is executed again.

ステップ１６０では、広指向性スピーカ１６を、ハンズフリー通話用のスピーカとして使用しないよう設定する。これによって、通話相手の音声は、狭指向性スピーカ１５のみから出力される状態に戻る。ステップ１６０の後、再度通話者特定部のステップ１０５を実行する。 In step 160, the wide directional speaker 16 is set not to be used as a speaker for hands-free calling. As a result, the voice of the other party is returned to the state where it is output only from the narrow directional speaker 15. After step 160, step 105 of the caller identification unit is executed again.

以上のような秘話制御用プログラム１００を制御部１０が実行することで、ハンズフリー装置１は、ハンズフリー制御の開始後に最初に発話を行った乗員（ステップ１１０参照）の耳位置を算出し（ステップ１１０参照）、その耳位置に狭指向性スピーカ１５の指向性を向ける（ステップ１１５参照）ことで、その乗員を通話者として特定する。 When the control unit 10 executes the secret story control program 100 as described above, the hands-free device 1 calculates the ear position of the occupant (see step 110) who first spoke after starting the hands-free control ( In step 110, the directivity of the narrow directivity speaker 15 is directed to the ear position (see step 115), so that the occupant is specified as a caller.

その後、通話者のリセット（ステップ１４０参照）や秘話の解除の乗員操作（ステップ１４５）がない限り、通話者の発話のみに対応して（ステップ１２０、１２５、１３０参照）、耳位置に狭指向性スピーカ１５の指向性を追従させる（ステップ１３５参照）。このとき、通話相手からの音声が車室内に出力されている場合（ステップ１２０参照）や、通話者以外が発話した場合（ステップ１３０参照）は、それらの音に対応して狭指向性スピーカ１５の指向性の向きを変化させることがない。 After that, unless there is a reset of the caller (see step 140) or an occupant operation for canceling the secret story (step 145), only the caller's utterance is handled (see steps 120, 125, 130), and narrowly directed to the ear position. The directivity of the directional speaker 15 is made to follow (see step 135). At this time, when the voice from the other party is output in the passenger compartment (see step 120) or when a voice other than the talker speaks (see step 130), the narrow directivity speaker 15 corresponds to those sounds. Does not change the direction of the directivity.

また、通話者をリセットする操作があると（ステップ１４０参照）、その後最初に発話した（ステップ１０５参照）乗員を新たな通話者として特定し（ステップ１１０、１１５参照）、その後その新たな通話者に狭指向性スピーカ１５の指向性を追従させる（ステップ１２０、１２５、１３０、１３５参照）。 When there is an operation to reset the caller (see step 140), the occupant who spoke first (see step 105) is identified as a new caller (see steps 110 and 115), and then the new caller Is made to follow the directivity of the narrow directivity speaker 15 (see steps 120, 125, 130, and 135).

また、秘話解除の操作があると（ステップ１４５参照）、広指向性スピーカ１６をも出力用のスピーカとして用いることで（ステップ１６０参照）、全乗員に通話相手の音声が聞こえるようにする。このとき、秘話再開の操作があると（ステップ１５５参照）、広指向性スピーカ１６の使用を停止し（ステップ１６０参照）、その後最初に発話した（ステップ１０５参照）乗員を新たな通話者として特定し（ステップ１１０、１１５参照）、その後その新たな通話者に狭指向性スピーカ１５の指向性を追従させる（ステップ１２０、１２５、１３０、１３５参照）。 Further, when there is an operation for canceling the secret story (see step 145), the wide directional speaker 16 is also used as an output speaker (see step 160), so that all passengers can hear the voice of the other party. At this time, if there is an operation for resuming the secret story (see step 155), use of the wide directivity speaker 16 is stopped (see step 160), and then the first utterance (see step 105) is specified as a new caller. Then (see steps 110 and 115), the new speaker is made to follow the directivity of the narrow directivity speaker 15 (see steps 120, 125, 130, and 135).

このように、狭指向性スピーカ１５が制御部１０の制御によって姿勢を変えることで、その指向性の方向が、発話者の推定された頭部の方向に追従する。したがって、通話者の変化する耳位置に応じてスピーカの指向性が変化する。 Thus, the directionality of the directivity follows the direction of the head estimated by the speaker when the narrow directivity speaker 15 changes its posture under the control of the control unit 10. Therefore, the directivity of the speaker changes according to the ear position where the caller changes.

また、電話通話が始まった直後、最初に発話した乗員が通話者として特定され、その通話者に狭指向性スピーカ１５の指向性が追従する。したがって、通話毎に入れ替わる通話者に応じてスピーカの指向性が変化する。 In addition, immediately after the telephone call starts, the first occupant who speaks is specified as the caller, and the directivity of the narrow directivity speaker 15 follows the caller. Therefore, the directivity of the speaker changes according to the caller who changes every call.

このようになっていることで、通話者以外には煩わしい通話相手からの音声が、通話者のみに聞こえるようになるので、通話者以外の乗車環境が向上する。また、通話者以外の乗員には聞かれたくない通話相手からの音声が、通話者にのみ聞こえるようになるので、通話の秘匿性が高まる。 In this way, since the voice from the other party who is troublesome to the person other than the caller can be heard only by the caller, the boarding environment other than the caller is improved. In addition, since the voice from the other party who does not want to be heard by passengers other than the caller can be heard only by the caller, the confidentiality of the call is improved.

また、特定された通話者の発話であることを確認してから指向性スピーカ１５の姿勢を変化させることで、通話者以外の乗員が言葉を発したときに、または車内外で音が発生したときに、指向性スピーカが誤ってその方向に指向性を向けてしまうような事態の発生頻度を低減できる。これにより、指向性スピーカによる特定の一人の発話者への追従性の正確度がより高まる。 Also, by confirming the utterance of the specified caller and then changing the posture of the directional speaker 15, a sound was generated when an occupant other than the caller spoke a word or inside or outside the car. Sometimes, the frequency of occurrence of a situation in which the directional speaker erroneously directs the directivity in that direction can be reduced. Thereby, the accuracy of the followability to a specific one speaker by the directional speaker is further increased.

また、通話者が発話したか否かの判定を、推定した話者の耳位置の基準耳位置からの変化が基準量以下であるか否かによって行うことで、指向性の制御のために推定された耳位置を、本人確認のためにも利用することができる。なお、基準耳位置は、ステップ１１０が実行される度に、すなわち、基準耳位置は、秘話の開始、秘話の再開、秘話における通話者のリセットのうちいずれかがある度に更新される。したがって、基準耳位置は、特定された通話者の初期の耳位置となる。また、基準量は、推定された耳位置がどの座席に属しているものかに基づいて基準量を変化させる。したがって、車室内の座席毎の特性に応じたよりきめ細かい通話者確認が可能となる。 In addition, it is estimated for directivity control by determining whether or not the speaker has spoken based on whether or not a change in the estimated ear position of the speaker from the reference ear position is below a reference amount. The obtained ear position can also be used for identity verification. Note that the reference ear position is updated every time step 110 is executed, that is, the reference ear position is updated every time any one of the start of the secret story, the restart of the secret story, and the reset of the caller in the secret story. Therefore, the reference ear position is the ear position of the identified caller. Further, the reference amount is changed based on which seat the estimated ear position belongs to. Therefore, more detailed caller confirmation according to the characteristics of each seat in the vehicle interior becomes possible.

また、乗員による通話者リセットの入力操作があったことに基づいて、それまでの通話者が発話したか否かに関わらず、その操作の直後に発話を行った乗員の耳位置の変化に狭指向性スピーカ１５の指向性を追従させるので、狭指向性スピーカ１５の指向性を向けるべき対象の乗員を通話中に切り替えることが可能となる。 Also, based on the operator's input operation for resetting the caller, regardless of whether the previous caller spoke or not, the change is limited to the change in the ear position of the rider who spoke immediately after that operation. Since the directivity of the directional speaker 15 is made to follow, it becomes possible to switch the target occupant to which the directivity of the narrow directional speaker 15 should be directed during a call.

また、天井とダッシュボードという離れた位置にあるマイク群１２、１３を用いて話者の頭部位置および耳位置を推定することで、これらの位置を精度よく推定できる。 Further, by estimating the speaker's head position and ear position using the microphone groups 12 and 13 located at a distance from the ceiling and the dashboard, these positions can be estimated with high accuracy.

また、耳位置検出のための前部マイク群１２、１３を、ハンズフリー通話機能のために兼用できる。 Further, the front microphone groups 12 and 13 for detecting the ear position can be used for the hands-free call function.

また、狭指向性スピーカ１５が天井の中央部に取り付けられているので、障害物に邪魔されることがほとんどなく、どの座席の乗員の耳に対しても指向性を向けることができる。
（他の実施形態）
以上、本発明の実施形態について説明したが、本発明の範囲は、上記実施形態のみに限定されるものではなく、本発明の各発明特定事項の機能を実現し得る種々の形態を包含するものである。 Moreover, since the narrow directivity speaker 15 is attached to the center part of the ceiling, it is hardly obstructed by an obstacle, and directivity can be directed to the occupant's ear of any seat.
(Other embodiments)
As mentioned above, although embodiment of this invention was described, the scope of the present invention is not limited only to the said embodiment, The various form which can implement | achieve the function of each invention specific matter of this invention is included. It is.

例えば、前部マイク群１２の取り付け位置座標、天井マイク群１３の取り付け位置座標、および狭指向性スピーカ１５の取り付け位置座標の情報は、ハンズフリー装置１の車両の装着時に、取り付け者によって、書き換え可能な長期記憶メモリ（例えばフラッシュメモリ、バックアップＲＡＭ）に記録されるようになっていてもよい。 For example, the information of the attachment position coordinates of the front microphone group 12, the attachment position coordinates of the ceiling microphone group 13, and the attachment position coordinates of the narrow directivity speaker 15 is rewritten by the installer when the hands-free device 1 is mounted on the vehicle. It may be recorded in a possible long-term storage memory (for example, flash memory, backup RAM).

また、ステップ１３０の通話者か否かの判定においては、ステップ１２５で算出した耳位置の属する座席（運転席、助手席、後部右座席、後部中央座席、後部左座席）と、ＲＡＭ中に基準耳位置として記憶されている位置の属する座席とが同じであるときに、通話者の発話であると判定するようになっていてもよい。 In the determination of whether or not the person is a caller in step 130, the seat to which the ear position calculated in step 125 belongs (driver's seat, front passenger seat, rear right seat, rear center seat, rear left seat) and the reference in the RAM. When the seat to which the position stored as the ear position belongs is the same, it may be determined that the caller is speaking.

また、ステップ１３０の通話者か否かの判定においては、ステップ１２０で検出した音声の周波数解析を行って、周波数特性が通話者のものより基準以上変化した場合に、通話者以外の乗員が発話していると判定してもよい。この場合、通話者の周波数特性を記憶するため、ステップ１０５で検出した音声、およびステップ１３０で通話者のものであると特定した音声を、ＲＡＭに順次上書き記録するようになっていてもよい。 Further, in the determination of whether or not the caller is a caller in step 130, when the frequency analysis of the voice detected in step 120 is performed and the frequency characteristic changes more than the reference from that of the caller, a passenger other than the caller speaks. You may determine that you are doing. In this case, in order to store the frequency characteristics of the caller, the voice detected at step 105 and the voice identified as that of the caller at step 130 may be sequentially overwritten and recorded in the RAM.

また、ステップ１３５では、ステップ１２５で算出した耳位置を、新たな基準耳位置としてＲＡＭに上書きしてもよい。この場合、基準位置は通話者の発話がある度に更新されることになる。 In step 135, the ear position calculated in step 125 may be overwritten in the RAM as a new reference ear position. In this case, the reference position is updated every time the caller speaks.

また、制御部１０は、通話者追従部のステップ１２０では、車内で発話があるか否かだけを判定するようになっていてもよい。これは、狭指向性スピーカ１５から通話相手の音声が出力されていても、その音声を前部マイク群１２および天井マイク群１３が検出する可能性は低いと考えられるからである。 Moreover, the control part 10 may determine only whether there exists an utterance in a vehicle in step 120 of a caller tracking part. This is because even if the other party's voice is output from the narrow directional speaker 15, it is considered that the possibility that the front microphone group 12 and the ceiling microphone group 13 detect the voice is low.

また、乗員が操作スイッチ群１７を操作して秘話制御を行う旨を入力した場合にのみ、秘話制御用プログラム１００が実行されるようになっていてもよい。 Also, the secret story control program 100 may be executed only when the occupant inputs that the secret control is performed by operating the operation switch group 17.

また、前部マイク群１２は、天井部以外であれば、前部以外の位置にあってもよい。 Further, the front microphone group 12 may be at a position other than the front part as long as it is other than the ceiling part.

なお、天井マイク群１３の位置と狭指向性スピーカ１５の位置は、ほぼ同じであってもよい。その場合、前部マイク群１２は必ずしも必要でない。この場合には、制御部１０は、ステップ１１０および１２５では、話者の耳位置でなく、前部マイク群１２から話者の耳位置への方向を算出すればよい。そして、ステップ１１５および１３５では、算出されたこの方向に、狭指向性スピーカ１５の指向性を向けるよう制御すればよい。 The position of the ceiling microphone group 13 and the position of the narrow directivity speaker 15 may be substantially the same. In that case, the front microphone group 12 is not necessarily required. In this case, in Steps 110 and 125, the control unit 10 may calculate the direction from the front microphone group 12 to the speaker's ear position, not the speaker's ear position. In steps 115 and 135, control may be performed so that the directivity of the narrow directivity speaker 15 is directed to the calculated direction.

また、本発明に用いられる複数のマイクの配置は、例えば、同一面内にない４つのマイクから構成されていてもよい。 In addition, the arrangement of the plurality of microphones used in the present invention may be composed of, for example, four microphones that are not in the same plane.

また、制御部１０が各種プログラムを実行して実現する機能のそれぞれは、当該機能を実現する専用のハードウェアによって実現されていてもよい。 Further, each of the functions realized by the control unit 10 executing various programs may be realized by dedicated hardware that realizes the functions.

また、本発明の音声入出力装置は、ハンズフリー装置に限らず、車両用ナビゲーション装置等、どのような車両用音声入出力装置として実現されていてもよい。 The voice input / output device of the present invention is not limited to the hands-free device, and may be realized as any type of vehicle voice input / output device such as a vehicle navigation device.

本発明の実施形態に係るハンズフリー装置１の電気的接続構成を示すブロック図である。It is a block diagram which shows the electrical connection structure of the hands-free apparatus 1 which concerns on embodiment of this invention. 前部マイク群１２のダッシュボード２１上の取り付け位置を示す図である。It is a figure which shows the attachment position on the dashboard 21 of the front microphone group 12. FIG. 天井マイク群１３の天井２２における取り付け位置を示す図である。It is a figure which shows the attachment position in the ceiling 22 of the ceiling microphone group 13. FIG. 駆動部１４および狭指向性スピーカ１５の機械的接続構成を示す図である。It is a figure which shows the mechanical connection structure of the drive part 14 and the narrow directivity speaker 15. FIG. 制御部１０が実現する機能の構成を概念的に示すブロック図である。FIG. 3 is a block diagram conceptually showing a configuration of functions realized by a control unit 10. 制御部１０のＣＰＵが実行する秘話制御用プログラム１００のフローチャートである。4 is a flowchart of a secret story control program 100 executed by the CPU of the control unit 10.

符号の説明Explanation of symbols

１…ハンズフリー装置、１０…制御部、１１…インターフェース、
１２…前部マイク群、１３…天井マイク群、１４…駆動部、１５…狭指向性スピーカ、
１６…広指向性スピーカ、１７…操作スイッチ群、２１…ダッシュボード、
２２…天井、１００…秘話制御用プログラム、１４１…第１モータ、１４２…回転部、
１４３…第２モータ、１４４…固定部。 DESCRIPTION OF SYMBOLS 1 ... Hands free apparatus, 10 ... Control part, 11 ... Interface,
12 ... Front microphone group, 13 ... Ceiling microphone group, 14 ... Drive unit, 15 ... Narrow directional speaker,
16 ... Wide directional speaker, 17 ... Operation switch group, 21 ... Dashboard,
22 ... Ceiling, 100 ... Program for secret story control, 141 ... First motor, 142 ... Rotating part,
143 ... 2nd motor, 144 ... fixed part.

Claims

車両に搭載される車両用音声入出力装置であって、
姿勢に応じた方向に指向性を有する指向性スピーカと、
前記指向性スピーカの姿勢を変える駆動部と、
前記車両の車室内の複数位置に配置された複数のマイクと、
制御部と、を備え、
前記制御部は、前記複数のマイクが受けた音声に基づいて、前記指向性スピーカから話者の頭部への方向（以下、目標方向という）を繰り返し推定し、前記話者本人が発話したか否かを判定し、前記話者本人が発話していると判定したことに基づいて、前記目標方向の変化に前記指向性スピーカの指向性を追従させるよう、前記駆動部を制御するものであり、
さらに、前記制御部は、前記話者本人が発話したか否かの判定を、推定した前記目標方向の変化が基準量以下であるか否かに基づいて行うとともに、前記制御部が推定する前記目標方向に基づいて前記基準量を変化させることを特徴とする車両用音声入出力装置。 The vehicle audio input output device mounted on a vehicle,
A directional speaker having directivity in a direction according to the posture ;
A drive unit for changing the posture of the directional speaker;
A plurality of microphones arranged at a plurality of positions in the vehicle interior of the vehicle;
A control unit,
The control unit repeatedly estimates a direction from the directional speaker to the speaker's head (hereinafter referred to as a target direction) based on the voices received by the plurality of microphones , and whether the speaker himself spoke And controlling the drive unit so that the directivity of the directional speaker follows the change in the target direction based on determining whether or not the speaker himself is speaking . ,
Further, the control unit determines whether or not the speaker himself has spoken based on whether or not the estimated change in the target direction is equal to or less than a reference amount, and the control unit estimates An audio input / output device for a vehicle, wherein the reference amount is changed based on a target direction .

前記複数のマイクは、前記車室内の天井部に取り付けられる第１のマイク群と、前記車室内の天井部以外の部分（以下、別部分という）に取り付けられる第２のマイク群とを備え、
前記第１のマイク群のそれぞれは、前記天井部の一列に並ばない３箇所に配置され、
前記第２のマイク群のぞれぞれは、前記別部分の一列に並ばない３箇所に配置され、
前記制御部は、前記複数のマイクのそれぞれ受けた音声間の時間遅れに基づいて、前記話者の頭部位置を推定し、推定した頭部位置に基づいて、前記目標方向を推定することを特徴とする請求項１に記載の車両用音声入出力装置。 The plurality of microphones includes a first microphone group attached to a ceiling portion in the vehicle interior, and a second microphone group attached to a portion other than the ceiling portion in the vehicle interior (hereinafter referred to as another portion),
Each of the first microphone groups is arranged in three places that are not lined up in a row of the ceiling part,
Each of the second microphone groups is arranged in three places that are not arranged in a row in the different part,
The control unit estimates the head position of the speaker based on a time delay between sounds received by the plurality of microphones, and estimates the target direction based on the estimated head position. The audio input / output device for a vehicle according to claim 1, wherein

前記制御部は、ユーザによる所定の入力操作があったことに基づいて、前記話者本人が発話したか否かに関わらず、前記目標方向の変化に前記指向性スピーカの指向性を追従させるよう、前記駆動部を制御することを特徴とする請求項１又は２に記載の車両用音声入出力装置。 The control unit causes the directivity of the directional speaker to follow the change in the target direction regardless of whether or not the speaker himself speaks based on a predetermined input operation by the user. The vehicle voice input / output device according to claim 1 , wherein the drive unit is controlled.