JP2020077933A

JP2020077933A - Hands-free speech device and method for controlling hands-free speech device

Info

Publication number: JP2020077933A
Application number: JP2018208906A
Authority: JP
Inventors: 明子太田; Akiko Ota
Original assignee: Clarion Co Ltd
Current assignee: Faurecia Clarion Electronics Co Ltd
Priority date: 2018-11-06
Filing date: 2018-11-06
Publication date: 2020-05-21

Abstract

To allow for hands-free speech that is easier to hear.SOLUTION: A speech device 10 determines how old a first speaker X is with an age determination unit 32 on the basis of a voice signal SX of the first speaker X, determines how old a second speaker Y is on the basis of a voice signal SY of the second speaker Y, determines an equalizer characteristic with an equalizer unit 33 on the basis of the ages of the first speaker X and the second speaker Y, and adjusts frequency characteristics of the voice signal SX to be output in a vehicle and the voice signal SY to be output to an external telephone terminal 51Y via a communication unit 22 on the basis of the determined equalizer characteristic.SELECTED DRAWING: Figure 3

Description

本発明は、ハンズフリー通話装置、及びハンズフリー通話装置の制御方法に関する。 The present invention relates to a hands-free communication device and a method for controlling a hands-free communication device.

携帯電話機などの通信端末装置には、ユーザーの顔を撮像して年齢（年代）を判定し、その判定結果に基づいて、入力される音声信号に対する周波数対利得特性を自動補正するものが開示されている（例えば、特許文献１参照）。また、車両等に搭載され、車室内の運転者と外部者との間のハンズフリー通話を実現するハンズフリー通話装置も知られている。 A communication terminal device such as a mobile phone is disclosed in which a user's face is imaged to determine the age (age), and the frequency-gain characteristic for an input audio signal is automatically corrected based on the determination result. (See, for example, Patent Document 1). There is also known a hands-free call device that is mounted on a vehicle or the like and realizes a hands-free call between a driver inside a vehicle and an outsider.

特開２００９−１７１１８９号公報JP, 2009-171189, A

ところで、年齢を重ねると、高い音が聞こえ難くなったり、高い音を発声し難くなったりする等の聴力及び発声能力の変化がある。従来の構成は、音声端末のユーザーの音声を、その音声端末のユーザーの年齢で補正するので、他方の話者の年齢に合わせた補正はできない。
そこで、本発明は、より聞き取り易いハンズフリー通話を可能にすることを目的とする。 By the way, with age, there is a change in hearing ability and utterance ability such that it becomes difficult to hear high sounds and it becomes difficult to utter high sounds. Since the conventional configuration corrects the voice of the user of the voice terminal with the age of the user of the voice terminal, it cannot be corrected according to the age of the other speaker.
Therefore, an object of the present invention is to enable a hands-free call that is easier to hear.

上記目的を達成するために、車両に搭載され、入力した第１話者の音声信号を、通信部を介して外部の電話端末に出力すると共に、外部の電話端末からの第２話者の音声信号を、前記通信部を介して入力して前記車両内に出力するハンズフリー通話装置において、前記第１話者の音声信号に基づいて前記第１話者の年齢を判定し、前記第２話者の音声信号に基づいて前記第２話者の年齢を判定する年齢判定部と、前記第１話者及び前記第２話者の年齢に基づいてイコライザ特性を決定し、決定したイコライザ特性に基づいて、前記第１話者の音声信号、及び／又は前記第２話者の音声信号の周波数特性を調整するイコライザ部と、を備えることを特徴とする。 In order to achieve the above object, the voice signal of the first speaker, which is mounted on the vehicle and input, is output to the external telephone terminal via the communication unit, and the voice of the second speaker is output from the external telephone terminal. In a hands-free communication device that inputs a signal through the communication unit and outputs the signal into the vehicle, the age of the first speaker is determined based on the voice signal of the first speaker, and the second talk is performed. Age determination unit that determines the age of the second speaker based on the voice signal of the speaker, and determines an equalizer characteristic based on the ages of the first speaker and the second speaker, and based on the determined equalizer characteristic And an equalizer unit for adjusting frequency characteristics of the voice signal of the first speaker and / or the voice signal of the second speaker.

上記構成において、前記イコライザ部は、発話側の年齢に応じた発声変化を補い、かつ、受話側の年齢に応じた聴力変化を補うイコライザ特性を決定することを特徴とする。 In the above configuration, the equalizer section determines an equalizer characteristic that compensates for a change in utterance according to the age of the speaker and also compensates for a change in hearing according to the age of the receiver.

上記構成において、前記第１話者及び前記第２話者のうちの発話側の背景ノイズ、及び受話側の年齢に基づいてノイズカット特性を決定し、決定したノイズカット特性に基づいて、前記車両内に出力する音声信号、及び／又は前記通信部に出力する音声信号のノイズカットを行うノイズカット部を備えることを特徴とする。 In the above configuration, the noise cut characteristic is determined based on the background noise on the uttering side of the first speaker and the second speaker and the age of the receiving side, and the vehicle based on the determined noise cut characteristic. It is characterized by comprising a noise cutting unit for performing noise cutting of the audio signal output inside and / or the audio signal output to the communication unit.

上記構成において、前記ノイズカット部は、ノイズカットのゲインを高くするほど所定ノイズが付加されるノイズカットを行うものであり、前記受話側の年齢が高いほど、ノイズカットのゲインを低くしたノイズカット特性にすることを特徴とする。 In the above configuration, the noise cut unit performs noise cut in which predetermined noise is added as the gain of the noise cut is increased, and the noise cut in which the gain of the noise cut is lowered as the age of the receiving side is increased. It is characterized by making it a characteristic.

上記構成において、前記イコライザ部は、前記第１話者及び前記第２話者の年齢の少なくともいずれかが、発声能力及び聴力の少なくともいずれかが所定レベル未満の年齢範囲の場合に、周波数特性を調整する処理を行い、両話者の年齢が前記年齢範囲ではない場合は、前記処理をキャンセルすることを特徴とする。 In the above configuration, the equalizer unit sets the frequency characteristic when at least one of the ages of the first speaker and the second speaker is in an age range in which at least one of vocal ability and hearing ability is below a predetermined level. A process of adjusting is performed, and when the ages of both speakers are not within the age range, the process is canceled.

上記構成において、前記年齢判定部は、前記第１話者及び前記第２話者の年齢を、時間間隔を空けて判定し、前記イコライザ部は、前記年齢判定部の判定結果毎に、イコライザ特性を決定し、決定したイコライザ特性に基づいて前記音声信号の周波数特性を調整することを特徴とする。 In the above configuration, the age determination unit determines the ages of the first speaker and the second speaker at a time interval, and the equalizer unit determines an equalizer characteristic for each determination result of the age determination unit. Is determined and the frequency characteristic of the audio signal is adjusted based on the determined equalizer characteristic.

上記構成において、前記ノイズカット特性は、車両特有のノイズを抑制する車両ノイズカット特性を含み、前記ノイズカット部は、前記ノイズカット特性のうち、前記車両ノイズカット特性を除くノイズカット特性のゲインを前記年齢に応じて変化させることを特徴とする。 In the above configuration, the noise cut characteristic includes a vehicle noise cut characteristic that suppresses noise peculiar to a vehicle, and the noise cut unit includes a gain of a noise cut characteristic of the noise cut characteristic excluding the vehicle noise cut characteristic. It is characterized in that it is changed according to the age.

また、車両に搭載され、入力した第１話者の音声信号を、通信部を介して外部の電話端末に出力すると共に、外部の電話端末からの第２話者の音声信号を、前記通信部を介して入力して前記車両内に出力するハンズフリー通話装置の制御方法において、年齢判定部によって、前記第１話者の音声信号に基づいて前記第１話者の年齢を判定し、前記第２話者の音声信号に基づいて前記第２話者の年齢を判定し、イコライザ部によって、前記第１話者及び前記第２話者の年齢に基づいてイコライザ特性を決定し、決定したイコライザ特性に基づいて、前記第１話者の音声信号、及び／又は前記第２話者の音声信号の周波数特性を調整することを特徴とする。 The voice signal of the first speaker, which is mounted on the vehicle and is input, is output to the external telephone terminal via the communication unit, and the voice signal of the second speaker from the external telephone terminal is output to the communication unit. In the method of controlling the hands-free communication device which inputs the signal via the voice signal and outputs the vehicle into the vehicle, the age determination unit determines the age of the first speaker based on the voice signal of the first speaker, The age of the second speaker is determined based on the voice signals of the two speakers, the equalizer unit determines the equalizer characteristic based on the ages of the first speaker and the second speaker, and the determined equalizer characteristic. Based on the above, the frequency characteristics of the voice signal of the first speaker and / or the voice signal of the second speaker are adjusted.

上記方法において、前記イコライザ部は、発話側の年齢に応じた発声変化を補い、かつ、受話側の年齢に応じた聴力変化を補うイコライザ特性を決定する。 In the above method, the equalizer section determines an equalizer characteristic that compensates for a change in utterance according to the age of the speaker and also compensates for a change in hearing according to the age of the receiver.

上記方法において、ノイズカット部により、前記第１話者及び前記第２話者のうちの発話側の背景ノイズ、及び受話側の年齢に基づいてノイズカット特性を決定し、決定したノイズカット特性に基づいて、前記車両内に出力する音声信号、及び／又は前記通信部に出力する音声信号のノイズカットを行うことを特徴とする。 In the above method, the noise cut unit determines a noise cut characteristic based on background noise on the uttering side of the first speaker and the second speaker and the age of the receiving side, and Based on the above, noise cutting of the audio signal output to the inside of the vehicle and / or the audio signal output to the communication unit is performed.

上記方法において、前記ノイズカット部は、ノイズカットのゲインを高くするほど所定ノイズが付加されるノイズカットを行うものであり、前記受話側の年齢が高いほど、ノイズカットのゲインを低くしたノイズカット特性にすることを特徴とする。 In the above method, the noise cut unit performs noise cut in which predetermined noise is added as the noise cut gain increases, and the noise cut with the noise cut gain lowered as the receiver side ages increases. It is characterized by making it a characteristic.

上記方法において、前記イコライザ部は、前記第１話者及び前記第２話者の年齢の少なくともいずれかが、発声能力及び聴力の少なくともいずれかが所定レベル未満の年齢範囲の場合に、周波数特性を調整する処理を行い、両者の年齢が前記年齢範囲ではない場合は、前記処理をキャンセルすることを特徴とする。 In the above method, the equalizer unit determines frequency characteristics when at least one of the ages of the first speaker and the second speaker is in an age range in which at least one of vocal ability and hearing ability is below a predetermined level. It is characterized in that adjustment processing is performed, and when both ages are not within the age range, the processing is canceled.

上記方法において、前記年齢判定部は、各話者の年齢を、時間間隔を空けて判定し、前記イコライザ部は、前記年齢判定部の判定結果毎に、イコライザ特性を決定し、決定したイコライザ特性に基づいて前記音声信号の周波数特性を調整することを特徴とする。 In the above method, the age determination unit, the age of each speaker, determines the time interval, the equalizer unit, for each determination result of the age determination unit, determines the equalizer characteristics, the equalizer characteristics determined The frequency characteristic of the audio signal is adjusted based on the above.

上記方法において、前記ノイズカット特性は、車両特有のノイズを抑制する車両ノイズカット特性を含み、前記ノイズカット部は、前記ノイズカット特性のうち、前記車両ノイズカット特性を除くノイズカット特性のゲインを前記年齢に応じて変化させることを特徴とする。 In the above method, the noise cut characteristic includes a vehicle noise cut characteristic that suppresses noise peculiar to a vehicle, and the noise cut unit obtains a gain of a noise cut characteristic of the noise cut characteristic excluding the vehicle noise cut characteristic. It is characterized in that it is changed according to the age.

本発明によれば、より聞き取り易いハンズフリー通話を可能にする。 According to the present invention, it is possible to make a hands-free call that is easier to hear.

本発明の実施形態に係る車載ハンズフリー通話装置を示す図である。It is a figure which shows the vehicle-mounted hands-free telephone apparatus which concerns on embodiment of this invention. 第１話者から第２話者へ通話する際のハンズフリーエンジンの構成を周辺構成と共に示すブロック図である。It is a block diagram which shows the structure of a hands-free engine at the time of making a telephone call from a 1st speaker to a 2nd speaker with peripheral structure. 第２話者から第１話者へ通話する際のハンズフリーエンジンの構成を周辺構成と共に示すブロック図である。It is a block diagram which shows the structure of a hands-free engine at the time of making a telephone call from a 2nd speaker to a 1st speaker with peripheral structure. ハンズフリーエンジンの音声解析部及び年齢判定部の動作を示すフローチャートである。It is a flow chart which shows operation of a voice analysis part and an age judgment part of a handsfree engine. 音声解析部の動作の説明に供する図である。FIG. 7 is a diagram for explaining the operation of the voice analysis unit. 音声解析部の動作の説明に供する図である。FIG. 7 is a diagram for explaining the operation of the voice analysis unit. 第１話者が５９歳以下の場合のイコライザカーブを例示した図である。It is a figure which illustrated the equalizer curve in case the 1st speaker is 59 years old or less. 第１話者が６０〜６９歳以下の場合のイコライザカーブを例示した図である。It is a figure which illustrated the equalizer curve in case the 1st speaker is 60-69 years old or less. ノイズカーブを例示した図である。It is the figure which illustrated the noise curve.

以下、図面を参照して本発明の実施の形態について説明する。
図１は本発明の実施形態に係る車載ハンズフリー通話装置を示す図である。
この車載用ハンズフリー通話装置１０（以下、通話装置１０と表記する）は、自動車などの車両に搭載される車載装置の１つであり、車両の乗員である第１話者Ｘ（近端話者に相当）がいわゆるハンズフリー通話を行うために使用する装置である。第１話者Ｘの通話相手（遠端話者）は第２話者Ｙである。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
FIG. 1 is a diagram showing an in-vehicle hands-free call device according to an embodiment of the present invention.
The in-vehicle hands-free communication device 10 (hereinafter, referred to as communication device 10) is one of the on-vehicle devices mounted on a vehicle such as an automobile, and is a first speaker X (near-end talk) who is a passenger of the vehicle. Equivalent to the person) is a device used to make so-called hands-free calls. The other party (far end speaker) of the first speaker X is the second speaker Y.

図１に示すように、通話装置１０は、車両に配置されたマイク１１を介して第１話者Ｘの音声に対応する音声信号ＳＸを入力し、ハンズフリーエンジン２１でこの音声信号ＳＸに所定の音声処理を施し、音声処理を施した音声信号ＳＸ’を通信部２２に出力する。
通信部２２は、車両内に配置された電話端末５１Ｘ、及び第２話者Ｙが使用する電話端末５１Ｙなどの任意の電話端末と無線通信する機能を有している。この通信部２２には、例えば、Ｂｌｕｅｔｏｏｔｈ（登録商標）規格に従った近距離無線通信を行う通信モジュールが適用される。なお、Ｂｌｕｅｔｏｏｔｈ以外の近距離無線通信を行う通信モジュールでもよい。 As shown in FIG. 1, the communication device 10 inputs a voice signal SX corresponding to the voice of the first speaker X via a microphone 11 arranged in a vehicle, and a hands-free engine 21 outputs a predetermined voice signal SX to the voice signal SX. Then, the voice signal SX ′ subjected to the voice processing is output to the communication unit 22.
The communication unit 22 has a function of wirelessly communicating with any telephone terminal such as the telephone terminal 51X arranged in the vehicle and the telephone terminal 51Y used by the second speaker Y. For the communication unit 22, for example, a communication module that performs near field communication according to the Bluetooth (registered trademark) standard is applied. A communication module that performs short-range wireless communication other than Bluetooth may be used.

通話装置１０は、この通信部２２を利用して車両内の電話端末５１Ｘと無線通信することによって、移動通信網５５を介して外部の電話端末５１Ｙとの間で通信が可能になる。ハンズフリー通話を行う場合、この通話装置１０から外部の電話端末５１Ｙに向けて第１話者Ｘの音声信号ＳＸ’が送信され、外部の電話端末５１Ｙから通話装置１０に向けて第２話者Ｙの音声信号ＳＹが送信される。 By using the communication unit 22 to wirelessly communicate with the telephone terminal 51X in the vehicle, the communication device 10 can communicate with the external telephone terminal 51Y via the mobile communication network 55. When performing a hands-free call, the voice signal SX ′ of the first speaker X is transmitted from the communication device 10 to the external telephone terminal 51Y, and the second speaker is transmitted from the external telephone terminal 51Y to the communication device 10. The Y audio signal SY is transmitted.

電話端末５１Ｘ、５１Ｙは、携帯電話や固定電話等の任意の電話端末と通信する機能を有し、内蔵スピーカ、内蔵マイク、電話通信のための通信部に加え、通話装置１０と近距離無線通信を行うための通信モジュールを備えている。例えば、電話端末５１Ｘは、第１話者Ｘが所有する携帯電話であり、電話端末５１Ｙは、第２話者Ｙが所有する携帯電話又は固定電話である。なお、電話端末５１Ｘと通話装置１０とが無線で通信接続される場合に限らず、有線で通信接続される構成でもよい。 Each of the telephone terminals 51X and 51Y has a function of communicating with any telephone terminal such as a mobile phone or a landline telephone, and has a built-in speaker, a built-in microphone, a communication unit for telephone communication, and a short-distance wireless communication with the communication device 10. It is equipped with a communication module for performing. For example, the telephone terminal 51X is a mobile phone owned by the first speaker X, and the telephone terminal 51Y is a mobile phone or landline phone owned by the second speaker Y. The configuration is not limited to the case where the telephone terminal 51X and the communication device 10 are wirelessly connected for communication, and may be a configuration in which a wired communication is connected.

第２話者Ｙの音声信号ＳＹは、通話装置１０内のハンズフリーエンジン２１によって所定の音声処理が施された後、車両に配置されたスピーカ１２に出力される。このスピーカ１２によって車両内に第２話者Ｙの音声が放音される。
マイク１１及びスピーカ１２は、車両に予め備えられたもの、又は通話装置１０に設けられたもののいずれでもよい。これらマイク１１及びスピーカ１２は、通話装置１０に有線接続された構成に限定されず、無線接続された構成でもよい。
なお、外部の電話端末５１Ｙでは、内蔵スピーカによって第１話者Ｘの音声信号ＳＸ’に対応する音声が放音され、内蔵マイクによって第２話者Ｙの音声信号ＳＹが取得されて通話装置１０に向けて送信される。 The voice signal SY of the second speaker Y is output to the speaker 12 arranged in the vehicle after being subjected to predetermined voice processing by the handsfree engine 21 in the communication device 10. The speaker 12 emits the voice of the second speaker Y in the vehicle.
The microphone 11 and the speaker 12 may either be provided in advance in the vehicle or provided in the communication device 10. The microphone 11 and the speaker 12 are not limited to the configuration in which they are connected to the communication device 10 by wire, but may be configured to be connected in a wireless manner.
In the external telephone terminal 51Y, a voice corresponding to the voice signal SX 'of the first speaker X is emitted by the built-in speaker, the voice signal SY of the second speaker Y is acquired by the built-in microphone, and the communication device 10 is connected. Sent to.

ハンズフリーエンジン２１は、入力された音声信号ＳＸ、ＳＹに信号処理を行う処理ユニットであり、例えば、ＤＳＰ（Digital Signal Processor）で構成される。このハンズフリーエンジン２１は、音声信号ＳＸ、ＳＹの音声解析を行う音声解析部３１と、音声信号ＳＸ、ＳＹに基づいて第１話者Ｘ及び第２話者Ｙの年齢を判定する処理を行う年齢判定部３２と、音声信号ＳＸ、ＳＹの周波数特性を調整するイコライザ処理を行うイコライザ部３３と、音声信号ＳＸ、ＳＹのノイズカット処理を行うノイズカット部３４とを備えている。 The handsfree engine 21 is a processing unit that performs signal processing on the input audio signals SX and SY, and is configured by, for example, a DSP (Digital Signal Processor). The hands-free engine 21 performs a voice analysis unit 31 that performs voice analysis of the voice signals SX and SY, and a process of determining the ages of the first speaker X and the second speaker Y based on the voice signals SX and SY. An age determination unit 32, an equalizer unit 33 that performs an equalizer process for adjusting the frequency characteristics of the audio signals SX and SY, and a noise cut unit 34 that performs a noise cut process for the audio signals SX and SY are provided.

ハンズフリーエンジン２１について詳述する。
図２は第１話者Ｘから第２話者Ｙへ通話する際のハンズフリーエンジン２１の構成を周辺構成と共に示すブロック図である。
図２に示すように、音声解析部３１は、第１話者音声解析部３１Ａと、第２話者音声解析部３１Ｂとを有している。第１話者音声解析部３１Ａは、第１話者Ｘの音声信号ＳＸの音声解析を行い、第２話者音声解析部３１Ｂは、第２話者Ｙの音声信号ＳＹの音声解析を行う。
各音声解析は、各音声信号ＳＸ、ＳＹから各話者Ｘ、Ｙの年齢に影響する音響特性を特定する処理と、各音声信号ＳＸ、ＳＹに含まれる背景ノイズを特定する処理とを含んでいる。 The hands-free engine 21 will be described in detail.
FIG. 2 is a block diagram showing the configuration of the hands-free engine 21 when making a call from the first speaker X to the second speaker Y together with the peripheral configuration.
As shown in FIG. 2, the voice analysis unit 31 has a first speaker voice analysis unit 31A and a second speaker voice analysis unit 31B. The first speaker voice analysis unit 31A performs voice analysis of the voice signal SX of the first speaker X, and the second speaker voice analysis unit 31B performs voice analysis of the voice signal SY of the second speaker Y.
Each voice analysis includes a process of identifying acoustic characteristics that affect the age of each speaker X, Y from each voice signal SX, SY, and a process of identifying background noise included in each voice signal SX, SY. There is.

年齢判定部３２は、第１話者年齢判定部３２Ａと、第２話者年齢判定部３２Ｂとを有している。第１話者年齢判定部３２Ａは、第１話者音声解析部３１Ａの音声解析結果に基づき第１話者Ｘの年齢を判定する。また、第２話者年齢判定部３２Ｂは、第２話者音声解析部３１Ｂの音声解析結果に基づき第２話者Ｙの年齢を判定する。 The age determination unit 32 has a first speaker age determination unit 32A and a second speaker age determination unit 32B. The first speaker age determination unit 32A determines the age of the first speaker X based on the voice analysis result of the first speaker voice analysis unit 31A. In addition, the second speaker age determination unit 32B determines the age of the second speaker Y based on the voice analysis result of the second speaker voice analysis unit 31B.

イコライザ部３３は、イコライザ特性決定部３３Ａと、イコライザ３３Ｂとを有している。イコライザ特性決定部３３Ａは、第１話者Ｘ及び第２話者Ｙの年齢に基づき音声信号ＳＸのイコライザ特性を決定する。イコライザ３３Ｂは、決定したイコライザ特性に基づいて音声信号ＳＸの周波数特性を調整する。このイコライザ３３Ｂには、音声信号の周波数特性を調整する公知の構成を適用可能である。
一般的に、音声信号の周波数特性の調整は、車両特性の差を吸収するために行われているが、本構成では、第１話者Ｘ及び第２話者Ｙの年齢が高いほど高域ゲインなどを持ち上げることによって、より聞き取り易い音声に調整している。 The equalizer unit 33 includes an equalizer characteristic determination unit 33A and an equalizer 33B. The equalizer characteristic determination unit 33A determines the equalizer characteristic of the audio signal SX based on the ages of the first speaker X and the second speaker Y. The equalizer 33B adjusts the frequency characteristic of the audio signal SX based on the determined equalizer characteristic. A known configuration for adjusting the frequency characteristic of the audio signal can be applied to the equalizer 33B.
Generally, the frequency characteristic of the audio signal is adjusted in order to absorb the difference in vehicle characteristics. In this configuration, the higher the first speaker X and the second speaker Y are, the higher the frequency range becomes. By increasing the gain, etc., the sound is adjusted to make it easier to hear.

ノイズカット部３４は、ノイズカット特性決定部３４Ａと、ノイズキャンセラー３４Ｂとを有している。ノイズカット特性決定部３４Ａは、第１話者音声解析部３１Ａによって特定される発話側（図２では第１話者Ｘ）の背景ノイズ、及び受話側（図２では第２話者Ｙ）の年齢に基づいてノイズカット特性を決定する。
ノイズキャンセラーは３４Ｂ、決定したノイズカット特性に基づいて音声信号ＳＸのノイズを除去、或いは抑制する。このノイズキャンセラー３４Ｂには、ファンや走行ノイズのようなパワースペクトルが時間に対してほぼ一定の背景ノイズをデジタル処理により抑圧する公知の構成（例えばスペクトルサブトラクション法を用いた構成）が適用される。 The noise cut unit 34 has a noise cut characteristic determination unit 34A and a noise canceller 34B. The noise cut characteristic determination unit 34A includes background noise on the uttering side (the first speaker X in FIG. 2) identified by the first speaker voice analysis unit 31A and the background noise on the receiving side (the second speaker Y in FIG. 2). Determine noise cut characteristics based on age.
The noise canceller 34B removes or suppresses the noise of the audio signal SX based on the determined noise cut characteristic. To the noise canceller 34B, a known configuration (for example, a configuration using a spectral subtraction method) that suppresses background noise such as a fan and running noise whose power spectrum is substantially constant with time by digital processing is applied.

ここで、ノイズカット特性決定部３４Ａは、音声解析部３１が特定したノイズレベルに応じてゲインが適切になるようにノイズカット特性を示すノイズカーブをシフトすると共に、受話者が高齢になるほど高域ノイズが聞こえ難くなるため、高域ノイズの抑圧を抑えるようにノイズカーブを補正する。
上記ノイズキャンセラー３４Ｂは、デジタル処理により上記ノイズカットを行うので、ノイズカットのゲインを高くするほど人工的なノイズ（例えば、ミュージカルノイズ）が付加される。
本構成では、高い音が聞こえ難い高齢者には、高域成分などのノイズカット量を抑えるノイズカット特性にすることによって、人工的なノイズの付加を抑えるようにしている。 Here, the noise cut characteristic determination unit 34A shifts the noise curve indicating the noise cut characteristic so that the gain becomes appropriate according to the noise level identified by the voice analysis unit 31, and the higher the frequency band as the listener becomes older. Since noise becomes hard to hear, the noise curve is corrected to suppress the suppression of high frequency noise.
Since the noise canceller 34B performs the noise cut by digital processing, artificial noise (for example, musical noise) is added as the noise cut gain is increased.
With this configuration, for the elderly who cannot hear high sounds, the addition of artificial noise is suppressed by providing a noise cut characteristic that suppresses the amount of noise cut such as high frequency components.

このようにして、第１話者Ｘの音声信号ＳＸには、ノイズカット部３４でノイズカットが施されると共に、イコライザ部３３で周波数特性が調整されることによって、音声信号ＳＸ’が生成される。この音声信号ＳＸ’は、通信部２２及び電話端末５１Ｘを介して第２話者Ｙの電話端末５１Ｙに送信されることによって、音声信号ＳＸ’に対応する音声が第２話者Ｙに聴取される。 In this way, the voice signal SX of the first speaker X is noise-cut by the noise cut unit 34 and the frequency characteristic is adjusted by the equalizer unit 33, so that the voice signal SX ′ is generated. It The voice signal SX ′ is transmitted to the telephone terminal 51Y of the second speaker Y via the communication unit 22 and the telephone terminal 51X, so that the voice corresponding to the voice signal SX ′ is heard by the second speaker Y. It

図３は第２話者Ｙから第１話者Ｘへ通話する際のハンズフリーエンジン２１の構成を周辺構成と共に示すブロック図である。図３は、イコライザ部３３及びノイズカット部３４が、第２話者Ｙの音声信号ＳＹに対し周波数特性の調整やノイズカットを行う点が図２と異なり、図２と共通の部分の説明は省略する。
イコライザ部３３は、イコライザ特性決定部３３Ａにより、年齢判定部３２の判定結果（第１話者Ｘ及び第２話者Ｙの年齢）に基づき音声信号ＳＹのイコライザ特性を決定する。そいて、イコライザ３３Ｂは、決定したイコライザ特性に基づいて音声信号ＳＹの周波数特性を調整する。 FIG. 3 is a block diagram showing the configuration of the hands-free engine 21 when making a call from the second speaker Y to the first speaker X together with the peripheral configuration. 3 is different from FIG. 2 in that the equalizer unit 33 and the noise cut unit 34 perform frequency characteristic adjustment and noise cut on the voice signal SY of the second speaker Y, and the description common to FIG. 2 will be omitted. Omit it.
The equalizer unit 33 determines the equalizer characteristic of the audio signal SY based on the determination result of the age determination unit 32 (the ages of the first speaker X and the second speaker Y) by the equalizer characteristic determination unit 33A. Then, the equalizer 33B adjusts the frequency characteristic of the audio signal SY based on the determined equalizer characteristic.

また、ノイズカット部３４は、ノイズカット特性決定部３４Ａにより、第２話者音声解析部３１Ｂによって特定される発話側（第２話者Ｙ）の背景ノイズ、及び年齢判定部３２によって判定される受話側（第１話者Ｘ）の年齢に基づいて、ノイズカット特性を決定する。そして、ノイズキャンセラー３４Ｂは、決定したノイズカット特性に基づいて音声信号ＳＹのノイズを除去、或いは抑制する。
このようにして、第２話者Ｙの音声信号ＳＹには、ノイズカットが施されると共に、周波数特性が調整されることによって、音声信号ＳＹ’が生成される。この音声信号ＳＹ’はスピーカ１２に出力されることによって、音声信号ＳＹ’に対応する音声が第１話者Ｘに聴取される。 The noise cut unit 34 is determined by the noise cut characteristic determination unit 34A and the background noise on the uttering side (second speaker Y) identified by the second speaker voice analysis unit 31B, and the age determination unit 32. The noise cut characteristic is determined based on the age of the receiving side (first speaker X). Then, the noise canceller 34B removes or suppresses noise of the audio signal SY based on the determined noise cut characteristic.
In this way, the voice signal SY of the second speaker Y is noise-cut and the frequency characteristic is adjusted, so that the voice signal SY ′ is generated. The voice signal SY ′ is output to the speaker 12, so that the voice corresponding to the voice signal SY ′ is heard by the first speaker X.

図４はハンズフリーエンジン２１の音声解析部３１及び年齢判定部３２の動作を示すフローチャートである。音声解析部３１及び年齢判定部３２において、音声信号ＳＸ、ＳＹに対する処理は同じであるため、以下の説明では、第１話者Ｘの音声信号ＳＸに対する処理を例に説明する。
ステップＳ１において、音声解析部３１は、第１話者音声解析部３１Ａによって音声信号ＳＸの周波数解析を行い、年齢に影響する音響特性として、音声信号ＳＸの基本周波数、パワー及び振幅変動を特定し、特定した各値を点数化する処理（第１解析処理と言う）を行う。 FIG. 4 is a flowchart showing the operation of the voice analysis unit 31 and the age determination unit 32 of the handsfree engine 21. Since the voice analysis unit 31 and the age determination unit 32 perform the same process on the voice signals SX and SY, the following description will be made by taking the process on the voice signal SX of the first speaker X as an example.
In step S1, the voice analysis unit 31 performs frequency analysis of the voice signal SX by the first speaker voice analysis unit 31A and identifies the fundamental frequency, power, and amplitude fluctuation of the voice signal SX as acoustic characteristics that affect age. A process (referred to as a first analysis process) for converting the identified values into scores is performed.

基本周波数は、フォルマント周波数のうちの最も低い第０フォルマント周波数（第１フォルマント周波数と言う場合もある）であり、入力された音声信号ＳＸを周波数解析し、ピークが立っている一番低い周波数を求めることによって取得される。基本周波数は、加齢とともに低下する傾向にある。
取得された基本周波数は、図５に示すように、基本周波数と点数との関係を記述した第１データＴ１に基づいて点数化され、点数が低いほど高齢の可能性が高いことを示す判定値に変換される。 The fundamental frequency is the lowest 0th formant frequency (sometimes referred to as the first formant frequency) of the formant frequencies, and the frequency analysis of the input audio signal SX is performed, and the lowest frequency at which the peak stands is found. Obtained by asking. The fundamental frequency tends to decrease with aging.
The acquired fundamental frequency is scored based on the first data T1 that describes the relationship between the fundamental frequency and the score, as shown in FIG. 5, and the lower the score, the higher the possibility of aging Is converted to.

パワーは、第０フォルマント周波数〜第ｋフォルマント周波数（ｋは使用帯域をカバーする整数値であり、例えば値１０）のパワースペクトルの合計値を求めることによって取得される。パワーは加齢とともに低下する傾向にある。
得られたパワーは、図５に示すように、合計値と点数との関係を記述した第２データＴ２に基づいて点数化され、点数が低いほど高齢の可能性が高いことを示す判定値に変換される。 The power is obtained by obtaining the total value of the power spectrum of the 0th formant frequency to the kth formant frequency (k is an integer value that covers the used band, for example, the value 10). Power tends to decrease with age.
As shown in FIG. 5, the obtained power is scored based on the second data T2 describing the relationship between the total value and the score, and the lower the score is, the higher the possibility of aging becomes. To be converted.

振幅変動は、周波数解析した波形の最大振幅値の標準偏差を平均値で割ることによって得られる。振幅変動は加齢とともに増加する傾向にある。
振幅変動は、図５に示すように、振幅変動の値と点数との関係を記述した第３データＴ３に基づいて点数化され、点数が低いほど高齢の可能性が高いことを示す判定値に変換される。以上がステップＳ１の第１解析処理である。 The amplitude fluctuation is obtained by dividing the standard deviation of the maximum amplitude value of the frequency-analyzed waveform by the average value. Amplitude fluctuation tends to increase with aging.
As shown in FIG. 5, the amplitude variation is scored based on the third data T3 describing the relationship between the value of the amplitude variation and the score, and the lower the score is, the higher the possibility of aging becomes. To be converted. The above is the first analysis processing in step S1.

ステップＳ２において、音声解析部３１は、第１話者音声解析部３１Ａによって音声信号ＳＸの音節解析を行い、年齢に影響する音響特性として、音声信号ＳＸの発話スピード、及び無音時間を特定し、特定した各値を点数化する処理（第２解析処理と言う）を行う。
発話スピードは、単位時間当たりのモーラ数を求めることによって得られる。例えば、チョコレートは、「チョ」、「コ」、「レ」、「ー」、「ト」の５モーラであり、「かっぱ」は、「か」、「っ」、「ぱ」の３モーラである。発話スピードは加齢とともに減少する傾向にある。
発話スピードは、図６に示すように、発話スピードと点数との関係を記述した第４データＴ４に基づいて点数化され、この点数が低いほど高齢の可能性が高いことを示す判定値に変換される。 In step S2, the voice analysis unit 31 performs the syllable analysis of the voice signal SX by the first speaker voice analysis unit 31A, and specifies the utterance speed and the silent period of the voice signal SX as the acoustic characteristics that affect age, A process of converting each specified value into a score (referred to as a second analysis process) is performed.
The speech speed can be obtained by obtaining the number of mora per unit time. For example, chocolate is 5 moras of "cho", "ko", "re", "-", and "to", and "kappa" is 3 moras of "ka", "tsu", and "pa". is there. Speech speed tends to decrease with age.
As shown in FIG. 6, the utterance speed is scored based on the fourth data T4 describing the relationship between the utterance speed and the score, and the score is converted into a judgment value indicating that the possibility of aging is higher as the score is lower. To be done.

無音時間は、任意の音節間の無音時間を求めることによって得られ、例えば、０．６秒を最大値とし、それ以上の無音は意図的な無発話と見なし、採用しない。無音時間は加齢とともに増加する傾向にある。
無音時間は、図６に示すように、発話スピードと点数との関係を記述した第５点数化データＴ５に基づいて点数化され、この点数が低いほど高齢の可能性が高いことを示す判定値に変換される。以上がステップＳ２の第２解析処理である。 The silent time is obtained by calculating the silent time between arbitrary syllables, for example, the maximum value is 0.6 seconds, and silent more than that is considered as intentional silent utterance and is not adopted. Silent time tends to increase with aging.
As shown in FIG. 6, the silent time is scored based on the fifth scoring data T5 that describes the relationship between the utterance speed and the score, and the lower the score, the higher the possibility of aging. Is converted to. The above is the second analysis processing in step S2.

ステップＳ３において、年齢判定部３２は、第１及び第２解析処理によって得られた各判定値に基づき第１話者Ｘの年齢を判定する。具体的には、各判定値の和を求め、図６に示すように、各判定値の和である項目合計点と年齢との関係を記述した年齢判定データＴ６に基づいて、年齢を判定する。
このステップＳ１〜Ｓ３の年齢を判定する処理は、時間間隔を空けて繰り返し実行される。これにより、電話中に、第１話者Ｘ及び第２話者Ｙのいずれかが別人に変わった場合に、別人の年齢を直ちに特定できる。 In step S3, the age determination unit 32 determines the age of the first speaker X based on the determination values obtained by the first and second analysis processes. Specifically, the sum of the determination values is obtained, and as shown in FIG. 6, the age is determined based on the age determination data T6 that describes the relationship between the item total score that is the sum of the determination values and the age. ..
The process of determining the age in steps S1 to S3 is repeatedly executed at intervals of time. Accordingly, when one of the first speaker X and the second speaker Y is changed to another person during the call, the age of the other person can be immediately specified.

なお、図６には、年齢判定データＴ６にテーブル形式のデータを使用する場合を例示しているが、これに限定されない。例えば、各判定値に対応する各項目の軸を中心から正多角形状に配置したレーダーチャートを作成し、作成したレーダーチャートと予め設定した年齢判定チャートとを比較することによって、年齢を判定するようにしてもよい。
また、基本周波数、パワー、振幅変動、発話スピード及び無音時間からなる５種類の判定値に基づき年齢を判定する場合を説明したが、判定値の数及び種類は適宜に変更してもよい。 Although FIG. 6 illustrates the case where the data in the table format is used as the age determination data T6, the present invention is not limited to this. For example, the age is determined by creating a radar chart in which the axis of each item corresponding to each determination value is arranged in a regular polygonal shape from the center and comparing the created radar chart with a preset age determination chart. You can
Further, although the case has been described where the age is judged based on five kinds of judgment values consisting of the fundamental frequency, the power, the amplitude fluctuation, the speech speed, and the silent time, the number and kinds of the judgment values may be appropriately changed.

イコライザ部３３は、第１話者Ｘ及び第２話者Ｙの年齢の組合せからイコライザ特性を決定するデータベースを記憶する。イコライザ特性決定部３３Ａは、年齢判定部３２によって判定された第１話者Ｘ及び第２話者Ｙの年齢に基づきデータベースを参照することによって、イコライザ特性を決定する。
このデータベースには、発話側の年齢に応じた発声変化を補い、かつ、受話側の年齢に応じた聴力変化を補うイコライザ特性を特定するデータが予め記述される。なお、データベースを用いる方法に限定されず、第１話者Ｘ及び第２話者Ｙの年齢からイコライザ特性を決定する演算式などを用いてもよい。 The equalizer unit 33 stores a database that determines the equalizer characteristics from the combination of the ages of the first speaker X and the second speaker Y. The equalizer characteristic determination unit 33A determines the equalizer characteristic by referring to the database based on the ages of the first speaker X and the second speaker Y determined by the age determination unit 32.
This database describes in advance data that specifies equalizer characteristics that compensate for changes in vocalization according to the age of the speaker and also compensate for changes in hearing according to the age of the receiver. The method of using the database is not limited, and an arithmetic expression or the like for determining the equalizer characteristic based on the ages of the first speaker X and the second speaker Y may be used.

また、イコライザ特性決定部３３Ａは、年齢判定部３２の判定結果毎にイコライザ特性を決定する。これにより、最新の第１話者Ｘ及び第２話者Ｙの年齢の基づいたイコライザ特性を決定できる。
また、イコライザ特性決定部３３Ａは、第１話者Ｘ及び第２話者Ｙの年齢の少なくともいずれかが、発声能力及び聴力の少なくともいずれかが所定レベル未満の年齢範囲（本構成では６０歳以上）の場合にだけ、イコライザ特性を決定してイコライザ３３Ｂによる音質補正を行う。つまり、第１話者Ｘ及び第２話者Ｙの年齢のいずれも６０歳未満の場合には、イコライザ補正をしなくても十分に通話可能とみなし、イコライザ補正による音質補正をキャンセルする。 In addition, the equalizer characteristic determination unit 33A determines the equalizer characteristic for each determination result of the age determination unit 32. Thereby, the equalizer characteristic based on the ages of the latest first speaker X and second speaker Y can be determined.
Further, the equalizer characteristic determining unit 33A determines that at least one of the ages of the first speaker X and the second speaker Y has an age range in which at least one of the vocal ability and the hearing ability is lower than a predetermined level (60 years or more in this configuration). Only in the case of), the equalizer characteristic is determined and the sound quality is corrected by the equalizer 33B. That is, when both the first speaker X and the second speaker Y are under the age of 60, it is considered that the call can be sufficiently made without the equalizer correction, and the sound quality correction by the equalizer correction is canceled.

図７は第１話者Ｘが５９歳以下の場合のイコライザ特性のカーブ（イコライザカーブ）を例示した図である。また、図８は第１話者Ｘが６０〜６９歳以下の場合のイコライザカーブを例示した図である。
図６及び図７に例示するように、イコライザカーブは、人の声のうちの高域周波数領域（一例として２０００Ｈｚ〜４０００Ｈｚ）を持ち上げ、かつ、人の声に通常含まれる周波数帯域（一例として１０００Ｈｚを中心とする前後の範囲）を持ち上げる特性とされ、第１話者Ｘ及び第２話者Ｙの年齢が上がるほどゲインを高めた特性とされる。
このイコライザカーブは、発話側の加齢に応じた発声変化を補い、且つ、受話側の加齢に応じた聴力低減を補うことが可能なイコライザ特性である。 FIG. 7 is a diagram exemplifying a curve of the equalizer characteristic (equalizer curve) when the first speaker X is 59 years old or younger. FIG. 8 is a diagram illustrating an equalizer curve when the first speaker X is 60 to 69 years old or younger.
As illustrated in FIGS. 6 and 7, the equalizer curve raises the high frequency range (2000 Hz to 4000 Hz, for example) of the human voice, and the frequency band normally included in the human voice (1000 Hz, for example). Is a characteristic in which the gain is increased as the ages of the first speaker X and the second speaker Y increase.
This equalizer curve is an equalizer characteristic capable of compensating for changes in utterance according to aging on the uttering side and also for reducing hearing loss according to aging on the receiving side.

なお、第１話者Ｘの音声信号ＳＸ及び第２話者Ｙの音声信号ＳＹの双方へのイコライザ特性については、異なるイコライザ特性にしてもよいし、共通のイコライザ特性にしてもよい。
異なるイコライザ特性にする場合、第１話者Ｘの音声信号ＳＸへのイコライザ特性については、発話側である第１話者Ｘの年齢に応じた発声変化を補い、かつ、受話側の第２話者の年齢に応じた聴力低減を補うより高精度なイコライザ特性にすることができる。従って、発話側の発声能力、及び、受話側の聴力を正確に考慮したイコライザ特性にでき、第２話者Ｙの聞き取りやすさを効果的に向上できる。
また、第２話者Ｙの音声信号ＳＹへのイコライザ特性については、発話側である第２話者Ｙの年齢に応じた発声変化を補い、かつ、受話側の第１話者Ｘの年齢に応じた聴力低減を補うより高精度なイコライザ特性にすることができ、第１話者Ｘの聞き取りやすさを効果的に向上できる。 The equalizer characteristics for both the voice signal SX of the first speaker X and the voice signal SY of the second speaker Y may be different equalizer characteristics or may be common equalizer characteristics.
When different equalizer characteristics are used, the equalizer characteristics for the voice signal SX of the first speaker X compensate for the change in utterance according to the age of the first speaker X who is the uttering side, and the second talk of the receiving side. It is possible to provide a more accurate equalizer characteristic that compensates for hearing loss according to the age of the person. Therefore, it is possible to provide an equalizer characteristic in which the speaking ability of the uttering side and the hearing ability of the receiving side are accurately taken into consideration, and it is possible to effectively improve the audibility of the second speaker Y.
Regarding the equalizer characteristic of the second speaker Y to the audio signal SY, the change in utterance according to the age of the second speaker Y who is the uttering side is compensated and the age of the first speaker X who is the receiving side is adjusted. A more accurate equalizer characteristic that compensates for the corresponding reduction in hearing ability can be provided, and the first speaker X can easily hear it.

これに対し、共通のイコライザ特性にする場合、発話側と受話側を区別せずに、双方の年齢を考慮してイコライザ特性を決定すればよい。この場合でも、発話側及び受話側のいずれか一方の年齢だけを考慮してイコライザ特性を決定する場合と比較して、発話側の発声変化を補ったり、受話側の聴力低減を補い易くなり、受話側の聞き取りやすさを向上できる。また、共通のイコライザ特性にする場合、異なるイコライザ特性にする場合と比べて、イコライザ特性の決定に要する処理を低減でき、イコライザ部３３に要求される処理量を低減できる。 On the other hand, if the equalizer characteristics are common, the equalizer characteristics may be determined in consideration of both ages of the uttering side and the receiving side. Even in this case, as compared with the case of determining the equalizer characteristics in consideration of only one of the uttering side and the receiving side, it becomes easier to supplement the voicing change on the uttering side or to supplement the hearing reduction on the receiving side, The audibility of the receiving side can be improved. Further, when the common equalizer characteristic is used, the processing required for determining the equalizer characteristic can be reduced and the processing amount required for the equalizer unit 33 can be reduced, as compared with the case where different equalizer characteristics are used.

ノイズカット部３４は、発話側の背景ノイズ、及び受話側の年齢に基づいてノイズカット特性を決定するデータベース、又は、背景ノイズ及び受話側の年齢からノイズカット特性を決定する演算式を記憶する。そして、ノイズカット特性決定部３４Ａが、音声解析部３１が特定した発話側の背景ノイズ、及び年齢判定部３２によって判定された受話側の年齢に基づいてデータベース又は演算式を参照することによって、ノイズカット特性を決定する。
また、ノイズカット特性決定部３４Ａは、年齢判定部３２の判定結果毎にノイズカット特性を決定する。これにより、最新の受話側の年齢の基づいたノイズカット特性を決定できる。 The noise cut unit 34 stores a database that determines the noise cut characteristic based on the background noise on the uttering side and the age of the receiving side, or an arithmetic expression that determines the noise cut characteristic from the background noise and the age of the receiving side. Then, the noise cut characteristic determination unit 34A refers to the database or the arithmetic expression based on the background noise on the utterance side identified by the voice analysis unit 31 and the age on the receiving side determined by the age determination unit 32, thereby reducing noise. Determine the cut characteristics.
Further, the noise cut characteristic determination unit 34A determines the noise cut characteristic for each determination result of the age determination unit 32. This makes it possible to determine the noise cut characteristic based on the latest age of the receiving side.

図９はノイズ特性のカーブ（ノイズカーブ）を例示した図である。
図９に例示するように、ノイズカーブは、発話側の背景ノイズを抑制可能なノイズカット特性であって、受話側の年齢が高いほど、ノイズカットのゲイン（抑圧レベル）を低くしたノイズカット特性とされる。
より具体的には、ノイズカーブは、車両特有のノイズ（例えばロードノイズ）を抑制する車両ノイズカット特性ｆａ（図９の例では１０００Ｈｚ未満）を含んでいる。そして、ノイズカット部３４は、ノイズカット特性のうち車両ノイズカット特性ｆａを除くノイズカット特性ｆｂ（図９の例では２０００Ｈｚ〜４０００Ｈｚ）のゲインを受話側の年齢が上がるほど下げる。 FIG. 9 is a diagram exemplifying a curve of noise characteristics (noise curve).
As illustrated in FIG. 9, the noise curve is a noise cut characteristic capable of suppressing background noise on the uttering side, and the noise cut gain (suppression level) is reduced as the age of the receiving side increases. It is said that.
More specifically, the noise curve includes a vehicle noise cut characteristic fa (less than 1000 Hz in the example of FIG. 9) that suppresses vehicle-specific noise (for example, road noise). Then, the noise cut unit 34 reduces the gain of the noise cut characteristics fb (2000 Hz to 4000 Hz in the example of FIG. 9) excluding the vehicle noise cut characteristics fa among the noise cut characteristics as the age of the receiving side increases.

このように、受話側の年齢が高いほど、抑圧レベルを下げたノイズカット特性にするので、高域成分が聞こえ難い高齢者には高域成分のノイズカットを抑える分、ノイズカットに伴う人工的なノイズの付加量を抑えることができる。図９の例では、受話者が９０歳以上では２０００Ｈｚ〜４０００Ｈｚの範囲の抑圧レベルが０〜２ｄＢまで下げられ、ノイズカットに伴う人工的なノイズの付加がほぼ回避される。
しかも、車両ノイズカット特性ｆａのゲインは、受話側の年齢に影響しないので、車両特有のノイズを効果的に抑制できる。 In this way, as the age of the receiving side is increased, the noise cut characteristic is set so that the suppression level is lowered, so the noise reduction of the high frequency component is suppressed for the elderly who cannot hear the high frequency component. The amount of noise added can be suppressed. In the example of FIG. 9, when the listener is 90 years old or older, the suppression level in the range of 2000 Hz to 4000 Hz is lowered to 0 to 2 dB, and the addition of artificial noise due to noise cut is almost avoided.
Moreover, since the gain of the vehicle noise cut characteristic fa does not affect the age on the receiving side, the noise peculiar to the vehicle can be effectively suppressed.

以上説明したように、通話装置１０は、年齢判定部３２によって、第１話者Ｘの音声信号ＳＸに基づいて第１話者Ｘの年齢を判定し、第２話者Ｙの音声信号ＳＹに基づいて第２話者Ｙの年齢を判定し、イコライザ部３３によって、第１話者Ｘ及び第２話者Ｙの年齢に基づいてイコライザ特性を決定し、決定したイコライザ特性に基づいて音声信号ＳＸ、ＳＹの周波数特性を調整する。これにより、両話者Ｘ、Ｙの年齢を考慮して音声信号ＳＸ、ＳＹの周波数特性を調整でき、年齢によらず聞き取り易いハンズフリー通話が可能になる。
しかも、イコライザ部３３は、発話側の年齢に応じた発声変化を補い、かつ、受話側の年齢に応じた聴力変化を補うイコライザ特性を決定するので、両話者のうちの発話側の発声能力、及び受話側の聴力を考慮して音声信号ＳＸ、ＳＹの周波数特性を調整でき、効果的に聞き取り易いハンズフリー通話がし易くなる。 As described above, in the communication device 10, the age determination unit 32 determines the age of the first speaker X based on the voice signal SX of the first speaker X, and the voice signal SY of the second speaker Y is determined. Based on the ages of the first speaker X and the second speaker Y, the equalizer unit 33 determines the age of the second speaker Y based on the determined equalizer characteristic, and the sound signal SX is determined based on the determined equalizer characteristic. , SY frequency characteristics are adjusted. As a result, the frequency characteristics of the audio signals SX and SY can be adjusted in consideration of the ages of the two speakers X and Y, and a hands-free call that is easy to hear regardless of age can be performed.
Moreover, since the equalizer unit 33 determines the equalizer characteristic that compensates for the change in utterance according to the age of the speaker and also the change in hearing according to the age of the receiver, the utterance ability of the utterer of both speakers is determined. , And the frequency characteristics of the audio signals SX, SY can be adjusted in consideration of the hearing ability of the receiving side, and it becomes easy to effectively perform a hands-free call that is easy to hear.

また、通話装置１０は、ノイズカット部３４によって、第１話者Ｘ及び第２話者Ｙのうちの発話側の背景ノイズ、及び受話側の年齢に基づいてノイズカット特性を決定し、決定したノイズカット特性に基づいて、音声信号ＳＸ、ＳＹのノイズカットを行うので、年齢に影響する聴力を考慮してノイズカットを行うことができ、より聞き取り易いハンズフリー通話が可能になる。 Further, in the communication device 10, the noise cut unit 34 determines and determines the noise cut characteristic based on the background noise on the uttering side of the first speaker X and the second speaker Y and the age on the receiving side. Since the noise cut of the audio signals SX and SY is performed based on the noise cut characteristic, the noise cut can be performed in consideration of the hearing ability that affects the age, and the hands-free call that is easier to hear becomes possible.

また、ノイズカット部３４は、ノイズカットのゲインを高くするほど所定ノイズが付加されるノイズカットを行うものであり、受話側の年齢が高いほど、ノイズカットのゲインを低くしたノイズカット特性にするので、高域ノイズが聞こえ難い高齢者には高域ノイズのカットを抑え、人工的なノイズによる音質劣化を抑えることができる。このようにノイズカット量と人工的なノイズの付加量とのバランスを調整することで、より聞き取り易いハンズフリー通話が可能になる。
また、ノイズカット特性は、車両特有のノイズを抑制する車両ノイズカット特性ｆａを含み、ノイズカット部３４は、ノイズカット特性のうち、車両ノイズカット特性ｆａを除くノイズカット特性のゲインを年齢に応じて変化させるので、車両特有のノイズを効果的に抑制しながら、ノイズカット量と人工的なノイズの付加量とのバランスを適切に調整することができる。 Further, the noise cut unit 34 performs noise cut in which predetermined noise is added as the noise cut gain is increased, and the noise cut characteristic is such that the noise cut gain is decreased as the receiving side ages. Therefore, it is possible to prevent the high frequency noise from being cut by an elderly person who cannot hear the high frequency noise, and to suppress the sound quality deterioration due to artificial noise. By adjusting the balance between the noise cut amount and the artificial noise addition amount in this way, it becomes possible to make a hands-free call that is easier to hear.
Further, the noise cut characteristic includes a vehicle noise cut characteristic fa that suppresses noise peculiar to the vehicle, and the noise cut unit 34 sets the gain of the noise cut characteristic of the noise cut characteristic excluding the vehicle noise cut characteristic fa according to age. Therefore, it is possible to appropriately adjust the balance between the noise cut amount and the artificial noise addition amount while effectively suppressing the vehicle-specific noise.

また、イコライザ部３３は、第１話者Ｘ及び第２話者Ｙの年齢の少なくともいずれかが、発声能力及び聴力の少なくともいずれかが所定レベル未満の年齢範囲の場合に、周波数特性を調整する処理を行い、両者の年齢がその年齢範囲ではない場合は、周波数特性を調整する処理をキャンセルする。これにより、両話者の年齢からいずれかの発声能力又は聴力等が十分高いと見なせる場合は、周波数特性を調整しないリアルな音声で通話させることができる。 Further, the equalizer unit 33 adjusts the frequency characteristic when at least one of the ages of the first speaker X and the second speaker Y is in an age range in which at least one of the vocal ability and the hearing ability is less than the predetermined level. If both ages are not within the age range, the process of adjusting the frequency characteristic is canceled. As a result, when it is considered that either speaker's voice ability or hearing ability is sufficiently high due to the age of both speakers, it is possible to make a real voice call without adjusting the frequency characteristic.

また、年齢判定部３２は、両話者Ｘ、Ｙの年齢を、時間間隔を空けて判定し、イコライザ部３３は、年齢判定部３２の判定結果毎にイコライザ特性を決定し、決定したイコライザ特性に基づいて音声信号ＳＸ、ＳＹの周波数特性を調整するので、通話中等にいずれかの話者が別人に変わると、その別人の年齢を考慮して音声の周波数特性を調整でき、聞き取り易いハンズフリー通話を継続することができる。 Further, the age determination unit 32 determines the ages of both speakers X and Y at time intervals, and the equalizer unit 33 determines the equalizer characteristic for each determination result of the age determination unit 32 and determines the determined equalizer characteristic. Since the frequency characteristics of the audio signals SX and SY are adjusted based on, when one speaker changes to another person during a call, the frequency characteristics of the voice can be adjusted in consideration of the age of the other person, and it is easy to hear. The call can be continued.

本構成では、音声信号ＳＸ、ＳＹの周波数特性を調整しているが、これに限定されず、音声信号ＳＸ、ＳＹのいずれか一方の周波数特性だけを調整してもよい。つまり、第１話者Ｘ及び第２話者Ｙの年齢に基づいて音声信号ＳＸ、及び／又は音声信号ＳＹの周波数特性を調整することで、第１話者Ｘ及び第２話者Ｙの少なくともいずれかが聞き取り易いハンズフリー通話が可能になる。
また、音声信号ＳＸ、ＳＹのノイズカットを行う態様にも限定されず、音声信号ＳＸ、ＳＹのいずれか一方のノイズカットだけを行うようにしてもよい。 In this configuration, the frequency characteristics of the audio signals SX and SY are adjusted, but the present invention is not limited to this, and only one of the audio signals SX and SY may be adjusted. That is, at least the first speaker X and the second speaker Y are adjusted by adjusting the frequency characteristics of the audio signal SX and / or the audio signal SY based on the ages of the first speaker X and the second speaker Y. Enables hands-free calling, whichever is easier to hear.
Further, the present invention is not limited to a mode in which noise cutting is performed on the audio signals SX and SY, and only one of the audio signals SX and SY may be noise-cut.

上記の実施形態は、あくまでも本発明の一実施の態様を例示するものであり、本発明の趣旨を逸脱しない範囲で任意に変形、及び応用が可能である。
例えば、上記の通話装置１０の各構成要素は分割してもよいし、併合してもよい。また、各構成要素は、ハードウェアとソフトウェアの協働などにより任意に実現可能である。また、フローチャートについても、各ステップに対応する処理を分割してもよいし、併合してもよい。
また、上記の通話装置１０はイコライザ処理とノイズカット処理を行う場合を説明したが、ノイズカット処理を省略してもよい。ノイズカット処理を省略した場合、聞き取りやすさを十分に確保できるようにイコライザ処理を行うようにすればよい。 The above embodiment is merely an example of one embodiment of the present invention, and can be arbitrarily modified and applied without departing from the spirit of the present invention.
For example, each component of the above-described communication device 10 may be divided or combined. Further, each component can be arbitrarily realized by cooperation between hardware and software. Also, regarding the flowchart, the processing corresponding to each step may be divided or combined.
Further, although the case has been described where the above-described communication device 10 performs the equalizer process and the noise cut process, the noise cut process may be omitted. When the noise cut processing is omitted, the equalizer processing may be performed so that the audibility can be sufficiently secured.

また、図１〜図３に示す通話装置１０及び通話装置１０の制御方法に本発明を適用する場合を説明したが、これに限定されない。例えば、電話端末５１Ｘの代わりとなる電話モジュールを内蔵するハンズフリー通話装置に本発明を適用してもよく、公知の様々な構成のハンズフリー通話装置に本発明を適用可能である。 Further, the case where the present invention is applied to the communication device 10 and the control method of the communication device 10 shown in FIGS. 1 to 3 has been described, but the invention is not limited to this. For example, the present invention may be applied to a hands-free communication device having a built-in telephone module that replaces the telephone terminal 51X, and the present invention can be applied to various known hands-free communication devices.

１０車載用ハンズフリー通話装置
１１マイク
１２スピーカ
２１ハンズフリーエンジン
２２通信部
３１音声解析部
３１Ａ第１話者音声解析部
３１Ｂ第２話者音声解析部
３２年齢判定部
３２Ａ第１話者年齢判定部
３２Ｂ第２話者年齢判定部
３３イコライザ部
３３Ａイコライザ特性決定部
３３Ｂイコライザ
３４ノイズカット部
３４Ａノイズカット特性決定部
３４Ｂノイズキャンセラー 10 In-vehicle hands-free communication device 11 Microphone 12 Speaker 21 Hands-free engine 22 Communication unit 31 Voice analysis unit 31A First speaker voice analysis unit 31B Second speaker voice analysis unit 32 Age determination unit 32A First speaker age determination unit 32B Second speaker age determination unit 33 Equalizer unit 33A Equalizer characteristic determination unit 33B Equalizer 34 Noise cut unit 34A Noise cut characteristic determination unit 34B Noise canceller

Claims

車両に搭載され、入力した第１話者の音声信号を、通信部を介して外部の電話端末に出力すると共に、外部の電話端末からの第２話者の音声信号を、前記通信部を介して入力して前記車両内に出力するハンズフリー通話装置において、
前記第１話者の音声信号に基づいて前記第１話者の年齢を判定し、前記第２話者の音声信号に基づいて前記第２話者の年齢を判定する年齢判定部と、
前記第１話者及び前記第２話者の年齢に基づいてイコライザ特性を決定し、決定したイコライザ特性に基づいて、前記第１話者の音声信号、及び／又は前記第２話者の音声信号の周波数特性を調整するイコライザ部と、
を備えることを特徴とするハンズフリー通話装置。 The voice signal of the first speaker, which is mounted on the vehicle and input, is output to the external telephone terminal via the communication unit, and the voice signal of the second speaker from the external telephone terminal is output via the communication unit. In the hands-free communication device that inputs the
An age determination unit that determines the age of the first speaker based on the voice signal of the first speaker and determines the age of the second speaker based on the voice signal of the second speaker.
An equalizer characteristic is determined based on the ages of the first speaker and the second speaker, and the voice signal of the first speaker and / or the voice signal of the second speaker is determined based on the determined equalizer characteristic. An equalizer section that adjusts the frequency characteristics of
A hands-free communication device comprising:

前記イコライザ部は、発話側の年齢に応じた発声変化を補い、かつ、受話側の年齢に応じた聴力変化を補うイコライザ特性を決定することを特徴とする請求項１に記載のハンズフリー通話装置。 2. The hands-free communication device according to claim 1, wherein the equalizer unit determines an equalizer characteristic that compensates for a change in voice according to the age of the speaker and also compensates for a change in hearing according to the age of the receiver. ..

前記第１話者及び前記第２話者のうちの発話側の背景ノイズ、及び受話側の年齢に基づいてノイズカット特性を決定し、決定したノイズカット特性に基づいて、前記車両内に出力する音声信号、及び／又は前記通信部に出力する音声信号のノイズカットを行うノイズカット部を備えることを特徴とする請求項１又は２に記載のハンズフリー通話装置。 The noise cut characteristic is determined based on the background noise on the uttering side of the first speaker and the second speaker and the age of the receiving side, and is output to the inside of the vehicle based on the determined noise cut characteristic. The hands-free communication device according to claim 1 or 2, further comprising a noise cutting unit that cuts a voice signal and / or a voice signal output to the communication unit.

前記ノイズカット部は、ノイズカットのゲインを高くするほど所定ノイズが付加されるノイズカットを行うものであり、
前記受話側の年齢が高いほど、ノイズカットのゲインを低くしたノイズカット特性にすることを特徴とする請求項３に記載のハンズフリー通話装置。 The noise cut unit performs noise cut in which predetermined noise is added as the noise cut gain is increased,
4. The hands-free communication device according to claim 3, wherein the older the receiving side is, the lower the noise cut gain is made to have the noise cut characteristic.

前記イコライザ部は、前記第１話者及び前記第２話者の年齢の少なくともいずれかが、発声能力及び聴力の少なくともいずれかが所定レベル未満の年齢範囲の場合に、周波数特性を調整する処理を行い、両話者の年齢が前記年齢範囲ではない場合は、前記処理をキャンセルすることを特徴とする請求項１から４のいずれかに記載のハンズフリー通話装置。 The equalizer unit adjusts the frequency characteristic when at least one of the ages of the first speaker and the second speaker is in an age range in which at least one of the vocal ability and the hearing ability is less than a predetermined level. The hands-free communication device according to any one of claims 1 to 4, wherein the processing is canceled when the ages of both speakers are not within the age range.

前記年齢判定部は、前記第１話者及び前記第２話者の年齢を時間間隔を空けて判定し、
前記イコライザ部は、前記年齢判定部の判定結果毎に、イコライザ特性を決定し、決定したイコライザ特性に基づいて前記音声信号の周波数特性を調整することを特徴とする請求項１から５のいずれかに記載のハンズフリー通話装置。 The age determination unit determines the ages of the first speaker and the second speaker at time intervals,
The equalizer section determines an equalizer characteristic for each determination result of the age determination section, and adjusts the frequency characteristic of the audio signal based on the determined equalizer characteristic. Hands-free communication device described in.

前記ノイズカット特性は、車両特有のノイズを抑制する車両ノイズカット特性を含み、
前記ノイズカット部は、前記ノイズカット特性のうち、前記車両ノイズカット特性を除くノイズカット特性のゲインを前記年齢に応じて変化させることを特徴とする請求項３又は４に記載のハンズフリー通話装置。 The noise cut characteristics include vehicle noise cut characteristics for suppressing vehicle-specific noise,
The said noise cut part changes the gain of the noise cut characteristic except the said vehicle noise cut characteristic among the said noise cut characteristics according to the said age, The hands-free communication apparatus of Claim 3 or 4. ..

車両に搭載され、入力した第１話者の音声信号を、通信部を介して外部の電話端末に出力すると共に、外部の電話端末からの第２話者の音声信号を、前記通信部を介して入力して前記車両内に出力するハンズフリー通話装置の制御方法において、
年齢判定部によって、前記第１話者の音声信号に基づいて前記第１話者の年齢を判定し、前記第２話者の音声信号に基づいて前記第２話者の年齢を判定し、
イコライザ部によって、前記第１話者及び前記第２話者の年齢に基づいてイコライザ特性を決定し、決定したイコライザ特性に基づいて、前記第１話者の音声信号、及び／又は前記第２話者の音声信号の周波数特性を調整することを特徴とするハンズフリー通話装置の制御方法。 The voice signal of the first speaker, which is mounted on the vehicle and input, is output to the external telephone terminal via the communication unit, and the voice signal of the second speaker from the external telephone terminal is output via the communication unit. In the control method of the hands-free communication device that inputs the
An age determination unit determines the age of the first speaker based on the voice signal of the first speaker, and determines the age of the second speaker based on the voice signal of the second speaker,
The equalizer section determines an equalizer characteristic based on the ages of the first speaker and the second speaker, and based on the determined equalizer characteristic, the voice signal of the first speaker and / or the second speaker. A method for controlling a hands-free communication device, which comprises adjusting the frequency characteristic of a person's voice signal.

前記イコライザ部は、発話側の年齢に応じた発声変化を補い、かつ、受話側の年齢に応じた聴力変化を補うイコライザ特性を決定することを特徴とする請求項８に記載のハンズフリー通話装置の制御方法。 9. The hands-free communication device according to claim 8, wherein the equalizer unit determines an equalizer characteristic that compensates for a change in utterance according to the age of the speaker and also compensates for a change in hearing according to the age of the receiver. Control method.

ノイズカット部により、前記第１話者及び前記第２話者のうちの発話側の背景ノイズ、及び受話側の年齢に基づいてノイズカット特性を決定し、決定したノイズカット特性に基づいて、前記車両内に出力する音声信号、及び／又は前記通信部に出力する音声信号のノイズカットを行うことを特徴とする請求項８又は９に記載のハンズフリー通話装置の制御方法。 The noise cut unit determines the noise cut characteristic based on the background noise on the uttering side of the first speaker and the second speaker and the age of the receiving side, and based on the determined noise cut characteristic, The method for controlling the hands-free communication device according to claim 8 or 9, wherein noise reduction is performed on a voice signal output to the inside of the vehicle and / or a voice signal output to the communication unit.

前記ノイズカット部は、ノイズカットのゲインを高くするほど所定ノイズが付加されるノイズカットを行うものであり、
前記受話側の年齢が高いほど、ノイズカットのゲインを低くしたノイズカット特性にすることを特徴とする請求項１０に記載のハンズフリー通話装置の制御方法。 The noise cut unit performs noise cut in which predetermined noise is added as the noise cut gain is increased,
11. The method of controlling a hands-free communication device according to claim 10, wherein the older the receiving side is, the lower the noise cut gain is made to have a noise cut characteristic.

前記イコライザ部は、前記第１話者及び前記第２話者の年齢の少なくともいずれかが、発声能力及び聴力の少なくともいずれかが所定レベル未満の年齢範囲の場合に、周波数特性を調整する処理を行い、両者の年齢が前記年齢範囲ではない場合は、前記処理をキャンセルすることを特徴とする請求項８から１１のいずれかに記載のハンズフリー通話装置の制御方法。 The equalizer unit adjusts the frequency characteristic when at least one of the ages of the first speaker and the second speaker is in an age range in which at least one of the vocal ability and the hearing ability is less than a predetermined level. The control method of the hands-free communication device according to any one of claims 8 to 11, wherein the processing is canceled when both ages are not within the age range.

前記年齢判定部は、各話者の年齢を、時間間隔を空けて判定し、
前記イコライザ部は、前記年齢判定部の判定結果毎に、イコライザ特性を決定し、決定したイコライザ特性に基づいて前記音声信号の周波数特性を調整することを特徴とする請求項８から１２のいずれかに記載のハンズフリー通話装置の制御方法。 The age determination unit determines the age of each speaker at a time interval,
13. The equalizer unit determines an equalizer characteristic for each determination result of the age determination unit, and adjusts the frequency characteristic of the audio signal based on the determined equalizer characteristic. A method for controlling the hands-free communication device described in.

前記ノイズカット特性は、車両特有のノイズを抑制する車両ノイズカット特性を含み、
前記ノイズカット部は、前記ノイズカット特性のうち、前記車両ノイズカット特性を除くノイズカット特性のゲインを前記年齢に応じて変化させることを特徴とする請求項１０又は１１に記載のハンズフリー通話装置の制御方法。 The noise cut characteristics include vehicle noise cut characteristics for suppressing vehicle-specific noise,
The said noise cut part changes the gain of the noise cut characteristic except the said vehicle noise cut characteristic among the said noise cut characteristics according to the said age, The hands-free communication device of Claim 10 or 11 characterized by the above-mentioned. Control method.