WO2020202444A1

WO2020202444A1 - Physical condition detection system

Info

Publication number: WO2020202444A1
Application number: PCT/JP2019/014526
Authority: WO
Inventors: 伴之服部
Original assignee: 株式会社ファーストアセント
Priority date: 2019-04-01
Filing date: 2019-04-01
Publication date: 2020-10-08

Abstract

This physical condition detection system comprises: a speech data acquisition unit for acquiring first speech data that represents a vocalization of an infant; a frequency component detection unit for detecting a plurality of frequency components included in the first speech data; a speech parameter extraction unit for extracting a plurality of parameters based on the plurality of frequency components; and a physical condition determination unit for determining whether or not the infant is in poor physical condition on the basis of second speech data that represents a vocalization of the infant and the plurality of parameters.

Description

体調検出システムPhysical condition detection system

　本発明は、体調検出システムに関する。 The present invention relates to a physical condition detection system.

　特許文献１には、測定データを処理して幼児の健康面に問題があるかを判断する幼児健康監視システムが開示されている。幼児監視装置が集めた測定データは、幼児の動き、体温、***および目覚めを含んでいる。マイクロホンまたはカメラ等の各装置から集めた環境センサデータは、音声レベルおよびビデオストリームを含んでいる。 Patent Document 1 discloses an infant health monitoring system that processes measurement data to determine whether there is a problem with the health of the infant. The measurement data collected by the infant monitoring device includes infant movement, temperature, position and awakening. Environmental sensor data collected from each device, such as a microphone or camera, includes audio levels and video streams.

国際公開第２０１６／１６４３７３号International Publication No. 2016/164373

　特許文献１に開示された幼児監視システムは、幼児監視装置が集めた測定データおよび環境センサデータに基づいて、ＳＩＤＳ（乳幼児突然死症候群）、てんかん発作、睡眠パターンの乱れ、発熱、或いはストレスを受けている状態といった、幼児の健康面の問題を判断する。しかし、乳幼児の健康面の問題はそれらに限られず、様々な疾病が考えられることから、そうした様々な疾病に乳幼児が罹患したことを検出するのは、特許文献１に開示された幼児監視システムには困難であるという問題がある。 The infant monitoring system disclosed in Patent Document 1 receives SIDS (Sudden Infant Death Syndrome), epilepsy, disturbed sleep pattern, fever, or stress based on the measurement data and environmental sensor data collected by the infant monitoring device. Determine infant health problems such as being in a state of being. However, the health problems of infants are not limited to these, and various diseases can be considered. Therefore, it is the infant monitoring system disclosed in Patent Document 1 that detects that an infant is affected by such various diseases. Has the problem of being difficult.

　本発明の第１の態様によると、体調検出システムは、乳児の発声を表す第１の音声データを取得する音声データ取得部と、前記第１の音声データに含まれる複数の周波数成分を検出する周波数成分検出部と、前記第１の音声データに含まれる前記複数の周波数成分に基づく複数のパラメータを抽出する音声パラメータ抽出部と、前記音声データ取得部により前記乳児の発声を表す第２の音声データが取得されると、取得された前記第２の音声データと前記複数のパラメータとに基づき、前記乳児は体調不良の状態であるか否かの判定を行う体調判定部とを備える。
　本発明の第２の態様によると、第１の態様の体調検出システムにおいて、前記周波数成分検出部は、前記第２の音声データに含まれる前記複数の周波数成分をさらに検出し、前記音声パラメータ抽出部は、前記第２の音声データに含まれる前記複数の周波数成分に基づく前記複数のパラメータをさらに抽出し、前記体調判定部は、前記第２の音声データに含まれる前記複数の周波数成分に基づく前記複数のパラメータのうちの少なくとも一部のパラメータが所定条件を満たすとき、前記判定を行うのが好ましい。
　本発明の第３の態様によると、第２の態様の体調検出システムにおいて、前記少なくとも一部のパラメータは、前記第２の音声データに含まれる前記発声の基本周波数であり、前記所定条件は、前記基本周波数が３００Ｈｚ以上８００Ｈｚ以下であるのが好ましい。
　本発明の第４の態様によると、第２または第３の態様の体調検出システムにおいて、前記体調判定部は、前記第１の音声データに含まれる前記複数の周波数成分に基づく前記複数のパラメータと、前記第２の音声データに含まれる前記複数の周波数成分に基づく前記複数のパラメータとの類似度に基づいて、前記判定を行うのが好ましい。
　本発明の第５の態様によると、第１から第４までのいずれかの態様の体調検出システムにおいて、前記複数のパラメータは、単位時間あたりの前記発声の発生回数と、前記発声の１回における継続時間と、前記発声の基本周波数と、前記発声のフォルマント周波数とのうちの少なくとも一つのパラメータを含むのが好ましい。
　本発明の第６の態様によると、第１から第５までのいずれかの態様の体調検出システムにおいて、前記体調不良の状態は、前記乳児が病気に罹患した状態と、前記乳児が前記病気に罹患しつつある状態とのうちの、少なくとも一方の状態を含むのが好ましい。
　本発明の第７の態様によると、第１から第６までのいずれかの態様の体調検出システムにおいて、前記体調判定部によって前記乳児が体調不良であると判定されると、前記乳児が体調不良である旨のメッセージを出力するメッセージ出力装置をさらに備えるのが好ましい。
　本発明の第８の態様によると、第１から第７までのいずれかの態様の体調検出システムにおいて、前記音声データ取得部により前記第２の音声データが取得されると、取得された前記第２の音声データと前記複数のパラメータとに基づき、前記乳児の感情を推定する感情推定部をさらに備えるのが好ましい。 According to the first aspect of the present invention, the physical condition detection system detects a voice data acquisition unit that acquires a first voice data representing a baby's voice and a plurality of frequency components included in the first voice data. A frequency component detection unit, a voice parameter extraction unit that extracts a plurality of parameters based on the plurality of frequency components included in the first voice data, and a second voice that represents the baby's voice by the voice data acquisition unit. When the data is acquired, the infant is provided with a physical condition determination unit that determines whether or not the infant is in a poor physical condition based on the acquired second voice data and the plurality of parameters.
According to the second aspect of the present invention, in the physical condition detection system of the first aspect, the frequency component detection unit further detects the plurality of frequency components included in the second voice data, and extracts the voice parameter. The unit further extracts the plurality of parameters based on the plurality of frequency components included in the second voice data, and the physical condition determination unit is based on the plurality of frequency components included in the second voice data. It is preferable to perform the determination when at least a part of the plurality of parameters satisfies a predetermined condition.
According to the third aspect of the present invention, in the physical condition detection system of the second aspect, the at least a part of the parameters is the fundamental frequency of the vocalization included in the second voice data, and the predetermined condition is. The fundamental frequency is preferably 300 Hz or more and 800 Hz or less.
According to the fourth aspect of the present invention, in the physical condition detection system of the second or third aspect, the physical condition determination unit includes the plurality of parameters based on the plurality of frequency components included in the first voice data. It is preferable to make the determination based on the similarity with the plurality of parameters based on the plurality of frequency components included in the second audio data.
According to the fifth aspect of the present invention, in the physical condition detection system of any one of the first to fourth aspects, the plurality of parameters are the number of occurrences of the utterance per unit time and one of the utterances. It is preferable to include at least one parameter of the duration, the fundamental frequency of the utterance, and the formant frequency of the utterance.
According to the sixth aspect of the present invention, in the physical condition detection system of any one of the first to fifth aspects, the state of poor physical condition includes a state in which the baby is sick and a state in which the baby is sick. It preferably includes at least one of the afflicting conditions.
According to the seventh aspect of the present invention, in the physical condition detection system of any one of the first to sixth aspects, when the physical condition determination unit determines that the baby is in poor physical condition, the baby is in poor physical condition. It is preferable to further include a message output device that outputs a message to that effect.
According to the eighth aspect of the present invention, in the physical condition detection system of any one of the first to seventh aspects, when the second voice data is acquired by the voice data acquisition unit, the acquired second voice data is obtained. It is preferable to further include an emotion estimation unit that estimates the emotions of the baby based on the voice data of 2 and the plurality of parameters.

　本発明によれば、様々に考えられる疾病に乳児が罹患したことを検出できる可能性が高いという効果が得られる。 According to the present invention, there is a high possibility that it is possible to detect that an infant has been affected by various possible diseases.

図１は、本発明の一実施の形態における体調検出システムの構成を示す図である。FIG. 1 is a diagram showing a configuration of a physical condition detection system according to an embodiment of the present invention. 図２は、乳児の発声を表す音声データが取得される様子を例示する図である。FIG. 2 is a diagram illustrating how voice data representing the utterance of an infant is acquired. 図３は、乳児の発声を表す音声データに含まれる複数の周波数成分と、それらの複数の周波数成分に基づく複数のパラメータの値が記録された記録データを例示する図である。FIG. 3 is a diagram illustrating recorded data in which a plurality of frequency components included in voice data representing a baby's utterance and values of a plurality of parameters based on the plurality of frequency components are recorded. 図４は、一実施の形態における音声検出装置でコンピュータプログラムが走行することによって実行されるパラメータ記録処理の一例を示す図である。FIG. 4 is a diagram showing an example of parameter recording processing executed by running a computer program on the voice detection device according to the embodiment. 図５は、乳児の発声を表す音声データが取得され、その乳児が体調不良の状態であると判定された場合を例示する図である。FIG. 5 is a diagram illustrating a case where voice data representing the vocalization of an infant is acquired and the infant is determined to be in a poor physical condition. 図６は、一実施の形態における音声検出装置でコンピュータプログラムが走行することによって実行される体調判定処理の一例を示す図である。FIG. 6 is a diagram showing an example of a physical condition determination process executed by running a computer program on the voice detection device according to the embodiment. 図７は、一実施の形態における音声検出装置で走行するコンピュータプログラムの製品供給態様を例示する図である。FIG. 7 is a diagram illustrating a product supply mode of a computer program traveling by the voice detection device according to the embodiment. 図８は、本発明の一実施の形態の変形例２における体調検出システムで実行される体調判定処理に用いられる記録データを例示する図である。FIG. 8 is a diagram illustrating recorded data used in the physical condition determination process executed by the physical condition detection system in the second modification of the embodiment of the present invention. 図９は、本発明の一実施の形態の変形例５における体調検出システムに含まれる音声検出装置の構成を示す図である。FIG. 9 is a diagram showing a configuration of a voice detection device included in the physical condition detection system according to the fifth modification of the embodiment of the present invention. 図１０は、本発明の一実施の形態の変形例７における体調検出システムの構成を示す図である。FIG. 10 is a diagram showing a configuration of a physical condition detection system according to a modification 7 of the embodiment of the present invention.

　図１は、本発明の一実施の形態における体調検出システム２の構成を示す図である。図１（ａ）において、体調検出システム２は、音声検出装置５と電子機器１０とを含む。音声検出装置５と電子機器１０とは互いにＢｌｕｅｔｏｏｔｈ（登録商標）等の無線通信を介して接続されている。電子機器１０は、例えば、スマートフォン、タブレットコンピュータ、またはパーソナルコンピュータ等の携帯端末であり、専用端末であっても汎用端末であってもよい。 FIG. 1 is a diagram showing a configuration of a physical condition detection system 2 according to an embodiment of the present invention. In FIG. 1A, the physical condition detection system 2 includes a voice detection device 5 and an electronic device 10. The voice detection device 5 and the electronic device 10 are connected to each other via wireless communication such as Bluetooth (registered trademark). The electronic device 10 is, for example, a mobile terminal such as a smartphone, a tablet computer, or a personal computer, and may be a dedicated terminal or a general-purpose terminal.

　図１（ｂ）に示すように、音声検出装置５は、マイクロホン３、ストレージ４、プロセッサ６、メモリ７および通信モジュール８を有する。マイクロホン３は、人間の発声および環境音を含む音声を電気信号による音声データに変換する。プロセッサ６は、メモリ７に格納されているコンピュータプログラムを起動することによって、音声データ取得部６１と、周波数成分検出部６２と、音声パラメータ抽出部６３と、体調判定部６４とを論理的に有する。　 As shown in FIG. 1B, the voice detection device 5 includes a microphone 3, a storage 4, a processor 6, a memory 7, and a communication module 8. The microphone 3 converts voice including human utterance and environmental sound into voice data by an electric signal. The processor 6 logically includes a voice data acquisition unit 61, a frequency component detection unit 62, a voice parameter extraction unit 63, and a physical condition determination unit 64 by activating a computer program stored in the memory 7. ..

　音声データ取得部６１は、マイクロホン３を介して乳児の発声を表す音声データを、例えば音声検出装置５の設置環境における環境音を表す音声データおよび／または成人の発声を表す音声データ等とともに取得する。取得された音声データには複数の周波数成分が含まれる。周波数成分検出部６２は、乳児の発声を含む音声の音声データに含まれる複数の周波数成分を検出する。乳児の発声には、泣き声および喃語の発声が含まれる。検出されたそれらの複数の周波数成分より、基本周波数と、第一フォルマント周波数と、第二フォルマント周波数とが得られる。一般に、基本周波数は音の高さを決め、第一フォルマント周波数および第二フォルマント周波数は音色を決めることが知られている。 The voice data acquisition unit 61 acquires voice data representing the voice of an infant via the microphone 3 together with voice data representing the environmental sound in the installation environment of the voice detection device 5 and / or voice data representing the voice of an adult. .. The acquired voice data includes a plurality of frequency components. The frequency component detection unit 62 detects a plurality of frequency components included in the voice data of the voice including the utterance of the baby. Infant vocalizations include crying and babbling vocalizations. From those plurality of detected frequency components, a fundamental frequency, a first formant frequency, and a second formant frequency can be obtained. It is generally known that the fundamental frequency determines the pitch, and the first formant frequency and the second formant frequency determine the timbre.

　音声パラメータ抽出部６３は、音声データ取得部６１によって取得された音声データに含まれる、周波数成分検出部６２によって検出された複数の周波数成分に基づき、複数のパラメータを抽出する。抽出される複数のパラメータは、図３（ｂ）を用いて後述するように、単位時間あたりの乳児の発声の回数と、乳児の発声の１回における継続時間と、乳児の発声の基本周波数と、乳児の発声に含まれるフォルマント周波数（第一フォルマント周波数および／または第二フォルマント周波数）とのうちの少なくとも一つのパラメータを含む。なお、音声波形を複数の正弦波の合成で近似したとき、それら複数の正弦波のうちで最も低い周波数を示す正弦波の周波数は基本周波数と呼ばれる。また、その基本周波数の整数倍の周波数を示す正弦波の振幅のピークに対応する周波数として、基本周波数から近い順に第一フォルマント周波数および第二フォルマント周波数が得られることが知られている。そこで、音声データ取得部６１によって取得された音声データにおいて周波数成分検出部６２により検出された複数の周波数成分に含まれる、基本周波数、第一フォルマント周波数および第二フォルマント周波数が、上述したパラメータとして、音声パラメータ抽出部６３により抽出され得る。 The voice parameter extraction unit 63 extracts a plurality of parameters based on a plurality of frequency components detected by the frequency component detection unit 62 included in the voice data acquired by the voice data acquisition unit 61. The plurality of extracted parameters are the number of infant utterances per unit time, the duration of one infant utterance, and the fundamental frequency of the infant utterance, as will be described later using FIG. 3 (b). , Includes at least one parameter of the formant frequency (first formant frequency and / or second formant frequency) included in the infant's utterance. When the audio waveform is approximated by combining a plurality of sine waves, the frequency of the sine wave indicating the lowest frequency among the plurality of sine waves is called the fundamental frequency. Further, it is known that the first formant frequency and the second formant frequency are obtained in order from the fundamental frequency as the frequency corresponding to the peak of the amplitude of the sine wave indicating a frequency that is an integral multiple of the fundamental frequency. Therefore, the fundamental frequency, the first formant frequency, and the second formant frequency included in the plurality of frequency components detected by the frequency component detection unit 62 in the voice data acquired by the voice data acquisition unit 61 are set as the above-mentioned parameters. It can be extracted by the voice parameter extraction unit 63.

　図４を用いて後述するパラメータ記録処理において、音声データ取得部６１によって取得された音声データから音声パラメータ抽出部６３によって抽出された複数のパラメータの値は、体調判定部６４によって乳児が体調不良の状態であるか否かの判定を行う際に参照される記録データとして、音声パラメータ抽出部６３によりストレージ４に記録される。この記録データは、音声検出装置５内部のストレージ４ではなく音声検出装置５外部のストレージ（不図示）に記録されてもよい。 In the parameter recording process described later with reference to FIG. 4, the values of the plurality of parameters extracted by the voice parameter extraction unit 63 from the voice data acquired by the voice data acquisition unit 61 are such that the baby is in poor physical condition by the physical condition determination unit 64. The recorded data referred to when determining whether or not the state is in the state is recorded in the storage 4 by the voice parameter extraction unit 63. This recorded data may be recorded in a storage (not shown) outside the voice detection device 5 instead of the storage 4 inside the voice detection device 5.

　図６を用いて後述する体調判定処理において、体調判定部６４は、音声データ取得部６１により乳児の発声を表す新たな音声データが取得されたとき、取得された新たな音声データと、上述したストレージ４に記録された複数のパラメータの値とに基づき、その乳児は体調不良の状態であるか否かの判定を行う。音声データ取得部６１により上述した新たな音声データが取得されたとき、周波数成分検出部６２は、その新たな音声データに含まれる複数の周波数成分を検出する。音声パラメータ抽出部６３は、その新たな音声データに含まれる、周波数成分検出部６２によって検出された複数の周波数成分に基づき、上述した複数のパラメータを抽出する。体調判定部６４は、その新たな音声データに含まれる複数の周波数成分に基づいて音声パラメータ抽出部６３により抽出された複数のパラメータのうち、少なくとも一部のパラメータが所定条件を満たすとき、その乳児は体調不良の状態であるか否かの判定を行う。少なくとも一部のパラメータが所定条件を満たすとき、とは、例えば、発声の基本周波数が３００Ｈｚ以上８００Ｈｚ以下であるとき、をいう。発声の基本周波数が３００Ｈｚを下回るとき、或いは基本周波数が８００Ｈｚを上回るとき、その新たな音声データで表される発声は、例えば成人の発声であったり、或いは発声ではなく生活音等の環境音であったりするなど、乳児の発声ではない可能性が高いと考えられるからである。 In the physical condition determination process described later with reference to FIG. 6, when the voice data acquisition unit 61 acquires new voice data representing the baby's utterance, the physical condition determination unit 64 together with the acquired new voice data and the above-mentioned. Based on the values of the plurality of parameters recorded in the storage 4, it is determined whether or not the baby is in a poor physical condition. When the above-mentioned new voice data is acquired by the voice data acquisition unit 61, the frequency component detection unit 62 detects a plurality of frequency components included in the new voice data. The voice parameter extraction unit 63 extracts the above-mentioned plurality of parameters based on the plurality of frequency components detected by the frequency component detection unit 62 included in the new voice data. When at least some of the parameters extracted by the voice parameter extraction unit 63 based on the plurality of frequency components included in the new voice data satisfy the predetermined conditions, the physical condition determination unit 64 determines the baby. Determines whether or not the patient is in poor physical condition. When at least some of the parameters satisfy a predetermined condition, for example, when the fundamental frequency of vocalization is 300 Hz or more and 800 Hz or less. When the fundamental frequency of utterance is below 300 Hz, or when the fundamental frequency is above 800 Hz, the utterance represented by the new voice data is, for example, an adult utterance, or an environmental sound such as a living sound instead of utterance. This is because it is highly possible that the vocalization is not an infant's vocalization.

　体調判定部６４は、乳児が体調不良の状態であるか否かの判定を行う際は、後述するように、ストレージ４に記録された複数のパラメータと、新たな音声データに含まれる複数の周波数成分に基づいて抽出された複数のパラメータとの類似度を算出する。算出された類似度に基づいて、乳児が体調不良の状態であるか否かの判定が、体調判定部６４により行われる。乳児が体調不良の状態であると判定されたとき、体調判定部６４は、乳児が体調不良である旨の通知信号を、通信モジュール８を介して無線通信により電子機器１０へ送出する。 When determining whether or not the baby is in poor physical condition, the physical condition determination unit 64 determines a plurality of parameters recorded in the storage 4 and a plurality of frequencies included in the new voice data, as will be described later. The similarity with a plurality of parameters extracted based on the components is calculated. Based on the calculated similarity, the physical condition determination unit 64 determines whether or not the baby is in a poor physical condition. When it is determined that the baby is in poor physical condition, the physical condition determination unit 64 sends a notification signal indicating that the baby is in poor physical condition to the electronic device 10 by wireless communication via the communication module 8.

　図１（ｃ）に示すように、電子機器１０は、プロセッサ１１、メモリ１２、入力インタフェース１３、メッセージ出力装置１４および通信モジュール１５を有する。プロセッサ１１は、メモリ１２に格納されているコンピュータプログラムを起動することによって、入力インタフェース１３の動作を制御する入力制御部１１１と、メッセージ出力装置１４を制御するメッセージ制御部１１２とを、論理的に有する。入力インタフェース１３は、例えばタッチパネルである。後述する期間情報は、ユーザによる画面入力データとして入力インタフェース１３に入力される。メッセージ出力装置１４は、例えばディスプレイおよび／またはタッチパネルであって、乳児が体調不良である旨を、その乳児の養育者等のユーザへ通知するメッセージと、後述する期間情報の入力画面表示とを出力する機能を有する。メッセージ制御部１１２は、通信モジュール１５を介して、無線通信により音声検出装置５から乳児が体調不良である旨の通知信号を受信する。 As shown in FIG. 1C, the electronic device 10 includes a processor 11, a memory 12, an input interface 13, a message output device 14, and a communication module 15. The processor 11 logically connects the input control unit 111 that controls the operation of the input interface 13 and the message control unit 112 that controls the message output device 14 by activating the computer program stored in the memory 12. Have. The input interface 13 is, for example, a touch panel. The period information described later is input to the input interface 13 as screen input data by the user. The message output device 14 is, for example, a display and / or a touch panel, and outputs a message notifying a user such as a caregiver of the baby that the baby is in poor physical condition, and an input screen display of period information described later. Has the function of The message control unit 112 receives a notification signal indicating that the baby is in poor physical condition from the voice detection device 5 by wireless communication via the communication module 15.

　図２は、乳児１の発声を表す音声データが取得される様子を例示する図である。図２（ａ）に示すように、音声検出装置５は、乳児１の発声を表す音声データが取得されるよう、乳児１の近傍に設置される。音声検出装置５によって取得された音声データ５０は、電気信号であるため、横軸を時間、縦軸を振幅とすると、図２（ｂ）に例示されるような波形で表される。波形で表される音声データ５０には、上述したように、複数の周波数成分が含まれる。乳児１の発声を表す音声データに含まれる複数の周波数成分に基づいて抽出される上述した複数のパラメータは、乳児１が体調不良の状態であるときと体調不良の状態でないときとで異なる。例えば、乳児１が喉または肺の炎症を患っていることにより体調不良の状態にあるとき、乳児１の発声には平常時と比べてノイズが生じやすくなる。そこで、その複数のパラメータの値を上述したように記録データとしてストレージ４に記録しておくことにより、その記録された複数のパラメータの値と、新たに取得された乳児１の発声を表す新たな音声データから抽出される少なくとも一つのパラメータの値とに基づいて、乳児１が体調不良の状態であるか否かの判定を行うことができる。 FIG. 2 is a diagram illustrating how voice data representing the utterance of the baby 1 is acquired. As shown in FIG. 2A, the voice detection device 5 is installed in the vicinity of the baby 1 so that voice data representing the utterance of the baby 1 can be acquired. Since the voice data 50 acquired by the voice detection device 5 is an electric signal, it is represented by a waveform as illustrated in FIG. 2B, where the horizontal axis is time and the vertical axis is amplitude. As described above, the voice data 50 represented by the waveform includes a plurality of frequency components. The above-mentioned plurality of parameters extracted based on the plurality of frequency components included in the voice data representing the vocalization of the baby 1 are different depending on whether the baby 1 is in a poor physical condition or not. For example, when the baby 1 is in a state of poor physical condition due to inflammation of the throat or lungs, the vocalization of the baby 1 is more likely to generate noise than in normal times. Therefore, by recording the values of the plurality of parameters in the storage 4 as recorded data as described above, a new value representing the recorded values of the plurality of parameters and the newly acquired utterance of the baby 1 is represented. Based on the value of at least one parameter extracted from the voice data, it is possible to determine whether or not the baby 1 is in a poor physical condition.

　まず、その記録データが記録されるパラメータ記録処理を説明する。図２（ｂ）に例示される乳児１の発声を表す音声データ５０が取得されたとき、図２（ｃ）に例示される入力用の画面が電子機器１０の出力装置１４に表示される。この入力用の画面を介して乳児１が病気に罹患した状態にあった期間を乳児１の養育者に問い合わせることができる。図２（ｃ）に例示される入力用の画面には、メッセージ１４１および期間情報１４２が表示されている。具体的には、メッセージ１４１として、図２（ｂ）に例示される音声データ５０に関し、「お子様が病気の期間を入力してください」というメッセージが表示されている。このメッセージに対して、乳児１が病気に罹患した状態にあった期間の始期および終期を表す期間情報１４２が乳児１の養育者により入力された例が示されている。この例において、乳児１が病気に罹患した状態にあった期間の始期は２０１９年３月５日１８：００であって、終期は２０１９年３月６日９：００である旨の期間情報１４２が入力されている。期間情報１４２は、電子機器１０が有するプロセッサ１１の入力制御部１１１による制御のもとで、乳児１の養育者により入力され、かつ通信モジュール１５を介して音声検出装置へ送信される。 First, the parameter recording process in which the recorded data is recorded will be described. When the voice data 50 representing the utterance of the baby 1 illustrated in FIG. 2B is acquired, the input screen illustrated in FIG. 2C is displayed on the output device 14 of the electronic device 10. Through this input screen, the caregiver of the baby 1 can be inquired about the period during which the baby 1 was sick. The message 141 and the period information 142 are displayed on the input screen illustrated in FIG. 2C. Specifically, as the message 141, the message "Please input the period of illness of the child" is displayed with respect to the voice data 50 exemplified in FIG. 2 (b). In response to this message, an example is shown in which period information 142 indicating the beginning and end of the period during which the baby 1 was sick was input by the caregiver of the baby 1. In this example, the period information 142 indicating that the beginning of the period in which the baby 1 was sick is 18:00 on March 5, 2019, and the end is 9:00 on March 6, 2019. Has been entered. The period information 142 is input by the caregiver of the baby 1 and transmitted to the voice detection device via the communication module 15 under the control of the input control unit 111 of the processor 11 of the electronic device 10.

　期間情報１４２として示される乳児１が病気に罹患した状態の期間は、図２（ｂ）に例示される音声データ５０において、期間Ｔ１として示されるものとする。音声データ５０が取得された期間のうち、乳児１が病気に罹患した状態の期間Ｔ１においては、乳児１が病気に罹患していない平常時の状態と比較して異なる波形が観測されると考えられる。この期間Ｔ１は乳児１が病気に罹患した状態の期間であるから、音声検出装置５が有するプロセッサ６の音声パラメータ抽出部６３は、通信モジュール８を介して電子機器１０から取得した期間情報１４２に基づき期間Ｔ１を、乳児１が体調不良の状態にある体調不良期間として特定する。図２（ｂ）に示す例において、期間Ｔ１は、２０１９年３月５日１８：００から２０１９年３月６日９：００までの期間である。 The period during which the baby 1 is sick, which is shown as the period information 142, shall be indicated as the period T1 in the voice data 50 illustrated in FIG. 2 (b). It is considered that a different waveform is observed in the period T1 in which the baby 1 is sick during the period in which the voice data 50 is acquired, as compared with the normal state in which the baby 1 is not sick. Be done. Since this period T1 is a period in which the baby 1 is sick, the voice parameter extraction unit 63 of the processor 6 included in the voice detection device 5 provides the period information 142 acquired from the electronic device 10 via the communication module 8. Based on this, the period T1 is specified as a period of poor physical condition in which the baby 1 is in a state of poor physical condition. In the example shown in FIG. 2B, the period T1 is the period from 18:00 on March 5, 2019 to 9:00 on March 6, 2019.

　また、乳児１が病気に罹患していない状態から病気に罹患した状態へ変化する際には、その過渡期間に乳児１は病気に罹患しつつある状態であると考えられる。音声データ５０が取得された期間のうち、乳児１が病気に罹患しつつある状態の期間Ｔ２においてもまた、乳児１が病気に罹患していない平常時の状態と比較して異なる波形が観測されると考えられる。この期間Ｔ２は乳児１が病気に罹患しつつある状態の期間であるから、音声検出装置５が有するプロセッサ６の音声パラメータ抽出部６３は、期間Ｔ１の前の期間Ｔ２を、乳児１が体調不良の状態にある体調不良期間として特定してもよい。図２（ｂ）に示す例において、期間Ｔ２は、２０１９年３月５日１５：００から２０１９年３月５日１８：００までの期間である。音声パラメータ抽出部６３は、例えば、期間Ｔ１の前の３時間の時間帯を期間Ｔ２として特定する。期間Ｔ２は、音声データ５０において、期間Ｔ１の前の時間帯で、平常時とは異なる波形が観測される期間として、音声パラメータ抽出部６３により特定されることとしてもよい。 Further, when the baby 1 changes from a state in which the baby is not sick to a state in which the baby 1 is sick, it is considered that the baby 1 is in a state of being sick during the transitional period. During the period T2 in which the baby 1 is getting sick during the period when the voice data 50 is acquired, a different waveform is observed as compared with the normal state in which the baby 1 is not sick. It is thought that. Since this period T2 is a period in which the baby 1 is getting sick, the voice parameter extraction unit 63 of the processor 6 included in the voice detection device 5 sets the period T2 before the period T1 and the baby 1 is in poor physical condition. It may be specified as a period of poor physical condition in the state of. In the example shown in FIG. 2B, the period T2 is the period from 15:00 on March 5, 2019 to 18:00 on March 5, 2019. The voice parameter extraction unit 63 specifies, for example, the time zone of 3 hours before the period T1 as the period T2. The period T2 may be specified by the voice parameter extraction unit 63 as a period in which a waveform different from the normal time is observed in the time zone before the period T1 in the voice data 50.

　さらに、上述したように、期間Ｔ１は乳児１が病気に罹患した状態の期間であり、期間Ｔ２は乳児１が病気に罹患しつつある状態の期間であるから、音声検出装置５が有するプロセッサ６の音声パラメータ抽出部６３は、図２（ｂ）に例示する期間Ｔ１および期間Ｔ２を併合した期間Ｔ３を、乳児１が体調不良の状態にある体調不良期間として特定としてもよい。図２（ｂ）に示す例において、期間Ｔ３は、２０１９年３月５日１５：００から２０１９年３月６日９：００までの期間である。 Further, as described above, since the period T1 is the period in which the baby 1 is sick and the period T2 is the period in which the baby 1 is sick, the processor 6 included in the voice detection device 5 The voice parameter extraction unit 63 of the above may specify the period T3, which is a combination of the period T1 and the period T2 illustrated in FIG. 2B, as the period of poor physical condition in which the baby 1 is in a state of poor physical condition. In the example shown in FIG. 2B, the period T3 is the period from 15:00 on March 5, 2019 to 9:00 on March 6, 2019.

　上述したように、音声検出装置５が有するプロセッサ６の周波数成分検出部６２は、乳児１の発声を含む音声の音声データ５０に含まれる複数の周波数成分を検出する。本実施の形態では、周波数成分検出部６２によって、複数の周波数成分が検出され、それらの複数の周波数成分のうちから、乳児１の発声の基本周波数と、乳児１の発声の第一フォルマント周波数および第二フォルマント周波数とが、毎秒得られるものとする。なお、それら複数の周波数成分は、毎秒ではなく５秒毎、或いは１０秒毎というように、所定時間毎に得られることとしてよい。上述したように、音声パラメータ抽出部６３は、検出されたそれら複数の周波数成分に基づき、複数のパラメータを抽出する。図３は、乳児１の発声を表す音声データ５０に含まれる複数の周波数成分と、それらの複数の周波数成分に基づく複数のパラメータの値が記録された記録データ４１とを例示する図である。記録データ４１は音声パラメータ抽出部６３により乳児毎にストレージ４に記録される。 As described above, the frequency component detection unit 62 of the processor 6 included in the voice detection device 5 detects a plurality of frequency components included in the voice data 50 of the voice including the utterance of the baby 1. In the present embodiment, a plurality of frequency components are detected by the frequency component detection unit 62, and the fundamental frequency of the vocalization of the infant 1 and the first formant frequency of the vocalization of the infant 1 are selected from the plurality of frequency components. It is assumed that the second formant frequency is obtained every second. The plurality of frequency components may be obtained every predetermined time, such as every 5 seconds or every 10 seconds instead of every second. As described above, the voice parameter extraction unit 63 extracts a plurality of parameters based on the detected plurality of frequency components. FIG. 3 is a diagram illustrating a plurality of frequency components included in the voice data 50 representing the utterance of the baby 1 and recorded data 41 in which the values of a plurality of parameters based on the plurality of frequency components are recorded. The recorded data 41 is recorded in the storage 4 for each baby by the voice parameter extraction unit 63.

　図３（ａ）は、周波数成分検出部６２が毎秒検出する乳児１の発声の基本周波数と、乳児１の発声の第一フォルマント周波数および第二フォルマント周波数とを含む時系列データ３１を示す。時系列データ３１はストレージ４に記録される。図３（ａ）に示されるこの時系列データ３１は、図２（ｂ）に示す乳児１の体調不良期間Ｔ１またはＴ３における音声データ５０に対応し、その音声データ５０に基づき、各時刻における乳児１の発声の基本周波数と、乳児１の発声の第一フォルマント周波数および第二フォルマント周波数とを含むレコードが毎秒追加されることによって、生成される。さらに、この時系列データ３１では、それらの周波数成分から構成される１秒毎の各時刻における各レコードに体調不良期間表示フラグが対応付けられている。体調不良期間表示フラグは、周波数成分検出部６２によって時系列データ３１がストレージ４に記録された後、音声パラメータ抽出部６３によって、レコード毎に対応付けられる。 FIG. 3A shows time-series data 31 including the fundamental frequency of the infant 1's utterance detected by the frequency component detection unit 62 per second, and the first formant frequency and the second formant frequency of the infant 1's utterance. The time series data 31 is recorded in the storage 4. The time-series data 31 shown in FIG. 3A corresponds to the voice data 50 in the poor physical condition period T1 or T3 of the baby 1 shown in FIG. 2B, and the baby at each time based on the voice data 50. It is generated by adding a record containing the fundamental frequency of the vocalization of 1 and the first and second formant frequencies of the infant 1 vocalization every second. Further, in this time-series data 31, a poor physical condition period display flag is associated with each record at each time every second composed of those frequency components. The poor physical condition period display flag is associated with each record by the voice parameter extraction unit 63 after the time series data 31 is recorded in the storage 4 by the frequency component detection unit 62.

　図３（ａ）に示される例では、２０１９年３月５日１８時３１分４５秒において、乳児１の発声があり、その基本周波数が３８７Ｈｚ、第一フォルマント周波数が１２６１Ｈｚ、第二フォルマント周波数が２７３２Ｈｚであって、この時刻において乳児１が体調不良の状態にあることを示す値「１」が、この時刻のレコードに対応する体調不良期間表示フラグとして設定される。２０１９年３月５日１８時３１分４６秒においては、乳児１の発声があり、その基本周波数が３８８Ｈｚ、第一フォルマント周波数が１２６４Ｈｚ、第二フォルマント周波数が２７３５Ｈｚであって、体調不良期間表示フラグとして値「１」がこの時刻のレコードに設定される。２０１９年３月５日１９時３１分４４秒においては、乳児１の発声が無いため、基本周波数、第一フォルマント周波数および第二フォルマント周波数のいずれもが検出されなかったことから値「ＮＵＬＬ」がこの時刻のレコードに設定されるとともに、体調不良期間表示フラグとして値「１」がこの時刻のレコードに設定される。なお、上述した期間Ｔ１またはＴ３以外の期間、すなわち乳児１が体調不良の状態にある体調不良期間とは特定されていない期間においては、体調不良期間表示フラグとして値「０」が、この期間に含まれる各時刻のレコードに設定される。また、乳児１の発声が無い時刻におけるレコードには、各周波数成分値として「ＮＵＬＬ」を設定する代わりに何も設定しないことにより、すなわち各周波数成分値を欠損値とすることにより、プロセッサ６の処理性能を節約してもよい。その場合、時系列データ３１に基づき音声パラメータ抽出部６３が音声パラメータを抽出する処理（後述）においては、乳児１の発声が無い時刻におけるレコードの各周波数成分値として論理的に「ＮＵＬＬ」が用いられる。 In the example shown in FIG. 3A, at 18:31:45 on March 5, 2019, the infant 1 was uttered, its fundamental frequency was 387 Hz, its first formant frequency was 1261 Hz, and its second formant frequency was A value "1", which is 2732 Hz and indicates that the infant 1 is in a poor physical condition at this time, is set as a poor physical condition period display flag corresponding to the record at this time. At 18:31:46 on March 5, 2019, the infant 1 was uttered, its fundamental frequency was 388 Hz, its first formant frequency was 1264 Hz, its second formant frequency was 2735 Hz, and the poor physical condition period display flag. The value "1" is set in the record at this time. At 19:31:44 on March 5, 2019, the value "NULL" was set because none of the fundamental frequency, the first formant frequency, and the second formant frequency were detected because the infant 1 did not utter. Along with being set in the record at this time, the value "1" is set in the record at this time as a display flag for the period of poor physical condition. In the period other than the above-mentioned period T1 or T3, that is, in the period not specified as the period in which the baby 1 is in a state of poor physical condition, the value "0" is set as the poor physical condition period display flag in this period. It is set in the record of each time included. Further, in the record at the time when the infant 1 does not utter, by not setting anything instead of setting "Null" as each frequency component value, that is, by setting each frequency component value as a missing value, the processor 6 Processing performance may be saved. In that case, in the process (described later) in which the voice parameter extraction unit 63 extracts the voice parameter based on the time series data 31, "Null" is logically used as each frequency component value of the record at the time when the baby 1 does not utter. Be done.

　時系列データ３１に基づき、１秒経過毎にその時点における時刻を基準時刻として、その基準時刻から過去１時間の期間における、単位時間あたりの乳児１の発声の回数と、乳児１の発声の１回における継続時間とが、各基準時刻毎に、音声パラメータ抽出部６３によって算出される。それらは、次のようにして算出される。例えば基準時刻が２０１９年３月５日１９時３１分４４秒の場合は、過去１時間、すなわち２０１９年３月５日１８時３１分４５秒から２０１９年３月５日１９時３１分４４秒までの時系列データ３１に含まれる３６００件のレコードの各々に乳児１の発声の基本周波数の値が存在するか否か、に基づいて算出される。２０１９年３月５日１８時３１分４５秒およびそれに引き続く１８時３１分４６秒においては、基本周波数の値がそれぞれ存在する。このように連続して基本周波数の値が存在する間は、１回の発声が継続していると解釈できる。２０１９年３月５日１９時３１分４４秒においては基本周波数の値が存在しない。もし、２０１９年３月５日１８時３１分４７秒から１９時３１分４４秒までずっと基本周波数の値が存在しなかったとすれば、基準時刻が２０１９年３月５日１９時３１分４４秒の場合において、単位時間あたりの乳児１の発声の回数、すなわち乳児１の発声回数／時間の値は１（回）と算出され、乳児１の発声の１回における継続時間、すなわち乳児１の発声継続時間（秒）／発声１回の値は２（秒）と算出される。例えば、乳児１の発声回数／時間の値が５（回）であって、それら５回の発声それぞれにおける発声継続時間が、順に、１２０秒、１００秒、８０秒、１１０秒、１００秒であった場合は、乳児１の発声継続時間（秒）／発声１回の値として、それら５回の発声継続時間の１回当たりの平均値である１０２（秒）が算出される。 Based on the time-series data 31, the number of vocalizations of infant 1 per unit time and 1 of the vocalizations of infant 1 in the period of the past 1 hour from the reference time, with the time at that time as the reference time every second. The duration of each time is calculated by the voice parameter extraction unit 63 for each reference time. They are calculated as follows. For example, if the reference time is 19:31:44 on March 5, 2019, the past hour, that is, from 18:31:45 on March 5, 2019 to 19:31:44 on March 5, 2019. It is calculated based on whether or not the value of the basic frequency of the vocalization of the infant 1 exists in each of the 3600 records included in the time series data 31 up to. At 18:31:45 on March 5, 2019 and subsequent 18:31:46, there are fundamental frequency values, respectively. As long as the fundamental frequency value exists continuously in this way, it can be interpreted that one utterance continues. At 19:31:44 on March 5, 2019, there is no fundamental frequency value. If the basic frequency value did not exist from 18:31:47 on March 5, 2019 to 19:31:44, the reference time would be 19:31:44 on March 5, 2019. In the case of, the number of utterances of infant 1 per unit time, that is, the value of the number of utterances / hour of infant 1, is calculated as 1 (times), and the duration of one utterance of infant 1, that is, the utterance of infant 1. The value of duration (seconds) / one utterance is calculated as 2 (seconds). For example, the value of the number of utterances / time of infant 1 is 5 (times), and the utterance durations for each of these 5 utterances are 120 seconds, 100 seconds, 80 seconds, 110 seconds, and 100 seconds, respectively. In this case, 102 (seconds), which is the average value of each of the five vocalization durations, is calculated as the value of the vocalization duration (seconds) / one vocalization of the infant 1.

　ただし、例えば、乳児１が発声している時間がｋ秒間継続した後、乳児１が発声していない時間が数秒間継続したとしても、それに基づいて、乳児１の発声時間がｋ秒間で終了したとは言えない可能性がある。乳児１が、その発声していない数秒間の間に息継ぎをした後に発声を再開する可能性があるからである。そこで、乳児１が発声していない時間が所定秒間未満の場合、例えば１０秒未満の場合、乳児１の発声時間はｋ秒間で終了したのではなく、ｋ秒間を越えて継続していると解釈してもよい。 However, for example, even if the time during which the baby 1 is speaking continues for k seconds and the time during which the baby 1 is not speaking continues for several seconds, the vocal time of the baby 1 ends in k seconds based on the time. It may not be possible to say. This is because baby 1 may resume vocalization after taking a breath for a few seconds when it is not vocalizing. Therefore, if the time when the baby 1 is not uttering is less than a predetermined second, for example, less than 10 seconds, it is interpreted that the utterance time of the baby 1 does not end in k seconds but continues for more than k seconds. You may.

　図３（ｂ）に例示される記録データ４１には、音声パラメータ抽出部６３によって時系列データ３１に基づき抽出される音声パラメータとして、乳児１の発声回数／時間と、乳児１の発声継続時間（秒）／発声１回と、乳児１の発声の基本周波数（Ｈｚ）、乳児１の発声の第一フォルマント周波数（Ｈｚ）と、乳児１の発声の第二フォルマント周波数（Ｈｚ）とが、それぞれ体調不良期間におけるパラメータの値と、平常時におけるパラメータの値とに分かれて記録され、所定期間経過毎に更新される。体調不良期間は、音声データ５０が取得された期間のうち、上述した期間Ｔ１、Ｔ２またはＴ３に含まれる期間であり、図３（ａ）に例示される時系列データ３１に設定された体調不良期間表示フラグに基づいて識別される。平常時は体調不良期間以外の音声データ５０が取得された期間の一部または全部である。 In the recorded data 41 illustrated in FIG. 3 (b), the number of utterances / time of the infant 1 and the utterance duration of the infant 1 are shown as the voice parameters extracted by the voice parameter extraction unit 63 based on the time series data 31. Seconds) / One utterance, the basic frequency of utterance of infant 1 (Hz), the first formant frequency (Hz) of utterance of infant 1, and the second formant frequency (Hz) of utterance of infant 1, respectively. The parameter values in the defective period and the parameter values in normal times are recorded separately and updated every predetermined period. The poor physical condition period is a period included in the above-mentioned period T1, T2 or T3 among the periods in which the voice data 50 is acquired, and the poor physical condition set in the time series data 31 exemplified in FIG. 3A. Identified based on the period display flag. In normal times, it is a part or all of the period in which the voice data 50 is acquired other than the period of poor physical condition.

　音声パラメータ抽出部６３によって抽出される音声パラメータのうち、乳児１の発声回数／時間には、記録または更新されるタイミングにおいて、例えば過去１年間で１秒間隔の基準時刻毎に算出された乳児１の発声回数／時間の値の、体調不良期間と平常時とのそれぞれの期間における平均値が用いられる。同様に、乳児１の発声継続時間（秒）／発声１回についても、過去１年間のうち、体調不良期間と平常時とのそれぞれの期間における平均値が用いられる。さらに、乳児１の発声の基本周波数、第一フォルマント周波数および第二フォルマント周波数について、いずれも、過去１年間の時系列データ３１に含まれるレコードにそれぞれ記録された値に基づき、体調不良期間と平常時とのそれぞれの期間における平均値が用いられる。なお、上述した５種類のパラメータには、過去１年間における平均値が用いられることとしたが、この平均値は単純平均によって得られた値に限られず、例えば、過去１年間のうちの古い値よりも比較的最近の値に、より大きな重みが付された加重平均によって得られた値であってもよい。このようにして音声パラメータ抽出部６３によって抽出される音声パラメータの例を、図３（ｂ）に示す。 Among the voice parameters extracted by the voice parameter extraction unit 63, the number of utterances / time of the baby 1 is calculated at the timing of recording or updating, for example, at the reference time at 1-second intervals in the past year. The average value of the number of utterances / hour value in each period of poor physical condition and normal time is used. Similarly, for the vocalization duration (seconds) / vocalization of baby 1, the average value in each period of the poor physical condition period and the normal period in the past one year is used. Furthermore, regarding the fundamental frequency, the first formant frequency, and the second formant frequency of the infant 1's vocalization, the period of poor physical condition and the normal condition are all based on the values recorded in the records included in the time series data 31 for the past one year. The average value for each period with time is used. It was decided that the average value in the past one year was used for the above-mentioned five types of parameters, but this average value is not limited to the value obtained by the simple average, for example, the old value in the past one year. It may be a value obtained by a weighted average of a relatively recent value with a greater weight. An example of the voice parameter extracted by the voice parameter extraction unit 63 in this way is shown in FIG. 3 (b).

　図３（ｂ）に示す例において、乳児１の体調不良期間に記録される各パラメータの値は、乳児１の発声回数／時間が３０回／時間、乳児１の発声継続時間（秒）／発声１回が１２０秒／回、乳児１の発声の基本周波数が３８０Ｈｚ、乳児１の発声の第一フォルマント周波数が１２５０Ｈｚ、乳児１の発声の第二フォルマント周波数が２７００Ｈｚである。乳児１の平常時に記録される各パラメータの値は、乳児１の発声回数／時間が２０回／時間、乳児１の発声継続時間（秒）／発声１回が８０秒／回、乳児１の発声の基本周波数が３５０Ｈｚ、乳児１の発声の第一フォルマント周波数が１２００Ｈｚ、乳児１の発声の第二フォルマント周波数が２５００Ｈｚである。記録データ４１は、音声検出装置５のプロセッサ６が有する音声パラメータ抽出部６３により、ストレージ４に記録される。 In the example shown in FIG. 3B, the values of each parameter recorded during the period of poor physical condition of the infant 1 are the number of vocalizations / hour of the infant 1 30 times / hour and the vocalization duration (seconds) / vocalization of the infant 1. One time is 120 seconds / time, the fundamental frequency of the utterance of the baby 1 is 380 Hz, the first formant frequency of the utterance of the baby 1 is 1250 Hz, and the second formant frequency of the utterance of the baby 1 is 2700 Hz. The values of each parameter recorded in the normal time of infant 1 are the number of utterances / time of infant 1 20 times / hour, the duration of utterance of infant 1 (seconds) / one utterance of 80 seconds / time, and the utterance of infant 1. The fundamental frequency of the infant 1 is 350 Hz, the first formant frequency of the utterance of the infant 1 is 1200 Hz, and the second formant frequency of the utterance of the infant 1 is 2500 Hz. The recorded data 41 is recorded in the storage 4 by the voice parameter extraction unit 63 included in the processor 6 of the voice detection device 5.

　図４は、本実施の形態における音声検出装置５でコンピュータプログラムが走行することによって実行されるパラメータ記録処理の一例を示す図である。音声検出装置５のプロセッサ６は、メモリ７に格納されているコンピュータプログラムを起動し、図４に示す処理を実行することによって、音声データ取得部６１と、周波数成分検出部６２と、音声パラメータ抽出部６３として機能する。本実施の形態において、このパラメータ記録処理は、毎秒繰り返し実行される。 FIG. 4 is a diagram showing an example of parameter recording processing executed by running a computer program on the voice detection device 5 according to the present embodiment. The processor 6 of the voice detection device 5 activates the computer program stored in the memory 7 and executes the process shown in FIG. 4, thereby causing the voice data acquisition unit 61, the frequency component detection unit 62, and the voice parameter extraction. It functions as a unit 63. In the present embodiment, this parameter recording process is repeatedly executed every second.

　このパラメータ記録処理が開始されると、ステップＳ４１０において、音声データ取得部６１は、マイクロホン３を介して乳児１の発声を表す音声データ５０を取得する。ステップＳ４２０において、周波数成分検出部６２は、ステップＳ４１０で取得された音声データ５０に含まれる複数の周波数成分を検出し、図３（ａ）に例示される時系列データ３１を生成する。この処理ステップにおいては、例えば、乳児１の発声の基本周波数と、乳児１の発声の第一フォルマント周波数と、乳児１の発声の第二フォルマント周波数とが検出される。時系列データ３１は、周波数成分検出部６２によってストレージ４に記録される。 When this parameter recording process is started, in step S410, the voice data acquisition unit 61 acquires voice data 50 representing the utterance of the baby 1 via the microphone 3. In step S420, the frequency component detection unit 62 detects a plurality of frequency components included in the voice data 50 acquired in step S410, and generates the time series data 31 illustrated in FIG. 3A. In this processing step, for example, the fundamental frequency of the utterance of the infant 1, the first formant frequency of the utterance of the infant 1, and the second formant frequency of the utterance of the infant 1 are detected. The time series data 31 is recorded in the storage 4 by the frequency component detection unit 62.

　上述したように、音声パラメータ抽出部６３は、通信モジュール８を介して電子機器１０から取得した期間情報１４２に基づいて体調不良期間Ｔ３を特定することができる。例えば、図２（ｂ）に例示する２０１９年３月５日１５：００から２０１９年３月６日９：００までの期間Ｔ３が、体調不良期間として特定される。ステップＳ４３０において、音声パラメータ抽出部６３は、特定した体調不良期間Ｔ３を参照することによって、ステップＳ４２０で生成された時系列データ３１について、乳児１が体調不良の状態にあった体調不良期間を識別する。音声パラメータ抽出部６３は、特定した体調不良期間Ｔ３に基づき、ステップＳ４２０で生成された時系列データ３１に含まれる複数の周波数成分に対応付けて体調不良期間表示フラグを設定する。 As described above, the voice parameter extraction unit 63 can specify the poor physical condition period T3 based on the period information 142 acquired from the electronic device 10 via the communication module 8. For example, the period T3 from 15:00 on March 5, 2019 to 9:00 on March 6, 2019, which is illustrated in FIG. 2B, is specified as a period of poor physical condition. In step S430, the voice parameter extraction unit 63 identifies the poor physical condition period in which the baby 1 was in a poor physical condition with respect to the time series data 31 generated in step S420 by referring to the specified poor physical condition period T3. To do. The voice parameter extraction unit 63 sets the poor physical condition period display flag in association with a plurality of frequency components included in the time series data 31 generated in step S420 based on the specified poor physical condition period T3.

　ステップＳ４４０において、音声パラメータ抽出部６３は、ステップＳ４２０で検出された複数の周波数成分に基づき、複数のパラメータを抽出する。この処理ステップにおいては、例えば、体調不良期間と、体調不良期間以外の平常時とのそれぞれにおける、単位時間あたりの乳児１の発声の回数平均値と、乳児１の発声の１回における継続時間平均値と、乳児１の発声の基本周波数平均値と、乳児１の発声の第一フォルマント周波数平均値と、乳児１の発声の第二フォルマント周波数平均値とが抽出される。ステップＳ４５０において、音声パラメータ抽出部６３は、ステップＳ４４０で抽出された複数のパラメータの値を、記録データ４１として、ストレージ４に記録する。ステップＳ４５０の処理が完了すると、このパラメータ記録処理は終了する。 In step S440, the voice parameter extraction unit 63 extracts a plurality of parameters based on the plurality of frequency components detected in step S420. In this processing step, for example, the average number of utterances of infant 1 per unit time and the average duration of one utterance of infant 1 in each of the illness period and the normal time other than the illness period. The value, the fundamental frequency average value of the utterance of the infant 1, the first formant frequency average value of the utterance of the infant 1, and the second formant frequency average value of the utterance of the infant 1 are extracted. In step S450, the voice parameter extraction unit 63 records the values of the plurality of parameters extracted in step S440 as the recording data 41 in the storage 4. When the process of step S450 is completed, this parameter recording process ends.

　図５は、乳児１の発声を表す音声データ５１が取得され、その乳児１が体調不良の状態であると判定された場合を例示する図である。図４を用いて説明したパラメータ記録処理により、乳児１の発声のパラメータの値が記録データ４１として記録された後、図６を用いて後述する体調判定処理が実行されると、まず、音声データ取得部６１により図５（ａ）に例示する乳児の発声を表す新たな音声データ５１が取得される。その後、周波数成分検出部６２は、取得された新たな音声データ５１に含まれる複数の周波数成分を検出する。続いて、上述したパラメータ記録処理と同様に、取得された新たな音声データ５１から音声パラメータ抽出部６３によって複数のパラメータが抽出される。こうして抽出された、音声データ５１に含まれる複数の周波数成分に基づく複数のパラメータの値と、記録データ４１に記録された、体調不良期間と、体調不良期間以外の平常時とのそれぞれにおける、音声データ５０に含まれる複数の周波数成分に基づく複数のパラメータの値との類似度に基づいて、体調判定部６４による、乳児１が体調不良の状態であるか否かの判定が行われる。 FIG. 5 is a diagram illustrating a case where voice data 51 representing the vocalization of the baby 1 is acquired and the baby 1 is determined to be in a poor physical condition. When the value of the vocalization parameter of the baby 1 is recorded as the recorded data 41 by the parameter recording process described with reference to FIG. 4, and then the physical condition determination process described later is executed with reference to FIG. 6, the voice data is first recorded. The acquisition unit 61 acquires new voice data 51 representing the utterance of the baby illustrated in FIG. 5 (a). After that, the frequency component detection unit 62 detects a plurality of frequency components included in the acquired new voice data 51. Subsequently, a plurality of parameters are extracted from the acquired new voice data 51 by the voice parameter extraction unit 63 in the same manner as the parameter recording process described above. The values of a plurality of parameters based on the plurality of frequency components included in the voice data 51 extracted in this way, and the voice recorded in the recorded data 41 in each of the poor physical condition period and the normal time other than the poor physical condition period. Based on the similarity with the values of the plurality of parameters based on the plurality of frequency components included in the data 50, the physical condition determination unit 64 determines whether or not the baby 1 is in a poor physical condition.

　上述した体調判定部６４による判定の際に用いられる上述した類似度は、例えば次のようにして算出される。新たな音声データ５１から抽出された各パラメータの値と、記録データ４１に記録された乳児１の体調不良期間における各パラメータの値との差の二乗和Ｘを算出する。二乗和Ｘの値が小さいほど、新たな音声データ５１から抽出された複数のパラメータと、乳児１の体調不良期間における複数のパラメータとの類似度が高くなる。同様にして、新たな音声データ５１から抽出された各パラメータの値と、記録データ４１に記録された乳児１の平常時における各パラメータの値との差の二乗和Ｙを算出する。二乗和Ｙの値が小さいほど、新たな音声データ５１から抽出された複数のパラメータと、乳児１の平常時における複数のパラメータとの類似度が高くなる。二乗和Ｘの値が二乗和Ｙの値よりも小さいとき、新たな音声データ５１から抽出された複数のパラメータは、乳児１の平常時における複数のパラメータよりも乳児１の体調不良期間における複数のパラメータとの類似度の方が高いということになる。このとき、体調判定部６４は、乳児１は体調不良の状態であると判定する。なお、二乗和ＸおよびＹのそれぞれの逆数の値１／Ｘおよび１／Ｙを、上述した類似度の値としてもよい。 The above-mentioned similarity used in the determination by the above-mentioned physical condition determination unit 64 is calculated as follows, for example. The sum of squares X of the difference between the value of each parameter extracted from the new voice data 51 and the value of each parameter in the period of poor physical condition of the baby 1 recorded in the recorded data 41 is calculated. The smaller the value of the sum of squares X, the higher the similarity between the plurality of parameters extracted from the new voice data 51 and the plurality of parameters during the period of poor physical condition of the baby 1. Similarly, the sum of squares Y of the difference between the value of each parameter extracted from the new voice data 51 and the value of each parameter recorded in the recorded data 41 in normal times of the baby 1 is calculated. The smaller the value of the sum of squares Y, the higher the similarity between the plurality of parameters extracted from the new voice data 51 and the plurality of parameters of the baby 1 in normal times. When the value of the sum of squares X is smaller than the value of the sum of squares Y, the plurality of parameters extracted from the new voice data 51 are more than a plurality of parameters in the period of poor physical condition of the baby 1 than the plurality of parameters in the normal time of the baby 1. It means that the similarity with the parameter is higher. At this time, the physical condition determination unit 64 determines that the baby 1 is in a poor physical condition. The reciprocal values 1 / X and 1 / Y of the sum of squares X and Y may be used as the above-mentioned values of similarity.

　また、上述した二乗和Ｘおよび二乗和Ｙを算出する際には、各パラメータの重要度に応じて各パラメータの値に重み付けがされることとしてもよい。例えば、上述した複数のパラメータのうち、乳児１の発声の１回における継続時間および乳児１の発声の基本周波数に対して付与される重みを、単位時間あたりの乳児１の発声の回数、乳児１の発声の第一フォルマント周波数および乳児１の発声の第二フォルマント周波数に対して付与される重みよりも大きな値として、上述した二乗和Ｘおよび二乗和Ｙが算出される。 Further, when calculating the sum of squares X and the sum of squares Y described above, the value of each parameter may be weighted according to the importance of each parameter. For example, among the plurality of parameters described above, the weight given to the duration of one vocalization of infant 1 and the fundamental frequency of vocalization of infant 1 is determined by the number of vocalizations of infant 1 per unit time, infant 1 The above-mentioned sum of squares X and sum of squares Y are calculated as values larger than the weights given to the first formant frequency of the utterance and the second formant frequency of the utterance of the infant 1.

　図３（ｂ）に例示される記録データ４１には、体調不良期間におけるパラメータの値のみならず、平常時におけるパラメータの値も記録されているが、平常時におけるパラメータの値が記録されずに体調不良期間におけるパラメータの値のみが記録されることとしてもよい。その場合は、上述した類似度の算出において、新たな音声データ５１から抽出された各パラメータの値と、記録データ４１に記録された乳児１の平常時における各パラメータの値との差の二乗和Ｙは算出されず、新たな音声データ５１から抽出された各パラメータの値と、記録データ４１に記録された乳児１の体調不良期間における各パラメータの値との差の二乗和Ｘのみが算出される。二乗和Ｘの値または二乗和Ｘの逆数の値を類似度の値とする。二乗和Ｘの値が所定の閾値よりも小さいとき、新たな音声データ５１から抽出された複数のパラメータと、乳児１の体調不良期間における複数のパラメータとの類似度が高いということになるので、体調判定部６４は、乳児１は体調不良の状態であると判定する。 In the recorded data 41 illustrated in FIG. 3B, not only the parameter values during the poor physical condition period but also the parameter values in normal times are recorded, but the parameter values in normal times are not recorded. Only the values of the parameters during the period of poor physical condition may be recorded. In that case, in the above-mentioned calculation of similarity, the sum of squares of the difference between the value of each parameter extracted from the new voice data 51 and the value of each parameter recorded in the recorded data 41 in normal times of the baby 1. Y is not calculated, only the sum of squares X of the difference between the value of each parameter extracted from the new voice data 51 and the value of each parameter during the period of poor physical condition of the baby 1 recorded in the recorded data 41 is calculated. To. The value of the sum of squares X or the reciprocal of the sum of squares X is taken as the value of similarity. When the value of the sum of squares X is smaller than the predetermined threshold value, it means that the plurality of parameters extracted from the new voice data 51 have a high degree of similarity with the plurality of parameters during the period of poor physical condition of the baby 1. The physical condition determination unit 64 determines that the baby 1 is in a poor physical condition.

　体調判定部６４は、乳児１は体調不良の状態であると判定すると、体調判定部６４は、乳児１が体調不良である旨の通知信号を、通信モジュール８を介して電子機器１０へ送出する。送出されたその通知信号を、電子機器１０が有するプロセッサ１１のメッセージ制御部１１２が、通信モジュール１５を介して受信すると、図５（ｂ）に例示するような乳児１が体調不良である旨のメッセージ１４５を、メッセージ出力装置１４に出力させる。図５（ｂ）に示す例では、「お子様が病気に罹患したかまたは病気に罹患しつつあります。医師の診察をお勧めします。」というメッセージが、メッセージ１４５として、メッセージ出力装置１４の画面に表示されている。メッセージ１４５は、このメッセージを視認した乳児１の養育者が、現在地の近くの病院を検索できるように、「近くの病院を検索する」という検索ボタン表示１４６を含んでいる。乳児１の養育者が、この検索ボタン表示１４６に触れると、不図示の通信ネットワークを介して現在地の近くの病院が検索され、メッセージ１４５を表示する画面が、その検索結果を表示する画面に切り替わる。 When the physical condition determination unit 64 determines that the baby 1 is in a poor physical condition, the physical condition determination unit 64 sends a notification signal indicating that the baby 1 is in a poor physical condition to the electronic device 10 via the communication module 8. .. When the message control unit 112 of the processor 11 of the electronic device 10 receives the transmitted notification signal via the communication module 15, the baby 1 as illustrated in FIG. 5B is in poor physical condition. The message 145 is output to the message output device 14. In the example shown in FIG. 5B, the message "Your child has or is getting sick. We recommend seeing a doctor." Is displayed as message 145 on the screen of the message output device 14. It is displayed in. The message 145 includes a search button display 146 that says "search for a nearby hospital" so that the caregiver of the baby 1 who sees this message can search for a hospital near the current location. When the caregiver of Infant 1 touches this search button display 146, a hospital near the current location is searched via a communication network (not shown), and the screen displaying message 145 switches to the screen displaying the search results. ..

　なお、体調判定部６４により乳児１が体調不良であると判定された際、時系列データ３１において、音声データ取得部６１によって音声データ５１が取得された時刻のレコードに対し、音声パラメータ抽出部６３により体調不良期間表示フラグとして値「１」が設定されることとしてもよい。 When the physical condition determination unit 64 determines that the baby 1 is in poor physical condition, the voice parameter extraction unit 63 is used for the record of the time when the voice data 51 is acquired by the voice data acquisition unit 61 in the time series data 31. The value "1" may be set as the poor physical condition period display flag.

　図６は、本実施の形態における音声検出装置５でコンピュータプログラムが走行することによって実行される体調判定処理の一例を示す図である。音声検出装置５のプロセッサ６は、メモリ７に格納されているコンピュータプログラムを起動し、図６に示す処理を実行することによって、音声データ取得部６１と、周波数成分検出部６２と、音声パラメータ抽出部６３と、体調判定部６４として機能する。 FIG. 6 is a diagram showing an example of a physical condition determination process executed by running a computer program on the voice detection device 5 according to the present embodiment. The processor 6 of the voice detection device 5 activates the computer program stored in the memory 7 and executes the process shown in FIG. 6, thereby causing the voice data acquisition unit 61, the frequency component detection unit 62, and the voice parameter extraction. It functions as a unit 63 and a physical condition determination unit 64.

　この体調判定処理が開始されると、ステップＳ６１０において、音声データ取得部６１は、マイクロホン３を介して乳児１の発声を表す音声データ５１を取得する。ステップＳ６２０において、周波数成分検出部６２は、ステップＳ６１０で取得された音声データ５１に含まれる複数の周波数成分を検出する。この処理ステップにおいては、例えば、乳児１の発声の基本周波数と、乳児１の発声の第一フォルマント周波数と、乳児１の発声の第二フォルマント周波数とが検出される。 When this physical condition determination process is started, in step S610, the voice data acquisition unit 61 acquires voice data 51 representing the utterance of the baby 1 via the microphone 3. In step S620, the frequency component detection unit 62 detects a plurality of frequency components included in the voice data 51 acquired in step S610. In this processing step, for example, the fundamental frequency of the utterance of the infant 1, the first formant frequency of the utterance of the infant 1, and the second formant frequency of the utterance of the infant 1 are detected.

　ステップＳ６３０において、音声パラメータ抽出部６３は、ステップＳ６２０で検出された複数の周波数成分に基づき、複数のパラメータを抽出する。この処理ステップにおいては、例えば、単位時間あたりの乳児１の発声の回数と、乳児１の発声の１回における継続時間と、乳児１の発声の基本周波数と、乳児１の発声の第一フォルマント周波数と、乳児１の発声の第二フォルマント周波数とが抽出される。ステップＳ６４０において、音声パラメータ抽出部６３は、ステップＳ６３０で抽出された音声データ５１に含まれる複数の周波数成分に基づく複数のパラメータのうち、乳児１の発声の基本周波数が所定条件を満たすか否かを判定する。本実施の形態において、その所定条件は、乳児１の発声の基本周波数が３００Ｈｚ以上８００Ｈｚ以下であるというものである。もしこの所定条件が充足されないときは、ステップＳ６１０で取得された音声データ５１に含まれる発声が、実際には乳児の発声ではなかった可能性が高いと考えられることから、ステップＳ６４０における基本周波数条件判定処理では否定判定が得られ、体調判定処理は終了する。 In step S630, the voice parameter extraction unit 63 extracts a plurality of parameters based on the plurality of frequency components detected in step S620. In this processing step, for example, the number of utterances of infant 1 per unit time, the duration of one utterance of infant 1, the fundamental frequency of utterance of infant 1, and the first formant frequency of utterance of infant 1. And the second formant frequency of the utterance of the infant 1 are extracted. In step S640, the voice parameter extraction unit 63 determines whether or not the fundamental frequency of the vocalization of the baby 1 satisfies a predetermined condition among the plurality of parameters based on the plurality of frequency components included in the voice data 51 extracted in step S630. To judge. In the present embodiment, the predetermined condition is that the fundamental frequency of the vocalization of the baby 1 is 300 Hz or more and 800 Hz or less. If this predetermined condition is not satisfied, it is highly probable that the utterance included in the voice data 51 acquired in step S610 was not actually the utterance of the baby, and therefore the basic frequency condition in step S640. A negative judgment is obtained in the judgment process, and the physical condition judgment process ends.

　ステップＳ６４０での基本周波数条件判定処理で肯定判定が得られた場合、ステップＳ６５０において、体調判定部６４は、上述したパラメータ記録処理においてストレージ４に記録された記録データ４１を参照する。ステップＳ６６０において、体調判定部６４は、上述した類似度算出方法により、ステップＳ６３０で抽出された複数のパラメータの値と、記録データ４１に記録された複数のパラメータの値との類似度を算出する。ステップＳ６３０で抽出された複数のパラメータの値は、音声データ５１に含まれる複数の周波数成分に基づいて抽出されたものである。記録データ４１に記録された複数のパラメータの値は、体調不良期間と、体調不良期間以外の平常時とのそれぞれについて、音声データ５０に含まれる複数の周波数成分に基づいて抽出されたものである。 When an affirmative determination is obtained in the basic frequency condition determination process in step S640, the physical condition determination unit 64 refers to the recorded data 41 recorded in the storage 4 in the above-mentioned parameter recording process in step S650. In step S660, the physical condition determination unit 64 calculates the similarity between the values of the plurality of parameters extracted in step S630 and the values of the plurality of parameters recorded in the recorded data 41 by the similarity calculation method described above. .. The values of the plurality of parameters extracted in step S630 are those extracted based on the plurality of frequency components included in the voice data 51. The values of the plurality of parameters recorded in the recorded data 41 are extracted based on the plurality of frequency components included in the voice data 50 for each of the poor physical condition period and the normal time other than the poor physical condition period. ..

　ステップＳ６７０において、体調判定部６４は、ステップＳ６６０で算出された類似度に基づき、乳児１が体調不良の状態であるか否かの判定を行う。ステップＳ６７０での判定処理で否定判定が得られた場合、体調判定処理は終了する。ステップＳ６７０での判定処理で肯定判定が得られた場合、ステップＳ６８０において、体調判定部６４は、乳児１が体調不良である旨の通知信号を、通信モジュール８を介して電子機器１０へ送出する。電子機器１０のメッセージ出力装置１４は、図５（ｂ）に例示するように、乳児１が体調不良である旨のメッセージ１４５を出力する。ステップＳ６８０の処理が完了すると、この体調判定処理は終了する。 In step S670, the physical condition determination unit 64 determines whether or not the baby 1 is in a poor physical condition based on the similarity calculated in step S660. If a negative determination is obtained in the determination process in step S670, the physical condition determination process ends. When an affirmative determination is obtained in the determination process in step S670, the physical condition determination unit 64 sends a notification signal indicating that the baby 1 is in poor physical condition to the electronic device 10 via the communication module 8 in step S680. .. As illustrated in FIG. 5B, the message output device 14 of the electronic device 10 outputs a message 145 to the effect that the baby 1 is in poor physical condition. When the process of step S680 is completed, this physical condition determination process ends.

　本実施の形態における体調検出システム２によれば、以下の作用効果が得られる。 According to the physical condition detection system 2 in the present embodiment, the following effects can be obtained.

（１）体調検出システム２に含まれる音声検出装置５は、音声データ取得部６１と、周波数成分検出部６２と、音声パラメータ抽出部６３と、体調判定部６４とを備える。音声データ取得部６１は、乳児１の発声を表す音声データ５０を取得する。周波数成分検出部６２は、音声データ５０に含まれる複数の周波数成分を検出する。音声パラメータ抽出部６３は、音声データ５０に含まれる複数の周波数成分に基づく複数のパラメータを抽出する。音声データ取得部６１により乳児１の発声を表す音声データ５１が新たに取得されたとき、体調判定部６４は、新たに取得された音声データ５１と既に抽出された複数のパラメータとに基づき、乳児１は体調不良の状態であるか否かの判定を行う。音声データ５０が過去に乳児１が体調不良の状態であった際に取得されたものであれば、様々に考えられる疾病に乳児１が罹患したことを、本実施の形態における体調検出システム２に含まれる音声検出装置５によって検出できる可能性が高い。 (1) The voice detection device 5 included in the physical condition detection system 2 includes a voice data acquisition unit 61, a frequency component detection unit 62, a voice parameter extraction unit 63, and a physical condition determination unit 64. The voice data acquisition unit 61 acquires voice data 50 representing the utterance of the baby 1. The frequency component detection unit 62 detects a plurality of frequency components included in the voice data 50. The voice parameter extraction unit 63 extracts a plurality of parameters based on a plurality of frequency components included in the voice data 50. When the voice data acquisition unit 61 newly acquires the voice data 51 representing the utterance of the baby 1, the physical condition determination unit 64 determines the baby based on the newly acquired voice data 51 and the plurality of parameters already extracted. 1 determines whether or not the patient is in poor physical condition. If the voice data 50 was acquired when the baby 1 was in a poor physical condition in the past, the physical condition detection system 2 in the present embodiment indicates that the baby 1 was affected by various possible diseases. There is a high possibility that it can be detected by the included voice detection device 5.

（２）体調検出システム２に含まれる音声検出装置５において、周波数成分検出部６２は、新たに取得された音声データ５１に含まれる複数の周波数成分をさらに検出する。音声パラメータ抽出部６３は、その新たに取得された音声データ５１に含まれる複数の周波数成分に基づく複数のパラメータをさらに抽出する。体調判定部６４は、その新たに取得された音声データ５１に含まれる複数の周波数成分に基づく複数のパラメータのうち、乳児１の発声の基本周波数が、３００Ｈｚ以上８００Ｈｚ以下であるという条件を満たすとき、乳児１が体調不良の状態であるか否かの判定を行う。したがって、音声データ５１に含まれる発声が、実際には乳児１の発声ではなかった場合に、誤って乳児１の体調判定処理が行われる可能性が低い。 (2) In the voice detection device 5 included in the physical condition detection system 2, the frequency component detection unit 62 further detects a plurality of frequency components included in the newly acquired voice data 51. The voice parameter extraction unit 63 further extracts a plurality of parameters based on the plurality of frequency components included in the newly acquired voice data 51. When the physical condition determination unit 64 satisfies the condition that the basic frequency of the baby 1's utterance is 300 Hz or more and 800 Hz or less among a plurality of parameters based on the plurality of frequency components included in the newly acquired voice data 51. , It is determined whether or not the baby 1 is in a poor physical condition. Therefore, when the utterance included in the voice data 51 is not actually the utterance of the baby 1, it is unlikely that the physical condition determination process of the baby 1 is mistakenly performed.

（３）体調検出システム２に含まれる音声検出装置５において、体調判定部６４は、過去に取得された音声データ５０に含まれる複数の周波数成分に基づく複数のパラメータと、新たに取得された音声データ５１に含まれる複数の周波数成分に基づく複数のパラメータとの類似度に基づいて、乳児１が体調不良の状態であるか否かの判定を行う。したがって、乳児１が体調不良の状態であるか否かに関する安定した判定結果が得られる。 (3) In the voice detection device 5 included in the physical condition detection system 2, the physical condition determination unit 64 includes a plurality of parameters based on a plurality of frequency components included in the voice data 50 acquired in the past, and newly acquired voice. Based on the degree of similarity with the plurality of parameters based on the plurality of frequency components included in the data 51, it is determined whether or not the baby 1 is in a poor physical condition. Therefore, a stable determination result regarding whether or not the baby 1 is in a poor physical condition can be obtained.

（４）体調検出システム２は、メッセージ出力装置１４を有する電子機器１０をさらに含む。音声検出装置５が有するプロセッサ６の体調判定部６４によって乳児１が体調不良であると判定されると、電子機器１０のメッセージ出力装置１４は、乳児１が体調不良である旨のメッセージを出力する。出力されたメッセージを視認した乳児１の養育者は、現在地の近くの病院を検索するといった対応を速やかにとることができる。 (4) The physical condition detection system 2 further includes an electronic device 10 having a message output device 14. When the physical condition determination unit 64 of the processor 6 included in the voice detection device 5 determines that the baby 1 is in poor physical condition, the message output device 14 of the electronic device 10 outputs a message indicating that the baby 1 is in poor physical condition. .. The caregiver of the baby 1 who sees the output message can promptly take measures such as searching for a hospital near the current location.

　図７は、上述した一実施の形態における体調検出システム２の音声検出装置５で走行するコンピュータプログラムの製品供給態様を例示する図である。音声検出装置５で走行するコンピュータプログラムは、ＣＤ－ＲＯＭやＵＳＢメモリ等の記録媒体４５や、インターネット等の通信ネットワーク３０を流れるデータ信号を通じて音声検出装置５へ提供することができる。例えば、音声検出装置５に無線接続または有線接続された操作端末４６により記録媒体４５からコンピュータプログラムが読み出され、音声検出装置５へ提供される。 FIG. 7 is a diagram illustrating a product supply mode of a computer program traveling by the voice detection device 5 of the physical condition detection system 2 in the above-described embodiment. The computer program running on the voice detection device 5 can be provided to the voice detection device 5 through a recording medium 45 such as a CD-ROM or a USB memory, or a data signal flowing through a communication network 30 such as the Internet. For example, the computer program is read from the recording medium 45 by the operation terminal 46 wirelessly or wiredly connected to the voice detection device 5, and is provided to the voice detection device 5.

　コンピュータプログラム提供サーバ４０は、上述したコンピュータプログラムを提供するサーバコンピュータであり、ハードディスク等の記憶装置にそのコンピュータプログラムを格納する。通信ネットワーク３０は、インターネット、無線ＬＡＮ、電話網、或いは専用線等である。コンピュータプログラム提供サーバ４０は、記憶装置に格納されたコンピュータプログラムを読み出し、データ信号として搬送波に載せ、通信ネットワーク３０を介して音声検出装置５または操作端末４６へ送信する。コンピュータプログラムが操作端末４６へ送信された場合は、操作端末４６によりそのコンピュータプログラムが音声検出装置５へ提供される。このように、コンピュータプログラムは、記録媒体やデータ信号などの種々の形態のコンピュータ読み込み可能なコンピュータプログラム製品として供給され得る。 The computer program providing server 40 is a server computer that provides the above-mentioned computer program, and stores the computer program in a storage device such as a hard disk. The communication network 30 is the Internet, a wireless LAN, a telephone network, a dedicated line, or the like. The computer program providing server 40 reads out the computer program stored in the storage device, puts it on a carrier wave as a data signal, and transmits it to the voice detection device 5 or the operation terminal 46 via the communication network 30. When the computer program is transmitted to the operation terminal 46, the operation terminal 46 provides the computer program to the voice detection device 5. As described above, the computer program can be supplied as a computer-readable computer program product in various forms such as a recording medium and a data signal.

　次のような変形も本発明の範囲内であり、変形例の一つ、もしくは複数を上述の実施形態と組み合わせることも可能である。 The following modifications are also within the scope of the present invention, and one or more of the modifications can be combined with the above-described embodiment.

（変形例１）上述した一実施の形態において、体調検出システム２に含まれる音声検出装置５が有するプロセッサ６の音声データ取得部６１により、パラメータ記録処理に向けて音声データ５０が取得されると、プロセッサ６の音声パラメータ抽出部６３は、通信モジュール８を介して電子機器１０から取得した期間情報１４２に基づき、乳児１が体調不良の状態にある体調不良期間を特定する。図２（ｂ）に例示する音声データ５０においては、２０１９年３月５日１５：００から２０１９年３月６日９：００までの期間Ｔ３が、体調不良期間として特定されることとした。 (Modification 1) In the above-described embodiment, when the voice data 50 is acquired for the parameter recording process by the voice data acquisition unit 61 of the processor 6 included in the voice detection device 5 included in the physical condition detection system 2. The voice parameter extraction unit 63 of the processor 6 identifies the period of poor physical condition in which the infant 1 is in a state of poor physical condition based on the period information 142 acquired from the electronic device 10 via the communication module 8. In the voice data 50 illustrated in FIG. 2B, the period T3 from 15:00 on March 5, 2019 to 9:00 on March 6, 2019 is specified as the period of poor physical condition.

　上述したように、期間Ｔ３は、乳児１が病気に罹患した状態の期間Ｔ１と、乳児１が病気に罹患しつつある状態の期間Ｔ２とを併合した期間である。乳児１が病気に罹患した状態の期間Ｔ１（図２（ｂ）に例示する２０１９年３月５日１８：００から２０１９年３月６日９：００までの期間）、または乳児１が病気に罹患しつつある状態の期間Ｔ２（図２（ｂ）に例示する２０１９年３月５日１５：００から２０１９年３月５日１８：００までの期間）が、体調不良期間として特定されることとしてもよい。乳児１が病気に罹患しつつある状態の期間Ｔ２または期間Ｔ２を含む期間Ｔ３が体調不良期間として特定されると、体調判定部６４による乳児１が体調不良の状態であるか否かの判定処理の際に、乳児１が病気に罹患しつつある状態も検出されるため、乳児１の体調悪化を予防できる可能性が高い。 As described above, the period T3 is a period in which the period T1 in which the baby 1 is sick and the period T2 in which the baby 1 is sick are combined. Infant 1 becomes ill during period T1 (the period from 18:00 on March 5, 2019 to 9:00 on March 6, 2019 illustrated in FIG. 2B), or infant 1 becomes ill. The period T2 of the illness (the period from 15:00 on March 5, 2019 to 18:00 on March 5, 2019 illustrated in FIG. 2B) is specified as a period of poor physical condition. May be. When the period T2 in which the baby 1 is getting sick or the period T3 including the period T2 is specified as the period of poor physical condition, the physical condition determination unit 64 determines whether or not the baby 1 is in a state of poor physical condition. At this time, since the state in which the baby 1 is getting sick is also detected, there is a high possibility that the deterioration of the physical condition of the baby 1 can be prevented.

（変形例２）上述した実施の形態および変形例において、音声検出装置５のプロセッサ６が有する体調判定部６４は、新たな音声データ５１に含まれる複数の周波数成分に基づく複数のパラメータの値と、記録データ４１に記録された、体調不良期間と、体調不良期間以外の平常時とのそれぞれにおける、過去の音声データ５０に含まれる複数の周波数成分に基づく複数のパラメータの値との類似度に基づいて、乳児１が体調不良の状態であるか否かの判定を行う。類似度の算出には、新たな音声データ５１から抽出された各パラメータの値と、記録データ４１に記録された乳児１の体調不良期間における各パラメータの値との差の二乗和Ｘ、および新たな音声データ５１から抽出された各パラメータの値と、記録データ４１に記録された乳児１の平常時における各パラメータの値との差の二乗和Ｙが用いられることとした。本変形例では、他の算出法によって、上述した類似度が算出される。図８は、変形例２における体調検出システムで実行される体調判定処理に用いられる記録データ４１を例示する図である。図８に例示する記録データ４１は、図３（ｂ）に例示する記録データ４１と比較すると、各パラメータの値の標準偏差が記録されている点において異なる。 (Modification 2) In the above-described embodiment and modification, the physical condition determination unit 64 included in the processor 6 of the voice detection device 5 includes the values of a plurality of parameters based on a plurality of frequency components included in the new voice data 51. , The degree of similarity between the poor physical condition period recorded in the recorded data 41 and the values of a plurality of parameters based on the plurality of frequency components included in the past voice data 50 in each of the normal times other than the poor physical condition period. Based on this, it is determined whether or not the infant 1 is in a poor physical condition. To calculate the similarity, the sum of squares X of the difference between the value of each parameter extracted from the new voice data 51 and the value of each parameter recorded in the recorded data 41 during the period of poor physical condition of the infant 1 and the new It was decided that the sum of squares Y of the difference between the value of each parameter extracted from the voice data 51 and the value of each parameter recorded in the recorded data 41 in normal times of the infant 1 was used. In this modification, the above-mentioned similarity is calculated by another calculation method. FIG. 8 is a diagram illustrating recorded data 41 used in the physical condition determination process executed by the physical condition detection system in the second modification. The recorded data 41 illustrated in FIG. 8 is different from the recorded data 41 illustrated in FIG. 3 (b) in that the standard deviation of the value of each parameter is recorded.

　上述したパラメータ記録処理を通じて、図８に例示する記録データ４１には、過去の音声データ５０に含まれる複数の周波数成分に基づく複数のパラメータの値が記録されている。記録データ４１には、体調不良期間ｘと、体調不良期間以外の平常時ｙとのそれぞれにおける、単位時間あたりの乳児１の発声の回数Ｃ１、すなわちＣ１ｘおよびＣ１ｙと、乳児１の発声の１回における継続時間Ｃ２、すなわちＣ２ｘおよびＣ２ｙと、乳児１の発声の基本周波数Ｃ３、すなわちＣ３ｘおよびＣ３ｙと、乳児１の発声の第一フォルマント周波数Ｃ４、すなわちＣ４ｘおよびＣ４ｙと、乳児１の発声の第二フォルマント周波数Ｃ５、すなわちＣ５ｘおよびＣ５ｙとが、記録されるとともに、体調不良期間ｘと、体調不良期間以外の平常時ｙとのそれぞれにおける、単位時間あたりの乳児１の発声の回数Ｃ１の標準偏差Ｓ１、すなわちＳ１ｘおよびＳ１ｙと、乳児１の発声の１回における継続時間Ｃ２の標準偏差Ｓ２、すなわちＳ２ｘおよびＳ２ｙと、乳児１の発声の基本周波数Ｃ３の標準偏差Ｓ３、すなわちＳ３ｘおよびＳ３ｙと、乳児１の発声の第一フォルマント周波数Ｃ４の標準偏差Ｓ４、すなわちＳ４ｘおよびＳ４ｙと、乳児１の発声の第二フォルマント周波数Ｃ５の標準偏差Ｓ５、すなわちＳ５ｘおよびＳ５ｙとが、記録される。図８によれば、Ｃ１ｘ＝３０、Ｓ１ｘ＝４．５、Ｃ１ｙ＝２０、Ｓ１ｙ＝３、Ｃ２ｘ＝１２０、Ｓ２ｘ＝２０、Ｃ２ｙ＝８０、Ｓ２ｙ＝３５、Ｃ３ｘ＝３８０、Ｓ３ｘ＝１３０、Ｃ３ｙ＝３５０、Ｓ３ｙ＝８０、Ｃ４ｘ＝１２５０、Ｓ４ｘ＝１２０、Ｃ４ｙ＝１２００、Ｓ４ｙ＝１００、Ｃ５ｘ＝２７００、Ｓ５ｘ＝２００、Ｃ５ｙ＝２５００、Ｓ５ｙ＝１５０である。 Through the parameter recording process described above, the recorded data 41 illustrated in FIG. 8 records the values of a plurality of parameters based on the plurality of frequency components included in the past voice data 50. In the recorded data 41, the number of utterances of infant 1 per unit time C1, that is, C1x and C1y and one utterance of infant 1 in each of the illness period x and the normal time y other than the illness period. The durations C2, i.e. C2x and C2y, the basic frequencies C3, i.e. C3x and C3y, of the infant 1 utterance, the first formant frequencies C4, i.e. C4x and C4y, of the infant 1 utterance, and the second of the infant 1 utterance. The formant frequencies C5, that is, C5x and C5y, are recorded, and the standard deviation S1 of the number of utterances C1 of the infant 1 per unit time in each of the poor physical condition period x and the normal time y other than the poor physical condition period. That is, S1x and S1y, the standard deviation S2 of the duration C2 in one utterance of the infant 1, that is, S2x and S2y, the standard deviation S3 of the basic frequency C3 of the utterance of the infant 1, S3x and S3y, and the infant 1. The standard deviation S4 of the first formant frequency C4 of the utterance, S4x and S4y, and the standard deviation S5 of the second formant frequency C5 of the utterance of the infant 1, S5x and S5y, are recorded. According to FIG. 8, C1x = 30, S1x = 4.5, C1y = 20, S1y = 3, C2x = 120, S2x = 20, C2y = 80, S2y = 35, C3x = 380, S3x = 130, C3y = 350, S3y = 80, C4x = 1250, S4x = 120, C4y = 1200, S4y = 100, C5x = 2700, S5x = 200, C5y = 2500, S5y = 150.

　次に、上述した体調判定処理に際に、新たな音声データ５１に含まれる複数の周波数成分に基づく複数のパラメータとして、単位時間あたりの乳児１の発声の回数Ｍ１と、乳児１の発声の１回における継続時間Ｍ２と、乳児１の発声の基本周波数Ｍ３と、乳児１の発声の第一フォルマント周波数Ｍ４と、乳児１の発声の第二フォルマント周波数Ｍ５とが、抽出されたとする。例えば、Ｍ１＝２８、Ｍ２＝１１０、Ｍ３＝３６０、Ｍ４＝１２３０、Ｍ５＝２６００とする。上述した類似度は、以下の式（１）（２）により算出されるユークリッド距離ＤｘおよびＤｙに基づいて定められることとしてもよい。なお、ユークリッド距離Ｄｘは、音声データ５１から抽出された各パラメータの値と、乳児１の体調不良期間ｘにおける各パラメータの値との距離を表し、ユークリッド距離Ｄｙは、音声データ５１から抽出された各パラメータの値と、乳児１の平常時ｙにおける各パラメータの値との距離を表す。
Ｄｘ^２＝｛（Ｃ１ｘ－Ｍ１）／Ｓ１ｘ｝^２＋｛（Ｃ２ｘ－Ｍ２）／Ｓ２ｘ｝^２
　　　＋｛（Ｃ３ｘ－Ｍ３）／Ｓ３ｘ｝^２＋｛（Ｃ４ｘ－Ｍ４）／Ｓ４ｘ｝^２
　　　＋｛（Ｃ５ｘ－Ｍ５）／Ｓ５ｘ｝^２　　　　　・・・（１）
Ｄｙ^２＝｛（Ｃ１ｙ－Ｍ１）／Ｓ１ｙ｝^２＋｛（Ｃ２ｙ－Ｍ２）／Ｓ２ｙ｝^２
　　　＋｛（Ｃ３ｙ－Ｍ３）／Ｓ３ｙ｝^２＋｛（Ｃ４ｙ－Ｍ４）／Ｓ４ｙ｝^２
　　　＋｛（Ｃ５ｙ－Ｍ５）／Ｓ５ｙ｝^２　　　　　・・・（２） Next, in the above-mentioned physical condition determination process, as a plurality of parameters based on a plurality of frequency components included in the new voice data 51, the number of vocalizations of the infant 1 per unit time M1 and the vocalization of the infant 1 1 It is assumed that the duration M2 in the times, the fundamental frequency M3 of the utterance of the infant 1, the first formant frequency M4 of the utterance of the infant 1, and the second formant frequency M5 of the utterance of the infant 1 are extracted. For example, M1 = 28, M2 = 110, M3 = 360, M4 = 1230, M5 = 2600. The above-mentioned similarity may be determined based on the Euclidean distances Dx and Dy calculated by the following equations (1) and (2). The Euclidean distance Dx represents the distance between the value of each parameter extracted from the voice data 51 and the value of each parameter in the poor physical condition period x of the infant 1, and the Euclidean distance Dy is extracted from the voice data 51. It represents the distance between the value of each parameter and the value of each parameter in the normal time y of infant 1.
Dx ² = {(C1x-M1) / S1x} ² + {(C2x-M2) / S2x} ²
+ {(C3x-M3) / S3x} ² + {(C4x-M4) / S4x} ²
+ {(C5x-M5) / S5x} ² ... (1)
Dy ² = {(C1y-M1) / S1y} ² + {(C2y-M2) / S2y} ²
+ {(C3y-M3) / S3y} ² + {(C4y-M4) / S4y} ²
+ {(C5y-M5) / S5y} ² ... (2)

　上述した式（１）（２）により算出されるユークリッド距離ＤｘおよびＤｙを互いに比較し、ユークリッド距離が小さいとき、類似度が高いとする。例えば、Ｄｘ＜Ｄｙのとき、音声データ５１が取得されたときの乳児１の体調は、体調不良期間ｘにおける乳児１の体調に類似しているので、体調判定部６４は、乳児１は体調不良の状態であると判定する。上述したＭ１＝２８、Ｍ２＝１１０、Ｍ３＝３６０、Ｍ４＝１２３０、Ｍ５＝２６００を、上述した式（１）および（２）に代入すると、Ｄｘ≒０．８７およびＤｙ≒２．９０が得られるので、Ｄｘ＜Ｄｙが成立し、乳児１は体調不良の状態であると判定される。 The Euclidean distances Dx and Dy calculated by the above equations (1) and (2) are compared with each other, and when the Euclidean distance is small, the similarity is high. For example, when Dx <Dy, the physical condition of the baby 1 when the voice data 51 is acquired is similar to the physical condition of the baby 1 in the poor physical condition period x. Therefore, the physical condition determination unit 64 determines that the baby 1 is in poor physical condition. It is determined that the state is. Substituting the above-mentioned M1 = 28, M2 = 110, M3 = 360, M4 = 1230, and M5 = 2600 into the above-mentioned equations (1) and (2), Dx≈0.87 and Dy≈2.90 are obtained. Therefore, Dx <Dy is established, and the baby 1 is determined to be in a poor physical condition.

（変形例３）上述した実施の形態および変形例において、図６に例示する体調判定処理を通じて得られる体調判定結果の正否を、乳児１の養育者が判断することとしてもよい。その養育者による判断結果が、図４に例示するパラメータ記録処理のステップＳ４３０で行われる、音声パラメータ抽出部６３による体調不良期間の識別処理に用いられる。このようにすることによって、記録データ４１に記録される体調不良期間および平常時のそれぞれの期間におけるパラメータの値が、より正確な値となる可能性がある。また、こうした養育者による判断結果のフィードバックを機械学習の教師データとして用いることにより、乳児１の発声を含む音声の音声データ５０が取得された期間のうちの体調不良期間の識別精度を向上させることが可能となり得る。 (Modified Example 3) In the above-described embodiment and modified example, the caregiver of the baby 1 may determine whether the physical condition determination result obtained through the physical condition determination process illustrated in FIG. 6 is correct or not. The determination result by the caregiver is used for the identification process of the poor physical condition period by the voice parameter extraction unit 63, which is performed in step S430 of the parameter recording process illustrated in FIG. By doing so, the value of the parameter recorded in the recorded data 41 in each of the poor physical condition period and the normal period may become a more accurate value. Further, by using the feedback of the judgment result by the caregiver as the teacher data of machine learning, it is possible to improve the identification accuracy of the poor physical condition period in the period in which the voice data 50 of the voice including the utterance of the baby 1 is acquired. Can be possible.

（変形例４）上述した実施の形態および変形例において、乳児１が体調不良の状態にある場合、図３（ａ）に例示される体調不良期間表示フラグには値「１」が設定される。しかし、医師の診察により乳児１が罹患している疾病の名称が判明している場合を考慮して、乳児１が罹患しているのが、例えば喉の炎症である場合は体調不良期間表示フラグには値「２」を設定し、肺炎である場合は体調不良期間表示フラグには値「３」を設定し、その他の疾病である場合および疾病の名称が判明していない場合は体調不良期間表示フラグには値「１」が設定されることとしてもよい。このようにすることによって、乳児１の体調不良の原因を推定することが可能となり得る。 (Modified Example 4) In the above-described embodiment and modified example, when the baby 1 is in a state of poor physical condition, a value “1” is set in the poor physical condition period display flag illustrated in FIG. 3 (a). .. However, in consideration of the case where the name of the disease affecting infant 1 is known by a doctor's examination, if infant 1 is affected by inflammation of the throat, for example, the illness period display flag Set the value "2" for, set the value "3" for the illness period display flag if it is pneumonia, and set the illness period if it is another illness or if the name of the illness is unknown. The value "1" may be set for the display flag. By doing so, it may be possible to estimate the cause of the poor physical condition of the baby 1.

　或いは、乳児１が体調不良の状態にある場合、体調不良期間表示フラグには値「１」が設定され、乳児１がとても元気な状態にある場合、体調不良期間表示フラグには値「２」が設定され、乳児１がとても元気な状態とまでは至らないものの体調不良ではない「ふつう」の状態にある場合、体調不良期間表示フラグには値「３」が設定され、体調が不明の状態にある場合、体調不良期間表示フラグには値「４」が設定されることとしてもよい。このようにすることによって、乳児１が体調不良の状態にある場合に限らず乳児１の体調を推定することが可能となり得る。 Alternatively, if the baby 1 is in poor physical condition, the value "1" is set in the poor physical condition period display flag, and if the baby 1 is in a very healthy state, the value "2" is set in the poor physical condition period display flag. Is set, and if baby 1 is not in a very healthy state but is in a "normal" state that is not in poor physical condition, the value "3" is set in the poor physical condition period display flag, and the physical condition is unknown. In the case of, the value "4" may be set in the poor physical condition period display flag. By doing so, it may be possible to estimate the physical condition of the baby 1 not only when the baby 1 is in a poor physical condition.

（変形例５）上述した実施の形態および変形例において、音声検出装置５のプロセッサ６が有する音声データ取得部６１によって取得された音声データを対象として周波数成分検出部６２によって検出された複数の周波数成分に基づき、音声パラメータ抽出部６３によって抽出された複数のパラメータを用いて、乳児１が体調不良の状態であるか否かの判定が、体調判定部６４によって行われる。さらに、それらの周波数成分或いは他の周波数成分に基づいて抽出される複数のパラメータを用いて、乳児１の感情推定のための処理が、図９に示す音声検出装置５のプロセッサ６が論理的に有する感情推定部６５によって行われることとしてもよい。図９は、変形例５における体調検出システム２に含まれる音声検出装置５の構成を示す図である。図９に示す音声検出装置５のプロセッサ６は、感情推定部６５を有する点において、図１（ｂ）に示す音声検出装置５のプロセッサ６と異なる。 (Modification 5) In the above-described embodiment and modification, a plurality of frequencies detected by the frequency component detection unit 62 for the voice data acquired by the voice data acquisition unit 61 included in the processor 6 of the voice detection device 5. Based on the components, the physical condition determination unit 64 determines whether or not the baby 1 is in a poor physical condition using a plurality of parameters extracted by the voice parameter extraction unit 63. Further, the processor 6 of the voice detection device 5 shown in FIG. 9 logically performs the processing for emotion estimation of the baby 1 by using a plurality of parameters extracted based on those frequency components or other frequency components. It may be performed by the emotion estimation unit 65 having. FIG. 9 is a diagram showing a configuration of a voice detection device 5 included in the physical condition detection system 2 in the modified example 5. The processor 6 of the voice detection device 5 shown in FIG. 9 is different from the processor 6 of the voice detection device 5 shown in FIG. 1B in that it has an emotion estimation unit 65.

　乳児１の発声を含む音声の音声データを対象とする周波数成分の検出処理および音声パラメータの抽出処理は、上述した図４に示すパラメータ記録処理および図６に示す体調判定処理で行われる周波数成分の検出処理および音声パラメータの抽出処理と同様にして行われる。感情推定部６５は、新たに取得された音声データから抽出されたパラメータと、過去に取得された音声データから抽出され、感情種別に対応付けて記録されたパラメータとの類似度を算出することによって、乳児１の感情が、分類された複数種類の感情のうちのいずれかに該当するものとして、乳児１の感情を推定する。感情推定部６５による乳児１の感情推定結果は、取得された音声データから抽出されて記録されるパラメータに対応付けられる感情種別に用いられることとしてもよい。 The frequency component detection process and the voice parameter extraction process for the voice data of the voice including the utterance of the baby 1 are performed by the parameter recording process shown in FIG. 4 and the physical condition determination process shown in FIG. It is performed in the same manner as the detection process and the voice parameter extraction process. The emotion estimation unit 65 calculates the degree of similarity between the parameters extracted from the newly acquired voice data and the parameters extracted from the voice data acquired in the past and recorded in association with the emotion type. , The emotion of the infant 1 is estimated as one of the plurality of types of emotions classified. The emotion estimation result of the baby 1 by the emotion estimation unit 65 may be used for the emotion type associated with the parameters extracted from the acquired voice data and recorded.

　なお、上述した感情推定部６５による乳児１の感情推定に用いられる音声パラメータと、体調判定部６４による乳児１の体調不良判定に用いられる音声パラメータとは、例えば、音声パラメータ抽出部６３によって抽出される同一種類のパラメータであってもよい。同一種類のパラメータとは、例えば、単位時間あたりの乳児１の発声の回数、乳児１の発声の１回における継続時間、乳児１の発声の基本周波数、乳児１の発声の第一フォルマント周波数および乳児１の発声の第二フォルマント周波数である。その場合、感情推定部６５と体調判定部６４とを一体に構成してもよい。感情推定部６５と体調判定部６４とが一体に構成された機能部を、感情推定および体調判定部と呼ぶこととする。 The voice parameters used for estimating the emotions of the baby 1 by the emotion estimation unit 65 and the voice parameters used for determining the poor physical condition of the baby 1 by the physical condition determination unit 64 are extracted by, for example, the voice parameter extraction unit 63. It may be the same kind of parameters. The parameters of the same type include, for example, the number of utterances of infant 1 per unit time, the duration of one utterance of infant 1, the fundamental frequency of utterance of infant 1, the first formant frequency of utterance of infant 1, and the infant. It is the second formant frequency of the utterance of 1. In that case, the emotion estimation unit 65 and the physical condition determination unit 64 may be integrally configured. A functional unit in which the emotion estimation unit 65 and the physical condition determination unit 64 are integrally configured is referred to as an emotion estimation and physical condition determination unit.

　その感情推定および体調判定部による乳児１の感情推定および体調判定の処理と、その前段に行われる音声パラメータ抽出部６３によるパラメータ抽出処理とには、機械学習が済んだ学習モデルが用いられてもよい。そのためには、まず、図４に例示されるパラメータ記録処理が、学習モデルに教師データを学習させることによって実現される。教師データは、ステップＳ４２０で検出される周波数成分検出結果であり、例えば、ステップＳ４１０で取得された音声データ５０に対応するスペクトログラム画像データであってもよい。学習モデルの入力層にスペクトログラム画像データが入力され、出力層に例えば「体調不良」「喜んでいる」「怒っている」「おなかがすいている」「眠い」「遊んで欲しがっている」といった体調および感情に関する６つの分類項目が設定される。出力層に設定されるそれらの各分類項目は、音声データ５０に含まれる複数の周波数成分に基づいて抽出される上述のパラメータに対応して、学習モデルにより分類される。学習モデルは、こうした入力層と出力層とが設定された状態で機械学習する。これにより、音声パラメータ抽出部６３による図４のステップＳ４３０、Ｓ４４０およびＳ４５０におけるパラメータ抽出等の一連の処理が完了する。 Even if a learning model that has completed machine learning is used for the emotion estimation and emotion estimation and physical condition determination processing of the baby 1 by the emotion estimation and physical condition determination unit, and the parameter extraction processing by the voice parameter extraction unit 63 performed in the previous stage. Good. For that purpose, first, the parameter recording process illustrated in FIG. 4 is realized by letting the learning model learn the teacher data. The teacher data is the frequency component detection result detected in step S420, and may be, for example, spectrogram image data corresponding to the audio data 50 acquired in step S410. Spectrogram image data is input to the input layer of the learning model, and for example, "unwell", "happy", "angry", "hungry", "sleepy", "wants to play" in the output layer. Six classification items related to physical condition and emotion are set. Each of those classification items set in the output layer is classified by the learning model according to the above-mentioned parameters extracted based on the plurality of frequency components included in the voice data 50. The learning model performs machine learning with the input layer and the output layer set. As a result, a series of processes such as parameter extraction in steps S430, S440 and S450 of FIG. 4 by the voice parameter extraction unit 63 are completed.

　そして、図６に例示される体調判定処理に感情推定処理が加わったことによる感情推定および体調判定処理が、機械学習済みの学習モデルにステップＳ６２０で検出される周波数成分検出結果が入力され、例えば「体調不良」「喜んでいる」「怒っている」「おなかがすいている」「眠い」「遊んで欲しがっている」といった体調および感情に関する６つの分類項目が設定された出力が得られることによって実現される。ステップＳ６１０で取得された音声データ５１に対応するスペクトログラム画像データが入力されると、学習モデルは、音声データ５０に含まれる複数の周波数成分に基づいて抽出される上述のパラメータに対応してその入力データを分類した結果を出力する。学習モデルは、入力データの分類結果として、入力された画像データと、上述した６つの分類項目のそれぞれとの類似度を出力する。これにより、音声パラメータ抽出部６３による図６のステップＳ６３０におけるパラメータ抽出処理、感情推定および体調判定部によるＳ６５０およびＳ６６０におけるパラメータ類似度算出等の処理ならびに感情推定処理が完了する。なお、学習モデルの教師あり学習において、乳児１の発声の基本周波数が３００Ｈｚ以上８００Ｈｚ以下であるという基本周波数条件も考慮することにより、音声パラメータ抽出部６３による図６のステップＳ６４０における基本周波数条件判定処理も、学習モデルの出力に含まれるようにすることができる。 Then, the emotion estimation and the physical condition determination process due to the addition of the emotion estimation process to the physical condition determination process illustrated in FIG. 6 are input to the machine-learned learning model with the frequency component detection result detected in step S620, for example. You can get an output with 6 classification items related to physical condition and emotions such as "unwell", "happy", "angry", "hungry", "sleepy", and "want to play". It is realized by. When the spectrogram image data corresponding to the audio data 51 acquired in step S610 is input, the training model inputs the spectrogram image data corresponding to the above-mentioned parameters extracted based on the plurality of frequency components included in the audio data 50. Output the result of classifying the data. The learning model outputs the degree of similarity between the input image data and each of the above-mentioned six classification items as the classification result of the input data. As a result, the parameter extraction process in step S630 of FIG. 6 by the voice parameter extraction unit 63, the emotion estimation, the parameter similarity calculation in S650 and S660 by the physical condition determination unit, and the emotion estimation process are completed. In the supervised learning of the learning model, the fundamental frequency condition determination in step S640 of FIG. 6 by the voice parameter extraction unit 63 is also taken into consideration by considering the fundamental frequency condition that the basic frequency of the utterance of the infant 1 is 300 Hz or more and 800 Hz or less. The processing can also be included in the output of the training model.

（変形例６）上述した実施の形態および変形例において、図１および図９に示すように、体調検出システム２は音声検出装置５と電子機器１０とを含むこととし、音声検出装置５のプロセッサ６は音声データ取得部６１と周波数成分検出部６２と音声パラメータ抽出部６３と体調判定部６４と感情推定部６５とを論理的に有することとしたが、他の構成であってもよい。例えば、音声検出装置５と電子機器１０とが一体に構成されてもよい。或いは、音声検出装置５のプロセッサ６が音声データ取得部６１および周波数成分検出部６２を有し、音声パラメータ抽出部６３および体調判定部６４を電子機器１０のプロセッサ１１が有することとしてもよい。上述した変形例５においては、音声検出装置５のプロセッサ６が音声データ取得部６１および周波数成分検出部６２を有し、音声パラメータ抽出部６３、体調判定部６４および感情推定部６５を電子機器１０のプロセッサ１１が有することとしてもよい。 (Modification 6) In the above-described embodiment and modification, as shown in FIGS. 1 and 9, the physical condition detection system 2 includes a voice detection device 5 and an electronic device 10, and the processor of the voice detection device 5 Although 6 is logically provided with a voice data acquisition unit 61, a frequency component detection unit 62, a voice parameter extraction unit 63, a physical condition determination unit 64, and an emotion estimation unit 65, other configurations may be used. For example, the voice detection device 5 and the electronic device 10 may be integrally configured. Alternatively, the processor 6 of the voice detection device 5 may have a voice data acquisition unit 61 and a frequency component detection unit 62, and the voice parameter extraction unit 63 and the physical condition determination unit 64 may be included in the processor 11 of the electronic device 10. In the above-described modification 5, the processor 6 of the voice detection device 5 has a voice data acquisition unit 61 and a frequency component detection unit 62, and the voice parameter extraction unit 63, the physical condition determination unit 64, and the emotion estimation unit 65 are electronic devices 10. The processor 11 of the above may have.

（変形例７）上述した実施の形態および変形例において、図４に例示するパラメータ記録処理および図６に例示する体調判定処理が、体調検出システム２に含まれる音声検出装置５が有するプロセッサ１１によって実行されることとした。しかし、一部の処理が、例えば、音声検出装置５とは異なる他の装置で実行されることとしてもよい。図１０は、変形例７における体調検出システム２の構成を示す図である。図１０（ａ）において、体調検出システム２は、音声検出装置５と電子機器１０と体調判定装置２０を含み、音声検出装置５と、電子機器１０と、体調判定装置２０とが、通信ネットワーク３０を介して互いに接続されている。体調判定装置２０は、例えば大容量サーバであって、乳児１に限らず他の乳児の体調判定処理も行われることとしてもよい。 (Modification 7) In the above-described embodiment and modification, the parameter recording process illustrated in FIG. 4 and the physical condition determination process illustrated in FIG. 6 are performed by the processor 11 included in the voice detection device 5 included in the physical condition detection system 2. It was decided to be executed. However, some processing may be executed, for example, by another device different from the voice detection device 5. FIG. 10 is a diagram showing the configuration of the physical condition detection system 2 in the modified example 7. In FIG. 10A, the physical condition detection system 2 includes a voice detection device 5, an electronic device 10, and a physical condition determination device 20, and the voice detection device 5, the electronic device 10, and the physical condition determination device 20 are connected to a communication network 30. Are connected to each other via. The physical condition determination device 20 may be, for example, a large-capacity server, and may perform physical condition determination processing of not only the baby 1 but also other infants.

　図１０（ｂ）に示す音声検出装置５は、図１（ｂ）に示す音声検出装置５と同様に、マイクロホン３、ストレージ４、プロセッサ６、メモリ７および通信モジュール８を有する。図１０（ｂ）に示す音声検出装置５のプロセッサ６は、メモリ７に格納されているコンピュータプログラムを起動することによって、周波数成分検出部６２と、音声パラメータ抽出部６３とを論理的に有するが、図１（ｂ）に示す音声検出装置５と異なり体調判定処理が行われないため、体調判定部６４を有さない。音声パラメータ抽出部６３は、抽出した複数のパラメータを、ストレージ４に記録データ４１として記録するのではなく、後述するように、通信モジュール８を介して電子機器１０へ送信する。 The voice detection device 5 shown in FIG. 10B has a microphone 3, a storage 4, a processor 6, a memory 7, and a communication module 8 similar to the voice detection device 5 shown in FIG. 1B. The processor 6 of the voice detection device 5 shown in FIG. 10B logically has a frequency component detection unit 62 and a voice parameter extraction unit 63 by activating a computer program stored in the memory 7. , Unlike the voice detection device 5 shown in FIG. 1B, the physical condition determination process is not performed, so that the physical condition determination unit 64 is not provided. The voice parameter extraction unit 63 does not record the extracted plurality of parameters in the storage 4 as recorded data 41, but transmits the extracted parameters to the electronic device 10 via the communication module 8 as described later.

　図４に例示するパラメータ記録処理のステップＳ４１０からステップＳ４４０までの処理ステップは、上述した実施の形態における図１（ｂ）に示す音声検出装置５と同様に、本変形例では図１０（ｂ）に示す音声検出装置５で実行される。本変形例においては、図４のステップＳ４５０におけるパラメータの値を記録する処理が行われる代わりに、抽出された複数のパラメータの値が、音声パラメータ抽出部６３により、通信モジュール８を介して電子機器１０へ送信され、電子機器１０からさらに体調判定装置２０へ、通信ネットワーク３０を介して送信される。 The processing steps from step S410 to step S440 of the parameter recording processing illustrated in FIG. 4 are similar to the voice detection device 5 shown in FIG. 1 (b) in the above-described embodiment, and in this modification, FIG. 10 (b) is shown. It is executed by the voice detection device 5 shown in. In this modification, instead of performing the process of recording the parameter values in step S450 of FIG. 4, the extracted plurality of parameter values are extracted by the voice parameter extraction unit 63 via the communication module 8 and electronically. It is transmitted to 10 and further transmitted from the electronic device 10 to the physical condition determination device 20 via the communication network 30.

　図６に例示する体調判定処理のステップＳ６１０からステップＳ６４０までの処理ステップは、上述した実施の形態における図１（ｂ）に示す音声検出装置５と同様に、本変形例では図１０（ｂ）に示す音声検出装置５で実行される。本変形例において、図６のステップＳ６４０において肯定判定が得られると、ステップＳ６３０において抽出された複数のパラメータの値は、音声パラメータ抽出部６３により、通信モジュール８を介して電子機器１０へ送信され、電子機器１０からさらに体調判定装置２０へ、通信ネットワーク３０を介して送信される。 The processing steps from step S610 to step S640 of the physical condition determination process illustrated in FIG. 6 are similar to the voice detection device 5 shown in FIG. 1 (b) in the above-described embodiment, and in this modification, FIG. 10 (b) is shown. It is executed by the voice detection device 5 shown in. In this modification, if a positive determination is obtained in step S640 of FIG. 6, the values of the plurality of parameters extracted in step S630 are transmitted to the electronic device 10 by the voice parameter extraction unit 63 via the communication module 8. , The electronic device 10 is further transmitted to the physical condition determination device 20 via the communication network 30.

　図１０（ｃ）に示すように、体調判定装置２０は、プロセッサ２１、ストレージ２４、メモリ２７および通信モジュール２８を有する。体調判定装置２０のプロセッサ２１は、メモリ２７に格納されているコンピュータプログラムを起動することによって、音声パラメータ取得部２１１と、体調判定部２１４とを、論理的に有する。音声パラメータ取得部２１１は、音声検出装置５で抽出された複数のパラメータの値を、通信ネットワーク３０および通信モジュール２８を介して取得する。パラメータ記録処理において抽出された複数のパラメータの値は、音声パラメータ取得部２１１により、ストレージ２４へ記録データ４１として記録される。体調判定処理において抽出された複数のパラメータの値のうちの基本周波数については、音声パラメータ取得部２１１により、ステップＳ６４０における判定処理が実行される。この判定処理において肯定判定が得られた後のステップＳ６５０以降の処理ステップについては、図１（ｂ）に示す音声検出装置５のプロセッサ６が有する体調判定部６４と同様に、図１０（ｃ）に示す体調判定装置２０のプロセッサ２１が有する体調判定部２１４によって実行される。 As shown in FIG. 10C, the physical condition determination device 20 includes a processor 21, a storage 24, a memory 27, and a communication module 28. The processor 21 of the physical condition determination device 20 logically has the voice parameter acquisition unit 211 and the physical condition determination unit 214 by activating the computer program stored in the memory 27. The voice parameter acquisition unit 211 acquires the values of a plurality of parameters extracted by the voice detection device 5 via the communication network 30 and the communication module 28. The values of the plurality of parameters extracted in the parameter recording process are recorded in the storage 24 as recorded data 41 by the voice parameter acquisition unit 211. The voice parameter acquisition unit 211 executes the determination process in step S640 for the fundamental frequency among the values of the plurality of parameters extracted in the physical condition determination process. Regarding the processing steps after step S650 after the affirmative determination is obtained in this determination process, FIG. 10 (c) is the same as the physical condition determination unit 64 included in the processor 6 of the voice detection device 5 shown in FIG. 1 (b). It is executed by the physical condition determination unit 214 included in the processor 21 of the physical condition determination device 20 shown in the above.

　上述したように、図１および図９に示す音声検出装置５のプロセッサ６が有する体調判定部６４の機能を、本変形例における体調判定装置２０のプロセッサ２１に、体調判定部２１４として配備することができる。同様に、図９に示す音声検出装置５が有する感情推定部６５の機能を、本変形例における体調判定装置２０のプロセッサ２１に配備してもよい。 As described above, the function of the physical condition determination unit 64 included in the processor 6 of the voice detection device 5 shown in FIGS. 1 and 9 is provided as the physical condition determination unit 214 in the processor 21 of the physical condition determination device 20 in this modification. Can be done. Similarly, the function of the emotion estimation unit 65 included in the voice detection device 5 shown in FIG. 9 may be provided in the processor 21 of the physical condition determination device 20 in this modification.

　本発明の特徴的な機能を損なわない限り、本発明は、上述した各実施の形態および各変形例における構成に何ら限定されない。 The present invention is not limited to the configurations in each of the above-described embodiments and modifications as long as the characteristic functions of the present invention are not impaired.

１　乳児、２　体調検出システム、３　マイクロホン、４　ストレージ、５　音声検出装置、６　プロセッサ、７　メモリ、８　通信モジュール、１０　電子機器、１１　プロセッサ、１２　メモリ、１３　入力インタフェース、１４　メッセージ出力装置、１５　通信モジュール、２０　体調判定装置、２１　プロセッサ、２４　ストレージ、２７　メモリ、２８　通信モジュール、３０　通信ネットワーク、３１　時系列データ、４０　コンピュータプログラム提供サーバ、４１　記録データ、４５　記録媒体、４６　操作端末、５０　音声データ、５１　音声データ、６１　音声データ取得部、６２　周波数成分検出部、６３　音声パラメータ抽出部、６４　体調判定部、６５　感情推定部、１１１　入力制御部、１１２　メッセージ制御部、１４１　メッセージ、１４２　期間情報、１４５　メッセージ、１４６　検索ボタン表示、２１１　音声パラメータ取得部、２１４　体調判定部
1 Infant, 2 Physical condition detection system, 3 Microphone, 4 Storage, 5 Voice detector, 6 Processor, 7 Memory, 8 Communication module, 10 Electronic device, 11 Processor, 12 Memory, 13 Input interface, 14 Message output device, 15 Communication Module, 20 physical condition judgment device, 21 processor, 24 storage, 27 memory, 28 communication module, 30 communication network, 31 time series data, 40 computer program providing server, 41 recording data, 45 recording medium, 46 operation terminal, 50 voice data , 51 voice data, 61 voice data acquisition unit, 62 frequency component detection unit, 63 voice parameter extraction unit, 64 physical condition judgment unit, 65 emotion estimation unit, 111 input control unit, 112 message control unit, 141 message, 142 period information, 145 message, 146 search button display, 211 voice parameter acquisition unit, 214 physical condition judgment unit

Claims

　乳児の発声を表す第１の音声データを取得する音声データ取得部と、
　前記第１の音声データに含まれる複数の周波数成分を検出する周波数成分検出部と、
　前記第１の音声データに含まれる前記複数の周波数成分に基づく複数のパラメータを抽出する音声パラメータ抽出部と、
　前記音声データ取得部により前記乳児の発声を表す第２の音声データが取得されると、取得された前記第２の音声データと前記複数のパラメータとに基づき、前記乳児は体調不良の状態であるか否かの判定を行う体調判定部とを備える、
　体調検出システム。 A voice data acquisition unit that acquires the first voice data representing the baby's utterance,
A frequency component detection unit that detects a plurality of frequency components included in the first audio data,
An audio parameter extraction unit that extracts a plurality of parameters based on the plurality of frequency components included in the first audio data, and an audio parameter extraction unit.
When the second voice data representing the utterance of the baby is acquired by the voice data acquisition unit, the baby is in a poor physical condition based on the acquired second voice data and the plurality of parameters. It is equipped with a physical condition determination unit that determines whether or not it is present.
Physical condition detection system.
　請求項１に記載の体調検出システムにおいて、
　前記周波数成分検出部は、前記第２の音声データに含まれる前記複数の周波数成分をさらに検出し、
　前記音声パラメータ抽出部は、前記第２の音声データに含まれる前記複数の周波数成分に基づく前記複数のパラメータをさらに抽出し、
　前記体調判定部は、前記第２の音声データに含まれる前記複数の周波数成分に基づく前記複数のパラメータのうちの少なくとも一部のパラメータが所定条件を満たすとき、前記判定を行う、
　体調検出システム。 In the physical condition detection system according to claim 1,
The frequency component detection unit further detects the plurality of frequency components included in the second voice data, and further detects the plurality of frequency components.
The voice parameter extraction unit further extracts the plurality of parameters based on the plurality of frequency components included in the second voice data, and further extracts the plurality of parameters.
The physical condition determination unit makes the determination when at least a part of the plurality of parameters based on the plurality of frequency components included in the second voice data satisfies a predetermined condition.
Physical condition detection system.
　請求項２に記載の体調検出システムにおいて、
　前記少なくとも一部のパラメータは、前記第２の音声データに含まれる前記発声の基本周波数であり、
　前記所定条件は、前記基本周波数が３００Ｈｚ以上８００Ｈｚ以下である、
　体調検出システム。 In the physical condition detection system according to claim 2,
The at least a part of the parameters is the fundamental frequency of the utterance included in the second voice data.
The predetermined condition is that the fundamental frequency is 300 Hz or more and 800 Hz or less.
Physical condition detection system.
　請求項２または請求項３に記載の体調検出システムにおいて、
　前記体調判定部は、前記第１の音声データに含まれる前記複数の周波数成分に基づく前記複数のパラメータと、前記第２の音声データに含まれる前記複数の周波数成分に基づく前記複数のパラメータとの類似度に基づいて、前記判定を行う、
　体調検出システム。 In the physical condition detection system according to claim 2 or 3.
The physical condition determination unit includes the plurality of parameters based on the plurality of frequency components included in the first voice data, and the plurality of parameters based on the plurality of frequency components included in the second voice data. The determination is made based on the degree of similarity.
Physical condition detection system.
　請求項１から請求項４までのいずれか一項に記載の体調検出システムにおいて、
　前記複数のパラメータは、単位時間あたりの前記発声の回数と、前記発声の１回における継続時間と、前記発声の基本周波数と、前記発声のフォルマント周波数とのうちの少なくとも一つのパラメータを含む、
　体調検出システム。 In the physical condition detection system according to any one of claims 1 to 4.
The plurality of parameters include at least one parameter of the number of utterances per unit time, the duration of one utterance, the fundamental frequency of the utterance, and the formant frequency of the utterance.
Physical condition detection system.
　請求項１から請求項５までのいずれか一項に記載の体調検出システムにおいて、
　前記体調不良の状態は、前記乳児が病気に罹患した状態と、前記乳児が前記病気に罹患しつつある状態とのうちの、少なくとも一方の状態を含む、
　体調検出システム。 In the physical condition detection system according to any one of claims 1 to 5.
The unwell condition includes at least one of a condition in which the baby is sick and a condition in which the baby is sick.
Physical condition detection system.
　請求項１から請求項６までのいずれか一項に記載の体調検出システムにおいて、
　前記体調判定部によって前記乳児が体調不良であると判定されると、前記乳児が体調不良である旨のメッセージを出力するメッセージ出力装置をさらに備える、
　体調検出システム。 In the physical condition detection system according to any one of claims 1 to 6.
When the baby is determined to be unwell by the physical condition determination unit, a message output device for outputting a message to the effect that the baby is unwell is further provided.
Physical condition detection system.
　請求項１から請求項７までのいずれか一項に記載の体調検出システムにおいて、
　前記音声データ取得部により前記第２の音声データが取得されると、取得された前記第２の音声データと前記複数のパラメータとに基づき、前記乳児の感情を推定する感情推定部をさらに備える、
　体調検出システム。
In the physical condition detection system according to any one of claims 1 to 7.
When the second voice data is acquired by the voice data acquisition unit, an emotion estimation unit that estimates the emotion of the baby based on the acquired second voice data and the plurality of parameters is further provided.
Physical condition detection system.