WO2023209799A1

WO2023209799A1 - Change point score calculation device, change point score calculation method, and program

Info

Publication number: WO2023209799A1
Application number: PCT/JP2022/018871
Authority: WO
Inventors: 彰子高橋; 恵竹下
Original assignee: 日本電信電話株式会社
Priority date: 2022-04-26
Filing date: 2022-04-26
Publication date: 2023-11-02

Abstract

A change point score calculation device according to the present disclosure: enters centroid coordinates, which are the centers of all clusters assigned to data at each time point that has dimension equal to the number of devices × the number of items × a time window length and constitutes time series data for a past period and the current period; enters all cluster transition patterns appearing in the past period and the current period; enters the cluster transition tensor for each of the past period and the current period; calculates the inter-centroid distance matrix for all cluster pairs on the basis of the centroid coordinates of all clusters; calculates the distance matrix for all cluster transition pattern pairs, on the basis of all cluster transition patterns and the inter-centroid distance matrix for all cluster pairs; and calculates the distance between the cluster transition tensors for the past period and the current period on the basis of the cluster transition tensor for each of the past period and the current period and the distance matrix for all cluster transition pattern pairs, taking into account the distance between the cluster transition patterns.

Description

変化点スコア算出装置、変化点スコア算出方法、及びプログラムChange point score calculation device, change point score calculation method, and program

　本開示内容は、距離を考慮した変化点スコアの算出に関する。 The present disclosure relates to calculating a change point score in consideration of distance.

　１つ又は複数の装置で構成されるシステムの各時点におけるシステム状態を表す時系列データを用いて、当該システムのシステム状態の変化点を検知する技術が従来から知られている。ここで、「システム状態」とは、「アクセス数」や「ユーザ数」などの量的変数により表されるシステムの稼働状態のことである。 A technique is conventionally known that uses time-series data representing the system state at each point in time of a system composed of one or more devices to detect a change point in the system state of the system. Here, the "system state" refers to the operating state of the system expressed by quantitative variables such as "number of accesses" and "number of users."

　変化点の発生位置に関する正解ラベルが付与されていない時系列データを対象に変化点を検知する技術としては、非特許文献１に記載されている技術が知られている。 A technique described in Non-Patent Document 1 is known as a technique for detecting a change point in time-series data to which no correct label regarding the position of the change point is attached.

　非特許文献１の方法は、クラスタリングにより変化点を検知する技術を拡張した方法であり、クラスタリングベースの方法であるため、対象時系列が定常性制約、独立同分布制約などの制約を受けることはない。また、非特許文献１の方法は、時系列データの各時点における時間窓をクラスタリングした後、各時点に割り当てられたクラスタを時間軸方向に追跡し遷移パターンを抽出している点で、時間軸の概念を導入した方法と言える。さらに、非特許文献１の方法は、時間窓よりも十分に長い過去期間と現在期間を設定し、両期間におけるクラスタ遷移パターンの分布を比較して変化点スコアを算出するという、時点ごとのスナップショットデータではなく一定の時間幅を持った区間データに対し変化点スコアを算出している点で、時系列パターンの変化も含めて変化点検知が可能な手法である。 The method of Non-Patent Document 1 is an expanded method of detecting change points by clustering, and since it is a clustering-based method, the target time series is not subject to constraints such as stationarity constraints and independent and equal distribution constraints. do not have. In addition, the method of Non-Patent Document 1 clusters the time windows at each point in time series data, and then tracks the clusters assigned to each point in the time axis direction to extract transition patterns. It can be said that this method introduced the concept of Furthermore, the method of Non-Patent Document 1 sets a past period and a current period that are sufficiently longer than the time window, and calculates a change point score by comparing the distribution of cluster transition patterns in both periods. This method is capable of detecting change points, including changes in time-series patterns, because the change point score is calculated for interval data with a fixed time width rather than shot data.

　非特許文献１が提案する変化点検知装置は、具体的には、１つ又は複数の装置で構成されるシステムの各時点におけるシステム状態を表す時系列データであって、前記システムを構成する装置数×前記装置の状態を表す項目数の次元のデータで構成される時系列データを入力する入力部と、前記各時点における前記時系列データを装置数×項目数の次元のデータから、装置数×項目数×時間窓長の次元のデータに変換することで変換データを生成する時間窓生成部と、前記各時点における前記変換データに基づいて算出された前記システム状態の変化点スコアがあらかじめ設定された閾値を超えた場合に変化点として検知する機能（クラスタリング部、クラスタ遷移系列作成部、クラスタ遷移テンソル算出部、変化点スコア算出部、検知部）を有する。 Specifically, the change point detection device proposed in Non-Patent Document 1 is time-series data representing the system state at each point in time of a system composed of one or more devices, and the device constituting the system. an input unit for inputting time-series data consisting of data in a dimension of number x number of items representing the state of the device; A time window generation unit that generates converted data by converting it into data with dimensions of x number of items x time window length, and a change point score of the system state calculated based on the converted data at each point in time are set in advance. It has a function (a clustering unit, a cluster transition sequence creation unit, a cluster transition tensor calculation unit, a change point score calculation unit, and a detection unit) to detect a change point when the threshold value is exceeded.

　また、変化点スコア算出部は、変化点スコアすなわち過去期間と現在期間のクラスタ遷移テンソル間距離の計算方法として、以下の平均平方二乗誤差により実行する。
dist（Ｄ_１，Ｄ_２）＝（Σ_ｌ＝１ ^ＬΣ_ｍ＝１ ^Ｍ（ｄ_２ ^{ｃ１，・・・，ｃＬ}－ｄ_１ ^{ｃ１，・・・，ｃＬ}）^２／Ｍ^Ｌ）^１／２
　ここで、Ｄ_１、Ｄ_２はそれぞれ過去期間、現在期間におけるクラスタ遷移テンソル、ｄ_１ ^{ｃ１，・・・，ｃＬ}、ｄ_２ ^{ｃ１，・・・，ｃＬ}はそれぞれクラスタ遷移パターン｛ｃ１，ｃ２，・・・，ｃＬ｝の滞在確率が格納されているテンソルＤ_１、Ｄ_２の要素である。 Further, the change point score calculation unit executes the following mean square error as a method of calculating the change point score, that is, the distance between cluster transition tensors between the past period and the current period.
dist(D ₁ , D ₂ )=(Σ _l=1 ^L Σ _m=1 ^M (d ₂ ^{c1,..., cL} - _{d 1} ^{c1,..., cL} ) ² /M ^L ) ^1/2
Here, D ₁ and D ₂ are cluster transition tensors in the past period and current period, respectively, and d ₁ ^{c1,..., cL} and d ₂ ^{c1,..., cL} are cluster transition patterns {c1, c2,... . . , cL} are elements of tensors D ₁ and D ₂ that store the stay probabilities.

　しかし、上記のテンソル間距離計算方法では、例えば、過去期間に観測されておらず現在期間に新たに観測されたクラスタ遷移パターンが存在する場合に、新たに観測されたクラスタ遷移パターンが過去観測されていた他のクラスタ遷移パターンからかけ離れたものであっても、似ているものであっても、距離計算上は同じ結果となってしまう。逆に、過去期間に観測されたものの現在期間に観測されていないクラスタ遷移パターンが存在する場合も、過去観測されていたクラスタ遷移パターンが現在観測されている他のクラスタ遷移パターンからかけ離れたものであっても、似ているものであっても、やはり距離計算上は同じ結果となってしまう。過去観測されたものとはかけ離れたクラスタ遷移パターンが新たに観測された場合、もしくは現在観測されているものとはかけ離れたクラスタ遷移パターンが過去観測されていた場合、変化点スコア、すなわち過去期間と現在期間のクラスタ遷移テンソル間距離が上昇することが望ましいが、上記のテンソル間距離計算方法ではこのようなクラスタ遷移パターン間のかけ離れ度合い（距離）を考慮することができないという課題が生じる。 However, with the above inter-tensor distance calculation method, for example, if there is a cluster transition pattern that has not been observed in the past period but is newly observed in the current period, the newly observed cluster transition pattern is not observed in the past. Even if the cluster transition pattern is far away from, or similar to, other cluster transition patterns, the distance calculation will give the same result. Conversely, if there is a cluster transition pattern that was observed in the past period but not in the current period, the cluster transition pattern observed in the past is far different from other cluster transition patterns currently observed. Even if they are similar, the distance calculation will still yield the same result. If a new cluster transition pattern is observed that is far from what has been observed in the past, or if a cluster transition pattern that is far from what is currently observed is observed in the past, the change point score, that is, the past period. Although it is desirable that the distance between cluster transition tensors in the current period increases, a problem arises in that the above-described inter-tensor distance calculation method cannot take into account the degree of separation (distance) between cluster transition patterns.

　本発明は、上記の点を鑑みてなされたものであり、過去観測されたものとはかけ離れたクラスタ遷移パターンが新たに観測された場合や、現在観測されているものとはかけ離れたクラスタ遷移パターンが過去観測されていた場合に、変化点スコアの上昇を促す仕組みを導入し、より精緻な変化点検知を実現することを目的とする。 The present invention has been made in view of the above points, and is applicable to cases where a cluster transition pattern that is far from what has been observed in the past is newly observed, or a cluster transition pattern that is far from what is currently observed. The aim is to introduce a mechanism that promotes an increase in the change point score when a change point has been observed in the past, and to achieve more precise change point detection.

　上記目的を達成するため、請求項１に係る発明は、クラスタリング部により生成された、過去期間と現在期間の時系列データを構成する各時点の装置数×項目数×時間窓長の次元のデータに対し割り当てられた全てのクラスタの中心であるセントロイド座標を入力するセントロイド座標入力部と、クラスタ遷移テンソル算出部において抽出された、過去期間と現在期間に出現した全てのクラスタ遷移パターンを入力するクラスタ遷移パターン入力部と、前記クラスタ遷移テンソル算出部において算出された、過去期間と現在期間それぞれのクラスタ遷移テンソルを入力するクラスタ遷移テンソル入力部と、前記セントロイド座標入力部により入力された全てのクラスタの前記セントロイド座標に基づき、全てのクラスタペアについてのセントロイド間距離行列を算出するセントロイド間距離行列算出部と、前記クラスタ遷移パターン入力部により入力された全ての前記クラスタ遷移パターンと、前記セントロイド間距離行列算出部により算出された全ての前記クラスタペアについてのセントロイド間距離行列に基づき、全ての前記クラスタ遷移パターンのペアについての距離行列を算出するクラスタ遷移パターン間距離行列算出部と、前記クラスタ遷移テンソル入力部により入力された過去期間と現在期間それぞれの前記クラスタ遷移テンソルと、クラスタ遷移パターン間距離行列算出部により算出された全ての前記クラスタ遷移パターンのペアについての距離行列に基づき、前記クラスタ遷移パターンの間の距離を考慮して、過去期間と現在期間の前記クラスタ遷移テンソルの間の距離を算出する変化点スコア算出部と、を有する変化点スコア算出装置である。 In order to achieve the above object, the invention according to claim 1 provides data in the dimensions of number of devices x number of items x time window length at each point in time that constitutes time series data of the past period and the current period, generated by the clustering unit. The centroid coordinate input section inputs the centroid coordinates that are the centers of all clusters assigned to , and the cluster transition tensor calculation section inputs all cluster transition patterns that appeared in the past and current periods, extracted by the cluster transition tensor calculation section. a cluster transition pattern input section that inputs the cluster transition tensors for the past period and the current period calculated in the cluster transition tensor calculation section; an inter-centroid distance matrix calculation unit that calculates an inter-centroid distance matrix for all cluster pairs based on the centroid coordinates of the clusters; and all the cluster transition patterns input by the cluster transition pattern input unit. , an inter-cluster transition pattern distance matrix calculation that calculates a distance matrix for all the cluster transition pattern pairs based on the inter-centroid distance matrix for all the cluster pairs calculated by the inter-centroid distance matrix calculation unit. , the cluster transition tensors for each of the past and current periods input by the cluster transition tensor input unit, and a distance matrix for all pairs of cluster transition patterns calculated by the inter-cluster transition pattern distance matrix calculation unit. a change point score calculation unit that calculates a distance between the cluster transition tensors of a past period and a current period, taking into account the distance between the cluster transition patterns, based on the above.

　以上説明したように本発明によれば、過去観測されたものとはかけ離れたクラスタ遷移パターンが新たに観測された場合や、現在観測されているものとはかけ離れたクラスタ遷移パターンが過去観測されていた場合に、変化点スコアの上昇を促す仕組みを導入し、より精緻な変化点検知を実現することができるという効果を奏する。 As explained above, according to the present invention, a cluster transition pattern that is far different from that observed in the past is newly observed, or a cluster transition pattern that is far different from what is currently observed is observed in the past. In this case, a mechanism is introduced to encourage an increase in the change point score, which has the effect of realizing more precise change point detection.

本実施形態の前提となる変化点検知装置の機能構成の一例を示す図である。It is a figure showing an example of the functional composition of the change point detection device which is a premise of this embodiment. 変化点の検知処理を示すフローチャートである。7 is a flowchart illustrating a process of detecting a change point. 本実施形態に係る距離を考慮した変化点スコア算出装置の機能構成の一例を示す図である。FIG. 2 is a diagram illustrating an example of a functional configuration of a change point score calculation device that takes distance into consideration according to the present embodiment. 本実施形態に係る距離を考慮した変化点スコア算出処理の一例を示すフローチャートである。12 is a flowchart illustrating an example of a change point score calculation process that takes distance into consideration according to the present embodiment. 変化点検知装置、変化点スコア算出装置のハードウェア構成の一例を示す図である。It is a figure showing an example of the hardware constitutions of a change point detection device and a change point score calculation device.

　●本実施形態の前提となる変化点検知装置
　まず、本実施形態の変化点スコア算出装置２０を説明する前に、図１、図２、及び図５を用いて、本実施形態の前提となる変化点検知装置１０について説明する。なお、変化点検知装置１０及び変化点スコア算出装置２０は、同じ装置であり、各特徴の一面を示した名称が付されただけである。 - Change point detection device that is the premise of this embodiment First, before explaining the change point score calculation device 20 of this embodiment, using FIG. 1, FIG. 2, and FIG. The change point detection device 10 will be explained. Note that the change point detection device 10 and the change point score calculation device 20 are the same device, and are simply given names that indicate one aspect of each feature.

　ここでは、１つ又は複数の装置で構成されるシステム（Ｓ）の各時点におけるシステム状態を表す時系列データを用いて、システム状態に何らかの変化が発生した際に、その発生時点を変化点として検知することが可能な変化点検知装置１０について説明する。ここで、「システム状態」とは、「アクセス数」や「ユーザ数」などの量的変数により表されるシステムの稼働状態のことである。 Here, we use time-series data representing the system state at each point in time of a system (S) consisting of one or more devices, and when some change occurs in the system state, we use the point of occurrence as a change point. A change point detection device 10 capable of detecting a change point will be explained. Here, the "system state" refers to the operating state of the system expressed by quantitative variables such as "number of accesses" and "number of users."

　〔機能構成〕
　まず、変化点検知装置１０の機能構成について、図１を参照しながら説明する。図１は、変化点検知装置の機能構成の一例を示す図である。 [Functional configuration]
First, the functional configuration of the change point detection device 10 will be explained with reference to FIG. FIG. 1 is a diagram showing an example of the functional configuration of a change point detection device.

　図１に示すように、変化点検知装置１０は、入力部１１と、時間窓生成部１２と、期間設定部１３と、クラスタリング部１４と、クラスタ遷移系列作成部１５と、クラスタ遷移テンソル算出部１６と、変化点スコア算出部１７と、検知部１８と、出力部１９とを有する。なお、以下に示されている「装置数」及び「装置の状態」の「装置」は変化点検知装置１０によって変化点の検知対象となるシステムを構成する装置を示す。 As shown in FIG. 1, the change point detection device 10 includes an input section 11, a time window generation section 12, a period setting section 13, a clustering section 14, a cluster transition sequence generation section 15, and a cluster transition tensor calculation section. 16, a change point score calculation section 17, a detection section 18, and an output section 19. Note that the "devices" in "Number of devices" and "Device status" shown below indicate devices constituting the system whose change point is to be detected by the change point detection device 10.

　入力部１１は、１つ又は複数の装置で構成されるシステム（Ｓ）の各時点におけるシステム状態を表す時系列データであって、システム（Ｓ）を構成する（装置数×装置の状態を表す項目数）次元のデータで構成される時系列データを入力する。 The input unit 11 is time-series data representing the system state at each point in time of a system (S) composed of one or more devices, and is expressed as (number of devices x device state) constituting the system (S). Input time series data consisting of dimension data (number of items).

　時間窓生成部１２は、入力部１１により入力された時系列データを固定長の時間窓で区切り、各時点のデータを（装置数×項目数）次元のデータから、（装置数×項目数×時間窓長）次元のデータに変換して変換データを生成し、中間出力を行う。 The time window generation unit 12 divides the time series data inputted by the input unit 11 into time windows of fixed length, and converts the data at each point in time from data in the (number of devices x number of items) dimension to (number of devices x number of items x time window length) to generate converted data, and perform intermediate output.

　期間設定部１３は、時間窓生成部１２により生成された（装置数×項目数×時間窓長）次元の時系列データから、あらかじめ設定された過去期間と現在期間の時系列データを抽出して、中間出力を行う。 The period setting unit 13 extracts time series data of a preset past period and current period from the (number of devices x number of items x time window length) dimension time series data generated by the time window generation unit 12. , performs intermediate output.

　クラスタリング部１４は、期間設定部１３により抽出された過去期間と現在期間の時系列データを構成する各時点の（装置数×項目数×時間窓長）次元のデータをクラスタリング手法により状態分類して、中間出力を行う。 The clustering unit 14 uses a clustering method to classify the state of (number of devices x number of items x time window length) dimensional data at each point in time that constitutes the time series data of the past period and the current period extracted by the period setting unit 13. , performs intermediate output.

　クラスタ遷移系列作成部１５は、過去期間と現在期間の各時点における（装置数×項目数×時間窓長）次元のデータに対しクラスタリング部１４が割り当てたクラスタを時間軸方向に追跡し、過去期間と現在期間のそれぞれについて、異なるクラスタ間におけるクラスタ遷移の系列を作成すると同時に、このクラスタ遷移系列を構成する各クラスタに対し当該クラスタにおける滞在期間を付与して、中間出力を行う。 The cluster transition sequence creation unit 15 tracks the clusters assigned by the clustering unit 14 to the (number of devices x number of items x time window length) dimension data at each point in time in the past period and the current period in the time axis direction, and For each of the current period and the current period, a series of cluster transitions between different clusters is created, and at the same time, a period of stay in the cluster is assigned to each cluster constituting the cluster transition series, and intermediate output is performed.

　クラスタ遷移テンソル算出部１６は、クラスタ遷移系列作成部１５により作成されたクラスタ遷移系列から、あらかじめ設定された固定長のクラスタ遷移を抽出した上で、過去期間と現在期間における各クラスタ遷移パターンの出現確率を算出し、上記クラスタ遷移長（クラスタ遷移の長さ）を階数（即ち、次元）とし、過去期間と現在期間に出現した全てのクラスタのユニーク値を各次元のインデックスに持ち、クラスタ遷移パターンの出現確率を値に持つクラスタ遷移テンソルを過去期間と現在期間のそれぞれについて算出して、中間出力を行う。 The cluster transition tensor calculation unit 16 extracts cluster transitions of a preset fixed length from the cluster transition sequence created by the cluster transition sequence creation unit 15, and calculates the appearance of each cluster transition pattern in the past period and the current period. The probability is calculated, the cluster transition length (length of cluster transition) is taken as a rank (i.e., dimension), and the index of each dimension is the unique value of all the clusters that appeared in the past period and the current period, and the cluster transition pattern is calculated. A cluster transition tensor whose value is the probability of occurrence of is calculated for each of the past period and the current period, and intermediate output is performed.

　変化点スコア算出部１７は、クラスタ遷移テンソル算出部１６により算出された過去期間と現在期間それぞれのクラスタ遷移テンソルに基づき、過去期間から現在期間にかけての変化度として、過去期間におけるクラスタ遷移テンソルと現在期間におけるクラスタ遷移テンソルの距離を算出して、中間出力を行う。 The change point score calculation unit 17 calculates the cluster transition tensor for the past period and the current period as the degree of change from the past period to the current period based on the cluster transition tensors for the past period and the current period calculated by the cluster transition tensor calculation unit 16. Calculate the distance of the cluster transition tensor in the period and perform intermediate output.

　検知部１８は、変化点スコア算出部１７により算出された変化点スコアがあらかじめ設定された閾値を超えた場合に変化点として検知する。即ち、検知部１８は、各時点におけるデータ（変換データ）に基づいて算出されたシステム状態の変化点スコアがあらかじめ設定された閾値を超えた場合に変化点として検知する。 The detection unit 18 detects the change point as a change point when the change point score calculated by the change point score calculation unit 17 exceeds a preset threshold. That is, the detection unit 18 detects the change point as a change point when the change point score of the system state calculated based on the data (converted data) at each point in time exceeds a preset threshold.

　出力部１９は、検知部１８により検知された変化点を出力する。 The output unit 19 outputs the change point detected by the detection unit 18.

　〔変化点検知処理〕
　次に、変化点検知処理（手順）について、図２を参照しながら説明する。図２は、変化点検知処理の一例を示すフローチャートである。 [Change point detection processing]
Next, the change point detection process (procedure) will be explained with reference to FIG. FIG. 2 is a flowchart illustrating an example of the change point detection process.

　以降では、システム（Ｓ）を構成する装置数をＭ、各時点におけるシステム状態を表すデータの項目数をＫ、時系列データの観測時点数をＮとして、Ｎ個のＭ×Ｋ次元データで時系列データが構成されているものとする。 In the following, time will be expressed using N pieces of M×K dimensional data, where M is the number of devices configuring the system (S), K is the number of data items representing the system state at each point in time, and N is the number of observation points in time series data. It is assumed that series data is configured.

　なお、各時点におけるＭ×Ｋ次元データの各要素は、当該時点におけるＭ個の装置の状態を表すＫ個の観測値である。具体的には、或る時点におけるＭ×Ｋ次元データを［ｘ１，・・・，ｘＫ，ｘＫ＋１，・・・，ｘ２Ｋ，・・・，ｘ（Ｍ－１）Ｋ＋１，・・・，ｘＭＫ］とした場合、例えば、ｍ＝１，・・・，Ｍに対してｘ（ｍ－１）Ｋ＋１，・・・，ｘｍＫは当該時点におけるｍ番目の装置のＫ個の観測値である。 Note that each element of the M×K dimensional data at each point in time is K observed values representing the states of M devices at that point in time. Specifically, M×K dimensional data at a certain point in time is expressed as [x1,...,xK,xK+1,...,x2K,...,x(M-1)K+1,...,xMK] For example, for m=1, . . . , M, x(m−1)K+1, .

　ステップＳ１１：まず、入力部１１は、Ｎ個のＭ×Ｋ（装置数×項目数）次元データで構成される時系列データを入力する。すなわち、時点ｎにおけるＭ×Ｋ次元データをＸｎとすれば、入力部１１は、時系列データ｛Ｘ１，・・・，ＸＮ｝を入力する。 Step S11: First, the input unit 11 inputs time series data composed of N pieces of M×K (number of devices×number of items) dimensional data. That is, if the M×K dimensional data at time n is Xn, the input unit 11 inputs time series data {X1, . . . , XN}.

　ステップＳ１２：次に、時間窓生成部１２は、ステップＳ１１で入力された時系列データを固定長Ｗの時間窓で区切ることによって、各時点のデータをＭ×Ｋ（装置数×項目数）次元のデータからＭ×Ｋ×Ｗ（装置数×項目数×時間窓長）次元のデータに変換して変換データを生成し、中間出力を行う。具体的には、時点ｎ－（Ｗ－１），ｎ－（Ｗ－２），・・・，ｎそれぞれのＭ×Ｋ次元データＸｎ－（Ｗ－１），Ｘｎ－（Ｗ－２），・・・，Ｘｎで構成されるＭ×Ｋ×Ｗ次元ベクトルＹｎ＝（Ｘｎ－（Ｗ－１），Ｘｎ－（Ｗ－２），・・・，Ｘｎ）を時点ｎのＭ×Ｋ×Ｗ次元データとする。なお、元のＭ×Ｋ次元データＸｎが時点ｎ＝１，・・・，Ｎについて観測されている場合、変換後のＭ×Ｋ×Ｗ次元データＹｎは時点ｎ＝Ｗ，・・・，Ｎについて得られることになる。 Step S12: Next, the time window generation unit 12 divides the time series data input in step S11 into time windows of fixed length W, so that the data at each time point is divided into M×K (number of devices×number of items) dimensions. The data is converted into M×K×W (number of devices×number of items×time window length) dimensional data to generate converted data and intermediate output is performed. Specifically, M×K dimensional data Xn-(W-1), Xn-(W-2), ..., Xn is an M×K×W dimensional vector Yn=(Xn-(W-1), Let it be dimensional data. Note that if the original M×K dimensional data Xn is observed at time n=1,...,N, the converted M×K×W dimensional data Yn is observed at time n=W,...,N You will get about.

　ステップＳ１３：次に、期間設定部１３は、ステップＳ１２で生成されたＭ×Ｋ×Ｗ（装置数×項目数×時間窓長）次元の時系列データから、あらかじめ設定された過去期間と現在期間の時系列データを抽出する。具体的には、過去期間を［ｓ１，ｅ１］、現在期間を［ｓ２，ｅ２］とした場合、時点ｎ＝Ｗ，・・・，ＮのＭ×Ｋ×Ｗ次元データＹｎから過去期間のデータ｛Ｙｓ１，・・・，Ｙｅ１｝と現在期間のデータ｛Ｙｓ２，・・・，Ｙｅ２｝を抽出する。 Step S13: Next, the period setting unit 13 sets a preset past period and current period from the M×K×W (number of devices x number of items x time window length) dimension time series data generated in step S12. Extract time series data. Specifically, when the past period is [s1, e1] and the current period is [s2, e2], data of the past period is obtained from M×K×W dimensional data Yn at time n=W,...,N. {Ys1, . . . , Ye1} and current period data {Ys2, . . . , Ye2} are extracted.

　ステップＳ１４：次に、クラスタリング部１４は、ステップＳ１３で抽出された長さ（ｅ１－ｓ１＋１）時点の過去期間と長さ（ｅ２－ｓ２＋１）時点の現在期間の時系列データを構成する（ｅ１－ｓ１＋ｅ２－ｓ２＋２）個のＭ×Ｋ×Ｗ（装置数×項目数×時間窓長）次元データをクラスタリング手法により状態分類することで、当該時系列データに対応するクラスタ系列を得る。具体的には、クラスタリング部１４は、時点ｎのＭ×Ｋ×Ｗ次元データＹｎが属するクラスタをＣｎとした場合、過去期間の時系列データ｛Ｙｓ１，・・・，Ｙｅ１｝からクラスタ系列｛Ｃｓ１，・・・，Ｃｅ１｝、現在期間の時系列データ｛Ｙｓ２，・・・，Ｙｅ２｝からクラスタ系列｛Ｃｓ２，・・・，Ｃｅ２｝が得られる。なお、クラスタリングは、（ｅ１－ｓ１＋ｅ２－ｓ２＋２）個のＭ×Ｋ×Ｗ次元データを互いの距離に基づいて近いデータ同士を同一クラスタに分類する処理である。各Ｍ×Ｋ×Ｗ次元データに割り当てられたクラスタを時系列順に並べることでクラスタ系列が得られる。クラスタリング手法としては、階層的手法（例えば、最短距離法、最長距離法、群平均法、ウォード法等）が用いられてもよいし、非階層的手法（例えば、Ｋ－Ｍｅａｎｓ法等）が用いられてもよい。 Step S14: Next, the clustering unit 14 configures time series data of the past period at the time of length (e1-s1+1) extracted at step S13 and the current period at the time of length (e2-s2+1) (e1- s1+e2-s2+2) M×K×W (number of devices×number of items×time window length) dimensional data is classified into states using a clustering method to obtain a cluster sequence corresponding to the time series data. Specifically, when the cluster to which the M×K×W dimensional data Yn of time n belongs is defined as Cn, the clustering unit 14 calculates the cluster sequence {Cs1 from the time series data {Ys1, ..., Ye1} of the past period. , ..., Ce1}, and the cluster sequence {Cs2, ..., Ce2} is obtained from the time series data {Ys2, ..., Ye2} of the current period. Note that clustering is a process of classifying (e1-s1+e2-s2+2) pieces of M×K×W dimensional data into the same cluster based on their distances. A cluster series is obtained by arranging clusters assigned to each M×K×W dimensional data in chronological order. As a clustering method, a hierarchical method (for example, shortest distance method, longest distance method, group average method, Ward method, etc.) may be used, or a non-hierarchical method (for example, K-Means method, etc.) may be used. It's okay to be hit.

　ステップＳ１５：次に、クラスタ遷移系列作成部１５は、過去期間［ｓ１，ｅ１］と現在期間［ｓ２，ｅ２］の各時点におけるＭ×Ｋ×Ｗ（装置数×項目数×時間窓長）次元データに対しステップＳ１４で割り当てたクラスタを時間軸方向に追跡し、過去期間と現在期間のそれぞれについて、異なるクラスタ間におけるクラスタ遷移の系列を作成すると共に、このクラスタ遷移系列を構成する各クラスタに対し当該クラスタにおける滞在期間を付与する。具体的に過去期間［ｓ１，ｅ１］の時系列データ｛Ｙｓ１，・・・，Ｙｅ１｝から得られたクラスタ系列｛Ｃｓ１，・・・，Ｃｅ１｝を例にとり説明すると、区間［ｓ１，ｅ１］で異なるクラスタ間におけるクラスタ遷移が発生した時点をτｉ（ｉ＝１，２，・・・，Ｉ）（ただし、τ１＝ｓ１）、時点τｉにおける遷移後クラスタのクラスタをｃ（τｉ）とした場合、これを時系列順に並べることで長さＩのクラスタ遷移系列ｃ（τ１）→ｃ（τ２）→・・・→ｃ（τＩ）が得られる。また、このクラスタ遷移系列を構成する各クラスタｃ（τｉ）に対し、当該クラスタｃ（τｉ）における滞在期間ｄ（τｉ）＝τｉ＋１－τｉ（ただし、τＩ＋１＝ｅ１）を付与することにより、滞在期間つきクラスタ遷移系列ｃ（τ１）｜ｄ（τ１）→ｃ（τ２）｜ｄ（τ２）→・・・→ｃ（τＩ）｜ｄ（τＩ）が得られる。 Step S15: Next, the cluster transition sequence creation unit 15 generates M×K×W (number of devices×number of items×time window length) dimensions at each point in the past period [s1, e1] and the current period [s2, e2]. The clusters assigned to the data in step S14 are tracked in the time axis direction, and cluster transition sequences between different clusters are created for each of the past period and the current period, and each cluster constituting this cluster transition sequence is The period of stay in the cluster is assigned. Specifically, taking as an example the cluster sequence {Cs1, ..., Ce1} obtained from the time series data {Ys1, ..., Ye1} of the past period [s1, e1], the interval [s1, e1] When the time point at which cluster transition occurs between different clusters is τi (i = 1, 2, ..., I) (where τ1 = s1), and the cluster after the transition at time τi is c(τi). , by arranging them in chronological order, a cluster transition sequence c(τ1)→c(τ2)→...→c(τI) of length I is obtained. In addition, by assigning the stay period d (τi) = τi + 1 - τi (however, τI + 1 = e1) in the cluster c (τi) to each cluster c (τi) constituting this cluster transition series, the stay period A cluster transition sequence c(τ1)|d(τ1)→c(τ2)|d(τ2)→...→c(τI)|d(τI) is obtained.

　ステップＳ１６：次に、クラスタ遷移テンソル算出部１６は、ステップＳ１５で作成されたクラスタ遷移系列から、あらかじめ設定された固定長Ｌのクラスタ遷移を抽出した上で、過去期間と現在期間における各クラスタ遷移パターンの出現確率を算出し、上記クラスタ遷移長Ｌを階数（次元）とし、過去期間と現在期間に出現した全てのクラスタのユニーク値を各次元のインデックスに持ち、クラスタ遷移パターンの出現確率を値に持つクラスタ遷移テンソルを過去期間と現在期間のそれぞれについて算出する。具体的に過去期間［ｓ１，ｅ１］の時系列データ｛Ｙｓ１，・・・，Ｙｅ１｝から得られた長さＩのクラスタ遷移系列ｃ（τ１）→ｃ（τ２）→・・・→ｃ（τＩ）を例にとり説明すると、このクラスタ遷移系列から長さＬ（ただし、Ｌ≦Ｉ）のクラスタ遷移は（Ｉ－（Ｌ－１））本抽出することができ、ｃ（τｉ－（Ｌ－１））→ｃ（τｉ－（Ｌ－２））→・・・→ｃ（τｉ）（ｉ＝Ｌ，・・・，Ｉ）で表される。クラスタ遷移テンソル算出部１６は、この（Ｉ－（Ｌ－１））本のクラスタ遷移をパターンごとにまとめて出現確率を算出し、これに基づきＬ次元のクラスタ遷移テンソルを算出する。ここで、クラスタ遷移パターンの出現確率とは、当該クラスタ遷移パターンの出現度数を全てのクラスタ遷移パターンの出現度数合計で割った値である。 Step S16: Next, the cluster transition tensor calculation unit 16 extracts cluster transitions with a preset fixed length L from the cluster transition sequence created in step S15, and then Calculate the appearance probability of the pattern, take the cluster transition length L above as the rank (dimension), have the unique values of all clusters that appeared in the past period and the current period as indexes for each dimension, and calculate the appearance probability of the cluster transition pattern as the value. The cluster transition tensor with is calculated for each of the past period and the current period. Specifically, the cluster transition sequence c(τ1)→c(τ2)→・・・→c( Taking τI) as an example, (I-(L-1)) cluster transitions of length L (L≦I) can be extracted from this cluster transition sequence, and c(τi-(L- 1))→c(τi-(L-2))→...→c(τi) (i=L,...,I). The cluster transition tensor calculating unit 16 calculates the appearance probability of these (I−(L−1)) cluster transitions for each pattern, and calculates an L-dimensional cluster transition tensor based on this. Here, the appearance probability of a cluster transition pattern is a value obtained by dividing the frequency of appearance of the cluster transition pattern by the total frequency of appearance of all cluster transition patterns.

　なお、クラスタ遷移パターンの出現度数は、当該クラスタ遷移パターンの滞在期間で重みづけした値を用いても良い。以下、クラスタ遷移パターンの出現確率をテンソルに格納する方法について、簡単のため、Ｌ＝２、過去期間と現在期間を通じて出現した全てのクラスタのユニーク値がα、β、γだった場合の例で説明する。このとき、クラスタ遷移テンソルは２次元で、各次元のインデックスは３つの値α、β、γをとる。クラスタ遷移テンソルは３×３の配列により表すことができ、クラスタ遷移パターンα→βの出現確率が０．１だった場合、第１軸のインデックス（クラスタ遷移パターンの第１要素）が値α、第２軸のインデックス（クラスタ遷移パターンの第２要素）が値βをとる配列要素に出現確率０．１を格納する。 Note that the frequency of appearance of a cluster transition pattern may be a value weighted by the stay period of the cluster transition pattern. Below, we will explain how to store the probability of occurrence of a cluster transition pattern in a tensor. For simplicity, we will use an example where L = 2 and the unique values of all clusters that have appeared in the past period and the current period are α, β, and γ. explain. At this time, the cluster transition tensor is two-dimensional, and the index of each dimension takes three values α, β, and γ. The cluster transition tensor can be represented by a 3 × 3 array, and if the probability of appearance of the cluster transition pattern α → β is 0.1, the index of the first axis (the first element of the cluster transition pattern) is the value α, An appearance probability of 0.1 is stored in an array element whose index on the second axis (second element of the cluster transition pattern) takes the value β.

　ステップＳ１７：次に、変化点スコア算出部１７は、ステップＳ１６で算出された過去期間と現在期間それぞれのクラスタ遷移テンソルに基づき、過去期間から現在期間にかけての変化度として、過去期間におけるクラスタ遷移テンソルと現在期間におけるクラスタ遷移テンソルの距離を算出する。過去期間のクラスタ遷移テンソルＤ１の要素をｄ１ｉ１，・・・，ｉＬ、現在期間のクラスタ遷移テンソルＤ２の要素をｄ２ｉ１，・・・，ｉＬとした場合、両者間の距離はたとえば以下の平均平方二乗誤差により表すことができる。
（Σｌ＝１ＬΣｍ＝１Ｍ（ｄ２ｉ１，・・・，ｉＬ－ｄ１ｉ１，・・・，ｉＬ）２／ＭＬ）１／２
　なお、下記テンソル間距離におけるＭは、過去期間と現在期間を通じて出現した全てのクラスタのユニーク値の個数である。 Step S17: Next, the change point score calculation unit 17 calculates the cluster transition tensor in the past period as the degree of change from the past period to the current period based on the cluster transition tensors for the past period and the current period calculated in step S16. and calculate the distance of the cluster transition tensor in the current period. If the elements of the cluster transition tensor D1 in the past period are d1i1,..., iL, and the elements of the cluster transition tensor D2 in the current period are d2i1,..., iL, the distance between them is, for example, the following mean square It can be expressed by an error.
(Σl=1LΣm=1M(d2i1,...,iL-d1i1,...,iL)2/ML)1/2
Note that M in the inter-tensor distance below is the number of unique values of all clusters that have appeared throughout the past period and the current period.

　ステップＳ１８：次に、検知部１８は、ステップＳ１７で算出された変化点スコアがあらかじめ設定された閾値を超えた場合に変化点として検知する。即ち、検知部１８は、各時点におけるデータ（変換データ）に基づいて算出されたシステム状態の変化点スコアがあらかじめ設定された閾値を超えた場合に変化点として検知する。 Step S18: Next, the detection unit 18 detects the change point score as a change point when the change point score calculated in step S17 exceeds a preset threshold. That is, the detection unit 18 detects the change point as a change point when the change point score of the system state calculated based on the data (converted data) at each point in time exceeds a preset threshold.

　ステップＳ１９：最後に、出力部１９は、ステップＳ１８で検知された変化点を出力する。 Step S19: Finally, the output unit 19 outputs the change point detected in step S18.

　〔ハードウェア構成〕
　続いて、図３を用いて、変化点検知装置１０のハードウェア構成について説明する。図３は、変化点検知装置のハードウェア構成図である。 [Hardware configuration]
Next, the hardware configuration of the change point detection device 10 will be described using FIG. 3. FIG. 3 is a hardware configuration diagram of the change point detection device.

　図３に示されているように、変化点検知装置１０は、プロセッサ１０１、メモリ１０２、補助記憶装置１０３、接続装置１０４、通信装置１０５、ドライブ装置１０６を有する。なお、変化点検知装置１０を構成する各ハードウェアは、バス１０７を介して相互に接続される。 As shown in FIG. 3, the change point detection device 10 includes a processor 101, a memory 102, an auxiliary storage device 103, a connection device 104, a communication device 105, and a drive device 106. Note that each piece of hardware that constitutes the change point detection device 10 is interconnected via a bus 107.

　プロセッサ１０１は、変化点検知装置１０全体の制御を行う制御部の役割を果たし、ＣＰＵ（Central Processing Unit）等の各種演算デバイスを有する。プロセッサ１０１は、各種プログラムをメモリ１０２上に読み出して実行する。なお、プロセッサ１０１には、ＧＰＧＰＵ(General-purpose computing on graphics processing units)が含まれていてもよい。 The processor 101 plays the role of a control unit that controls the entire change point detection device 10, and includes various calculation devices such as a CPU (Central Processing Unit). The processor 101 reads various programs onto the memory 102 and executes them. Note that the processor 101 may include GPGPU (General-purpose computing on graphics processing units).

　メモリ１０２は、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）等の主記憶デバイスを有する。プロセッサ１０１とメモリ１０２とは、いわゆるコンピュータを形成し、プロセッサ１０１が、メモリ１０２上に読み出した各種プログラムを実行することで、当該コンピュータは各種機能を実現する。 The memory 102 includes main storage devices such as ROM (Read Only Memory) and RAM (Random Access Memory). The processor 101 and the memory 102 form a so-called computer, and when the processor 101 executes various programs read onto the memory 102, the computer realizes various functions.

　補助記憶装置１０３は、各種プログラムや、各種プログラムがプロセッサ１０１によって実行される際に用いられる各種情報を格納する。 The auxiliary storage device 103 stores various programs and various information used when the various programs are executed by the processor 101.

　接続装置１０４は、外部装置（例えば、表示装置１１０、操作装置１１１）と変化点検知装置１０とを接続する接続デバイスである。 The connection device 104 is a connection device that connects an external device (for example, the display device 110, the operation device 111) and the change point detection device 10.

　通信装置１０５は、他の装置との間で各種情報を送受信するための通信デバイスである。 The communication device 105 is a communication device for transmitting and receiving various information to and from other devices.

　ドライブ装置１０６は記録媒体１３０をセットするためのデバイスである。ここでいう記録媒体１３０には、ＣＤ－ＲＯＭ(Compact Disc Read-Only Memory)、フレキシブルディスク、光磁気ディスク等のように情報を光学的、電気的あるいは磁気的に記録する媒体が含まれる。また、記録媒体１３０には、ＲＯＭ(Read Only Memory)、フラッシュメモリ等のように情報を電気的に記録する半導体メモリ等が含まれていてもよい。 The drive device 106 is a device for setting the recording medium 130. The recording medium 130 herein includes a medium that records information optically, electrically, or magnetically, such as a CD-ROM (Compact Disc Read-Only Memory), a flexible disk, and a magneto-optical disk. Further, the recording medium 130 may include a semiconductor memory that electrically records information, such as a ROM (Read Only Memory) or a flash memory.

　なお、補助記憶装置１０３にインストールされる各種プログラムは、例えば、配布された記録媒体１３０がドライブ装置１０６にセットされ、当該記録媒体１３０に記録された各種プログラムがドライブ装置１０６により読み出されることでインストールされる。あるいは、補助記憶装置１０３にインストールされる各種プログラムは、通信装置１０５を介してネットワークからダウンロードされることで、インストールされてもよい。 Note that the various programs to be installed in the auxiliary storage device 103 are installed by, for example, setting the distributed recording medium 130 in the drive device 106 and reading out the various programs recorded on the recording medium 130 by the drive device 106. be done. Alternatively, various programs installed in the auxiliary storage device 103 may be installed by being downloaded from a network via the communication device 105.

　〔変化点検知装置による主な効果〕
　以上のように、変化点検知装置１０は、１つ又は複数の装置で構成されるシステム（Ｓ）の各時点におけるシステム状態を表す時系列データを用いて、システム状態に何らかの変化が発生した際に、その発生時点を変化点として検知することができる。 [Main effects of the change point detection device]
As described above, the change point detection device 10 uses time-series data representing the system state at each point in time of the system (S) composed of one or more devices, and detects when some change occurs in the system state. It is possible to detect the point of occurrence as a point of change.

　しかも、変化点検知装置１０は、各時点におけるシステム状態をクラスタリング手法により状態分類する方法を前提としていることで、周期変動を示すなど定常性制約やiid制約を満たさないデータも含めた時系列データを対象とすることができる。さらに、変化点検知装置１０は、時間経過に伴うシステム（Ｓ）の状態遷移（つまり、各時点でシステム状態が属するクラスタとその滞在期間の遷移）を考慮することでシステム（Ｓ）の周期変動をモデル化しており、周期変動の変化など時間変化パターンの変化も含めた変化を検知することができる。 Moreover, since the change point detection device 10 is based on a method of classifying the system state at each point in time using a clustering method, time-series data including data that does not satisfy the stationarity constraint or the IID constraint, such as data that shows periodic fluctuations, can be targeted. Furthermore, the change point detection device 10 detects periodic fluctuations in the system (S) by considering state transitions of the system (S) over time (that is, transitions in the cluster to which the system state belongs and its stay period at each point in time). It is possible to detect changes, including changes in time-varying patterns, such as changes in periodic fluctuations.

　なお、後述の変化点スコア算出装置２０は、変化点検知装置１０と同様のハードウェア構成であるため、その説明を省略する。 Note that the change point score calculation device 20, which will be described later, has the same hardware configuration as the change point detection device 10, so a description thereof will be omitted.

　●本実施形態の変化点スコア算出装置
　続いて、本発明の一実施形態について説明する。本実施形態では、変化点検知装置１０が、変化点スコア算出部１７において過去期間と現在期間のクラスタ遷移テンソル間距離を計算する際に、過去期間と現在期間におけるクラスタ遷移パターンごとの滞在確率の二乗誤差を各クラスタ遷移パターンのその他パターンからの距離で重みづけすることで、過去観測されたものとはかけ離れたクラスタ遷移パターンが新たに観測された場合や、現在観測されているものとはかけ離れたクラスタ遷移パターンが過去観測されていた場合に、変化点スコアの上昇を促す仕組みを導入し、より精緻な変化点検知を実現することが可能な変化点スコア算出装置２０について説明する。 ●Change point score calculation device according to the present embodiment Next, an embodiment of the present invention will be described. In this embodiment, when the change point detection device 10 calculates the distance between cluster transition tensors in the past period and the current period in the change point score calculation unit 17, the change point detection device 10 calculates the stay probability for each cluster transition pattern in the past period and the current period. By weighting the squared error by the distance of each cluster transition pattern from other patterns, we can calculate whether a cluster transition pattern is newly observed that is far from what has been observed in the past, or is far from what is currently observed. A change point score calculation device 20 that is capable of realizing more precise change point detection by introducing a mechanism to encourage an increase in the change point score when a cluster transition pattern has been observed in the past will be described.

　〔機能構成〕
　まず、本実施形態に係る変化点スコア算出装置２０の機能構成について、図３を参照しながら説明する。図３は、本実施形態に係る変化点スコア算出装置の機能構成の一例を示す図である。 [Functional configuration]
First, the functional configuration of the change point score calculation device 20 according to this embodiment will be described with reference to FIG. 3. FIG. 3 is a diagram showing an example of the functional configuration of the change point score calculation device according to the present embodiment.

　図３に示すように、本実施形態に係る変化点スコア算出装置２０は、セントロイド座標入力部２１と、クラスタ遷移パターン入力部２２と、クラスタ遷移テンソル入力部２３と、セントロイド間距離行列算出部２４と、クラスタ遷移パターン間距離行列算出部２５と、変化点スコア算出部２６と、出力部２７とを有する。 As shown in FIG. 3, the change point score calculation device 20 according to the present embodiment includes a centroid coordinate input section 21, a cluster transition pattern input section 22, a cluster transition tensor input section 23, and an inter-centroid distance matrix calculation. , a cluster transition pattern distance matrix calculation section 25 , a change point score calculation section 26 , and an output section 27 .

　セントロイド座標入力部２１は、クラスタリング部１４により生成された全てのクラスタ（過去期間と現在期間の時系列データを構成する各時点の装置数×項目数×時間窓長の次元のデータに対し割り当てられた全てのクラスタ）のセントロイド（クラスタ中心）座標を入力する。 The centroid coordinate input unit 21 is used to allocate all the clusters generated by the clustering unit 14 (data in the dimensions of number of devices x number of items x time window length at each point in time that constitutes the time series data of the past period and the current period). Enter the centroid (cluster center) coordinates of all clusters that were created.

　クラスタ遷移パターン入力部２２は、クラスタ遷移テンソル算出部１６において抽出された全てのクラスタ遷移パターン（過去期間と現在期間に出現した全てのクラスタ遷移パターン）を入力する。 The cluster transition pattern input unit 22 inputs all cluster transition patterns (all cluster transition patterns that appeared in the past period and the current period) extracted by the cluster transition tensor calculation unit 16.

　クラスタ遷移テンソル入力部２３は、クラスタ遷移テンソル算出部１６において算出された過去期間と現在期間それぞれのクラスタ遷移テンソルを入力する。 The cluster transition tensor input unit 23 inputs the cluster transition tensors for each of the past period and the current period calculated by the cluster transition tensor calculation unit 16.

　セントロイド間距離行列算出部２４は、セントロイド座標入力部２１により入力された全てのクラスタのセントロイド座標に基づき、全てのクラスタペアについてのセントロイド間距離行列を算出し、中間出力を行う。 The inter-centroid distance matrix calculation unit 24 calculates the inter-centroid distance matrices for all cluster pairs based on the centroid coordinates of all clusters input by the centroid coordinate input unit 21, and performs intermediate output.

　クラスタ遷移パターン間距離行列算出部２５は、クラスタ遷移パターン入力部２２により入力された全てのクラスタ遷移パターンと、セントロイド間距離行列算出部２４により算出された全てのクラスタペアについてのセントロイド間距離行列に基づき、全てのクラスタ遷移パターンペアについての距離行列を算出し、中間出力を行う。 The inter-cluster transition pattern distance matrix calculation unit 25 calculates the inter-centroid distances for all cluster transition patterns input by the cluster transition pattern input unit 22 and all cluster pairs calculated by the inter-centroid distance matrix calculation unit 24. Based on the matrix, distance matrices for all cluster transition pattern pairs are calculated and intermediate output is performed.

　変化点スコア算出部２６は、クラスタ遷移テンソル入力部２３により入力された過去期間と現在期間それぞれのクラスタ遷移テンソルと、クラスタ遷移パターン間距離行列算出部２５により算出された全てのクラスタ遷移パターンペアについての距離行列に基づき、クラスタ遷移パターン間の距離を考慮した過去期間と現在期間のクラスタ遷移テンソル間距離を算出し、中間出力を行う。なお、変化点スコア算出部２６は、変化点スコア算出部１７の一面の機能を示した機能部である。 The change point score calculation unit 26 calculates the cluster transition tensors for each of the past period and the current period input by the cluster transition tensor input unit 23 and all the cluster transition pattern pairs calculated by the inter-cluster transition pattern distance matrix calculation unit 25. Based on the distance matrix, the distance between cluster transition tensors for the past period and the current period is calculated, taking into account the distance between cluster transition patterns, and intermediate output is performed. Note that the change point score calculation section 26 is a functional section that shows one aspect of the function of the change point score calculation section 17.

　出力部２７は、変化点スコア算出部２６により算出された過去期間と現在期間のクラスタ遷移テンソル間距離を変化点スコアとして出力し、検知部１８に渡す。 The output unit 27 outputs the distance between cluster transition tensors between the past period and the current period calculated by the change point score calculation unit 26 as a change point score, and passes it to the detection unit 18.

　〔変化点スコア算出処理〕
　次に、本実施形態に係る変化点スコア算出処理（手順）について、図４を参照しながら説明する。図４は、本実施形態に係る変化点スコア算出処理の一例を示すフローチャートである。 [Change point score calculation process]
Next, the change point score calculation process (procedure) according to this embodiment will be described with reference to FIG. 4. FIG. 4 is a flowchart illustrating an example of the change point score calculation process according to the present embodiment.

　ステップＳ２１：まず、セントロイド座標入力部２１は、クラスタリング部１４により生成された全てのクラスタ（過去期間と現在期間の時系列データを構成する各時点の装置数×項目数×時間窓長の次元のデータに対し割り当てられた全てのクラスタ）のセントロイド（クラスタ中心）座標を入力する。すなわち、過去期間と現在期間の時系列を構成するＫ（装置数×項目数×時間窓長）次元の時系列データから、クラスタリング部１４によりＩ個のクラスタｃ_１，ｃ_２，...，ｃ_Ｉが生成されたとして、そのｉ（＝１，２，...，Ｉ）番目のクラスタｃ_ｉのセントロイド座標（Ｋ次元ベクトル）をＸ_ｉ＝（Ｘ_ｉ１，Ｘ_ｉ２，...，Ｘ_ｉＫ）とすれば、セントロイド座標入力部２１は、Ｉ個のセントロイド座標Ｘ_１，Ｘ_２，...，Ｘ_Ｉを入力する。 Step S21: First, the centroid coordinate input unit 21 selects all the clusters generated by the clustering unit 14 (dimensions of number of devices x number of items x time window length at each point in time that constitute the time series data of the past period and the current period). Enter the centroid (cluster center) coordinates of all clusters assigned to the data. That is, the clustering unit 14 creates I clusters c ₁ , c ₂ , . Assuming that c _I is generated, the centroid coordinates (K-dimensional vector) of the i (=1, 2, ..., I)-th cluster c _i are X _i = (X _i1 , X _i2 , ... , X _iK ), the centroid coordinate input unit 21 inputs I centroid coordinates X ₁ , X ₂ , . . . , X _I.

　ステップＳ２２：クラスタ遷移パターン入力部２２は、クラスタ遷移テンソル算出部１６において抽出された全てのクラスタ遷移パターン（過去期間と現在期間に出現した全てのクラスタ遷移パターン）を入力する。すなわち、クラスタ遷移パターン入力部２２は、クラスタ遷移テンソル算出部１６において固定長Ｌのクラスタ遷移パターンがＭ本抽出されたとして、そのｍ（＝１，２，...，Ｍ）番目のクラスタ遷移パターンを｛ｃ_ｍ１，ｃ_ｍ２，...，ｃ_ｍＬ｝とすれば、クラスタ遷移パターン入力部２２は、Ｍ本のクラスタ遷移パターン｛ｃ_ｍ１，ｃ_ｍ２，...，ｃ_ｍＬ｝（ｍ＝１，２，...，Ｍ）を入力する。 Step S22: The cluster transition pattern input unit 22 inputs all cluster transition patterns extracted by the cluster transition tensor calculation unit 16 (all cluster transition patterns that appeared in the past period and the current period). That is, assuming that M cluster transition patterns of fixed length L are extracted in the cluster transition tensor calculation unit 16, the cluster transition pattern input unit 22 inputs the m (=1, 2, ..., M)th cluster transition pattern. If the pattern is {c _m1 , c _m2 , ..., c _mL }, the cluster transition pattern input unit 22 inputs M cluster transition patterns {c _m1 , c _m2 , ..., c _mL } (m =1,2,...,M).

　ステップＳ１３：クラスタ遷移テンソル入力部２３は、クラスタ遷移テンソル算出部１６において算出された過去期間と現在期間それぞれのクラスタ遷移テンソルを入力する。すなわち、過去期間と現在期間のそれぞれについて算出されたクラスタ遷移テンソルをそれぞれＤ_１，Ｄ_２とすれば、クラスタ遷移テンソル入力部２３は、この２つのテンソルＤ_１，Ｄ_２を入力する。なお、ここでのクラスタ遷移テンソルとは、Ｄ_１，Ｄ_２ともに、前記クラスタ遷移パターンの長さＬを階数（次元数）とし、過去期間と現在期間に出現した全てのクラスタのユニーク値を各次元のインデックスに持ち、各次元のインデックス（クラスタ）組合せに対応した要素には各次元のインデックス（クラスタ）を次元順に並べたクラスタ遷移パターンの滞在確率が格納されているテンソルである。 Step S13: The cluster transition tensor input unit 23 inputs the cluster transition tensors for each of the past period and the current period calculated by the cluster transition tensor calculation unit 16. That is, if the cluster transition tensors calculated for the past period and the current period are respectively D ₁ and D ₂ , the cluster transition tensor input unit 23 inputs these two tensors D ₁ and D ₂ . Note that the cluster transition tensor here means that both D ₁ and D ₂ have the length L of the cluster transition pattern as the rank (number of dimensions), and each unique value of all clusters that appeared in the past period and the current period is It is a tensor that has indexes of dimensions, and elements corresponding to combinations of indexes (clusters) of each dimension store the stay probabilities of cluster transition patterns in which the indexes (clusters) of each dimension are arranged in order of dimension.

　ステップＳ２４：セントロイド間距離行列算出部２４は、セントロイド座標入力部２１により入力された全てのクラスタのセントロイド座標に基づき、全てのクラスタペアについてのセントロイド間距離行列を算出する。すなわち、セントロイド座標入力部２１により入力されたＩ個のクラスタｃ_１，ｃ_２，...，ｃ_Ｉのセントロイド座標Ｘ_１，Ｘ_２，...，Ｘ_Ｉに基づき、全てのクラスタペア｛ｃ_ｉ，ｃ_ｊ｝（ｉ＝１，２，...，Ｉ，ｊ＝１，２，...，Ｉ）についてセントロイド座標ペア｛Ｘ_ｉ，Ｘ_ｊ｝間の距離ｄ_ｃ（Ｘ_ｉ，Ｘ_ｊ）を算出し、これを（ｉ，ｊ）成分に格納したＩ行Ｉ列のセントロイド間距離行列Ｍ_ｃを求める。なお、セントロイド座標ペア｛Ｘ_ｉ，Ｘ_ｊ｝間の距離ｄ_ｃ（Ｘ_ｉ，Ｘ_ｊ）としては、次式により表されるユークリッド距離を用いても良いし、他の距離（マンハッタン距離、チェビシェフ距離、マハラノビス距離など）を用いても良い。 Step S24: The inter-centroid distance matrix calculation unit 24 calculates the inter-centroid distance matrix for all cluster pairs based on the centroid coordinates of all clusters input by the centroid coordinate input unit 21. That _is _, based on the _centroid coordinates _X ₁ _, The distance _{d c} _between the centroid coordinate pair {X _i , X j } for the pair {c i , c _j } (i=1,2,...,I, j=1,2, _... ,I) (X _i , X _j ) is calculated, and an I-row, I-column inter-centroid distance matrix M _c is obtained by storing this in the (i, j) components. Note that as the distance d _c (X _i , X _j ) between the centroid coordinate pair {X _i , X _j }, the Euclidean distance expressed by the following formula may be used, or other distances (Manhattan distance, Chebyshev distance, Mahalanobis distance, etc.) may be used.

　ｄ_ｃ（Ｘ_ｉ，Ｘ_ｊ）＝（Σ_ｋ＝１ ^Ｋ（Ｘ_ｊｋ－Ｘ_ｉｋ）^２）^１／２
　また、セントロイド座標ペア｛Ｘ_ｉ，Ｘ_ｊ｝間の距離ｄ_ｃ（Ｘ_ｉ，Ｘ_ｊ）はクラスタｉとクラスタｊを入れ替えても変わらないためＭ_ｃは対象行列である。 d _c (X _i , X _j )=(Σ _k=1 ^K (X _jk −X _ik ) ² ) ^1/2
Further, since the distance d _c (X _i , X _j ) between the centroid coordinate pair {X _i , X _j } does not change even if cluster i and cluster j are swapped, M _c is a symmetric matrix.

　ステップＳ２５：クラスタ遷移パターン間距離行列算出部２５は、クラスタ遷移パターン入力部２２により入力された全てのクラスタ遷移パターンと、セントロイド間距離行列算出部２４により算出された全てのクラスタペアについてのセントロイド間距離行列に基づき、全てのクラスタ遷移パターンペアについての距離行列を算出する。すなわち、クラスタ遷移パターン入力部２２により入力された全てのクラスタ遷移パターンをπ_ｍ＝｛ｃ_ｍ１，ｃ_ｍ２，...，ｃ_ｍＬ｝（ｍ＝１，２，...，Ｍ）、セントロイド間距離行列算出部２４により算出された全てのクラスタペア｛ｃ_ｉ，ｃ_ｊ｝（ｉ＝１，２，...，Ｉ，ｊ＝１，２，...，Ｉ）についてのセントロイド間距離をｄ_ｃ（Ｘ_ｉ，Ｘ_ｊ）とすれば、クラスタ遷移パターン間距離行列算出部２５は、これらの入力に基づき、全てのクラスタ遷移パターンペア｛π_ｍ，π_ｎ｝（ｍ＝１，２，...，Ｍ，ｎ＝１，２，...，Ｍ）についてクラスタ遷移パターン間距離ｄ_ｐ（π_ｍ，π_ｎ）を算出し、これを（ｍ，ｎ）成分に格納したＭ行Ｍ列の距離行列Ｍ_ｐを求める。 Step S25: The inter-cluster transition pattern distance matrix calculation unit 25 calculates the centroids for all cluster transition patterns input by the cluster transition pattern input unit 22 and all cluster pairs calculated by the inter-centroid distance matrix calculation unit 24. Distance matrices for all cluster transition pattern pairs are calculated based on the interroid distance matrix. That is, all cluster transition patterns input by the cluster transition pattern input unit 22 are calculated as π _m ={c _m1 , _cm2 ,..., c _mL } (m=1, 2,..., M), cents for all cluster pairs {c _i , c _j } (i=1, 2, ..., I, j=1, 2, ..., I) calculated by the interroid distance matrix calculation unit 24 If the inter-roid distance is d _c (X _i , X _j ), the inter-cluster transition pattern distance matrix calculation unit 25 calculates all cluster transition pattern pairs {π _m , π _n } (m= 1, 2, ..., M, n = 1, 2, ..., M), calculate the distance d _p (π _m , π _n ) between cluster transition patterns, and convert this into the (m, n) component. The stored distance matrix M _p of M rows and M columns is determined.

　なお、クラスタ遷移パターンπ_ｍ＝｛ｃ_ｍ１，ｃ_ｍ２，...，ｃ_ｍＬ｝とクラスタ遷移パターンπ_ｎ＝｛ｃ_ｎ１，ｃ_ｎ２，...，ｃ_ｎＬ｝の距離を算出するにあたっては、例えば、バイオインフォマティクス分野の技術であるアラインメント技術を利用する。アラインメント技術とは、２本もしくは３本以上の配列間の類似性の判定に利用可能な技術であり、配列長を同じにするようにギャップ記号を挿入しながら配列間の最適な対応関係を求める最適化問題を解く技術である。ステップＳ２５では、特に、２本の配列間の最適な対応関係を求めるペアワイズ・アラインメント技術を利用する。ペアワイズ・アラインメント技術において最適化（最大化）されるのは、２本の配列間の対応関係から算出されるスコアであり、対応づけられた文字ペアが同じ場合、違う場合、一方がギャップの場合のそれぞれに対してあらかじめスコアを設定し、その総和により算出されるものである。ステップＳ２５では、このペアワイズ・アラインメント技術を利用して、２本のクラスタ遷移パターン間の最適な対応関係を求め、最大スコアを算出することができるため、この最大スコアを２本のクラスタ遷移パターン間の類似度と見なし、この類似度から２本のクラスタ遷移パターン間の距離を算出することができる。例えば、類似度を区間［０，１］で正規化した後、１から引くなどの方法により距離を算出することができる。なお、一般的なアラインメント技術においては、対応づけられた文字ペアが同じ場合、違う場合、一方がギャップの場合のそれぞれに対してあらかじめ定数スコアを設定することが多いが、ステップＳ２５の場合、配列（クラスタ遷移パターン）を構成する文字がクラスタに該当するため、違う文字ペア（クラスタペア）に対して設定するスコアとしてステップＳ２４で算出したセントロイド間距離を用いる。例えば、同じ文字ペア（クラスタペア）に対して設定するスコアを＋ｓ、一方がギャップの場合に設定するスコアを－ｇとした場合、セントロイド間距離行列Ｍ_ｃの最小値が＋ｓ、最大値－ｇとなるよう正規化する。これは、セントロイド間距離行列Ｍ_ｃの最小値は同じクラスタペア間の距離に該当するため、これを同じ文字ペア（クラスタペア）に対して設定したスコア＋ｓに対応づけ、またセントロイド間距離行列Ｍ_ｃの最大値は最も離れたクラスタペア間の距離に該当するため、これを一方がギャップの場合のスコア－ｇに対応づけることを意味する。 In addition, when calculating the distance between the cluster transition pattern π _m ={c _m1 , c _m2 , ..., c _mL } and the cluster transition pattern π _n = {c _n1 , c _n2 , ..., c _nL }, For example, alignment technology, which is a technology in the bioinformatics field, is used. Alignment technology is a technology that can be used to determine the similarity between two or more sequences, and it seeks the optimal correspondence between sequences by inserting gap symbols to make the sequence lengths the same. It is a technique for solving optimization problems. In step S25, in particular, a pairwise alignment technique is used to find the optimal correspondence between two sequences. What is optimized (maximized) in pairwise alignment technology is the score calculated from the correspondence between two sequences, and when the matched character pairs are the same, different, or one has a gap A score is set in advance for each, and the score is calculated by summing the scores. In step S25, by using this pairwise alignment technology, it is possible to find the optimal correspondence between the two cluster transition patterns and calculate the maximum score. The distance between two cluster transition patterns can be calculated from this similarity. For example, the distance can be calculated by normalizing the similarity in the interval [0, 1] and then subtracting it from 1. In addition, in general alignment technology, constant scores are often set in advance for each of the cases where the matched character pairs are the same, when they are different, and when one of them has a gap, but in the case of step S25, the alignment Since the characters constituting the (cluster transition pattern) correspond to clusters, the inter-centroid distance calculated in step S24 is used as the score set for different character pairs (cluster pairs). For example, if the score set for the same character pair (cluster pair) is +s and the score set when one is a gap is -g, then the minimum value of the inter-centroid distance matrix M _c is +s and the maximum value - Normalize it so that it becomes g. This is because the minimum value of the inter-centroid distance matrix M _c corresponds to the distance between the same cluster pair, so this is associated with the score + s set for the same character pair (cluster pair), and the inter-centroid distance Since the maximum value of the matrix M _c corresponds to the distance between the farthest cluster pair, this means that it is associated with the score -g when one of the clusters is a gap.

　ステップＳ２６：変化点スコア算出部２６は、クラスタ遷移テンソル入力部２３により入力された過去期間と現在期間それぞれのクラスタ遷移テンソルと、クラスタ遷移パターン間距離行列算出部２５により算出された全てのクラスタ遷移パターンペアについての距離行列に基づき、クラスタ遷移パターン間の距離を考慮した過去期間と現在期間のクラスタ遷移テンソル間距離を算出する。すなわち、クラスタ遷移テンソル入力部２３により入力された過去期間と現在期間のクラスタ遷移テンソルＤ_１、Ｄ_２に格納されているクラスタ遷移パターンπ_ｍ＝｛ｃ_ｍ１，ｃ_ｍ２，...，ｃ_ｍＬ｝の滞在確率をｐ_１ ^ｍ、ｐ_２ ^ｍ、クラスタ遷移パターン間距離行列算出部２５により算出された全てのクラスタ遷移パターンペアについての距離行列Ｍ_ｐの行平均をとった距離ベクトルを Step S26: The change point score calculation unit 26 calculates the cluster transition tensors for each of the past period and the current period input by the cluster transition tensor input unit 23, and all cluster transitions calculated by the cluster transition pattern distance matrix calculation unit 25. Based on the distance matrix for pattern pairs, distances between cluster transition tensors in the past period and current period are calculated, taking into account the distances between cluster transition patterns. That is, the cluster transition patterns π _m ={c _m1 , _cm2 , ..., c _mL stored in the cluster transition tensors D ₁ and D ₂ for the past period and the current period input by the cluster transition tensor input unit 23 } are the stay probabilities p ₁ ^m , p ₂ ^m , and the distance vector obtained by taking the row average of the distance matrix M _p for all the cluster transition pattern pairs calculated by the inter-cluster transition pattern distance matrix calculation unit 25 is

とし、その第ｍ要素をδ_ｍとすれば、クラスタ遷移パターン間の距離を考慮した過去期間と現在期間のクラスタ遷移テンソル間距離ｄ（Ｄ_１，Ｄ_２）は例えば次式により算出される。

If the m-th element is δ _m , the distance d (D ₁ , D ₂ ) between cluster transition tensors between the past period and the current period, taking into account the distance between cluster transition patterns, is calculated, for example, by the following equation.

　ｄ（Ｄ_１，Ｄ_２）＝（Σ_ｍδ_ｍ（ｐ_２ ^ｍ－ｐ_１ ^ｍ）^２）^１／２
　なお、ここで距離ベクトル d(D ₁ , D ₂ )=(Σ _m δ _m (p ₂ ^m −p ₁ ^m ) ² ) ^1/2
In addition, here the distance vector

の第ｍ要素δ_ｍは、クラスタ遷移パターンπ_ｍのその他パターンからの平均距離を意味し、次式により算出される（Ｎは距離行列Ｍ_ｐの列数であり、過去期間、現在期間を通じて出現したクラスタ遷移パターン数を意味する）。

The m-th element δ _m means the average distance of the cluster transition pattern π _m from other patterns, and is calculated by the following formula (N is the number of columns of the distance matrix _M (means the number of cluster transition patterns).

　δ_ｍ＝Σ_ｎ≠ｍｄ_ｐ（π_ｍ，π_ｎ）／（Ｎ－１）
　ステップＳ２７：出力部２７は、変化点スコア算出部２６により算出された過去期間と現在期間のクラスタ遷移テンソル間距離ｄ（Ｄ_１，Ｄ_２）を変化点スコアとして出力し、検知部８に渡す。 δ _m =Σ _n≠m d _p (π _m , π _n )/(N-1)
Step S27: The output unit 27 outputs the distance d (D ₁ , D ₂ ) between cluster transition tensors between the past period and the current period calculated by the change point score calculation unit 26 as a change point score, and passes it to the detection unit 8. .

　〔実施形態の主な効果〕
　以上のように、本実施形態に係る変化点スコア算出装置２０は、変化点スコア算出部１７において過去期間と現在期間のクラスタ遷移テンソル間距離を計算する際に、過去期間と現在期間におけるクラスタ遷移パターンごとの滞在確率の二乗誤差を各クラスタ遷移パターンのその他パターンからの距離で重みづけすることで、過去観測されたものとはかけ離れたクラスタ遷移パターンが新たに観測された場合や、現在観測されているものとはかけ離れたクラスタ遷移パターンが過去観測されていた場合に、変化点スコアの上昇を促す仕組みを導入し、より精緻な変化点検知を実現することができる。 [Main effects of the embodiment]
As described above, when the change point score calculation unit 17 calculates the distance between cluster transition tensors between the past period and the current period, the change point score calculation device 20 according to the present embodiment By weighting the squared error of the stay probability for each pattern by the distance of each cluster transition pattern from other patterns, it is possible to detect cases where a new cluster transition pattern is observed that is far from those observed in the past, or when a cluster transition pattern that is currently observed is If a cluster transition pattern that is far different from the current one has been observed in the past, a mechanism can be introduced to encourage the change point score to increase, allowing for more precise change point detection.

　〔補足〕
　本発明は上述の実施形態に限定されるものではなく、以下に示すような構成又は処理（動作）であってもよい。〔supplement〕
The present invention is not limited to the above-described embodiments, and may have the following configuration or processing (operation).

　変化点検知装置１０及び変化点スコア算出装置２０は、コンピュータとプログラムによっても実現できるが、このプログラムを（非一時的）記録媒体に記録することも、インターネット等のネットワークを通して提供することも可能である。 The change point detection device 10 and the change point score calculation device 20 can be realized by a computer and a program, but this program can also be recorded on a (non-temporary) recording medium or provided through a network such as the Internet. be.

１０　変化点検知装置
１１　入力部
１２　時間窓生成部
１３　期間設定部
１４　クラスタリング部
１５　クラスタ遷移系列作成部
１６　クラスタ遷移テンソル算出部
１７　変化点スコア算出部
１８　検知部
１９　出力部
２０　変化点スコア算出装置
２１　セントロイド座標入力部
２２　クラスタ遷移パターン入力部
２３　クラスタ遷移テンソル入力部
２４　セントロイド間距離行列算出部
２５　クラスタ遷移パターン間距離行列算出部
２６　変化点スコア算出部
２７　出力部 10 Change point detection device 11 Input section 12 Time window generation section 13 Period setting section 14 Clustering section 15 Cluster transition series creation section 16 Cluster transition tensor calculation section 17 Change point score calculation section 18 Detection section 19 Output section 20 Change point score calculation device 21 Centroid coordinate input section 22 Cluster transition pattern input section 23 Cluster transition tensor input section 24 Inter-centroid distance matrix calculation section 25 Inter-cluster transition pattern distance matrix calculation section 26 Change point score calculation section 27 Output section

Claims

　クラスタリング部により生成された、過去期間と現在期間の時系列データを構成する各時点の装置数×項目数×時間窓長の次元のデータに対し割り当てられた全てのクラスタの中心であるセントロイド座標を入力するセントロイド座標入力部と、
　クラスタ遷移テンソル算出部において抽出された、過去期間と現在期間に出現した全てのクラスタ遷移パターンを入力するクラスタ遷移パターン入力部と、
　前記クラスタ遷移テンソル算出部において算出された、過去期間と現在期間それぞれのクラスタ遷移テンソルを入力するクラスタ遷移テンソル入力部と、
　前記セントロイド座標入力部により入力された全てのクラスタの前記セントロイド座標に基づき、全てのクラスタペアについてのセントロイド間距離行列を算出するセントロイド間距離行列算出部と、
　前記クラスタ遷移パターン入力部により入力された全ての前記クラスタ遷移パターンと、前記セントロイド間距離行列算出部により算出された全ての前記クラスタペアについてのセントロイド間距離行列に基づき、全ての前記クラスタ遷移パターンのペアについての距離行列を算出するクラスタ遷移パターン間距離行列算出部と、
　前記クラスタ遷移テンソル入力部により入力された過去期間と現在期間それぞれの前記クラスタ遷移テンソルと、クラスタ遷移パターン間距離行列算出部により算出された全ての前記クラスタ遷移パターンのペアについての距離行列に基づき、前記クラスタ遷移パターンの間の距離を考慮して、過去期間と現在期間の前記クラスタ遷移テンソルの間の距離を算出する変化点スコア算出部と、
　を有する変化点スコア算出装置。 Centroid coordinates, which are the centers of all clusters, assigned to the data generated by the clustering unit in the dimensions of number of devices x number of items x time window length at each time point that make up the time series data of the past and current periods. a centroid coordinate input section for inputting
a cluster transition pattern input unit that inputs all cluster transition patterns that appeared in the past period and the current period, extracted by the cluster transition tensor calculation unit;
a cluster transition tensor input unit that inputs cluster transition tensors for each of the past period and the current period calculated in the cluster transition tensor calculation unit;
an inter-centroid distance matrix calculation unit that calculates an inter-centroid distance matrix for all cluster pairs based on the centroid coordinates of all clusters input by the centroid coordinate input unit;
All the cluster transitions are calculated based on all the cluster transition patterns input by the cluster transition pattern input unit and the inter-centroid distance matrices for all the cluster pairs calculated by the inter-centroid distance matrix calculation unit. a cluster transition pattern distance matrix calculation unit that calculates a distance matrix for a pair of patterns;
Based on the cluster transition tensors for each of the past period and the current period input by the cluster transition tensor input unit, and the distance matrix for all pairs of cluster transition patterns calculated by the cluster transition pattern distance matrix calculation unit, a change point score calculation unit that calculates a distance between the cluster transition tensors of the past period and the current period, taking into account the distance between the cluster transition patterns;
A change point score calculation device having:
　前記クラスタ遷移パターン間距離行列算出部は、前記クラスタ遷移パターンの間の距離を算出する際にアラインメント技術を利用し、当該アラインメント技術により算出される前記クラスタ遷移パターンの間の最適な対応関係を達成するスコアを正規化した値を類似度と見なし、当該類似度に基づき前記クラスタ遷移パターンの間の距離を算出する、請求項１に記載の変化点スコア算出装置。 The inter-cluster transition pattern distance matrix calculation unit utilizes alignment technology when calculating the distance between the cluster transition patterns, and achieves an optimal correspondence between the cluster transition patterns calculated by the alignment technology. 2. The change point score calculation device according to claim 1, wherein a value obtained by normalizing the score is regarded as a degree of similarity, and a distance between the cluster transition patterns is calculated based on the degree of similarity.
　クラスタリング部により生成された、過去期間と現在期間の時系列データを構成する各時点の装置数×項目数×時間窓長の次元のデータに対し割り当てられた全てのクラスタの中心であるセントロイド座標を入力するセントロイド座標入力処理と、
　クラスタ遷移テンソル算出部において抽出された、過去期間と現在期間に出現した全てのクラスタ遷移パターンを入力するクラスタ遷移パターン入力処理と、
　前記クラスタ遷移テンソル算出部において算出された、過去期間と現在期間それぞれのクラスタ遷移テンソルを入力するクラスタ遷移テンソル入力処理と、
　セントロイド座標入力処理により入力された全てのクラスタの前記セントロイド座標に基づき、全てのクラスタペアについてのセントロイド間距離行列を算出するセントロイド間距離行列算出処理と、
　クラスタ遷移パターン入力処理により入力された全ての前記クラスタ遷移パターンと、セントロイド間距離行列算出処理により算出された全ての前記クラスタペアについてのセントロイド間距離行列に基づき、全ての前記クラスタ遷移パターンのペアについての距離行列を算出するクラスタ遷移パターン間距離行列算出処理と、
　クラスタ遷移テンソル入力処理により入力された過去期間と現在期間それぞれの前記クラスタ遷移テンソルと、クラスタ遷移パターン間距離行列算出処理により算出された全ての前記クラスタ遷移パターンのペアについての距離行列に基づき、前記クラスタ遷移パターンの間の距離を考慮して過去期間と現在期間の前記クラスタ遷移テンソルの間の距離を算出する変化点スコア算出処理と、
　をコンピュータが実行する変化点スコア算出方法。 Centroid coordinates, which are the centers of all clusters, assigned to the data generated by the clustering unit in the dimensions of number of devices x number of items x time window length at each time point that make up the time series data of the past and current periods. Centroid coordinate input processing to input
Cluster transition pattern input processing that inputs all cluster transition patterns that appeared in the past period and the current period, extracted by the cluster transition tensor calculation unit;
Cluster transition tensor input processing of inputting cluster transition tensors for each of the past period and the current period calculated by the cluster transition tensor calculation unit;
An inter-centroid distance matrix calculation process that calculates an inter-centroid distance matrix for all cluster pairs based on the centroid coordinates of all clusters input by the centroid coordinate input process;
All of the cluster transition patterns are calculated based on all the cluster transition patterns input by the cluster transition pattern input process and the inter-centroid distance matrices for all the cluster pairs calculated by the inter-centroid distance matrix calculation process. A distance matrix calculation process between cluster transition patterns that calculates a distance matrix for the pair;
Based on the cluster transition tensors for each of the past period and the current period input by the cluster transition tensor input process and the distance matrix for all the pairs of cluster transition patterns calculated by the inter-cluster transition pattern distance matrix calculation process, a change point score calculation process that calculates the distance between the cluster transition tensors of the past period and the current period by considering the distance between the cluster transition patterns;
A change point score calculation method performed by a computer.
　コンピュータに、請求項３に記載の方法を実行させるプログラム。 A program that causes a computer to execute the method according to claim 3.