JP5569984B2

JP5569984B2 - Network failure handling method, network and host

Info

Publication number: JP5569984B2
Application number: JP2012024864A
Authority: JP
Inventors: 達哉坂本
Original assignee: NEC Engineering Ltd
Current assignee: NEC Engineering Ltd
Priority date: 2012-02-08
Filing date: 2012-02-08
Publication date: 2014-08-13
Anticipated expiration: 2032-02-08
Also published as: JP2013162446A

Description

本発明は、ファブリックスイッチが搭載されたパケット通信装置内での、ネットワーク障害対処方法、ネットワーク及びホストに関する。 The present invention relates to a network failure handling method, a network, and a host in a packet communication device equipped with a fabric switch.

近年、無線基地局装置では、装置内の通信データ量が飛躍的に増加している。また、通信データ量の増加に伴い無線基地局装置に格納されるカード間の通信速度が上昇している。よって、無線基地局装置内におけるカード間の通信障害を早期に検出し、その影響を最小限にする必要性が益々高まっている。 In recent years, in wireless base station devices, the amount of communication data in the devices has increased dramatically. Further, the communication speed between cards stored in the radio base station apparatus is increasing with an increase in the amount of communication data. Therefore, there is an increasing need to detect a communication failure between cards in the radio base station apparatus at an early stage and minimize the influence thereof.

今回の説明にあたり、まず、ＰＩＣＭＧ（PCI Industrial Computer Manufacturers Group）が規定した規格であり、コア・アプリケーション向け標準規格であるＡｄｖａｎｃｅｄＴＣＡ(Advanced Telecommunications Computing Architecture)と、ＭｉｃｒｏＴＣＡ（Micro Telecommunications Computing Architecture）の関係について説明する。 In this explanation, we first describe the relationship between Advanced Telecommunication Computing Architecture (Advanced TCA), which is a standard specified by PCI Industrial Computer Manufacturers Group (PICMG), and the standard for core applications, and Micro Telecommunications Computing Architecture (Micro TCA). To do.

通常は、ＡｄｖａｎｃｅｄＭＣ（以下、「ＡＭＣカード」と呼ぶ。）が、ＡｄｖａｎｃｅｄＴＣＡに準拠したボード上にメザニン・カードとして接続される。 Normally, AdvancedMC (hereinafter referred to as “AMC card”) is connected as a mezzanine card on a board compliant with AdvancedTCA.

もっとも、ＭｉｃｒｏＴＣＡでは、ＡｄｖａｎｃｅｄＴＣＡに準拠していないボードであっても、ＡｄｖａｎｃｅｄＴＣＡの機能や特性を実現するためのアーキテクチャー規格を提供している。 However, MicroTCA provides an architecture standard for realizing the functions and characteristics of AdvancedTCA even if the board is not compliant with AdvancedTCA.

具体的には、ＭｉｃｒｏＴＣＡを利用すれば、シェルフ内にＡｄｖａｎｃｅｄＴＣＡに準拠したボードが無い場合であっても、ＡＭＣカードを直接バックプレーンと接続して、利用することができる。この場合、ＡｄｖａｎｃｅｄＴＣＡの機能や特性提供しながらも、物理的なサイズの小型化と導入コスト抑制を実現することができる。このような背景から、ＭｉｃｒｏＴＣＡは、通信事業者向けの装置で多く使用されている。 Specifically, if the MicroTCA is used, the AMC card can be directly connected to the backplane and used even if there is no board compliant with the AdvancedTCA in the shelf. In this case, it is possible to reduce the physical size and reduce the introduction cost while providing the functions and characteristics of the AdvancedTCA. Against this background, MicroTCA is often used in devices for communication carriers.

ここで、ＭｉｃｒｏＴＣＡに準拠したシェルフ内に接続される複数のＡＭＣカードには、Ｉ２Ｃ（Inter-Integrated Circuit）ベースのＩＰＭＢ（Intelligent Peripheral Management Bus）を活用した、ＩＰＭＩ（Intelligent Peripheral Management Interface）仕様のＭＭＣ（Module Management Controller）の機能が搭載されている。 Here, IPMC (Intelligent Peripheral Management Interface) specification MMC using IPMB (Intelligent Peripheral Management Bus) based on I2C (Inter-Integrated Circuit) is used for a plurality of AMC cards connected in a MicroTCA-compliant shelf. (Module Management Controller) function is installed.

ＭＭＣは、各ＡＭＣカードでの障害発生の有無を監視している。そしてＭＭＣは、障害発生を検知した場合に、ＭＣＭＣ（Micro TCA Carrier Management Controller）に対して障害情報をＩ２Ｃで通知する。障害情報を受け取ったＭＣＭＣは、Ｉ２Ｃ経由でホストＡＭＣカードへ障害内容を通知する。ここで、Ｉ２Ｃの通信レートとしては、１００ｋｂｐｓ、４００ｋｂｐｓ及び３．４Ｍｂｐｓの３つが定義されている。 The MMC monitors whether or not a failure has occurred in each AMC card. When the MMC detects the occurrence of a failure, the MMC notifies the failure information to the MCMC (Micro TCA Carrier Management Controller) by I2C. The MCMC that has received the failure information notifies the host AMC card of the content of the failure via I2C. Here, three communication rates of 100 kbps, 400 kbps, and 3.4 Mbps are defined as I2C communication rates.

一方、各ＡＭＣカード間の主要データ通信インターフェースとしては高速シリアルインターフェースであるｓＲＩＯ（Serial Rapid IO）が広く使用されている。「Rapid IO」は、Rapid IO Trade Associationの登録商標である。なお、ｓＲＩＯに関しては非特許文献１に詳細な説明がある。 On the other hand, sRIO (Serial Rapid IO), which is a high-speed serial interface, is widely used as a main data communication interface between AMC cards. “Rapid IO” is a registered trademark of Rapid IO Trade Association. Note that sRIO is described in detail in Non-Patent Document 1.

ｓＲＩＯは、ＡＭＣカード内部の各種デバイスとファブリックスイッチをシリアル回線で接続し、各デバイス間で最大１０Ｇｂｐｓの高速通信を可能にするための規格である。また、無線基地局装置で使用している各種デバイスとして、ｓＲＩＯの規格に対応したものが実用化されている。 sRIO is a standard for connecting various devices in an AMC card and a fabric switch through a serial line, and enabling high-speed communication of up to 10 Gbps between the devices. As various devices used in the radio base station apparatus, devices corresponding to the sRIO standard have been put into practical use.

このようなＭｉｃｒｏＴＣＡ規格に準拠したネットワークシステムの動作の一例について図８を参照して説明する。 An example of the operation of a network system compliant with such a MicroTCA standard will be described with reference to FIG.

本例におけるＭｉｃｒｏＴＣＡネットワークは、ホストＡＭＣ６０、ファブリックスイッチ７０、複数のアダプタＡＭＣ８０及びＭＣＭＣ９０の機能ブロックを含む。 The MicroTCA network in this example includes functional blocks of a host AMC 60, a fabric switch 70, a plurality of adapters AMC 80, and MCMC 90.

ホストＡＭＣ６０は、ネットワークを全体を管理する。ファブリックスイッチ７０は、パケットを分配する。各アダプタＡＭＣ８０には、ｓＲＩＯの規格に対応したデバイスがＡＭＣカードとして接続される。また、ＭＣＭＣ９０は、ネットワーク全体の保守監視機能をまとめる。次に、個々の機能ブロックに関してより詳細に説明する。 The host AMC 60 manages the entire network. The fabric switch 70 distributes the packet. Devices corresponding to the sRIO standard are connected to each adapter AMC 80 as an AMC card. The MCMC 90 also collects maintenance monitoring functions for the entire network. Next, each functional block will be described in more detail.

ホストＡＭＣ６０は、ＣＰＵ６１、ＭＭＣ６２、ＴＸＢＦＲ６３及びＲＸＢＦＲ６４を含む。 The host AMC 60 includes a CPU 61, an MMC 62, a TXBFR 63, and an RXBFR 64.

ＣＰＵ６１は、ネットワーク全体を制御する演算処理装置である。ＭＭＣ６２は、ネットワークの保守監視機能を持つ。ＴＸＢＦＲ６３は、パケットの生成機能と送信機能を持つ送信バッファである。ＲＸＢＦＲ６４は、パケットの受信機能を持つ受信バッファである。 The CPU 61 is an arithmetic processing device that controls the entire network. The MMC 62 has a network maintenance monitoring function. The TXBFR 63 is a transmission buffer having a packet generation function and a transmission function. RXBFR64 is a reception buffer having a packet reception function.

また、ファブリックスイッチ７０は、ＲＸＢＦＲ７１及びＲＸＢＦＲ７４−１〜ＲＸＢＦＲ７４−ｎ並びにＴＸＢＦＲ７２及びＴＸＢＦＲ７３−１〜ＸＢＦＲ７３−ｎを含む。 The fabric switch 70 includes RXBFR71 and RXBFR74-1 to RXBFR74-n, TXBFR72 and TXBFR73-1 to XBFR73-n.

ＲＸＢＦＲ７１及びＲＸＢＦＲ７４−１〜ＲＸＢＦＲ７４−ｎは、送信バッファである。 RXBFR71 and RXBFR74-1 to RXBFR74-n are transmission buffers.

一方、ＴＸＢＦＲ７２及びＴＸＢＦＲ７３−１〜ＸＢＦＲ７３−ｎは受信バッファである。これら受信バッファ及び送信バッファは、ホストＡＭＣ６０と、複数のアダプタＡＭＣ８０（アダプタＡＭＣ８０−１〜アダプタＡＭＣ８０−ｎ）との間でパケットを分配する。 On the other hand, TXBFR72 and TXBFR73-1 to XBFR73-n are reception buffers. These reception buffers and transmission buffers distribute packets between the host AMC 60 and a plurality of adapters AMC 80 (adapter AMC 80-1 to adapter AMC 80-n).

複数のアダプタＡＭＣ８０は、それぞれが、ＭＭＣ８１、ＲＸＢＦＲ８２及びＴＸＢＦＲ８３を含む。 Each of the plurality of adapters AMC80 includes MMC81, RXBFR82, and TXBFR83.

ＭＭＣ８１は、ネットワークの保守監視機能を持つ。ＲＸＢＦＲ８２は、パケットの受信機能を持つ受信バッファである。ＴＸＢＦＲ８３は、パケットの生成機能と送信機能を持つ送信バッファである。なお、今回の例では２種類のデータが送受信される。１つはＭＭＣ８１、ＭＣＭＣ９０及びＭＭＣ６２間で、Ｉ２Ｃにより送受信される障害通知用のデータである。また、もう１つは、ファブリックスイッチ７０を介してホストＡＭＣ６０とアダプタＡＭＣ８０の間でｓＲＩＯにより送受信されるデータである。以下の説明では、前者を「Ｉ２Ｃ障害通知」と呼び、後者を「主要データ」と呼ぶことにより区別する。 The MMC 81 has a network maintenance monitoring function. The RXBFR 82 is a reception buffer having a packet reception function. The TXBFR 83 is a transmission buffer having a packet generation function and a transmission function. In this example, two types of data are transmitted and received. One is failure notification data transmitted / received between the MMC 81, MCMC 90, and MMC 62 by I2C. The other is data transmitted and received by sRIO between the host AMC 60 and the adapter AMC 80 via the fabric switch 70. In the following description, the former is called “I2C failure notification” and the latter is called “main data”.

まず、ネットワーク上での主要データのやり取りについて説明する。ネットワークシステム上でのホストＡＭＣ６０からアダプタＡＭＣ８０への主要データは、ホストＡＭＣ６０のＣＰＵ１１で生成され、ＴＸＢＦＲ６３に送られる。 First, the exchange of main data on the network will be described. Main data from the host AMC 60 to the adapter AMC 80 on the network system is generated by the CPU 11 of the host AMC 60 and sent to the TXBFR 63.

そして、主要データは、パケット単位に分割されてファブリックスイッチ７０のＲＸＢＦＲ７１に送信される。 Then, the main data is divided into packets and transmitted to the RXBFR 71 of the fabric switch 70.

ファブリックスイッチ７０内では複数のアダプタＡＭＣ８０のそれぞれ宛毎にパケットが分配され、個々のＴＸＢＦＲ７３（ＴＸＢＦＲ７３−１〜ＴＸＢＦＲ７３−１ｎ）に蓄積される。 Within the fabric switch 70, packets are distributed to each of the plurality of adapters AMCs 80 and stored in individual TXBFR 73 (TXBFR 73-1 to TXBFR 73-1n).

パケットは蓄積された順番に従って、アダプタＡＭＣ８０のＲＸＢＦＲ８２へ送られる。そして、パケットはアダプタＡＭＣ８０に接続されたデバイスであるＡＭＣカード内部にて処理される。 Packets are sent to RXBFR 82 of adapter AMC 80 in the order in which they are stored. The packet is processed inside the AMC card which is a device connected to the adapter AMC 80.

一方、アダプタＡＭＣ８０からホストＡＭＣ６０への主要データは、アダプタＡＭＣ８０のＴＸＢＦＲ８３でパケット化される。そしてパケットは、ファブリックスイッチ７０の個々のＲＸＢＦＲ７４（ＲＸＢＦＲ７４−１〜ＲＸＢＦＲ７４−ｎ）に送信されて蓄積される。 On the other hand, main data from the adapter AMC 80 to the host AMC 60 is packetized by the TXBFR 83 of the adapter AMC 80. Then, the packet is transmitted to each RXBFR 74 (RXBFR 74-1 to RXBFR 74-n) of the fabric switch 70 and accumulated.

蓄積された個々のアダプタＡＭＣ８０からのパケットはＴＸＢＦＲ７２でまとめられてホストＡＭＣ６０のＲＸＢＦＲ６４へ送信される。その後、パケットはホストＡＭＣ６０のＣＰＵ６１にて処理される。 The accumulated packets from the individual adapters AMC 80 are collected by the TXBFR 72 and transmitted to the RXBFR 64 of the host AMC 60. Thereafter, the packet is processed by the CPU 61 of the host AMC 60.

以上が、主要データに関しての説明である。続いてＩ２Ｃ障害通知の送受信に関して説明をする。 This completes the explanation of the main data. Next, transmission / reception of an I2C failure notification will be described.

ネットワークシステム上の保守監視においては、個々のアダプタＡＭＣ８０に含まれるＭＭＣ８１（ＭＭＣ８１−１〜ＭＭＣ８１−ｎ）にて状態監視を行う。そして、アダプタＡＭＣ８０が障害の発生を検知した場合は、ＭＭＣ８１からＩ２ＣにてＭＣＭＣ９０へ報告を行う。 In maintenance monitoring on the network system, state monitoring is performed by the MMC 81 (MMC 81-1 to MMC 81-n) included in each adapter AMC 80. When the adapter AMC 80 detects the occurrence of a failure, the MMC 81 reports to the MCMC 90 through I2C.

報告を受けたＭＣＭＣ９０は、ネットワーク全体の状態通知として、障害が発生したという情報をＩ２Ｃ障害通知としてホストＡＭＣ６０のＭＭＣ６２へと送る。 The MCMC 90 that has received the report sends information indicating that a failure has occurred to the MMC 62 of the host AMC 60 as an I2C failure notification as a status notification of the entire network.

送られてきた、Ｉ２Ｃ障害通知はＭＭＣ６２を介してＣＰＵ６１へと送られる。これにより、ＣＰＵ６１は障害の発生を認識できる。 The sent I2C failure notification is sent to the CPU 61 via the MMC 62. Thereby, the CPU 61 can recognize the occurrence of the failure.

なお、上述したファブリックスイッチの動作を実現するための技術の一例が特許文献１に記載されている。特許文献１に記載のパケットスイッチング集積回路チップは、パケット、例えば、ＲａｐｉｄＩＯ（登録商標）に準拠したパケットを複数の外部ソースから受信し、受信パケット内のデータを複数の外部受信装置に選択的に転送するように構成されている。さらにパケットスイッチング集積回路チップは、受信パケット内の宛先アドレスに基づいて、受信パケットを管理するように構成することができる。 An example of a technique for realizing the operation of the fabric switch described above is described in Patent Document 1. The packet switching integrated circuit chip described in Patent Document 1 receives a packet, for example, a packet conforming to Rapid IO (registered trademark) from a plurality of external sources, and selectively receives data in the received packet to a plurality of external receiving devices. Configured to forward to. Further, the packet switching integrated circuit chip can be configured to manage the received packet based on the destination address in the received packet.

特表２００８−５３９６１３号公報Special table 2008-539613 gazette

「Rapid IO Interconnect Specification Rev1.3」［online］平成２３年１２月８日検索インターネット〈http://www.rapidio.org〉“Rapid IO Interconnect Specification Rev1.3” [online] December 8, 2011 Search Internet <http://www.rapidio.org>

しかしながら、上述した一般的なＭｉｃｒｏＴＣＡネットワークには、以下のような複数の問題点が存在した。 However, the above-described general MicroTCA network has the following problems.

第１の問題点は、一般的なＭｉｃｒｏＴＣＡの規格上、個々のアダプタＡＭＣ８１（ＭＭＣ８１−１〜ＭＭＣ８１−ｎ）の障害通知データの送受信にＩ２Ｃを使用しているため、障害が発生したことがホストＡＭＣ６０に通知されるまでに時間を要することである。また、このように時間を要することからバッファにパケットが蓄積し続け、効率的なパケット送信ができなくなることである。 The first problem is that the I2C is used for transmission / reception of failure notification data of each adapter AMC81 (MMC81-1 to MMC81-n) according to the general MicroTCA standard. It takes time to notify the AMC 60. In addition, since time is required in this way, packets continue to be accumulated in the buffer, and efficient packet transmission cannot be performed.

この現象についての具体例を、図９及び図１０を参照して説明する。今回の説明においては、ファブリックスイッチ７０とアダプタＡＭＣ８０−１間のｓＲＩＯ経路にて障害が発生したものとする。 A specific example of this phenomenon will be described with reference to FIGS. In this description, it is assumed that a failure has occurred in the sRIO path between the fabric switch 70 and the adapter AMC 80-1.

ｓＲＩＯ経路にて障害が発生した場合、アダプタＡＭＣ８０−１のＭＭＣ８１−１にてｓＲＩＯ経路障害が検出される。そして、ＭＭＣ８１−１にてＩ２Ｃ障害通知が生成され、Ｉ２ＣにてＭＣＭＣ９０へ通知される。 When a failure occurs in the sRIO path, the MMC 81-1 of the adapter AMC 80-1 detects the sRIO path failure. Then, the MMC 81-1 generates an I2C failure notification and notifies the MCMC 90 through I2C.

ＭＣＭＣ９０で受信されたＩ２Ｃ障害通知は、ホストＡＭＣ６０のＭＭＣ６２を介してホストＡＭＣ６０内のＣＰＵ６１に通知される。 The I2C failure notification received by the MCMC 90 is notified to the CPU 61 in the host AMC 60 via the MMC 62 of the host AMC 60.

しかし、上述したようにＩ２Ｃの通信レートは、最大でも３．４Ｍｂｐｓであり、ｓＲＩＯと比較すると低速である。よって、図１０に表すようにホストＡＭＣ６０のＣＰＵ１１が障害情報を検知するまでには数十［ｍｓ］の時間を要する。 However, as described above, the maximum communication rate of I2C is 3.4 Mbps, which is lower than that of sRIO. Therefore, as shown in FIG. 10, it takes several tens [ms] until the CPU 11 of the host AMC 60 detects the failure information.

一方、ホストＡＭＣ６０と複数のアダプタＡＭＣ８０（アダプタＡＭＣ８０−１〜アダプタＡＭＣ８０−ｎ）間の主要データに関しては、ファブリックスイッチ７０を介してｓＲＩＯにて高速パケット（通信レート：１Ｇｂｐｓ以上）で通信を行っている。そのため、図１０に表すようにホストＡＭＣ６０のＣＰＵ１１で障害が発生したことが検知されるまでの数十［ｍｓ］の時間には、ホストＡＭＣ６０からアダプタＡＭＣ８０（アダプタＡＭＣ８０−１〜アダプタＡＭＣ８０−ｎ）宛に大量のパケットが送信できる状態にある。 On the other hand, the main data between the host AMC 60 and a plurality of adapters AMC 80 (adapter AMC 80-1 to adapter AMC 80-n) is communicated at high speed packets (communication rate: 1 Gbps or more) via sRIO via the fabric switch 70. Yes. Therefore, as shown in FIG. 10, the host AMC 60 to the adapter AMC 80 (adapter AMC 80-1 to adapter AMC 80-n) is taken for several tens of [ms] until it is detected that a failure has occurred in the CPU 11 of the host AMC 60. A large number of packets can be sent to the destination.

もっとも、ファブリックスイッチ７０で受信したパケットは入力された順番にバッファに蓄積され、また蓄積された順番でパケットが送出されるというＦＩＦＯ(First In, First Out)の構成である。つまりパケットの送出の順番が変わることはない。そのため、ｓＲＩＯ経路上で障害が発生した場合には、ファブリックスイッチ７０内の経路障害の該当バッファ（ＴＸＢＦＲ７３−１）のパケットが出力できない状態となって、後続のパケットが蓄積していく。 Of course, the packets received by the fabric switch 70 are stored in the buffer in the input order, and the packets are sent out in the stored order. That is, the order of packet transmission does not change. Therefore, when a failure occurs on the sRIO path, the packet of the corresponding path failure buffer (TXBFR 73-1) in the fabric switch 70 cannot be output, and subsequent packets are accumulated.

その後、図９に表すように最初にファブリックスイッチ７０内のＴＸＢＦＲ７３−１が満杯となる。これによりＲＸＢＦＲ７１からＴＸＢＦＲ７３−１へのパケットも送信できなくなるため、次には、ＲＸＢＦＲ７１が満杯となる。最終的にはホストＡＭＣ６０の送信バッファ６３にも徐々にパケットが蓄積されていき、ｓＲＩＯ経路上へのパケットが効率的に送信できない状態となる。これが第１の問題点である。 Thereafter, as shown in FIG. 9, the TXBFR 73-1 in the fabric switch 70 is first filled. As a result, the packet from RXBFR 71 to TXBFR 73-1 cannot be transmitted, and RXBFR 71 becomes full. Eventually, packets are gradually accumulated in the transmission buffer 63 of the host AMC 60, and packets cannot be transmitted efficiently on the sRIO path. This is the first problem.

また、第２の問題点は、ｓＲＩＯネットワーク上でファブリックスイッチ７０に接続されているホストＡＭＣ６０とアダプタＡＭＣ８０（アダプタＡＭＣ８０−１〜アダプタＡＭＣ８０−ｎ）は、自身以外のＡＭＣ８０がファブリックスイッチ７０と通信可能な状態であるか否かについて、ｓＲＩＯの規格上、ファブリックスイッチ７０内にあるレジスタをリードしない限りは、わからない点である。 The second problem is that the host AMC 60 and adapter AMC 80 (adapter AMC 80-1 to adapter AMC 80-n) connected to the fabric switch 70 on the sRIO network can communicate with the fabric switch 70 by the AMC 80 other than itself. Whether or not this is a state is not known unless the register in the fabric switch 70 is read according to the sRIO standard.

従って、ファブリックスイッチ７０とアダプタＡＭＣ８０（アダプタＡＭＣ８０−１〜アダプタＡＭＣ８０−ｎ）間でｓＲＩＯ経路上で障害が発生した場合には、常にファブリックスイッチ７０のレジスタをダンプしていないとリアルタイムには障害の発生を検知することができない。そのため上述したようなパケットが蓄積してしまい、パケットが効率的に送信できないという第１の問題点がより顕著なものとなってしまう。 Therefore, when a failure occurs on the sRIO path between the fabric switch 70 and the adapter AMC 80 (adapter AMC 80-1 to adapter AMC 80-n), the failure of the fabric switch 70 register is not always dumped in real time. The occurrence cannot be detected. Therefore, the packets as described above are accumulated, and the first problem that the packets cannot be transmitted efficiently becomes more prominent.

そこで、本発明はファブリックスイッチと或るアダプタＡＭＣ間での通信を実現するための経路上で障害が発生した場合であっても、この或るアダプタＡＭＣ以外のＡＭＣ宛のパケットを適切に送信することが可能な、ネットワーク障害対処方法、ネットワーク及びホストを提供することを目的とする。 Therefore, the present invention appropriately transmits a packet addressed to an AMC other than a certain adapter AMC even when a failure occurs on a path for realizing communication between the fabric switch and the certain adapter AMC. An object of the present invention is to provide a network failure handling method, a network, and a host.

本発明の第１の観点によれば、複数のアダプタがスイッチを介してホストと接続され、前記ホストから出力されたデータが前記スイッチで多重分離されて前記複数のアダプタに送信されるネットワークであって、前記ホストは、前記複数のアダプタにそれぞれ対応した複数の送信バッファと、データをその宛先となる前記アダプタに対応する前記送信バッファを経由してから出力させるための手段と、障害に係る前記アダプタ宛のデータを該アダプタに対応する前記バッファに書き込むこと又は該バッファから読み出すことを中止するための手段と、を備え、前記アダプタに係る障害が発生した場合に該障害発生を前記ホストに通知する際と、前記スイッチと前記アダプタ間の経路で障害が発生した場合に該障害発生を前記ホストに通知する際とで、通知に使用するプロトコルを異ならせることを特徴とするネットワークが提供される。 According to a first aspect of the present invention, there is provided a network in which a plurality of adapters are connected to a host via a switch, and data output from the host is demultiplexed by the switch and transmitted to the plurality of adapters. The host includes a plurality of transmission buffers respectively corresponding to the plurality of adapters, a means for outputting data after passing through the transmission buffer corresponding to the adapter serving as the destination, and the failure Means for canceling writing to or reading from the buffer corresponding to the adapter, and notifying the host of the occurrence of a failure when the adapter has a failure. When a failure occurs in the path between the switch and the adapter, the failure is notified to the host. And, the network characterized by Rukoto with different protocols to be used for notification is provided.

本発明の第２の観点によれば、複数のアダプタがスイッチを介してホストと接続され、前記ホストから出力されたデータが前記スイッチで多重分離されて前記複数のアダプタに送信されるネットワークにおける前記ホストであって、前記複数のアダプタにそれぞれ対応した複数の送信バッファと、データをその宛先となる前記アダプタに対応する前記送信バッファを経由してから出力させるための手段と、障害に係る前記アダプタ宛のデータを該アダプタに対応する前記バッファに書き込むこと又は該バッファから読み出すことを中止するための手段と、を備え、前記アダプタに係る障害が発生した場合に該障害発生を前記ホストに通知する際と、前記スイッチと前記アダプタ間の経路で障害が発生した場合に該障害発生を前記ホストに通知する際とで、通知に使用するプロトコルを異ならせることを特徴とするホストが提供される。 According to a second aspect of the present invention, a plurality of adapters are connected to a host via a switch, and data output from the host is demultiplexed by the switch and transmitted to the plurality of adapters. A host, a plurality of transmission buffers respectively corresponding to the plurality of adapters, a means for outputting data after passing through the transmission buffer corresponding to the adapter serving as a destination thereof, and the adapter relating to a failure Means for stopping writing or reading data addressed to the buffer corresponding to the adapter and notifying the host of the occurrence of the failure when the failure relating to the adapter occurs. And when a failure occurs in the path between the switch and the adapter, the failure is notified to the host. That when in a host, wherein Rukoto with different protocols to be used for notification is provided.

本発明の第３の観点によれば、複数のアダプタがスイッチを介してホストと接続され、前記ホストから出力されたデータが前記スイッチで多重分離されて前記複数のアダプタに送信されるネットワークにおけるネットワーク障害対処方法であって、前記ホストが、前記複数のアダプタにそれぞれ対応した複数の送信バッファによりデータをバッファリングするステップと、データをその宛先となる前記アダプタに対応する前記送信バッファを経由してから出力させるためのステップと、障害に係る前記アダプタ宛のデータを該アダプタに対応する前記バッファに書き込むこと又は該バッファから読み出すことを中止するためのステップと、を備え、前記アダプタに係る障害が発生した場合に該障害発生を前記ホストに通知する際と、前記スイッチと前記アダプタ間の経路で障害が発生した場合に該障害発生を前記ホストに通知する際とで、通知に使用するプロトコルを異ならせることを特徴とするネットワーク障害対処方法が提供される。 According to a third aspect of the present invention, a network in a network in which a plurality of adapters are connected to a host via a switch, and data output from the host is demultiplexed by the switch and transmitted to the plurality of adapters. A method for coping with a failure, wherein the host buffers data by a plurality of transmission buffers respectively corresponding to the plurality of adapters, and the data passes through the transmission buffer corresponding to the adapter serving as a destination thereof. And a step for stopping writing data read from the buffer corresponding to the adapter or reading from the buffer to the adapter associated with the fault, When the failure is notified to the host, the switch The failure occurs when a line between the switch and the adapter fails for a time of notifying the host, the network troubleshooting wherein the Rukoto with different protocols to be used for notification is provided.

本発明によれば、ファブリックスイッチとアダプタＡＭＣ間での通信を実現するための経路上で障害が発生した場合に、この障害の発生を迅速に送信元のホストに通知することが可能となる。 According to the present invention, when a failure occurs on a path for realizing communication between the fabric switch and the adapter AMC, it is possible to promptly notify the transmission source host of the occurrence of the failure.

本発明の実施形態であるＭｉｃｒｏＴＣＡのネットワークシステムの基本的構成を表す図である。It is a figure showing the basic composition of the network system of MicroTCA which is an embodiment of the present invention. 本発明の実施形態におけるホストＡＭＣの基本的構成を表す図である。It is a figure showing the basic composition of host AMC in the embodiment of the present invention. ｓＲＩＯ障害通知に用いるコントロールシンボル（ＮＯＰ）のフォーマットを表す図である。It is a figure showing the format of the control symbol (NOP) used for a sRIO failure notification. 本発明の実施形態で送受信されるデータの流れを表す図である。It is a figure showing the flow of the data transmitted / received by embodiment of this invention. 本発明の実施形態の基本的動作を表すフローチャートである。It is a flowchart showing the basic operation | movement of embodiment of this invention. 本発明の実施形態において、ファブリックスイッチとアダプタＡＭＣ間で経路障害発生時の説明図である。In embodiment of this invention, it is explanatory drawing at the time of a path | route failure occurring between a fabric switch and adapter AMC. 本発明の実施形態において、ファブリックスイッチとアダプタＡＭＣ間で経路障害発生時のタイムチャートである。6 is a time chart when a path failure occurs between the fabric switch and the adapter AMC in the embodiment of the present invention. 一般的なＭｉｃｒｏＴＣＡのネットワークシステムの基本的構成を表す図である。It is a figure showing the basic composition of the network system of a general MicroTCA. 一般的なＭｉｃｒｏＴＣＡのネットワークシステムでの障害発生時の状況を説明する図である。It is a figure explaining the condition at the time of the failure occurrence in a general MicroTCA network system. 一般的なＭｉｃｒｏＴＣＡのネットワークシステムでのファブリックスイッチとアダプタＡＭＣ間で経路障害発生時のタイムチャートである。6 is a time chart when a path failure occurs between a fabric switch and an adapter AMC in a general MicroTCA network system.

次に、本発明の実施の形態について図面を参照して詳細に説明する。 Next, embodiments of the present invention will be described in detail with reference to the drawings.

図１は、本実施形態であるＭｉｃｒｏＴＣＡネットワークシステムのブロック図である。 FIG. 1 is a block diagram of a MicroTCA network system according to the present embodiment.

図１を参照すると、本実施形態は、ホストＡＭＣ１０、ファブリックスイッチ２０、複数のアダプタＡＭＣ３０及びＭＣＭＣ４０の機能ブロックを含む。 Referring to FIG. 1, the present embodiment includes functional blocks of a host AMC 10, a fabric switch 20, a plurality of adapters AMC 30 and MCMC 40.

ホストＡＭＣ１０は、ネットワークを全体を管理する。ファブリックスイッチ２０は、パケットを分配する。各アダプタＡＭＣ３０には、ｓＲＩＯの規格に対応したデバイスがＡＭＣカードとして接続される。また、ＭＣＭＣ４０は、ネットワーク全体の保守監視機能をまとめる。次に、個々の機能ブロックに関してより詳細に説明する。 The host AMC 10 manages the entire network. The fabric switch 20 distributes the packet. A device corresponding to the sRIO standard is connected to each adapter AMC 30 as an AMC card. The MCMC 40 collects maintenance and monitoring functions for the entire network. Next, each functional block will be described in more detail.

ホストＡＭＣ１０は、ＣＰＵ１１、ＭＭＣ１２、ＴＸＢＦＲ１３、ＲＸＢＦＲ１４及びｓＲＩＯ障害処理部１５を含む。 The host AMC 10 includes a CPU 11, an MMC 12, a TXBFR 13, an RXBFR 14, and an sRIO failure processing unit 15.

ＣＰＵ１１は、ネットワーク全体を制御する演算処理装置である。ＭＭＣ１２は、ネットワークの保守監視機能を持つ。ＴＸＢＦＲ１３は、パケットの生成機能と送信機能を持つ送信バッファである。ＲＸＢＦＲ１４は、パケットの受信機能を持つ受信バッファである。ｓＲＩＯ障害処理部１５は、ｓＲＩＯ経路障害を処理する機能を持つ。 The CPU 11 is an arithmetic processing device that controls the entire network. The MMC 12 has a network maintenance monitoring function. The TXBFR 13 is a transmission buffer having a packet generation function and a transmission function. The RXBFR 14 is a reception buffer having a packet reception function. The sRIO failure processing unit 15 has a function of processing an sRIO path failure.

また、ファブリックスイッチ２０は、ＲＸＢＦＲ２１及びＲＸＢＦＲ２４−１〜ＲＸＢＦＲ２４−ｎと、ＴＸＢＦＲ２２及びＴＸＢＦＲ２３−１〜ＸＢＦＲ２３−ｎと、ｓＲＩＯ障害処理部２５を含む。 The fabric switch 20 includes RXBFR 21 and RXBFR 24-1 to RXBFR 24-n, TXBFR 22 and TXBFR 23-1 to XBFR 23-n, and an sRIO failure processing unit 25.

ＲＸＢＦＲ２１及びＲＸＢＦＲ２４−１〜ＲＸＢＦＲ２４−ｎは、送信バッファである。 RXBFR21 and RXBFR24-1 to RXBFR24-n are transmission buffers.

一方、ＴＸＢＦＲ２２及びＴＸＢＦＲ２３−１〜ＸＢＦＲ２３−ｎは受信バッファである。これら受信バッファ及び送信バッファは、ホストＡＭＣ１０と、複数のアダプタＡＭＣ３０（アダプタＡＭＣ３０−１〜アダプタＡＭＣ３０−ｎ）との間でパケットを分配する。すなわち、本実施形態では、主要データがファブリックスイッチ２０にて多重分離され、分離後の主要データがそれぞれ適切な送信バッファに割り振られる。 On the other hand, TXBFR22 and TXBFR23-1 to XBFR23-n are reception buffers. These reception buffers and transmission buffers distribute packets between the host AMC 10 and a plurality of adapters AMC 30 (adapter AMC 30-1 to adapter AMC 30-n). That is, in the present embodiment, the main data is demultiplexed by the fabric switch 20, and the main data after separation is allocated to appropriate transmission buffers.

ｓＲＩＯ障害処理部２５は、ｓＲＩＯ経路障害を処理する機能を持つ。具体的には、障害発生を検知したｓＲＩＯ障害処理部２５は、障害発生通知をｓＲＩＯにてｓＲＩＯ障害処理部１５に通知する。 The sRIO failure processing unit 25 has a function of processing an sRIO path failure. Specifically, the sRIO failure processing unit 25 that detects the occurrence of a failure notifies the sRIO failure processing unit 15 of a failure occurrence notification by sRIO.

複数のアダプタＡＭＣ３０は、それぞれが、ＭＭＣ３１、ＲＸＢＦＲ３２及びＴＸＢＦＲ３３を含む。 Each of the plurality of adapters AMC30 includes an MMC31, an RXBFR32, and a TXBFR33.

ＭＭＣ３１は、ネットワークの保守監視機能を持つ。ＲＸＢＦＲ３２は、パケットの受信機能を持つ受信バッファである。ＴＸＢＦＲ３３は、パケットの生成機能と送信機能を持つ送信バッファである。 The MMC 31 has a network maintenance monitoring function. RXBFR 32 is a reception buffer having a packet reception function. The TXBFR 33 is a transmission buffer having a packet generation function and a transmission function.

一般的なＭｉｃｒｏＴＣＡネットワークシステムと比較すると、本実施形態では、ｓＲＩＯ障害処理部１５及びｓＲＩＯ障害処理部２５を更に含んでいる点で相違する。 Compared with a general MicroTCA network system, this embodiment is different in that it further includes an sRIO failure processing unit 15 and an sRIO failure processing unit 25.

なお、図１ではアダプタＡＭＣ３０−１、アダプタＡＭＣ３０−２及びアダプタＡＭＣ３０−ｎのみを図示しているが、これ以外のアダプタＡＭＣ３０が存在していてもよい。すなわち、アダプタＡＭＣ３０の数は任意の数とすることができる。また、アダプタＡＭＣ３０の数が増加する場合には、それに伴いファブリックスイッチ２０のＴＸＢＦＲ７３及びＲＸＢＦＲ２４の数も増加する。 Although only the adapter AMC 30-1, the adapter AMC 30-2, and the adapter AMC 30-n are illustrated in FIG. 1, other adapters AMC 30 may exist. That is, the number of adapters AMC30 can be any number. Further, when the number of adapters AMC 30 increases, the number of TXBFR 73 and RXBFR 24 of the fabric switch 20 increases accordingly.

また、本実施形態では３種類のデータが送受信される。 In this embodiment, three types of data are transmitted and received.

１つめはファブリックスイッチ２０を介してホストＡＭＣ１０とアダプタＡＭＣ３０の間でｓＲＩＯにより送受信されるデータであり、「主要データ」と呼ぶ。 The first is data transmitted and received by the sRIO between the host AMC 10 and the adapter AMC 30 via the fabric switch 20 and is referred to as “main data”.

また、２つめは、ＭＭＣ３１、ＭＣＭＣ４０及びＭＭＣ１２間で、Ｉ２Ｃにより送受信される障害通知用のデータであり、「Ｉ２Ｃ障害通知」と呼ぶ。 The second is data for failure notification transmitted and received between the MMC 31, MCMC 40, and MMC 12 by I2C, and is called “I2C failure notification”.

最後の１つはファブリックスイッチ２０のｓＲＩＯ障害処理部２５と、ホストＡＭＣ１０のｓＲＩＯ障害処理部１５の間で送受信される障害通知用のデータであり、「ｓＲＩＯ障害通知」と呼ぶ。 The last one is failure notification data transmitted and received between the sRIO failure processing unit 25 of the fabric switch 20 and the sRIO failure processing unit 15 of the host AMC 10 and is referred to as “sRIO failure notification”.

次に、図２を参照してホストＡＭＣ１０の構成についてより詳細に説明する。 Next, the configuration of the host AMC 10 will be described in more detail with reference to FIG.

図２を参照すると、ＴＸＢＦＲ１３は、送信バッファ制御部１３−１、複数の送信バッファ１３−２、セレクタ１３−３及びパケット受信機能部１３−４を含む。 Referring to FIG. 2, the TXBFR 13 includes a transmission buffer control unit 13-1, a plurality of transmission buffers 13-2, a selector 13-3, and a packet reception function unit 13-4.

また、ＲＸＢＦＲ１４は、パケット受信機能部１４−１、受信バッファ１４−２及び受信バッファ制御部１４−３を含む。 The RXBFR 14 includes a packet reception function unit 14-1, a reception buffer 14-2, and a reception buffer control unit 14-3.

送信バッファ制御部１３−１はＣＰＵ１１から送信される主要データを送信バッファ１３−２に格納する。ここで、送信バッファ１３−２−１〜送信バッファ１３−２−ｎはそれぞれ符号の末尾の数字が同一のアダプタＡＭＣ３０と対応している。例えば、送信バッファ１３−２−１はアダプタＡＭＣ３０−１と対応している。よって送信バッファ制御部１３−１は、宛先のアダプタＡＭＣ３０に応じて送信バッファ１３−２を選択し、主要データを格納する。 The transmission buffer control unit 13-1 stores the main data transmitted from the CPU 11 in the transmission buffer 13-2. Here, each of the transmission buffers 13-2-1 to 13-2-n corresponds to the adapter AMC30 having the same number at the end of the code. For example, the transmission buffer 13-2-1 corresponds to the adapter AMC 30-1. Therefore, the transmission buffer control unit 13-1 selects the transmission buffer 13-2 according to the destination adapter AMC30, and stores the main data.

送信バッファ１３−２−１〜送信バッファ１３−２−ｎは主要データを蓄積する為のバッファでありＦＩＦＯのバッファである。送信バッファ１３−２−１〜送信バッファ１３−２−ｎが蓄積した主要データはセレクタ１３−３により読み出される。 The transmission buffers 13-2-1 to 13-2-n are buffers for accumulating main data and are FIFO buffers. The main data stored in the transmission buffer 13-2-1 to transmission buffer 13-2-n is read out by the selector 13-3.

セレクタ１３−３は、何れの送信バッファ１３−２から主要データを読み出すかを選択する。そして、セレクタ１３−３は読み出した主要データをパケット送信機能部１３−４に送る。 The selector 13-3 selects from which transmission buffer 13-2 main data is read. Then, the selector 13-3 sends the read main data to the packet transmission function unit 13-4.

パケット送信機能部１３−４は、セレクタ１３−３より受け取った主要データをｓＲＩＯによりファブリックスイッチ２０に送信する。 The packet transmission function unit 13-4 transmits the main data received from the selector 13-3 to the fabric switch 20 by sRIO.

パケット受信機能部１４−１は、ファブリックスイッチ２０より送信されてきた主要データ又はｓＲＩＯ障害通知を受け取る。そして、受け取った主要データを受信バッファ１４−２に送る。また、受け取ったｓＲＩＯ障害通知をｓＲＩＯ障害処理部１５に送る。 The packet reception function unit 14-1 receives main data or sRIO failure notification transmitted from the fabric switch 20. Then, the received main data is sent to the reception buffer 14-2. The received sRIO failure notification is sent to the sRIO failure processing unit 15.

受信バッファ１４−２は、パケット受信機能部１４−１より受け取った主要データを蓄積する為のバッファでありＦＩＦＯのバッファである。受信バッファ１４−２は、蓄積した主要データを受信バッファ制御部１４−３に送る。 The reception buffer 14-2 is a buffer for storing the main data received from the packet reception function unit 14-1, and is a FIFO buffer. The reception buffer 14-2 sends the accumulated main data to the reception buffer control unit 14-3.

受信バッファ制御部１４−３は受信バッファ１４−２から送り出される主要データを受け取る。そして、受信バッファ制御部１４−３は受け取った主要データをＣＰＵ１１に送る。 The reception buffer control unit 14-3 receives main data sent from the reception buffer 14-2. Then, the reception buffer control unit 14-3 sends the received main data to the CPU 11.

ｓＲＩＯ障害処理部１５は、ｓＲＩＯ障害通知を受け取ると、ＣＰＵ１１に対してｓＲＩＯ経路上で障害が発生した旨を通知する。ＣＰＵ１１はこの通知を受けてｓＲＩＯ障害処理部１５のレジスタを参照する。ｓＲＩＯ障害処理部１５のレジスタには、ｓＲＩＯ障害通知が格納されている。そして、ＣＰＵ１１はこのｓＲＩＯ障害通知を参照することにより、障害が発生した経路がどの経路であるかということを検知する。 Upon receiving the sRIO failure notification, the sRIO failure processing unit 15 notifies the CPU 11 that a failure has occurred on the sRIO path. Upon receiving this notification, the CPU 11 refers to the register of the sRIO failure processing unit 15. The register of the sRIO failure processing unit 15 stores an sRIO failure notification. Then, the CPU 11 refers to this sRIO failure notification to detect which route the failure has occurred.

その後、ｓＲＩＯ障害処理部１５はＣＰＵ１１においてｓＲＩＯ経路障害情報が検知されたことの応答としてコントロールシンボルによるｓＲＩＯ障害通知応答を生成する。そして、ｓＲＩＯ障害処理部１５はｓＲＩＯ障害通知応答をＴＸＢＦＲ１３のパケット送信機能部１３−４に送る。 After that, the sRIO failure processing unit 15 generates an sRIO failure notification response using a control symbol as a response when the CPU 11 detects sRIO path failure information. Then, the sRIO failure processing unit 15 sends an sRIO failure notification response to the packet transmission function unit 13-4 of the TXBFR 13.

次に、本実施形態全体での３種類のデータの送受信についてそれぞれ説明する。 Next, transmission / reception of three types of data in the entire embodiment will be described.

まず主要データの送受信について説明する。 First, transmission / reception of main data will be described.

ネットワークシステム上でのホストＡＭＣ１０からアダプタＡＭＣ３０への主要データは、ホストＡＭＣ１０のＣＰＵ１１で生成され、ＴＸＢＦＲ１３に送信される。 Main data from the host AMC 10 to the adapter AMC 30 on the network system is generated by the CPU 11 of the host AMC 10 and transmitted to the TXBFR 13.

ＴＸＢＦＲ１３では図２に表されるように送信バッファ制御部１３−１によってアダプタＡＭＣ３０宛て毎の送信バッファ１３−２に蓄積される。その後、セレクタ１３−３の制御によって主要データが送信バッファ１３−２から読み出される。読み出された主要データはパケット送信制御部１３−４にてパケット化されてファブリックスイッチ２０のＲＸＢＦＲ２１に送られる。 In TXBFR13, as shown in FIG. 2, the transmission buffer control unit 13-1 stores the data in the transmission buffer 13-2 for each adapter AMC30. Thereafter, main data is read from the transmission buffer 13-2 under the control of the selector 13-3. The read main data is packetized by the packet transmission controller 13-4 and sent to the RXBFR 21 of the fabric switch 20.

ファブリックスイッチ２０内ではＲＸＢＦＲ２１によりパケットを蓄積する。蓄積されたパケットは複数のアダプタＡＭＣ毎に分配され、個々のＴＸＢＦＲ２３（ＴＸＢＦＲ２３−１〜ＴＸＢＦＲ２３−ｎ）に蓄積される。ＴＸＢＦＲ２３（ＴＸＢＦＲ２３−１〜ＴＸＢＦＲ２３−ｎ）のそれぞれに蓄積されたパケットはそれぞれに蓄積された順番に従ってそれぞれに対応するアダプタＡＭＣ３０のＲＸＢＦＲ３２へ送られる。そして、パケットはアダプタＡＭＣ３０に接続されたデバイスであるＡＭＣカード内部にて処理される。 In the fabric switch 20, packets are accumulated by the RXBFR 21. The accumulated packets are distributed for each of the plurality of adapters AMC and accumulated in each TXBFR 23 (TXBFR 23-1 to TXBFR 23-n). Packets accumulated in each of TXBFR 23 (TXBFR 23-1 to TXBFR 23-n) are sent to RXBFR 32 of corresponding adapter AMC 30 according to the order of accumulation. The packet is processed inside the AMC card which is a device connected to the adapter AMC 30.

一方、各アダプタＡＭＣ３０からホストＡＭＣ１０への主要データは、各アダプタＡＭＣ３０のＴＸＢＦＲ３３でパケット化され、ファブリックスイッチ２０の対応するＲＸＢＦＲ２４（ＲＸＢＦＲ２４−１〜ＲＸＢＦＲ２４−ｎ）に送信されて蓄積される。 On the other hand, main data from each adapter AMC 30 to the host AMC 10 is packetized by the TXBFR 33 of each adapter AMC 30 and transmitted to and stored in the corresponding RXBFR 24 (RXBFR 24-1 to RXBFR 24-n) of the fabric switch 20.

蓄積された個々のアダプタＡＭＣ３０からのパケットはＴＸＢＦＲ２２でまとめられてホストＡＭＣ１０のＲＸＢＦＲ１４へ送信される。ＲＸＢＦＲ１４はパケットを受信して、受信したパケットをＣＰＵ１１に送る。そして、パケットはＣＰＵ１１にて処理される。 The accumulated packets from the individual adapters AMC 30 are collected by the TXBFR 22 and transmitted to the RXBFR 14 of the host AMC 10. RXBFR 14 receives the packet and sends the received packet to CPU 11. The packet is processed by the CPU 11.

次に、Ｉ２Ｃ障害通知の送受信について説明する。 Next, transmission / reception of an I2C failure notification will be described.

ネットワークシステム上の保守監視においては、個々のアダプタＡＭＣ３０に含まれるＭＭＣ３１が、アダプタＡＭＣ３０内部のうちのｓＲＩＯ経路に関連しない部分の状態監視を行う。例えば、アダプタＡＭＣ３０に接続されたデバイスに障害が発生したか否かの監視を行う。そして、アダプタＡＭＣ３０内部のうちのｓＲＩＯ経路に関連しない部分で障害が発生した場合は、ＭＣＭＣ４０宛てにＩ２Ｃ障害通知を行う。 In the maintenance monitoring on the network system, the MMC 31 included in each adapter AMC 30 monitors the state of the part not related to the sRIO path in the adapter AMC 30. For example, it is monitored whether a failure has occurred in a device connected to the adapter AMC 30. When a failure occurs in a portion not related to the sRIO path in the adapter AMC 30, an I2C failure notification is sent to the MCMC 40.

一方、本実施形態では、ＭＭＣ３１はｓＲＩＯ経路に関連する障害発生に関しては監視を行わない。そのため、アダプタＡＭＣ３０とファブリックスイッチ２０間でのｓＲＩＯ経路に障害が発生したとしてもＭＭＣ３１は特定の処理を行わない。 On the other hand, in this embodiment, the MMC 31 does not monitor the occurrence of a failure related to the sRIO path. Therefore, even if a failure occurs in the sRIO path between the adapter AMC 30 and the fabric switch 20, the MMC 31 does not perform a specific process.

そして、アダプタＡＭＣ３０内部のＭＭＣ３１による状態監視下の部分で障害が発生した場合は、ＭＭＣ３１がＩ２ＣにてＭＣＭＣ４０へＩ２Ｃ障害通知を行う。 When a failure occurs in the portion under the status monitoring by the MMC 31 inside the adapter AMC 30, the MMC 31 notifies the MCMC 40 of the I2C failure through I2C.

通知を受けたＭＣＭＣ４０は、ネットワーク全体の状態通知として、障害が発生したという情報をＩ２Ｃ障害通知としてホストＡＭＣ１０のＭＭＣ１２へと送る。 The MCMC 40 that has received the notification sends information indicating that a failure has occurred to the MMC 12 of the host AMC 10 as an I2C failure notification as a status notification of the entire network.

送られてきた、Ｉ２Ｃ障害通知はＭＭＣ１２を介してＣＰＵ１１へと送られる。これにより、ＣＰＵ１１は障害の発生を認識できる。 The sent I2C failure notification is sent to the CPU 11 via the MMC 12. Thereby, the CPU 11 can recognize the occurrence of the failure.

次に、ｓＲＩＯ障害通知の送受信について説明する。 Next, transmission / reception of the sRIO failure notification will be described.

ネットワークシステム上でのｓＲＩＯ経路障害に関しては、ファブリックスイッチ２０内のｓＲＩＯ障害処理部２５で監視を行う。この監視について説明する。 An sRIO path failure on the network system is monitored by the sRIO failure processing unit 25 in the fabric switch 20. This monitoring will be described.

本実施形態におけるファブリックスイッチ２０と各アダプタＡＭＣ３０は、相互にアイドル信号を送受信している。そして、ｓＲＩＯ障害処理部２５は、このアイドル信号を監視し、アイドル信号が受信できなくなった場合に、ファブリックスイッチ２０と各アダプタＡＭＣ３０間で障害が発生したことを検知する。障害発生を検知したｓＲＩＯ障害処理部２５は、障害発生通知をｓＲＩＯにてｓＲＩＯ障害処理部１５に通知する。 The fabric switch 20 and each adapter AMC 30 in this embodiment transmit and receive idle signals to each other. The sRIO failure processing unit 25 monitors this idle signal, and detects that a failure has occurred between the fabric switch 20 and each adapter AMC 30 when the idle signal cannot be received. The sRIO failure processing unit 25 that has detected the occurrence of a failure notifies the sRIO failure processing unit 15 of a failure occurrence notification by sRIO.

そして、ｓＲＩＯ障害処理部２５は障害の発生を検知すると、コントロールシンボル（ＮＯＰ）を使用してｓＲＩＯ障害発生をホストＡＭＣ１０へ通知する。 When the occurrence of the failure is detected, the sRIO failure processing unit 25 notifies the host AMC 10 of the occurrence of the sRIO failure using a control symbol (NOP).

ホストＡＭＣ１０で受信されたｓＲＩＯ経路障害は、ｓＲＩＯ障害処理部１５を介してＣＰＵ１１に通知される。 The sRIO path failure received by the host AMC 10 is notified to the CPU 11 via the sRIO failure processing unit 15.

ｓＲＩＯ経路障害通知に対する応答メッセージは、ホストＡＭＣ１０のｓＲＩＯ障害処理部１５で生成される。具体的には、コントロールシンボルを生成し、このコントロールシンボル内に応答メッセージがマッピングされる。そして、応答メッセージは、ＴＸＢＦＲ１３とＲＸＢＦＲ２１を介してファブリックスイッチ２０のｓＲＩＯ障害処理部２５へ通知される。 A response message to the sRIO path failure notification is generated by the sRIO failure processing unit 15 of the host AMC 10. Specifically, a control symbol is generated, and a response message is mapped in this control symbol. Then, the response message is notified to the sRIO failure processing unit 25 of the fabric switch 20 via the TXBFR 13 and the RXBFR 21.

ここで、コントロールシンボルへのマッピングについて説明する。 Here, mapping to control symbols will be described.

図３を参照すると、コントロールシンボルの一例が表されている。ＳｅｒｉａｌＲａｐｉｄＩＯの経路上には、フロー制御、リトライ制御及びパケットの区切りの役割を持つコントロールシンボルが流れている。コントロールシンボルでは、種類に応じて、１０ｂｉｔｓＫ−ＣｈａｒａｃｔｅｒのＫ２８．０（Ｋ−ｃｏｄｅ「ＳＣ」）またはＫ２８．３（Ｋ−ｃｏｄｅ「ＰＤ」）のいずれかが開始を示すマークとして用いられる。図３では、Ｋ２８．０（Ｋ−ｃｏｄｅ「ＳＣ」）が用いられる場合を例に取って説明する。 Referring to FIG. 3, an example of control symbols is shown. A control symbol having a role of flow control, retry control, and packet separation flows on the path of the Serial Rapid IO. In the control symbol, either K28.0 (K-code “SC”) or K28.3 (K-code “PD”) of 10 bits K-Character is used as a start mark depending on the type. In FIG. 3, a case where K28.0 (K-code “SC”) is used will be described as an example.

そして、この開始マークＫ２８．０（Ｋ−ｃｏｄｅ「ＳＣ」）に続くコントロールシンボルの実体は２４ビットから成る。図３を参照すると、コントロールシンボルの実体は、ｓｔｙｐｅ０、ｐａｒａｍ０、ｐａｒａｍ１、ｓｔｙｐｅ１、ＣＭＤ及びＣＲＣを含む。なお各項目の括弧内の数字はビット数を表すものである。また、図３では今回利用する項目を網掛けとする。 The control symbol following the start mark K28.0 (K-code “SC”) is composed of 24 bits. Referring to FIG. 3, the control symbol entity includes type 0, param 0, param 1, type 1, CMD, and CRC. The numbers in parentheses for each item represent the number of bits. In FIG. 3, the item used this time is shaded.

コントロールシンボルはｓｔｙｐｅ０とｓｔｙｐｅ１の２つのステータス情報を持つことができる。ｓｔｙｐｅ０は、アダプタＡＭＣ３０からホストＡＭＣ１０へ伝えるステータス情報である。ｓｔｙｐｅ０は、パケットが到着した旨の応答や、パケット不着の応答等を伝えることができる。 The control symbol can have two pieces of status information, type 0 and type 1. Type 0 is status information transmitted from the adapter AMC 30 to the host AMC 10. The type 0 can transmit a response indicating that a packet has arrived, a non-delivery response, or the like.

一方、ホストＡＭＣ１０からアダプタＡＭＣ３０へ伝える情報はｓｔｙｐｅ１となる。ｓｔｙｐｅ１では、例えばパケットの開始位置であること、パケットの終了位置であること、パケット送信のリトライ等を伝えることができる。 On the other hand, the information transmitted from the host AMC 10 to the adapter AMC 30 is type1. In type 1, for example, it can be notified that the start position of the packet, the end position of the packet, a retry of packet transmission, or the like.

そして、本実施形態ではコントロールシンボルのｐａｒａｍ０，ｐａｒａｍ１の領域に、障害が発生したという情報と、どの経路で障害が発生したかという情報を付加する。具体的には、障害が発生した経路にてアダプタＡＭＣ３０と接続されているファブリックスイッチ２０のポートのポート番号を付加する。ＣＰＵ１１は、ｓＲＩＯ障害処理部１５に記憶されたコントロールシンボル内のｐａｒａｍ０，ｐａｒａｍ１の領域を参照することにより、どのアダプタＡＭＣ３０と、ファブリックスイッチ２０の間で障害が発生したかを把握できる。 In this embodiment, information indicating that a failure has occurred and information indicating on which route the failure has occurred are added to the areas of param0 and param1 of the control symbol. Specifically, the port number of the port of the fabric switch 20 connected to the adapter AMC 30 in the path where the failure has occurred is added. The CPU 11 can grasp which adapter AMC 30 and the fabric switch 20 have failed by referring to the areas of param 0 and param 1 in the control symbol stored in the sRIO failure processing unit 15.

なお、今回ＣＭＤは利用しないため、Ｎｕｌｌ（０００）となっている。また、ＣＲＣはエラーチェックに用いられる。 Since CMD is not used this time, it is Null (000). CRC is used for error checking.

続いて、図４及び図５を参照して本実施形態の動作について説明する。ここで、図４は本実施形態で送受信されるデータの流れを表す図である。また、図５は本実施形態の基本的動作を表すフローチャートである。 Next, the operation of this embodiment will be described with reference to FIGS. Here, FIG. 4 is a diagram showing the flow of data transmitted and received in the present embodiment. FIG. 5 is a flowchart showing the basic operation of this embodiment.

今回の説明ではアダプタＡＭＣ３０−１とファブリックスイッチ２０間のｓＲＩＯ経路上で障害が発生した場合を例として説明する。 In this description, a case where a failure occurs on the sRIO path between the adapter AMC 30-1 and the fabric switch 20 will be described as an example.

まず、ファブリックスイッチ２０内のｓＲＩＯ障害処理部２５は、ファブリックスイッチ２０とアダプタＡＭＣ３０（アダプタＡＭＣ３０−１〜アダプタＡＭＣ３０−ｎ）間のｓＲＩＯ経路が通信可能な状態であるかを常に監視する（図５のステップＡ１０１）。 First, the sRIO failure processing unit 25 in the fabric switch 20 always monitors whether the sRIO path between the fabric switch 20 and the adapter AMC 30 (adapter AMC 30-1 to adapter AMC 30-n) is communicable (FIG. 5). Step A101).

そして、もしｓＲＩＯ経路上で障害が発生した場合は（図５のステップＡ１０２においてＹｅｓ）、図３に示すコントロールシンボルを使用したｓＲＩＯ障害通知（図４のＳ１）をＴＸＢＦＲ２２に送る（図５のステップＡ１０３）。 If a failure occurs on the sRIO path (Yes in step A102 in FIG. 5), an sRIO failure notification (S1 in FIG. 4) using the control symbol shown in FIG. 3 is sent to the TXBFR 22 (step in FIG. 5). A103).

このｓＲＩＯ障害通知に使用されるコントロールシンボルのｐａｒａｍ０，ｐａｒａｍ１の領域には、ファブリックスイッチ２０でどの経路で障害が発生したかという情報と、障害発生フラグの情報が付加される。今回の例では、アダプタＡＭＣ３０−１とファブリックスイッチ２０間のｓＲＩＯ経路上で障害が発生しているため、アダプタＡＭＣ３０−１とファブリックスイッチ２０間のｓＲＩＯ経路で障害が発生したという情報と、障害発生フラグの情報が付加される。 Information on which path the failure has occurred in the fabric switch 20 and information on the failure occurrence flag are added to the param0 and param1 areas of the control symbols used for this sRIO failure notification. In this example, since a failure has occurred on the sRIO path between the adapter AMC 30-1 and the fabric switch 20, information indicating that a failure has occurred in the sRIO path between the adapter AMC 30-1 and the fabric switch 20, and the occurrence of the failure Flag information is added.

コントロールシンボルはｓＲＩＯ上り経路（図４のＳ２）にてホストＡＭＣ１０へ送信される。そして、コントロールシンボルは、ＲＸＢＦＲ１４を経由してｓＲＩＯ障害メッセージ（図４のＳ３）としてｓＲＩＯ障害処理部１５にて受信される（図５のステップＡ１０４）。 The control symbol is transmitted to the host AMC 10 through the sRIO upstream path (S2 in FIG. 4). The control symbol is received by the sRIO failure processing unit 15 via the RXBFR 14 as an sRIO failure message (S3 in FIG. 4) (step A104 in FIG. 5).

ｓＲＩＯ障害処理部１５では、コントロールシンボルのｐａｒａｍ０，ｐａｒａｍ１領域を参照する。そして、ｓＲＩＯ障害処理部１５は、ｓＲＩＯ経路上で障害が発生したことを、経路障害発生通知（図４のＳ７）によってＣＰＵ１１へ報告する（図５のステップＡ１０５）。 The sRIO failure processing unit 15 refers to the param0 and param1 areas of the control symbol. Then, the sRIO failure processing unit 15 reports that a failure has occurred on the sRIO route to the CPU 11 by a route failure occurrence notification (S7 in FIG. 4) (step A105 in FIG. 5).

ＣＰＵ１１は経路障害発生通知（図４のＳ７）の報告を受けて、障害発生経路に関する経路確認情報（図４のＳ８）をｓＲＩＯ障害処理部１５から取得する（図５のステップＡ１０６）。この取得はＣＰＵ１１が、ｓＲＩＯ障害処理部１５のレジスタを参照することにより行われる。今回の例では、アダプタＡＭＣ３０−１とファブリックスイッチ２０間のｓＲＩＯ経路で障害が発生したという情報を取得する。 In response to the report of the path failure occurrence notification (S7 in FIG. 4), the CPU 11 acquires the route confirmation information (S8 in FIG. 4) regarding the failure occurrence path from the sRIO failure processing unit 15 (step A106 in FIG. 5). This acquisition is performed by the CPU 11 referring to the register of the sRIO failure processing unit 15. In this example, information that a failure has occurred in the sRIO path between the adapter AMC 30-1 and the fabric switch 20 is acquired.

そして、ＣＰＵ１１は、経路障害が発生したアダプタＡＭＣ３０−１へのデータ送信処理を停止する。併せて、ＴＸＢＦＲ１３内のセレクタ部への経路選択指示信号（図４のＳ９）によって経路障害が発生したアダプタＡＭＣ３０用の送信バッファ１３−２からのデータ選択を禁止する制御を行う。今回の例では、アダプタＡＭＣ３０−１用の送信バッファである送信バッファ１３−２−１からのデータ選択を禁止する制御を行う（図５のステップＡ１０７）。 Then, the CPU 11 stops the data transmission process to the adapter AMC 30-1 in which the path failure has occurred. At the same time, control for prohibiting data selection from the transmission buffer 13-2 for the adapter AMC 30 in which a path failure has occurred is performed by a path selection instruction signal (S9 in FIG. 4) to the selector unit in the TXBFR 13. In this example, control for prohibiting data selection from the transmission buffer 13-2-1 which is a transmission buffer for the adapter AMC 30-1 is performed (step A107 in FIG. 5).

またｓＲＩＯ障害処理部１５では、ＣＰＵ１１においてｓＲＩＯ経路障害情報が検知されたことの応答としてコントロールシンボルによるｓＲＩＯ障害通知応答（図４のＳ４）をＴＸＢＦＲ１３のパケット送信機能部１３−４に送る（図５のステップＡ１０８）。 In addition, the sRIO failure processing unit 15 sends a sRIO failure notification response (S4 in FIG. 4) by the control symbol to the packet transmission function unit 13-4 of the TXBFR 13 as a response to the detection of the sRIO path failure information in the CPU 11 (FIG. 5). Step A108).

そして、パケット送信機能部１３−４は、ｓＲＩＯ下り経路（図４のＳ５）を経由してｓＲＩＯ障害通知応答をファブリックスイッチ２０へ送信する（図５のステップＡ１０９）。 Then, the packet transmission function unit 13-4 transmits an sRIO failure notification response to the fabric switch 20 via the sRIO downlink path (S5 in FIG. 4) (step A109 in FIG. 5).

ｓＲＩＯ障害通知応答のコントロールシンボルはＲＸＢＦＲ２１を介してｓＲＩＯ障害処理部２５へ通知され（図４のＳ６）、応答確認がｓＲＩＯ障害処理部２５で受け取られる。これによりｓＲＩＯ障害処理部２５は、ホストＡＭＣ１０がｓＲＩＯ障害通知を受け取ったことが確認できる。そして、ｓＲＩＯ障害処理部２５でのｓＲＩＯ障害通知（図４のＳ１）の送信が終了となる（図５のステップＡ１１０）。 The control symbol of the sRIO failure notification response is notified to the sRIO failure processing unit 25 via the RXBFR 21 (S6 in FIG. 4), and the response confirmation is received by the sRIO failure processing unit 25. Accordingly, the sRIO failure processing unit 25 can confirm that the host AMC 10 has received the sRIO failure notification. Then, the transmission of the sRIO failure notification (S1 in FIG. 4) in the sRIO failure processing unit 25 ends (step A110 in FIG. 5).

続いて、図６を参照してファブリックスイッチ２０とアダプタＡＭＣ３０−１間のｓＲＩＯ経路で障害が発生した際の主要データの流れについて説明する。 Next, the flow of main data when a failure occurs in the sRIO path between the fabric switch 20 and the adapter AMC 30-1 will be described with reference to FIG.

本実施形態によるｓＲＩＯ経路障害に関する処理によって、ホストＡＭＣ１０内のＴＸＢＦＲ１３においてアダプタＡＭＣ３０−１用の送信バッファ１３−２−１へのデータ送信が禁止される。またセレクタ１３−３でアダプタＡＭＣ１０用の送信バッファ１３−２−１の選択が禁止される。これにより、他の正常なアダプタＡＭＣ３０へのデータ送信への影響をなくすことができる。なお、ファブリックスイッチ２０へのパケットの送信の停止のみを考えるのであればセレクタ１３−３でアダプタＡＭＣ１０用の送信バッファ１３−２−１の選択を禁止するだけでもよい。もっとも、不要な処理を削減するという観点から言えば、上述したように送信バッファ１３−２−１へのデータ送信の禁止も併せて行うことが好ましい。更に、障害発生を検知したＣＰＵ１１は、検知以降はアダプタＡＭＣ３０−１宛のパケットの生成自体を中止するようにしてもよい。また、送信バッファ制御部１３−１は、ＣＰＵ１１の指示に応じて送信バッファ１３−２−１へのデータ送信の禁止を行うようにしてもよい。 By the processing related to the sRIO path failure according to the present embodiment, data transmission to the transmission buffer 13-2-1 for the adapter AMC 30-1 is prohibited in the TXBFR 13 in the host AMC 10. The selector 13-3 prohibits selection of the transmission buffer 13-2-1 for the adapter AMC10. Thereby, the influence on the data transmission to the other normal adapter AMC30 can be eliminated. If only the stop of the transmission of the packet to the fabric switch 20 is considered, the selector 13-3 may only prohibit the selection of the transmission buffer 13-2-1 for the adapter AMC10. However, from the viewpoint of reducing unnecessary processing, it is preferable to prohibit data transmission to the transmission buffer 13-2-1 as described above. Further, the CPU 11 that has detected the occurrence of a failure may stop generating the packet addressed to the adapter AMC 30-1 after the detection. In addition, the transmission buffer control unit 13-1 may prohibit data transmission to the transmission buffer 13-2-1 in accordance with an instruction from the CPU 11.

また、図６においては、主要データの送信とは直接関係しないＭＭＣ１２、ＲＸＢＦＲ１４及びｓＲＩＯ障害処理部１５の図示を省略した。
＊「アダプタＡＭＣ３０−１用の送信バッファ１３−２−１へのデータ送信が禁止される」
次に、図７を参照して本実施形態の効果を具体的に説明する。本実施形態では、コントロールシンボルによる通知によって、ｓＲＩＯ経路障害通知がホストＡＭＣ１０で検知されるまでの時間が、従来のＩ２Ｃによる数十［ｍｓ］から数［μＳ］に大幅に短縮される。 Further, in FIG. 6, illustration of the MMC 12, the RXBFR 14, and the sRIO failure processing unit 15 that are not directly related to the transmission of main data is omitted.
* "Data transmission to the transmission buffer 13-2-1 for the adapter AMC30-1 is prohibited"
Next, the effect of the present embodiment will be specifically described with reference to FIG. In this embodiment, the time until the sRIO path failure notification is detected by the host AMC 10 is greatly reduced from several tens [ms] by conventional I2C to several [μS] by the notification by the control symbol.

以上説明したように、本実施形態では、ＭｉｃｒｏＴＣＡネットワークシステム上のｓＲＩＯ経路障害が発生した場合に、ホストＡＭＣで早期に障害を検知することができ、またｓＲＩＯ経路障害による影響を受けることの無い安定したｓＲＩＯ高速シリアル通信を実現することができる。 As described above, in this embodiment, when an sRIO path failure on the MicroTCA network system occurs, the host AMC can detect the failure at an early stage, and is stable without being affected by the sRIO path failure. SRIO high-speed serial communication can be realized.

なお、上記のホストＡＭＣ、ファブリックスイッチ、アダプタＡＭＣ及びＭＣＭＣは、ハードウェア、ソフトウェア又はこれらの組合わせにより実現することができる。また、上記のホストＡＭＣ、ファブリックスイッチ、アダプタＡＭＣ及びＭＣＭＣの何れか又は組合せにより行なわれるネットワーク障害対処方法も、ハードウェア、ソフトウェア又はこれらに組合わせにより実現することができる。ここで、ソフトウェアによって実現されるとは、コンピュータがプログラムを読み込んで実行することにより実現されることを意味する。 The host AMC, fabric switch, adapter AMC, and MCMC described above can be realized by hardware, software, or a combination thereof. In addition, a network failure handling method performed by any one or combination of the host AMC, fabric switch, adapter AMC, and MCMC can be realized by hardware, software, or a combination thereof. Here, “realized by software” means realized by a computer reading and executing a program.

プログラムは、様々なタイプの非一時的なコンピュータ可読媒体(non-transitory computer readable medium)を用いて格納され、コンピュータに供給することができる。非一時的なコンピュータ可読媒体は、様々なタイプの実体のある記録媒体(tangible storage medium)を含む。非一時的なコンピュータ可読媒体の例は、磁気記録媒体（例えば、フレキシブルディスク、磁気テープ、ハードディスクドライブ）、光磁気記録媒体（例えば、光磁気ディスク）、ＣＤ−ＲＯＭ(Read Only Memory)、ＣＤ−Ｒ、ＣＤ−Ｒ／Ｗ、半導体メモリ（例えば、マスクＲＯＭ、ＰＲＯＭ(Programmable ROM)、ＥＰＲＯＭ(Erasable PROM)、フラッシュＲＯＭ、ＲＡＭ(random access memory)）を含む。また、プログラムは、様々なタイプの一時的なコンピュータ可読媒体(transitory computer readable medium)によってコンピュータに供給されてもよい。一時的なコンピュータ可読媒体の例は、電気信号、光信号、及び電磁波を含む。一時的なコンピュータ可読媒体は、電線及び光ファイバ等の有線通信路、又は無線通信路を介して、プログラムをコンピュータに供給できる。 The program may be stored using various types of non-transitory computer readable media and supplied to the computer. Non-transitory computer readable media include various types of tangible storage media. Examples of non-transitory computer readable media include magnetic recording media (for example, flexible disks, magnetic tapes, hard disk drives), magneto-optical recording media (for example, magneto-optical disks), CD-ROMs (Read Only Memory), CD- R, CD-R / W, semiconductor memory (for example, mask ROM, PROM (Programmable ROM), EPROM (Erasable PROM), flash ROM, RAM (random access memory)). The program may also be supplied to the computer by various types of transitory computer readable media. Examples of transitory computer readable media include electrical signals, optical signals, and electromagnetic waves. The temporary computer-readable medium can supply the program to the computer via a wired communication path such as an electric wire and an optical fiber, or a wireless communication path.

上記の実施形態の一部又は全部は、以下の付記のようにも記載されうるが、以下には限られない。 A part or all of the above-described embodiment can be described as in the following supplementary notes, but is not limited thereto.

（付記１）複数のアダプタがスイッチを介してホストと接続され、前記ホストから出力されたデータが前記スイッチで多重分離されて前記複数のアダプタに送信されるネットワークであって、
前記ホストは、
前記複数のアダプタにそれぞれ対応した複数の送信バッファと、
データをその宛先となる前記アダプタに対応する前記送信バッファを経由してから出力させるための手段と、
障害に係る前記アダプタ宛のデータを該アダプタに対応する前記バッファに書き込むこと又は該バッファから読み出すことを中止するための手段と、
を備えることを特徴とするネットワーク。 (Appendix 1) A network in which a plurality of adapters are connected to a host via a switch, and data output from the host is demultiplexed by the switch and transmitted to the plurality of adapters.
The host is
A plurality of transmission buffers respectively corresponding to the plurality of adapters;
Means for outputting data after passing through the transmission buffer corresponding to the adapter as the destination;
Means for stopping writing to or reading from the buffer corresponding to the adapter with data addressed to the adapter associated with the failure;
A network characterized by comprising:

（付記２）付記１に記載のネットワークであって、
或る前記アダプタに係る障害が検出されたならば、該障害を前記スイッチから前記ホストに、これらの間のデータ通信に係るプロトコルを用いて、通知する手段を更に備えることを特徴とするネットワーク。 (Supplementary note 2) The network according to supplementary note 1, wherein
The network further comprising means for notifying the failure from the switch to the host by using a protocol relating to data communication between the switches if a failure related to the certain adapter is detected.

（付記３）付記１又は２に記載のネットワークであって、
前記スイッチは前記複数のアダプタそれぞれとの間で通信状態を確認する為の信号を送受信しており、或るアダプタとの間で該信号が受信できない場合に該或るアダプタとの間の経路での障害発生を検知し、前記スイッチから前記ホスト間のデータ通信に係るプロトコルに準拠したコントロールシンボル内に該或るアダプタを特定する情報を挿入して前記ホストに送信することにより、前記ホストに障害を通知することを特徴とするネットワーク。 (Supplementary note 3) The network according to supplementary note 1 or 2,
The switch transmits / receives a signal for confirming a communication state with each of the plurality of adapters, and when the signal cannot be received with a certain adapter, By detecting the occurrence of a failure and inserting information identifying the certain adapter into a control symbol conforming to a protocol related to data communication between the hosts from the switch and transmitting the information to the host. A network characterized by notifying.

（付記４）付記１乃至３の何れか１に記載のネットワークであって、
前記スイッチは、
受信バッファと、前記複数のアダプタにそれぞれ対応した複数の送信バッファを備え、
前記ホストから受信したデータを一旦受信バッファに格納した後に、各アダプタ宛のデータを該アダプタに対応する前記送信バッファに分配することにより前記多重分離を行なうことを特徴とするネットワーク。 (Appendix 4) The network according to any one of appendices 1 to 3,
The switch is
A reception buffer and a plurality of transmission buffers respectively corresponding to the plurality of adapters;
A network characterized in that after the data received from the host is temporarily stored in a reception buffer, the demultiplexing is performed by distributing the data addressed to each adapter to the transmission buffer corresponding to the adapter.

（付記５）付記１乃至４の何れか１に記載のネットワークであって、
当該ネットワークは、
前記スイッチと前記アダプタ間の経路で障害が発生した際に前記アダプタが該障害を検知し、該アダプタが前記スイッチから前記ホスト間のデータ通信に係るプロトコルとは異なるプロトコルにて前記ホストに障害発生を通知する、という規格に準じていることを特徴とするネットワーク。 (Supplementary note 5) The network according to any one of supplementary notes 1 to 4,
The network
When a failure occurs in the path between the switch and the adapter, the adapter detects the failure, and the adapter detects a failure in the host using a protocol different from the protocol related to data communication between the switch and the host. A network characterized by conforming to the standard of notifying.

（付記６）複数のアダプタがスイッチを介してホストと接続され、前記ホストから出力されたデータが前記スイッチで多重分離されて前記複数のアダプタに送信されるネットワークにおける前記ホストであって、
前記複数のアダプタにそれぞれ対応した複数の送信バッファと、
データをその宛先となる前記アダプタに対応する前記送信バッファを経由してから出力させるための手段と、
障害に係る前記アダプタ宛のデータを該アダプタに対応する前記バッファに書き込むこと又は該バッファから読み出すことを中止するための手段と、
を備えることを特徴とするホスト。 (Appendix 6) The host in a network in which a plurality of adapters are connected to a host via a switch, and data output from the host is demultiplexed by the switch and transmitted to the plurality of adapters,
A plurality of transmission buffers respectively corresponding to the plurality of adapters;
Means for outputting data after passing through the transmission buffer corresponding to the adapter as the destination;
Means for stopping writing to or reading from the buffer corresponding to the adapter with data addressed to the adapter associated with the failure;
A host characterized by comprising:

（付記７）複数のアダプタがスイッチを介してホストと接続され、前記ホストから出力されたデータが前記スイッチで多重分離されて前記複数のアダプタに送信されるネットワークにおけるネットワーク障害対処方法であって、
前記ホストが、
前記複数のアダプタにそれぞれ対応した複数の送信バッファによりデータをバッファリングするステップと、
データをその宛先となる前記アダプタに対応する前記送信バッファを経由してから出力させるためのステップと、
障害に係る前記アダプタ宛のデータを該アダプタに対応する前記バッファに書き込むこと又は該バッファから読み出すことを中止するためのステップと、
を備えることを特徴とするネットワーク障害対処方法。 (Appendix 7) A network failure handling method in a network in which a plurality of adapters are connected to a host via a switch, and data output from the host is demultiplexed by the switch and transmitted to the plurality of adapters.
The host
Buffering data with a plurality of transmission buffers respectively corresponding to the plurality of adapters;
A step for outputting data after passing through the transmission buffer corresponding to the adapter as the destination;
A step for stopping writing to or reading from the buffer corresponding to the adapter with data addressed to the adapter associated with the failure;
A network failure handling method comprising:

本発明は、例えば携帯電話機基地局装置など、ネットワーク系路上で障害が発生した場合でも、遅延を許さないインフラ装置に好適である。 The present invention is suitable for an infrastructure device that does not allow a delay even when a failure occurs on a network path, such as a mobile phone base station device.

１０、６０ホストＡＭＣ
１１、６１ＣＰＵ
１２、６２ＭＭＣ
１３、６３ＴＸＢＦＲ
１３−１送信バッファ制御部
１３−２−１、１３−２−２、１３−２−ｎ送信バッファ
１３−３セレクタ
１３−４パケット受信機能部
１４、６４ＲＸＢＦＲ
１４−１パケット受信機能部
１４−２受信バッファ
１４−３受信バッファ制御部
１５、２５ｓＲＩＯ障害処理部
２０、７０ファブリックスイッチ
２１、７１ＲＸＢＦＲ
２２、７２ＴＸＢＦＲ
２３−１、２３−２、２３−ｎ、７３−１、７３−２、７３−ｎＴＸＢＦＲ
２４−１、２４−２、２４−ｎ、７４−１、７４−２、７４−ｎＲＸＢＦＲ
３０−１、３０−２、３０−ｎ、７０−１、７０−２、７０−ｎアダプタＡＭＣ
３１−１、３１−２、３１−ｎ、７１−１、７１−２、７１−ｎＭＭＣ
３２−１、３２−２、３２−ｎ、７２−１、７２−２、７２−ｎＲＸＢＦＲ
３３−１、３３−２、３３−ｎ、７３−１、７３−２、７３−ｎＴＸＢＦＲ
４０、９０ＭＣＭＣ 10, 60 Host AMC
11, 61 CPU
12, 62 MMC
13, 63 TXBFR
13-1 Transmission buffer control unit 13-2-1, 13-2-2, 13-2-n Transmission buffer 13-3 Selector 13-4 Packet reception function unit 14, 64 RXBFR
14-1 Packet reception function unit 14-2 Reception buffer 14-3 Reception buffer control unit 15, 25 sRIO failure processing unit 20, 70 Fabric switch 21, 71 RXBFR
22, 72 TXBFR
23-1, 23-2, 23-n, 73-1, 73-2, 73-n TXBFR
24-1, 24-2, 24-n, 74-1, 74-2, 74-n RXBFR
30-1, 30-2, 30-n, 70-1, 70-2, 70-n Adapter AMC
31-1, 31-2, 31-n, 71-1, 71-2, 71-n MMC
32-1, 32-2, 32-n, 72-1, 72-2, 72-n RXBFR
33-1, 33-2, 33-n, 73-1, 73-2, 73-n TXBFR
40, 90 MCMC

Claims

複数のアダプタがスイッチを介してホストと接続され、前記ホストから出力されたデータが前記スイッチで多重分離されて前記複数のアダプタに送信されるネットワークであって、
前記ホストは、
前記複数のアダプタにそれぞれ対応した複数の送信バッファと、
データをその宛先となる前記アダプタに対応する前記送信バッファを経由してから出力させるための手段と、
障害に係る前記アダプタ宛のデータを該アダプタに対応する前記バッファに書き込むこと又は該バッファから読み出すことを中止するための手段と、
を備え、
前記アダプタに係る障害が発生した場合に該障害発生を前記ホストに通知する際と、前記スイッチと前記アダプタ間の経路で障害が発生した場合に該障害発生を前記ホストに通知する際とで、通知に使用するプロトコルを異ならせることを特徴とするネットワーク。 A plurality of adapters are connected to a host through a switch, and data output from the host is demultiplexed by the switch and transmitted to the plurality of adapters,
The host is
A plurality of transmission buffers respectively corresponding to the plurality of adapters;
Means for outputting data after passing through the transmission buffer corresponding to the adapter as the destination;
Means for stopping writing to or reading from the buffer corresponding to the adapter with data addressed to the adapter associated with the failure;
Equipped with a,
When notifying the host of the occurrence of a failure when the failure occurs in the adapter, and when notifying the host of the occurrence of a failure when a failure occurs in the path between the switch and the adapter, network characterized by Rukoto with different protocols to be used for notification.

請求項１に記載のネットワークであって、
前記スイッチと或る前記アダプタ間の経路で障害が検出されたならば、該障害を前記スイッチから前記ホストに、これらの間のデータ通信に係るプロトコルを用いて、通知する手段を更に備えることを特徴とするネットワーク。 The network of claim 1,
If a failure is detected on the path between the switch and the adapter, the device further comprises means for notifying the failure from the switch to the host using a protocol related to data communication between them. Characteristic network.

請求項１又は２に記載のネットワークであって、
前記スイッチは前記複数のアダプタそれぞれとの間で通信状態を確認する為の信号を送受信しており、或るアダプタとの間で該信号が受信できない場合に該或るアダプタとの間の経路での障害発生を検知し、前記スイッチから前記ホスト間のデータ通信に係るプロトコルに準拠したコントロールシンボル内に該或るアダプタを特定する情報を挿入して前記ホストに送信することにより、前記ホストに障害を通知することを特徴とするネットワーク。 The network according to claim 1 or 2, wherein
The switch transmits / receives a signal for confirming a communication state with each of the plurality of adapters, and when the signal cannot be received with a certain adapter, By detecting the occurrence of a failure and inserting information identifying the certain adapter into a control symbol conforming to a protocol related to data communication between the hosts from the switch and transmitting the information to the host. A network characterized by notifying.

請求項１乃至３の何れか１項に記載のネットワークであって、
前記スイッチは、
受信バッファと、前記複数のアダプタにそれぞれ対応した複数の送信バッファを備え、
前記ホストから受信したデータを一旦受信バッファに格納した後に、各アダプタ宛のデータを該アダプタに対応する前記送信バッファに分配することにより前記多重分離を行なうことを特徴とするネットワーク。 The network according to any one of claims 1 to 3,
The switch is
A reception buffer and a plurality of transmission buffers respectively corresponding to the plurality of adapters;
A network characterized in that after the data received from the host is temporarily stored in a reception buffer, the demultiplexing is performed by distributing the data addressed to each adapter to the transmission buffer corresponding to the adapter.

請求項１乃至４の何れか１項に記載のネットワークであって、
当該ネットワークは、
前記スイッチと前記アダプタ間の経路で障害が発生した際に前記アダプタが該障害を検知し、該アダプタが前記スイッチから前記ホスト間のデータ通信に係るプロトコルとは異なるプロトコルにて前記ホストに障害発生を通知する、という規格に準じているが、該アダプタは該アダプタに係る障害が発生した場合にのみ障害発生を前記ホストに通知することを特徴とするネットワーク。 A network according to any one of claims 1 to 4,
The network
When a failure occurs in the path between the switch and the adapter, the adapter detects the failure, and the adapter detects a failure in the host using a protocol different from the protocol related to data communication between the switch and the host. The network is characterized in that the adapter notifies the host of the failure only when a failure relating to the adapter occurs .

複数のアダプタがスイッチを介してホストと接続され、前記ホストから出力されたデータが前記スイッチで多重分離されて前記複数のアダプタに送信されるネットワークにおける前記ホストであって、
前記複数のアダプタにそれぞれ対応した複数の送信バッファと、
データをその宛先となる前記アダプタに対応する前記送信バッファを経由してから出力させるための手段と、
障害に係る前記アダプタ宛のデータを該アダプタに対応する前記バッファに書き込むこと又は該バッファから読み出すことを中止するための手段と、
を備え、
前記アダプタに係る障害が発生した場合に該障害発生を前記ホストに通知する際と、前記スイッチと前記アダプタ間の経路で障害が発生した場合に該障害発生を前記ホストに通知する際とで、通知に使用するプロトコルを異ならせることを特徴とするホスト。 A plurality of adapters connected to a host via a switch, and the data in the network in which data output from the host is demultiplexed by the switch and transmitted to the plurality of adapters;
A plurality of transmission buffers respectively corresponding to the plurality of adapters;
Means for outputting data after passing through the transmission buffer corresponding to the adapter as the destination;
Means for stopping writing to or reading from the buffer corresponding to the adapter with data addressed to the adapter associated with the failure;
Equipped with a,
When notifying the host of the occurrence of a failure when the failure occurs in the adapter, and when notifying the host of the occurrence of a failure when a failure occurs in the path between the switch and the adapter, host, wherein Rukoto with different protocols to be used for notification.

複数のアダプタがスイッチを介してホストと接続され、前記ホストから出力されたデータが前記スイッチで多重分離されて前記複数のアダプタに送信されるネットワークにおけるネットワーク障害対処方法であって、
前記ホストが、
前記複数のアダプタにそれぞれ対応した複数の送信バッファによりデータをバッファリングするステップと、
データをその宛先となる前記アダプタに対応する前記送信バッファを経由してから出力させるためのステップと、
障害に係る前記アダプタ宛のデータを該アダプタに対応する前記バッファに書き込むこと又は該バッファから読み出すことを中止するためのステップと、
を備え、
前記アダプタに係る障害が発生した場合に該障害発生を前記ホストに通知する際と、前記スイッチと前記アダプタ間の経路で障害が発生した場合に該障害発生を前記ホストに通知する際とで、通知に使用するプロトコルを異ならせることを特徴とするネットワーク障害対処方法。 A network failure handling method in a network in which a plurality of adapters are connected to a host via a switch, and data output from the host is demultiplexed by the switch and transmitted to the plurality of adapters.
The host
Buffering data with a plurality of transmission buffers respectively corresponding to the plurality of adapters;
A step for outputting data after passing through the transmission buffer corresponding to the adapter as the destination;
A step for stopping writing to or reading from the buffer corresponding to the adapter with data addressed to the adapter associated with the failure;
Equipped with a,
When notifying the host of the occurrence of a failure when the failure occurs in the adapter, and when notifying the host of the occurrence of a failure when a failure occurs in the path between the switch and the adapter, network troubleshooting wherein the Rukoto with different protocols to be used for notification.