CN117201276A

CN117201276A - Fault processing method, device, gateway, system and storage medium

Info

Publication number: CN117201276A
Application number: CN202311303100.7A
Authority: CN
Inventors: 王岗; 宋卿; 王鹏
Original assignee: Dawning Information Industry Beijing Co Ltd
Current assignee: Dawning Information Industry Beijing Co Ltd
Priority date: 2023-10-09
Filing date: 2023-10-09
Publication date: 2023-12-08

Abstract

The application relates to a fault processing method, a fault processing device, a gateway, a system and a storage medium. The method comprises the following steps: firstly, periodically acquiring the network port state of each border gateway in a border gateway cluster, then detecting whether each border gateway has a fault according to the change condition between the network port states of each border gateway adjacent to each border gateway twice, obtaining a detection result, and finally determining the target state which can normally work after each border gateway is subjected to fault recovery according to the detection result. The method can realize the automatic fault recovery of each boundary gateway of the fault and avoid network interruption.

Description

Fault processing method, device, gateway, system and storage medium

Technical Field

The present application relates to the field of communications technologies, and in particular, to a fault processing method, device, gateway, system, and storage medium.

Background

With the rapid development of virtualization and cloud computing technologies, the traditional network technologies and architectures have become increasingly unable to meet the requirements of rapid configuration, on-demand calling and automatic load balancing. Thus, the software-defined network SDN (Software Defined Networking) becomes an important part of the software-defined data center. The SDN with the Tungsten Fabric as an open source can provide single-point control, visibility and management services of network and security for deployment of different types of data centers or multi-cloud environments. The device management component in tunesten Fabric implements the function of configuration management on physical routers, but the device management component only supports physical routers of specific series products at present and cannot adapt to other types of physical routers.

In order to meet the requirements of physical router diversity, high performance, scalability, high availability, etc., an SDN network integration solution may be implemented by establishing a border gateway device cluster in combination with tunesten Fabric. However, the performance of the present border gateway is not high, and network disruption may be caused when the border gateway fails.

Disclosure of Invention

In view of the foregoing, it is desirable to provide a fault handling method, device, gateway, system, and storage medium capable of automatically performing fault recovery.

In a first aspect, the present application provides a fault handling method, the method comprising:

periodically acquiring the network port state of each border gateway in the border gateway cluster;

detecting whether each border gateway has faults according to the change condition between the network port states of two adjacent border gateways to obtain a detection result;

determining the target state of each border gateway according to the detection result; the target state is a state that the border gateway can work normally after fault recovery.

In one embodiment, detecting whether each border gateway has a fault according to a change condition between two adjacent network port states of each border gateway, to obtain a detection result includes:

if the change condition is that the network port states of all the border gateways are not changed between two adjacent network port states, acquiring the number of the network port states of all the border gateways as a first state;

and detecting whether each border gateway has faults according to the number, and obtaining a detection result.

In the embodiment, by acquiring the network port states of two adjacent times, whether the network port states change or not is judged, so that whether faults exist in the Bond mode or not is determined, and automatic real-time monitoring of the faults is realized.

In one embodiment, detecting whether each border gateway has a fault according to the number, to obtain a detection result includes:

if the number reaches a preset threshold, determining that the detection result is that each border gateway has faults;

if the number does not reach the preset threshold, determining that the detection result is that at least one border gateway in the border gateways has faults.

if the change condition is that the network port states of the adjacent two times of the border gateway are changed, determining the state change trend of each border gateway according to the change condition of the network port states of the adjacent two times of the border gateway;

and detecting whether each border gateway has faults according to the state change trend of each border gateway, and obtaining a detection result.

In this embodiment, the detection result of the network port state is determined according to the network port state change trend, which is more accurate than the method of determining the detection result only through the current state.

In one embodiment, determining the target state of each border gateway according to the detection result includes:

and if the state change trend of the border gateway is changed from the first state to the second state, determining that the target state of the border gateway is the second state.

In this embodiment, according to the state change trend of the border gateway, when the state is changed from the first state to the second state, the management state in the network port state of the border gateway is set as the target state, so that the network port fault of the border gateway can be automatically recovered, and the high availability of the border gateway is ensured.

In one embodiment, each border gateway in the border gateway cluster has a cluster internet protocol IP and a bottom layer bearer network interconnection IP;

the cluster IP of each border gateway is the same, and the interconnection IP of the bottom layer bearing network of each border gateway is different.

In this embodiment, each border gateway in the border gateway cluster has the same cluster IP and different interconnection IPs, so that the border gateway cluster can be packaged as a whole, then traffic sharing is performed inside the border gateway cluster, and the border gateway is increased or decreased inside the border gateway cluster, so that transmission of external data is not affected, and high availability and scalability characteristics of the border gateway cluster are ensured.

In a second aspect, the present application also provides a fault handling apparatus, including:

the acquisition module is used for periodically acquiring the network port state of each border gateway in the border gateway cluster;

the detection module is used for detecting whether each border gateway has faults according to the change condition between the network port states of two adjacent border gateways to obtain a detection result;

the determining module is used for determining the target state of each border gateway according to the detection result; the target state is a state that the border gateway can work normally after fault recovery.

In a third aspect, the present application further provides a border gateway, including a memory and a processor, where the memory stores a computer program, and the processor implements the fault handling method according to any one of the first aspects when executing the computer program.

In a fourth aspect, the present application further provides a border gateway system, where the border gateway system includes a control node, a computing node, an intranet router, and a plurality of border gateways;

the border gateway is configured to perform the fault handling method according to any one of the first aspect;

the control node is in communication connection with the intranet router, and is used for receiving the route information of a plurality of border gateways and determining a target border gateway from the border gateways according to the route information;

and the computing node is in communication connection with the intranet router and is used for establishing a network virtualization technology VXLAN tunnel with the target border gateway.

In a fifth aspect, the present application also provides a computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements the fault handling method of any of the first aspects described above.

In a sixth aspect, the application also provides a computer program product comprising a computer program which, when executed by a processor, implements the fault handling method of any of the first aspects described above.

According to the fault processing method, the device, the gateway, the system and the storage medium, firstly, the network port state of each border gateway in the border gateway cluster is periodically obtained, then whether each border gateway has a fault or not is detected according to the change condition between the network port states of two adjacent border gateways, a detection result is obtained, and finally, the target state which can normally work after the border gateways are subjected to fault recovery is determined according to the detection result. By means of the method, the change conditions of the network port states of the border gateways acquired in two adjacent periods are compared, the detection result of each border gateway can be determined according to the change conditions, the target state is set according to the detection result, and each border gateway can work normally after the setting is successful, so that the automatic fault recovery of each border gateway with faults is achieved, and network interruption is avoided.

Drawings

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the related art, the drawings that are required to be used in the embodiments or the related technical descriptions will be briefly described, and it is apparent that the drawings in the following description are only some embodiments of the present application, and other drawings may be obtained according to the drawings without inventive effort for those skilled in the art.

FIG. 1 is an application environment diagram of a fault handling method in one embodiment;

FIG. 2 is a schematic diagram of one embodiment of a possible failure of a border gateway cluster architecture;

FIG. 3 is a second schematic diagram of a possible failure of the border gateway cluster architecture in one embodiment;

FIG. 4 is a third exemplary diagram illustrating a possible failure of the border gateway cluster architecture in one embodiment;

FIG. 5 is a flow diagram of a fault handling method in one embodiment;

FIG. 6 is a flow chart of a fault handling method in another embodiment;

FIG. 7 is a flow chart of a fault handling method in another embodiment;

FIG. 8 is a flow chart of a fault handling method in another embodiment;

FIG. 9 is a flow chart of a fault handling method in another embodiment;

FIG. 10 is a block diagram of a fault handling apparatus in one embodiment;

fig. 11 is an internal structural diagram of a border gateway in one embodiment.

Detailed Description

The present application will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present application more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the application.

The fault processing method provided by the embodiment of the application can be applied to an application environment shown in figure 1. The TF control node 10 is communicatively connected to the intranet router 30, the TF computing node 20 is communicatively connected to the intranet router 30, the intranet router 30 is communicatively connected to the border gateway cluster 40, the border gateway cluster 40 is communicatively connected to the external network router 50 and the switch 60, the terminal can access the virtual machine on the TF computing node 20 through the switch 60, and the virtual machine on the TF computing node 20 can also access the external network through the external network router.

In the control plane, TF control node 10 is responsible for implementing control plane BGP (Border Gateway Protocol ) EVPN (Ethernet Virtual Private Network, next generation virtual private network) routing information advertisement with each border gateway in border gateway group 40.

Each border gateway in border gateway cluster 40 configures a different underway (underlying bearer network) interconnect IP, but configures the same loopback interface IP, i.e., cluster IP as VTEP (VXLAN Tunnel Endpoint ) IP.

Each border gateway generates a Type3 route according to VTEP IP and sends it to TF control node 10. The TF control node 10 performs route preference according to the Type3 route sent by the different border gateways, and selects a target border gateway for creating the VXLAN tunnel.

In the data plane, TF calculation node 20 creates a data plane VXLAN tunnel with the target border gateway selected by TF control node 10 described above.

An intranet router 30 for generating ECMP (Equal-cost multi-path routing) routes to border gateway clusters IP, there are multiple internetworks IP to different border gateways. The external network router 50 is configured to generate ECMP routes to the border gateway cluster external network IP, and to carry traffic of the external network accessed by the virtual machines on the TF computing nodes 20. A switch 60 for two-layer access of external terminals and virtual machines in TF computing node 20.

After TF computing node 20 establishes a VXLAN tunnel with border gateway cluster 40, data transfer may take place. When a virtual machine on the TF computing node 20 sends a message, the message is encapsulated into a VXLAN message on the TF computing node 20, the source IP of the VXLAN message is the IP of the TF computing node, the destination IP is the border gateway cluster IP, the VXLAN message arrives in the intranet router, the destination IP of the VXLAN message is used to find a routing table, match the ECMP route and perform traffic sharing, send the message to any border gateway in the border gateway cluster 40, then perform VXLAN message decapsulation, and send the message to the extranet router or switch according to the destination address of the decapsulated message.

When the TF computing node 20 receives the message, in the process of sending the message, the switch 60 learns the MAC (Media Access Control Address ) of the virtual machine, so that the backhaul traffic sent from the terminal to the virtual machine will find the port where the MAC of the virtual machine is located in the L2 forwarding table in the switch 60 to forward, and the message arrives at the corresponding border gateway, is encapsulated into a VXLAN message, and then is sent to the virtual machine of the TF computing node.

In the above network architecture, the border gateway cluster 40 may have a failure, as shown in fig. 2, such as an intranet port failure, when an intranet port of one border gateway 01 is down, that is, closed, a BGP Peer failure on the border gateway is triggered, and the TF computing node 20 performs priority again.

As shown in fig. 3, after the external network port fails, for example, after the external network port of the border gateway 01 is down, the TF computing node 20 cannot sense the failure to perform re-optimization, so that when the virtual machine on the TF computing node accesses the external network, a message may be sent to the border gateway 01, resulting in a flow interruption.

As shown in fig. 4, the failure of the border gateway 01 causes the internal and external ports to simultaneously down, in which case the internal network router, the external network router and the switch can all re-converge to select, and the transmitted message can be directly switched to other border gateways in the border gateway cluster 40, so that the forwarding of the traffic is not affected.

At present, when the border gateway cluster has the faults, the system cannot automatically detect and process the faults, which can cause network interruption and affect network performance.

Based on the above, the application provides a fault processing method, which is applied to the network architecture, compares the change condition of the network port states of the border gateways acquired in two adjacent periods, can determine the detection result of each border gateway according to the change condition, and sets a target state according to the detection result, so that each border gateway can work normally after the setting is successful, thereby realizing the automatic fault recovery of each border gateway of the fault and avoiding the network interruption.

In an exemplary embodiment, as shown in fig. 5, a fault handling method is provided, and an example of application of the method to the border gateway cluster in fig. 1 is described, which includes the following steps S502 to S506.

Wherein:

s502, periodically acquiring the network port state of each border gateway in the border gateway cluster.

The network port state of each border gateway may include a management state of the network port and a connection state of the network port, and optionally, the current network port state of each border gateway may be obtained through a network port query command. In order to detect the working state of each border gateway, the network port state of each border gateway is periodically acquired, for example, the network port state is acquired once every 5 seconds, so that the dynamic monitoring of the network port of each border gateway is realized. Optionally, in the Bond mode, the ports of the border gateways form a Bond port, and the port state of each border gateway is obtained, that is, the port state of the Bond port, that is, the member port of the border gateway is obtained.

S504, detecting whether each border gateway has faults according to the change condition between the network port states of two adjacent border gateways to obtain detection results.

After the network port states of the border gateways are acquired at two times of intervals of one period, judging according to the change condition of the network port states of the adjacent two times, wherein the change condition may be that the network port states of the border gateways acquired at two times are unchanged or may be changed, whether each border gateway has a fault or not is determined according to the actual change condition, and optionally, if the connection state of the network port states of the border gateways is a normal connection state in the first period, and if the connection state of the network port states of the border gateways is an disconnection state in the second period, the fault of the border gateway can be determined, and optionally, the detection result can be further determined according to the number of the border gateways with faults in the border gateway cluster.

S506, determining the target state of each border gateway according to the detection result.

The target state is a state that the border gateway can normally work after fault recovery. For example, the management state and the connection interface are both up, i.e. are both in a target state when opened, so that after the hardware fault is repaired, the border gateway can automatically recover normal operation after automatically detecting and setting the management state and the connection interface to be in the target state in terms of software, and the fault is automatically repaired.

In the above embodiment, firstly, the network port states of the border gateways in the border gateway cluster are periodically obtained, then, according to the change condition between the network port states of the border gateways adjacent to each other twice, whether each border gateway has a fault is detected, a detection result is obtained, and finally, the target state which can normally work after the fault recovery of each border gateway is determined according to the detection result. By means of the method, the change conditions of the network port states of the border gateways acquired in two adjacent periods are compared, the detection result of each border gateway can be determined according to the change conditions, the target state is set according to the detection result, and each border gateway can work normally after the setting is successful, so that the automatic fault recovery of each border gateway with faults is achieved, and network interruption is avoided.

In one embodiment, the step of obtaining the detection result in step S504 includes, as shown in fig. 6:

s602, if the change condition is that the network port states of all border gateways are not changed between two adjacent network port states, obtaining the number of the network port states of all border gateways as the first state.

The first state of the border gateway is a closed state, that is, a down state, of the connection state in the portal state of the border gateway. The second state is a state in which the connection state in the portal state of the border gateway is open, i.e., up state. When the network port state of each border gateway in the border gateway cluster acquired in the first period is the same as the network port state of each border gateway in the border gateway cluster acquired in the second period, that is, the network port state of each border gateway is recorded as the down number.

S604, detecting whether each border gateway has faults according to the number, and obtaining a detection result.

Judging according to the number of the network port states of the border gateways as down and a preset threshold, wherein the preset threshold is a fault threshold, and setting according to requirements, and determining whether the condition of the Bond port down is met according to a judging result.

Wherein, detecting whether each border gateway has a fault according to the number, the step of obtaining a detection result is shown in fig. 7, and includes:

s702, if the number reaches a preset threshold, determining that the detection result is that each border gateway has a fault.

If the number of the network port states is equal to or greater than the preset threshold, determining that the detection result meets the condition of the Bond port down, namely that each border gateway may have faults at the moment, and waiting for the next period to continue to acquire the network port states.

And S704, if the number does not reach the preset threshold, determining that the detection result is that at least one border gateway in the border gateways has faults.

If the network port state is that the number of down is smaller than the preset threshold, determining that the detection result is that at least one border gateway in all border gateways has faults. Optionally, in this case, the Bond port is closed first, and fault recovery is performed, that is, the connection states of all Bond member ports are set to down.

In one embodiment, there is another case in the step 504, as shown in fig. 8, including:

s802, if the change condition is that the network port states of two adjacent boundary gateways are changed, determining the state change trend of each boundary gateway according to the change condition of the network port states of two adjacent boundary gateways.

If the connection state in the network port states acquired in two adjacent periods changes, the connection success of reestablishing partial network ports is indicated. The state of the portal may change from down to up or up to down, and thus the state change trend may be down_to_up or up_to_down.

S804, detecting whether each border gateway has a fault according to the state change trend of each border gateway, and obtaining a detection result.

When the state change trend of the network port state is down_to_up, the detection result can be determined as the current normal opening, and no fault exists. The state change trend is up_to_down, and it can be determined that the detection result is that the state is changed from the on state to the off state, and a fault may exist.

In one embodiment, the determining the target state of each border gateway according to the detection result includes:

If the state change trend of the border gateway is changed from the first state to the second state, that is, the connection state in the network port state of the border gateway is changed from down to up, it is indicated that the network port connection state of the border gateway is recovered to be normal, and at the moment, the management state in the network port state of the border gateway is set to up, that is, the border gateway can work normally.

If the state change trend of the border gateway is changed from the second state to the first state, that is, the connection state in the network port state of the border gateway is changed from up to down, it is indicated that the network port connection state of the border gateway is changed from the open state to the closed state, and a fault may occur.

In one embodiment, as shown in FIG. 1 above, each border gateway in the border gateway cluster has a cluster Internet protocol IP and an underlying bearer network interconnect IP; the cluster IP of each border gateway is the same, and the interconnection IP of the bottom layer bearing network of each border gateway is different.

Optionally, the trunking IPs of the border gateways in the border gateway trunking are the same, and the interconnection IPs of the underlying bearer network are different, so that when a message is to be sent to the border gateway trunking, the same trunking IPs can be used, and an intranet router performs forwarding by optimizing one of the border gateways.

In the embodiment of the present application, for convenience of understanding of the reader, the flow of the fault handling method provided in the embodiment of the present application is illustrated, and specific software flow is shown in fig. 9.

(1) And starting the border gateway service.

(2) And acquiring link states (connection states) and admin states (management states) of Bond and member ports thereof, and recording network port state information acquired for the first time. After waiting for 5 seconds, continuously acquiring the network port state information, comparing the network port state information with the network port state information acquired last time, if the network port state information is the same, executing the step (3), and if the network port state information is not the same, executing the step (5).

(3) And (3) calculating the number of link down states of the Bond member ports, judging whether the condition of the Bond port down is met, if all the Bond member ports are in the down or up state, waiting for 5 seconds, acquiring network port state information again, and otherwise, executing the step (4).

(4) Setting the link states of all member ports except the link down as down, and continuing to acquire network port state information after waiting for 5 seconds.

(5) If the link state is changed after the two network port state information are compared, judging whether the state change trend is down_to_up or up_to_down, and if the state change trend is down_to_up, setting the admin state of bond and member ports thereof as up. If the address is up_to_down, the admin state of all bond member ports except the network port is set as down. Wait 5 seconds and re-enter the loop.

It should be understood that, although the steps in the flowcharts related to the embodiments described above are sequentially shown as indicated by arrows, these steps are not necessarily sequentially performed in the order indicated by the arrows. The steps are not strictly limited to the order of execution unless explicitly recited herein, and the steps may be executed in other orders. Moreover, at least some of the steps in the flowcharts described in the above embodiments may include a plurality of steps or a plurality of stages, which are not necessarily performed at the same time, but may be performed at different times, and the order of the steps or stages is not necessarily performed sequentially, but may be performed alternately or alternately with at least some of the other steps or stages.

Based on the same inventive concept, the embodiment of the application also provides a fault processing device for realizing the fault processing method. The implementation of the solution provided by the device is similar to the implementation described in the above method, so the specific limitation of one or more embodiments of the fault handling device provided below may refer to the limitation of the fault handling method described above, and will not be repeated here.

In an exemplary embodiment, as shown in fig. 10, there is provided a fault handling apparatus including: the device comprises an acquisition module, a detection module and a determination module, wherein:

the acquisition module is used for periodically acquiring the network port state of each border gateway in the border gateway cluster.

The detection module is used for detecting whether each border gateway has faults according to the change condition between the network port states of two adjacent border gateways to obtain detection results.

In one embodiment, the detection module comprises:

the quantity obtaining unit is used for obtaining the quantity that the network port states of all the border gateways are the first state if the change condition is that the network port states of all the border gateways are not changed between two adjacent network port states.

And the first fault detection unit is used for detecting whether each border gateway has faults according to the number and obtaining a detection result.

In one embodiment, the fault detection unit is specifically configured to determine that each border gateway has a fault as a detection result if the number reaches a preset threshold; if the number does not reach the preset threshold, determining that the detection result is that at least one border gateway in the border gateways has faults.

In one embodiment, the detection module further comprises:

the change trend determining unit is used for determining the state change trend of each border gateway according to the change condition of the network port states of the border gateway adjacent two times if the change condition is that the network port states of the border gateway adjacent two times change;

and the second fault detection unit is used for detecting whether each border gateway has faults according to the state change trend of each border gateway and obtaining a detection result.

In one embodiment, the determining module is specifically configured to determine that the target state of the border gateway is the second state if the state change trend of the border gateway is from the first state to the second state.

In one embodiment, each border gateway in the border gateway cluster has a cluster internet protocol IP and a bottom layer bearer network interconnection IP; the cluster IP of each border gateway is the same, and the interconnection IP of the bottom layer bearing network of each border gateway is different.

Each of the modules in the fault handling apparatus described above may be implemented in whole or in part by software, hardware, and combinations thereof. The above modules may be embedded in hardware or may be independent of a processor in the computer device, or may be stored in software in a memory in the computer device, so that the processor may call and execute operations corresponding to the above modules.

In one exemplary embodiment, a border gateway is provided, the internal structure of which may be as shown in fig. 11. The computer device includes a processor, a memory, an Input/Output interface (I/O) and a communication interface. The processor, the memory and the input/output interface are connected through a system bus, and the communication interface is connected to the system bus through the input/output interface. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer programs, and a database. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage media. The database of the computer device is used for storing network port state data. The input/output interface of the computer device is used to exchange information between the processor and the external device. The communication interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to implement a fault handling method.

It will be appreciated by those skilled in the art that the structure shown in FIG. 11 is merely a block diagram of some of the structures associated with the present inventive arrangements and is not limiting of the computer device to which the present inventive arrangements may be applied, and that a particular computer device may include more or fewer components than shown, or may combine some of the components, or have a different arrangement of components.

In one exemplary embodiment, a computer device is provided comprising a memory and a processor, the memory having stored therein a computer program, the processor when executing the computer program performing the steps of: periodically acquiring the network port state of each border gateway in the border gateway cluster; detecting whether each border gateway has faults according to the change condition between the network port states of two adjacent border gateways to obtain a detection result; determining the target state of each border gateway according to the detection result; the target state is a state that the border gateway can work normally after fault recovery.

In one embodiment, the processor when executing the computer program further performs the steps of: if the change condition is that the network port states of all the border gateways are not changed between two adjacent network port states, acquiring the number of the network port states of all the border gateways as a first state; and detecting whether each border gateway has faults according to the number, and obtaining a detection result.

In one embodiment, the processor when executing the computer program further performs the steps of: if the number reaches a preset threshold, determining that the detection result is that each border gateway has faults; if the number does not reach the preset threshold, determining that the detection result is that at least one border gateway in the border gateways has faults.

In one embodiment, the processor when executing the computer program further performs the steps of: if the change condition is that the network port states of the adjacent two times of the border gateway are changed, determining the state change trend of each border gateway according to the change condition of the network port states of the adjacent two times of the border gateway; and detecting whether each border gateway has faults according to the state change trend of each border gateway, and obtaining a detection result.

In one embodiment, the processor when executing the computer program further performs the steps of: and if the state change trend of the border gateway is changed from the first state to the second state, determining that the target state of the border gateway is the second state.

In one embodiment, referring to fig. 1, a border gateway system is further provided, where the border gateway system includes a control node, a computing node, an intranet router, and a plurality of border gateways;

the border gateway is used for executing the fault processing method in the method embodiment.

The control node, which is TF control node 10 in fig. 1, is communicatively connected to the intranet router, and is configured to receive routing information of a plurality of border gateways, and determine a target border gateway from the plurality of border gateways according to the routing information.

The computing node, which is TF computing node 20 in fig. 1, is communicatively connected to the intranet router, and is configured to establish a VXLAN tunnel with the target border gateway, and perform data transmission with an external network or a terminal through the VXLAN tunnel.

In one embodiment, a computer readable storage medium is provided having a computer program stored thereon, which when executed by a processor, performs the steps of: periodically acquiring the network port state of each border gateway in the border gateway cluster; detecting whether each border gateway has faults according to the change condition between the network port states of two adjacent border gateways to obtain a detection result; determining the target state of each border gateway according to the detection result; the target state is a state that the border gateway can work normally after fault recovery.

In one embodiment, the computer program when executed by the processor further performs the steps of: if the change condition is that the network port states of all the border gateways are not changed between two adjacent network port states, acquiring the number of the network port states of all the border gateways as a first state; and detecting whether each border gateway has faults according to the number, and obtaining a detection result.

In one embodiment, the computer program when executed by the processor further performs the steps of: if the number reaches a preset threshold, determining that the detection result is that each border gateway has faults; if the number does not reach the preset threshold, determining that the detection result is that at least one border gateway in the border gateways has faults.

In one embodiment, the computer program when executed by the processor further performs the steps of: if the change condition is that the network port states of the adjacent two times of the border gateway are changed, determining the state change trend of each border gateway according to the change condition of the network port states of the adjacent two times of the border gateway; and detecting whether each border gateway has faults according to the state change trend of each border gateway, and obtaining a detection result.

In one embodiment, the computer program when executed by the processor further performs the steps of: and if the state change trend of the border gateway is changed from the first state to the second state, determining that the target state of the border gateway is the second state.

In one embodiment, a computer program product is provided comprising a computer program which, when executed by a processor, performs the steps of: periodically acquiring the network port state of each border gateway in the border gateway cluster; detecting whether each border gateway has faults according to the change condition between the network port states of two adjacent border gateways to obtain a detection result; determining the target state of each border gateway according to the detection result; the target state is a state that the border gateway can work normally after fault recovery.

It should be noted that, the user information (including but not limited to user equipment information, user personal information, etc.) and the data (including but not limited to data for analysis, stored data, presented data, etc.) related to the present application are both information and data authorized by the user or sufficiently authorized by each party, and the collection, use and processing of the related data are required to meet the related regulations.

Those skilled in the art will appreciate that implementing all or part of the above described methods may be accomplished by way of a computer program stored on a non-transitory computer readable storage medium, which when executed, may comprise the steps of the embodiments of the methods described above. Any reference to memory, database, or other medium used in embodiments provided herein may include at least one of non-volatile and volatile memory. The nonvolatile Memory may include Read-Only Memory (ROM), magnetic tape, floppy disk, flash Memory, optical Memory, high density embedded nonvolatile Memory, resistive random access Memory (ReRAM), magnetic random access Memory (Magnetoresistive Random Access Memory, MRAM), ferroelectric Memory (Ferroelectric Random Access Memory, FRAM), phase change Memory (Phase Change Memory, PCM), graphene Memory, and the like. Volatile memory can include random access memory (Random Access Memory, RAM) or external cache memory, and the like. By way of illustration, and not limitation, RAM can be in the form of a variety of forms, such as static random access memory (Static Random Access Memory, SRAM) or dynamic random access memory (Dynamic Random Access Memory, DRAM), and the like. The databases referred to in the embodiments provided herein may include at least one of a relational database and a non-relational database. The non-relational database may include, but is not limited to, a blockchain-based distributed database, and the like. The processor referred to in the embodiments provided in the present application may be a general-purpose processor, a central processing unit, a graphics processor, a digital signal processor, a programmable logic unit, a data processing logic unit based on quantum computing, or the like, but is not limited thereto.

The technical features of the above embodiments may be arbitrarily combined, and all possible combinations of the technical features in the above embodiments are not described for brevity of description, however, as long as there is no contradiction between the combinations of the technical features, they should be considered as the scope of the description.

The foregoing examples illustrate only a few embodiments of the application and are described in detail herein without thereby limiting the scope of the application. It should be noted that it will be apparent to those skilled in the art that several variations and modifications can be made without departing from the spirit of the application, which are all within the scope of the application. Accordingly, the scope of the application should be assessed as that of the appended claims.

Claims

1. A method of fault handling, the method comprising:

detecting whether each border gateway has a fault according to the change condition between the network port states of two adjacent border gateways to obtain a detection result;

determining the target state of each border gateway according to the detection result; the target state is a state which can normally work after the border gateway is subjected to fault recovery.

2. The method according to claim 1, wherein detecting whether each border gateway has a fault according to a change condition between network port states of two adjacent border gateways, and obtaining a detection result includes:

3. The method according to claim 2, wherein detecting whether each border gateway has a fault according to the number, and obtaining a detection result includes:

and if the number does not reach the preset threshold value, determining that the detection result is that at least one border gateway in the border gateways has faults.

4. The method according to claim 1, wherein detecting whether each border gateway has a fault according to a change condition between network port states of two adjacent border gateways, and obtaining a detection result includes:

and detecting whether each border gateway has a fault according to the state change trend of each border gateway, and obtaining a detection result.

5. The method of claim 4, wherein said determining the target state of each border gateway based on the detection results comprises:

and if the state change trend of the border gateway is changed from the first state to the second state, determining the target state of the border gateway as the second state.

6. The method according to any of claims 1 to 5, wherein each of said border gateways in said border gateway cluster has a cluster internet protocol IP and an underlying bearer network interconnect IP;

the cluster IP of each border gateway is identical, and the interconnection IP of the bottom layer bearing network of each border gateway is different.

7. A fault handling apparatus, the apparatus comprising:

the detection module is used for detecting whether each border gateway has faults according to the change condition between the network port states of two adjacent border gateways to obtain detection results;

the determining module is used for determining the target state of each border gateway according to the detection result; the target state is a state which can normally work after the border gateway is subjected to fault recovery.

8. A border gateway comprising a memory storing a computer program and a processor, characterized in that the processor implements the steps of the method of any of claims 1 to 6 when executing the computer program.

9. The border gateway system is characterized by comprising a control node, a computing node, an intranet router and a plurality of border gateways;

the border gateway for performing the steps of the method of any one of claims 1 to 6;

10. A computer readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, implements the steps of the method of any of claims 1 to 6.