WO2011143876A1 - Master/backup switching method and device for service nodes - Google Patents

Master/backup switching method and device for service nodes Download PDF

Info

Publication number
WO2011143876A1
WO2011143876A1 PCT/CN2010/076850 CN2010076850W WO2011143876A1 WO 2011143876 A1 WO2011143876 A1 WO 2011143876A1 CN 2010076850 W CN2010076850 W CN 2010076850W WO 2011143876 A1 WO2011143876 A1 WO 2011143876A1
Authority
WO
WIPO (PCT)
Prior art keywords
active
standby
service node
switching
state
Prior art date
Application number
PCT/CN2010/076850
Other languages
French (fr)
Chinese (zh)
Inventor
袁博
范亮
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2011143876A1 publication Critical patent/WO2011143876A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • H04L41/0663Performing the actions predefined by failover planning, e.g. switching to standby network elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/16Threshold monitoring

Definitions

  • the present invention relates to the field of communications, and in particular, to a service node active/standby switching method and apparatus. Background technique
  • a service node may also be referred to as a user service control device, such as a Broadband Remote Access Server (BRAS) or a Service Router (SR), etc., with access to the broadband through the SN.
  • BRAS Broadband Remote Access Server
  • SR Service Router
  • the user backup technology refers to dual-system hot backup or multi-machine hot backup, and the access node (AN) accesses multiple uplink SNs, the primary SN access port and the backup SN backup port.
  • the active and standby SNs use the Virtual Router Redundancy Protocol (VRRP) and Bidirectional Forwarding Detection (BFD) technologies to switch states.
  • VRRP Virtual Router Redundancy Protocol
  • BFD Bidirectional Forwarding Detection
  • the primary SN synchronizes the access user information to the standby SN.
  • the standby SN When receiving the user information transmitted by the primary SN, the standby SN immediately sends the user information to the interface module in the form of a user table.
  • the standby SN When the access port of the primary SN access port is faulty, the standby SN is switched to the primary SN, and the user traffic is switched at the same time. After the handover, the uplink and downlink traffic of the user is forwarded through the standby SN, and the standby SN is based on the backed up user. Information controls and manages user traffic.
  • the active/standby switchover protocol is associated with the port or link state of the SN downlink to the AN, and the link connectivity detection mode is used to detect that the link of the SN downlink AN or the SN port is faulty.
  • the master/slave SN is switched.
  • the link detection technology cannot detect the internal abnormality of the primary SN device. For example, when the central processor of the primary SN and the memory usage are too high, the user may also forward the abnormality. However, since the internal abnormality of the primary SN device is not detected, the link status is still normal and the SN is not performed. The master/slave switchover causes the user to forward or the user accesses an exception. Summary of the invention
  • the present invention provides a service node active/standby switching method and apparatus for solving the problem of user forwarding or user access abnormality caused by the detection of an internal abnormality of the primary SN device in the prior art.
  • An active/standby switching method for a service node includes:
  • the active/standby switchover of the service node is performed by the active/standby state switching protocol according to the current system state of the active service node, the abnormal threshold, and the active/standby relationship.
  • the active/standby state switching protocol includes one of the following: a virtual routing redundancy protocol, and a multi-rack link aggregation protocol.
  • the system status includes at least one of the following: hardware operating status, software operating status, and device system alarm.
  • the process of performing the active/standby switchover of the service node by using the active/standby relationship of the primary service node, the abnormal threshold, and the active/standby relationship by the active/standby relationship including: determining that the first handover condition is met, Performing an active/standby switchover of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node; where the first handover condition is: determining that an alarm of the current system state of the active service node occurs or The abnormal value of the current system state of the active service node is greater than or equal to the abnormal threshold;
  • the second switching condition Determining that the second switching condition is reached, and performing the active/standby switchback of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node; where the second switching condition is: The alarm of the current system state of the active service node disappears or the abnormal value of the current system state of the active service node is smaller than the abnormal threshold.
  • the method for performing the active/standby switchover of the service node by using the active/standby state switching protocol is as follows: performing the active/standby switchover of the service node by using the active/standby state switching protocol in a manner of modifying the priority of the service node.
  • the method further includes:
  • the active/standby switchover of the service node is controlled by the network management device.
  • a service node active/standby switching device includes:
  • Establishing a module configured to establish an active/standby relationship of the service node by using an active/standby state switching protocol, associate the active/standby state with a system state of the active service node, and configure an abnormal threshold of the system state;
  • a switching module configured to perform active/standby switching of the service node by using the active/standby state switching protocol according to the current system state of the active service node, the abnormal threshold, and the active/standby relationship.
  • the active/standby state switching protocol includes one of the following: a virtual routing redundancy protocol, a multi-rack link aggregation protocol;
  • the system status includes at least one of the following: hardware operating status, software operating status, and device system alarm.
  • the switching module is configured to: when the active and standby switching of the service node is performed by the active/standby state switching protocol according to the current system state of the primary service node, the abnormal threshold, and the active/standby relationship.
  • the first switching condition is: determining a current status of the active service node System status alarm or current system of the active service node The outlier value of the system state is greater than or equal to the abnormal threshold value;
  • the switching module is configured to:
  • the active/standby switchover of the service node is performed by using the active/standby state switching protocol in a manner of modifying the priority of the service node.
  • the switching module is further configured to:
  • the active/standby switchover of the service node is performed according to the control of the network management device.
  • FIG. 1 is a flowchart of a method for switching between active and standby of a service node according to an embodiment of the present invention
  • FIG. 2 is a detailed processing flowchart of a service node active/standby switching method according to an embodiment of the present invention
  • FIG. 3 is a schematic structural diagram of a service node active/standby switching device according to an embodiment of the present invention
  • FIG. 4 is a network topology structure according to an embodiment of the present invention
  • Schematic diagram is a detailed description
  • the present invention provides a service node active/standby switching method and apparatus.
  • the technical solution of the embodiment of the present invention is further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
  • FIG. 1 is a flowchart of a service node active/standby handover method according to an embodiment of the present invention. As shown in FIG. 1, a service node according to an embodiment of the present invention is provided.
  • the active/standby switching methods include:
  • Step 101 Establish an active/standby relationship of the service node by using the active/standby state switching protocol, and associate the active/standby state with the system state of the active service node, and configure an abnormal threshold of the system state.
  • the active/standby state switching protocol may include One of the following: Virtual Routing Redundancy Protocol, Multi-Rack Link Aggregation Protocol. That is, in step 101, the active and standby SNs can establish an active/standby relationship through an active/standby state switching protocol such as VRRP, but the active/standby state switching protocol is not limited to VRRP, and may also be multi-chassis link aggregation ( MC-LAG) protocol.
  • MC-LAG multi-chassis link aggregation
  • the system status can include at least one of the following: hardware operating status, software operating status, device system alarm. That is to say, the association between the active and standby states and the system state may be associated with the system operating state of the device level, such as CPU or memory usage, or may be associated with the running state of a certain hardware of the device, for example, It can be associated with the running status of the switching stencil and can also be associated with the running status of a functional module of the device.
  • the active/standby switchover protocol is associated with the system state, you can choose to associate multiple different system states at the same time. The logical relationship between these states can be "or” or "or".
  • the active and standby SNs can associate the system running status, important alarms, and active/standby status switching protocols of the device, and configure corresponding threshold thresholds (ie, the above abnormal thresholds).
  • Step 102 According to the current system state of the active service node, an abnormal threshold, and a primary and secondary switch The active/standby switchover of service nodes is performed through the active/standby state switching protocol.
  • the SN-linked active/standby state switching protocol performs the active/standby switchover of the SN; when the original active SN device system is restored to normal or the critical alarm disappears.
  • the SN and the active/standby state switching protocol switch back to the active/standby state.
  • the association between the active and standby states and the system state can be selected to switch and switch back to use different conditions to associate the active and standby SN state switches. For example, handover condition A results in an active/standby switchover (ie, the primary SN switches to the standby SN), and handover condition B results in a failback (ie, the standby SN switches to the primary SN).
  • the second switching condition is determined, and the active/standby switchback protocol is used to perform the active/standby switchback of the service node according to the active/standby relationship of the service node, where the second handover condition is: determining that the current system state alarm of the active service node disappears or the primary The outlier value of the current system state with the service node is less than the abnormal threshold.
  • the active/standby switchover of the service node may be performed by using the active/standby state switching protocol in a manner of modifying the priority of the service node.
  • the network management device can control the active/standby switchover of the service node. That is to say, when the active/standby protocol is associated with some seriously related device system alarms, the network management device can notify the SN of the active/standby state switching through SNMP.
  • Step 201 The SN establishes an active/standby relationship through the active/standby state switching protocol, where the active and standby SNs can establish an active/standby relationship through VRRP.
  • Step 202 The SN associates the operating state of the device system, or the running status of a certain software/hardware module of the device, or an important related alarm with the active/standby state, and configures a critical threshold;
  • Step 203 When the operating state of the device system is severely abnormal and reaches a critical threshold or a related serious alarm is generated, the SN performs an active/standby switchover between the SN and the active/standby state switching protocol.
  • Step 204 When the running status of the original active SN device system is normal or the serious alarm disappears, the SN linkage active/standby state switching protocol switches back to the active/standby state.
  • the switching mechanism of the dual-system backup is associated with the system state of the device.
  • the active/standby switching protocol of the triggering device starts the active/standby switchover. , can avoid the problem of being unable to switch due to serious abnormalities in the system.
  • Example 1 Associate the system CPU status and perform the master/slave SN switch.
  • Step 1 The master/slave SN establishes the master/slave relationship through the master/slave switchover protocol VRRP.
  • Step 2 Configure the active/standby switchover when the system CPU is >80% for 5 minutes; when the CPU is lower than 60% for 5 minutes, perform the switchback, and link the switch policy to VRRP;
  • Step 3 When the system CPU exceeds 80% and lasts for 5 minutes, the VRRP priority of the original primary SN is reduced. Because the priority of the original primary SN is smaller than the VRRP priority of the original standby SN, the triggering is started according to the VRRP active/standby selection mode. Active/standby switchover;
  • Step 4 When the CPU of the primary SN is lower than 60% and lasts for 5 minutes, the VRRP priority of the original primary SN is restored, and the switchback is triggered.
  • Example 2 Associate the status of the switching network board and perform the active/standby SN switching.
  • Step 1 The master/slave SN proposes the master/slave relationship through the master/slave switchover protocol VRRP.
  • Step 2 Configure the switchover when the status of the two switch fabrics in the three switch fabrics is abnormal. When the two switch fabrics in the three switch fabrics are in the normal state, perform the switchback and link the switchover policy.
  • VRRP master/slave switchover protocol
  • Step 3 When the status of the two switching stencils in the three switching stencils is abnormal, the VRRP priority of the original primary SN is reduced. The priority of the original primary SN is smaller than the VRRP priority of the original standby SN. Mode, starting to trigger the active/standby switchover;
  • Step 4 When the status of the two switching stencils in the three switching stencils is normal, the VRRP priority of the original active SN is restored and the switchback is triggered.
  • Example 3 Associate route alarms, and switch the master/slave SN.
  • Step 1 The active and standby SNs adopt the active/standby switchover protocol VRRP to propose the active/standby relationship.
  • the switchover is performed. If the BGP peer status is up to 10 minutes, the switchover is performed, and the switchover policy is associated with the VRRP.
  • Step 3 When the BGP peer status of the system is abnormal for more than 3 minutes, the VRRP priority of the original primary SN is reduced. The priority of the original primary SN is lower than the VRRP priority of the original standby SN. Triggering the active/standby switchover;
  • Step 4 When the system and the BGP peer state are restored to normal for more than 10 minutes, the VRRP priority of the original active SN is restored, and the switchback is triggered.
  • the user forwarding or the user access can be quickly switched to the standby SN.
  • FIG. 3 is a schematic structural diagram of a service node active/standby switching device according to an embodiment of the present invention, as shown in FIG.
  • the service node active/standby switching device of the embodiment of the present invention includes: an establishing module 30, and a switching module 32.
  • FIG. 4 a schematic diagram of the network topology structure is shown in FIG. 4.
  • the primary SN and the standby SN are located between the upper layer network and the lower layer network, and the primary SN and the primary SN are used.
  • An active/standby relationship is established between the standby SNs. The user needs to access the SN.
  • the service node active/standby switching device in the embodiment of the present invention is described in detail below.
  • the establishing module 30 is configured to establish an active/standby relationship of the service node by using the active/standby state switching protocol, and associate the active/standby state with the system state of the active service node, and configure an abnormal threshold of the system state;
  • the state switching protocol may include one of the following: a virtual routing redundancy protocol, a multi-chassis link aggregation protocol. That is to say, the establishing module 30 can establish an active/standby relationship through an active/standby state switching protocol such as VRRP, but the active/standby state switching protocol is not limited to VRRP, and may also be a multi-chassis link aggregation (MC-LAG) protocol. .
  • the system status can include at least one of the following: hardware operating status, software operating status, device system alarm. That is to say, the association between the active and standby states and the system state may be associated with the system operating state of the device level, such as CPU or memory usage, or may be associated with the running state of a certain hardware of the device, for example, It can be associated with the running status of the switching stencil and can also be associated with the running status of a functional module of the device.
  • the active/standby switchover protocol is associated with the system state, you can choose to associate multiple different system states at the same time. The logical relationship between these states can be "or” or "or".
  • the switching module 32 is configured to perform active/standby switchover of the service node by using the active/standby state switching protocol according to the current system state, the abnormal threshold, and the active/standby relationship of the active service node.
  • the switching module 32 associates the active/standby state switching protocol to perform active/standby switching of the SN; when the operating state of the original primary SN device system returns to normal or serious When the alarm disappears, the switching module 32 associates the active/standby state switching protocol to switch back to the active/standby state.
  • the active and standby states and system states For the association, the switching module 32 can select the switching and failback to use different conditions to associate the active and standby SN state switching.
  • the switching condition A causes the switching module 32 to perform the active/standby switching (ie, the primary SN switches to the standby SN), and the switching condition B causes the switching module 32 to perform the failback (ie, the standby SN switches to the primary SN).
  • the switching module 32 can be specifically processed:
  • the switching module 32 determines that the first switching condition is reached, and performs the active/standby switchover of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node, where the first switching condition is: determining that the current system state of the active service node is alarmed or The abnormal value of the current system state of the active service node is greater than or equal to the abnormal threshold;
  • the switching module 32 determines that the second switching condition is reached, and performs the active/standby switchback of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node, where the second handover condition is: determining an alarm of the current system state of the active service node.
  • the outlier or the abnormal value of the current system state of the active service node is less than the abnormal threshold.
  • the switching module 32 may perform the active/standby switchover of the service node by using the active/standby state switching protocol in a manner of modifying the priority of the service node.
  • the switching module 32 can perform the active/standby switching or failback of the service node according to the control of the network management device. That is, when the active/standby state protocol is associated with some seriously related device system alarms, the network management device can notify the switching module 32 of the active/standby state switching through SNMP.
  • the technical solution of the present invention by associating the active/standby state switching protocol with the system state of the active service node, the user forwarding or the user caused by the internal abnormality of the primary SN device cannot be detected in the prior art.
  • the problem of access abnormality When the internal PN device is abnormal, the user can forward or user access can be quickly switched to the standby SN.

Abstract

A master/backup switching method and device for service nodes are provided in the present invention. The method includes: establishing master/backup relation between service nodes using the master/backup state switching protocol, correlating said master/backup state with the system state of the master service node, and configuring the abnormity threshold for the system state (101); according to the current system state of the master service node, said abnormity threshold, and said master/backup relation, performing the master/backup switching for the service nodes using said master/backup state switching protocol (102). With the method and device in the present invention, by correlating the master/backup state switching protocol with the system state of the master service node, the problem of abnormal user forward or abnormal user access in the art, caused by that the internal abnormity of the master service node apparatus is not detected, is solved; so that the user forward or user access can be rapidly switched to the backup service node in the case of the internal abnormity of the master service node apparatus.

Description

业务节点主备切换方法和装置 技术领域  Service node active/standby switching method and device
本发明涉及通讯领域, 特别是涉及一种业务节点主备切换方法和装置。 背景技术  The present invention relates to the field of communications, and in particular, to a service node active/standby switching method and apparatus. Background technique
业务节点(Service Node, SN )也可以称为用户业务控制设备, 例如宽 带远端接入服务器( Broadband Remote Access Server, BRAS )或业务路由 器( Service Router, SR )等, 随着通过 SN接入宽带的用户数量越来越多、 以及宽带用户对宽带高可靠性的要求越来越高, 运营商对用户备份接入的 要求也越来越高。 因此, SN间广泛使用了用户信息备份技术。  A service node (SN) may also be referred to as a user service control device, such as a Broadband Remote Access Server (BRAS) or a Service Router (SR), etc., with access to the broadband through the SN. The number of users is increasing, and the requirements for broadband high reliability of broadband users are getting higher and higher, and operators are increasingly demanding backup access for users. Therefore, user information backup technology is widely used among SNs.
在现有技术中, 用户备份技术是指双机热备份或多机热备份, 接入节 点(Access Node, AN )接入多个上行 SN, 主用 SN的接入端口和备用 SN 的备份端口——对应, 主备 SN 釆用虚拟路由冗余协议 ( Virtual Router Redundancy Protocol , VRRP ) 和双向转发检测 ( Bidirectional Forwarding Detection, BFD )技术来切换状态。 主用 SN把接入用户信息同步到备用 SN,备用 SN在收到主用 SN传递的用户信息时,立刻将该用户信息以用户 表的方式下发到接口模块。 当主用 SN的接入端口接入用户的链路故障时, 备用 SN切换成主用 SN, 用户流量同时发生切换, 切换后用户的上下行流 量都通过备用 SN转发, 备用 SN基于已备份的用户信息对用户业务流进行 控制和管理。  In the prior art, the user backup technology refers to dual-system hot backup or multi-machine hot backup, and the access node (AN) accesses multiple uplink SNs, the primary SN access port and the backup SN backup port. —— Correspondingly, the active and standby SNs use the Virtual Router Redundancy Protocol (VRRP) and Bidirectional Forwarding Detection (BFD) technologies to switch states. The primary SN synchronizes the access user information to the standby SN. When receiving the user information transmitted by the primary SN, the standby SN immediately sends the user information to the interface module in the form of a user table. When the access port of the primary SN access port is faulty, the standby SN is switched to the primary SN, and the user traffic is switched at the same time. After the handover, the uplink and downlink traffic of the user is forwarded through the standby SN, and the standby SN is based on the backed up user. Information controls and manages user traffic.
目前,现有技术中是将主备切换协议和 SN下行到 AN的端口或链路状 态进行关联, 并通过链路连通性探测的方式, 当探测到 SN下行 AN的链路 或者 SN端口发生故障时, 进行主备 SN的切换。  Currently, in the prior art, the active/standby switchover protocol is associated with the port or link state of the SN downlink to the AN, and the link connectivity detection mode is used to detect that the link of the SN downlink AN or the SN port is faulty. At the same time, the master/slave SN is switched.
在现有技术中, 链路探测技术不能够探测到主用 SN设备的内部异常, 例如, 在主用 SN的中央处理器、 内存的利用率过高时, 也会造成用户转发 异常, 但由于探测不到主用 SN设备的内部异常, 链路状态仍然显示正常, 不会进行 SN主备切换, 从而导致用户转发或者用户接入的异常。 发明内容 In the prior art, the link detection technology cannot detect the internal abnormality of the primary SN device. For example, when the central processor of the primary SN and the memory usage are too high, the user may also forward the abnormality. However, since the internal abnormality of the primary SN device is not detected, the link status is still normal and the SN is not performed. The master/slave switchover causes the user to forward or the user accesses an exception. Summary of the invention
本发明提供一种业务节点主备切换方法和装置, 用以解决现有技术中 由于探测不到主用 SN设备的内部异常而导致的用户转发或者用户接入异 常的问题。  The present invention provides a service node active/standby switching method and apparatus for solving the problem of user forwarding or user access abnormality caused by the detection of an internal abnormality of the primary SN device in the prior art.
为了解决上述问题, 本发明的技术方案是这样实现的:  In order to solve the above problems, the technical solution of the present invention is implemented as follows:
一种业务节点主备切换方法, 包括:  An active/standby switching method for a service node includes:
通过主备状态切换协议建立业务节点的主备关系, 将所述主备状态与 主用业务节点的***状态相关联, 并配置所述***状态的异常阔值;  And establishing an active/standby relationship of the service node by using an active/standby state switching protocol, associating the active/standby state with a system state of the active service node, and configuring an abnormal threshold of the system state;
根据所述主用业务节点的当前***状态、 所述异常阔值、 以及所述主 备关系, 通过所述主备状态切换协议进行业务节点主备切换。  The active/standby switchover of the service node is performed by the active/standby state switching protocol according to the current system state of the active service node, the abnormal threshold, and the active/standby relationship.
所述主备状态切换协议包括以下之一: 虚拟路由冗余协议、 多机架链 路聚合协议。  The active/standby state switching protocol includes one of the following: a virtual routing redundancy protocol, and a multi-rack link aggregation protocol.
所述***状态包括以下至少之一: 硬件运行状态、 软件运行状态、 设 备***告警。  The system status includes at least one of the following: hardware operating status, software operating status, and device system alarm.
根据所述主用业务节点的当前***状态、 所述异常阔值、 以及所述主 备关系通过所述主备状态切换协议进行业务节点主备切换的过程, 包括: 确定达到第一切换条件, 根据所述业务节点的主备关系通过所述主备 状态切换协议进行业务节点主备切换; 其中, 所述第一切换条件为: 确定 所述主用业务节点的当前***状态出现告警或所述主用业务节点的当前系 统状态的异常值大于或等于所述异常阔值;  The process of performing the active/standby switchover of the service node by using the active/standby relationship of the primary service node, the abnormal threshold, and the active/standby relationship by the active/standby relationship, including: determining that the first handover condition is met, Performing an active/standby switchover of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node; where the first handover condition is: determining that an alarm of the current system state of the active service node occurs or The abnormal value of the current system state of the active service node is greater than or equal to the abnormal threshold;
确定达到第二切换条件, 根据所述业务节点的主备关系通过所述主备 状态切换协议进行业务节点主备回切; 其中, 所述第二切换条件为: 确定 所述主用业务节点的当前***状态的告警消失或所述主用业务节点的当前 ***状态的异常值小于所述异常阔值。 Determining that the second switching condition is reached, and performing the active/standby switchback of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node; where the second switching condition is: The alarm of the current system state of the active service node disappears or the abnormal value of the current system state of the active service node is smaller than the abnormal threshold.
通过所述主备状态切换协议进行业务节点主备切换的方法为: 以修改业务节点优先级的方式通过所述主备状态切换协议进行所述业 务节点主备切换。  The method for performing the active/standby switchover of the service node by using the active/standby state switching protocol is as follows: performing the active/standby switchover of the service node by using the active/standby state switching protocol in a manner of modifying the priority of the service node.
在所述主备状态与所述设备***告警相关联的情况下, 所述方法还包 括:  In the case that the active/standby state is associated with the device system alarm, the method further includes:
由网管设备控制所述业务节点主备切换。  The active/standby switchover of the service node is controlled by the network management device.
一种业务节点主备切换装置, 包括:  A service node active/standby switching device includes:
建立模块, 用于通过主备状态切换协议建立业务节点的主备关系, 将 所述主备状态与主用业务节点的***状态相关联, 并配置所述***状态的 异常阔值;  Establishing a module, configured to establish an active/standby relationship of the service node by using an active/standby state switching protocol, associate the active/standby state with a system state of the active service node, and configure an abnormal threshold of the system state;
切换模块, 用于根据所述主用业务节点的当前***状态、 所述异常阔 值、 以及所述主备关系, 通过所述主备状态切换协议进行业务节点主备切 换。  And a switching module, configured to perform active/standby switching of the service node by using the active/standby state switching protocol according to the current system state of the active service node, the abnormal threshold, and the active/standby relationship.
所述主备状态切换协议包括以下之一: 虚拟路由冗余协议、 多机架链 路聚合协议;  The active/standby state switching protocol includes one of the following: a virtual routing redundancy protocol, a multi-rack link aggregation protocol;
所述***状态包括以下至少之一: 硬件运行状态、 软件运行状态、 设 备***告警。  The system status includes at least one of the following: hardware operating status, software operating status, and device system alarm.
根据所述主用业务节点的当前***状态、 所述异常阔值、 以及所述主 备关系通过所述主备状态切换协议进行业务节点主备切换时, 所述切换模 块用于:  The switching module is configured to: when the active and standby switching of the service node is performed by the active/standby state switching protocol according to the current system state of the primary service node, the abnormal threshold, and the active/standby relationship.
确定达到第一切换条件, 根据所述业务节点的主备关系通过所述主备 状态切换协议进行业务节点主备切换; 其中, 所述第一切换条件为: 确定 所述主用业务节点的当前***状态出现告警或所述主用业务节点的当前系 统状态的异常值大于或等于所述异常阔值; Determining that the first switching condition is reached, and performing the active/standby switchover of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node; wherein the first switching condition is: determining a current status of the active service node System status alarm or current system of the active service node The outlier value of the system state is greater than or equal to the abnormal threshold value;
确定达到第二切换条件, 根据所述业务节点的主备关系通过所述主备 状态切换协议进行业务节点主备回切; 其中, 所述第二切换条件为: 确定 所述主用业务节点的当前***状态的告警消失或所述主用业务节点的当前 ***状态的异常值小于所述异常阔值。  Determining that the second switching condition is reached, and performing the active/standby switchback of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node; where the second handover condition is: determining the primary service node The alarm of the current system state disappears or the abnormal value of the current system state of the active service node is less than the abnormal threshold.
通过所述主备状态切换协议进行业务节点主备切换时, 所述切换模块 用于:  When the active/standby switchover of the service node is performed by using the active/standby state switching protocol, the switching module is configured to:
以修改业务节点优先级的方式通过所述主备状态切换协议进行所述业 务节点主备切换。  The active/standby switchover of the service node is performed by using the active/standby state switching protocol in a manner of modifying the priority of the service node.
所述切换模块进一步用于:  The switching module is further configured to:
在所述主备状态与所述设备***告警相关联的情况下, 根据网管设备 的控制, 进行所述业务节点的主备切换。  In the case that the active/standby state is associated with the device system alarm, the active/standby switchover of the service node is performed according to the control of the network management device.
本发明有益效果如下:  The beneficial effects of the present invention are as follows:
借助于本发明的技术方案, 通过将主备状态切换协议与主用业务节点 的***状态相关联,解决了现有技术中由于探测不到主用 SN设备的内部异 常而导致的用户转发或者用户接入异常的问题。使得在主用 SN设备的内部 异常时, 能够将用户转发或者用户接入快速的切换到备用 SN上。 附图说明  By means of the technical solution of the present invention, by associating the active/standby state switching protocol with the system state of the active service node, the user forwarding or the user caused by the internal abnormality of the primary SN device cannot be detected in the prior art. The problem of access abnormality. When the internal PN device is abnormal, the user can forward or user access can be quickly switched to the standby SN. DRAWINGS
图 1是本发明实施例的业务节点主备切换方法的流程图;  1 is a flowchart of a method for switching between active and standby of a service node according to an embodiment of the present invention;
图 2是本发明实施例的业务节点主备切换方法的详细处理流程图; 图 3是本发明实施例的业务节点主备切换装置的结构示意图; 图 4是本发明实施例的网络拓朴结构的示意图。 具体实施方式  2 is a detailed processing flowchart of a service node active/standby switching method according to an embodiment of the present invention; FIG. 3 is a schematic structural diagram of a service node active/standby switching device according to an embodiment of the present invention; FIG. 4 is a network topology structure according to an embodiment of the present invention; Schematic diagram. detailed description
为了解决现有技术中探测不到主用 SN设备的内部异常而导致的用户 转发或者用户接入异常的问题, 本发明提供了一种业务节点主备切换方法 和装置, 以下结合附图以及实施例, 对本发明实施例的技术方案进行进一 步详细说明。 应当理解, 此处所描述的具体实施例仅仅用以解释本发明, 并不限定本发明。 In order to solve the problem that the internal abnormality of the primary SN device is not detected in the prior art The present invention provides a service node active/standby switching method and apparatus. The technical solution of the embodiment of the present invention is further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
方法实施例  Method embodiment
根据本发明的实施例, 提供了一种业务节点主备切换方法, 图 1 是本 发明实施例的业务节点主备切换方法的流程图, 如图 1 所示, 根据本发明 实施例的业务节点主备切换方法包括:  According to an embodiment of the present invention, a service node active/standby handover method is provided. FIG. 1 is a flowchart of a service node active/standby handover method according to an embodiment of the present invention. As shown in FIG. 1, a service node according to an embodiment of the present invention is provided. The active/standby switching methods include:
步骤 101 , 通过主备状态切换协议建立业务节点的主备关系, 将主备状 态与主用业务节点的***状态相关联, 并配置***状态的异常阔值; 其中, 主备状态切换协议可以包括以下之一: 虚拟路由冗余协议、 多机架链路聚 合协议。也就是说, 在步骤 101中, 主备 SN可以通过诸如 VRRP这种主备 状态切换协议建立主备关系,但主备状态切换协议并不限制于 VRRP,也可 以是多机架链路聚合 ( MC-LAG )协议。  Step 101: Establish an active/standby relationship of the service node by using the active/standby state switching protocol, and associate the active/standby state with the system state of the active service node, and configure an abnormal threshold of the system state. The active/standby state switching protocol may include One of the following: Virtual Routing Redundancy Protocol, Multi-Rack Link Aggregation Protocol. That is, in step 101, the active and standby SNs can establish an active/standby relationship through an active/standby state switching protocol such as VRRP, but the active/standby state switching protocol is not limited to VRRP, and may also be multi-chassis link aggregation ( MC-LAG) protocol.
此外, ***状态可以包括以下至少之一: 硬件运行状态、 软件运行状 态、 设备***告警。 也就是说, 对于主备状态和***状态的关联, 既可以 包括和设备级的***运行状态, 例如 CPU、 或内存使用情况关联, 也可以 包括和设备某一硬件的运行状态相关联, 例如, 可以和交换网板运行状态 关联, 还可以和设备某一功能模块的运行状态关联。 此外, 在主备切换协 议和***状态相关联时, 可以选择同时关联多个不同的***状态, 这些状 态之间的逻辑关系可以是 "与" 或者 "或" 的关系。  In addition, the system status can include at least one of the following: hardware operating status, software operating status, device system alarm. That is to say, the association between the active and standby states and the system state may be associated with the system operating state of the device level, such as CPU or memory usage, or may be associated with the running state of a certain hardware of the device, for example, It can be associated with the running status of the switching stencil and can also be associated with the running status of a functional module of the device. In addition, when the active/standby switchover protocol is associated with the system state, you can choose to associate multiple different system states at the same time. The logical relationship between these states can be "or" or "or".
综上所述, 在步骤 101中, 主备 SN可以将设备的***运行状态、 重要 告警和主备状态切换协议关联, 并配置相应的临界阀值(即, 上述异常阔 值)。  In summary, in step 101, the active and standby SNs can associate the system running status, important alarms, and active/standby status switching protocols of the device, and configure corresponding threshold thresholds (ie, the above abnormal thresholds).
步骤 102, 根据主用业务节点的当前***状态、 异常阔值、 以及主备关 系, 通过主备状态切换协议进行业务节点主备切换。 Step 102: According to the current system state of the active service node, an abnormal threshold, and a primary and secondary switch The active/standby switchover of service nodes is performed through the active/standby state switching protocol.
也就是说, 当设备***状态严重异常达到临界阀值或者有相关严重告 警产生时, SN联动主备状态切换协议对 SN进行主备切换; 当原主用 SN 设备***运行状态恢复正常或者严重告警消失时, SN联动主备状态切换协 议回切主备状态。 在实际应用中, 主备状态与***状态的关联, 可以选择 切换和回切使用不同的条件去关联主备 SN状态切换。 例如, 切换条件 A 导致主备切换(即, 主用 SN切换到备用 SN ), 切换条件 B导致回切(即, 备用 SN切换到主用 SN )。  That is, when the critical state of the device system reaches a critical threshold or a serious alarm occurs, the SN-linked active/standby state switching protocol performs the active/standby switchover of the SN; when the original active SN device system is restored to normal or the critical alarm disappears. The SN and the active/standby state switching protocol switch back to the active/standby state. In actual applications, the association between the active and standby states and the system state can be selected to switch and switch back to use different conditions to associate the active and standby SN state switches. For example, handover condition A results in an active/standby switchover (ie, the primary SN switches to the standby SN), and handover condition B results in a failback (ie, the standby SN switches to the primary SN).
具体可以进行处理:  Specifically can be processed:
确定达到第一切换条件, 根据业务节点的主备关系通过主备状态切换 协议进行业务节点主备切换, 其中, 第一切换条件为: 确定主用业务节点 的当前***状态出现告警或主用业务节点的当前***状态的异常值大于或 等于异常阔值;  Determining that the first switching condition is reached, and performing the active/standby switchover of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node, where the first handover condition is: determining that the current system state of the active service node has an alarm or a primary service The abnormal value of the current system state of the node is greater than or equal to the abnormal threshold;
确定达到第二切换条件, 根据业务节点的主备关系通过主备状态切换 协议进行业务节点主备回切, 其中, 第二切换条件为: 确定主用业务节点 的当前***状态的告警消失或主用业务节点的当前***状态的异常值小于 异常阔值。  The second switching condition is determined, and the active/standby switchback protocol is used to perform the active/standby switchback of the service node according to the active/standby relationship of the service node, where the second handover condition is: determining that the current system state alarm of the active service node disappears or the primary The outlier value of the current system state with the service node is less than the abnormal threshold.
优选地, 在实际应用中, 可以以修改业务节点优先级的方式通过主备 状态切换协议进行业务节点主备切换。  Preferably, in an actual application, the active/standby switchover of the service node may be performed by using the active/standby state switching protocol in a manner of modifying the priority of the service node.
此外, 在主备状态切换协议与设备***告警 (包括网管告警)相关联 的情况下, 可以由网管设备控制业务节点主备切换。 也就是说, 当主备状 态协议和某些严重相关的设备***告警关联时,可以由网管设备通过 SNMP 通知 SN主备状态切换。  In addition, when the active/standby state switching protocol is associated with the device system alarm (including the network management alarm), the network management device can control the active/standby switchover of the service node. That is to say, when the active/standby protocol is associated with some seriously related device system alarms, the network management device can notify the SN of the active/standby state switching through SNMP.
图 2是本发明实施例的业务节点主备切换方法的详细处理流程图, 如 图 2所示, 根据本发明实施例的技术方案包括如下处理: 步骤 201 , SN通过主备状态切换协议建立主备关系, 其中, 主备 SN 可以通过 VRRP建立主备关系; 2 is a detailed processing flowchart of a method for switching between active and standby of a service node according to an embodiment of the present invention. As shown in FIG. 2, the technical solution according to an embodiment of the present invention includes the following processing: Step 201: The SN establishes an active/standby relationship through the active/standby state switching protocol, where the active and standby SNs can establish an active/standby relationship through VRRP.
步骤 202, SN将设备***运行状态、或者设备某软 /硬件模块运行状态, 或者重要相关告警和主备状态关联, 并配置临界阀值;  Step 202: The SN associates the operating state of the device system, or the running status of a certain software/hardware module of the device, or an important related alarm with the active/standby state, and configures a critical threshold;
步骤 203 ,当设备***运行状态严重异常达到临界阀值或者有相关严重 告警产生时, SN联动主备状态切换协议对 SN进行主备切换;  Step 203: When the operating state of the device system is severely abnormal and reaches a critical threshold or a related serious alarm is generated, the SN performs an active/standby switchover between the SN and the active/standby state switching protocol.
步骤 204,当原主用 SN设备***运行状态恢复正常或者严重告警消失, SN联动主备状态切换协议回切主备状态。  Step 204: When the running status of the original active SN device system is normal or the serious alarm disappears, the SN linkage active/standby state switching protocol switches back to the active/standby state.
从上述处理可以看出, 本发明实施例将双机备份的切换机制和设备系 统状态相关联, 当发现设备状态严重异常或者某个严重告警产生时, 触发 设备的主备切换协议开始主备切换, 可以规避虽然由于***严重异常造成 无法切换的问题。  It can be seen from the foregoing that, in the embodiment of the present invention, the switching mechanism of the dual-system backup is associated with the system state of the device. When the device status is abnormally abnormal or a serious alarm is generated, the active/standby switching protocol of the triggering device starts the active/standby switchover. , can avoid the problem of being unable to switch due to serious abnormalities in the system.
以下结合实例, 对本发明实施例的上述技术方案进行举例说明。  The above technical solutions of the embodiments of the present invention are exemplified by the following examples.
实例 1 , 关联*** CPU状态, 联动进行主备 SN切换。  Example 1. Associate the system CPU status and perform the master/slave SN switch.
步骤 1 , 主备 SN通过主备切换协议 VRRP建立主备关系;  Step 1: The master/slave SN establishes the master/slave relationship through the master/slave switchover protocol VRRP.
步骤 2,配置当*** CPU>80%且持续 5分钟时,进行主备切换;当 CPU 低于 60%且持续 5分钟时, 进行回切, 并将该切换策略联动 VRRP;  Step 2: Configure the active/standby switchover when the system CPU is >80% for 5 minutes; when the CPU is lower than 60% for 5 minutes, perform the switchback, and link the switch policy to VRRP;
步骤 3 , 当*** CPU超过 80%且持续 5分钟时, 原主用 SN的 VRRP 优先级降低, 由于原主用 SN的优先级小于原备用 SN的 VRRP优先级, 因 此根据 VRRP主备选择方式, 开始触发主备切换;  Step 3: When the system CPU exceeds 80% and lasts for 5 minutes, the VRRP priority of the original primary SN is reduced. Because the priority of the original primary SN is smaller than the VRRP priority of the original standby SN, the triggering is started according to the VRRP active/standby selection mode. Active/standby switchover;
步骤 4,主用 SN的 CPU低于 60%且持续 5分钟时,原主用 SN的 VRRP 优先级恢复, 开始触发回切。  Step 4: When the CPU of the primary SN is lower than 60% and lasts for 5 minutes, the VRRP priority of the original primary SN is restored, and the switchback is triggered.
实例 2, 关联交换网板状态, 联动进行主备 SN切换。  Example 2: Associate the status of the switching network board and perform the active/standby SN switching.
步骤 1 , 主备 SN通过主备切换协议 VRRP建议主备关系; 步骤 2, 配置当 3块交换网板中 2块交换网板状态异常时, 进行主备切 换; 3块交换网板中 2块交换网板状态正常时, 进行回切, 并将该切换策略 联动 VRRP; Step 1: The master/slave SN proposes the master/slave relationship through the master/slave switchover protocol VRRP. Step 2: Configure the switchover when the status of the two switch fabrics in the three switch fabrics is abnormal. When the two switch fabrics in the three switch fabrics are in the normal state, perform the switchback and link the switchover policy. VRRP;
步骤 3 , 当 3块交换网板中 2块交换网板状态异常时, 原主用 SN的 VRRP优先级降低, 由于原主用 SN的优先级小于原备用 SN的 VRRP优先 级, 因此根据 VRRP主备选择方式, 开始触发主备切换;  Step 3: When the status of the two switching stencils in the three switching stencils is abnormal, the VRRP priority of the original primary SN is reduced. The priority of the original primary SN is smaller than the VRRP priority of the original standby SN. Mode, starting to trigger the active/standby switchover;
步骤 4, 当 3块交换网板中 2块交换网板状态正常时, 原主用 SN的 VRRP优先级恢复, 开始触发回切。  Step 4: When the status of the two switching stencils in the three switching stencils is normal, the VRRP priority of the original active SN is restored and the switchback is triggered.
实例 3 , 关联路由告警, 联动进行主备 SN的切换。  Example 3: Associate route alarms, and switch the master/slave SN.
步骤 1 , 主备 SN通过主备切换协议 VRRP建议主备关系;  Step 1: The active and standby SNs adopt the active/standby switchover protocol VRRP to propose the active/standby relationship.
步骤 2, 配置当某个 BGP Peer状态异常时超过 3分钟, 进行主备切换; 某个 BGP Peer状态恢复正常超过 10分钟时, 进行回切, 并将该切换策略 联动 VRRP;  If the BGP peer status is abnormal for more than 3 minutes, the switchover is performed. If the BGP peer status is up to 10 minutes, the switchover is performed, and the switchover policy is associated with the VRRP.
步骤 3 , 当***某个 BGP Peer状态异常超过 3分钟时, 原主用 SN的 VRRP优先级降低, 由于原主用 SN的优先级小于原备用 SN的 VRRP优先 级, 因此根据 VRRP主备选择方式, 开始触发主备切换;  Step 3: When the BGP peer status of the system is abnormal for more than 3 minutes, the VRRP priority of the original primary SN is reduced. The priority of the original primary SN is lower than the VRRP priority of the original standby SN. Triggering the active/standby switchover;
步骤 4, 当***和该 BGP Peer状态恢复正常超过 10分钟时, 原主用 SN的 VRRP优先级恢复, 开始触发回切。  Step 4: When the system and the BGP peer state are restored to normal for more than 10 minutes, the VRRP priority of the original active SN is restored, and the switchback is triggered.
借助于本发明的技术方案,使得在主用 SN设备的异常时, 能够将用户 转发或者用户接入快速的切换到备用 SN上。  With the technical solution of the present invention, when the abnormality of the primary SN device is abnormal, the user forwarding or the user access can be quickly switched to the standby SN.
装置实施例  Device embodiment
根据本发明的实施例, 提供了一种业务节点主备切换装置, 该装置可 以位于业务节点, 图 3 是本发明实施例的业务节点主备切换装置的结构示 意图, 如图 3 所示, 根据本发明实施例的业务节点主备切换装置包括: 建 立模块 30、 以及切换模块 32。 在对本发明实施例的业务节点主备切换装置进行详细说明之前, 首先 网络拓朴结构的示意图, 如图 4所示, 主用 SN和备用 SN位于上层网络和 下层网络之间, 主用 SN和备用 SN之间预先建立了主备关系, 用户需要通 过 SN进行接入。 According to an embodiment of the present invention, a service node active/standby switching device is provided, and the device may be located at a service node. FIG. 3 is a schematic structural diagram of a service node active/standby switching device according to an embodiment of the present invention, as shown in FIG. The service node active/standby switching device of the embodiment of the present invention includes: an establishing module 30, and a switching module 32. Before the detailed description of the service node active/standby switching device in the embodiment of the present invention, first, a schematic diagram of the network topology structure is shown in FIG. 4. The primary SN and the standby SN are located between the upper layer network and the lower layer network, and the primary SN and the primary SN are used. An active/standby relationship is established between the standby SNs. The user needs to access the SN.
下面对本发明实施例的业务节点主备切换装置进行详细的说明。  The service node active/standby switching device in the embodiment of the present invention is described in detail below.
具体地, 建立模块 30用于通过主备状态切换协议建立业务节点的主备 关系, 将主备状态与主用业务节点的***状态相关联, 并配置***状态的 异常阔值; 其中, 主备状态切换协议可以包括以下之一: 虚拟路由冗余协 议、 多机架链路聚合协议。 也就是说, 建立模块 30可以通过诸如 VRRP这 种主备状态切换协议建立主备关系, 但主备状态切换协议并不限制于 VRRP, 也可以是多机架链路聚合 ( MC-LAG )协议。  Specifically, the establishing module 30 is configured to establish an active/standby relationship of the service node by using the active/standby state switching protocol, and associate the active/standby state with the system state of the active service node, and configure an abnormal threshold of the system state; The state switching protocol may include one of the following: a virtual routing redundancy protocol, a multi-chassis link aggregation protocol. That is to say, the establishing module 30 can establish an active/standby relationship through an active/standby state switching protocol such as VRRP, but the active/standby state switching protocol is not limited to VRRP, and may also be a multi-chassis link aggregation (MC-LAG) protocol. .
此外, ***状态可以包括以下至少之一: 硬件运行状态、 软件运行状 态、 设备***告警。 也就是说, 对于主备状态和***状态的关联, 既可以 包括和设备级的***运行状态, 例如 CPU、 或内存使用情况关联, 也可以 包括和设备某一硬件的运行状态相关联, 例如, 可以和交换网板运行状态 关联, 还可以和设备某一功能模块的运行状态关联。 此外, 在主备切换协 议和***状态相关联时, 可以选择同时关联多个不同的***状态, 这些状 态之间的逻辑关系可以是 "与" 或者 "或" 的关系。  In addition, the system status can include at least one of the following: hardware operating status, software operating status, device system alarm. That is to say, the association between the active and standby states and the system state may be associated with the system operating state of the device level, such as CPU or memory usage, or may be associated with the running state of a certain hardware of the device, for example, It can be associated with the running status of the switching stencil and can also be associated with the running status of a functional module of the device. In addition, when the active/standby switchover protocol is associated with the system state, you can choose to associate multiple different system states at the same time. The logical relationship between these states can be "or" or "or".
切换模块 32用于根据主用业务节点的当前***状态、 异常阔值、 以及 主备关系, 通过主备状态切换协议进行业务节点主备切换。  The switching module 32 is configured to perform active/standby switchover of the service node by using the active/standby state switching protocol according to the current system state, the abnormal threshold, and the active/standby relationship of the active service node.
也就是说, 当设备***状态严重异常达到临界阀值或者有相关严重告 警产生时, 切换模块 32联动主备状态切换协议对 SN进行主备切换; 当原 主用 SN设备***运行状态恢复正常或者严重告警消失时, 切换模块 32联 动主备状态切换协议回切主备状态。 在实际应用中, 主备状态与***状态 的关联, 切换模块 32可以选择切换和回切使用不同的条件去关联主备 SN 状态切换。 例如, 切换条件 A导致切换模块 32进行主备切换(即, 主用 SN切换到备用 SN ) , 切换条件 B导致切换模块 32进行回切(即, 备用 SN 切换到主用 SN )。 That is, when the critical state of the device system reaches a critical threshold or a related serious alarm occurs, the switching module 32 associates the active/standby state switching protocol to perform active/standby switching of the SN; when the operating state of the original primary SN device system returns to normal or serious When the alarm disappears, the switching module 32 associates the active/standby state switching protocol to switch back to the active/standby state. In practical applications, the active and standby states and system states For the association, the switching module 32 can select the switching and failback to use different conditions to associate the active and standby SN state switching. For example, the switching condition A causes the switching module 32 to perform the active/standby switching (ie, the primary SN switches to the standby SN), and the switching condition B causes the switching module 32 to perform the failback (ie, the standby SN switches to the primary SN).
切换模块 32具体可以进行处理:  The switching module 32 can be specifically processed:
切换模块 32确定达到第一切换条件, 根据业务节点的主备关系通过主 备状态切换协议进行业务节点主备切换, 其中, 第一切换条件为: 确定主 用业务节点的当前***状态出现告警或主用业务节点的当前***状态的异 常值大于或等于异常阔值;  The switching module 32 determines that the first switching condition is reached, and performs the active/standby switchover of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node, where the first switching condition is: determining that the current system state of the active service node is alarmed or The abnormal value of the current system state of the active service node is greater than or equal to the abnormal threshold;
切换模块 32确定达到第二切换条件, 根据业务节点的主备关系通过主 备状态切换协议进行业务节点主备回切, 其中, 第二切换条件为: 确定主 用业务节点的当前***状态的告警消失或主用业务节点的当前***状态的 异常值小于异常阔值。  The switching module 32 determines that the second switching condition is reached, and performs the active/standby switchback of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node, where the second handover condition is: determining an alarm of the current system state of the active service node. The outlier or the abnormal value of the current system state of the active service node is less than the abnormal threshold.
优选地, 在实际应用中, 切换模块 32可以以修改业务节点优先级的方 式通过主备状态切换协议进行业务节点主备切换。  Preferably, in an actual application, the switching module 32 may perform the active/standby switchover of the service node by using the active/standby state switching protocol in a manner of modifying the priority of the service node.
此外, 在主备状态切换协议与设备***告警 (包括网管告警)相关联 的情况下, 切换模块 32可以根据网管设备的控制进行业务节点主备切换或 回切。 也就是说, 当主备状态协议和某些严重相关的设备***告警关联时, 可以由网管设备通过 SNMP通知切换模块 32进行主备状态切换。  In addition, in the case that the active/standby state switching protocol is associated with the device system alarm (including the network management alarm), the switching module 32 can perform the active/standby switching or failback of the service node according to the control of the network management device. That is, when the active/standby state protocol is associated with some seriously related device system alarms, the network management device can notify the switching module 32 of the active/standby state switching through SNMP.
借助于本发明的技术方案, 通过将主备状态切换协议与主用业务节点 的***状态相关联,解决了现有技术中由于探测不到主用 SN设备的内部异 常而导致的用户转发或者用户接入异常的问题。使得在主用 SN设备的内部 异常时, 能够将用户转发或者用户接入快速的切换到备用 SN上。  By means of the technical solution of the present invention, by associating the active/standby state switching protocol with the system state of the active service node, the user forwarding or the user caused by the internal abnormality of the primary SN device cannot be detected in the prior art. The problem of access abnormality. When the internal PN device is abnormal, the user can forward or user access can be quickly switched to the standby SN.
尽管为示例目的, 已经公开了本发明的优选实施例, 本领域的技术人 员将意识到各种改进、 增加和取代也是可能的, 因此, 本发明的范围应当 不限于上述实施例。 Although the preferred embodiments of the present invention have been disclosed for illustrative purposes, those skilled in the art will appreciate that various modifications, additions and substitutions are possible, therefore, the scope of the present invention should It is not limited to the above embodiment.

Claims

权利要求书 Claim
1、 一种业务节点主备切换方法, 其特征在于, 包括:  A method for switching between active and standby service nodes, which is characterized by:
通过主备状态切换协议建立业务节点的主备关系, 将所述主备状态与 主用业务节点的***状态相关联, 并配置所述***状态的异常阔值;  And establishing an active/standby relationship of the service node by using an active/standby state switching protocol, associating the active/standby state with a system state of the active service node, and configuring an abnormal threshold of the system state;
根据所述主用业务节点的当前***状态、 所述异常阔值、 以及所述主 备关系, 通过所述主备状态切换协议进行业务节点主备切换。  The active/standby switchover of the service node is performed by the active/standby state switching protocol according to the current system state of the active service node, the abnormal threshold, and the active/standby relationship.
2、 如权利要求 1所述的方法, 其特征在于, 所述主备状态切换协议包 括以下之一: 虚拟路由冗余协议、 多机架链路聚合协议。  2. The method according to claim 1, wherein the active/standby state switching protocol comprises one of the following: a virtual routing redundancy protocol, a multi-chassis link aggregation protocol.
3、 如权利要求 1所述的方法, 其特征在于, 所述***状态包括以下至 少之一: 硬件运行状态、 软件运行状态、 设备***告警。  3. The method according to claim 1, wherein the system state comprises at least one of the following: a hardware running state, a software running state, and a device system alarm.
4、 如权利要求 1至 3任一项所述的方法, 其特征在于, 根据所述主用 业务节点的当前***状态、 所述异常阔值、 以及所述主备关系通过所述主 备状态切换协议进行业务节点主备切换的过程, 包括:  The method according to any one of claims 1 to 3, wherein, according to the current system state of the active service node, the abnormal threshold, and the active/standby relationship through the active/standby state The process of switching the protocol to perform the active/standby switchover of the service node, including:
确定达到第一切换条件, 根据所述业务节点的主备关系通过所述主备 状态切换协议进行业务节点主备切换; 其中, 所述第一切换条件为: 确定 所述主用业务节点的当前***状态出现告警或所述主用业务节点的当前系 统状态的异常值大于或等于所述异常阔值;  Determining that the first switching condition is reached, and performing the active/standby switchover of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node; wherein the first switching condition is: determining a current status of the active service node An alarm is generated in the system state or an abnormal value of the current system state of the active service node is greater than or equal to the abnormal threshold;
确定达到第二切换条件, 根据所述业务节点的主备关系通过所述主备 状态切换协议进行业务节点主备回切; 其中, 所述第二切换条件为: 确定 所述主用业务节点的当前***状态的告警消失或所述主用业务节点的当前 ***状态的异常值小于所述异常阔值。  Determining that the second switching condition is reached, and performing the active/standby switchback of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node; where the second handover condition is: determining the primary service node The alarm of the current system state disappears or the abnormal value of the current system state of the active service node is less than the abnormal threshold.
5、 如权利要求 4所述的方法, 其特征在于, 通过所述主备状态切换协 议进行业务节点主备切换的方法为:  The method according to claim 4, wherein the method for performing active/standby switchover of the service node by using the active/standby state switching protocol is:
以修改业务节点优先级的方式通过所述主备状态切换协议进行所述业 务节点主备切换。 The active/standby switchover of the service node is performed by using the active/standby state switching protocol in a manner of modifying the priority of the service node.
6、 如权利要求 4所述的方法, 其特征在于, 在所述主备状态与所述设 备***告警相关联的情况下, 所述方法还包括: The method of claim 4, wherein, in the case that the active/standby state is associated with the device system alarm, the method further includes:
由网管设备控制所述业务节点主备切换。  The active/standby switchover of the service node is controlled by the network management device.
7、 一种业务节点主备切换装置, 其特征在于, 包括:  A service node active/standby switching device, comprising:
建立模块, 用于通过主备状态切换协议建立业务节点的主备关系, 将 所述主备状态与主用业务节点的***状态相关联, 并配置所述***状态的 异常阔值;  Establishing a module, configured to establish an active/standby relationship of the service node by using an active/standby state switching protocol, associate the active/standby state with a system state of the active service node, and configure an abnormal threshold of the system state;
切换模块, 用于根据所述主用业务节点的当前***状态、 所述异常阔 值、 以及所述主备关系, 通过所述主备状态切换协议进行业务节点主备切 换。  And a switching module, configured to perform active/standby switching of the service node by using the active/standby state switching protocol according to the current system state of the active service node, the abnormal threshold, and the active/standby relationship.
8、 如权利要求 7所述的装置, 其特征在于,  8. Apparatus according to claim 7 wherein:
所述主备状态切换协议包括以下之一: 虚拟路由冗余协议、 多机架链 路聚合协议;  The active/standby state switching protocol includes one of the following: a virtual routing redundancy protocol, a multi-rack link aggregation protocol;
所述***状态包括以下至少之一: 硬件运行状态、 软件运行状态、 设 备***告警。  The system status includes at least one of the following: hardware operating status, software operating status, and device system alarm.
9、 如权利要求 7或 8所述的装置, 其特征在于, 根据所述主用业务节 点的当前***状态、 所述异常阔值、 以及所述主备关系通过所述主备状态 切换协议进行业务节点主备切换时, 所述切换模块用于:  The device according to claim 7 or 8, wherein the current system state of the active service node, the abnormal threshold, and the active/standby relationship are performed by the active/standby state switching protocol. When the service node is switched between active and standby, the switching module is used to:
确定达到第一切换条件, 根据所述业务节点的主备关系通过所述主备 状态切换协议进行业务节点主备切换; 其中, 所述第一切换条件为: 确定 所述主用业务节点的当前***状态出现告警或所述主用业务节点的当前系 统状态的异常值大于或等于所述异常阔值;  Determining that the first switching condition is reached, and performing the active/standby switchover of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node; wherein the first switching condition is: determining a current status of the active service node An alarm is generated in the system state or an abnormal value of the current system state of the active service node is greater than or equal to the abnormal threshold;
确定达到第二切换条件, 根据所述业务节点的主备关系通过所述主备 状态切换协议进行业务节点主备回切; 其中, 所述第二切换条件为: 确定 所述主用业务节点的当前***状态的告警消失或所述主用业务节点的当前 ***状态的异常值小于所述异常阔值。 Determining that the second switching condition is reached, and performing the active/standby switchback of the service node by using the active/standby state switching protocol according to the active/standby relationship of the service node; where the second handover condition is: determining the primary service node The current system status alarm disappears or the current service node's current status The abnormal value of the system state is less than the abnormal threshold.
10、 如权利要求 9所述的装置, 其特征在于, 通过所述主备状态切换 协议进行业务节点主备切换时, 所述切换模块用于:  The apparatus according to claim 9, wherein when the active/standby switchover of the service node is performed by the active/standby state switching protocol, the switching module is configured to:
以修改业务节点优先级的方式通过所述主备状态切换协议进行所述业 务节点主备切换。  The active/standby switchover of the service node is performed by using the active/standby state switching protocol in a manner of modifying the priority of the service node.
11、 如权利要求 9所述的装置, 其特征在于, 所述切换模块进一步用 于:  11. The apparatus according to claim 9, wherein the switching module is further used to:
在所述主备状态与所述设备***告警相关联的情况下, 根据网管设备 的控制, 进行所述业务节点的主备切换。  In the case that the active/standby state is associated with the device system alarm, the active/standby switchover of the service node is performed according to the control of the network management device.
PCT/CN2010/076850 2010-05-21 2010-09-13 Master/backup switching method and device for service nodes WO2011143876A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2010101790150A CN102255740A (en) 2010-05-21 2010-05-21 Method and device for active-standby switch of service nodes
CN201010179015.0 2010-05-21

Publications (1)

Publication Number Publication Date
WO2011143876A1 true WO2011143876A1 (en) 2011-11-24

Family

ID=44982746

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2010/076850 WO2011143876A1 (en) 2010-05-21 2010-09-13 Master/backup switching method and device for service nodes

Country Status (2)

Country Link
CN (1) CN102255740A (en)
WO (1) WO2011143876A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106411589A (en) * 2016-09-29 2017-02-15 北京神州绿盟信息安全科技股份有限公司 Method and apparatus for realizing high availability
CN109450707A (en) * 2018-12-13 2019-03-08 京信通信***(中国)有限公司 A kind of data transmission method, device, gateway and readable storage medium storing program for executing

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102594696B (en) * 2012-03-08 2016-09-07 中兴通讯股份有限公司 A kind of method and device managing network device link aggregation group
CN102739451B (en) * 2012-06-29 2014-12-03 华为技术有限公司 Method and device for updating master-slave switchover condition, server and system
CN103560913A (en) * 2013-10-31 2014-02-05 华为技术有限公司 Disaster recovery switching method, equipment and system
CN112866004B (en) * 2018-08-23 2024-04-12 华为技术有限公司 Control plane equipment switching method and device and transfer control separation system
CN111277596A (en) * 2020-01-20 2020-06-12 广东电网有限责任公司电力调度控制中心 Power grid regulation and control safety zone data transmission system, method and equipment
CN114584866A (en) * 2020-11-18 2022-06-03 中兴通讯股份有限公司 Service protection method, optical transport network device and storage medium
CN112887204A (en) * 2021-01-29 2021-06-01 上海弘积信息科技有限公司 Method for periodically setting VRRP node as main node
CN112929265A (en) * 2021-02-08 2021-06-08 上海弘积信息科技有限公司 Method for any node to place VRRP any node in main in real time
CN112929266A (en) * 2021-02-09 2021-06-08 上海弘积信息科技有限公司 Method for switching VRRP (virtual router redundancy protocol) main and standby according to SSL (secure socket layer) accelerator card state
CN112929267A (en) * 2021-02-09 2021-06-08 上海弘积信息科技有限公司 Method for switching VRRP main and standby in advance according to CPU temperature
CN113067780B (en) * 2021-03-15 2022-11-01 杭州迪普科技股份有限公司 Flow processing method of virtual switching matrix and electronic equipment
CN114020532A (en) * 2021-11-03 2022-02-08 安天科技集团股份有限公司 Dual-computer hot standby method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1558604A (en) * 2004-01-15 2004-12-29 烽火通信科技股份有限公司 Method for master and backup cross connection modules protection switching
CN101068140A (en) * 2007-06-27 2007-11-07 中兴通讯股份有限公司 Apparatus and method for realizing primary/standby PCI equipment switchover
CN101119187A (en) * 2007-09-06 2008-02-06 上海可鲁***软件有限公司 Master-salve equipment switch control method
CN101166069A (en) * 2007-09-05 2008-04-23 中兴通讯股份有限公司 Method and system for automatic switching of optical module in optical transmission network

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7036051B1 (en) * 2002-12-18 2006-04-25 Juniper Networks, Inc. Responsive virtual routing system
CN100558122C (en) * 2005-12-21 2009-11-04 华为技术有限公司 The method that single board main/standby switches in a kind of access layer equipment
CN101447858B (en) * 2008-01-17 2012-01-11 中兴通讯股份有限公司 Method for realizing synchronous switching of virtual router redundancy protocol in dual-machine hot backup system
CN101267392B (en) * 2008-04-24 2015-05-20 中兴通讯股份有限公司 A realizing method for notifying downstream device in case of switch of uplink link status

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1558604A (en) * 2004-01-15 2004-12-29 烽火通信科技股份有限公司 Method for master and backup cross connection modules protection switching
CN101068140A (en) * 2007-06-27 2007-11-07 中兴通讯股份有限公司 Apparatus and method for realizing primary/standby PCI equipment switchover
CN101166069A (en) * 2007-09-05 2008-04-23 中兴通讯股份有限公司 Method and system for automatic switching of optical module in optical transmission network
CN101119187A (en) * 2007-09-06 2008-02-06 上海可鲁***软件有限公司 Master-salve equipment switch control method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106411589A (en) * 2016-09-29 2017-02-15 北京神州绿盟信息安全科技股份有限公司 Method and apparatus for realizing high availability
CN109450707A (en) * 2018-12-13 2019-03-08 京信通信***(中国)有限公司 A kind of data transmission method, device, gateway and readable storage medium storing program for executing
CN109450707B (en) * 2018-12-13 2021-12-14 京信网络***股份有限公司 Data transmission method and device, gateway equipment and readable storage medium

Also Published As

Publication number Publication date
CN102255740A (en) 2011-11-23

Similar Documents

Publication Publication Date Title
WO2011143876A1 (en) Master/backup switching method and device for service nodes
US7848226B2 (en) Communication system, switch
JP5913635B2 (en) Redundant network connection
US8027246B2 (en) Network system and node apparatus
JP5910811B2 (en) Switch device control system, configuration control device and configuration control method thereof
Song et al. Control path management framework for enhancing software-defined network (SDN) reliability
WO2011157151A2 (en) Method, device and system for realizing disaster-tolerant backup
JP5743809B2 (en) Network management system and network management method
EP2352253A1 (en) Method and apparatus for protecting link aggregation group of ethernet ring
WO2011120301A1 (en) Method, apparatus and system for dual-homing protection switching
WO2015007096A1 (en) Switching method and apparatus for main and standby devices
CN106533736B (en) Network equipment restarting method and device
WO2006136088A1 (en) A method for implementing active/standby gateway device in the network and a system thereof
US8520509B2 (en) System and a method for routing data traffic
CN106936613B (en) Method and system for rapidly switching main and standby Openflow switch
JP6308534B2 (en) Network protection method, network protection device, off-ring node, and system
WO2011120297A1 (en) Protection method, system, provider edge and customer edge for virtual private lan services
EP3629535B1 (en) Method, device, and system for implementing mux machine
CN105577444B (en) A kind of wireless controller management method and wireless controller
US8711681B2 (en) Switch redundancy in systems with dual-star backplanes
US9960993B2 (en) Packet network linear protection systems and methods in a dual home or multi-home configuration
US8935564B2 (en) Method for backing up user information and apparatus in standby service node for backing up user information
JP2010239279A (en) Communication apparatus and method thereof, and communication system and method thereof
WO2011110134A2 (en) Method, device and broadband access server system for load share
WO2011150780A1 (en) Method for triggering route switching and service provider-end provider edge device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10851633

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10851633

Country of ref document: EP

Kind code of ref document: A1