CN103931139B - Method and device for redundancy protection, and device and system - Google Patents

Method and device for redundancy protection, and device and system Download PDF

Info

Publication number
CN103931139B
CN103931139B CN201380003769.2A CN201380003769A CN103931139B CN 103931139 B CN103931139 B CN 103931139B CN 201380003769 A CN201380003769 A CN 201380003769A CN 103931139 B CN103931139 B CN 103931139B
Authority
CN
China
Prior art keywords
subsystem
primary
node
standby
secondary node
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201380003769.2A
Other languages
Chinese (zh)
Other versions
CN103931139A (en
Inventor
朱家悦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of CN103931139A publication Critical patent/CN103931139A/en
Application granted granted Critical
Publication of CN103931139B publication Critical patent/CN103931139B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)
  • Hardware Redundancy (AREA)

Abstract

An embodiment of the invention provides a method and a device for redundancy protection, and a device and system, and relates to the field of computers. The method comprises: detecting whether a fault node exists; if the fault node exists and the fault node comprises a first main node carrying a first main subsystem, and a first standby node carrying a first standby subsystem, the first main subsystem being bound to a second standby node of a second standby subsystem, so that the second standby node carries the first main subsystem. The first standby subsystem is a default standby subsystem of the first main subsystem. The second standby subsystem is a non-default standby subsystem of the first main subsystem. The device comprises a detection module and a first executive module. The method, the device, and the system support a plurality of fault processes, and improve reliability of the system.

Description

A kind of redundancy protected method, device, equipment and system
Technical field
The present invention relates to computer realm, particularly to a kind of redundancy protected method, device, equipment and system.
Background technology
Cluster (cluster) refers to, some separate service entities (computer or work station etc.), by HVN The service platform that network interconnects and constitutes.It is different from single service entity, Cluster has equally loaded and two spies of redundancy protecting Property.Wherein, redundancy protecting refers to, when the resource (can be the process on server or server) of certain task of execution is due to certain When kind of reason breaks down, to the standby resources of resource should then completing this task.
Specifically, with execute the resource of certain task as server on process as a example, existing redundancy protecting flow process is led to Often as follows.First, primary resources and the standby resources of equal amount are disposed on each service entities of Cluster.Secondly, adopt Mode with ring-like backup is primary resources setting standby resources it is ensured that primary resources and the standby resources of oneself do not take at one In entity.For example, Cluster includes numbering respectively 1 ..., the service entities of N-1, N and N+1.Divide equally on each service entities Cloth M primary resources and M standby resources.Standby resources corresponding to the upper M primary resources of N-1 are distributed on N;Upper M of N Standby resources corresponding to primary resources are distributed on N+1;Standby resources corresponding to the upper M primary resources of N+1 are distributed in 1 On;By that analogy.Then, the running status of each primary resources of real-time detection;Meanwhile, primary resources are in the task of execution, corresponding Standby resources back up primary resources all data.When primary resources is detected and breaking down, the primary money of closing fault Source, the corresponding standby resources of primary resources of startup separator.Because standby resources have backed up all numbers of the primary resources of fault According to therefore, it is possible to ensure that task do not interrupt.
During realizing the present invention, inventor finds that prior art at least has problems with:
If primary resources and standby resources corresponding with this primary resources all break down, the task on primary resources will Interrupt, the reliability of impact Cluster.
Content of the invention
In order to increase the reliability of Cluster, embodiments provide a kind of redundancy protected method, device, equipment And system.Described technical scheme is as follows:
On the one hand, embodiments provide a kind of redundancy protected method, methods described includes:
Detect whether there is malfunctioning node;
When there is malfunctioning node, and described malfunctioning node include carrying the first primary subsystem the first primary node and Carry the first standby subsystem the first secondary node when, by the described first primary binding of subsystems to carry the second standby subsystem Second secondary node of system, so that described second secondary node carries the described first primary subsystem, described first standby subsystem System is the standby subsystem of acquiescence of the described first primary subsystem, and described second standby subsystem is the described first primary subsystem The standby subsystem of non-default.
Preferably, described the second standby section by the described first primary binding of subsystems to carrying the second standby subsystem Point, so that described second secondary node carries described primary subsystem, including:
In the standby subsystem of non-default of the described first primary subsystem, select the minimum standby subsystem of a priority System, the secondary node that selected described standby subsystem is located is as described second secondary node and main by described first With binding of subsystems to described second secondary node.
Preferably, in described the second standby section by the described first primary binding of subsystems to carrying the second standby subsystem Point, so that before described second secondary node carries described primary subsystem, methods described also includes:
Judge whether the described first primary subsystem possesses the ability of seizing;
If described first primary subsystem possesses the ability of seizing, by the described first primary binding of subsystems to described second Secondary node.
Preferably, after by the described first primary binding of subsystems to described second secondary node, methods described is also wrapped Include:
Detect whether the described first primary node and described first secondary node recover normal;
If described first primary node first recovers normal, the described first standby binding of subsystems is primary to described first Node, so that described first primary node carries the described first standby subsystem;
If described first secondary node first recovers normal, the described first standby binding of subsystems is standby to described first Node, so that described first secondary node carries the described first standby subsystem.
Preferably, described by the described first standby binding of subsystems to described first primary node, so that described first After primary node carries the described first standby subsystem, methods described also includes:
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, described masterslave switchover is carried out to the described first primary node and described second secondary node after, institute Method of stating also includes:
After described first secondary node recovers normal, by the described first standby binding of subsystems to described first standby section Put and discharge described second secondary node, so that described first secondary node carries the described first standby subsystem.
Preferably, described by the described first standby binding of subsystems to described first secondary node so that described first is standby After carrying the described first standby subsystem with node, methods described also includes:
Masterslave switchover is carried out to described first secondary node and described second secondary node, so that described first secondary node Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, described masterslave switchover is carried out to described first secondary node and described second secondary node after, institute Method of stating also includes:
After the described first primary node recovers normal, the described first standby binding of subsystems is primary to described first Node simultaneously discharges described second secondary node, so that described first primary node carries the described first standby subsystem.
Preferably, described by the described first standby binding of subsystems to described first primary node discharge described second After secondary node, methods described also includes:
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, methods described also includes:
When there is malfunctioning node, and described malfunctioning node includes the second primary node of carrying the second primary subsystem, and The standby subsystem that the secondary node that described malfunctioning node does not include secondary node or described malfunctioning node includes carries is all not Be the described second primary subsystem acquiescence standby subsystem when,
Masterslave switchover is carried out to the 3rd secondary node of the second primary node and carrying the 3rd standby subsystem, so that described 3rd secondary node carries the described second primary subsystem, and described 3rd standby subsystem is the silent of the described second primary subsystem Recognize standby subsystem.
Preferably, to the second primary node and carry the 3rd standby subsystem the 3rd secondary node carry out active and standby fall Change, so that before described 3rd secondary node carries the described second primary subsystem, methods described also includes:
Judge that the described second primary subsystem whether there is and give tacit consent to standby subsystem;
If existing, by the described second primary binding of subsystems to the 3rd secondary node carrying the 3rd standby subsystem;
If not existing, by the described second primary binding of subsystems to the 4th standby section carrying the 4th standby subsystem Point, described 4th standby subsystem is the standby subsystem of non-default of the described second primary subsystem.
Preferably, by the described second primary binding of subsystems to described 3rd secondary node or described 4th secondary node Afterwards, methods described also includes:
Detect whether the described second primary node recovers normal;
If described second primary node recovers normal, by the described second primary binding of subsystems to described second primary section Point.
Preferably, methods described also includes:
When there is malfunctioning node, and described malfunctioning node includes carrying the 5th secondary node of the 5th standby subsystem, and The primary subsystem that the primary node that described malfunctioning node does not include primary node or described malfunctioning node includes carries silent When to recognize sub standby subsystem be not described five standby subsystem,
According to the priority of described 5th standby subsystem, seize the 6th secondary node carrying the 6th standby subsystem, So that described 6th secondary node carries described 5th standby subsystem.
Preferably, methods described also includes:
It is primary subsystem and standby subsystem setting priority.
Preferably, methods described also includes:
Arrange for primary subsystem and give tacit consent to standby subsystem.
Preferably, the quantity of primary node carrying primary subsystem and the quantity of the secondary node carrying standby subsystem Equal, each described primary subsystem has a standby subsystem of acquiescence.
Preferably, described primary node and described secondary node are the process on server, the institute on same server State the process that the standby subsystem of acquiescence of the primary subsystem that primary node carries is located, be evenly distributed in and described server phase On adjacent two servers.
Another further aspect, embodiments provides a kind of redundancy protection device, and described device includes:
, for detecting whether there is malfunctioning node in detection module;
, there is malfunctioning node for working as, and described malfunctioning node include carrying the first primary subsystem in the first performing module The first primary node and carry the first standby subsystem the first secondary node when, by the described first primary binding of subsystems To the second secondary node carrying the second standby subsystem, so that described second secondary node carries the described first primary subsystem System, described first standby subsystem is the standby subsystem of acquiescence of the described first primary subsystem, described second standby subsystem It is the standby subsystem of non-default of the described first primary subsystem.
Preferably, described first performing module is used for,
When there is malfunctioning node, and described malfunctioning node include carrying the first primary subsystem the first primary node and Carry the first standby subsystem the first secondary node when, in the standby subsystem of non-default of the described first primary subsystem, Select the minimum standby subsystem of a priority, the secondary node that selected described standby subsystem is located is as described Second secondary node, and by the described first primary binding of subsystems to described second secondary node.
Preferably, described device also includes:
First judge module, for judging whether the described first primary subsystem possesses the ability of seizing;
Correspondingly, described first performing module is additionally operable to, when the described first primary subsystem possesses and seizes ability, by institute State the first primary binding of subsystems to described second secondary node.
Preferably, described detection module is additionally operable to, and whether detection the described first primary node and described first secondary node Recover normal;
Correspondingly, described first performing module is additionally operable to, when the described first primary node first recovers normal, then will be described First standby binding of subsystems is to described first primary node, so that described first primary node carries the described first standby subsystem System;When described first secondary node first recovers normal, then by the described first standby binding of subsystems to described first standby section Point, so that described first secondary node carries the described first standby subsystem.
Preferably, described first performing module is additionally operable to, described by the described first standby binding of subsystems to described the One primary node, so that after described first primary node carries the described first standby subsystem,
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, described first performing module is additionally operable to,
After described first secondary node recovers normal, by the described first standby binding of subsystems to described first standby section Put and discharge described second secondary node, so that described first secondary node carries the described first standby subsystem.
Preferably, described first performing module is additionally operable to, described by the described first standby binding of subsystems to described first Secondary node, so that after described first secondary node carries the described first standby subsystem,
Masterslave switchover is carried out to described first secondary node and described second secondary node, so that described first secondary node Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, described first performing module is additionally operable to,
After the described first primary node recovers normal, the described first standby binding of subsystems is primary to described first Node simultaneously discharges described second secondary node, so that described first primary node carries the described first standby subsystem.
Preferably, described first performing module is additionally operable to,
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, described device also includes:
Second performing module, for detecting the presence of malfunctioning node when described detection module, and described malfunctioning node includes Carry the second primary node of the second primary subsystem, and described malfunctioning node does not include secondary node or described malfunctioning node Including secondary node carry standby subsystem be not the described second primary subsystem acquiescence standby subsystem when, by institute State the second primary binding of subsystems to the 3rd secondary node carrying the 3rd standby subsystem, so that described 3rd secondary node holds Carry the described second primary subsystem, described 3rd standby subsystem is the standby subsystem of acquiescence of the described second primary subsystem.
Preferably, described device also includes:
Second judge module, gives tacit consent to standby subsystem for judging that the described second primary subsystem whether there is;
Correspondingly, described second performing module is additionally operable to, in the presence of the judged result of described second judge module is, will Described second primary binding of subsystems is to the 3rd secondary node carrying the 3rd standby subsystem;When described second judge module When judged result is not exist, by the described second primary binding of subsystems to the 4th standby section carrying the 4th standby subsystem Point, described 4th standby subsystem is the standby subsystem of non-default of the described second primary subsystem.
Preferably, described detection module is additionally operable to, and whether detection the described second primary node recovers normal;
Correspondingly, described second performing module is additionally operable to, when the described second primary node recovers normal, then by described second Primary binding of subsystems is to described second primary node.
Preferably, described device also includes:
3rd performing module, for detecting the presence of malfunctioning node when described detection module, and described malfunctioning node includes Carry the 5th secondary node of the 5th standby subsystem, and described malfunctioning node does not include primary node or described malfunctioning node Including the standby subsystem of acquiescence of primary subsystem that carries of primary node be not described five standby subsystem when, root According to the priority of described 5th standby subsystem, seize the 6th secondary node carrying the 6th standby subsystem, so that described the Six secondary nodes carry described 5th standby subsystem.
Preferably, described device also includes:
First setup module, for being primary subsystem and standby subsystem setting priority.
Preferably, described device also includes:
Second setup module, gives tacit consent to standby subsystem for arranging for primary subsystem.
Another aspect, embodiments provides a kind of redundancy protecting equipment, and described equipment includes:Processor and storage Device, for executing aforementioned redundancy protected method.
On the other hand, a kind of redundancy protection systems are embodiments provided, described system includes:Multiple nodes, with And aforementioned redundancy protection device, the plurality of node is used for executing task under the control of described device.
Alternatively, described node can be the process on server or server.
Preferably, when described node be server on process when, carry primary subsystem the quantity of primary node and The quantity carrying the secondary node of standby subsystem is equal, and each described primary subsystem has a standby subsystem of acquiescence System, the process at the acquiescence standby subsystem place of the primary subsystem that the described primary node on same server carries, averagely It is distributed on the two neighboring server with described server.
The beneficial effect that technical scheme provided in an embodiment of the present invention is brought is:By carrying the first primary subsystem First primary node and the first standby section carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) During the equal fault of point, by the first primary binding of subsystems, to carrying the second standby subsystem, (non-default of the first primary subsystem is standby With subsystem) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first The normal operation of primary subsystem;Therefore, the embodiment of the present invention can support multiple node failures, and effectively increase system can By property.
Brief description
For the technical scheme being illustrated more clearly that in the embodiment of the present invention, will make to required in embodiment description below Accompanying drawing be briefly described it should be apparent that, drawings in the following description are only some embodiments of the present invention, for For those of ordinary skill in the art, on the premise of not paying creative work, other can also be obtained according to these accompanying drawings Accompanying drawing.
Fig. 1 is a kind of flow chart of redundancy protected method that the embodiment of the present invention 1 provides;
Fig. 2 is the structural representation of the radio network controller that the embodiment of the present invention 2 provides;
Fig. 3 is a kind of flow chart of redundancy protected method that the embodiment of the present invention 2 provides;
Fig. 4 is the schematic diagram of the two-way backup mode that the embodiment of the present invention 2 provides;
Fig. 5-Figure 12 is the schematic diagram of the concrete application of redundancy protected method that the embodiment of the present invention 2 provides;
Figure 13 is a kind of structural representation of redundancy protection device that the embodiment of the present invention 3 provides;
Figure 14 is a kind of structural representation of redundancy protection device that the embodiment of the present invention 4 provides;
Figure 15 is a kind of structural representation of redundancy protecting equipment that the embodiment of the present invention 5 provides;
Figure 16 is a kind of structural representation of redundancy protection systems that the embodiment of the present invention 6 provides.
Specific embodiment
For making the object, technical solutions and advantages of the present invention clearer, below in conjunction with accompanying drawing to embodiment party of the present invention Formula is described in further detail.
Embodiment 1
Referring to Fig. 1, embodiments provide a kind of redundancy protected method, the method flow process includes:
Step 101:Detect whether there is malfunctioning node.
Specifically, in embodiments of the present invention, node is the resource for executing certain task, can be a service A process on device or server.
Step 102:When there is malfunctioning node, and malfunctioning node includes the first primary section of carrying the first primary subsystem During the first secondary node of point and carrying the first standby subsystem, by the first primary binding of subsystems to carrying the second spare Second secondary node of system, so that this second secondary node carries the first primary subsystem, this first standby subsystem is the The standby subsystem of acquiescence of one primary subsystem, this second standby subsystem is the standby subsystem of non-default of the first primary subsystem System.
Specifically, in embodiments of the present invention, the standby subsystem of the acquiescence of primary subsystem, referring to can this master of backed up in synchronization Standby subsystem with the service data of subsystem.The standby subsystem of non-default of primary subsystem refers to backed up in synchronization to be somebody's turn to do The standby subsystem of the service data of primary subsystem, in implementing, primary subsystem and the standby subsystem of acquiescence are permissible It is one-to-one it is also possible to not be one-to-one.In the case of corresponding, the acquiescence of a primary subsystem is standby Subsystem is the non-default subsystem of another primary subsystem;In the case of not being to correspond, a primary subsystem The standby subsystem of non-default can be the standby subsystem of acquiescence of another primary subsystem it is also possible to not be any primary son The standby subsystem of acquiescence of system.
The beneficial effect of said method provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem System) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first is primary The normal operation of subsystem;Therefore, the embodiment of the present invention can support multiple node failures, effectively increases the reliability of system Property.Further, since the embodiment of the present invention is in primary subsystem and standby subsystem simultaneous faults, primary subsystem is acceptable Seize other secondary nodes, therefore can ensure that the operation of system using less secondary node, saved resource.
Embodiment 2
As a example the embodiment of the present invention will be with RNC (Radio Network Controller, radio network controller), to this The redundancy protected method of invention illustrates.The understanding of the technical scheme for ease of describing to the embodiment of the present invention, introduces first RNC once.RNC is a key network element of 3G (3rd Generation, 3G (Third Generation) Moblie) network and the group of access network Become part, for providing mobile management, call treatment, url management and handover mechanism.In order to ensure the reliability of mobile communication Property, RNC is typically a Cluster system.Referring to Fig. 2, RNC is made up of the veneer (i.e. service entities) of some interconnections.Veneer For functional module, it can be the blade server that specific function is provided.Veneer includes some processes (i.e. node).Process is single The abstract embodiment of CPU computing resource on plate, a process is exactly a physical manipulations unit.Each process carries an independence Logic application (i.e. task).Here, an independent Logic application is referred to as a subsystem.
With reference to Fig. 3, redundancy protected method provided in an embodiment of the present invention is described, referring to Fig. 3, the method includes:
Step 201:Arrange for primary subsystem and give tacit consent to standby subsystem.
Specifically, by setting up the corresponding relation between primary subsystem and standby subsystem, you can for primary subsystem Standby subsystem is given tacit consent in setting.In the running of primary subsystem, the standby subsystem of acquiescence of this primary subsystem can be right It synchronizes backup.
Preferably, referring to Fig. 4, in embodiments of the present invention, primary subsystem is equal with the quantity of standby subsystem, primary Subsystem is corresponded with standby subsystem, each primary subsystem have one give tacit consent to standby subsystem that is to say, that RNC is by the way of 1+1 backup.
It is highly preferred that the embodiment of the present invention can adopt two-way backup mode.Two-way backup mode refers to, first, in each list The primary subsystem of equal amount and standby subsystem are disposed on plate;Secondly, make a part of primary subsystem on a veneer With the spare system relationship on previous veneer, the standby subsystem pass on the primary subsystem of another part and a rear veneer Connection is that is to say, that two lists given tacit consent to before and after standby subsystem is evenly distributed in this veneer of primary subsystem on this veneer On plate.So, when veneer breaks down, the load of this veneer will be split on former and later two veneers, in front and back's veneer Impact is less.It is readily apparent that, the forward and backward veneer referring to same type adjacent in logic here, such as, in same machine frame Interior, including two kinds of veneer, one kind is redundancy protecting veneer, and the subsystem running thereon is all using the side of redundancy protecting Formula, and another kind is non-redundancy protecting veneer, the subsystem running thereon is not by the way of redundancy protecting.Assume that redundancy is protected The slot number that shield veneer is located in this machine frame is 0,1,2,4,5,6, then, for the veneer of No. 2 groove positions, before and after it Two veneers are respectively the veneer of No. 1 groove position and the veneer of No. 4 groove positions.
Obviously, existing Ring-backup mode may also be employed, by all primary subsystem on a veneer and adjacent The spare system relationship of individual veneer.It is readily apparent that, here the veneer of the adjacent same type also referring to Logic adjacent.
What deserves to be explained is, in the present embodiment, employ the mode of 1+1 backup, that is, primary subsystem with standby The quantity of subsystem is equal, and each primary subsystem has a standby subsystem of acquiescence, however, in other embodiments, main The quantity of standby subsystem can also be more than or less than with the quantity of subsystem.When the quantity of standby subsystem is more than primary subsystem During the quantity of system, there is the standby subsystem of redundancy.When the quantity of primary subsystem is more than the quantity of standby subsystem, permissible Priority (see step 202) according to primary subsystem sets up primary subsystem and the corresponding relation of standby subsystem, such as, first Set up the corresponding relation between the high primary subsystem of priority and standby subsystem.
Step 202:It is primary subsystem and standby subsystem setting priority.
Further, this step 202 can include:According to the type of service of subsystem, user type and/or region, it is Primary subsystem and standby subsystem setting priority.
Preferably, in the present embodiment, can be primary subsystem setting priority according to the type of service of subsystem. For example, the type of service according to subsystem, primary system subdivision is 4 priority, high priority, medium priority, low excellent First level and 0 priority.Wherein, high priority can be responsible for the subsystem that signaling is processed, this primary subsystem fault in load RNC All business in RNC will be affected, impact is maximum.Medium priority can be for being responsible for the subsystem of cell management in load RNC, this is primary The cell of this management subsystem that can lead to subsystem fault exits service, and impact is larger.Low priority can be reserved, is easy to follow-up Extension.0 priority can be for being responsible for the subsystem of call treatment in load RNC, this primary subsystem fault can lead to this subsystem On currently ongoing call, impact is minimum.
The priority of the corresponding primary subsystem of priority inheritance of standby subsystem, that is, primary subsystem is preset preferential Level is identical with the priority of the standby subsystem of acquiescence of this primary subsystem.The priority of standby subsystem is only used for standby subsystem Priority height is compared between system.
Preferably, the method can also include:Arrange primary subsystem seizes ability.
Specifically, the seizing ability and can also be represented using priority of primary subsystem.Such as, can be in setting priority Shi Dingyi, the primary subsystem of 0 priority does not possess the ability of seizing that is to say, that working as the primary subsystem carrying 0 priority During process failure, it can not seize the standby subsystem of other non-default.Certainly, seize ability and can not also adopt priority list Show, and adopt other specification to represent.
In the present embodiment, the ability of seizing of primary subsystem is also adopted by priority and represents, so only non-zero priority Primary subsystem just possesses the ability of seizing that is to say, that the priority of the primary subsystem of all non-zero priority is more than arbitrarily standby Priority with subsystem.
Alternatively, in other embodiments it is also possible to according to user type, be primary subsystem and the setting of standby subsystem Priority.Such as, the business of honored guest user all can be assigned on some subsystems, and the business of domestic consumer is all distributed To on other subsystems, high towards the priority of the subsystem than domestic consumer for the subsystem of honored guest user.
Alternatively, in other embodiments, can also be preferential according to being arranged with subsystem and standby subsystem based on region Level.Such as, can be and close by population by the traffic assignments of service area (such as city) big for population density to some subsystems The traffic assignments spending little service area (such as mountain area) are to other subsystems, the corresponding son of the big service area of population density The priority of the corresponding subsystem of the system service area less than population density is high.
It should be noted that in implementing, step 201 and 202 does not have sequencing, can execute simultaneously.Additionally, Step 201 and 202 is optional step, can pre-set.
Step 203:Detect whether there is malfunctioning node.
When there is malfunctioning node, and this malfunctioning node includes carrying the first primary node of the first primary subsystem and holds Carry the first standby subsystem the first secondary node when, execution step 204, wherein, the first standby subsystem is the first primary son The standby subsystem of acquiescence of system;
When there is malfunctioning node, and this malfunctioning node includes the second primary node of carrying the second primary subsystem, and is somebody's turn to do The standby subsystem that the secondary node that malfunctioning node does not include secondary node or this malfunctioning node includes carries be not this During the standby subsystem of acquiescence of two primary subsystems, execution step 214;
When there is malfunctioning node, and this malfunctioning node includes carrying the 5th secondary node of the 5th standby subsystem, and is somebody's turn to do Acquiescence of the primary subsystem that the primary node that malfunctioning node does not include primary node or this malfunctioning node includes carries is standby When being not five standby subsystems with subsystem, execution step 218.
Further, in the present embodiment, detect whether there is malfunctioning node, including following two situations:
If the first simply certain node (i.e. process) on veneer break down, the detection Programmable detection on veneer arrives After this fault, can be by operating system interface to control centre (i.e. in the device in embodiment three or four or embodiment five Equipment) reporting fault node, control centre knows malfunctioning node according to the information that veneer reports.
If second single board default, such as hardware fault or network failure, in this case, due to control centre and list Timing handshake message can be sent, once there is single board default, control centre in the scheduled time or does not for a long time receive report between plate Literary composition, now then it represents that all nodes all faults on this veneer.
Be readily apparent that, generally preserve in control centre the corresponding relation of node and subsystem and primary subsystem and Give tacit consent to the corresponding relation (such as in the form of a list) of standby subsystem, so, when there is malfunctioning node, aforementioned by inquiring about Corresponding relation is it is possible to know the type of the subsystem that malfunctioning node carries.
Specifically, referring to Fig. 5, in the present embodiment it can be assumed that malfunctioning node includes the first primary node a and first standby Use node b, a carries the first primary subsystem X, b carries the first standby subsystem X ', X ' is the standby subsystem of acquiescence of X.
Preferably, after detecting whether to there is malfunctioning node, the method also includes:
Detect whether after restart the scheduled time interior recovery is normal for the first primary node and the first secondary node.
In the present embodiment, node is the process on veneer, so, detecting the first primary node and the first standby section After point failure, this first primary node and the first secondary node should be restarted immediately in control centre.And, first should be detected after restarting Whether primary node and the first secondary node recover normal in the given time.When the first primary node or the first secondary node are extensive When multiple normal, initiation message can be sent to control centre, control centre can judge the first primary node according to this initiation message Whether recover normal with the first secondary node.
Specifically, this scheduled time is to pre-set, and is used for making malfunctioning node complete to restart, when being typically additionally provided with Between protect surplus.For example, the starting time 90 seconds+guardtime surplus of this scheduled time=subsystem 30 seconds=120 seconds.
Correspondingly, if the first primary node and the first secondary node all do not recover normal after restart in the scheduled time, then Execution subsequent step 204, if the first primary node and the first secondary node recover normal in the given time, does not execute follow-up Step 204.
Step 204:Judge whether the first primary subsystem possesses the ability of seizing.If this first primary subsystem possesses seizing Ability, then execution step 205;If this first primary subsystem does not possess the ability of seizing, exit this flow process.
Specifically, if seizing ability using what priority represented primary subsystem, then, judge the first primary subsystem Whether possess the ability of seizing to include:Whether the priority judging this first primary subsystem is non-zero priority.If this is first primary The priority of subsystem is non-zero priority, then this first primary subsystem possesses the ability of seizing.If this first primary subsystem Priority is 0 priority, then this first primary subsystem does not possess the ability of seizing.
Step 205:By the first primary binding of subsystems to the second secondary node carrying the second standby subsystem, so that the Two secondary nodes carry the first primary subsystem;This second standby subsystem is the standby subsystem of non-default of the first primary subsystem System.
Specifically, this step 205 includes:
In the standby subsystem of non-default of the first primary subsystem, select the minimum standby subsystem of a priority, The secondary node that selected standby subsystem is located is as the second secondary node, and the first primary binding of subsystems extremely should Second secondary node.
In the rnc, by the first primary binding of subsystems to the second secondary node, including:
The start-up parameter data of the first primary subsystem is sent to the second secondary node, so that the second secondary node is adopted Run with the start-up parameter data receiving, this is the state of the art, here is omitted and described in detail.
Referring to Fig. 6 it can be assumed that the second secondary node is c, c initially carries the second standby subsystem Y '.Step 205 it Afterwards, X is bound to c.
Be readily apparent that, by the first primary binding of subsystems to carry the second standby subsystem the second secondary node it Before, the corresponding relation between the subsystem of its preservation and node can refresh in control centre, then according to the corresponding relation after refreshing, Send aforementioned start-up parameter.
Step 206:Detect whether the first primary node and the first secondary node recover normal;If the first primary node is first extensive Multiple normal, then execution step 207;If the first secondary node first recovers normal, execution step 210.
Specifically, when malfunctioning node recovers normal, initiation message can be sent to control centre, so, control centre can With according to whether the initiation message receiving malfunctioning node transmission to judge whether this malfunctioning node recovers normal.
In the present embodiment, only describe the first primary node and the first secondary node recovers normal situation (in succession One primary node first recovers normal or the first secondary node and first recovers normal).In theory, it is understood that there may be the first primary node and One secondary node recovers normal situation simultaneously.However, in the implementing of the embodiment of the present invention, due to holding of all steps Row is all to be processed in the way of interacting message, for control centre, is only possible to first receive the first primary node and The one of initiation message sending of one secondary node.Therefore, in actual applications, be not in the first primary node and One secondary node recovers normal situation simultaneously.
Step 207:By the first standby binding of subsystems to the first primary node, so that the first primary node carrying first is standby Use subsystem.
Specifically, when the first primary node first recovers normal, by the first standby binding of subsystems to this first primary section Point so that after the first primary node starts, can be previously run the first standby subsystem the first primary subsystem is carried out standby Part, after the completion of being easy to backup, the first primary subsystem is tied up back the first primary node again.Referring to Fig. 7, when a first recovers normal, X ' is tied to a.
Step 208:Masterslave switchover is carried out to the first primary node and the second secondary node, so that the first primary node carries First primary subsystem, and the second secondary node carries the first standby subsystem.
Specifically, the backup (BACKUP TIME can be set) to the first primary subsystem is completed in the first standby subsystem Afterwards, masterslave switchover is carried out to the first primary node and the second secondary node:By the first primary binding of subsystems to the first primary section Point, so that the first primary node carries the first primary subsystem again;By the first standby binding of subsystems to the second secondary node, So that the second secondary node carries this first standby subsystem.Referring to Fig. 8, when carrying out the masterslave switchover between node:X is tied up back a, A is made again to carry X.X ' is tied to c, makes c start to carry X '.
Step 209:After the first secondary node recovers normal, by the first standby binding of subsystems to the first secondary node simultaneously Discharge the second secondary node, so that the first secondary node carries the first standby subsystem.
Specifically, after the first primary node recovers normal, the first secondary node has also recovered normally, at this moment, by first Standby binding of subsystems to the first secondary node and discharges the second secondary node, so that the first secondary node carries the first spare System, and restart the second secondary node being released, so that it recovers original standby subsystem (the second standby subsystem).Ginseng See Fig. 9, after b also recovers normal, X ' is tied up back b, and discharges c.
So, by step 207-209, the first primary subsystem, the first standby subsystem and the second standby subsystem are equal Recover to original state.
Step 210:By the first standby binding of subsystems to the first secondary node, so that the first secondary node carrying first is standby Use subsystem.
Specifically, when the first secondary node first recovers normal, by the first standby binding of subsystems to the first secondary node, So that the first standby subsystem backs up to the first primary subsystem.Referring to Figure 10 it is assumed that after a and b fault, b just first recovers Often.Then X ' is tied to b.
Step 211:Masterslave switchover is carried out to the first secondary node and the second secondary node, so that the first secondary node carries First primary subsystem, and the second secondary node carries the first standby subsystem.
Specifically, the backup (BACKUP TIME can be set) to the first primary subsystem is completed in the first standby subsystem Afterwards, masterslave switchover is carried out to the first secondary node and the second secondary node:By the first primary binding of subsystems to the first standby section Point, by the first standby binding of subsystems to the second secondary node.Referring to Figure 11, b first recovers normal and is tied to after b by X ', to b Carry out masterslave switchover with c:X is bind to b, X ' is bind to c.
Step 212:After the first primary node recovers normal, by the first standby binding of subsystems to the first primary node And discharge the second secondary node, so that the first primary node carries the first standby subsystem.
Specifically, after the first secondary node recovers normal, the first primary node has also recovered normally, at this moment, by first Standby binding of subsystems to the first primary node and discharges the second secondary node, so that the first primary node carries the first spare System.After first standby binding of subsystems to the first primary node, start the first primary subsystem is backed up.Referring to figure 12, first recover a after normally in b and also recovered normally, at this moment, X ' to be bind to a, and discharge c.
Step 213:Masterslave switchover is carried out to the first primary node and the first secondary node, so that the first primary node carries First primary subsystem, and the first secondary node carries the first standby subsystem.
Specifically, treat that the first standby subsystem completes the backup to the first primary subsystem, to the first primary node and One secondary node carries out masterslave switchover:By the first primary binding of subsystems to the first primary node, the first standby subsystem is tied up Determine to the first secondary node.Refer again to Fig. 9, after X ' completes the backup to X, then the masterslave switchover carrying out a and b:X is tied up back a, X ' ties up back b.
Similarly, by step 210-213, the first primary subsystem, the first standby subsystem and the second standby subsystem All recover to original state.
It should be noted that step 208-209 and step 211-213 are preferred steps, facilitate implementation the load of system Balance.
Step 214:Judge that the second primary subsystem whether there is and give tacit consent to standby subsystem.If existing, execution step 215;If not existing, execution step 216.
Specifically, judge that the second primary subsystem whether there is and give tacit consent to standby subsystem, including:Preserve according in server Primary subsystem and give tacit consent to standby subsystem corresponding relation, judge the second primary subsystem whether there is acquiescence standby subsystem System.
Step 215:Masterslave switchover is carried out to the 3rd secondary node of the second primary node and carrying the 3rd standby subsystem, So that the 3rd secondary node carries the second primary subsystem, the 3rd standby subsystem is that the acquiescence of the second primary subsystem is standby Subsystem.
Specifically, if the second primary subsystem exists gives tacit consent to standby subsystem, and the standby subsystem of this acquiescence is standby for the 3rd With subsystem, then by the second primary binding of subsystems to the 3rd secondary node carrying the 3rd standby subsystem.
Step 216:By the second primary binding of subsystems to carry the 4th standby subsystem the 4th secondary node, the 4th Standby subsystem is the standby subsystem of non-default of the second primary subsystem.
Specifically, by the second primary binding of subsystems to the 4th secondary node carrying the 4th standby subsystem, same to step The process of 204~step 205 description, will not be described in detail herein.
Step 217:Detect whether the second primary node recovers normal;If the second primary node recovers normal, by second Primary binding of subsystems is to the second primary node.
Specifically, detection process is identical with the detection process that step 206 describes, and will not be described in detail herein.If the second primary section Point recovers normal, then by the second primary binding of subsystems to the second primary node;If this second primary node does not recover normal, Proceed to detect.
Specifically, if the second primary node recovers normal, the second primary binding of subsystems to the second primary node wraps Include two kinds of situations:
The first situation:As step 215 describes, the 3rd secondary node carries the second primary subsystem, at this moment, if second is main Recover normal with node, then first by the 3rd standby binding of subsystems to the second primary node so that the second primary node carries the Three standby subsystems;Then, masterslave switchover is carried out to the second primary node and the 3rd secondary node, so that the second primary node holds Carry the second primary subsystem, and the 3rd secondary node carries the 3rd standby subsystem.
Second situation:As step 216 describes, the 4th secondary node carries the second primary subsystem, at this moment, if second is main Recover normal with node, then by the second primary binding of subsystems to the second primary node, and discharge the 4th secondary node.
Step 218:According to the priority of the 5th standby subsystem, seize the 6th standby section carrying the 6th standby subsystem Point, so that the 6th secondary node carries the 5th standby subsystem.
Wherein, the priority of the standby subsystem of priority ratio the 5th of the 6th standby subsystem is low.Further, seize Process can be found in step 205, will not be described in detail herein.
Preferably, after detecting the 5th secondary node fault, the 5th secondary node should be restarted immediately in control centre.Weight Qi Hou, should detect whether the 5th secondary node recovers normal in the given time;If so, then do not seized;If it is not, then executing Seize.
It is highly preferred that after the 5th standby subsystem seizes the 6th secondary node, the method also includes:
After the 5th secondary node recovers normal, discharge the 6th secondary node, and by the 5th standby binding of subsystems extremely 5th secondary node, so that the 5th secondary node carries the 5th standby subsystem again.
What deserves to be explained is, in the present embodiment, node is a process on veneer in RNC, in other embodiments, Node can be a money on a server or server in a veneer or other Cluster systems in RNC Source.
The beneficial effect of said method provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem System) the second secondary node so that this second secondary node carry the first primary subsystem, such that it is able to fast quick-recovery fault The first primary subsystem it is ensured that the normal operation of the first primary subsystem;Therefore, the embodiment of the present invention can support multiple sections Point failure, effectively increases the reliability of system.Further, since the embodiment of the present invention is in primary subsystem and standby subsystem During simultaneous faults, primary subsystem can also seize other secondary nodes, therefore be can ensure that using less secondary node The operation of system, has saved resource.In addition, when carrying the secondary node fault of standby subsystem of high priority, Ke Yigen Seize the secondary node of the standby subsystem carrying low priority according to priority, therefore can preferentially ensure the primary of high priority Subsystem exists gives tacit consent to standby subsystem, that is, ensure the high primary subsystem of priority, preferentially have redundancy protecting, such that it is able to Reduce the risk of service disconnection, improve the reliability of system further, and the mode seized according to priority, realize simple.
Embodiment 3
Embodiments provide a kind of redundancy protection device, referring to Figure 13, this device includes:
, for detecting whether there is malfunctioning node in detection module 301.
First performing module 302, for detecting the presence of malfunctioning node when detection module 301, and malfunctioning node includes holding The the first primary node carrying the first primary subsystem and carry the first standby subsystem the first secondary node when, main by first With binding of subsystems to the second secondary node carrying the second standby subsystem, so that the second secondary node carries the first primary son System, this first standby subsystem is the standby subsystem of acquiescence of the first primary subsystem, and this second standby subsystem is first The standby subsystem of non-default of primary subsystem.
The beneficial effect of said apparatus provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem System) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first is primary The normal operation of subsystem;Therefore, the embodiment of the present invention can support multiple node failures, effectively increases the reliability of system Property.
Embodiment 4
Embodiments provide a kind of redundancy protection device, referring to Figure 14, the difference with the device of embodiment 3 It is, the first performing module 302 is additionally operable to, when detection module 301 detects the presence of malfunctioning node, and malfunctioning node includes carrying During the first secondary node of the first primary node of the first primary subsystem and carrying the first standby subsystem, primary first In the standby subsystem of non-default of subsystem, select the minimum standby subsystem of a priority, by selected standby subsystem Unite the secondary node being located as the second secondary node, and by the first primary binding of subsystems to the second secondary node.
Further, this device also includes:First judge module 403, for judging whether the first primary subsystem possesses Seize ability;
Correspondingly, the first performing module 302 is additionally operable to, when the first primary subsystem possesses and seizes ability, main by first With binding of subsystems to the second secondary node.
Further, detection module 301 is additionally operable to, and whether detection the first primary node and the first secondary node just recover Often;
Correspondingly, the first performing module 302 is additionally operable to, when the first primary node first recovers normal, by the first spare System binding is to the first primary node, so that the first primary node carries the first standby subsystem;When the first secondary node is first extensive When multiple normal, by the first standby binding of subsystems to the first secondary node, so that the first secondary node carries the first standby subsystem System.
Further, the first performing module 302 is additionally operable to, by the first standby binding of subsystems to the first primary node, So that after the first primary node carries the first standby subsystem, the first primary node and the second secondary node are carried out active and standby fall Change, so that the first primary node carries the first primary subsystem, and the second secondary node carries the first standby subsystem.
Further, the first performing module 302 is additionally operable to, after the first secondary node recovers normal, by the first spare System binding to the first secondary node and discharges the second secondary node, so that the first secondary node carries the described first standby subsystem System.
Further, the first performing module 302 is additionally operable to, by the first standby binding of subsystems to the first secondary node, So that first secondary node carry the first standby subsystem after, the first secondary node and the second secondary node are carried out active and standby fall Change, so that the first secondary node carries the first primary subsystem, and the second secondary node carries the first standby subsystem.
Further, the first performing module 302 is additionally operable to, after the first primary node recovers normal, standby by first Binding of subsystems to the first primary node and discharges the second secondary node, so that the first primary node carries the first standby subsystem System.
Further, the first performing module 302 is additionally operable to, the first primary node and the second secondary node are carried out active and standby fall Change, so that the first primary node carries the first primary subsystem, and the second secondary node carries the first standby subsystem.
Further, this device also includes:
Second performing module 404, for detecting the presence of malfunctioning node when detection module 301, and malfunctioning node includes holding Carry the second primary node of the second primary subsystem, and malfunctioning node does not include secondary node or that malfunctioning node includes is standby When the standby subsystem that node carries not is the standby subsystem of acquiescence of the second primary subsystem, the second primary subsystem is tied up Fixed to the 3rd secondary node carrying the 3rd standby subsystem so that the 3rd secondary node carries the second primary subsystem, this Three standby subsystems are the standby subsystem of acquiescence of the second primary subsystem.
Further, this device also includes:Second judge module 405, for judging that the second primary subsystem whether there is Give tacit consent to standby subsystem;
Correspondingly, the second performing module 404 is additionally operable to, in the presence of the judged result of the second judge module 405 is, by Two primary binding of subsystems are to the 3rd secondary node carrying the 3rd standby subsystem;Judgement knot when the second judge module 405 When fruit is not exist, by the second primary binding of subsystems to the 4th secondary node carrying the 4th standby subsystem, the 4th is standby The standby subsystem of non-default being the second primary subsystem with subsystem.
Further, detection module 301 is additionally operable to, and whether detection the second primary node recovers normal;Correspondingly, second hold Row module 404 is additionally operable to, when the second primary node recovers normal, then by the second primary binding of subsystems to the second primary node.
Further, this device also includes:
3rd performing module 406, for detecting the presence of malfunctioning node when detection module 301, and malfunctioning node includes holding Carry the 5th secondary node of the 5th standby subsystem, and malfunctioning node does not include primary node or that malfunctioning node includes is primary When the standby subsystem of acquiescence of the primary subsystem that node carries not is five standby subsystem, according to the 5th standby subsystem The priority of system, seizes the 6th secondary node carrying the 6th standby subsystem, so that the 6th secondary node carrying the 5th is standby Subsystem.
Further, this device also includes:
First setup module 407, for being primary subsystem and standby subsystem setting priority.
Further, this device also includes:
Second setup module 408, gives tacit consent to standby subsystem for arranging for primary subsystem.
The beneficial effect of said apparatus provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem System) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first is primary The normal operation of subsystem;Therefore, the embodiment of the present invention can support multiple node failures, effectively increases the reliability of system Property.Further, since the embodiment of the present invention is in primary subsystem and standby subsystem simultaneous faults, primary subsystem is acceptable Seize other secondary nodes, therefore can ensure that the operation of system using less secondary node, saved resource.In addition, When carrying the secondary node fault of standby subsystem of high priority, can be seized according to priority and carry the standby of low priority With the secondary node of subsystem, therefore can preferentially ensure that the primary subsystem of high priority exists and give tacit consent to standby subsystem, from And improve the reliability of system further, and the mode seized according to priority, realize simple.
Embodiment 5
Embodiments provide a kind of redundancy protecting equipment, referring to Figure 15, this equipment includes processor 51 (for example CPU), memorizer 52 and network interface 53.
With reference to Figure 15, each component parts of redundancy protecting equipment 50 are specifically introduced:
Network interface 53 is used for and the communication connection of each node.
Memorizer 52 can be used for storing software program and/or application module, thus executing the various of redundancy protecting equipment 50 Application of function and data processing.Memorizer 52 can mainly include storing program area and storage data field, wherein, storing program area Can application program needed for storage program area, at least one function etc.;Storage data field can store according to redundancy protecting equipment The data that 50 use is created, such as corresponding relation between primary subsystem and standby subsystem, the priority of subsystem, And primary subsystem seize ability etc..Additionally, memorizer 52 can include high-speed RAM (Random Access Memory, Random access memory), nonvolatile memory (non-volatile memory), for example, at least one magnetic can also be included Disk storage device, flush memory device or other volatile solid-state parts.
Specifically, it is stored in software program and/or the application module in memorizer 52 by operation or execution, and call It is stored in the data in memorizer 52, processor 51 is it is achieved that detect whether there is malfunctioning node, and determines that malfunctioning node holds The subsystem carrying.Specifically, when there is malfunctioning node, and this malfunctioning node includes carrying the first primary of the first primary subsystem During the first secondary node of node and carrying the first standby subsystem, processor 51 passes through network interface 53 to the second standby section Point sends the instruction message (including the start-up parameter of the first primary subsystem) carrying the first primary subsystem, and this instruction message is used In by the second secondary node of the first primary binding of subsystems to carrying the second standby subsystem, so that this second secondary node holds Carry the first primary subsystem, the first standby subsystem is the standby subsystem of acquiescence of the first primary subsystem, this second spare System is the standby subsystem of non-default of the first primary subsystem.
Preferably, processor 51 is specifically for, in the standby subsystem of non-default of the first primary subsystem, selecting one The minimum standby subsystem of priority, using the secondary node at selected standby subsystem place as the second secondary node, and By the first primary binding of subsystems to this second secondary node.
Preferably, processor 51 can also be realized, and the first primary binding of subsystems is extremely being carried the second standby subsystem The second secondary node so that second secondary node carry primary subsystem before,
Judge whether the first primary subsystem possesses the ability of seizing;
If the first primary subsystem possesses the ability of seizing, by the first primary binding of subsystems to the second secondary node.
Preferably, processor 51 can also be realized, after by the first primary binding of subsystems to the second secondary node,
Detect whether the first primary node and the first secondary node recover normal;
If the first primary node first recovers normal, by the first standby binding of subsystems to the first primary node, so that the One primary node carries the first standby subsystem;
If the first secondary node first recovers normal, by the first standby binding of subsystems to the first secondary node, so that the One secondary node carries the first standby subsystem.
Further, processor 51 can also be realized, by the first standby binding of subsystems to the first primary node so that After first primary node carries the first standby subsystem, masterslave switchover is carried out to the first primary node and the second secondary node, So that the first primary node carries the first primary subsystem, and the second secondary node carries the first standby subsystem.
Preferably, processor 51 can also be realized, and is carrying out masterslave switchover to the first primary node and the second secondary node Afterwards, after the first secondary node recovers normal, by the first standby binding of subsystems, to the first secondary node and to discharge second standby With node, so that the first secondary node carries the first standby subsystem.
Preferably, processor 51 can also be realized, by the first standby binding of subsystems to the first secondary node so that After one secondary node carries the first standby subsystem, masterslave switchover is carried out to the first secondary node and the second secondary node, with Make the first secondary node carry the first primary subsystem, and the second secondary node carries the first standby subsystem.
Preferably, processor 51 can also be realized, and is carrying out masterslave switchover to the first secondary node and the second secondary node Afterwards,
After the first primary node recovers normal, the first standby binding of subsystems to the first primary node and is discharged the Two secondary nodes, so that the first primary node carries the first standby subsystem.
Preferably, processor 51 can also be realized, by the described first standby binding of subsystems to the first primary node simultaneously After discharging the second secondary node,
Masterslave switchover is carried out to the first primary node and the second secondary node, so that the first primary node carrying first is primary Subsystem, and the second secondary node carries the first standby subsystem.
Further, processor 51 can also be realized, when there is malfunctioning node, and this malfunctioning node include carrying second main With the second primary node of subsystem, and this malfunctioning node does not include the secondary node that secondary node or this malfunctioning node include When the standby subsystem carrying not is the standby subsystem of acquiescence of this second primary subsystem, processor 51 passes through network interface 53 to the 3rd secondary nodes send the main message of liter, and (the 3rd standby subsystem is the standby subsystem of acquiescence of the second primary subsystem System), this message is used for instruction the 3rd secondary node and carries the second primary subsystem;Or, processor 51 passes through network interface 53 Send the configured information (including the start-up parameter of the second primary subsystem) carrying the second primary subsystem to the 4th secondary node, This configured information is used for the second primary binding of subsystems to the 4th secondary node, so that the 4th secondary node carrying second is primary Subsystem, the 4th standby subsystem is the standby subsystem of non-default of the second primary subsystem.
Preferably, processor 51 can also be realized, before the 3rd secondary node carries the second primary subsystem,
Judge that the second primary subsystem whether there is and give tacit consent to standby subsystem;
If existing, by the second primary binding of subsystems to the 3rd secondary node carrying the 3rd standby subsystem;
If not existing, by the second primary binding of subsystems to the 4th secondary node carrying the 4th standby subsystem, the Four standby subsystems are the standby subsystem of non-default of the second primary subsystem.
Preferably, processor 51 can also be realized, by the second primary binding of subsystems to the 3rd secondary node or the 4th After secondary node,
Detect whether the second primary node recovers normal;
If the second primary node recovers normal, by the second primary binding of subsystems to the second primary node.
Further, processor 51 can also be realized, when there is malfunctioning node, and this malfunctioning node include carrying the 5th standby With the 5th secondary node of subsystem, and this malfunctioning node does not include the primary node that primary node or this malfunctioning node include When the standby subsystem of acquiescence of the primary subsystem carrying not is five standby subsystem, processor 51 is connect by network Mouth 53 to the 6th secondary nodes send the configured information (priority ratio of the 6th standby subsystem carrying the 5th standby subsystems The priority of the 5th standby subsystem is low), this configured information is used for the 5th standby binding of subsystems to the 6th secondary node, with The 6th secondary node is made to carry the 5th standby subsystem.
Further, processor 51 can also be realized, and is primary subsystem and standby subsystem setting priority.
Further, processor 51 can also be realized, and is that standby subsystem is given tacit consent in primary subsystem setting.
It will be understood by those skilled in the art that the structure shown in Figure 15 does not constitute the limit to redundancy protecting equipment Fixed, it can include ratio and illustrate more or less of part, or combines some parts, or different part arrangements.
The beneficial effect of said system provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem System) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first is primary The normal operation of subsystem;Therefore, the embodiment of the present invention can support multiple node failures, effectively increases the reliability of system Property.Further, since the embodiment of the present invention is in primary subsystem and standby subsystem simultaneous faults, primary subsystem is acceptable Seize other secondary nodes, therefore can ensure that the operation of system using less secondary node, saved resource.In addition, When carrying the secondary node fault of standby subsystem of high priority, can be seized according to priority and carry the standby of low priority With the secondary node of subsystem, therefore can preferentially ensure that the primary subsystem of high priority exists and give tacit consent to standby subsystem, from And improve the reliability of system further, and the mode seized according to priority, realize simple.
Embodiment 6
Embodiments provide a kind of redundancy protection systems, referring to Figure 16, this system include multiple nodes 601, with And redundancy protection device 602.Wherein, this redundancy protection device 602 can be protected for the redundancy providing in the embodiment of the present invention 3 or 4 The redundancy protecting equipment providing in protection unit or the embodiment of the present invention 5.
Wherein, node 601 can be the process on server or server.
Further, when node 601 be server on process when, carry primary subsystem the quantity of primary node and The quantity carrying the secondary node of standby subsystem is equal, and each primary subsystem has a standby subsystem of acquiescence, with The process that the standby subsystem of acquiescence of the primary subsystem that the primary node on one server carries is located, is evenly distributed in and is somebody's turn to do On the two neighboring server of server.
As it was previously stated, the system of the present embodiment is applied to the RNC and IT (Information of other Cluster form Technology, information technology) product.
The beneficial effect of said system provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem System) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first is primary The normal operation of subsystem;Therefore, the embodiment of the present invention can support multiple node failures, effectively increases the reliability of system Property.
It should be noted that:The redundancy protection device that above-described embodiment provides when executing redundancy protecting, only with above-mentioned each The division of functional module is illustrated, and in practical application, can distribute above-mentioned functions by different work(as desired Module can complete, the internal structure of device will be divided into different functional modules, with complete described above all or portion Divide function.In addition, the redundancy protection device that above-described embodiment provides belongs to same design with redundancy protected method embodiment, its tool Body is realized process and is referred to embodiment of the method, repeats no more here.
The embodiments of the present invention are for illustration only, do not represent the quality of embodiment.
One of ordinary skill in the art will appreciate that all or part of step realizing above-described embodiment can pass through hardware To complete it is also possible to the hardware being instructed correlation by program is completed, described program can be stored in a kind of computer-readable In storage medium, storage medium mentioned above can be read only memory, disk or CD etc..
The foregoing is only presently preferred embodiments of the present invention, not in order to limit the present invention, all spirit in the present invention and Within principle, any modification, equivalent substitution and improvement made etc., should be included within the scope of the present invention.

Claims (36)

1. a kind of redundancy protected method is it is characterised in that methods described includes:
Detect whether there is malfunctioning node;
When there is malfunctioning node, and described malfunctioning node includes carrying the first primary node of the first primary subsystem and carries During the first secondary node of the first standby subsystem, by the described first primary binding of subsystems to carrying the second standby subsystem Second secondary node, so that described second secondary node carries the described first primary subsystem, described first standby subsystem is The standby subsystem of acquiescence of described first primary subsystem, described second standby subsystem is the non-of the described first primary subsystem Give tacit consent to standby subsystem.
2. method according to claim 1 it is characterised in that described by the described first primary binding of subsystems to carrying the Second secondary node of two standby subsystems, so that described second secondary node carries described primary subsystem, including:
In the standby subsystem of non-default of the described first primary subsystem, select the minimum standby subsystem of a priority, The secondary node that selected described standby subsystem is located as described second secondary node, and by the described first primary son System binding is to described second secondary node.
3. method according to claim 1 and 2 it is characterised in that described by the described first primary binding of subsystems extremely Carry the second secondary node of the second standby subsystem, so that before described second secondary node carries described primary subsystem, Methods described also includes:
Judge whether the described first primary subsystem possesses the ability of seizing;
If described first primary subsystem possesses the ability of seizing, the described first primary binding of subsystems is standby to described second Node.
4. method according to claim 1 it is characterised in that by the described first primary binding of subsystems to described second After secondary node, methods described also includes:
Detect whether the described first primary node and described first secondary node recover normal;
If described first primary node first recovers normal, by the described first standby binding of subsystems to described first primary section Point, so that described first primary node carries the described first standby subsystem;
If described first secondary node first recovers normal, by the described first standby binding of subsystems to described first standby section Point, so that described first secondary node carries the described first standby subsystem.
5. method according to claim 4 it is characterised in that described by the described first standby binding of subsystems to described First primary node, so that after described first primary node carries the described first standby subsystem, methods described also includes:
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node carries Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
6. method according to claim 5 is it is characterised in that described to the described first primary node and described second standby After carrying out masterslave switchover with node, methods described also includes:
After described first secondary node recovers normal, by the described first standby binding of subsystems to described first secondary node simultaneously Discharge described second secondary node, so that described first secondary node carries the described first standby subsystem.
7. method according to claim 4 it is characterised in that described by the described first standby binding of subsystems to described First secondary node, so that after described first secondary node carries the described first standby subsystem, methods described also includes:
Masterslave switchover is carried out to described first secondary node and described second secondary node, so that described first secondary node carries Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
8. method according to claim 7 is it is characterised in that described to described first secondary node and described second standby After carrying out masterslave switchover with node, methods described also includes:
After the described first primary node recovers normal, by the described first standby binding of subsystems to described first primary node And discharge described second secondary node, so that described first primary node carries the described first standby subsystem.
9. method according to claim 8 it is characterised in that described by the described first standby binding of subsystems to described First primary node after discharging described second secondary node, methods described also includes:
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node carries Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
10. method according to claim 1 is it is characterised in that methods described also includes:
When there is malfunctioning node, and described malfunctioning node includes the second primary node of carrying the second primary subsystem, and described The standby subsystem that the secondary node that malfunctioning node does not include secondary node or described malfunctioning node includes carries not is institute State the second primary subsystem acquiescence standby subsystem when,
Masterslave switchover is carried out to the 3rd secondary node of the second primary node and carrying the 3rd standby subsystem, so that the described 3rd Secondary node carries the described second primary subsystem, and described 3rd standby subsystem is that the acquiescence of the described second primary subsystem is standby Use subsystem.
11. methods according to claim 10 are it is characterised in that described to the second primary node with carry the 3rd standby 3rd secondary node of subsystem carries out masterslave switchover so that described 3rd secondary node carry the described second primary subsystem it Before, methods described also includes:
Judge that the described second primary subsystem whether there is and give tacit consent to standby subsystem;
If existing, by the described second primary binding of subsystems to the 3rd secondary node carrying the 3rd standby subsystem;
If not existing, by the described second primary binding of subsystems to the 4th secondary node carrying the 4th standby subsystem, institute State the standby subsystem of non-default that the 4th standby subsystem is the described second primary subsystem.
12. methods according to claim 11 are it is characterised in that by the described second primary binding of subsystems to described the After three secondary nodes or described 4th secondary node, methods described also includes:
Detect whether the described second primary node recovers normal;
If described second primary node recovers normal, by the described second primary binding of subsystems to described second primary node.
13. methods according to claim 1 are it is characterised in that methods described also includes:
When there is malfunctioning node, and described malfunctioning node includes carrying the 5th secondary node of the 5th standby subsystem, and described Acquiescence of the primary subsystem that the primary node that malfunctioning node does not include primary node or described malfunctioning node includes carries When standby subsystem is not described five standby subsystem,
According to the priority of described 5th standby subsystem, seize the 6th secondary node carrying the 6th standby subsystem, so that Described 6th secondary node carries described 5th standby subsystem.
14. methods according to claim 1 are it is characterised in that methods described also includes:
It is primary subsystem and standby subsystem setting priority.
15. methods according to claim 1 are it is characterised in that methods described also includes:
Arrange for primary subsystem and give tacit consent to standby subsystem.
16. methods according to claim 1 are it is characterised in that carrying the quantity of primary node of primary subsystem and holding The quantity carrying the secondary node of standby subsystem is equal, and each described primary subsystem has a standby subsystem of acquiescence.
17. methods according to claim 16 are it is characterised in that described primary node and described secondary node are service Process on device, what the standby subsystem of acquiescence of the primary subsystem that the described primary node on same server carries was located enters Journey, is evenly distributed on two adjacent with described server server.
A kind of 18. redundancy protection devices are it is characterised in that described device includes:
, for detecting whether there is malfunctioning node in detection module;
First performing module, for detecting the presence of malfunctioning node when described detection module, and described malfunctioning node includes carrying During the first secondary node of the first primary node of the first primary subsystem and carrying the first standby subsystem, by described first Primary binding of subsystems is to the second secondary node carrying the second standby subsystem, so that described second secondary node carrying is described First primary subsystem, described first standby subsystem is the standby subsystem of acquiescence of the described first primary subsystem, described Two standby subsystems are the standby subsystems of non-default of the described first primary subsystem.
19. devices according to claim 18 it is characterised in that described first performing module is used for,
When there is malfunctioning node, and described malfunctioning node includes carrying the first primary node of the first primary subsystem and carries During the first secondary node of the first standby subsystem, in the standby subsystem of non-default of the described first primary subsystem, select The minimum standby subsystem of one priority, the secondary node that selected described standby subsystem is located is as described second Secondary node, and by the described first primary binding of subsystems to described second secondary node.
20. devices according to claim 18 or 19 are it is characterised in that described device also includes:
First judge module, for judging whether the described first primary subsystem possesses the ability of seizing;
Correspondingly, described first performing module is additionally operable to, when the described first primary subsystem possesses and seizes ability, by described One primary binding of subsystems is to described second secondary node.
21. devices according to claim 18 it is characterised in that
Described detection module is additionally operable to, and whether detection the described first primary node and described first secondary node recover normal;
Correspondingly, described first performing module is additionally operable to, when the described first primary node first recovers normal, then by described first Standby binding of subsystems is to described first primary node, so that described first primary node carries the described first standby subsystem; When described first secondary node first recovers normal, then by the described first standby binding of subsystems to described first secondary node, So that described first secondary node carries the described first standby subsystem.
22. devices according to claim 21 it is characterised in that described first performing module is additionally operable to,
Described by the described first standby binding of subsystems to described first primary node, so that described first primary node carries After described first standby subsystem,
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node carries Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
23. devices according to claim 22 it is characterised in that described first performing module is additionally operable to,
After described first secondary node recovers normal, by the described first standby binding of subsystems to described first secondary node simultaneously Discharge described second secondary node, so that described first secondary node carries the described first standby subsystem.
24. devices according to claim 21 it is characterised in that described first performing module is additionally operable to,
Described by the described first standby binding of subsystems to described first secondary node, so that described first secondary node carries After described first standby subsystem,
Masterslave switchover is carried out to described first secondary node and described second secondary node, so that described first secondary node carries Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
25. devices according to claim 24 it is characterised in that described first performing module is additionally operable to,
After the described first primary node recovers normal, by the described first standby binding of subsystems to described first primary node And discharge described second secondary node, so that described first primary node carries the described first standby subsystem.
26. devices according to claim 25 it is characterised in that described first performing module is additionally operable to,
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node carries Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
27. devices according to claim 18 are it is characterised in that described device also includes:
Second performing module, for detecting the presence of malfunctioning node when described detection module, and described malfunctioning node includes carrying Second primary node of the second primary subsystem, and described malfunctioning node does not include secondary node or described malfunctioning node includes The standby subsystem that carries of secondary node be not the described second primary subsystem acquiescence standby subsystem when, by described the Two primary binding of subsystems are to the 3rd secondary node carrying the 3rd standby subsystem, so that described 3rd secondary node carries institute State the second primary subsystem, described 3rd standby subsystem is the standby subsystem of acquiescence of the described second primary subsystem.
28. devices according to claim 27 are it is characterised in that described device also includes:
Second judge module, gives tacit consent to standby subsystem for judging that the described second primary subsystem whether there is;
Correspondingly, described second performing module is additionally operable to, in the presence of the judged result of described second judge module is, will be described Second primary binding of subsystems is to the 3rd secondary node carrying the 3rd standby subsystem;Judgement when described second judge module When result is not exist, by the described second primary binding of subsystems to the 4th secondary node carrying the 4th standby subsystem, institute State the standby subsystem of non-default that the 4th standby subsystem is the described second primary subsystem.
29. devices according to claim 28 it is characterised in that
Described detection module is additionally operable to, and whether detection the described second primary node recovers normal;
Correspondingly, described second performing module is additionally operable to, when the described second primary node recovers normal, then primary by described second Binding of subsystems is to described second primary node.
30. devices according to claim 18 are it is characterised in that described device also includes:
3rd performing module, for detecting the presence of malfunctioning node when described detection module, and described malfunctioning node includes carrying 5th secondary node of the 5th standby subsystem, and described malfunctioning node does not include primary node or described malfunctioning node includes The standby subsystem of acquiescence of primary subsystem that carries of primary node when being not described five standby subsystem, according to institute State the priority of the 5th standby subsystem, seize the 6th secondary node carrying the 6th standby subsystem, so that the described 6th is standby Carry described 5th standby subsystem with node.
31. devices according to claim 18 are it is characterised in that described device also includes:
First setup module, for being primary subsystem and standby subsystem setting priority.
32. devices according to claim 18 are it is characterised in that described device also includes:
Second setup module, gives tacit consent to standby subsystem for arranging for primary subsystem.
A kind of 33. redundancy protecting equipment are it is characterised in that described equipment includes:Processor and memorizer, for execution such as right Require the redundancy protected method any one of 1 to 17.
A kind of 34. redundancy protection systems are it is characterised in that described system includes:Multiple nodes and such as claim 18-32 Device described in any one, the plurality of node is used for executing task under the control of described device.
35. systems according to claim 34 are it is characterised in that described node is process or server on server.
36. systems according to claim 34 or 35 it is characterised in that when the process that described node is on server, The quantity of primary node carrying primary subsystem and the quantity of the secondary node carrying standby subsystem are equal, described in each Primary subsystem has a standby subsystem of acquiescence, the primary subsystem that the described primary node on same server carries Give tacit consent to the process that standby subsystem is located, be evenly distributed on two adjacent with described server server.
CN201380003769.2A 2013-03-19 2013-03-19 Method and device for redundancy protection, and device and system Active CN103931139B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2013072890 2013-03-19

Publications (2)

Publication Number Publication Date
CN103931139A CN103931139A (en) 2014-07-16
CN103931139B true CN103931139B (en) 2017-02-15

Family

ID=51147972

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201380003769.2A Active CN103931139B (en) 2013-03-19 2013-03-19 Method and device for redundancy protection, and device and system

Country Status (1)

Country Link
CN (1) CN103931139B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105634779B (en) * 2014-11-04 2019-09-03 南京中兴软件有限责任公司 The operation processing method and device of master/slave device
CN107612720B (en) * 2017-08-29 2021-06-22 深圳市盛路物联通讯技术有限公司 Access point backup method and gateway
JP7421052B2 (en) * 2019-03-15 2024-01-24 アイコム株式会社 How to make server systems and processes redundant
CN110149366B (en) * 2019-04-16 2022-03-18 平安科技(深圳)有限公司 Method and device for improving availability of cluster system and computer equipment

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102143063A (en) * 2010-12-29 2011-08-03 华为技术有限公司 Method and device for protecting business in cluster system
CN102231681A (en) * 2011-06-27 2011-11-02 中国建设银行股份有限公司 High availability cluster computer system and fault treatment method thereof

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102143063A (en) * 2010-12-29 2011-08-03 华为技术有限公司 Method and device for protecting business in cluster system
CN102231681A (en) * 2011-06-27 2011-11-02 中国建设银行股份有限公司 High availability cluster computer system and fault treatment method thereof

Also Published As

Publication number Publication date
CN103931139A (en) 2014-07-16

Similar Documents

Publication Publication Date Title
US10601657B2 (en) Instance node management method and management device
CN103152419B (en) A kind of high availability cluster management method of cloud computing platform
CN105187249B (en) A kind of fault recovery method and device
CN103931139B (en) Method and device for redundancy protection, and device and system
TWI701916B (en) Method and device for self-recovering management ability in distributed system
CN102882704B (en) Link protection method in the soft reboot escalation process of a kind of ISSU and equipment
CN105159798A (en) Dual-machine hot-standby method for virtual machines, dual-machine hot-standby management server and system
CN111209110B (en) Task scheduling management method, system and storage medium for realizing load balancing
CN102025562A (en) Path detection method and device
CN109768884A (en) The implementation method of communication system and its high availability, device and computer equipment
CN105790825A (en) Method and apparatus for carrying out hot backup on controllers in distributed protection
CN105530116A (en) Virtual network backup and recovery method and corresponding device
CN107656847A (en) Node administration method, system, device and storage medium based on distributed type assemblies
CN110166115A (en) A kind of quantum optical network service fault recovery method and device based on pool of keys
CN109324549A (en) A kind of dual master control equipment configuration method and dual master control equipment
CN107329830A (en) A kind of method and device of distributed cluster system optimization
CN111490932A (en) Communication method and device
CN107992752B (en) Data processing method and device and computer equipment
CN116184895A (en) Redundant unit control system, method, device, computer equipment and storage medium
CN113098709B (en) Network recovery method and device based on distributed networking system and computer equipment
CN113794541A (en) Main/standby switching management method, device, equipment and machine readable storage medium
CN103716186B (en) Artificial telephone traffic system with network fault tolerance capability and method thereof
CN106341323A (en) VRRP state synchronizing method and device
CN113472642B (en) Protection switching method and device
CN103227994B (en) The management method and system of shared resource

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant