CN103931139B - Method and device for redundancy protection, and device and system - Google Patents
Method and device for redundancy protection, and device and system Download PDFInfo
- Publication number
- CN103931139B CN103931139B CN201380003769.2A CN201380003769A CN103931139B CN 103931139 B CN103931139 B CN 103931139B CN 201380003769 A CN201380003769 A CN 201380003769A CN 103931139 B CN103931139 B CN 103931139B
- Authority
- CN
- China
- Prior art keywords
- subsystem
- primary
- node
- standby
- secondary node
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Data Exchanges In Wide-Area Networks (AREA)
- Hardware Redundancy (AREA)
Abstract
An embodiment of the invention provides a method and a device for redundancy protection, and a device and system, and relates to the field of computers. The method comprises: detecting whether a fault node exists; if the fault node exists and the fault node comprises a first main node carrying a first main subsystem, and a first standby node carrying a first standby subsystem, the first main subsystem being bound to a second standby node of a second standby subsystem, so that the second standby node carries the first main subsystem. The first standby subsystem is a default standby subsystem of the first main subsystem. The second standby subsystem is a non-default standby subsystem of the first main subsystem. The device comprises a detection module and a first executive module. The method, the device, and the system support a plurality of fault processes, and improve reliability of the system.
Description
Technical field
The present invention relates to computer realm, particularly to a kind of redundancy protected method, device, equipment and system.
Background technology
Cluster (cluster) refers to, some separate service entities (computer or work station etc.), by HVN
The service platform that network interconnects and constitutes.It is different from single service entity, Cluster has equally loaded and two spies of redundancy protecting
Property.Wherein, redundancy protecting refers to, when the resource (can be the process on server or server) of certain task of execution is due to certain
When kind of reason breaks down, to the standby resources of resource should then completing this task.
Specifically, with execute the resource of certain task as server on process as a example, existing redundancy protecting flow process is led to
Often as follows.First, primary resources and the standby resources of equal amount are disposed on each service entities of Cluster.Secondly, adopt
Mode with ring-like backup is primary resources setting standby resources it is ensured that primary resources and the standby resources of oneself do not take at one
In entity.For example, Cluster includes numbering respectively 1 ..., the service entities of N-1, N and N+1.Divide equally on each service entities
Cloth M primary resources and M standby resources.Standby resources corresponding to the upper M primary resources of N-1 are distributed on N;Upper M of N
Standby resources corresponding to primary resources are distributed on N+1;Standby resources corresponding to the upper M primary resources of N+1 are distributed in 1
On;By that analogy.Then, the running status of each primary resources of real-time detection;Meanwhile, primary resources are in the task of execution, corresponding
Standby resources back up primary resources all data.When primary resources is detected and breaking down, the primary money of closing fault
Source, the corresponding standby resources of primary resources of startup separator.Because standby resources have backed up all numbers of the primary resources of fault
According to therefore, it is possible to ensure that task do not interrupt.
During realizing the present invention, inventor finds that prior art at least has problems with:
If primary resources and standby resources corresponding with this primary resources all break down, the task on primary resources will
Interrupt, the reliability of impact Cluster.
Content of the invention
In order to increase the reliability of Cluster, embodiments provide a kind of redundancy protected method, device, equipment
And system.Described technical scheme is as follows:
On the one hand, embodiments provide a kind of redundancy protected method, methods described includes:
Detect whether there is malfunctioning node;
When there is malfunctioning node, and described malfunctioning node include carrying the first primary subsystem the first primary node and
Carry the first standby subsystem the first secondary node when, by the described first primary binding of subsystems to carry the second standby subsystem
Second secondary node of system, so that described second secondary node carries the described first primary subsystem, described first standby subsystem
System is the standby subsystem of acquiescence of the described first primary subsystem, and described second standby subsystem is the described first primary subsystem
The standby subsystem of non-default.
Preferably, described the second standby section by the described first primary binding of subsystems to carrying the second standby subsystem
Point, so that described second secondary node carries described primary subsystem, including:
In the standby subsystem of non-default of the described first primary subsystem, select the minimum standby subsystem of a priority
System, the secondary node that selected described standby subsystem is located is as described second secondary node and main by described first
With binding of subsystems to described second secondary node.
Preferably, in described the second standby section by the described first primary binding of subsystems to carrying the second standby subsystem
Point, so that before described second secondary node carries described primary subsystem, methods described also includes:
Judge whether the described first primary subsystem possesses the ability of seizing;
If described first primary subsystem possesses the ability of seizing, by the described first primary binding of subsystems to described second
Secondary node.
Preferably, after by the described first primary binding of subsystems to described second secondary node, methods described is also wrapped
Include:
Detect whether the described first primary node and described first secondary node recover normal;
If described first primary node first recovers normal, the described first standby binding of subsystems is primary to described first
Node, so that described first primary node carries the described first standby subsystem;
If described first secondary node first recovers normal, the described first standby binding of subsystems is standby to described first
Node, so that described first secondary node carries the described first standby subsystem.
Preferably, described by the described first standby binding of subsystems to described first primary node, so that described first
After primary node carries the described first standby subsystem, methods described also includes:
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node
Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, described masterslave switchover is carried out to the described first primary node and described second secondary node after, institute
Method of stating also includes:
After described first secondary node recovers normal, by the described first standby binding of subsystems to described first standby section
Put and discharge described second secondary node, so that described first secondary node carries the described first standby subsystem.
Preferably, described by the described first standby binding of subsystems to described first secondary node so that described first is standby
After carrying the described first standby subsystem with node, methods described also includes:
Masterslave switchover is carried out to described first secondary node and described second secondary node, so that described first secondary node
Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, described masterslave switchover is carried out to described first secondary node and described second secondary node after, institute
Method of stating also includes:
After the described first primary node recovers normal, the described first standby binding of subsystems is primary to described first
Node simultaneously discharges described second secondary node, so that described first primary node carries the described first standby subsystem.
Preferably, described by the described first standby binding of subsystems to described first primary node discharge described second
After secondary node, methods described also includes:
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node
Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, methods described also includes:
When there is malfunctioning node, and described malfunctioning node includes the second primary node of carrying the second primary subsystem, and
The standby subsystem that the secondary node that described malfunctioning node does not include secondary node or described malfunctioning node includes carries is all not
Be the described second primary subsystem acquiescence standby subsystem when,
Masterslave switchover is carried out to the 3rd secondary node of the second primary node and carrying the 3rd standby subsystem, so that described
3rd secondary node carries the described second primary subsystem, and described 3rd standby subsystem is the silent of the described second primary subsystem
Recognize standby subsystem.
Preferably, to the second primary node and carry the 3rd standby subsystem the 3rd secondary node carry out active and standby fall
Change, so that before described 3rd secondary node carries the described second primary subsystem, methods described also includes:
Judge that the described second primary subsystem whether there is and give tacit consent to standby subsystem;
If existing, by the described second primary binding of subsystems to the 3rd secondary node carrying the 3rd standby subsystem;
If not existing, by the described second primary binding of subsystems to the 4th standby section carrying the 4th standby subsystem
Point, described 4th standby subsystem is the standby subsystem of non-default of the described second primary subsystem.
Preferably, by the described second primary binding of subsystems to described 3rd secondary node or described 4th secondary node
Afterwards, methods described also includes:
Detect whether the described second primary node recovers normal;
If described second primary node recovers normal, by the described second primary binding of subsystems to described second primary section
Point.
Preferably, methods described also includes:
When there is malfunctioning node, and described malfunctioning node includes carrying the 5th secondary node of the 5th standby subsystem, and
The primary subsystem that the primary node that described malfunctioning node does not include primary node or described malfunctioning node includes carries silent
When to recognize sub standby subsystem be not described five standby subsystem,
According to the priority of described 5th standby subsystem, seize the 6th secondary node carrying the 6th standby subsystem,
So that described 6th secondary node carries described 5th standby subsystem.
Preferably, methods described also includes:
It is primary subsystem and standby subsystem setting priority.
Preferably, methods described also includes:
Arrange for primary subsystem and give tacit consent to standby subsystem.
Preferably, the quantity of primary node carrying primary subsystem and the quantity of the secondary node carrying standby subsystem
Equal, each described primary subsystem has a standby subsystem of acquiescence.
Preferably, described primary node and described secondary node are the process on server, the institute on same server
State the process that the standby subsystem of acquiescence of the primary subsystem that primary node carries is located, be evenly distributed in and described server phase
On adjacent two servers.
Another further aspect, embodiments provides a kind of redundancy protection device, and described device includes:
, for detecting whether there is malfunctioning node in detection module;
, there is malfunctioning node for working as, and described malfunctioning node include carrying the first primary subsystem in the first performing module
The first primary node and carry the first standby subsystem the first secondary node when, by the described first primary binding of subsystems
To the second secondary node carrying the second standby subsystem, so that described second secondary node carries the described first primary subsystem
System, described first standby subsystem is the standby subsystem of acquiescence of the described first primary subsystem, described second standby subsystem
It is the standby subsystem of non-default of the described first primary subsystem.
Preferably, described first performing module is used for,
When there is malfunctioning node, and described malfunctioning node include carrying the first primary subsystem the first primary node and
Carry the first standby subsystem the first secondary node when, in the standby subsystem of non-default of the described first primary subsystem,
Select the minimum standby subsystem of a priority, the secondary node that selected described standby subsystem is located is as described
Second secondary node, and by the described first primary binding of subsystems to described second secondary node.
Preferably, described device also includes:
First judge module, for judging whether the described first primary subsystem possesses the ability of seizing;
Correspondingly, described first performing module is additionally operable to, when the described first primary subsystem possesses and seizes ability, by institute
State the first primary binding of subsystems to described second secondary node.
Preferably, described detection module is additionally operable to, and whether detection the described first primary node and described first secondary node
Recover normal;
Correspondingly, described first performing module is additionally operable to, when the described first primary node first recovers normal, then will be described
First standby binding of subsystems is to described first primary node, so that described first primary node carries the described first standby subsystem
System;When described first secondary node first recovers normal, then by the described first standby binding of subsystems to described first standby section
Point, so that described first secondary node carries the described first standby subsystem.
Preferably, described first performing module is additionally operable to, described by the described first standby binding of subsystems to described the
One primary node, so that after described first primary node carries the described first standby subsystem,
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node
Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, described first performing module is additionally operable to,
After described first secondary node recovers normal, by the described first standby binding of subsystems to described first standby section
Put and discharge described second secondary node, so that described first secondary node carries the described first standby subsystem.
Preferably, described first performing module is additionally operable to, described by the described first standby binding of subsystems to described first
Secondary node, so that after described first secondary node carries the described first standby subsystem,
Masterslave switchover is carried out to described first secondary node and described second secondary node, so that described first secondary node
Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, described first performing module is additionally operable to,
After the described first primary node recovers normal, the described first standby binding of subsystems is primary to described first
Node simultaneously discharges described second secondary node, so that described first primary node carries the described first standby subsystem.
Preferably, described first performing module is additionally operable to,
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node
Carry the described first primary subsystem, and described second secondary node carries the described first standby subsystem.
Preferably, described device also includes:
Second performing module, for detecting the presence of malfunctioning node when described detection module, and described malfunctioning node includes
Carry the second primary node of the second primary subsystem, and described malfunctioning node does not include secondary node or described malfunctioning node
Including secondary node carry standby subsystem be not the described second primary subsystem acquiescence standby subsystem when, by institute
State the second primary binding of subsystems to the 3rd secondary node carrying the 3rd standby subsystem, so that described 3rd secondary node holds
Carry the described second primary subsystem, described 3rd standby subsystem is the standby subsystem of acquiescence of the described second primary subsystem.
Preferably, described device also includes:
Second judge module, gives tacit consent to standby subsystem for judging that the described second primary subsystem whether there is;
Correspondingly, described second performing module is additionally operable to, in the presence of the judged result of described second judge module is, will
Described second primary binding of subsystems is to the 3rd secondary node carrying the 3rd standby subsystem;When described second judge module
When judged result is not exist, by the described second primary binding of subsystems to the 4th standby section carrying the 4th standby subsystem
Point, described 4th standby subsystem is the standby subsystem of non-default of the described second primary subsystem.
Preferably, described detection module is additionally operable to, and whether detection the described second primary node recovers normal;
Correspondingly, described second performing module is additionally operable to, when the described second primary node recovers normal, then by described second
Primary binding of subsystems is to described second primary node.
Preferably, described device also includes:
3rd performing module, for detecting the presence of malfunctioning node when described detection module, and described malfunctioning node includes
Carry the 5th secondary node of the 5th standby subsystem, and described malfunctioning node does not include primary node or described malfunctioning node
Including the standby subsystem of acquiescence of primary subsystem that carries of primary node be not described five standby subsystem when, root
According to the priority of described 5th standby subsystem, seize the 6th secondary node carrying the 6th standby subsystem, so that described the
Six secondary nodes carry described 5th standby subsystem.
Preferably, described device also includes:
First setup module, for being primary subsystem and standby subsystem setting priority.
Preferably, described device also includes:
Second setup module, gives tacit consent to standby subsystem for arranging for primary subsystem.
Another aspect, embodiments provides a kind of redundancy protecting equipment, and described equipment includes:Processor and storage
Device, for executing aforementioned redundancy protected method.
On the other hand, a kind of redundancy protection systems are embodiments provided, described system includes:Multiple nodes, with
And aforementioned redundancy protection device, the plurality of node is used for executing task under the control of described device.
Alternatively, described node can be the process on server or server.
Preferably, when described node be server on process when, carry primary subsystem the quantity of primary node and
The quantity carrying the secondary node of standby subsystem is equal, and each described primary subsystem has a standby subsystem of acquiescence
System, the process at the acquiescence standby subsystem place of the primary subsystem that the described primary node on same server carries, averagely
It is distributed on the two neighboring server with described server.
The beneficial effect that technical scheme provided in an embodiment of the present invention is brought is:By carrying the first primary subsystem
First primary node and the first standby section carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem)
During the equal fault of point, by the first primary binding of subsystems, to carrying the second standby subsystem, (non-default of the first primary subsystem is standby
With subsystem) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first
The normal operation of primary subsystem;Therefore, the embodiment of the present invention can support multiple node failures, and effectively increase system can
By property.
Brief description
For the technical scheme being illustrated more clearly that in the embodiment of the present invention, will make to required in embodiment description below
Accompanying drawing be briefly described it should be apparent that, drawings in the following description are only some embodiments of the present invention, for
For those of ordinary skill in the art, on the premise of not paying creative work, other can also be obtained according to these accompanying drawings
Accompanying drawing.
Fig. 1 is a kind of flow chart of redundancy protected method that the embodiment of the present invention 1 provides;
Fig. 2 is the structural representation of the radio network controller that the embodiment of the present invention 2 provides;
Fig. 3 is a kind of flow chart of redundancy protected method that the embodiment of the present invention 2 provides;
Fig. 4 is the schematic diagram of the two-way backup mode that the embodiment of the present invention 2 provides;
Fig. 5-Figure 12 is the schematic diagram of the concrete application of redundancy protected method that the embodiment of the present invention 2 provides;
Figure 13 is a kind of structural representation of redundancy protection device that the embodiment of the present invention 3 provides;
Figure 14 is a kind of structural representation of redundancy protection device that the embodiment of the present invention 4 provides;
Figure 15 is a kind of structural representation of redundancy protecting equipment that the embodiment of the present invention 5 provides;
Figure 16 is a kind of structural representation of redundancy protection systems that the embodiment of the present invention 6 provides.
Specific embodiment
For making the object, technical solutions and advantages of the present invention clearer, below in conjunction with accompanying drawing to embodiment party of the present invention
Formula is described in further detail.
Embodiment 1
Referring to Fig. 1, embodiments provide a kind of redundancy protected method, the method flow process includes:
Step 101:Detect whether there is malfunctioning node.
Specifically, in embodiments of the present invention, node is the resource for executing certain task, can be a service
A process on device or server.
Step 102:When there is malfunctioning node, and malfunctioning node includes the first primary section of carrying the first primary subsystem
During the first secondary node of point and carrying the first standby subsystem, by the first primary binding of subsystems to carrying the second spare
Second secondary node of system, so that this second secondary node carries the first primary subsystem, this first standby subsystem is the
The standby subsystem of acquiescence of one primary subsystem, this second standby subsystem is the standby subsystem of non-default of the first primary subsystem
System.
Specifically, in embodiments of the present invention, the standby subsystem of the acquiescence of primary subsystem, referring to can this master of backed up in synchronization
Standby subsystem with the service data of subsystem.The standby subsystem of non-default of primary subsystem refers to backed up in synchronization to be somebody's turn to do
The standby subsystem of the service data of primary subsystem, in implementing, primary subsystem and the standby subsystem of acquiescence are permissible
It is one-to-one it is also possible to not be one-to-one.In the case of corresponding, the acquiescence of a primary subsystem is standby
Subsystem is the non-default subsystem of another primary subsystem;In the case of not being to correspond, a primary subsystem
The standby subsystem of non-default can be the standby subsystem of acquiescence of another primary subsystem it is also possible to not be any primary son
The standby subsystem of acquiescence of system.
The beneficial effect of said method provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem
First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal
During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem
System) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first is primary
The normal operation of subsystem;Therefore, the embodiment of the present invention can support multiple node failures, effectively increases the reliability of system
Property.Further, since the embodiment of the present invention is in primary subsystem and standby subsystem simultaneous faults, primary subsystem is acceptable
Seize other secondary nodes, therefore can ensure that the operation of system using less secondary node, saved resource.
Embodiment 2
As a example the embodiment of the present invention will be with RNC (Radio Network Controller, radio network controller), to this
The redundancy protected method of invention illustrates.The understanding of the technical scheme for ease of describing to the embodiment of the present invention, introduces first
RNC once.RNC is a key network element of 3G (3rd Generation, 3G (Third Generation) Moblie) network and the group of access network
Become part, for providing mobile management, call treatment, url management and handover mechanism.In order to ensure the reliability of mobile communication
Property, RNC is typically a Cluster system.Referring to Fig. 2, RNC is made up of the veneer (i.e. service entities) of some interconnections.Veneer
For functional module, it can be the blade server that specific function is provided.Veneer includes some processes (i.e. node).Process is single
The abstract embodiment of CPU computing resource on plate, a process is exactly a physical manipulations unit.Each process carries an independence
Logic application (i.e. task).Here, an independent Logic application is referred to as a subsystem.
With reference to Fig. 3, redundancy protected method provided in an embodiment of the present invention is described, referring to Fig. 3, the method includes:
Step 201:Arrange for primary subsystem and give tacit consent to standby subsystem.
Specifically, by setting up the corresponding relation between primary subsystem and standby subsystem, you can for primary subsystem
Standby subsystem is given tacit consent in setting.In the running of primary subsystem, the standby subsystem of acquiescence of this primary subsystem can be right
It synchronizes backup.
Preferably, referring to Fig. 4, in embodiments of the present invention, primary subsystem is equal with the quantity of standby subsystem, primary
Subsystem is corresponded with standby subsystem, each primary subsystem have one give tacit consent to standby subsystem that is to say, that
RNC is by the way of 1+1 backup.
It is highly preferred that the embodiment of the present invention can adopt two-way backup mode.Two-way backup mode refers to, first, in each list
The primary subsystem of equal amount and standby subsystem are disposed on plate;Secondly, make a part of primary subsystem on a veneer
With the spare system relationship on previous veneer, the standby subsystem pass on the primary subsystem of another part and a rear veneer
Connection is that is to say, that two lists given tacit consent to before and after standby subsystem is evenly distributed in this veneer of primary subsystem on this veneer
On plate.So, when veneer breaks down, the load of this veneer will be split on former and later two veneers, in front and back's veneer
Impact is less.It is readily apparent that, the forward and backward veneer referring to same type adjacent in logic here, such as, in same machine frame
Interior, including two kinds of veneer, one kind is redundancy protecting veneer, and the subsystem running thereon is all using the side of redundancy protecting
Formula, and another kind is non-redundancy protecting veneer, the subsystem running thereon is not by the way of redundancy protecting.Assume that redundancy is protected
The slot number that shield veneer is located in this machine frame is 0,1,2,4,5,6, then, for the veneer of No. 2 groove positions, before and after it
Two veneers are respectively the veneer of No. 1 groove position and the veneer of No. 4 groove positions.
Obviously, existing Ring-backup mode may also be employed, by all primary subsystem on a veneer and adjacent
The spare system relationship of individual veneer.It is readily apparent that, here the veneer of the adjacent same type also referring to Logic adjacent.
What deserves to be explained is, in the present embodiment, employ the mode of 1+1 backup, that is, primary subsystem with standby
The quantity of subsystem is equal, and each primary subsystem has a standby subsystem of acquiescence, however, in other embodiments, main
The quantity of standby subsystem can also be more than or less than with the quantity of subsystem.When the quantity of standby subsystem is more than primary subsystem
During the quantity of system, there is the standby subsystem of redundancy.When the quantity of primary subsystem is more than the quantity of standby subsystem, permissible
Priority (see step 202) according to primary subsystem sets up primary subsystem and the corresponding relation of standby subsystem, such as, first
Set up the corresponding relation between the high primary subsystem of priority and standby subsystem.
Step 202:It is primary subsystem and standby subsystem setting priority.
Further, this step 202 can include:According to the type of service of subsystem, user type and/or region, it is
Primary subsystem and standby subsystem setting priority.
Preferably, in the present embodiment, can be primary subsystem setting priority according to the type of service of subsystem.
For example, the type of service according to subsystem, primary system subdivision is 4 priority, high priority, medium priority, low excellent
First level and 0 priority.Wherein, high priority can be responsible for the subsystem that signaling is processed, this primary subsystem fault in load RNC
All business in RNC will be affected, impact is maximum.Medium priority can be for being responsible for the subsystem of cell management in load RNC, this is primary
The cell of this management subsystem that can lead to subsystem fault exits service, and impact is larger.Low priority can be reserved, is easy to follow-up
Extension.0 priority can be for being responsible for the subsystem of call treatment in load RNC, this primary subsystem fault can lead to this subsystem
On currently ongoing call, impact is minimum.
The priority of the corresponding primary subsystem of priority inheritance of standby subsystem, that is, primary subsystem is preset preferential
Level is identical with the priority of the standby subsystem of acquiescence of this primary subsystem.The priority of standby subsystem is only used for standby subsystem
Priority height is compared between system.
Preferably, the method can also include:Arrange primary subsystem seizes ability.
Specifically, the seizing ability and can also be represented using priority of primary subsystem.Such as, can be in setting priority
Shi Dingyi, the primary subsystem of 0 priority does not possess the ability of seizing that is to say, that working as the primary subsystem carrying 0 priority
During process failure, it can not seize the standby subsystem of other non-default.Certainly, seize ability and can not also adopt priority list
Show, and adopt other specification to represent.
In the present embodiment, the ability of seizing of primary subsystem is also adopted by priority and represents, so only non-zero priority
Primary subsystem just possesses the ability of seizing that is to say, that the priority of the primary subsystem of all non-zero priority is more than arbitrarily standby
Priority with subsystem.
Alternatively, in other embodiments it is also possible to according to user type, be primary subsystem and the setting of standby subsystem
Priority.Such as, the business of honored guest user all can be assigned on some subsystems, and the business of domestic consumer is all distributed
To on other subsystems, high towards the priority of the subsystem than domestic consumer for the subsystem of honored guest user.
Alternatively, in other embodiments, can also be preferential according to being arranged with subsystem and standby subsystem based on region
Level.Such as, can be and close by population by the traffic assignments of service area (such as city) big for population density to some subsystems
The traffic assignments spending little service area (such as mountain area) are to other subsystems, the corresponding son of the big service area of population density
The priority of the corresponding subsystem of the system service area less than population density is high.
It should be noted that in implementing, step 201 and 202 does not have sequencing, can execute simultaneously.Additionally,
Step 201 and 202 is optional step, can pre-set.
Step 203:Detect whether there is malfunctioning node.
When there is malfunctioning node, and this malfunctioning node includes carrying the first primary node of the first primary subsystem and holds
Carry the first standby subsystem the first secondary node when, execution step 204, wherein, the first standby subsystem is the first primary son
The standby subsystem of acquiescence of system;
When there is malfunctioning node, and this malfunctioning node includes the second primary node of carrying the second primary subsystem, and is somebody's turn to do
The standby subsystem that the secondary node that malfunctioning node does not include secondary node or this malfunctioning node includes carries be not this
During the standby subsystem of acquiescence of two primary subsystems, execution step 214;
When there is malfunctioning node, and this malfunctioning node includes carrying the 5th secondary node of the 5th standby subsystem, and is somebody's turn to do
Acquiescence of the primary subsystem that the primary node that malfunctioning node does not include primary node or this malfunctioning node includes carries is standby
When being not five standby subsystems with subsystem, execution step 218.
Further, in the present embodiment, detect whether there is malfunctioning node, including following two situations:
If the first simply certain node (i.e. process) on veneer break down, the detection Programmable detection on veneer arrives
After this fault, can be by operating system interface to control centre (i.e. in the device in embodiment three or four or embodiment five
Equipment) reporting fault node, control centre knows malfunctioning node according to the information that veneer reports.
If second single board default, such as hardware fault or network failure, in this case, due to control centre and list
Timing handshake message can be sent, once there is single board default, control centre in the scheduled time or does not for a long time receive report between plate
Literary composition, now then it represents that all nodes all faults on this veneer.
Be readily apparent that, generally preserve in control centre the corresponding relation of node and subsystem and primary subsystem and
Give tacit consent to the corresponding relation (such as in the form of a list) of standby subsystem, so, when there is malfunctioning node, aforementioned by inquiring about
Corresponding relation is it is possible to know the type of the subsystem that malfunctioning node carries.
Specifically, referring to Fig. 5, in the present embodiment it can be assumed that malfunctioning node includes the first primary node a and first standby
Use node b, a carries the first primary subsystem X, b carries the first standby subsystem X ', X ' is the standby subsystem of acquiescence of X.
Preferably, after detecting whether to there is malfunctioning node, the method also includes:
Detect whether after restart the scheduled time interior recovery is normal for the first primary node and the first secondary node.
In the present embodiment, node is the process on veneer, so, detecting the first primary node and the first standby section
After point failure, this first primary node and the first secondary node should be restarted immediately in control centre.And, first should be detected after restarting
Whether primary node and the first secondary node recover normal in the given time.When the first primary node or the first secondary node are extensive
When multiple normal, initiation message can be sent to control centre, control centre can judge the first primary node according to this initiation message
Whether recover normal with the first secondary node.
Specifically, this scheduled time is to pre-set, and is used for making malfunctioning node complete to restart, when being typically additionally provided with
Between protect surplus.For example, the starting time 90 seconds+guardtime surplus of this scheduled time=subsystem 30 seconds=120 seconds.
Correspondingly, if the first primary node and the first secondary node all do not recover normal after restart in the scheduled time, then
Execution subsequent step 204, if the first primary node and the first secondary node recover normal in the given time, does not execute follow-up
Step 204.
Step 204:Judge whether the first primary subsystem possesses the ability of seizing.If this first primary subsystem possesses seizing
Ability, then execution step 205;If this first primary subsystem does not possess the ability of seizing, exit this flow process.
Specifically, if seizing ability using what priority represented primary subsystem, then, judge the first primary subsystem
Whether possess the ability of seizing to include:Whether the priority judging this first primary subsystem is non-zero priority.If this is first primary
The priority of subsystem is non-zero priority, then this first primary subsystem possesses the ability of seizing.If this first primary subsystem
Priority is 0 priority, then this first primary subsystem does not possess the ability of seizing.
Step 205:By the first primary binding of subsystems to the second secondary node carrying the second standby subsystem, so that the
Two secondary nodes carry the first primary subsystem;This second standby subsystem is the standby subsystem of non-default of the first primary subsystem
System.
Specifically, this step 205 includes:
In the standby subsystem of non-default of the first primary subsystem, select the minimum standby subsystem of a priority,
The secondary node that selected standby subsystem is located is as the second secondary node, and the first primary binding of subsystems extremely should
Second secondary node.
In the rnc, by the first primary binding of subsystems to the second secondary node, including:
The start-up parameter data of the first primary subsystem is sent to the second secondary node, so that the second secondary node is adopted
Run with the start-up parameter data receiving, this is the state of the art, here is omitted and described in detail.
Referring to Fig. 6 it can be assumed that the second secondary node is c, c initially carries the second standby subsystem Y '.Step 205 it
Afterwards, X is bound to c.
Be readily apparent that, by the first primary binding of subsystems to carry the second standby subsystem the second secondary node it
Before, the corresponding relation between the subsystem of its preservation and node can refresh in control centre, then according to the corresponding relation after refreshing,
Send aforementioned start-up parameter.
Step 206:Detect whether the first primary node and the first secondary node recover normal;If the first primary node is first extensive
Multiple normal, then execution step 207;If the first secondary node first recovers normal, execution step 210.
Specifically, when malfunctioning node recovers normal, initiation message can be sent to control centre, so, control centre can
With according to whether the initiation message receiving malfunctioning node transmission to judge whether this malfunctioning node recovers normal.
In the present embodiment, only describe the first primary node and the first secondary node recovers normal situation (in succession
One primary node first recovers normal or the first secondary node and first recovers normal).In theory, it is understood that there may be the first primary node and
One secondary node recovers normal situation simultaneously.However, in the implementing of the embodiment of the present invention, due to holding of all steps
Row is all to be processed in the way of interacting message, for control centre, is only possible to first receive the first primary node and
The one of initiation message sending of one secondary node.Therefore, in actual applications, be not in the first primary node and
One secondary node recovers normal situation simultaneously.
Step 207:By the first standby binding of subsystems to the first primary node, so that the first primary node carrying first is standby
Use subsystem.
Specifically, when the first primary node first recovers normal, by the first standby binding of subsystems to this first primary section
Point so that after the first primary node starts, can be previously run the first standby subsystem the first primary subsystem is carried out standby
Part, after the completion of being easy to backup, the first primary subsystem is tied up back the first primary node again.Referring to Fig. 7, when a first recovers normal,
X ' is tied to a.
Step 208:Masterslave switchover is carried out to the first primary node and the second secondary node, so that the first primary node carries
First primary subsystem, and the second secondary node carries the first standby subsystem.
Specifically, the backup (BACKUP TIME can be set) to the first primary subsystem is completed in the first standby subsystem
Afterwards, masterslave switchover is carried out to the first primary node and the second secondary node:By the first primary binding of subsystems to the first primary section
Point, so that the first primary node carries the first primary subsystem again;By the first standby binding of subsystems to the second secondary node,
So that the second secondary node carries this first standby subsystem.Referring to Fig. 8, when carrying out the masterslave switchover between node:X is tied up back a,
A is made again to carry X.X ' is tied to c, makes c start to carry X '.
Step 209:After the first secondary node recovers normal, by the first standby binding of subsystems to the first secondary node simultaneously
Discharge the second secondary node, so that the first secondary node carries the first standby subsystem.
Specifically, after the first primary node recovers normal, the first secondary node has also recovered normally, at this moment, by first
Standby binding of subsystems to the first secondary node and discharges the second secondary node, so that the first secondary node carries the first spare
System, and restart the second secondary node being released, so that it recovers original standby subsystem (the second standby subsystem).Ginseng
See Fig. 9, after b also recovers normal, X ' is tied up back b, and discharges c.
So, by step 207-209, the first primary subsystem, the first standby subsystem and the second standby subsystem are equal
Recover to original state.
Step 210:By the first standby binding of subsystems to the first secondary node, so that the first secondary node carrying first is standby
Use subsystem.
Specifically, when the first secondary node first recovers normal, by the first standby binding of subsystems to the first secondary node,
So that the first standby subsystem backs up to the first primary subsystem.Referring to Figure 10 it is assumed that after a and b fault, b just first recovers
Often.Then X ' is tied to b.
Step 211:Masterslave switchover is carried out to the first secondary node and the second secondary node, so that the first secondary node carries
First primary subsystem, and the second secondary node carries the first standby subsystem.
Specifically, the backup (BACKUP TIME can be set) to the first primary subsystem is completed in the first standby subsystem
Afterwards, masterslave switchover is carried out to the first secondary node and the second secondary node:By the first primary binding of subsystems to the first standby section
Point, by the first standby binding of subsystems to the second secondary node.Referring to Figure 11, b first recovers normal and is tied to after b by X ', to b
Carry out masterslave switchover with c:X is bind to b, X ' is bind to c.
Step 212:After the first primary node recovers normal, by the first standby binding of subsystems to the first primary node
And discharge the second secondary node, so that the first primary node carries the first standby subsystem.
Specifically, after the first secondary node recovers normal, the first primary node has also recovered normally, at this moment, by first
Standby binding of subsystems to the first primary node and discharges the second secondary node, so that the first primary node carries the first spare
System.After first standby binding of subsystems to the first primary node, start the first primary subsystem is backed up.Referring to figure
12, first recover a after normally in b and also recovered normally, at this moment, X ' to be bind to a, and discharge c.
Step 213:Masterslave switchover is carried out to the first primary node and the first secondary node, so that the first primary node carries
First primary subsystem, and the first secondary node carries the first standby subsystem.
Specifically, treat that the first standby subsystem completes the backup to the first primary subsystem, to the first primary node and
One secondary node carries out masterslave switchover:By the first primary binding of subsystems to the first primary node, the first standby subsystem is tied up
Determine to the first secondary node.Refer again to Fig. 9, after X ' completes the backup to X, then the masterslave switchover carrying out a and b:X is tied up back a,
X ' ties up back b.
Similarly, by step 210-213, the first primary subsystem, the first standby subsystem and the second standby subsystem
All recover to original state.
It should be noted that step 208-209 and step 211-213 are preferred steps, facilitate implementation the load of system
Balance.
Step 214:Judge that the second primary subsystem whether there is and give tacit consent to standby subsystem.If existing, execution step
215;If not existing, execution step 216.
Specifically, judge that the second primary subsystem whether there is and give tacit consent to standby subsystem, including:Preserve according in server
Primary subsystem and give tacit consent to standby subsystem corresponding relation, judge the second primary subsystem whether there is acquiescence standby subsystem
System.
Step 215:Masterslave switchover is carried out to the 3rd secondary node of the second primary node and carrying the 3rd standby subsystem,
So that the 3rd secondary node carries the second primary subsystem, the 3rd standby subsystem is that the acquiescence of the second primary subsystem is standby
Subsystem.
Specifically, if the second primary subsystem exists gives tacit consent to standby subsystem, and the standby subsystem of this acquiescence is standby for the 3rd
With subsystem, then by the second primary binding of subsystems to the 3rd secondary node carrying the 3rd standby subsystem.
Step 216:By the second primary binding of subsystems to carry the 4th standby subsystem the 4th secondary node, the 4th
Standby subsystem is the standby subsystem of non-default of the second primary subsystem.
Specifically, by the second primary binding of subsystems to the 4th secondary node carrying the 4th standby subsystem, same to step
The process of 204~step 205 description, will not be described in detail herein.
Step 217:Detect whether the second primary node recovers normal;If the second primary node recovers normal, by second
Primary binding of subsystems is to the second primary node.
Specifically, detection process is identical with the detection process that step 206 describes, and will not be described in detail herein.If the second primary section
Point recovers normal, then by the second primary binding of subsystems to the second primary node;If this second primary node does not recover normal,
Proceed to detect.
Specifically, if the second primary node recovers normal, the second primary binding of subsystems to the second primary node wraps
Include two kinds of situations:
The first situation:As step 215 describes, the 3rd secondary node carries the second primary subsystem, at this moment, if second is main
Recover normal with node, then first by the 3rd standby binding of subsystems to the second primary node so that the second primary node carries the
Three standby subsystems;Then, masterslave switchover is carried out to the second primary node and the 3rd secondary node, so that the second primary node holds
Carry the second primary subsystem, and the 3rd secondary node carries the 3rd standby subsystem.
Second situation:As step 216 describes, the 4th secondary node carries the second primary subsystem, at this moment, if second is main
Recover normal with node, then by the second primary binding of subsystems to the second primary node, and discharge the 4th secondary node.
Step 218:According to the priority of the 5th standby subsystem, seize the 6th standby section carrying the 6th standby subsystem
Point, so that the 6th secondary node carries the 5th standby subsystem.
Wherein, the priority of the standby subsystem of priority ratio the 5th of the 6th standby subsystem is low.Further, seize
Process can be found in step 205, will not be described in detail herein.
Preferably, after detecting the 5th secondary node fault, the 5th secondary node should be restarted immediately in control centre.Weight
Qi Hou, should detect whether the 5th secondary node recovers normal in the given time;If so, then do not seized;If it is not, then executing
Seize.
It is highly preferred that after the 5th standby subsystem seizes the 6th secondary node, the method also includes:
After the 5th secondary node recovers normal, discharge the 6th secondary node, and by the 5th standby binding of subsystems extremely
5th secondary node, so that the 5th secondary node carries the 5th standby subsystem again.
What deserves to be explained is, in the present embodiment, node is a process on veneer in RNC, in other embodiments,
Node can be a money on a server or server in a veneer or other Cluster systems in RNC
Source.
The beneficial effect of said method provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem
First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal
During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem
System) the second secondary node so that this second secondary node carry the first primary subsystem, such that it is able to fast quick-recovery fault
The first primary subsystem it is ensured that the normal operation of the first primary subsystem;Therefore, the embodiment of the present invention can support multiple sections
Point failure, effectively increases the reliability of system.Further, since the embodiment of the present invention is in primary subsystem and standby subsystem
During simultaneous faults, primary subsystem can also seize other secondary nodes, therefore be can ensure that using less secondary node
The operation of system, has saved resource.In addition, when carrying the secondary node fault of standby subsystem of high priority, Ke Yigen
Seize the secondary node of the standby subsystem carrying low priority according to priority, therefore can preferentially ensure the primary of high priority
Subsystem exists gives tacit consent to standby subsystem, that is, ensure the high primary subsystem of priority, preferentially have redundancy protecting, such that it is able to
Reduce the risk of service disconnection, improve the reliability of system further, and the mode seized according to priority, realize simple.
Embodiment 3
Embodiments provide a kind of redundancy protection device, referring to Figure 13, this device includes:
, for detecting whether there is malfunctioning node in detection module 301.
First performing module 302, for detecting the presence of malfunctioning node when detection module 301, and malfunctioning node includes holding
The the first primary node carrying the first primary subsystem and carry the first standby subsystem the first secondary node when, main by first
With binding of subsystems to the second secondary node carrying the second standby subsystem, so that the second secondary node carries the first primary son
System, this first standby subsystem is the standby subsystem of acquiescence of the first primary subsystem, and this second standby subsystem is first
The standby subsystem of non-default of primary subsystem.
The beneficial effect of said apparatus provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem
First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal
During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem
System) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first is primary
The normal operation of subsystem;Therefore, the embodiment of the present invention can support multiple node failures, effectively increases the reliability of system
Property.
Embodiment 4
Embodiments provide a kind of redundancy protection device, referring to Figure 14, the difference with the device of embodiment 3
It is, the first performing module 302 is additionally operable to, when detection module 301 detects the presence of malfunctioning node, and malfunctioning node includes carrying
During the first secondary node of the first primary node of the first primary subsystem and carrying the first standby subsystem, primary first
In the standby subsystem of non-default of subsystem, select the minimum standby subsystem of a priority, by selected standby subsystem
Unite the secondary node being located as the second secondary node, and by the first primary binding of subsystems to the second secondary node.
Further, this device also includes:First judge module 403, for judging whether the first primary subsystem possesses
Seize ability;
Correspondingly, the first performing module 302 is additionally operable to, when the first primary subsystem possesses and seizes ability, main by first
With binding of subsystems to the second secondary node.
Further, detection module 301 is additionally operable to, and whether detection the first primary node and the first secondary node just recover
Often;
Correspondingly, the first performing module 302 is additionally operable to, when the first primary node first recovers normal, by the first spare
System binding is to the first primary node, so that the first primary node carries the first standby subsystem;When the first secondary node is first extensive
When multiple normal, by the first standby binding of subsystems to the first secondary node, so that the first secondary node carries the first standby subsystem
System.
Further, the first performing module 302 is additionally operable to, by the first standby binding of subsystems to the first primary node,
So that after the first primary node carries the first standby subsystem, the first primary node and the second secondary node are carried out active and standby fall
Change, so that the first primary node carries the first primary subsystem, and the second secondary node carries the first standby subsystem.
Further, the first performing module 302 is additionally operable to, after the first secondary node recovers normal, by the first spare
System binding to the first secondary node and discharges the second secondary node, so that the first secondary node carries the described first standby subsystem
System.
Further, the first performing module 302 is additionally operable to, by the first standby binding of subsystems to the first secondary node,
So that first secondary node carry the first standby subsystem after, the first secondary node and the second secondary node are carried out active and standby fall
Change, so that the first secondary node carries the first primary subsystem, and the second secondary node carries the first standby subsystem.
Further, the first performing module 302 is additionally operable to, after the first primary node recovers normal, standby by first
Binding of subsystems to the first primary node and discharges the second secondary node, so that the first primary node carries the first standby subsystem
System.
Further, the first performing module 302 is additionally operable to, the first primary node and the second secondary node are carried out active and standby fall
Change, so that the first primary node carries the first primary subsystem, and the second secondary node carries the first standby subsystem.
Further, this device also includes:
Second performing module 404, for detecting the presence of malfunctioning node when detection module 301, and malfunctioning node includes holding
Carry the second primary node of the second primary subsystem, and malfunctioning node does not include secondary node or that malfunctioning node includes is standby
When the standby subsystem that node carries not is the standby subsystem of acquiescence of the second primary subsystem, the second primary subsystem is tied up
Fixed to the 3rd secondary node carrying the 3rd standby subsystem so that the 3rd secondary node carries the second primary subsystem, this
Three standby subsystems are the standby subsystem of acquiescence of the second primary subsystem.
Further, this device also includes:Second judge module 405, for judging that the second primary subsystem whether there is
Give tacit consent to standby subsystem;
Correspondingly, the second performing module 404 is additionally operable to, in the presence of the judged result of the second judge module 405 is, by
Two primary binding of subsystems are to the 3rd secondary node carrying the 3rd standby subsystem;Judgement knot when the second judge module 405
When fruit is not exist, by the second primary binding of subsystems to the 4th secondary node carrying the 4th standby subsystem, the 4th is standby
The standby subsystem of non-default being the second primary subsystem with subsystem.
Further, detection module 301 is additionally operable to, and whether detection the second primary node recovers normal;Correspondingly, second hold
Row module 404 is additionally operable to, when the second primary node recovers normal, then by the second primary binding of subsystems to the second primary node.
Further, this device also includes:
3rd performing module 406, for detecting the presence of malfunctioning node when detection module 301, and malfunctioning node includes holding
Carry the 5th secondary node of the 5th standby subsystem, and malfunctioning node does not include primary node or that malfunctioning node includes is primary
When the standby subsystem of acquiescence of the primary subsystem that node carries not is five standby subsystem, according to the 5th standby subsystem
The priority of system, seizes the 6th secondary node carrying the 6th standby subsystem, so that the 6th secondary node carrying the 5th is standby
Subsystem.
Further, this device also includes:
First setup module 407, for being primary subsystem and standby subsystem setting priority.
Further, this device also includes:
Second setup module 408, gives tacit consent to standby subsystem for arranging for primary subsystem.
The beneficial effect of said apparatus provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem
First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal
During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem
System) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first is primary
The normal operation of subsystem;Therefore, the embodiment of the present invention can support multiple node failures, effectively increases the reliability of system
Property.Further, since the embodiment of the present invention is in primary subsystem and standby subsystem simultaneous faults, primary subsystem is acceptable
Seize other secondary nodes, therefore can ensure that the operation of system using less secondary node, saved resource.In addition,
When carrying the secondary node fault of standby subsystem of high priority, can be seized according to priority and carry the standby of low priority
With the secondary node of subsystem, therefore can preferentially ensure that the primary subsystem of high priority exists and give tacit consent to standby subsystem, from
And improve the reliability of system further, and the mode seized according to priority, realize simple.
Embodiment 5
Embodiments provide a kind of redundancy protecting equipment, referring to Figure 15, this equipment includes processor 51 (for example
CPU), memorizer 52 and network interface 53.
With reference to Figure 15, each component parts of redundancy protecting equipment 50 are specifically introduced:
Network interface 53 is used for and the communication connection of each node.
Memorizer 52 can be used for storing software program and/or application module, thus executing the various of redundancy protecting equipment 50
Application of function and data processing.Memorizer 52 can mainly include storing program area and storage data field, wherein, storing program area
Can application program needed for storage program area, at least one function etc.;Storage data field can store according to redundancy protecting equipment
The data that 50 use is created, such as corresponding relation between primary subsystem and standby subsystem, the priority of subsystem,
And primary subsystem seize ability etc..Additionally, memorizer 52 can include high-speed RAM (Random Access Memory,
Random access memory), nonvolatile memory (non-volatile memory), for example, at least one magnetic can also be included
Disk storage device, flush memory device or other volatile solid-state parts.
Specifically, it is stored in software program and/or the application module in memorizer 52 by operation or execution, and call
It is stored in the data in memorizer 52, processor 51 is it is achieved that detect whether there is malfunctioning node, and determines that malfunctioning node holds
The subsystem carrying.Specifically, when there is malfunctioning node, and this malfunctioning node includes carrying the first primary of the first primary subsystem
During the first secondary node of node and carrying the first standby subsystem, processor 51 passes through network interface 53 to the second standby section
Point sends the instruction message (including the start-up parameter of the first primary subsystem) carrying the first primary subsystem, and this instruction message is used
In by the second secondary node of the first primary binding of subsystems to carrying the second standby subsystem, so that this second secondary node holds
Carry the first primary subsystem, the first standby subsystem is the standby subsystem of acquiescence of the first primary subsystem, this second spare
System is the standby subsystem of non-default of the first primary subsystem.
Preferably, processor 51 is specifically for, in the standby subsystem of non-default of the first primary subsystem, selecting one
The minimum standby subsystem of priority, using the secondary node at selected standby subsystem place as the second secondary node, and
By the first primary binding of subsystems to this second secondary node.
Preferably, processor 51 can also be realized, and the first primary binding of subsystems is extremely being carried the second standby subsystem
The second secondary node so that second secondary node carry primary subsystem before,
Judge whether the first primary subsystem possesses the ability of seizing;
If the first primary subsystem possesses the ability of seizing, by the first primary binding of subsystems to the second secondary node.
Preferably, processor 51 can also be realized, after by the first primary binding of subsystems to the second secondary node,
Detect whether the first primary node and the first secondary node recover normal;
If the first primary node first recovers normal, by the first standby binding of subsystems to the first primary node, so that the
One primary node carries the first standby subsystem;
If the first secondary node first recovers normal, by the first standby binding of subsystems to the first secondary node, so that the
One secondary node carries the first standby subsystem.
Further, processor 51 can also be realized, by the first standby binding of subsystems to the first primary node so that
After first primary node carries the first standby subsystem, masterslave switchover is carried out to the first primary node and the second secondary node,
So that the first primary node carries the first primary subsystem, and the second secondary node carries the first standby subsystem.
Preferably, processor 51 can also be realized, and is carrying out masterslave switchover to the first primary node and the second secondary node
Afterwards, after the first secondary node recovers normal, by the first standby binding of subsystems, to the first secondary node and to discharge second standby
With node, so that the first secondary node carries the first standby subsystem.
Preferably, processor 51 can also be realized, by the first standby binding of subsystems to the first secondary node so that
After one secondary node carries the first standby subsystem, masterslave switchover is carried out to the first secondary node and the second secondary node, with
Make the first secondary node carry the first primary subsystem, and the second secondary node carries the first standby subsystem.
Preferably, processor 51 can also be realized, and is carrying out masterslave switchover to the first secondary node and the second secondary node
Afterwards,
After the first primary node recovers normal, the first standby binding of subsystems to the first primary node and is discharged the
Two secondary nodes, so that the first primary node carries the first standby subsystem.
Preferably, processor 51 can also be realized, by the described first standby binding of subsystems to the first primary node simultaneously
After discharging the second secondary node,
Masterslave switchover is carried out to the first primary node and the second secondary node, so that the first primary node carrying first is primary
Subsystem, and the second secondary node carries the first standby subsystem.
Further, processor 51 can also be realized, when there is malfunctioning node, and this malfunctioning node include carrying second main
With the second primary node of subsystem, and this malfunctioning node does not include the secondary node that secondary node or this malfunctioning node include
When the standby subsystem carrying not is the standby subsystem of acquiescence of this second primary subsystem, processor 51 passes through network interface
53 to the 3rd secondary nodes send the main message of liter, and (the 3rd standby subsystem is the standby subsystem of acquiescence of the second primary subsystem
System), this message is used for instruction the 3rd secondary node and carries the second primary subsystem;Or, processor 51 passes through network interface 53
Send the configured information (including the start-up parameter of the second primary subsystem) carrying the second primary subsystem to the 4th secondary node,
This configured information is used for the second primary binding of subsystems to the 4th secondary node, so that the 4th secondary node carrying second is primary
Subsystem, the 4th standby subsystem is the standby subsystem of non-default of the second primary subsystem.
Preferably, processor 51 can also be realized, before the 3rd secondary node carries the second primary subsystem,
Judge that the second primary subsystem whether there is and give tacit consent to standby subsystem;
If existing, by the second primary binding of subsystems to the 3rd secondary node carrying the 3rd standby subsystem;
If not existing, by the second primary binding of subsystems to the 4th secondary node carrying the 4th standby subsystem, the
Four standby subsystems are the standby subsystem of non-default of the second primary subsystem.
Preferably, processor 51 can also be realized, by the second primary binding of subsystems to the 3rd secondary node or the 4th
After secondary node,
Detect whether the second primary node recovers normal;
If the second primary node recovers normal, by the second primary binding of subsystems to the second primary node.
Further, processor 51 can also be realized, when there is malfunctioning node, and this malfunctioning node include carrying the 5th standby
With the 5th secondary node of subsystem, and this malfunctioning node does not include the primary node that primary node or this malfunctioning node include
When the standby subsystem of acquiescence of the primary subsystem carrying not is five standby subsystem, processor 51 is connect by network
Mouth 53 to the 6th secondary nodes send the configured information (priority ratio of the 6th standby subsystem carrying the 5th standby subsystems
The priority of the 5th standby subsystem is low), this configured information is used for the 5th standby binding of subsystems to the 6th secondary node, with
The 6th secondary node is made to carry the 5th standby subsystem.
Further, processor 51 can also be realized, and is primary subsystem and standby subsystem setting priority.
Further, processor 51 can also be realized, and is that standby subsystem is given tacit consent in primary subsystem setting.
It will be understood by those skilled in the art that the structure shown in Figure 15 does not constitute the limit to redundancy protecting equipment
Fixed, it can include ratio and illustrate more or less of part, or combines some parts, or different part arrangements.
The beneficial effect of said system provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem
First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal
During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem
System) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first is primary
The normal operation of subsystem;Therefore, the embodiment of the present invention can support multiple node failures, effectively increases the reliability of system
Property.Further, since the embodiment of the present invention is in primary subsystem and standby subsystem simultaneous faults, primary subsystem is acceptable
Seize other secondary nodes, therefore can ensure that the operation of system using less secondary node, saved resource.In addition,
When carrying the secondary node fault of standby subsystem of high priority, can be seized according to priority and carry the standby of low priority
With the secondary node of subsystem, therefore can preferentially ensure that the primary subsystem of high priority exists and give tacit consent to standby subsystem, from
And improve the reliability of system further, and the mode seized according to priority, realize simple.
Embodiment 6
Embodiments provide a kind of redundancy protection systems, referring to Figure 16, this system include multiple nodes 601, with
And redundancy protection device 602.Wherein, this redundancy protection device 602 can be protected for the redundancy providing in the embodiment of the present invention 3 or 4
The redundancy protecting equipment providing in protection unit or the embodiment of the present invention 5.
Wherein, node 601 can be the process on server or server.
Further, when node 601 be server on process when, carry primary subsystem the quantity of primary node and
The quantity carrying the secondary node of standby subsystem is equal, and each primary subsystem has a standby subsystem of acquiescence, with
The process that the standby subsystem of acquiescence of the primary subsystem that the primary node on one server carries is located, is evenly distributed in and is somebody's turn to do
On the two neighboring server of server.
As it was previously stated, the system of the present embodiment is applied to the RNC and IT (Information of other Cluster form
Technology, information technology) product.
The beneficial effect of said system provided in an embodiment of the present invention is:By carrying the first of the first primary subsystem
First secondary node of primary node and carrying the first standby subsystem (the standby subsystem of acquiescence of the first primary subsystem) is equal
During fault, by the first primary binding of subsystems to carrying the second standby subsystem (non-default spare of the first primary subsystem
System) the second secondary node so that this second secondary node carry the first primary subsystem, thereby may be ensured that first is primary
The normal operation of subsystem;Therefore, the embodiment of the present invention can support multiple node failures, effectively increases the reliability of system
Property.
It should be noted that:The redundancy protection device that above-described embodiment provides when executing redundancy protecting, only with above-mentioned each
The division of functional module is illustrated, and in practical application, can distribute above-mentioned functions by different work(as desired
Module can complete, the internal structure of device will be divided into different functional modules, with complete described above all or portion
Divide function.In addition, the redundancy protection device that above-described embodiment provides belongs to same design with redundancy protected method embodiment, its tool
Body is realized process and is referred to embodiment of the method, repeats no more here.
The embodiments of the present invention are for illustration only, do not represent the quality of embodiment.
One of ordinary skill in the art will appreciate that all or part of step realizing above-described embodiment can pass through hardware
To complete it is also possible to the hardware being instructed correlation by program is completed, described program can be stored in a kind of computer-readable
In storage medium, storage medium mentioned above can be read only memory, disk or CD etc..
The foregoing is only presently preferred embodiments of the present invention, not in order to limit the present invention, all spirit in the present invention and
Within principle, any modification, equivalent substitution and improvement made etc., should be included within the scope of the present invention.
Claims (36)
1. a kind of redundancy protected method is it is characterised in that methods described includes:
Detect whether there is malfunctioning node;
When there is malfunctioning node, and described malfunctioning node includes carrying the first primary node of the first primary subsystem and carries
During the first secondary node of the first standby subsystem, by the described first primary binding of subsystems to carrying the second standby subsystem
Second secondary node, so that described second secondary node carries the described first primary subsystem, described first standby subsystem is
The standby subsystem of acquiescence of described first primary subsystem, described second standby subsystem is the non-of the described first primary subsystem
Give tacit consent to standby subsystem.
2. method according to claim 1 it is characterised in that described by the described first primary binding of subsystems to carrying the
Second secondary node of two standby subsystems, so that described second secondary node carries described primary subsystem, including:
In the standby subsystem of non-default of the described first primary subsystem, select the minimum standby subsystem of a priority,
The secondary node that selected described standby subsystem is located as described second secondary node, and by the described first primary son
System binding is to described second secondary node.
3. method according to claim 1 and 2 it is characterised in that described by the described first primary binding of subsystems extremely
Carry the second secondary node of the second standby subsystem, so that before described second secondary node carries described primary subsystem,
Methods described also includes:
Judge whether the described first primary subsystem possesses the ability of seizing;
If described first primary subsystem possesses the ability of seizing, the described first primary binding of subsystems is standby to described second
Node.
4. method according to claim 1 it is characterised in that by the described first primary binding of subsystems to described second
After secondary node, methods described also includes:
Detect whether the described first primary node and described first secondary node recover normal;
If described first primary node first recovers normal, by the described first standby binding of subsystems to described first primary section
Point, so that described first primary node carries the described first standby subsystem;
If described first secondary node first recovers normal, by the described first standby binding of subsystems to described first standby section
Point, so that described first secondary node carries the described first standby subsystem.
5. method according to claim 4 it is characterised in that described by the described first standby binding of subsystems to described
First primary node, so that after described first primary node carries the described first standby subsystem, methods described also includes:
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node carries
Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
6. method according to claim 5 is it is characterised in that described to the described first primary node and described second standby
After carrying out masterslave switchover with node, methods described also includes:
After described first secondary node recovers normal, by the described first standby binding of subsystems to described first secondary node simultaneously
Discharge described second secondary node, so that described first secondary node carries the described first standby subsystem.
7. method according to claim 4 it is characterised in that described by the described first standby binding of subsystems to described
First secondary node, so that after described first secondary node carries the described first standby subsystem, methods described also includes:
Masterslave switchover is carried out to described first secondary node and described second secondary node, so that described first secondary node carries
Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
8. method according to claim 7 is it is characterised in that described to described first secondary node and described second standby
After carrying out masterslave switchover with node, methods described also includes:
After the described first primary node recovers normal, by the described first standby binding of subsystems to described first primary node
And discharge described second secondary node, so that described first primary node carries the described first standby subsystem.
9. method according to claim 8 it is characterised in that described by the described first standby binding of subsystems to described
First primary node after discharging described second secondary node, methods described also includes:
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node carries
Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
10. method according to claim 1 is it is characterised in that methods described also includes:
When there is malfunctioning node, and described malfunctioning node includes the second primary node of carrying the second primary subsystem, and described
The standby subsystem that the secondary node that malfunctioning node does not include secondary node or described malfunctioning node includes carries not is institute
State the second primary subsystem acquiescence standby subsystem when,
Masterslave switchover is carried out to the 3rd secondary node of the second primary node and carrying the 3rd standby subsystem, so that the described 3rd
Secondary node carries the described second primary subsystem, and described 3rd standby subsystem is that the acquiescence of the described second primary subsystem is standby
Use subsystem.
11. methods according to claim 10 are it is characterised in that described to the second primary node with carry the 3rd standby
3rd secondary node of subsystem carries out masterslave switchover so that described 3rd secondary node carry the described second primary subsystem it
Before, methods described also includes:
Judge that the described second primary subsystem whether there is and give tacit consent to standby subsystem;
If existing, by the described second primary binding of subsystems to the 3rd secondary node carrying the 3rd standby subsystem;
If not existing, by the described second primary binding of subsystems to the 4th secondary node carrying the 4th standby subsystem, institute
State the standby subsystem of non-default that the 4th standby subsystem is the described second primary subsystem.
12. methods according to claim 11 are it is characterised in that by the described second primary binding of subsystems to described the
After three secondary nodes or described 4th secondary node, methods described also includes:
Detect whether the described second primary node recovers normal;
If described second primary node recovers normal, by the described second primary binding of subsystems to described second primary node.
13. methods according to claim 1 are it is characterised in that methods described also includes:
When there is malfunctioning node, and described malfunctioning node includes carrying the 5th secondary node of the 5th standby subsystem, and described
Acquiescence of the primary subsystem that the primary node that malfunctioning node does not include primary node or described malfunctioning node includes carries
When standby subsystem is not described five standby subsystem,
According to the priority of described 5th standby subsystem, seize the 6th secondary node carrying the 6th standby subsystem, so that
Described 6th secondary node carries described 5th standby subsystem.
14. methods according to claim 1 are it is characterised in that methods described also includes:
It is primary subsystem and standby subsystem setting priority.
15. methods according to claim 1 are it is characterised in that methods described also includes:
Arrange for primary subsystem and give tacit consent to standby subsystem.
16. methods according to claim 1 are it is characterised in that carrying the quantity of primary node of primary subsystem and holding
The quantity carrying the secondary node of standby subsystem is equal, and each described primary subsystem has a standby subsystem of acquiescence.
17. methods according to claim 16 are it is characterised in that described primary node and described secondary node are service
Process on device, what the standby subsystem of acquiescence of the primary subsystem that the described primary node on same server carries was located enters
Journey, is evenly distributed on two adjacent with described server server.
A kind of 18. redundancy protection devices are it is characterised in that described device includes:
, for detecting whether there is malfunctioning node in detection module;
First performing module, for detecting the presence of malfunctioning node when described detection module, and described malfunctioning node includes carrying
During the first secondary node of the first primary node of the first primary subsystem and carrying the first standby subsystem, by described first
Primary binding of subsystems is to the second secondary node carrying the second standby subsystem, so that described second secondary node carrying is described
First primary subsystem, described first standby subsystem is the standby subsystem of acquiescence of the described first primary subsystem, described
Two standby subsystems are the standby subsystems of non-default of the described first primary subsystem.
19. devices according to claim 18 it is characterised in that described first performing module is used for,
When there is malfunctioning node, and described malfunctioning node includes carrying the first primary node of the first primary subsystem and carries
During the first secondary node of the first standby subsystem, in the standby subsystem of non-default of the described first primary subsystem, select
The minimum standby subsystem of one priority, the secondary node that selected described standby subsystem is located is as described second
Secondary node, and by the described first primary binding of subsystems to described second secondary node.
20. devices according to claim 18 or 19 are it is characterised in that described device also includes:
First judge module, for judging whether the described first primary subsystem possesses the ability of seizing;
Correspondingly, described first performing module is additionally operable to, when the described first primary subsystem possesses and seizes ability, by described
One primary binding of subsystems is to described second secondary node.
21. devices according to claim 18 it is characterised in that
Described detection module is additionally operable to, and whether detection the described first primary node and described first secondary node recover normal;
Correspondingly, described first performing module is additionally operable to, when the described first primary node first recovers normal, then by described first
Standby binding of subsystems is to described first primary node, so that described first primary node carries the described first standby subsystem;
When described first secondary node first recovers normal, then by the described first standby binding of subsystems to described first secondary node,
So that described first secondary node carries the described first standby subsystem.
22. devices according to claim 21 it is characterised in that described first performing module is additionally operable to,
Described by the described first standby binding of subsystems to described first primary node, so that described first primary node carries
After described first standby subsystem,
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node carries
Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
23. devices according to claim 22 it is characterised in that described first performing module is additionally operable to,
After described first secondary node recovers normal, by the described first standby binding of subsystems to described first secondary node simultaneously
Discharge described second secondary node, so that described first secondary node carries the described first standby subsystem.
24. devices according to claim 21 it is characterised in that described first performing module is additionally operable to,
Described by the described first standby binding of subsystems to described first secondary node, so that described first secondary node carries
After described first standby subsystem,
Masterslave switchover is carried out to described first secondary node and described second secondary node, so that described first secondary node carries
Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
25. devices according to claim 24 it is characterised in that described first performing module is additionally operable to,
After the described first primary node recovers normal, by the described first standby binding of subsystems to described first primary node
And discharge described second secondary node, so that described first primary node carries the described first standby subsystem.
26. devices according to claim 25 it is characterised in that described first performing module is additionally operable to,
Masterslave switchover is carried out to the described first primary node and described second secondary node, so that described first primary node carries
Described first primary subsystem, and described second secondary node carries the described first standby subsystem.
27. devices according to claim 18 are it is characterised in that described device also includes:
Second performing module, for detecting the presence of malfunctioning node when described detection module, and described malfunctioning node includes carrying
Second primary node of the second primary subsystem, and described malfunctioning node does not include secondary node or described malfunctioning node includes
The standby subsystem that carries of secondary node be not the described second primary subsystem acquiescence standby subsystem when, by described the
Two primary binding of subsystems are to the 3rd secondary node carrying the 3rd standby subsystem, so that described 3rd secondary node carries institute
State the second primary subsystem, described 3rd standby subsystem is the standby subsystem of acquiescence of the described second primary subsystem.
28. devices according to claim 27 are it is characterised in that described device also includes:
Second judge module, gives tacit consent to standby subsystem for judging that the described second primary subsystem whether there is;
Correspondingly, described second performing module is additionally operable to, in the presence of the judged result of described second judge module is, will be described
Second primary binding of subsystems is to the 3rd secondary node carrying the 3rd standby subsystem;Judgement when described second judge module
When result is not exist, by the described second primary binding of subsystems to the 4th secondary node carrying the 4th standby subsystem, institute
State the standby subsystem of non-default that the 4th standby subsystem is the described second primary subsystem.
29. devices according to claim 28 it is characterised in that
Described detection module is additionally operable to, and whether detection the described second primary node recovers normal;
Correspondingly, described second performing module is additionally operable to, when the described second primary node recovers normal, then primary by described second
Binding of subsystems is to described second primary node.
30. devices according to claim 18 are it is characterised in that described device also includes:
3rd performing module, for detecting the presence of malfunctioning node when described detection module, and described malfunctioning node includes carrying
5th secondary node of the 5th standby subsystem, and described malfunctioning node does not include primary node or described malfunctioning node includes
The standby subsystem of acquiescence of primary subsystem that carries of primary node when being not described five standby subsystem, according to institute
State the priority of the 5th standby subsystem, seize the 6th secondary node carrying the 6th standby subsystem, so that the described 6th is standby
Carry described 5th standby subsystem with node.
31. devices according to claim 18 are it is characterised in that described device also includes:
First setup module, for being primary subsystem and standby subsystem setting priority.
32. devices according to claim 18 are it is characterised in that described device also includes:
Second setup module, gives tacit consent to standby subsystem for arranging for primary subsystem.
A kind of 33. redundancy protecting equipment are it is characterised in that described equipment includes:Processor and memorizer, for execution such as right
Require the redundancy protected method any one of 1 to 17.
A kind of 34. redundancy protection systems are it is characterised in that described system includes:Multiple nodes and such as claim 18-32
Device described in any one, the plurality of node is used for executing task under the control of described device.
35. systems according to claim 34 are it is characterised in that described node is process or server on server.
36. systems according to claim 34 or 35 it is characterised in that when the process that described node is on server,
The quantity of primary node carrying primary subsystem and the quantity of the secondary node carrying standby subsystem are equal, described in each
Primary subsystem has a standby subsystem of acquiescence, the primary subsystem that the described primary node on same server carries
Give tacit consent to the process that standby subsystem is located, be evenly distributed on two adjacent with described server server.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013072890 | 2013-03-19 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103931139A CN103931139A (en) | 2014-07-16 |
CN103931139B true CN103931139B (en) | 2017-02-15 |
Family
ID=51147972
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201380003769.2A Active CN103931139B (en) | 2013-03-19 | 2013-03-19 | Method and device for redundancy protection, and device and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103931139B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105634779B (en) * | 2014-11-04 | 2019-09-03 | 南京中兴软件有限责任公司 | The operation processing method and device of master/slave device |
CN107612720B (en) * | 2017-08-29 | 2021-06-22 | 深圳市盛路物联通讯技术有限公司 | Access point backup method and gateway |
JP7421052B2 (en) * | 2019-03-15 | 2024-01-24 | アイコム株式会社 | How to make server systems and processes redundant |
CN110149366B (en) * | 2019-04-16 | 2022-03-18 | 平安科技(深圳)有限公司 | Method and device for improving availability of cluster system and computer equipment |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102143063A (en) * | 2010-12-29 | 2011-08-03 | 华为技术有限公司 | Method and device for protecting business in cluster system |
CN102231681A (en) * | 2011-06-27 | 2011-11-02 | 中国建设银行股份有限公司 | High availability cluster computer system and fault treatment method thereof |
-
2013
- 2013-03-19 CN CN201380003769.2A patent/CN103931139B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102143063A (en) * | 2010-12-29 | 2011-08-03 | 华为技术有限公司 | Method and device for protecting business in cluster system |
CN102231681A (en) * | 2011-06-27 | 2011-11-02 | 中国建设银行股份有限公司 | High availability cluster computer system and fault treatment method thereof |
Also Published As
Publication number | Publication date |
---|---|
CN103931139A (en) | 2014-07-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10601657B2 (en) | Instance node management method and management device | |
CN103152419B (en) | A kind of high availability cluster management method of cloud computing platform | |
CN105187249B (en) | A kind of fault recovery method and device | |
CN103931139B (en) | Method and device for redundancy protection, and device and system | |
TWI701916B (en) | Method and device for self-recovering management ability in distributed system | |
CN102882704B (en) | Link protection method in the soft reboot escalation process of a kind of ISSU and equipment | |
CN105159798A (en) | Dual-machine hot-standby method for virtual machines, dual-machine hot-standby management server and system | |
CN111209110B (en) | Task scheduling management method, system and storage medium for realizing load balancing | |
CN102025562A (en) | Path detection method and device | |
CN109768884A (en) | The implementation method of communication system and its high availability, device and computer equipment | |
CN105790825A (en) | Method and apparatus for carrying out hot backup on controllers in distributed protection | |
CN105530116A (en) | Virtual network backup and recovery method and corresponding device | |
CN107656847A (en) | Node administration method, system, device and storage medium based on distributed type assemblies | |
CN110166115A (en) | A kind of quantum optical network service fault recovery method and device based on pool of keys | |
CN109324549A (en) | A kind of dual master control equipment configuration method and dual master control equipment | |
CN107329830A (en) | A kind of method and device of distributed cluster system optimization | |
CN111490932A (en) | Communication method and device | |
CN107992752B (en) | Data processing method and device and computer equipment | |
CN116184895A (en) | Redundant unit control system, method, device, computer equipment and storage medium | |
CN113098709B (en) | Network recovery method and device based on distributed networking system and computer equipment | |
CN113794541A (en) | Main/standby switching management method, device, equipment and machine readable storage medium | |
CN103716186B (en) | Artificial telephone traffic system with network fault tolerance capability and method thereof | |
CN106341323A (en) | VRRP state synchronizing method and device | |
CN113472642B (en) | Protection switching method and device | |
CN103227994B (en) | The management method and system of shared resource |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |