CN106452846A - Fault processing method, virtual architecture management system and business management system - Google Patents
Fault processing method, virtual architecture management system and business management system Download PDFInfo
- Publication number
- CN106452846A CN106452846A CN201610841392.3A CN201610841392A CN106452846A CN 106452846 A CN106452846 A CN 106452846A CN 201610841392 A CN201610841392 A CN 201610841392A CN 106452846 A CN106452846 A CN 106452846A
- Authority
- CN
- China
- Prior art keywords
- virtual machine
- management system
- virtual
- business management
- service application
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Debugging And Monitoring (AREA)
Abstract
The invention provides a fault processing method, a virtual architecture management system, a business management system and a virtual computer system. The fault processing method comprises the following steps: the virtual architecture management system acquires a fault alarm message, wherein the fault alarm message carries identification information and a fault type of a faulty device; the virtual architecture management system determines a first virtual machine set according to the fault warning message, wherein the first virtual machine set comprises at least one first virtual machine that is affected by the faulty device; and the virtual architecture management system sends a state alarm message to the business management system, wherein the state alarm message carries the information of the first virtual machine set. The technical scheme of the invention can quickly notify affected business of the influence of hardware faults, thereby improving the reliability of the business.
Description
Technical field
The present invention relates to field of cloud calculation, more particularly, to a kind of fault handling method, virtual architecture management system, business pipe
Reason system and virtualization computer system.
Background technology
In field of cloud calculation, the operation system of industry-by-industry is implementing virtualization or cloudization deployment.At present, operation system
In service operation on a virtual machine, on the hardware device as shared resource pond, that is, business no longer adopts deploying virtual machine
Traditional specialized hardware or physical server deployment, to realize hardware and software decoupling and to improve resource utilization.
At present, after equipment (as physical host, storage device etc.) breaks down, the fault of faulty equipment can be passed through event
Barrier alarm information is sent to virtual architecture management system, and virtual architecture management system is sent to business pipe fault warning message again
Reason system, determines affected virtual machine and service application by business management system according to fault warning message, and to impacted
Service application execution fault handling operation.This makes business management system need to perceive hardware and the corresponding business of hardware fault
Application, could execute troubleshooting to service application, such that faulty equipment can not rapidly be led to by business management system
Know the service application being affected to faulty equipment, affect service application reliability.
Content of the invention
The present invention provides a kind of fault handling method, virtual architecture management system business management system and Virtualization Computer
System, can rapidly by hardware fault, the impact to virtual machine notifies the business that affected to affected virtual machine, thus
Improve service reliability.
In a first aspect, a kind of the invention provides fault handling method.This fault handling method is used for calculating in virtualization
Carry out troubleshooting, this virtualization computer system includes in machine system:Virtual architecture management system, business management system and
At least one virtual machine, at least one virtual machine operates at least one physical equipment, and at least one virtual machine is used for executing
Service application, business management system is used for management business and applies, and virtual architecture management system is used for managing at least one virtual machine
At least one physical equipment.This fault handling method includes:Virtual architecture management system obtains fault warning message, and fault is accused
Alarm message carries identification information and the fault type of faulty equipment;Virtual architecture management system determines according to fault warning message
One virtual machine set, the first virtual machine set includes at least one first virtual machine being affected by described faulty equipment;Virtual frame
Structure management system sends state alarm information to business management system, and state alarm information carries the letter of the first virtual machine set
Breath.
In this fault handling method, after virtual architecture management system gets the fault warning message on faulty equipment, directly
Connect and this fault warning message is analyzed process, obtain one or more virtual machines of faulty equipment impact, and to business pipe
Reason system sends the information of these virtual machines so that business management system can directly obtain according to the information analysiss of these virtual machines
To affected service application, and then affected service application can be processed.Compared with prior art, by virtual frame
The information of the virtual machine that structure management system is directly affected by faulty equipment according to the determination of the fault warning message of faulty equipment so that
Business management system directly can obtain affected service application according to the state alarm information analysis of the first virtual machine set,
Rather than go analysis to obtain affected virtual machine, analyze affected service application again according to the alarm information of faulty equipment.
So that business management system does not need directly to perceive hardware fault, and then can quickly trigger at the impact of service application
Reason, reduces traffic lost, improves the reliability of service application.
In a kind of possible implementation, described fault handling method also includes:Virtual architecture management system is according to event
The fault warning message of barrier hardware determines the impact information of the first virtual machine set, and this impact information is used for indicating faulty equipment pair
The type of impact and/or rank that the first virtual machine in first virtual machine set produces;Correspondingly, state alarm information also may be used
To carry the impact information of the first virtual machine set.
In this fault handling method, virtual architecture management system is according to the fault warning information of faulty equipment except obtaining
Take at least one virtual machine affected, the type of the impact to these virtual machines for the fault of faulty equipment generation can also be obtained
And/or rank, then also carry in the state alarm information sending to business management system for indicating faulty equipment to
The type of impact and/or the impact information of rank that the first virtual machine in one virtual machine set produces, so that business pipe
Reason system or operation system can more be processed to service application according to this impact information, improve service application further
Reliability.
Alternatively, state warning information can also include the identification information of the first virtual machine in the first virtual machine set,
Alarm identifier information, alarm name information, alarm object type information, alarm type information, raising Time information, alarm
Unit type information, alarm components identification information and alarm components name information.
Alternatively, described state warning information can include the fault type information of faulty equipment.
In a kind of possible implementation, the shadow that faulty equipment produces to the first virtual machine in the first virtual machine set
Ring type include fault, excessive risk, risk, low-risk or no impact one or more of type.
Alternatively, faulty equipment includes tightly to the rank of the impact that the first virtual machine in the first virtual machine set produces
Anxious, important or inessential.
In a kind of possible implementation, this fault handling method also includes:Virtual architecture management system receives business
The first request message that management system sends, the first request message is used for indicating virtual machine to be restored, this is to be restored virtual
Machine is a subset in the first virtual machine set;It is to be restored that virtual architecture management system preferentially recovers this according to the first solicited message
Virtual machine.
In this fault handling method, virtual architecture management system can be according to the request of business management system, according to business
The priority of management system instruction, at least one of first virtual machine set that the fault by faulty equipment is affected is virtual
Machine carries out recovery process.
Alternatively, the recovery that virtual architecture management system is carried out to virtual machine processes and can include:Live migration of virtual machine.
In a kind of possible implementation, this fault handling method also includes:If virtual architecture management system is preset
Do not receive the first solicited message of business management system transmission in time threshold, then recover according to preset virtual machine recovery policy
The first virtual machine in first virtual machine set.
This fault handling method can ensure that how business management system does not have information instruction virtual architecture management system
When recovering the virtual machine in the first virtual machine set, virtual architecture management system can be actively according to the recovery policy being pre-configured with
The first virtual machine in first virtual machine set is recovered.
In a kind of possible implementation, this fault handling method also includes:Virtual architecture management system is to business pipe
Reason system sends state alarm clearance message.
In this fault handling method, virtual architecture management system carries out after recovery process to virtual machine, to operational control system
System transmission state alarm clearance message is so that business management system can receive according to before this state alarm clearance message erasing
Correlation state alarm information, thus avoiding the business management system state alarm information related to recovered virtual machine
It is analyzed processing.
Second aspect, the invention provides a kind of virtual architecture management system, described virtual architecture management system includes using
The modules of the fault handling method in execution first aspect or any one possible implementation of first aspect.
The virtual architecture management system that the present invention provides is after getting the fault warning message on faulty equipment, directly right
This fault warning message is analyzed processing, and obtains one or more virtual machines of faulty equipment impact, and to operational control system
System sends the information of these virtual machines so that business management system can be directly subject to according to the information analysiss of these virtual machines
The service application of impact, and then can be processed with affected service application.Compared with prior art, managed by virtual architecture
The information of the virtual machine that system is directly affected by faulty equipment according to the determination of the fault warning message of faulty equipment is so that business pipe
Reason system directly can obtain affected service application according to the state alarm information analysis of the first virtual machine set, rather than
Analysis is gone to obtain affected virtual machine, analyze affected service application again according to the alarm information of faulty equipment.So that
Obtaining business management system does not need directly to perceive hardware fault, and then can quickly trigger the impact process of service application, reduces
Traffic lost, improves the reliability of service application.
The third aspect, the present invention provides a kind of virtual architecture management system, and described virtual architecture management system includes processing
Device, memorizer, communication interface and bus.Wherein, processor, memorizer, communication interface are communicated by bus it is also possible to be led to
Cross and other means such as be wirelessly transferred and realize communication.This memorizer is used for store instruction, and this processor is used for executing this memorizer to be deposited
The instruction of storage.This memory storage program code, and processor can call the program code execution first of storage in memorizer
Fault handling method in aspect and any one possible implementation of first aspect.
Fourth aspect, the invention provides a kind of computer-readable medium, described computer-readable medium storage is used for empty
Intend the program code of architecture management system execution, described program code include for execute first aspect and first aspect any one
The instruction of the fault handling method in possible implementation.
5th aspect, present invention also offers a kind of fault handling method, this fault handling method is based in virtualization
Carry out troubleshooting, virtualization computer system includes in calculation machine system:Virtual architecture management system, business management system and
At least one virtual machine, this at least one virtual machine operates at least one physical equipment, and this at least one virtual machine is used for
Execution service application, business management system is used for management business and applies, and virtual architecture management system is used for managing this at least one
Virtual machine and this at least one physical equipment;This fault handling method includes:Business management system receives virtual architecture management system
The state alarm information that system sends, this state alarm information carries the information of the first virtual machine set being affected by faulty equipment,
First virtual machine set includes at least one first virtual machine;Business management system determines at least one according to state alarm information
The service application of individual first virtual machine association;Business management system processes operation to the service application execution of association.
In this fault handling method, business management system receives from virtual architecture management system to be affected by faulty equipment
After the information of virtual machine in first virtual machine set, can directly be obtained affected according to the information analysiss of these virtual machines
Service application, and then affected service application can be processed.Compared with prior art, business management system can be straight
Connect and affected service application is obtained according to the state alarm information analysis of the first virtual machine set, rather than according to faulty equipment
Alarm information go analysis to obtain affected virtual machine, analyze affected service application again.So that operational control system
System does not need directly to perceive hardware fault, and then can quickly trigger the impact process of service application, reduces traffic lost, improves
The reliability of service application.
In a kind of possible implementation, the state alarm information of the first virtual machine set also carries the first virtual machine collection
The impact information closed, this impact information is used for indicating faulty equipment at least one of the first virtual machine set the first virtual machine
The type of impact producing and/or rank.Correspondingly, business management system processes operation to service application execution and includes:Business
Management system processes operation according to the impact information of the first virtual machine set to service application execution.
In this fault handling method, the first virtual machine set that business management system receives from virtual architecture management system
Also carry in state alarm information for indicating the impact that faulty equipment produces to the first virtual machine in the first virtual machine set
Type and/or rank impact information so that business management system or operation system can more be believed according to this impact
Breath is processed to service application, improves the reliability of service application further.
Alternatively, state warning information can also include the identification information of the first virtual machine in the first virtual machine set,
Alarm identifier information, alarm name information, alarm object type information, alarm type information, raising Time information, alarm
Unit type information, alarm components identification information and alarm components name information.
Alternatively, described state warning information can include the fault type information of faulty equipment.
In a kind of possible implementation, first virtual machine set produce impact type include fault, excessive risk,
Risk, low-risk or no impact one or more of type.
Alternatively, faulty equipment includes tightly to the rank of the impact that the first virtual machine in the first virtual machine set produces
Anxious, important or inessential.
In a kind of possible implementation, process operation and include at least one in the following manner:
The service application of at least one the first virtual machine association is switched to by business management system to be affected by faulty equipment
Virtual machine execution;Or
The application state information of at least one the first virtual machine is designated isolation, isolation by business management system
For indicating the service application of at least one first at least one first virtual machine association of virtual machine stopping execution;Or
Business management system sends the first request message to virtual architecture management system, and the first request message is treated for instruction
The virtual machine recovering, virtual machine to be restored is a subset in the first virtual machine set;Or
The control node of the service application at least one the first virtual machine association for the business management system sends state alarm
Message, so that the service application of at least one the first virtual machine association is switched to not by control node according to state alarm information
The virtual machine being affected by faulty equipment executes or the application state information of at least one the first virtual machine is designated isolation.
In a kind of possible implementation, this fault handling method also includes:Business management system is virtual according to first
The impact information of machine set determines the first request message.
In this fault handling method, according to the impact information of the first virtual machine, business management system can determine that first is virtual
Need the priority of the virtual machine to be restored of virtual architecture management system recovery in machine set, and send out to virtual machine architecture management
The first request message sending recovery priority for indicating these virtual machines to be restored is so that virtual architecture management system can
With the priority being indicated according to business management system, in the first virtual machine set that the fault by faulty equipment is affected extremely
A few virtual machine carries out recovery process.
In a kind of possible implementation, business management system to virtual architecture management system send for instruction treat extensive
A kind of specific implementation of the first request message of the recovery priority of multiple virtual machine can be:Business management system according to
The priority of service application sends the first request message to virtual architecture management system.
In this fault handling method, business management system is associated according to the first virtual machine in the first virtual machine set
The priority of service application, that is, the priority indication virtual architecture management system of the service application according to faulty equipment impact is to the
Virtual machine to be restored in one virtual machine set carries out recovery process, thereby may be ensured that the service application of high priority is permissible
Preferentially it is restored, be further ensured that the reliability of service application.
Alternatively, business management system can be according to the impact information of the first virtual machine set and associated service application
Priority to virtual architecture management system send the first request message.
In a kind of possible implementation, business management system sends the first request message to virtual architecture management system
A kind of specific implementation be:Business management system sends to virtual architecture management system according to the deployment mode of service application
First request message, the deployment mode of service application is included in active-standby mode, load sharing pattern and single virtual machine mode extremely
Few one kind.
In this fault handling method, business management system according to the deployment mode of service application, that is, according to faulty equipment shadow
The deployment mode instruction virtual architecture management system of the service application rung is to the virtual machine to be restored in the first virtual machine set
Carry out recovery process.
Alternatively, business management system can be according to the deployment mould of the impact information of the first virtual machine set and service application
Formula sends the first request message to virtual architecture management system, or can be according to the deployment mode of service application and service application
Priority to virtual architecture management system send the first request message, or can according to the impact information of the first virtual machine set,
The priority of the deployment mode of service application and service application sends the first request message to virtual architecture management system.
In a kind of possible implementation, this fault handling method also includes:Business management system receives virtual architecture
The state alarm clearance message that management system sends;Business management system receives according to before this state alarm clearance message erasing
Correlation state alarm information.
In this fault handling method, business management system can alert clear according to the state that virtual architecture management system sends
Except the state alarm information of the correlation receiving before message erasing, thus avoiding the related state of recovered virtual machine is accused
Alarm message is analyzed processing.
6th aspect, the invention provides a kind of business management system, described business management system is included for execution the
The modules of the fault handling method in the arbitrary possible implementation of five aspects or the 5th aspect.
7th aspect, the invention provides a kind of business management system, described business management system includes processor, storage
Device, communication interface and bus.Wherein, processor, memorizer, communication interface are communicated by bus it is also possible to be passed through wireless
Other means such as transmission realize communication.This memorizer is used for store instruction, and this processor is used for executing the finger of this memory storage
Order.This memory storage program code, and processor can call in memorizer the program code of storage execute the 5th aspect and
Fault handling method in any one possible implementation of 5th aspect.
Eighth aspect, the invention provides a kind of computer-readable medium, described computer-readable medium storage is for industry
The program code of business management system execution, described program code includes the arbitrary possibility for execution the 5th aspect or the 5th aspect
Implementation in fault handling method instruction.
9th aspect, the invention provides a kind of virtualization computer system, including virtual management node and operational control
Node, this virtual management node is used for executing at the fault in first aspect or the arbitrary possible implementation of first aspect
Reason method, this operational control node is used at the fault in execution the 5th aspect or the arbitrary possible implementation of the 5th aspect
Reason method.
Brief description
In order to be illustrated more clearly that the technical scheme of the embodiment of the present invention, will make to required in the embodiment of the present invention below
Accompanying drawing be briefly described it should be apparent that, drawings described below be only some embodiments of the present invention attached
Figure.
Figure 1A is the exemplary system structure chart of the fault handling method of the application embodiment of the present invention.
Figure 1B is another kind of exemplary system structure chart of the fault handling method of the application embodiment of the present invention.
Fig. 2 is the indicative flowchart of the fault handling method of one embodiment of the invention.
Fig. 3 is the indicative flowchart of the fault handling method of another embodiment of the present invention.
Fig. 4 is the indicative flowchart of the fault handling method of another embodiment of the present invention.
Fig. 5 is the schematic diagram of the virtual architecture management system of one embodiment of the invention.
Fig. 6 is the schematic diagram of the business management system of one embodiment of the invention.
Fig. 7 is the schematic diagram of the virtual architecture management system of another embodiment of the present invention.
Fig. 8 is the schematic diagram of the business management system of another embodiment of the present invention.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Site preparation description is it is clear that described embodiment a part of embodiment that is the present invention, rather than whole embodiments.
In order to make it easy to understand, first description can implement the system tray of the fault handling method of the embodiment of the present invention on the whole
The exemplary plot of structure.It should be understood that the embodiment of the present invention is not limited in the virtualization computer system shown in Figure 1A and Figure 1B, this
Outward, the device in Figure 1A and Figure 1B can be hardware or the software functionally dividing or the above combination of the two.
Hardware resource (Hardware Resources) 110 can include one or more equipment, and each equipment can be
The hardware device resources such as X86 server, storage device, the network equipment, can be used for providing the hardware capabilitys such as calculating, storage, network.
Virtualization layer (Virtualization Layer) 120 passes through Intel Virtualization Technology to hardware such as calculating, storage, networks
Resource is virtualized, and wherein, Intel Virtualization Technology can use Xen, HyperV, it is possible to use KVM, the present invention does not limit
System.
Virtual resource (Virtual Resources) 130 refers to by Intel Virtualization Technology, hardware resource 110 to be carried out virtual
Change the virtual resource being formed, such as virtual computing, virtual network, virtual memory etc..
Hardware resource 110, virtualization layer 120 and virtual resource 130 can become virtual architecture layer (Virtualized again
Infrastructure Layer), provide the infrastructure layer such as virtual resource or virtualization pool for upper-layer service.
One or more service application functions are disposed, each service application is deployed in one or more in operation system 140
On virtual machine, that is, these virtual machines are used for executing service application.Deploying virtual machine is on the equipment in hardware resource 110.
Each service application has corresponding control node.Control node is used for corresponding service application is managed.Control
Node processed is alternatively referred to as arbitration node.Control node can be deployed in operation system, and a control node can manage respectively
A corresponding service application, as shown in Figure 1A;One control node can also manage multiple service application, as shown in Figure 1B.
Control node can refer to the hardware unit that corresponding service application is managed it is also possible to refer to the many of service application operation
One of individual virtual machine virtual machine.
Virtual architecture management (Virtualized Infrastructure Manager) system 150 realizes virtualization basis
The management of facility, the equipment be responsible for physical hardware (i.e. hardware resource 110) virtual resources and be deployed in hardware resource 110
On virtual machine carry out being managed collectively, monitor, scheduling of resource, troubleshooting etc., running for operation system provides resource to support, and
Open interface etc. is provided.Virtualization architecture management system 150 is referred to as being the ingredient of virtualization layer.
Business management system 160, for being managed to the service application run on a virtual machine, such as creating business should
With schedule virtual resources in the application of, distributing service, service application and close service application etc..Business management system can manage
One or more service application.Business management system calls the interface that virtual architecture management system provides, and is that service application is run
Resource is provided, realizes service application granting, deployment etc..Business management system 160 is docked with virtual architecture management system 150.When
So, business management system can be docked with multiple virtual architecture management systems.
Wherein, business management system 160 and operation system 140 can be collectively referred to as application layer again.Business management system 160 and industry
Business system 140 can be the separate system of logic, as shown in Figure 1A and 1B it is also possible to realize the function of the two by a system.
It is specifically described in the following specifically describes of the embodiment of the present invention taking virtualization computer system shown in Figure 1A as a example.
As shown in the above, operation system operates on the virtual machine in virtual resource 130, and operation system does not need to close
The specific hardware device of the heart is it is not required that know virtual machine specifically operation, the industry on which hardware device that service application is located
Business management system and operation system all do not need the impact to service application of direct awareness apparatus and faulty equipment.
Therefore the present invention proposes new fault handling method, virtual architecture management system, business management system and virtualization
Computer system so that business management system impact to service application without direct awareness apparatus and equipment fault, but
The impact to VM for the equipment fault can be known from virtual architecture management system, such that it is able to rapidly know that affected business should
With, and then enable affected service application to be quickly obtained process.
As a example virtualization computer system shown in below by Figure 1A, the fault handling method of the embodiment of the present invention is carried out
Detailed introduction.
Fig. 2 is the indicative flowchart of the fault handling method of the embodiment of the present invention.It should be understood that Fig. 2 shows at fault
The step of reason method or operation, but these steps or operation be only example, the embodiment of the present invention can also carry out other operation or
The deformation of each operation in person Fig. 2.Additionally, each step in Fig. 2 can be held according to from the different order that Fig. 2 presents
Row is not and it is possible to really want to execute all operationss in Fig. 2.
S210, virtual architecture management system obtains fault warning message, and fault warning message carries the mark of faulty equipment
Information and fault type.
Wherein, faulty equipment can be any one or more equipment, fault type in hardware resource 110 shown in Figure 1A
Including machine failure or fractional hardware fault.
For example, if faulty equipment is X86 server, fault type can for X86 server machine failure or
At least one of CPU, internal memory, network interface card, disk hardware fault in X86 server.
In the embodiment of the present invention, faulty equipment (as server, storage device etc.) can be with quick detection faults itself, then
Virtual architecture management system can in several ways or agreement obtains the fault warning message of faulty equipment, and such as faulty equipment can
To be managed to virtual architecture by Simple Network Management Protocol (Simple Network Management Protocol, SNMP)
The fault warning message of system reporting fault equipment, or virtual architecture management system can be transmitted by declarative state
The fault warning message of (Representational State Transfer, REST) interface polls faulty equipment.
S220, virtual architecture management system determines the first virtual machine set according to the fault warning message of faulty equipment, the
One virtual machine set includes at least one first virtual machine being affected by faulty equipment.
After virtual architecture management system gets the fault warning message of faulty equipment, determined according to this fault warning message
The the first virtual machine set being affected by faulty equipment.Virtual architecture management system gets the fault warning message of faulty equipment
Afterwards, determine that the first virtual machine set determines that the specific implementation of the first virtual machine set is permissible according to this fault warning message
It is:Virtual architecture management system according to the identification information of faulty equipment and fault type, from the data of virtual architecture management system
In storehouse, inquire about all or part of virtual machine that the fault being deployed on this faulty equipment and being occurred by this faulty equipment is affected
Information.In order to subsequent descriptions are convenient, each virtual machine in affected virtual machine can be referred to as the first virtual machine, own
First virtual machine form the first virtual machine set.
S230, virtual architecture management system sends state alarm information wherein to business management system, state alarm information
Carry the information of the first virtual machine set.
When virtual architecture management system generates a bar state alarm information respectively for multiple first virtual machines, virtual architecture
These state alarm informations once can be sent to business management system it is also possible to be sent to business pipe several times by management system
Reason system.
Certainly, virtual architecture management system can also generate a state alarm information for all affected virtual machines,
I.e. in the first virtual machine set, all first virtual machines generate a state alarm information, the invention is not limited in this regard.
After business management system receives the state alarm information of the first virtual machine set that virtual architecture management system sends,
Can store this state alarm information, such as by this state alarm information record or the data base being saved in business management system
In.
S240, business management system determines in the first virtual machine set according to the state alarm information of the first virtual machine set
The service application of at least one the first virtual machine association.
Business management system receives the state alarm information of the first virtual machine set of virtual architecture management system transmission
Afterwards, this state warning information is associated with service application, identification specifically affected service application, specific implementation is permissible
For:The information of affected first virtual machine carrying in the state alarm information according to the first virtual machine set, from business pipe
In the data base of reason system or configuration file, the corresponding relation of inquiry the first virtual machine and service application, identifies and is specifically subject to shadow
The service application rung.
S250, the service application that business management system associates at least one of first virtual machine set first virtual machine
Execution processes operation.
Specifically, the service application execution to the first virtual machine association in the first virtual machine set for the business management system
Processing a kind of implementation operating can be:It is virtual that business management system sends first to the corresponding control node of service application
The information of machine set.Wherein, the information of the first virtual machine set is used for indicating that control node is carried out at recovery to this service application
Reason.
Alternatively, the business that business management system associates at least one of first virtual machine set first virtual machine should
Process operation with execution and include at least one in the following manner:
Mode one:The service application of at least one the first virtual machine association affected is switched to not by business management system
The virtual machine being affected by described faulty equipment executes
Mode two:The application state information of business management system at least one the first virtual machine just described is designated isolation
State, described isolation is used for indicating that at least one first virtual machine described stops execution at least one first virtual machine described
The service application of association, isolates affected virtual machine in service application.
Mode three:Business management system sends the first request message to virtual architecture management system, and the first request message is used
In indicating virtual machine to be restored, virtual machine to be restored is a subset in described first virtual machine set.
Mode four:The control node of the service application at least one the first virtual machine association for the business management system sends shape
State alarm information, so that control node should by the business of at least one the first virtual machine association according to described state alarm information
With switching to the virtual machine execution being affected by described faulty equipment or the application state by least one the first virtual machine described
Message identification is described isolation.
In the embodiment of the present invention, after virtual architecture management system gets the fault warning message on faulty equipment, directly
This fault warning message is analyzed process, obtains one or more virtual machines of faulty equipment impact, and to operational control
System sends the information of these virtual machines.Business management system can directly obtain by shadow according to the information analysiss of these virtual machines
The service application rung, and then affected service application can be processed.Compared with prior art, managed by virtual architecture
The information of the virtual machine that system is directly affected by faulty equipment according to the determination of the fault warning message of faulty equipment is so that business pipe
Reason system directly can obtain affected service application according to the state alarm information analysis of the first virtual machine set, rather than
Analysis is gone to obtain affected virtual machine, analyze affected service application again according to the alarm information of faulty equipment.So that
Obtaining business management system does not need directly to perceive hardware fault, and then can quickly trigger the impact process of service application, reduces
Traffic lost, improves the reliability of service application.
In the embodiment of the present invention, alternatively, virtual architecture management system can be according to the fault warning message of faulty equipment
Determine the impact information of the first virtual machine set, this impact information be used for indicating faulty equipment in the first virtual machine set at least
The type of impact of one the first virtual machine generation and/or rank.Accordingly, virtual machine architecture management system is to operational control system
The state alarm information that system sends can also carry this impact information, and business management system goes out to receive from virtual architecture management system
State alarm information can carry this impact information.Then business management system is according to the impact information of the first virtual machine set
Service application execution to the first virtual machine association in described first virtual machine set processes operation.
User can define the type that affected of fault and/or the rank that virtual machine is subject to faulty equipment according to demand, under
Face is that the virtual machine of the embodiment of the present invention is subject to the type of impact of faulty equipment and the example of rank.
When physical server occurs machine failure (to include electricity, host operating system fault etc. under physical server can not carry
Fault for computing resource), storage device failure situations such as (electric under storage device, whole chain rupture) and other hardware faults
When leading to virtual machine cannot run, providing service for business, then the affected type of virtual machine may be configured as fault, and rank can set
It is set to urgent.For network interface card or other hardware faults, if lead to virtual machine cannot normal work when, the affected class of virtual machine
Type can be fault, and rank could be arranged to promptly.
When physical server generation part fault, such as central processing unit (Central Processing Unit, CPU), interior
Depositing, when part network interface card breaks down, if temporarily do not affect virtual machine running, but there is the situation of operation risk, then virtual machine is subject to
The type of impact could be arranged to excessive risk, and rank may be configured as important.
When storage device generation part fault, such as part of links interruption, segment controller fault etc., if temporarily do not affect void
Plan machine runs, but there is the situation of operation risk, then the affected type of virtual machine could be arranged to risk, and rank can set
It is set to secondary.
Under normal circumstances, when every hardware fault leads to virtual machine cannot run or externally cannot provide service, virtual machine
Affected type all could be arranged to fault, and rank all could be arranged to promptly.
And for not affecting the hardware fault that any virtual machine runs, then can be not provided with the affected type of virtual machine
And rank, or the affected type of virtual machine can be set for low-risk or devoid of risk, rank is prompting.
By the description of content above, virtual architecture management system disappears to the state alarm that business management system sends
Breath can include the information of the first virtual machine set, i.e. the mark of at least one the first virtual machine affected.Can also include
The impact information of the first virtual machine set, that is, faulty equipment is at least one first virtual machine generation in the first virtual machine set
The type of impact and/or rank.
Alternatively, alarm statuses message can also include generation time, checkout time, alarm synchronization number, alarm name, announcement
Alert object type etc..Except above- mentioned information, the state alarm information of virtual machine can also carry failure cause of faulty equipment etc.
Information.Certainly, the information that the state alarm information of virtual machine includes is not limited to the above-mentioned content enumerated.
In the method for above-mentioned virtual architecture management system execution, although virtual architecture management system is to faulty equipment
Alarm information is analyzed processing, and the information obtaining to business management system offer analysis, but virtual architecture management system
This fault handling method of execution is the follow-up virtual machine being affected by fault process or subsequently on the industry being affected by fault
Effective method before the process of business application, therefore can be referred to as fault handling method without meaning.
In the embodiment of the present invention, alternatively, business management system can call the interface that virtual architecture management system provides,
Request virtual architecture management system is processed to affected virtual machine.Specifically, business management system can be according to
The impact information of one virtual machine set determines the first request message needing the preferential virtual machine to be restored recovering for instruction,
This virtual machine virtual machine to be restored is a subset in the first virtual machine set.Then business management system is to virtual architecture pipe
Reason system sends this first request message.
Business management system determines in the first virtual machine set according to the impact information of the first virtual machine needs virtual architecture
The priority of the virtual machine that management system is recovered, and send for indicating the extensive of virtual machine to be restored to virtual machine architecture management
First request message of multiple priority so that virtual architecture management system can according to the priority of business management system instruction,
At least one of the first virtual machine set that fault by faulty equipment is affected virtual machine carries out recovery process.
Alternatively, business management system sends the recovery for indicating virtual machine to be restored to virtual architecture management system
Another kind of specific implementation of the first request message of priority can be:Business management system is preferential according to service application
Level sends the first request message to virtual architecture management system.
Specifically, the service application that business management system is associated according to the first virtual machine in the first virtual machine set
Priority, that is, according to faulty equipment impact service application priority indication virtual architecture management system to the first virtual machine
Virtual machine to be restored in set carries out recovery process, thereby may be ensured that the service application of high priority can preferentially obtain
Recover, be further ensured that the reliability of service application.
Recover first as by the first request message, business management system can indicate that virtual machine architecture management system is preferential
The first high virtual machine of virtual machine set medium priority.
Alternatively, business management system can be according to the impact information of the first virtual machine set and associated service application
Priority to virtual architecture management system send the first request message.
Alternatively, business management system sends one kind side of implementing of the first request message to virtual architecture management system
Formula is:Business management system sends the first request message, industry according to the deployment mode of service application to virtual architecture management system
The deployment mode of business application includes at least one in active-standby mode, load sharing pattern and single virtual machine mode.
Specifically, business management system is according to the deployment mode of service application, the i.e. business according to faulty equipment impact
The deployment mode instruction virtual architecture management system of application is recovered to the virtual machine to be restored in the first virtual machine set
Process.
As by the first request message, business management system can indicate that virtual architecture management system is preferentially recovered to dispose mould
Formula is the host virtual machine in the active and standby virtual machine of service application of active-standby mode.
Alternatively, business management system can be according to the deployment mould of the impact information of the first virtual machine set and service application
Formula sends the first request message to virtual architecture management system, or can be according to the deployment mode of service application and service application
Priority to virtual architecture management system send the first request message, or can according to the impact information of the first virtual machine set,
The priority of the deployment mode of service application and service application sends the first request message to virtual architecture management system.
After virtual architecture management system receives the first request message that business management system sends, can be according to the first request
The instruction of message carries out recovery process to the virtual machine to be restored in the first virtual machine set according to certain priority.Virtual
Architecture management system can be virtual machine (vm) migration to the concrete recovery form of virtual machine, will virtual machine move to from faulty equipment
Other normal device;Can also be, using virtual machine snapshot, this virtual machine is recovered on other normal device.
Alternatively, if virtual architecture management system does not receive the use of business management system transmission in preset time threshold value
In the first solicited message indicating the virtual machine needing preferentially to recover in the first virtual machine set, then recover according to preset virtual machine
Strategy recovers the first virtual machine in the first virtual machine set.
So can ensure that there is no how information instruction virtual architecture management system recovers the first void in business management system
During virtual machine in plan machine set, virtual architecture management system can be actively virtual to first according to preset virtual machine recovery policy
At least one of machine set first virtual machine is recovered.
Alternatively, either business management system request virtual architecture management system to affected virtual machine at
Reason, or virtual architecture management system active is processed to affected virtual machine, and virtual architecture management system is to impacted
Virtual machine processed after, all can to business management system send state alarm clearance message, to indicate business management system
The state alarm information corresponding with the virtual machine that this is processed receiving before can be removed.
After business management system receives the state alarm clearance message of virtual architecture management system transmission, can will be corresponding
The state alarm information of virtual machine is removed, and reduces business management system to the maintenance work recovering alarm, such that it is able to save
Resource, improves efficiency.
The concrete form that business management system removes state alarm information can be to delete the state alarm information of storage,
Can also be certain information in modification state alarm information so that this information indicates this corresponding virtual machine of state alarm information
Recovered.
In the embodiment of the present invention, alternatively, according to the information of the first virtual machine set, business management system determines that first is empty
After the service application of the first virtual machine association in plan machine set, first can be sent to the control node of service application association empty
The information of plan machine set.
After the control node of service application receives the information of the first virtual machine set of business management system transmission, can
To be processed to affected service application according to the information of the first virtual machine in the first virtual machine set.
Alternatively, the control node of service application can also according to the deployment mode of service application to service application at
Reason.As when service application is disposed for active-standby mode, if host virtual machine fault, control node needs to carry out active-standby switch;If
Standby VM fault, control node does not need active-standby switch.As when service application is disposed for load sharing pattern, control node will be subject to
The VM isolation of impact.
Alternatively, the control node of service application can be according to the deployment mode of service application and the first virtual machine set
Impact information is processed to service application.As virtual to first in the impact information instruction faulty equipment when the first virtual machine set
The type of the impact of machine is fault, rank is urgent, and service application is disposed for active-standby mode, if main VM fault, controls section
Point needs to carry out active-standby switch, if standby VM fault or service application are inessential, control node can not deal with, that is, control section
Point does not need active-standby switch.It will be appreciated that above-mentioned according to the affected type of virtual machine, rank and deployment mode etc. to service application
The mode being processed is exemplary illustration, and it implements and can be defined according to the demand of user, and the present invention is to this not
It is restricted.
Alternatively, after the completion of the control node of service application is to service application process, can send to business management system
Business Processing feedback message, its result to service application of informing business management system.
With reference to Fig. 3, the fault handling method of the embodiment of the present invention taking physical host fault as a example, is discussed in detail.As
Shown in Fig. 3, including three equipment, respectively calculate node 1, calculate node 2 and calculate node 3.Calculate node 1, calculating
Node 2 and calculate node 3 can be respectively equipment 1 in Figure 1A or Figure 1B, equipment 2 and equipment 3.
2 kinds of service application (Application, APP) are deployed in virtualization computer system.One kind is applied and is
App1, is associated with VM1 and VM2, using active-standby mode deployment.Wherein, VM1 is deployed in calculate node 1, is the primary void of APP1
Plan machine;VM2 is deployed in calculate node 2, is the standby virtual machine of APP1.Another kind is applied as App2, associates with VM3 and VM4,
Using the deployment of load sharing pattern, VM3 is deployed on server computing node 2, and VM4 is deployed in calculate node 3.
S402, when calculate node 1 occurs power down fault, calculate node 1 passes through snmp protocol in virtual architecture management
The fault warning message of report calculate node 1.
S404, virtual architecture management receives fault warning message, and according to this fault warning message, determining is affected by fault
Virtual machine, and produce the state alarm information of virtual machine, comprise the following steps that.
(1) virtual architecture management system receives the hardware fault alarm information of calculate node 1, from virtual architecture management system
Data base in inquiry calculate node 1 on run virtual machine list, getting affected virtual machine has VM1, obtains VM1's
The information such as ID.
(2) because calculate node 1 power down fault leads to VM1 fault, VM1 cannot run offer service, therefore can be by VM1
Affected type is set to fault, and the affected rank of VM1 is set to promptly.
(3) virtual architecture management system produces the state alarm information of VM1, and it carries information and includes:VM1ID, VM1 are subject to shadow
The type (for fault) of sound, generation time, the affected rank of VM1 (being urgent), the fault type of faulty equipment are (for calculating section
Point 1 machine failure) etc..
S406, virtual architecture management system sends the state alarm information of VM1 to business management system.
S408, business management system receives the state alarm information of the virtual machine that virtual architecture management system sends, and obtains
The information such as the ID of VM1, go out the corresponding relation of VM1 and service application from the data base querying of business management system, obtain impacted
Service application be App1.
Business management system sends a notification message to the control node of App1, notifies the fault of VM1.Then this control node
Determined according to notification message and VM2 is upgraded to primary server.
S410, business management system calls the interface that virtual architecture management system provides, and sends out to virtual architecture management system
Send the first request message, ask virtual architecture management system fast quick-recovery VM1.
S412, VM1 is moved in calculate node 3 by virtual architecture management system, and now, VM1 is changed into the standby void of App1
Plan machine.
Now, in specific implementation process, virtual architecture management system can also carry out Fault Isolation to calculate node 1.
S414, after virtual architecture management system recovers VM1, sends VM1 state alarm clearance message to operational control.
After troubleshooting, the deployment scenario of the application in operation system is as shown in Figure 4.Wherein, App1 is using active and standby
Pattern is disposed, and it is primary virtual machine that VM2 is deployed in calculate node 2, and it is standby virtual machine that VM1 is deployed in calculate node 3.
App2 adopts load sharing pattern to dispose, and VM3 is deployed in calculate node 2, and VM4 is deployed in calculate node 3.Calculate node 1
Fault, from resource pool isolation.
In above-described embodiment, after calculate node 1 breaks down, send alarm information to virtual architecture management system, virtual
According to alarm information, architecture management system determines that affected virtual machine is VM1, and determine VM1 by the type that affected and level
Not.Business management system is processed without the direct alarm information to hardware, you can directly at virtual architecture management system
Obtain the impact information that the information of affected VM1 and VM1 are subject to, and then determine that the service application run on VM1 is App1, industry
Business management system notifies the control node of App1 to App1 process, and it is extensive to ask virtual architecture management system that VM1 is carried out
Multiple.Virtual architecture management system moves to VM1 in calculate node 3 according to the request of business management system.The control section of App1
Put the information obtaining VM1 at business management system and VM1 by being affected after information, by standby virtual machine VM2 original for App1
Switch to host virtual machine, and the VM1 that will migrate in calculate node 3 is set to standby virtual machine, thus ensureing the operation of App1,
Improve the reliability of App1.
Describe the fault handling method of the embodiment of the present invention above in conjunction with Fig. 2 to Fig. 4, with reference to Fig. 5 to Fig. 8 introduction
The virtual architecture management system of the embodiment of the present invention and business management system.
Fig. 5 is the schematic diagram of the virtual architecture management system of one embodiment of the invention.It should be understood that Fig. 5 illustrates
Virtual architecture management system 500 be only example, the virtual architecture management system of the embodiment of the present invention may also include other modules
Or unit, or include the intimate module with the modules in Fig. 5, or do not really want including all moulds in Fig. 5
Block.
Acquisition module 510, for obtaining fault warning message, described fault warning message carries the mark letter of faulty equipment
Breath and fault type.
Determining module 520, for determining the first virtual machine set, described first virtual machine according to described fault warning message
Set includes at least one first virtual machine being affected by described faulty equipment.
Sending module 530, for sending state alarm information to described business management system, described state alarm information is taken
Information with described first virtual machine set.
In the embodiment of the present invention, after virtual architecture management system gets the fault warning message on faulty equipment, directly
This fault warning message is analyzed process, obtains one or more virtual machines of faulty equipment impact, and to operational control
System sends the information of these virtual machines so that business management system can directly obtain according to the information analysiss of these virtual machines
Affected service application, and then affected service application can be processed.Compared with prior art, by virtual architecture
The information of the virtual machine that management system is directly affected by faulty equipment according to the determination of the fault warning message of faulty equipment is so that industry
Business management system directly can obtain affected service application according to the state alarm information analysis of the first virtual machine set, and
It is not to go analysis to obtain affected virtual machine, analyze affected service application again according to the alarm information of faulty equipment.From
And make business management system not need directly to perceive hardware fault, and then can quickly trigger the impact process of service application,
Reduce traffic lost, improve the reliability of service application.
Alternatively, as an embodiment, described determining module is additionally operable to according to described fault warning message determines
The impact information of the first virtual machine set, described impact information is used for indicating described faulty equipment at least one first void described
The type of impact and/or rank that plan machine produces.Then described virtual architecture management system sends shape to described business management system
State alarm information also carries the impact information of described first virtual machine set.
In the embodiment of the present invention, virtual architecture management system is according to the fault warning information of faulty equipment except obtaining
At least one virtual machine affected, can also obtain the type of the impact to these virtual machines for the fault of faulty equipment generation
And/or rank, then also carry in the state alarm information sending to business management system for indicating faulty equipment to
The type of impact and/or the impact information of rank that the first virtual machine in one virtual machine set produces, so that business pipe
Reason system or operation system can more be processed to service application according to this impact information, improve service application further
Reliability.
Alternatively, as an embodiment, the impact that described faulty equipment produces at least one first virtual machine described
Type include following at least one:Fault, excessive risk, risk, low-risk or no affect.
Alternatively, as an embodiment, described virtual architecture management system also includes receiver module and recovery module.Institute
State receiver module and be used for indicating for receiving the first request message that described business management system sends, described first request message
Need the preferential virtual machine to be restored recovering, described virtual machine to be restored is a son in described first virtual machine set
Collection.Described recovery module is used for according to the preferential recovery of described first solicited message virtual machine to be restored.
In the embodiment of the present invention, virtual architecture management system can be according to the request of business management system, according to business pipe
The priority of reason system instruction, at least one of first virtual machine set that the fault by faulty equipment is affected virtual machine
Carry out recovery process.
Alternatively, as an embodiment, described recovery module is additionally operable to not receive in preset time threshold value described
During described first solicited message that business management system sends, according to preset virtual machine recovery policy recover described at least one the
One virtual machine.
The embodiment of the present invention can ensure that does not have information instruction virtual architecture management system how extensive in business management system
When answering the virtual machine in the first virtual machine set, virtual architecture management system can be actively according to the recovery policy pair being pre-configured with
The first virtual machine in first virtual machine set is recovered.
Alternatively, as an embodiment, described sending module is additionally operable to send state announcement to described business management system
Alert removing message, described state alarm clearance message is used for indicating that described business management system is removed in described business management system
Described state alarm information.
In the embodiment of the present invention, virtual architecture management system carries out after recovery process to virtual machine, to business management system
Transmission state alarm clearance message is so that business management system can be according to reception before this state alarm clearance message erasing
Related state alarm information, thus avoid the business management system state alarm information related to recovered virtual machine to enter
Row analyzing and processing.
It should be understood that the virtual architecture management system 500 of the embodiment of the present invention can pass through special IC
(Application Specific Integrated Circuit, ASIC) realizes, or PLD
(Programmable Logic Device, PLD) realizes, and above-mentioned PLD can be complicated process logical device (Complex
Programmable Logic Device, CPLD), field programmable gate array (Field-Programmable Gate
Array, FPGA), GAL (Generic Array Logic, GAL) or its combination in any.Fig. 2 is realized by software
During the step being executed by virtual architecture management system in shown fault handling method, virtual architecture management system 500 and its each
Individual module can also be software module.
It should be understood that the virtual architecture management system 500 shown in Fig. 5 may correspond to the void in fault handling method shown in Fig. 2
Intend the above and other operation of unit in architecture management system, and virtual architecture management system 500 and/or function is divided
Not in order to realize the corresponding flow process of the fault handling method in Fig. 2, for sake of simplicity, will not be described here.
Fig. 6 is the schematic diagram of the business management system of one embodiment of the invention.It should be understood that the industry shown in Fig. 6
Business management system 600 be only example, the operation system of the embodiment of the present invention may also include other modules or unit, or include with
Intimate module of the modules in Fig. 6, or do not really want including all modules in Fig. 6.
Receiver module 610, for receiving the state alarm information that described virtual architecture management system sends, described state is accused
Alarm message carries the information of the first virtual machine set being affected by faulty equipment, and described first virtual machine set includes at least one
Individual first virtual machine.
Determining module 620, at least one first virtual machine association according to the determination of described state alarm information
Service application.
Processing module 630, for the service application execution process operation at least one the first virtual machine association described.
In the embodiment of the present invention, business management system receives, from virtual architecture management system, being affected by faulty equipment
After the information of virtual machine in one virtual machine set, directly can obtain affected industry according to the information analysiss of these virtual machines
Business application, and then affected service application can be processed.Compared with prior art, business management system can be direct
Affected service application is obtained according to the state alarm information analysis of the first virtual machine set, rather than according to faulty equipment
Alarm information goes analysis to obtain affected virtual machine, analyze affected service application again.So that business management system
Do not need directly to perceive hardware fault, and then can quickly trigger the impact of service application and process, reduce traffic lost, improve industry
The reliability of business application.
Alternatively, as an embodiment, described state alarm information also carries the impact of described first virtual machine set
Information, described impact information is used for indicating the type to the impact that at least one first virtual machine described produces for the described faulty equipment
And/or rank.Wherein, described processing module specifically for the impact information according to described first virtual machine set to described at least
The service application execution of one the first virtual machine association processes operation.
In the embodiment of the present invention, the shape of the first virtual machine set that business management system receives from virtual architecture management system
Also carry in state alarm information for indicating faulty equipment to the impact that the first virtual machine in the first virtual machine set produces
The impact information of type and/or rank, so that business management system or operation system can be more according to this impact information
Service application is processed, improves the reliability of service application further.
Alternatively, as an embodiment, described first virtual machine set produce impact type include following at least
A kind of:Fault, excessive risk, risk, low-risk or no affect.
Alternatively, as an embodiment, described process operation includes at least one in the following manner:
Described business management system the service application of at least one first virtual machine association described is switched to be not subject to described
The virtual machine execution of faulty equipment impact;Or
The application state information of at least one the first virtual machine described is designated isolation by described business management system,
Described isolation is used for indicating that at least one first virtual machine described stops execution at least one first virtual machine association described
Service application;Or
Described business management system sends the first request message to described virtual architecture management system, and described first request disappears
Cease for indicating virtual machine to be restored, described virtual machine to be restored is a subset in described first virtual machine set;Or
The control node of the service application at least one the first virtual machine association described for the described business management system sends
Described state alarm information so that described control node according to described state alarm information, at least one is first virtual by described
The service application of office's connection switch to described in the virtual machine execution that affected by described faulty equipment or by described at least one the
The application state information of one virtual machine is designated described isolation.
Alternatively, as an embodiment, described determining module is additionally operable to the impact according to described first virtual machine set
Information determines the first request message, and described first request message is used for instruction needs the preferential virtual machine to be restored recovering, institute
Stating virtual machine to be restored is a subset in described first virtual machine set.Described business management system also includes sending mould
Block, for sending described first request message to described virtual architecture management system.
In the embodiment of the present invention, business management system can determine the first virtual machine according to the impact information of the first virtual machine
Need the priority of the virtual machine to be restored of virtual architecture management system recovery in set, and manage transmission to virtual machine architecture
For indicate these virtual machines to be restored recovery priority the first request message so that virtual architecture management system is permissible
The priority being indicated according to business management system, in the first virtual machine set that the fault by faulty equipment is affected at least
One virtual machine carries out recovery process.
Alternatively, as an embodiment, described sending module is additionally operable to according at least one first virtual office described
The priority of the service application of connection sends described first request message to described virtual architecture management system.
In the embodiment of the present invention, the industry that business management system is associated according to the first virtual machine in the first virtual machine set
The priority of business application, that is, the priority indication virtual architecture management system of the service application according to faulty equipment impact is to first
Virtual machine to be restored in virtual machine set carries out recovery process, thereby may be ensured that the service application of high priority can be excellent
First it is restored, be further ensured that the reliability of service application.
Alternatively, as an embodiment, described sending module is additionally operable to according at least one first virtual office described
Connection service application deployment mode to described virtual architecture management system send described first request message, described at least one
The deployment mode of the service application of the first virtual machine association is included in active-standby mode, load sharing pattern and single virtual machine mode
At least one.
In the embodiment of the present invention, business management system according to the deployment mode of affected service application, that is, according to fault
The deployment mode instruction virtual architecture management system of the service application of equipment impact is to be restored in the first virtual machine set
Virtual machine carries out recovery process.
Alternatively, as an embodiment, described receiver module is additionally operable to receive described virtual architecture management system transmission
State alarm clearance message, described processing module is additionally operable to according to described state alarm clearance message erasing state alarm
Message.
In the embodiment of the present invention, the state alarm clearance that business management system can send according to virtual architecture management system
The state alarm information of the correlation receiving before message erasing, thus avoid the state alarm related to recovered virtual machine
Message is analyzed processing.
It should be understood that the business management system 600 of the embodiment of the present invention can be realized by special IC, or can
Programmed logic device is realized, and above-mentioned PLD can be complicated process logical device, field programmable gate array, GAL
Or its combination in any.When the step being executed by business management system in fault handling method shown in Fig. 2 is realized by software, industry
Business management system 600 and its modules can also be software module.
It should be understood that the business management system 600 shown in Fig. 6 may correspond to the business pipe in fault handling method shown in Fig. 2
Above and other operation of the unit in reason system, and business management system 600 and/or function are respectively in order to realize figure
The corresponding flow process of the fault handling method in 2, for sake of simplicity, will not be described here.
Fig. 7 is the schematic diagram of the virtual architecture management system 700 of another embodiment of the present invention.Virtual architecture pipe
Reason system 700 includes processor 710, memorizer 720, communication interface 730 and bus 740.Wherein, processor 710, memorizer
720th, communication interface 730 is communicated by bus 740 it is also possible to other means such as be passed through to be wirelessly transferred to realize communication.This is deposited
Reservoir 720 is used for store instruction, and this processor 710 is used for executing the instruction of this memorizer 720 storage.This memorizer 720 stores
Program code, and processor 710 can call the following operation of program code execution of storage in memorizer 720:
Obtain fault warning message, described fault warning message carries identification information and the fault type of faulty equipment;Root
Determine the first virtual machine set according to described fault warning message, described first virtual machine set includes being affected by described faulty equipment
At least one first virtual machine;Send state alarm information to described business management system, described state alarm information carries
The information of described first virtual machine set.
In the embodiment of the present invention, after virtual architecture management system gets the fault warning message on faulty equipment, directly
This fault warning message is analyzed process, obtains one or more virtual machines of faulty equipment impact, and to operational control
System sends the information of these virtual machines so that business management system can directly obtain according to the information analysiss of these virtual machines
Affected service application, and then affected service application can be processed.Compared with prior art, by virtual architecture
The information of the virtual machine that management system is directly affected by faulty equipment according to the determination of the fault warning message of faulty equipment is so that industry
Business management system directly can obtain affected service application according to the state alarm information analysis of the first virtual machine set, and
It is not to go analysis to obtain affected virtual machine, analyze affected service application again according to the alarm information of faulty equipment.From
And make business management system not need directly to perceive hardware fault, and then can quickly trigger the impact process of service application,
Reduce traffic lost, improve the reliability of service application.
Alternatively, as an embodiment, the program code that processor 710 may call upon storage in memorizer 720 is held
Row is following to be operated:Determine the impact information of described first virtual machine set, described impact information according to described fault warning message
For indicating type and/or the rank of the impact at least one first virtual machine generation described for the described faulty equipment.Wherein, institute
The state alarm information of stating also carries described impact information.
In the embodiment of the present invention, virtual architecture management system is according to the fault warning information of faulty equipment except obtaining
At least one virtual machine affected, can also obtain the type of the impact to these virtual machines for the fault of faulty equipment generation
And/or rank, then also carry in the state alarm information sending to business management system for indicating faulty equipment to
The type of impact and/or the impact information of rank that the first virtual machine in one virtual machine set produces, so that business pipe
Reason system or operation system can more be processed to service application according to this impact information, improve service application further
Reliability.
Alternatively, as an embodiment, the impact that described faulty equipment produces at least one first virtual machine described
Type include following at least one:Fault, excessive risk, risk, low-risk or no affect.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720
Hereinafter operate:Receive the first request message that described business management system sends, described first request message is used for instruction to be needed
The preferential virtual machine to be restored recovering, described virtual machine to be restored is a subset in described first virtual machine set.Institute
State processor to be additionally operable to preferentially recover described virtual machine to be restored according to described first solicited message.
In the embodiment of the present invention, virtual architecture management system can be according to the request of business management system, according to business pipe
The priority of reason system instruction, at least one of first virtual machine set that the fault by faulty equipment is affected virtual machine
Carry out recovery process.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720
Hereinafter operate:When not receiving described first solicited message that described business management system sends in preset time threshold value, press
Recover at least one first virtual machine described according to preset virtual machine recovery policy.
The embodiment of the present invention can ensure that does not have information instruction virtual architecture management system how extensive in business management system
When answering the virtual machine in the first virtual machine set, virtual architecture management system can be actively according to the recovery policy pair being pre-configured with
The first virtual machine in first virtual machine set is recovered.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720
Hereinafter operate:Send state alarm clearance message to described business management system, described state alarm clearance message is used for indicating
Described business management system removes the described state alarm information in described business management system.
In the embodiment of the present invention, virtual architecture management system carries out after recovery process to virtual machine, to business management system
Transmission state alarm clearance message is so that business management system can be according to reception before this state alarm clearance message erasing
Related state alarm information, thus avoid the business management system state alarm information related to recovered virtual machine to enter
Row analyzing and processing.
It should be understood that the virtual architecture management system of the embodiment of the present invention shown in Fig. 7 may correspond to the virtual architecture shown in Fig. 5
Above and other operation of the unit in management system, and the virtual architecture management system of the embodiment of the present invention and/or
Function respectively in order to realize the corresponding flow process being executed in the fault handling method shown in Fig. 2 by virtual architecture management system, in order to
Succinctly, will not be described here.
Fig. 8 is the schematic diagram of the business management system 800 of another embodiment of the present invention.Business management system
800 include processor 810, memorizer 820, communication interface 830 and bus 840.Wherein, processor 810, memorizer 820, communication
Interface 830 is communicated by bus 840 it is also possible to other means such as be passed through to be wirelessly transferred to realize communication.This memorizer 820 is used
In store instruction, this processor 810 is used for executing the instruction of this memorizer 820 storage.This memorizer 820 store program codes,
And processor 810 can call the following operation of program code execution of storage in memorizer 820:
Receive the state alarm information that described virtual architecture management system sends, described state alarm information carries by fault
The information of the first virtual machine set of equipment impact, described first virtual machine set includes at least one first virtual machine;Root
Determine the service application of at least one the first virtual machine association described according to described state alarm information;To described at least one first
The service application execution of virtual machine association processes operation.
In the embodiment of the present invention, business management system receives, from virtual architecture management system, being affected by faulty equipment
After the information of virtual machine in one virtual machine set, directly can obtain affected industry according to the information analysiss of these virtual machines
Business application, and then affected service application can be processed.Compared with prior art, business management system can be direct
Affected service application is obtained according to the state alarm information analysis of the first virtual machine set, rather than according to faulty equipment
Alarm information goes analysis to obtain affected virtual machine, analyze affected service application again.So that business management system
Do not need directly to perceive hardware fault, and then can quickly trigger the impact of service application and process, reduce traffic lost, improve industry
The reliability of business application.
Alternatively, as an embodiment, described state alarm information also carries the impact of described first virtual machine set
Information, described impact information is used for indicating the type to the impact that at least one first virtual machine described produces for the described faulty equipment
And/or rank.Wherein, described processor specifically for the impact information according to described first virtual machine set to described at least one
The service application execution of individual first virtual machine association processes operation.
In the embodiment of the present invention, the shape of the first virtual machine set that business management system receives from virtual architecture management system
Also carry in state alarm information for indicating faulty equipment to the impact that the first virtual machine in the first virtual machine set produces
The impact information of type and/or rank, so that business management system or operation system can be more according to this impact information
Service application is processed, improves the reliability of service application further.
Alternatively, as an embodiment, described first virtual machine set produce impact type include following at least
A kind of:Fault, excessive risk, risk, low-risk or no affect.
Alternatively, as an embodiment, described process operation includes at least one in the following manner:
Described business management system the service application of at least one first virtual machine association described is switched to be not subject to described
The virtual machine execution of faulty equipment impact;Or
The application state information of at least one the first virtual machine described is designated isolation by described business management system,
Described isolation is used for indicating that at least one first virtual machine described stops execution at least one first virtual machine association described
Service application;Or
Described business management system sends the first request message to described virtual architecture management system, and described first request disappears
Cease for indicating virtual machine to be restored, described virtual machine to be restored is a subset in described first virtual machine set;Or
The control node of the service application at least one the first virtual machine association described for the described business management system sends
Described state alarm information so that described control node according to described state alarm information, at least one is first virtual by described
The service application of office's connection switch to described in the virtual machine execution that affected by described faulty equipment or by described at least one the
The application state information of one virtual machine is designated described isolation.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720
Hereinafter operate:First request message is determined according to the impact information of described first virtual machine set.Described transmitter 840 be used for
Described virtual architecture management system sends described first request message.
In the embodiment of the present invention, business management system can determine the first virtual machine according to the impact information of the first virtual machine
Need the priority of the virtual machine to be restored of virtual architecture management system recovery in set, and manage transmission to virtual machine architecture
For indicate these virtual machines to be restored recovery priority the first request message so that virtual architecture management system is permissible
The priority being indicated according to business management system, in the first virtual machine set that the fault by faulty equipment is affected at least
One virtual machine carries out recovery process.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720
Hereinafter operate:The priority of the service application according at least one the first virtual machine association described manages system to described virtual architecture
System sends described first request message.
In the embodiment of the present invention, the industry that business management system is associated according to the first virtual machine in the first virtual machine set
The priority of business application, that is, the priority indication virtual architecture management system of the service application according to faulty equipment impact is to first
Virtual machine to be restored in virtual machine set carries out recovery process, thereby may be ensured that the service application of high priority can be excellent
First it is restored, be further ensured that the reliability of service application.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720
Hereinafter operate:The deployment mode of the service application according at least one the first virtual machine association described manages to described virtual architecture
System sends described first request message, and the deployment mode of the service application of at least one the first virtual machine association described includes leading
At least one in standby pattern, load sharing pattern and single virtual machine mode.
In the embodiment of the present invention, business management system, according to the deployment mode of service application, affects according to faulty equipment
Service application deployment mode instruction virtual architecture management system the virtual machine to be restored in the first virtual machine set is entered
Row recovery is processed.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720
Hereinafter operate:Receive the state alarm clearance message that described virtual architecture management system sends, described processor is additionally operable to basis
State alarm information described in described state alarm clearance message erasing.
In the embodiment of the present invention, the state alarm clearance that business management system can send according to virtual architecture management system
The state alarm information of the correlation receiving before message erasing, thus avoid the state alarm related to recovered virtual machine
Message is analyzed processing.
It should be understood that the business management system of the embodiment of the present invention shown in Fig. 8 may correspond to the operational control system shown in Fig. 6
Unite, and above and other operation of the unit in the business management system of the embodiment of the present invention and/or function are respectively
Realize the corresponding flow process being executed in the fault handling method shown in Fig. 2 by business management system, for sake of simplicity, here is no longer superfluous
State.
It is appreciated that the processor in the embodiment of the present invention can be a kind of IC chip, there is the process of signal
Ability.During realizing, each step of said method embodiment can be by the integrated logic circuit of the hardware in processor
Or the instruction of software form completes.Above-mentioned processor can be general processor, digital signal processor (Digital
Signal Processor, DSP), special IC (Application Specific Integrated Circuit,
ASIC), ready-made programmable gate array (Field Programmable Gate Array, FPGA) or other FPGAs
Device, discrete gate or transistor logic, discrete hardware components.Can realize or execute the public affairs in the embodiment of the present invention
Each method, step and the logic diagram opened.General processor can be microprocessor or this processor can also be any normal
Processor of rule etc..Step in conjunction with the method disclosed in the embodiment of the present invention can be embodied directly in hardware decoding processor and holds
Row completes, or is completed with the hardware in decoding processor and software module combination execution.Software module may be located at deposits at random
Reservoir, flash memory, read only memory, this area such as programmable read only memory or electrically erasable programmable memory, depositor
In ripe storage medium.This storage medium is located at memorizer, and processor reads the information in memorizer, completes in conjunction with its hardware
The step of said method.
It is appreciated that the memorizer in the embodiment of the present invention can be volatile memory or nonvolatile memory, or
May include volatibility and nonvolatile memory.Wherein, nonvolatile memory can be read only memory (Read-
Only Memory, ROM), programmable read only memory (Programmable ROM, PROM), the read-only storage of erasable programmable
Device (Erasable PROM, EPROM), Electrically Erasable Read Only Memory (Electrically EPROM, EEPROM) or
Flash memory.Volatile memory can be random access memory (Random Access Memory, RAM), and it is used as outside high
Speed caching.By exemplary but be not restricted explanation, the RAM of many forms can use, such as static RAM
(Static RAM, SRAM), dynamic random access memory (Dynamic RAM, DRAM), Synchronous Dynamic Random Access Memory
(Synchronous DRAM, SDRAM), double data speed synchronous dynamic RAM (Double Data Rate
SDRAM, DDR SDRAM), enhancement mode Synchronous Dynamic Random Access Memory (Enhanced SDRAM, ESDRAM), synchronized links
Dynamic random access memory (Synchlink DRAM, SLDRAM) and direct rambus random access memory (Direct
Rambus RAM, DR RAM).It should be noted that the memorizer of system and method described herein be intended to including but not limited to these and
The memorizer of any other suitable type.
In addition, the terms " system " and " network " are herein often used interchangeably.The terms " and/
Or ", only a kind of incidence relation of description affiliated partner, represents there may be three kinds of relations, for example, A and/or B, can be with table
Show:, there are A and B, these three situations of individualism B in individualism A simultaneously.In addition, character "/" herein, before and after typicallying represent
Affiliated partner is a kind of relation of "or".
It should be understood that in embodiments of the present invention, " B corresponding with A " represents that B is associated with A, can determine B according to A.But
It should also be understood that determining that B is not meant to determine B only according to A according to A, B can also be determined according to A and/or other information.
Those of ordinary skill in the art are it is to be appreciated that combine the list of each example of the embodiments described herein description
Unit and algorithm steps, being capable of being implemented in combination in electronic hardware or computer software and electronic hardware.These functions are actually
To be executed with hardware or software mode, the application-specific depending on technical scheme and design constraint.Professional and technical personnel
Each specific application can be used different methods to realize described function, but this realization is it is not considered that exceed
The scope of the present invention.
Those skilled in the art can be understood that, for convenience and simplicity of description, the system of foregoing description,
Device and the specific work process of unit, may be referred to the corresponding process in preceding method embodiment, will not be described here.
It should be understood that disclosed system, apparatus and method in several embodiments provided herein, permissible
Realize by another way.For example, device embodiment described above is only schematically, for example, described unit
Divide, only a kind of division of logic function, actual can have other dividing mode when realizing, for example multiple units or assembly
Can in conjunction with or be desirably integrated into another system, or some features can be ignored, or does not execute.Another, shown or
The coupling each other discussing or direct-coupling or communication connection can be by some interfaces, the indirect coupling of device or unit
Close or communicate to connect, can be electrical, mechanical or other forms.
The described unit illustrating as separating component can be or may not be physically separate, show as unit
The part showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple
On NE.The mesh to realize this embodiment scheme for some or all of unit therein can be selected according to the actual needs
's.
In addition, can be integrated in a processing unit in each functional unit in each embodiment of the present invention it is also possible to
It is that unit is individually physically present it is also possible to two or more units are integrated in a unit.
If described function realized using in the form of SFU software functional unit and as independent production marketing or use when, permissible
It is stored in a computer read/write memory medium.Based on such understanding, technical scheme is substantially in other words
Partly being embodied in the form of software product of part that prior art is contributed or this technical scheme, this meter
Calculation machine software product is stored in a storage medium, including some instructions with so that a computer equipment (can be individual
People's computer, server, or network equipment etc.) execution each embodiment methods described of the present invention all or part of step.
And aforesaid storage medium includes:USB flash disk, portable hard drive, read only memory (Read-Only Memory, ROM), random access memory are deposited
Reservoir (Random Access Memory, RAM), magnetic disc or CD etc. are various can be with the medium of store program codes.
Claims (30)
1. a kind of fault handling method, for carrying out troubleshooting, described Virtualization Computer in virtualization computer system
System includes:Virtual architecture management system, business management system and at least one virtual machine, described at least one virtual machine fortune
On at least physical equipment, at least one virtual machine described is used for executing service application row, and described business management system is used
In managing described service application, described virtual architecture management system is used for managing described at least one virtual machine and described at least
Platform physical equipment is it is characterised in that described fault handling method includes:
Described virtual architecture management system obtains fault warning message, and described fault warning message carries the mark letter of faulty equipment
Breath and fault type;
Described virtual architecture management system determines the first virtual machine set, described first virtual machine according to described fault warning message
Set includes at least one first virtual machine being affected by described faulty equipment;
Described virtual architecture management system sends state alarm information to described business management system, and described state alarm information is taken
Information with described first virtual machine set.
2. fault handling method according to claim 1 is it is characterised in that described fault handling method also includes:
Described virtual architecture management system determines the impact information of described first virtual machine set according to described fault warning message,
Described impact information be used for indicate described faulty equipment to described at least one first virtual machine produce impact type and/or
Rank;
Then described virtual architecture management system sends state alarm information to described business management system and also carries described first void
The impact information of plan machine set.
3. fault handling method according to claim 2 it is characterised in that described faulty equipment to described at least one
The type of the impact that one virtual machine produces includes following at least one:Fault, excessive risk, risk, low-risk or no affect.
4. according to described fault handling method arbitrary in claims 1 to 3 it is characterised in that described fault handling method also wraps
Include:
Described virtual architecture management system receives the first request message that described business management system sends, and described first request disappears
Cease for indicating virtual machine to be restored, described virtual machine to be restored is a subset in described first virtual machine set;
Described virtual architecture management system preferentially recovers described virtual machine to be restored according to described first solicited message.
5. according to any one of claims 1 to 3 fault handling method it is characterised in that described fault handling method also
Including:
If described virtual architecture management system does not receive described in described business management system transmission in preset time threshold value
First solicited message, then recover at least one first virtual machine described according to preset virtual machine recovery policy.
6. fault handling method according to any one of claim 1 to 5 is it is characterised in that described fault handling method
Also include:
Described virtual architecture management system sends state alarm clearance message to described business management system.
7. a kind of virtual architecture management system, for carrying out troubleshooting, described virtualization meter in virtualization computer system
Calculation machine system includes:Virtual architecture management system, business management system and at least one virtual machine, described at least one is virtual
Machine operates at least one physical equipment, and at least one virtual machine described is used for executing service application, described operational control system
Unite for managing described service application, described virtual architecture management system be used for managing at least one virtual machine described and described extremely
A few physical equipment is it is characterised in that described virtual architecture management system includes:
Acquisition module, for obtaining fault warning message, described fault warning message carries identification information and the event of faulty equipment
Barrier type;
Determining module, for determining the first virtual machine set, described first virtual machine set bag according to described fault warning message
Include at least one first virtual machine being affected by described faulty equipment;
Sending module, for sending state alarm information to described business management system, described state alarm information carries described
The information of the first virtual machine set.
8. virtual architecture management system according to claim 7 is it is characterised in that described determining module is additionally operable to according to institute
State the impact information that fault warning message determines described first virtual machine set, described impact information is used for indicating that described fault sets
The standby type to the impact that at least one first virtual machine described produces and/or rank;
Then also to carry described first to the described state alarm information that described business management system sends virtual for described sending module
The impact information of machine set.
9. virtual architecture management system according to claim 8 is it is characterised in that described faulty equipment is to described at least one
The type of the impact that individual first virtual machine produces includes following at least one:Fault, excessive risk, risk, low-risk or no shadow
Ring.
10. the virtual architecture management system according to any one of claim 7 to 9 is it is characterised in that described virtual architecture
Management system also includes receiver module and recovery module;
Described receiver module, for receiving the first request message that described business management system sends, described first request message
For indicating virtual machine to be restored, described virtual machine to be restored is a subset in described first virtual machine set;
Described recovery module, for preferentially recovering described virtual machine to be restored according to described first solicited message.
11. virtual architecture management systems according to any one of claim 7 to 9 are it is characterised in that described recovery module
When being additionally operable to described first solicited message not receiving described business management system transmission in preset time threshold value, according to pre-
Put virtual machine recovery policy and recover at least one first virtual machine described.
12. virtual architecture management systems according to any one of claim 7 to 11 are it is characterised in that described transmission mould
Block is additionally operable to send state alarm clearance message to described business management system.
A kind of 13. fault handling methods, for carrying out troubleshooting, described Virtualization Computer in virtualization computer system
System includes:Virtual architecture management system, business management system and at least one virtual machine, described at least one virtual machine fortune
On at least physical equipment, at least one virtual machine described is used for executing service application row, and described business management system is used
In managing described service application, described virtual architecture management system is used for managing described at least one virtual machine and described at least
Platform physical equipment;It is characterized in that, described fault handling method includes:
Described business management system receives the state alarm information that described virtual architecture management system sends, and described state alarm disappears
Breath carries the information of the first virtual machine set being affected by faulty equipment, described first virtual machine set include at least one the
One virtual machine;
The business of described business management system at least one first virtual machine association according to described state alarm information determines
Application;
Described business management system processes operation to the service application execution of at least one the first virtual machine association described.
14. fault handling methods according to claim 13 it is characterised in that described state alarm information also carry described
The impact information of the first virtual machine set, described impact information is used for indicating described faulty equipment at least one first void described
The type of impact and/or rank that plan machine produces;
Then described business management system processes operation, bag to the service application execution of at least one the first virtual machine association described
Include:
Described business management system is according to the impact information of described first virtual machine set at least one first virtual machine described
The service application execution of association processes operation.
15. fault handling methods according to claim 14 it is characterised in that described first virtual machine set produce shadow
The type rung includes following at least one:Fault, excessive risk, risk, low-risk or no affect.
16. according to described fault handling method arbitrary in claim 13 to 15 it is characterised in that described business management system pair
The service application execution of at least one the first virtual machine association described processes operation and includes at least one in the following manner:
The service application of at least one the first virtual machine association described is switched to and is not subject to described fault by described business management system
The virtual machine execution of equipment impact;Or
The application state information of at least one the first virtual machine described is designated isolation by described business management system, described
Isolation is used for indicating the industry that at least one first virtual machine described stops execution at least one the first virtual machine association described
Business application;Or
Described business management system sends the first request message to described virtual architecture management system, and described first request message is used
In indicating virtual machine to be restored, described virtual machine to be restored is a subset in described first virtual machine set;Or
The control node of the service application at least one the first virtual machine association described for the described business management system sends described
State alarm information so that described control node according to described state alarm information by least one first virtual office described
The service application of connection switch to described in the virtual machine execution that affected by described faulty equipment or will at least one first void described
The application state information of plan machine is designated described isolation.
17. fault handling methods according to claim 16 are it is characterised in that described business management system is to described virtual
Architecture management system sends the first request message and includes:
Described business management system determines the first request message according to the impact information of described first virtual machine set;
Described business management system sends described first request message to described virtual architecture management system.
18. fault handling methods according to claim 17 are it is characterised in that described business management system is to described virtual
Architecture management system sends described first request message, including:
The priority of the service application according at least one the first virtual machine association described for the described business management system is to described void
Intend architecture management system and send described first request message.
19. fault handling methods according to claim 17 or 18 are it is characterised in that described business management system is to described
Virtual architecture management system sends described first request message, including:
The deployment mode of the service application according at least one first virtual machine association described for the described business management system is to described
Virtual architecture management system sends described first request message, the portion of the service application of at least one the first virtual machine association described
Administration's pattern includes at least one in active-standby mode, load sharing pattern and single virtual machine mode.
20. fault handling methods according to any one of claim 13 to 19 are it is characterised in that described troubleshooting side
Method also includes:
Described business management system receives the state alarm clearance message that described virtual architecture management system sends;
Described business management system state alarm information according to described state alarm clearance message erasing.
A kind of 21. business management systems, for carrying out troubleshooting, described Virtualization Computer in virtualization computer system
System includes:Virtual architecture management system, business management system and at least one virtual machine, described at least one virtual machine fortune
On at least physical equipment, at least one virtual machine described is used for executing service application row, and described business management system is used
In managing described service application, described virtual architecture management system is used for managing described at least one virtual machine and described at least
Platform physical equipment is it is characterised in that described business management system includes:
Receiver module, for receiving the state alarm information that described virtual architecture management system sends, described state alarm information
Carry the information of the first virtual machine set being affected by faulty equipment, described first virtual machine set include at least one first
Virtual machine;
Determining module, the business at least one first virtual machine association according to the determination of described state alarm information should
With;
Processing module, for the service application execution process operation at least one the first virtual machine association described.
22. business management systems according to claim 21 it is characterised in that described state alarm information also carry described
The impact information of the first virtual machine set, described impact information is used for indicating described faulty equipment at least one first void described
The type of impact and/or rank that plan machine produces;
Then described processing module processes operation to the service application execution of at least one the first virtual machine association described, including basis
The impact information of described first virtual machine set processes behaviour to the service application execution of at least one the first virtual machine association described
Make.
23. business management systems according to claim 22 it is characterised in that described first virtual machine set produce shadow
The type rung includes following at least one:Fault, excessive risk, risk, low-risk or no affect.
24. according to described business management system arbitrary in claim 21 to 23 it is characterised in that described processing module is to described
The service application execution of at least one the first virtual machine association processes operation and includes at least one in the following manner:
The service application of at least one the first virtual machine association described is switched to the virtual machine being affected by described faulty equipment
Execution;Or
The application state information of at least one the first virtual machine described is designated isolation, described isolation is used for indicating
At least one first virtual machine described stops the service application of execution at least one the first virtual machine association described;Or
Send the first request message to described virtual architecture management system, described first request message is used for indicating void to be restored
Plan machine, described virtual machine to be restored is a subset in described first virtual machine set;Or
Control node to the service application of at least one the first virtual machine association described sends described state alarm information, so that
Obtain described control node to be switched the service application of at least one first virtual machine association described according to described state alarm information
Execute to the described virtual machine being affected by described faulty equipment or the application state of at least one the first virtual machine described is believed
Breath is designated described isolation.
25. according to the arbitrary described business management system of claim 21 to 24 it is characterised in that described determining module is additionally operable to
First request message is determined according to the impact information of described first virtual machine set;
Wherein, described business management system also includes sending module, for sending described the to described virtual architecture management system
One request message.
26. business management systems according to claim 25 are it is characterised in that described sending module is additionally operable to according to described
The priority of the service application of at least one the first virtual machine association sends described first to described virtual architecture management system please
Seek message.
27. business management systems according to claim 25 or 26 are it is characterised in that described sending module is additionally operable to basis
The deployment mode of the service application of at least one the first virtual machine association described sends described to described virtual architecture management system
First request message, the deployment mode of the service application of at least one the first virtual machine association described includes active-standby mode, load
At least one in apportionment mode and single virtual machine mode.
28. business management systems according to any one of claim 21 to 27 it is characterised in that
Described receiver module is additionally operable to receive the state alarm clearance message that described virtual architecture management system sends;
Described processing module is additionally operable to state alarm information according to described state alarm clearance message erasing.
A kind of 29. virtual architecture management systems it is characterised in that described virtual architecture management system include processor, memorizer,
Communication interface and bus.Wherein, processor, memorizer, communication interface are communicated by bus;Described memorizer is used for storing
Instruction, when described virtual architecture management system is run, described in described computing device, the instruction of memory storage is with using described
Hardware resource perform claim in virtual architecture management system requires arbitrary methods described in 1 to 6.
A kind of 30. business management systems are it is characterised in that described business management system includes processor, memorizer, communication interface
And bus.Wherein, processor, memorizer, communication interface are communicated by bus;Described memorizer is used for store instruction, institute
When stating business management system operation, described in described computing device, the instruction of memory storage is with using described business management system
In hardware resource perform claim require 13 to 20 in arbitrary methods described.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610841392.3A CN106452846A (en) | 2016-09-22 | 2016-09-22 | Fault processing method, virtual architecture management system and business management system |
PCT/CN2017/085356 WO2018054081A1 (en) | 2016-09-22 | 2017-05-22 | Fault processing method, virtual infrastructure management system and service management system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610841392.3A CN106452846A (en) | 2016-09-22 | 2016-09-22 | Fault processing method, virtual architecture management system and business management system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106452846A true CN106452846A (en) | 2017-02-22 |
Family
ID=58166295
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610841392.3A Pending CN106452846A (en) | 2016-09-22 | 2016-09-22 | Fault processing method, virtual architecture management system and business management system |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN106452846A (en) |
WO (1) | WO2018054081A1 (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018054081A1 (en) * | 2016-09-22 | 2018-03-29 | 华为技术有限公司 | Fault processing method, virtual infrastructure management system and service management system |
CN108874411A (en) * | 2017-05-12 | 2018-11-23 | 华为技术有限公司 | A kind of method and related system of system infrastructure upgrading |
CN109039740A (en) * | 2018-08-01 | 2018-12-18 | 平安科技(深圳)有限公司 | A kind of method and apparatus handling O&M monitoring alarm |
CN110083584A (en) * | 2019-05-07 | 2019-08-02 | 深信服科技股份有限公司 | File reconstruction method, apparatus, equipment and computer readable storage medium |
CN110120146A (en) * | 2019-04-25 | 2019-08-13 | 新浪网技术(中国)有限公司 | It is a kind of based on platform system in the alarm method of platform system in alarm and alarm |
CN110888754A (en) * | 2019-11-14 | 2020-03-17 | 北京金山云网络技术有限公司 | Message obtaining method and device |
CN111771190A (en) * | 2018-04-12 | 2020-10-13 | 华为技术有限公司 | Method and equipment for detecting state of virtual machine |
CN114048004A (en) * | 2021-11-22 | 2022-02-15 | 北京志凌海纳科技有限公司 | High-availability batch scheduling method, device, equipment and storage medium for virtual machines |
CN115858222A (en) * | 2022-12-19 | 2023-03-28 | 安超云软件有限公司 | Virtual machine fault processing method and system and electronic equipment |
CN116401009A (en) * | 2023-03-28 | 2023-07-07 | 北京益安在线科技股份有限公司 | Intelligent management system based on kvm virtualization |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109600250B (en) * | 2018-09-29 | 2023-07-18 | 中国平安人寿保险股份有限公司 | Service system fault notification method, device, electronic device and storage medium |
CN116458118A (en) * | 2020-10-01 | 2023-07-18 | 上海诺基亚贝尔股份有限公司 | Method, apparatus and computer program |
CN112866009B (en) * | 2021-01-04 | 2023-03-10 | 国网山东省电力公司青岛供电公司 | Virtual network fault diagnosis method and device for comprehensive service station |
CN113315653B (en) * | 2021-04-30 | 2022-07-12 | 新华三大数据技术有限公司 | Network equipment management method and device, network equipment and computer equipment |
CN115086143A (en) * | 2022-04-28 | 2022-09-20 | 阿里巴巴(中国)有限公司 | Fault early warning method and device |
CN116643906A (en) * | 2023-06-01 | 2023-08-25 | 北京首都在线科技股份有限公司 | Cloud platform fault processing method and device, electronic equipment and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102984214A (en) * | 2012-11-08 | 2013-03-20 | 华为技术有限公司 | Method and device which are capable of achieving telecom cloud business migration |
US20130151885A1 (en) * | 2010-08-18 | 2013-06-13 | Fujitsu Limited | Computer management apparatus, computer management system and computer system |
US20150378831A1 (en) * | 2014-06-28 | 2015-12-31 | Vmware, Inc. | Using a recovery snapshot during live migration |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10037237B2 (en) * | 2013-03-28 | 2018-07-31 | Telefonaktiebolaget L M Ericsson (Publ) | Method and arrangement for fault management in infrastructure as a service clouds |
CN103559124B (en) * | 2013-10-24 | 2017-04-12 | 华为技术有限公司 | Fast fault detection method and device |
US9727439B2 (en) * | 2014-05-28 | 2017-08-08 | Vmware, Inc. | Tracking application deployment errors via cloud logs |
CN106452846A (en) * | 2016-09-22 | 2017-02-22 | 华为技术有限公司 | Fault processing method, virtual architecture management system and business management system |
-
2016
- 2016-09-22 CN CN201610841392.3A patent/CN106452846A/en active Pending
-
2017
- 2017-05-22 WO PCT/CN2017/085356 patent/WO2018054081A1/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130151885A1 (en) * | 2010-08-18 | 2013-06-13 | Fujitsu Limited | Computer management apparatus, computer management system and computer system |
CN102984214A (en) * | 2012-11-08 | 2013-03-20 | 华为技术有限公司 | Method and device which are capable of achieving telecom cloud business migration |
US20150378831A1 (en) * | 2014-06-28 | 2015-12-31 | Vmware, Inc. | Using a recovery snapshot during live migration |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018054081A1 (en) * | 2016-09-22 | 2018-03-29 | 华为技术有限公司 | Fault processing method, virtual infrastructure management system and service management system |
CN108874411A (en) * | 2017-05-12 | 2018-11-23 | 华为技术有限公司 | A kind of method and related system of system infrastructure upgrading |
US11777830B2 (en) | 2018-04-12 | 2023-10-03 | Huawei Cloud Computing Technolgies Co., Ltd. | Virtual machine status probe method, and device |
CN111771190B (en) * | 2018-04-12 | 2024-04-09 | 华为云计算技术有限公司 | Method and equipment for detecting state of virtual machine |
CN111771190A (en) * | 2018-04-12 | 2020-10-13 | 华为技术有限公司 | Method and equipment for detecting state of virtual machine |
CN109039740A (en) * | 2018-08-01 | 2018-12-18 | 平安科技(深圳)有限公司 | A kind of method and apparatus handling O&M monitoring alarm |
CN110120146A (en) * | 2019-04-25 | 2019-08-13 | 新浪网技术(中国)有限公司 | It is a kind of based on platform system in the alarm method of platform system in alarm and alarm |
CN110083584A (en) * | 2019-05-07 | 2019-08-02 | 深信服科技股份有限公司 | File reconstruction method, apparatus, equipment and computer readable storage medium |
CN110888754A (en) * | 2019-11-14 | 2020-03-17 | 北京金山云网络技术有限公司 | Message obtaining method and device |
CN114048004A (en) * | 2021-11-22 | 2022-02-15 | 北京志凌海纳科技有限公司 | High-availability batch scheduling method, device, equipment and storage medium for virtual machines |
CN115858222A (en) * | 2022-12-19 | 2023-03-28 | 安超云软件有限公司 | Virtual machine fault processing method and system and electronic equipment |
CN115858222B (en) * | 2022-12-19 | 2024-01-02 | 安超云软件有限公司 | Virtual machine fault processing method, system and electronic equipment |
CN116401009A (en) * | 2023-03-28 | 2023-07-07 | 北京益安在线科技股份有限公司 | Intelligent management system based on kvm virtualization |
Also Published As
Publication number | Publication date |
---|---|
WO2018054081A1 (en) | 2018-03-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106452846A (en) | Fault processing method, virtual architecture management system and business management system | |
US10644952B2 (en) | VNF failover method and apparatus | |
WO2017161984A1 (en) | Method, device and system for deploying data clusters, and computer storage medium | |
CN105095001B (en) | Virtual machine abnormal restoring method under distributed environment | |
US10541862B2 (en) | VNF processing policy determining method, apparatus, and system | |
CN106462450A (en) | Notification about virtual machine live migration to VNF manager | |
CN110661641B (en) | Virtual network function VNF deployment method and device | |
CN110134518A (en) | A kind of method and system improving big data cluster multinode high application availability | |
CN109960634A (en) | A kind of method for monitoring application program, apparatus and system | |
CN107659948B (en) | Method and device for controlling access of AP (access point) | |
CN106790436A (en) | A kind of traffic system monitoring method and control centre's Cloud Server based on cloud framework | |
CN105119736A (en) | Data check method and device in network function virtualization architecture | |
CN112948063A (en) | Cloud platform creation method and device, cloud platform and cloud platform implementation system | |
CN109309617A (en) | Disaster tolerance switching method, relevant device and computer storage medium | |
CN109343944A (en) | Data processing method, device, terminal and the storage medium of eSIM card | |
CN116095145B (en) | Data control method and system of VPC cluster | |
CN112804202A (en) | Multi-internetwork data security interaction method and device, server and storage medium | |
CN114124803B (en) | Device management method and device, electronic device and storage medium | |
CN105187482B (en) | A kind of method and message server of the realization of PaaS platform fault self-recovery | |
CN110795202B (en) | Resource allocation method and device of virtualized cluster resource management system | |
CN104657240B (en) | The Failure Control method and device of more kernel operating systems | |
CN116126457A (en) | Container migration method and server cluster | |
CN108737144B (en) | Method and device for resource management | |
CN115421891A (en) | Task distribution method, device, equipment and medium | |
CN112787840B (en) | Method, device and system for realizing service function deployment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170222 |