CN106452846A - Fault processing method, virtual architecture management system and business management system - Google Patents

Fault processing method, virtual architecture management system and business management system Download PDF

Info

Publication number
CN106452846A
CN106452846A CN201610841392.3A CN201610841392A CN106452846A CN 106452846 A CN106452846 A CN 106452846A CN 201610841392 A CN201610841392 A CN 201610841392A CN 106452846 A CN106452846 A CN 106452846A
Authority
CN
China
Prior art keywords
virtual machine
management system
virtual
business management
service application
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610841392.3A
Other languages
Chinese (zh)
Inventor
李候青
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201610841392.3A priority Critical patent/CN106452846A/en
Publication of CN106452846A publication Critical patent/CN106452846A/en
Priority to PCT/CN2017/085356 priority patent/WO2018054081A1/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention provides a fault processing method, a virtual architecture management system, a business management system and a virtual computer system. The fault processing method comprises the following steps: the virtual architecture management system acquires a fault alarm message, wherein the fault alarm message carries identification information and a fault type of a faulty device; the virtual architecture management system determines a first virtual machine set according to the fault warning message, wherein the first virtual machine set comprises at least one first virtual machine that is affected by the faulty device; and the virtual architecture management system sends a state alarm message to the business management system, wherein the state alarm message carries the information of the first virtual machine set. The technical scheme of the invention can quickly notify affected business of the influence of hardware faults, thereby improving the reliability of the business.

Description

Fault handling method, virtual architecture management system and business management system
Technical field
The present invention relates to field of cloud calculation, more particularly, to a kind of fault handling method, virtual architecture management system, business pipe Reason system and virtualization computer system.
Background technology
In field of cloud calculation, the operation system of industry-by-industry is implementing virtualization or cloudization deployment.At present, operation system In service operation on a virtual machine, on the hardware device as shared resource pond, that is, business no longer adopts deploying virtual machine Traditional specialized hardware or physical server deployment, to realize hardware and software decoupling and to improve resource utilization.
At present, after equipment (as physical host, storage device etc.) breaks down, the fault of faulty equipment can be passed through event Barrier alarm information is sent to virtual architecture management system, and virtual architecture management system is sent to business pipe fault warning message again Reason system, determines affected virtual machine and service application by business management system according to fault warning message, and to impacted Service application execution fault handling operation.This makes business management system need to perceive hardware and the corresponding business of hardware fault Application, could execute troubleshooting to service application, such that faulty equipment can not rapidly be led to by business management system Know the service application being affected to faulty equipment, affect service application reliability.
Content of the invention
The present invention provides a kind of fault handling method, virtual architecture management system business management system and Virtualization Computer System, can rapidly by hardware fault, the impact to virtual machine notifies the business that affected to affected virtual machine, thus Improve service reliability.
In a first aspect, a kind of the invention provides fault handling method.This fault handling method is used for calculating in virtualization Carry out troubleshooting, this virtualization computer system includes in machine system:Virtual architecture management system, business management system and At least one virtual machine, at least one virtual machine operates at least one physical equipment, and at least one virtual machine is used for executing Service application, business management system is used for management business and applies, and virtual architecture management system is used for managing at least one virtual machine At least one physical equipment.This fault handling method includes:Virtual architecture management system obtains fault warning message, and fault is accused Alarm message carries identification information and the fault type of faulty equipment;Virtual architecture management system determines according to fault warning message One virtual machine set, the first virtual machine set includes at least one first virtual machine being affected by described faulty equipment;Virtual frame Structure management system sends state alarm information to business management system, and state alarm information carries the letter of the first virtual machine set Breath.
In this fault handling method, after virtual architecture management system gets the fault warning message on faulty equipment, directly Connect and this fault warning message is analyzed process, obtain one or more virtual machines of faulty equipment impact, and to business pipe Reason system sends the information of these virtual machines so that business management system can directly obtain according to the information analysiss of these virtual machines To affected service application, and then affected service application can be processed.Compared with prior art, by virtual frame The information of the virtual machine that structure management system is directly affected by faulty equipment according to the determination of the fault warning message of faulty equipment so that Business management system directly can obtain affected service application according to the state alarm information analysis of the first virtual machine set, Rather than go analysis to obtain affected virtual machine, analyze affected service application again according to the alarm information of faulty equipment. So that business management system does not need directly to perceive hardware fault, and then can quickly trigger at the impact of service application Reason, reduces traffic lost, improves the reliability of service application.
In a kind of possible implementation, described fault handling method also includes:Virtual architecture management system is according to event The fault warning message of barrier hardware determines the impact information of the first virtual machine set, and this impact information is used for indicating faulty equipment pair The type of impact and/or rank that the first virtual machine in first virtual machine set produces;Correspondingly, state alarm information also may be used To carry the impact information of the first virtual machine set.
In this fault handling method, virtual architecture management system is according to the fault warning information of faulty equipment except obtaining Take at least one virtual machine affected, the type of the impact to these virtual machines for the fault of faulty equipment generation can also be obtained And/or rank, then also carry in the state alarm information sending to business management system for indicating faulty equipment to The type of impact and/or the impact information of rank that the first virtual machine in one virtual machine set produces, so that business pipe Reason system or operation system can more be processed to service application according to this impact information, improve service application further Reliability.
Alternatively, state warning information can also include the identification information of the first virtual machine in the first virtual machine set, Alarm identifier information, alarm name information, alarm object type information, alarm type information, raising Time information, alarm Unit type information, alarm components identification information and alarm components name information.
Alternatively, described state warning information can include the fault type information of faulty equipment.
In a kind of possible implementation, the shadow that faulty equipment produces to the first virtual machine in the first virtual machine set Ring type include fault, excessive risk, risk, low-risk or no impact one or more of type.
Alternatively, faulty equipment includes tightly to the rank of the impact that the first virtual machine in the first virtual machine set produces Anxious, important or inessential.
In a kind of possible implementation, this fault handling method also includes:Virtual architecture management system receives business The first request message that management system sends, the first request message is used for indicating virtual machine to be restored, this is to be restored virtual Machine is a subset in the first virtual machine set;It is to be restored that virtual architecture management system preferentially recovers this according to the first solicited message Virtual machine.
In this fault handling method, virtual architecture management system can be according to the request of business management system, according to business The priority of management system instruction, at least one of first virtual machine set that the fault by faulty equipment is affected is virtual Machine carries out recovery process.
Alternatively, the recovery that virtual architecture management system is carried out to virtual machine processes and can include:Live migration of virtual machine.
In a kind of possible implementation, this fault handling method also includes:If virtual architecture management system is preset Do not receive the first solicited message of business management system transmission in time threshold, then recover according to preset virtual machine recovery policy The first virtual machine in first virtual machine set.
This fault handling method can ensure that how business management system does not have information instruction virtual architecture management system When recovering the virtual machine in the first virtual machine set, virtual architecture management system can be actively according to the recovery policy being pre-configured with The first virtual machine in first virtual machine set is recovered.
In a kind of possible implementation, this fault handling method also includes:Virtual architecture management system is to business pipe Reason system sends state alarm clearance message.
In this fault handling method, virtual architecture management system carries out after recovery process to virtual machine, to operational control system System transmission state alarm clearance message is so that business management system can receive according to before this state alarm clearance message erasing Correlation state alarm information, thus avoiding the business management system state alarm information related to recovered virtual machine It is analyzed processing.
Second aspect, the invention provides a kind of virtual architecture management system, described virtual architecture management system includes using The modules of the fault handling method in execution first aspect or any one possible implementation of first aspect.
The virtual architecture management system that the present invention provides is after getting the fault warning message on faulty equipment, directly right This fault warning message is analyzed processing, and obtains one or more virtual machines of faulty equipment impact, and to operational control system System sends the information of these virtual machines so that business management system can be directly subject to according to the information analysiss of these virtual machines The service application of impact, and then can be processed with affected service application.Compared with prior art, managed by virtual architecture The information of the virtual machine that system is directly affected by faulty equipment according to the determination of the fault warning message of faulty equipment is so that business pipe Reason system directly can obtain affected service application according to the state alarm information analysis of the first virtual machine set, rather than Analysis is gone to obtain affected virtual machine, analyze affected service application again according to the alarm information of faulty equipment.So that Obtaining business management system does not need directly to perceive hardware fault, and then can quickly trigger the impact process of service application, reduces Traffic lost, improves the reliability of service application.
The third aspect, the present invention provides a kind of virtual architecture management system, and described virtual architecture management system includes processing Device, memorizer, communication interface and bus.Wherein, processor, memorizer, communication interface are communicated by bus it is also possible to be led to Cross and other means such as be wirelessly transferred and realize communication.This memorizer is used for store instruction, and this processor is used for executing this memorizer to be deposited The instruction of storage.This memory storage program code, and processor can call the program code execution first of storage in memorizer Fault handling method in aspect and any one possible implementation of first aspect.
Fourth aspect, the invention provides a kind of computer-readable medium, described computer-readable medium storage is used for empty Intend the program code of architecture management system execution, described program code include for execute first aspect and first aspect any one The instruction of the fault handling method in possible implementation.
5th aspect, present invention also offers a kind of fault handling method, this fault handling method is based in virtualization Carry out troubleshooting, virtualization computer system includes in calculation machine system:Virtual architecture management system, business management system and At least one virtual machine, this at least one virtual machine operates at least one physical equipment, and this at least one virtual machine is used for Execution service application, business management system is used for management business and applies, and virtual architecture management system is used for managing this at least one Virtual machine and this at least one physical equipment;This fault handling method includes:Business management system receives virtual architecture management system The state alarm information that system sends, this state alarm information carries the information of the first virtual machine set being affected by faulty equipment, First virtual machine set includes at least one first virtual machine;Business management system determines at least one according to state alarm information The service application of individual first virtual machine association;Business management system processes operation to the service application execution of association.
In this fault handling method, business management system receives from virtual architecture management system to be affected by faulty equipment After the information of virtual machine in first virtual machine set, can directly be obtained affected according to the information analysiss of these virtual machines Service application, and then affected service application can be processed.Compared with prior art, business management system can be straight Connect and affected service application is obtained according to the state alarm information analysis of the first virtual machine set, rather than according to faulty equipment Alarm information go analysis to obtain affected virtual machine, analyze affected service application again.So that operational control system System does not need directly to perceive hardware fault, and then can quickly trigger the impact process of service application, reduces traffic lost, improves The reliability of service application.
In a kind of possible implementation, the state alarm information of the first virtual machine set also carries the first virtual machine collection The impact information closed, this impact information is used for indicating faulty equipment at least one of the first virtual machine set the first virtual machine The type of impact producing and/or rank.Correspondingly, business management system processes operation to service application execution and includes:Business Management system processes operation according to the impact information of the first virtual machine set to service application execution.
In this fault handling method, the first virtual machine set that business management system receives from virtual architecture management system Also carry in state alarm information for indicating the impact that faulty equipment produces to the first virtual machine in the first virtual machine set Type and/or rank impact information so that business management system or operation system can more be believed according to this impact Breath is processed to service application, improves the reliability of service application further.
Alternatively, state warning information can also include the identification information of the first virtual machine in the first virtual machine set, Alarm identifier information, alarm name information, alarm object type information, alarm type information, raising Time information, alarm Unit type information, alarm components identification information and alarm components name information.
Alternatively, described state warning information can include the fault type information of faulty equipment.
In a kind of possible implementation, first virtual machine set produce impact type include fault, excessive risk, Risk, low-risk or no impact one or more of type.
Alternatively, faulty equipment includes tightly to the rank of the impact that the first virtual machine in the first virtual machine set produces Anxious, important or inessential.
In a kind of possible implementation, process operation and include at least one in the following manner:
The service application of at least one the first virtual machine association is switched to by business management system to be affected by faulty equipment Virtual machine execution;Or
The application state information of at least one the first virtual machine is designated isolation, isolation by business management system For indicating the service application of at least one first at least one first virtual machine association of virtual machine stopping execution;Or
Business management system sends the first request message to virtual architecture management system, and the first request message is treated for instruction The virtual machine recovering, virtual machine to be restored is a subset in the first virtual machine set;Or
The control node of the service application at least one the first virtual machine association for the business management system sends state alarm Message, so that the service application of at least one the first virtual machine association is switched to not by control node according to state alarm information The virtual machine being affected by faulty equipment executes or the application state information of at least one the first virtual machine is designated isolation.
In a kind of possible implementation, this fault handling method also includes:Business management system is virtual according to first The impact information of machine set determines the first request message.
In this fault handling method, according to the impact information of the first virtual machine, business management system can determine that first is virtual Need the priority of the virtual machine to be restored of virtual architecture management system recovery in machine set, and send out to virtual machine architecture management The first request message sending recovery priority for indicating these virtual machines to be restored is so that virtual architecture management system can With the priority being indicated according to business management system, in the first virtual machine set that the fault by faulty equipment is affected extremely A few virtual machine carries out recovery process.
In a kind of possible implementation, business management system to virtual architecture management system send for instruction treat extensive A kind of specific implementation of the first request message of the recovery priority of multiple virtual machine can be:Business management system according to The priority of service application sends the first request message to virtual architecture management system.
In this fault handling method, business management system is associated according to the first virtual machine in the first virtual machine set The priority of service application, that is, the priority indication virtual architecture management system of the service application according to faulty equipment impact is to the Virtual machine to be restored in one virtual machine set carries out recovery process, thereby may be ensured that the service application of high priority is permissible Preferentially it is restored, be further ensured that the reliability of service application.
Alternatively, business management system can be according to the impact information of the first virtual machine set and associated service application Priority to virtual architecture management system send the first request message.
In a kind of possible implementation, business management system sends the first request message to virtual architecture management system A kind of specific implementation be:Business management system sends to virtual architecture management system according to the deployment mode of service application First request message, the deployment mode of service application is included in active-standby mode, load sharing pattern and single virtual machine mode extremely Few one kind.
In this fault handling method, business management system according to the deployment mode of service application, that is, according to faulty equipment shadow The deployment mode instruction virtual architecture management system of the service application rung is to the virtual machine to be restored in the first virtual machine set Carry out recovery process.
Alternatively, business management system can be according to the deployment mould of the impact information of the first virtual machine set and service application Formula sends the first request message to virtual architecture management system, or can be according to the deployment mode of service application and service application Priority to virtual architecture management system send the first request message, or can according to the impact information of the first virtual machine set, The priority of the deployment mode of service application and service application sends the first request message to virtual architecture management system.
In a kind of possible implementation, this fault handling method also includes:Business management system receives virtual architecture The state alarm clearance message that management system sends;Business management system receives according to before this state alarm clearance message erasing Correlation state alarm information.
In this fault handling method, business management system can alert clear according to the state that virtual architecture management system sends Except the state alarm information of the correlation receiving before message erasing, thus avoiding the related state of recovered virtual machine is accused Alarm message is analyzed processing.
6th aspect, the invention provides a kind of business management system, described business management system is included for execution the The modules of the fault handling method in the arbitrary possible implementation of five aspects or the 5th aspect.
7th aspect, the invention provides a kind of business management system, described business management system includes processor, storage Device, communication interface and bus.Wherein, processor, memorizer, communication interface are communicated by bus it is also possible to be passed through wireless Other means such as transmission realize communication.This memorizer is used for store instruction, and this processor is used for executing the finger of this memory storage Order.This memory storage program code, and processor can call in memorizer the program code of storage execute the 5th aspect and Fault handling method in any one possible implementation of 5th aspect.
Eighth aspect, the invention provides a kind of computer-readable medium, described computer-readable medium storage is for industry The program code of business management system execution, described program code includes the arbitrary possibility for execution the 5th aspect or the 5th aspect Implementation in fault handling method instruction.
9th aspect, the invention provides a kind of virtualization computer system, including virtual management node and operational control Node, this virtual management node is used for executing at the fault in first aspect or the arbitrary possible implementation of first aspect Reason method, this operational control node is used at the fault in execution the 5th aspect or the arbitrary possible implementation of the 5th aspect Reason method.
Brief description
In order to be illustrated more clearly that the technical scheme of the embodiment of the present invention, will make to required in the embodiment of the present invention below Accompanying drawing be briefly described it should be apparent that, drawings described below be only some embodiments of the present invention attached Figure.
Figure 1A is the exemplary system structure chart of the fault handling method of the application embodiment of the present invention.
Figure 1B is another kind of exemplary system structure chart of the fault handling method of the application embodiment of the present invention.
Fig. 2 is the indicative flowchart of the fault handling method of one embodiment of the invention.
Fig. 3 is the indicative flowchart of the fault handling method of another embodiment of the present invention.
Fig. 4 is the indicative flowchart of the fault handling method of another embodiment of the present invention.
Fig. 5 is the schematic diagram of the virtual architecture management system of one embodiment of the invention.
Fig. 6 is the schematic diagram of the business management system of one embodiment of the invention.
Fig. 7 is the schematic diagram of the virtual architecture management system of another embodiment of the present invention.
Fig. 8 is the schematic diagram of the business management system of another embodiment of the present invention.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation description is it is clear that described embodiment a part of embodiment that is the present invention, rather than whole embodiments.
In order to make it easy to understand, first description can implement the system tray of the fault handling method of the embodiment of the present invention on the whole The exemplary plot of structure.It should be understood that the embodiment of the present invention is not limited in the virtualization computer system shown in Figure 1A and Figure 1B, this Outward, the device in Figure 1A and Figure 1B can be hardware or the software functionally dividing or the above combination of the two.
Hardware resource (Hardware Resources) 110 can include one or more equipment, and each equipment can be The hardware device resources such as X86 server, storage device, the network equipment, can be used for providing the hardware capabilitys such as calculating, storage, network.
Virtualization layer (Virtualization Layer) 120 passes through Intel Virtualization Technology to hardware such as calculating, storage, networks Resource is virtualized, and wherein, Intel Virtualization Technology can use Xen, HyperV, it is possible to use KVM, the present invention does not limit System.
Virtual resource (Virtual Resources) 130 refers to by Intel Virtualization Technology, hardware resource 110 to be carried out virtual Change the virtual resource being formed, such as virtual computing, virtual network, virtual memory etc..
Hardware resource 110, virtualization layer 120 and virtual resource 130 can become virtual architecture layer (Virtualized again Infrastructure Layer), provide the infrastructure layer such as virtual resource or virtualization pool for upper-layer service.
One or more service application functions are disposed, each service application is deployed in one or more in operation system 140 On virtual machine, that is, these virtual machines are used for executing service application.Deploying virtual machine is on the equipment in hardware resource 110.
Each service application has corresponding control node.Control node is used for corresponding service application is managed.Control Node processed is alternatively referred to as arbitration node.Control node can be deployed in operation system, and a control node can manage respectively A corresponding service application, as shown in Figure 1A;One control node can also manage multiple service application, as shown in Figure 1B. Control node can refer to the hardware unit that corresponding service application is managed it is also possible to refer to the many of service application operation One of individual virtual machine virtual machine.
Virtual architecture management (Virtualized Infrastructure Manager) system 150 realizes virtualization basis The management of facility, the equipment be responsible for physical hardware (i.e. hardware resource 110) virtual resources and be deployed in hardware resource 110 On virtual machine carry out being managed collectively, monitor, scheduling of resource, troubleshooting etc., running for operation system provides resource to support, and Open interface etc. is provided.Virtualization architecture management system 150 is referred to as being the ingredient of virtualization layer.
Business management system 160, for being managed to the service application run on a virtual machine, such as creating business should With schedule virtual resources in the application of, distributing service, service application and close service application etc..Business management system can manage One or more service application.Business management system calls the interface that virtual architecture management system provides, and is that service application is run Resource is provided, realizes service application granting, deployment etc..Business management system 160 is docked with virtual architecture management system 150.When So, business management system can be docked with multiple virtual architecture management systems.
Wherein, business management system 160 and operation system 140 can be collectively referred to as application layer again.Business management system 160 and industry Business system 140 can be the separate system of logic, as shown in Figure 1A and 1B it is also possible to realize the function of the two by a system. It is specifically described in the following specifically describes of the embodiment of the present invention taking virtualization computer system shown in Figure 1A as a example.
As shown in the above, operation system operates on the virtual machine in virtual resource 130, and operation system does not need to close The specific hardware device of the heart is it is not required that know virtual machine specifically operation, the industry on which hardware device that service application is located Business management system and operation system all do not need the impact to service application of direct awareness apparatus and faulty equipment.
Therefore the present invention proposes new fault handling method, virtual architecture management system, business management system and virtualization Computer system so that business management system impact to service application without direct awareness apparatus and equipment fault, but The impact to VM for the equipment fault can be known from virtual architecture management system, such that it is able to rapidly know that affected business should With, and then enable affected service application to be quickly obtained process.
As a example virtualization computer system shown in below by Figure 1A, the fault handling method of the embodiment of the present invention is carried out Detailed introduction.
Fig. 2 is the indicative flowchart of the fault handling method of the embodiment of the present invention.It should be understood that Fig. 2 shows at fault The step of reason method or operation, but these steps or operation be only example, the embodiment of the present invention can also carry out other operation or The deformation of each operation in person Fig. 2.Additionally, each step in Fig. 2 can be held according to from the different order that Fig. 2 presents Row is not and it is possible to really want to execute all operationss in Fig. 2.
S210, virtual architecture management system obtains fault warning message, and fault warning message carries the mark of faulty equipment Information and fault type.
Wherein, faulty equipment can be any one or more equipment, fault type in hardware resource 110 shown in Figure 1A Including machine failure or fractional hardware fault.
For example, if faulty equipment is X86 server, fault type can for X86 server machine failure or At least one of CPU, internal memory, network interface card, disk hardware fault in X86 server.
In the embodiment of the present invention, faulty equipment (as server, storage device etc.) can be with quick detection faults itself, then Virtual architecture management system can in several ways or agreement obtains the fault warning message of faulty equipment, and such as faulty equipment can To be managed to virtual architecture by Simple Network Management Protocol (Simple Network Management Protocol, SNMP) The fault warning message of system reporting fault equipment, or virtual architecture management system can be transmitted by declarative state The fault warning message of (Representational State Transfer, REST) interface polls faulty equipment.
S220, virtual architecture management system determines the first virtual machine set according to the fault warning message of faulty equipment, the One virtual machine set includes at least one first virtual machine being affected by faulty equipment.
After virtual architecture management system gets the fault warning message of faulty equipment, determined according to this fault warning message The the first virtual machine set being affected by faulty equipment.Virtual architecture management system gets the fault warning message of faulty equipment Afterwards, determine that the first virtual machine set determines that the specific implementation of the first virtual machine set is permissible according to this fault warning message It is:Virtual architecture management system according to the identification information of faulty equipment and fault type, from the data of virtual architecture management system In storehouse, inquire about all or part of virtual machine that the fault being deployed on this faulty equipment and being occurred by this faulty equipment is affected Information.In order to subsequent descriptions are convenient, each virtual machine in affected virtual machine can be referred to as the first virtual machine, own First virtual machine form the first virtual machine set.
S230, virtual architecture management system sends state alarm information wherein to business management system, state alarm information Carry the information of the first virtual machine set.
When virtual architecture management system generates a bar state alarm information respectively for multiple first virtual machines, virtual architecture These state alarm informations once can be sent to business management system it is also possible to be sent to business pipe several times by management system Reason system.
Certainly, virtual architecture management system can also generate a state alarm information for all affected virtual machines, I.e. in the first virtual machine set, all first virtual machines generate a state alarm information, the invention is not limited in this regard.
After business management system receives the state alarm information of the first virtual machine set that virtual architecture management system sends, Can store this state alarm information, such as by this state alarm information record or the data base being saved in business management system In.
S240, business management system determines in the first virtual machine set according to the state alarm information of the first virtual machine set The service application of at least one the first virtual machine association.
Business management system receives the state alarm information of the first virtual machine set of virtual architecture management system transmission Afterwards, this state warning information is associated with service application, identification specifically affected service application, specific implementation is permissible For:The information of affected first virtual machine carrying in the state alarm information according to the first virtual machine set, from business pipe In the data base of reason system or configuration file, the corresponding relation of inquiry the first virtual machine and service application, identifies and is specifically subject to shadow The service application rung.
S250, the service application that business management system associates at least one of first virtual machine set first virtual machine Execution processes operation.
Specifically, the service application execution to the first virtual machine association in the first virtual machine set for the business management system Processing a kind of implementation operating can be:It is virtual that business management system sends first to the corresponding control node of service application The information of machine set.Wherein, the information of the first virtual machine set is used for indicating that control node is carried out at recovery to this service application Reason.
Alternatively, the business that business management system associates at least one of first virtual machine set first virtual machine should Process operation with execution and include at least one in the following manner:
Mode one:The service application of at least one the first virtual machine association affected is switched to not by business management system The virtual machine being affected by described faulty equipment executes
Mode two:The application state information of business management system at least one the first virtual machine just described is designated isolation State, described isolation is used for indicating that at least one first virtual machine described stops execution at least one first virtual machine described The service application of association, isolates affected virtual machine in service application.
Mode three:Business management system sends the first request message to virtual architecture management system, and the first request message is used In indicating virtual machine to be restored, virtual machine to be restored is a subset in described first virtual machine set.
Mode four:The control node of the service application at least one the first virtual machine association for the business management system sends shape State alarm information, so that control node should by the business of at least one the first virtual machine association according to described state alarm information With switching to the virtual machine execution being affected by described faulty equipment or the application state by least one the first virtual machine described Message identification is described isolation.
In the embodiment of the present invention, after virtual architecture management system gets the fault warning message on faulty equipment, directly This fault warning message is analyzed process, obtains one or more virtual machines of faulty equipment impact, and to operational control System sends the information of these virtual machines.Business management system can directly obtain by shadow according to the information analysiss of these virtual machines The service application rung, and then affected service application can be processed.Compared with prior art, managed by virtual architecture The information of the virtual machine that system is directly affected by faulty equipment according to the determination of the fault warning message of faulty equipment is so that business pipe Reason system directly can obtain affected service application according to the state alarm information analysis of the first virtual machine set, rather than Analysis is gone to obtain affected virtual machine, analyze affected service application again according to the alarm information of faulty equipment.So that Obtaining business management system does not need directly to perceive hardware fault, and then can quickly trigger the impact process of service application, reduces Traffic lost, improves the reliability of service application.
In the embodiment of the present invention, alternatively, virtual architecture management system can be according to the fault warning message of faulty equipment Determine the impact information of the first virtual machine set, this impact information be used for indicating faulty equipment in the first virtual machine set at least The type of impact of one the first virtual machine generation and/or rank.Accordingly, virtual machine architecture management system is to operational control system The state alarm information that system sends can also carry this impact information, and business management system goes out to receive from virtual architecture management system State alarm information can carry this impact information.Then business management system is according to the impact information of the first virtual machine set Service application execution to the first virtual machine association in described first virtual machine set processes operation.
User can define the type that affected of fault and/or the rank that virtual machine is subject to faulty equipment according to demand, under Face is that the virtual machine of the embodiment of the present invention is subject to the type of impact of faulty equipment and the example of rank.
When physical server occurs machine failure (to include electricity, host operating system fault etc. under physical server can not carry Fault for computing resource), storage device failure situations such as (electric under storage device, whole chain rupture) and other hardware faults When leading to virtual machine cannot run, providing service for business, then the affected type of virtual machine may be configured as fault, and rank can set It is set to urgent.For network interface card or other hardware faults, if lead to virtual machine cannot normal work when, the affected class of virtual machine Type can be fault, and rank could be arranged to promptly.
When physical server generation part fault, such as central processing unit (Central Processing Unit, CPU), interior Depositing, when part network interface card breaks down, if temporarily do not affect virtual machine running, but there is the situation of operation risk, then virtual machine is subject to The type of impact could be arranged to excessive risk, and rank may be configured as important.
When storage device generation part fault, such as part of links interruption, segment controller fault etc., if temporarily do not affect void Plan machine runs, but there is the situation of operation risk, then the affected type of virtual machine could be arranged to risk, and rank can set It is set to secondary.
Under normal circumstances, when every hardware fault leads to virtual machine cannot run or externally cannot provide service, virtual machine Affected type all could be arranged to fault, and rank all could be arranged to promptly.
And for not affecting the hardware fault that any virtual machine runs, then can be not provided with the affected type of virtual machine And rank, or the affected type of virtual machine can be set for low-risk or devoid of risk, rank is prompting.
By the description of content above, virtual architecture management system disappears to the state alarm that business management system sends Breath can include the information of the first virtual machine set, i.e. the mark of at least one the first virtual machine affected.Can also include The impact information of the first virtual machine set, that is, faulty equipment is at least one first virtual machine generation in the first virtual machine set The type of impact and/or rank.
Alternatively, alarm statuses message can also include generation time, checkout time, alarm synchronization number, alarm name, announcement Alert object type etc..Except above- mentioned information, the state alarm information of virtual machine can also carry failure cause of faulty equipment etc. Information.Certainly, the information that the state alarm information of virtual machine includes is not limited to the above-mentioned content enumerated.
In the method for above-mentioned virtual architecture management system execution, although virtual architecture management system is to faulty equipment Alarm information is analyzed processing, and the information obtaining to business management system offer analysis, but virtual architecture management system This fault handling method of execution is the follow-up virtual machine being affected by fault process or subsequently on the industry being affected by fault Effective method before the process of business application, therefore can be referred to as fault handling method without meaning.
In the embodiment of the present invention, alternatively, business management system can call the interface that virtual architecture management system provides, Request virtual architecture management system is processed to affected virtual machine.Specifically, business management system can be according to The impact information of one virtual machine set determines the first request message needing the preferential virtual machine to be restored recovering for instruction, This virtual machine virtual machine to be restored is a subset in the first virtual machine set.Then business management system is to virtual architecture pipe Reason system sends this first request message.
Business management system determines in the first virtual machine set according to the impact information of the first virtual machine needs virtual architecture The priority of the virtual machine that management system is recovered, and send for indicating the extensive of virtual machine to be restored to virtual machine architecture management First request message of multiple priority so that virtual architecture management system can according to the priority of business management system instruction, At least one of the first virtual machine set that fault by faulty equipment is affected virtual machine carries out recovery process.
Alternatively, business management system sends the recovery for indicating virtual machine to be restored to virtual architecture management system Another kind of specific implementation of the first request message of priority can be:Business management system is preferential according to service application Level sends the first request message to virtual architecture management system.
Specifically, the service application that business management system is associated according to the first virtual machine in the first virtual machine set Priority, that is, according to faulty equipment impact service application priority indication virtual architecture management system to the first virtual machine Virtual machine to be restored in set carries out recovery process, thereby may be ensured that the service application of high priority can preferentially obtain Recover, be further ensured that the reliability of service application.
Recover first as by the first request message, business management system can indicate that virtual machine architecture management system is preferential The first high virtual machine of virtual machine set medium priority.
Alternatively, business management system can be according to the impact information of the first virtual machine set and associated service application Priority to virtual architecture management system send the first request message.
Alternatively, business management system sends one kind side of implementing of the first request message to virtual architecture management system Formula is:Business management system sends the first request message, industry according to the deployment mode of service application to virtual architecture management system The deployment mode of business application includes at least one in active-standby mode, load sharing pattern and single virtual machine mode.
Specifically, business management system is according to the deployment mode of service application, the i.e. business according to faulty equipment impact The deployment mode instruction virtual architecture management system of application is recovered to the virtual machine to be restored in the first virtual machine set Process.
As by the first request message, business management system can indicate that virtual architecture management system is preferentially recovered to dispose mould Formula is the host virtual machine in the active and standby virtual machine of service application of active-standby mode.
Alternatively, business management system can be according to the deployment mould of the impact information of the first virtual machine set and service application Formula sends the first request message to virtual architecture management system, or can be according to the deployment mode of service application and service application Priority to virtual architecture management system send the first request message, or can according to the impact information of the first virtual machine set, The priority of the deployment mode of service application and service application sends the first request message to virtual architecture management system.
After virtual architecture management system receives the first request message that business management system sends, can be according to the first request The instruction of message carries out recovery process to the virtual machine to be restored in the first virtual machine set according to certain priority.Virtual Architecture management system can be virtual machine (vm) migration to the concrete recovery form of virtual machine, will virtual machine move to from faulty equipment Other normal device;Can also be, using virtual machine snapshot, this virtual machine is recovered on other normal device.
Alternatively, if virtual architecture management system does not receive the use of business management system transmission in preset time threshold value In the first solicited message indicating the virtual machine needing preferentially to recover in the first virtual machine set, then recover according to preset virtual machine Strategy recovers the first virtual machine in the first virtual machine set.
So can ensure that there is no how information instruction virtual architecture management system recovers the first void in business management system During virtual machine in plan machine set, virtual architecture management system can be actively virtual to first according to preset virtual machine recovery policy At least one of machine set first virtual machine is recovered.
Alternatively, either business management system request virtual architecture management system to affected virtual machine at Reason, or virtual architecture management system active is processed to affected virtual machine, and virtual architecture management system is to impacted Virtual machine processed after, all can to business management system send state alarm clearance message, to indicate business management system The state alarm information corresponding with the virtual machine that this is processed receiving before can be removed.
After business management system receives the state alarm clearance message of virtual architecture management system transmission, can will be corresponding The state alarm information of virtual machine is removed, and reduces business management system to the maintenance work recovering alarm, such that it is able to save Resource, improves efficiency.
The concrete form that business management system removes state alarm information can be to delete the state alarm information of storage, Can also be certain information in modification state alarm information so that this information indicates this corresponding virtual machine of state alarm information Recovered.
In the embodiment of the present invention, alternatively, according to the information of the first virtual machine set, business management system determines that first is empty After the service application of the first virtual machine association in plan machine set, first can be sent to the control node of service application association empty The information of plan machine set.
After the control node of service application receives the information of the first virtual machine set of business management system transmission, can To be processed to affected service application according to the information of the first virtual machine in the first virtual machine set.
Alternatively, the control node of service application can also according to the deployment mode of service application to service application at Reason.As when service application is disposed for active-standby mode, if host virtual machine fault, control node needs to carry out active-standby switch;If Standby VM fault, control node does not need active-standby switch.As when service application is disposed for load sharing pattern, control node will be subject to The VM isolation of impact.
Alternatively, the control node of service application can be according to the deployment mode of service application and the first virtual machine set Impact information is processed to service application.As virtual to first in the impact information instruction faulty equipment when the first virtual machine set The type of the impact of machine is fault, rank is urgent, and service application is disposed for active-standby mode, if main VM fault, controls section Point needs to carry out active-standby switch, if standby VM fault or service application are inessential, control node can not deal with, that is, control section Point does not need active-standby switch.It will be appreciated that above-mentioned according to the affected type of virtual machine, rank and deployment mode etc. to service application The mode being processed is exemplary illustration, and it implements and can be defined according to the demand of user, and the present invention is to this not It is restricted.
Alternatively, after the completion of the control node of service application is to service application process, can send to business management system Business Processing feedback message, its result to service application of informing business management system.
With reference to Fig. 3, the fault handling method of the embodiment of the present invention taking physical host fault as a example, is discussed in detail.As Shown in Fig. 3, including three equipment, respectively calculate node 1, calculate node 2 and calculate node 3.Calculate node 1, calculating Node 2 and calculate node 3 can be respectively equipment 1 in Figure 1A or Figure 1B, equipment 2 and equipment 3.
2 kinds of service application (Application, APP) are deployed in virtualization computer system.One kind is applied and is App1, is associated with VM1 and VM2, using active-standby mode deployment.Wherein, VM1 is deployed in calculate node 1, is the primary void of APP1 Plan machine;VM2 is deployed in calculate node 2, is the standby virtual machine of APP1.Another kind is applied as App2, associates with VM3 and VM4, Using the deployment of load sharing pattern, VM3 is deployed on server computing node 2, and VM4 is deployed in calculate node 3.
S402, when calculate node 1 occurs power down fault, calculate node 1 passes through snmp protocol in virtual architecture management The fault warning message of report calculate node 1.
S404, virtual architecture management receives fault warning message, and according to this fault warning message, determining is affected by fault Virtual machine, and produce the state alarm information of virtual machine, comprise the following steps that.
(1) virtual architecture management system receives the hardware fault alarm information of calculate node 1, from virtual architecture management system Data base in inquiry calculate node 1 on run virtual machine list, getting affected virtual machine has VM1, obtains VM1's The information such as ID.
(2) because calculate node 1 power down fault leads to VM1 fault, VM1 cannot run offer service, therefore can be by VM1 Affected type is set to fault, and the affected rank of VM1 is set to promptly.
(3) virtual architecture management system produces the state alarm information of VM1, and it carries information and includes:VM1ID, VM1 are subject to shadow The type (for fault) of sound, generation time, the affected rank of VM1 (being urgent), the fault type of faulty equipment are (for calculating section Point 1 machine failure) etc..
S406, virtual architecture management system sends the state alarm information of VM1 to business management system.
S408, business management system receives the state alarm information of the virtual machine that virtual architecture management system sends, and obtains The information such as the ID of VM1, go out the corresponding relation of VM1 and service application from the data base querying of business management system, obtain impacted Service application be App1.
Business management system sends a notification message to the control node of App1, notifies the fault of VM1.Then this control node Determined according to notification message and VM2 is upgraded to primary server.
S410, business management system calls the interface that virtual architecture management system provides, and sends out to virtual architecture management system Send the first request message, ask virtual architecture management system fast quick-recovery VM1.
S412, VM1 is moved in calculate node 3 by virtual architecture management system, and now, VM1 is changed into the standby void of App1 Plan machine.
Now, in specific implementation process, virtual architecture management system can also carry out Fault Isolation to calculate node 1.
S414, after virtual architecture management system recovers VM1, sends VM1 state alarm clearance message to operational control.
After troubleshooting, the deployment scenario of the application in operation system is as shown in Figure 4.Wherein, App1 is using active and standby Pattern is disposed, and it is primary virtual machine that VM2 is deployed in calculate node 2, and it is standby virtual machine that VM1 is deployed in calculate node 3. App2 adopts load sharing pattern to dispose, and VM3 is deployed in calculate node 2, and VM4 is deployed in calculate node 3.Calculate node 1 Fault, from resource pool isolation.
In above-described embodiment, after calculate node 1 breaks down, send alarm information to virtual architecture management system, virtual According to alarm information, architecture management system determines that affected virtual machine is VM1, and determine VM1 by the type that affected and level Not.Business management system is processed without the direct alarm information to hardware, you can directly at virtual architecture management system Obtain the impact information that the information of affected VM1 and VM1 are subject to, and then determine that the service application run on VM1 is App1, industry Business management system notifies the control node of App1 to App1 process, and it is extensive to ask virtual architecture management system that VM1 is carried out Multiple.Virtual architecture management system moves to VM1 in calculate node 3 according to the request of business management system.The control section of App1 Put the information obtaining VM1 at business management system and VM1 by being affected after information, by standby virtual machine VM2 original for App1 Switch to host virtual machine, and the VM1 that will migrate in calculate node 3 is set to standby virtual machine, thus ensureing the operation of App1, Improve the reliability of App1.
Describe the fault handling method of the embodiment of the present invention above in conjunction with Fig. 2 to Fig. 4, with reference to Fig. 5 to Fig. 8 introduction The virtual architecture management system of the embodiment of the present invention and business management system.
Fig. 5 is the schematic diagram of the virtual architecture management system of one embodiment of the invention.It should be understood that Fig. 5 illustrates Virtual architecture management system 500 be only example, the virtual architecture management system of the embodiment of the present invention may also include other modules Or unit, or include the intimate module with the modules in Fig. 5, or do not really want including all moulds in Fig. 5 Block.
Acquisition module 510, for obtaining fault warning message, described fault warning message carries the mark letter of faulty equipment Breath and fault type.
Determining module 520, for determining the first virtual machine set, described first virtual machine according to described fault warning message Set includes at least one first virtual machine being affected by described faulty equipment.
Sending module 530, for sending state alarm information to described business management system, described state alarm information is taken Information with described first virtual machine set.
In the embodiment of the present invention, after virtual architecture management system gets the fault warning message on faulty equipment, directly This fault warning message is analyzed process, obtains one or more virtual machines of faulty equipment impact, and to operational control System sends the information of these virtual machines so that business management system can directly obtain according to the information analysiss of these virtual machines Affected service application, and then affected service application can be processed.Compared with prior art, by virtual architecture The information of the virtual machine that management system is directly affected by faulty equipment according to the determination of the fault warning message of faulty equipment is so that industry Business management system directly can obtain affected service application according to the state alarm information analysis of the first virtual machine set, and It is not to go analysis to obtain affected virtual machine, analyze affected service application again according to the alarm information of faulty equipment.From And make business management system not need directly to perceive hardware fault, and then can quickly trigger the impact process of service application, Reduce traffic lost, improve the reliability of service application.
Alternatively, as an embodiment, described determining module is additionally operable to according to described fault warning message determines The impact information of the first virtual machine set, described impact information is used for indicating described faulty equipment at least one first void described The type of impact and/or rank that plan machine produces.Then described virtual architecture management system sends shape to described business management system State alarm information also carries the impact information of described first virtual machine set.
In the embodiment of the present invention, virtual architecture management system is according to the fault warning information of faulty equipment except obtaining At least one virtual machine affected, can also obtain the type of the impact to these virtual machines for the fault of faulty equipment generation And/or rank, then also carry in the state alarm information sending to business management system for indicating faulty equipment to The type of impact and/or the impact information of rank that the first virtual machine in one virtual machine set produces, so that business pipe Reason system or operation system can more be processed to service application according to this impact information, improve service application further Reliability.
Alternatively, as an embodiment, the impact that described faulty equipment produces at least one first virtual machine described Type include following at least one:Fault, excessive risk, risk, low-risk or no affect.
Alternatively, as an embodiment, described virtual architecture management system also includes receiver module and recovery module.Institute State receiver module and be used for indicating for receiving the first request message that described business management system sends, described first request message Need the preferential virtual machine to be restored recovering, described virtual machine to be restored is a son in described first virtual machine set Collection.Described recovery module is used for according to the preferential recovery of described first solicited message virtual machine to be restored.
In the embodiment of the present invention, virtual architecture management system can be according to the request of business management system, according to business pipe The priority of reason system instruction, at least one of first virtual machine set that the fault by faulty equipment is affected virtual machine Carry out recovery process.
Alternatively, as an embodiment, described recovery module is additionally operable to not receive in preset time threshold value described During described first solicited message that business management system sends, according to preset virtual machine recovery policy recover described at least one the One virtual machine.
The embodiment of the present invention can ensure that does not have information instruction virtual architecture management system how extensive in business management system When answering the virtual machine in the first virtual machine set, virtual architecture management system can be actively according to the recovery policy pair being pre-configured with The first virtual machine in first virtual machine set is recovered.
Alternatively, as an embodiment, described sending module is additionally operable to send state announcement to described business management system Alert removing message, described state alarm clearance message is used for indicating that described business management system is removed in described business management system Described state alarm information.
In the embodiment of the present invention, virtual architecture management system carries out after recovery process to virtual machine, to business management system Transmission state alarm clearance message is so that business management system can be according to reception before this state alarm clearance message erasing Related state alarm information, thus avoid the business management system state alarm information related to recovered virtual machine to enter Row analyzing and processing.
It should be understood that the virtual architecture management system 500 of the embodiment of the present invention can pass through special IC (Application Specific Integrated Circuit, ASIC) realizes, or PLD (Programmable Logic Device, PLD) realizes, and above-mentioned PLD can be complicated process logical device (Complex Programmable Logic Device, CPLD), field programmable gate array (Field-Programmable Gate Array, FPGA), GAL (Generic Array Logic, GAL) or its combination in any.Fig. 2 is realized by software During the step being executed by virtual architecture management system in shown fault handling method, virtual architecture management system 500 and its each Individual module can also be software module.
It should be understood that the virtual architecture management system 500 shown in Fig. 5 may correspond to the void in fault handling method shown in Fig. 2 Intend the above and other operation of unit in architecture management system, and virtual architecture management system 500 and/or function is divided Not in order to realize the corresponding flow process of the fault handling method in Fig. 2, for sake of simplicity, will not be described here.
Fig. 6 is the schematic diagram of the business management system of one embodiment of the invention.It should be understood that the industry shown in Fig. 6 Business management system 600 be only example, the operation system of the embodiment of the present invention may also include other modules or unit, or include with Intimate module of the modules in Fig. 6, or do not really want including all modules in Fig. 6.
Receiver module 610, for receiving the state alarm information that described virtual architecture management system sends, described state is accused Alarm message carries the information of the first virtual machine set being affected by faulty equipment, and described first virtual machine set includes at least one Individual first virtual machine.
Determining module 620, at least one first virtual machine association according to the determination of described state alarm information Service application.
Processing module 630, for the service application execution process operation at least one the first virtual machine association described.
In the embodiment of the present invention, business management system receives, from virtual architecture management system, being affected by faulty equipment After the information of virtual machine in one virtual machine set, directly can obtain affected industry according to the information analysiss of these virtual machines Business application, and then affected service application can be processed.Compared with prior art, business management system can be direct Affected service application is obtained according to the state alarm information analysis of the first virtual machine set, rather than according to faulty equipment Alarm information goes analysis to obtain affected virtual machine, analyze affected service application again.So that business management system Do not need directly to perceive hardware fault, and then can quickly trigger the impact of service application and process, reduce traffic lost, improve industry The reliability of business application.
Alternatively, as an embodiment, described state alarm information also carries the impact of described first virtual machine set Information, described impact information is used for indicating the type to the impact that at least one first virtual machine described produces for the described faulty equipment And/or rank.Wherein, described processing module specifically for the impact information according to described first virtual machine set to described at least The service application execution of one the first virtual machine association processes operation.
In the embodiment of the present invention, the shape of the first virtual machine set that business management system receives from virtual architecture management system Also carry in state alarm information for indicating faulty equipment to the impact that the first virtual machine in the first virtual machine set produces The impact information of type and/or rank, so that business management system or operation system can be more according to this impact information Service application is processed, improves the reliability of service application further.
Alternatively, as an embodiment, described first virtual machine set produce impact type include following at least A kind of:Fault, excessive risk, risk, low-risk or no affect.
Alternatively, as an embodiment, described process operation includes at least one in the following manner:
Described business management system the service application of at least one first virtual machine association described is switched to be not subject to described The virtual machine execution of faulty equipment impact;Or
The application state information of at least one the first virtual machine described is designated isolation by described business management system, Described isolation is used for indicating that at least one first virtual machine described stops execution at least one first virtual machine association described Service application;Or
Described business management system sends the first request message to described virtual architecture management system, and described first request disappears Cease for indicating virtual machine to be restored, described virtual machine to be restored is a subset in described first virtual machine set;Or
The control node of the service application at least one the first virtual machine association described for the described business management system sends Described state alarm information so that described control node according to described state alarm information, at least one is first virtual by described The service application of office's connection switch to described in the virtual machine execution that affected by described faulty equipment or by described at least one the The application state information of one virtual machine is designated described isolation.
Alternatively, as an embodiment, described determining module is additionally operable to the impact according to described first virtual machine set Information determines the first request message, and described first request message is used for instruction needs the preferential virtual machine to be restored recovering, institute Stating virtual machine to be restored is a subset in described first virtual machine set.Described business management system also includes sending mould Block, for sending described first request message to described virtual architecture management system.
In the embodiment of the present invention, business management system can determine the first virtual machine according to the impact information of the first virtual machine Need the priority of the virtual machine to be restored of virtual architecture management system recovery in set, and manage transmission to virtual machine architecture For indicate these virtual machines to be restored recovery priority the first request message so that virtual architecture management system is permissible The priority being indicated according to business management system, in the first virtual machine set that the fault by faulty equipment is affected at least One virtual machine carries out recovery process.
Alternatively, as an embodiment, described sending module is additionally operable to according at least one first virtual office described The priority of the service application of connection sends described first request message to described virtual architecture management system.
In the embodiment of the present invention, the industry that business management system is associated according to the first virtual machine in the first virtual machine set The priority of business application, that is, the priority indication virtual architecture management system of the service application according to faulty equipment impact is to first Virtual machine to be restored in virtual machine set carries out recovery process, thereby may be ensured that the service application of high priority can be excellent First it is restored, be further ensured that the reliability of service application.
Alternatively, as an embodiment, described sending module is additionally operable to according at least one first virtual office described Connection service application deployment mode to described virtual architecture management system send described first request message, described at least one The deployment mode of the service application of the first virtual machine association is included in active-standby mode, load sharing pattern and single virtual machine mode At least one.
In the embodiment of the present invention, business management system according to the deployment mode of affected service application, that is, according to fault The deployment mode instruction virtual architecture management system of the service application of equipment impact is to be restored in the first virtual machine set Virtual machine carries out recovery process.
Alternatively, as an embodiment, described receiver module is additionally operable to receive described virtual architecture management system transmission State alarm clearance message, described processing module is additionally operable to according to described state alarm clearance message erasing state alarm Message.
In the embodiment of the present invention, the state alarm clearance that business management system can send according to virtual architecture management system The state alarm information of the correlation receiving before message erasing, thus avoid the state alarm related to recovered virtual machine Message is analyzed processing.
It should be understood that the business management system 600 of the embodiment of the present invention can be realized by special IC, or can Programmed logic device is realized, and above-mentioned PLD can be complicated process logical device, field programmable gate array, GAL Or its combination in any.When the step being executed by business management system in fault handling method shown in Fig. 2 is realized by software, industry Business management system 600 and its modules can also be software module.
It should be understood that the business management system 600 shown in Fig. 6 may correspond to the business pipe in fault handling method shown in Fig. 2 Above and other operation of the unit in reason system, and business management system 600 and/or function are respectively in order to realize figure The corresponding flow process of the fault handling method in 2, for sake of simplicity, will not be described here.
Fig. 7 is the schematic diagram of the virtual architecture management system 700 of another embodiment of the present invention.Virtual architecture pipe Reason system 700 includes processor 710, memorizer 720, communication interface 730 and bus 740.Wherein, processor 710, memorizer 720th, communication interface 730 is communicated by bus 740 it is also possible to other means such as be passed through to be wirelessly transferred to realize communication.This is deposited Reservoir 720 is used for store instruction, and this processor 710 is used for executing the instruction of this memorizer 720 storage.This memorizer 720 stores Program code, and processor 710 can call the following operation of program code execution of storage in memorizer 720:
Obtain fault warning message, described fault warning message carries identification information and the fault type of faulty equipment;Root Determine the first virtual machine set according to described fault warning message, described first virtual machine set includes being affected by described faulty equipment At least one first virtual machine;Send state alarm information to described business management system, described state alarm information carries The information of described first virtual machine set.
In the embodiment of the present invention, after virtual architecture management system gets the fault warning message on faulty equipment, directly This fault warning message is analyzed process, obtains one or more virtual machines of faulty equipment impact, and to operational control System sends the information of these virtual machines so that business management system can directly obtain according to the information analysiss of these virtual machines Affected service application, and then affected service application can be processed.Compared with prior art, by virtual architecture The information of the virtual machine that management system is directly affected by faulty equipment according to the determination of the fault warning message of faulty equipment is so that industry Business management system directly can obtain affected service application according to the state alarm information analysis of the first virtual machine set, and It is not to go analysis to obtain affected virtual machine, analyze affected service application again according to the alarm information of faulty equipment.From And make business management system not need directly to perceive hardware fault, and then can quickly trigger the impact process of service application, Reduce traffic lost, improve the reliability of service application.
Alternatively, as an embodiment, the program code that processor 710 may call upon storage in memorizer 720 is held Row is following to be operated:Determine the impact information of described first virtual machine set, described impact information according to described fault warning message For indicating type and/or the rank of the impact at least one first virtual machine generation described for the described faulty equipment.Wherein, institute The state alarm information of stating also carries described impact information.
In the embodiment of the present invention, virtual architecture management system is according to the fault warning information of faulty equipment except obtaining At least one virtual machine affected, can also obtain the type of the impact to these virtual machines for the fault of faulty equipment generation And/or rank, then also carry in the state alarm information sending to business management system for indicating faulty equipment to The type of impact and/or the impact information of rank that the first virtual machine in one virtual machine set produces, so that business pipe Reason system or operation system can more be processed to service application according to this impact information, improve service application further Reliability.
Alternatively, as an embodiment, the impact that described faulty equipment produces at least one first virtual machine described Type include following at least one:Fault, excessive risk, risk, low-risk or no affect.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720 Hereinafter operate:Receive the first request message that described business management system sends, described first request message is used for instruction to be needed The preferential virtual machine to be restored recovering, described virtual machine to be restored is a subset in described first virtual machine set.Institute State processor to be additionally operable to preferentially recover described virtual machine to be restored according to described first solicited message.
In the embodiment of the present invention, virtual architecture management system can be according to the request of business management system, according to business pipe The priority of reason system instruction, at least one of first virtual machine set that the fault by faulty equipment is affected virtual machine Carry out recovery process.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720 Hereinafter operate:When not receiving described first solicited message that described business management system sends in preset time threshold value, press Recover at least one first virtual machine described according to preset virtual machine recovery policy.
The embodiment of the present invention can ensure that does not have information instruction virtual architecture management system how extensive in business management system When answering the virtual machine in the first virtual machine set, virtual architecture management system can be actively according to the recovery policy pair being pre-configured with The first virtual machine in first virtual machine set is recovered.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720 Hereinafter operate:Send state alarm clearance message to described business management system, described state alarm clearance message is used for indicating Described business management system removes the described state alarm information in described business management system.
In the embodiment of the present invention, virtual architecture management system carries out after recovery process to virtual machine, to business management system Transmission state alarm clearance message is so that business management system can be according to reception before this state alarm clearance message erasing Related state alarm information, thus avoid the business management system state alarm information related to recovered virtual machine to enter Row analyzing and processing.
It should be understood that the virtual architecture management system of the embodiment of the present invention shown in Fig. 7 may correspond to the virtual architecture shown in Fig. 5 Above and other operation of the unit in management system, and the virtual architecture management system of the embodiment of the present invention and/or Function respectively in order to realize the corresponding flow process being executed in the fault handling method shown in Fig. 2 by virtual architecture management system, in order to Succinctly, will not be described here.
Fig. 8 is the schematic diagram of the business management system 800 of another embodiment of the present invention.Business management system 800 include processor 810, memorizer 820, communication interface 830 and bus 840.Wherein, processor 810, memorizer 820, communication Interface 830 is communicated by bus 840 it is also possible to other means such as be passed through to be wirelessly transferred to realize communication.This memorizer 820 is used In store instruction, this processor 810 is used for executing the instruction of this memorizer 820 storage.This memorizer 820 store program codes, And processor 810 can call the following operation of program code execution of storage in memorizer 820:
Receive the state alarm information that described virtual architecture management system sends, described state alarm information carries by fault The information of the first virtual machine set of equipment impact, described first virtual machine set includes at least one first virtual machine;Root Determine the service application of at least one the first virtual machine association described according to described state alarm information;To described at least one first The service application execution of virtual machine association processes operation.
In the embodiment of the present invention, business management system receives, from virtual architecture management system, being affected by faulty equipment After the information of virtual machine in one virtual machine set, directly can obtain affected industry according to the information analysiss of these virtual machines Business application, and then affected service application can be processed.Compared with prior art, business management system can be direct Affected service application is obtained according to the state alarm information analysis of the first virtual machine set, rather than according to faulty equipment Alarm information goes analysis to obtain affected virtual machine, analyze affected service application again.So that business management system Do not need directly to perceive hardware fault, and then can quickly trigger the impact of service application and process, reduce traffic lost, improve industry The reliability of business application.
Alternatively, as an embodiment, described state alarm information also carries the impact of described first virtual machine set Information, described impact information is used for indicating the type to the impact that at least one first virtual machine described produces for the described faulty equipment And/or rank.Wherein, described processor specifically for the impact information according to described first virtual machine set to described at least one The service application execution of individual first virtual machine association processes operation.
In the embodiment of the present invention, the shape of the first virtual machine set that business management system receives from virtual architecture management system Also carry in state alarm information for indicating faulty equipment to the impact that the first virtual machine in the first virtual machine set produces The impact information of type and/or rank, so that business management system or operation system can be more according to this impact information Service application is processed, improves the reliability of service application further.
Alternatively, as an embodiment, described first virtual machine set produce impact type include following at least A kind of:Fault, excessive risk, risk, low-risk or no affect.
Alternatively, as an embodiment, described process operation includes at least one in the following manner:
Described business management system the service application of at least one first virtual machine association described is switched to be not subject to described The virtual machine execution of faulty equipment impact;Or
The application state information of at least one the first virtual machine described is designated isolation by described business management system, Described isolation is used for indicating that at least one first virtual machine described stops execution at least one first virtual machine association described Service application;Or
Described business management system sends the first request message to described virtual architecture management system, and described first request disappears Cease for indicating virtual machine to be restored, described virtual machine to be restored is a subset in described first virtual machine set;Or
The control node of the service application at least one the first virtual machine association described for the described business management system sends Described state alarm information so that described control node according to described state alarm information, at least one is first virtual by described The service application of office's connection switch to described in the virtual machine execution that affected by described faulty equipment or by described at least one the The application state information of one virtual machine is designated described isolation.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720 Hereinafter operate:First request message is determined according to the impact information of described first virtual machine set.Described transmitter 840 be used for Described virtual architecture management system sends described first request message.
In the embodiment of the present invention, business management system can determine the first virtual machine according to the impact information of the first virtual machine Need the priority of the virtual machine to be restored of virtual architecture management system recovery in set, and manage transmission to virtual machine architecture For indicate these virtual machines to be restored recovery priority the first request message so that virtual architecture management system is permissible The priority being indicated according to business management system, in the first virtual machine set that the fault by faulty equipment is affected at least One virtual machine carries out recovery process.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720 Hereinafter operate:The priority of the service application according at least one the first virtual machine association described manages system to described virtual architecture System sends described first request message.
In the embodiment of the present invention, the industry that business management system is associated according to the first virtual machine in the first virtual machine set The priority of business application, that is, the priority indication virtual architecture management system of the service application according to faulty equipment impact is to first Virtual machine to be restored in virtual machine set carries out recovery process, thereby may be ensured that the service application of high priority can be excellent First it is restored, be further ensured that the reliability of service application.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720 Hereinafter operate:The deployment mode of the service application according at least one the first virtual machine association described manages to described virtual architecture System sends described first request message, and the deployment mode of the service application of at least one the first virtual machine association described includes leading At least one in standby pattern, load sharing pattern and single virtual machine mode.
In the embodiment of the present invention, business management system, according to the deployment mode of service application, affects according to faulty equipment Service application deployment mode instruction virtual architecture management system the virtual machine to be restored in the first virtual machine set is entered Row recovery is processed.
Alternatively, as an embodiment, processor 710 can call the program code execution of storage in memorizer 720 Hereinafter operate:Receive the state alarm clearance message that described virtual architecture management system sends, described processor is additionally operable to basis State alarm information described in described state alarm clearance message erasing.
In the embodiment of the present invention, the state alarm clearance that business management system can send according to virtual architecture management system The state alarm information of the correlation receiving before message erasing, thus avoid the state alarm related to recovered virtual machine Message is analyzed processing.
It should be understood that the business management system of the embodiment of the present invention shown in Fig. 8 may correspond to the operational control system shown in Fig. 6 Unite, and above and other operation of the unit in the business management system of the embodiment of the present invention and/or function are respectively Realize the corresponding flow process being executed in the fault handling method shown in Fig. 2 by business management system, for sake of simplicity, here is no longer superfluous State.
It is appreciated that the processor in the embodiment of the present invention can be a kind of IC chip, there is the process of signal Ability.During realizing, each step of said method embodiment can be by the integrated logic circuit of the hardware in processor Or the instruction of software form completes.Above-mentioned processor can be general processor, digital signal processor (Digital Signal Processor, DSP), special IC (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field Programmable Gate Array, FPGA) or other FPGAs Device, discrete gate or transistor logic, discrete hardware components.Can realize or execute the public affairs in the embodiment of the present invention Each method, step and the logic diagram opened.General processor can be microprocessor or this processor can also be any normal Processor of rule etc..Step in conjunction with the method disclosed in the embodiment of the present invention can be embodied directly in hardware decoding processor and holds Row completes, or is completed with the hardware in decoding processor and software module combination execution.Software module may be located at deposits at random Reservoir, flash memory, read only memory, this area such as programmable read only memory or electrically erasable programmable memory, depositor In ripe storage medium.This storage medium is located at memorizer, and processor reads the information in memorizer, completes in conjunction with its hardware The step of said method.
It is appreciated that the memorizer in the embodiment of the present invention can be volatile memory or nonvolatile memory, or May include volatibility and nonvolatile memory.Wherein, nonvolatile memory can be read only memory (Read- Only Memory, ROM), programmable read only memory (Programmable ROM, PROM), the read-only storage of erasable programmable Device (Erasable PROM, EPROM), Electrically Erasable Read Only Memory (Electrically EPROM, EEPROM) or Flash memory.Volatile memory can be random access memory (Random Access Memory, RAM), and it is used as outside high Speed caching.By exemplary but be not restricted explanation, the RAM of many forms can use, such as static RAM (Static RAM, SRAM), dynamic random access memory (Dynamic RAM, DRAM), Synchronous Dynamic Random Access Memory (Synchronous DRAM, SDRAM), double data speed synchronous dynamic RAM (Double Data Rate SDRAM, DDR SDRAM), enhancement mode Synchronous Dynamic Random Access Memory (Enhanced SDRAM, ESDRAM), synchronized links Dynamic random access memory (Synchlink DRAM, SLDRAM) and direct rambus random access memory (Direct Rambus RAM, DR RAM).It should be noted that the memorizer of system and method described herein be intended to including but not limited to these and The memorizer of any other suitable type.
In addition, the terms " system " and " network " are herein often used interchangeably.The terms " and/ Or ", only a kind of incidence relation of description affiliated partner, represents there may be three kinds of relations, for example, A and/or B, can be with table Show:, there are A and B, these three situations of individualism B in individualism A simultaneously.In addition, character "/" herein, before and after typicallying represent Affiliated partner is a kind of relation of "or".
It should be understood that in embodiments of the present invention, " B corresponding with A " represents that B is associated with A, can determine B according to A.But It should also be understood that determining that B is not meant to determine B only according to A according to A, B can also be determined according to A and/or other information.
Those of ordinary skill in the art are it is to be appreciated that combine the list of each example of the embodiments described herein description Unit and algorithm steps, being capable of being implemented in combination in electronic hardware or computer software and electronic hardware.These functions are actually To be executed with hardware or software mode, the application-specific depending on technical scheme and design constraint.Professional and technical personnel Each specific application can be used different methods to realize described function, but this realization is it is not considered that exceed The scope of the present invention.
Those skilled in the art can be understood that, for convenience and simplicity of description, the system of foregoing description, Device and the specific work process of unit, may be referred to the corresponding process in preceding method embodiment, will not be described here.
It should be understood that disclosed system, apparatus and method in several embodiments provided herein, permissible Realize by another way.For example, device embodiment described above is only schematically, for example, described unit Divide, only a kind of division of logic function, actual can have other dividing mode when realizing, for example multiple units or assembly Can in conjunction with or be desirably integrated into another system, or some features can be ignored, or does not execute.Another, shown or The coupling each other discussing or direct-coupling or communication connection can be by some interfaces, the indirect coupling of device or unit Close or communicate to connect, can be electrical, mechanical or other forms.
The described unit illustrating as separating component can be or may not be physically separate, show as unit The part showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.The mesh to realize this embodiment scheme for some or all of unit therein can be selected according to the actual needs 's.
In addition, can be integrated in a processing unit in each functional unit in each embodiment of the present invention it is also possible to It is that unit is individually physically present it is also possible to two or more units are integrated in a unit.
If described function realized using in the form of SFU software functional unit and as independent production marketing or use when, permissible It is stored in a computer read/write memory medium.Based on such understanding, technical scheme is substantially in other words Partly being embodied in the form of software product of part that prior art is contributed or this technical scheme, this meter Calculation machine software product is stored in a storage medium, including some instructions with so that a computer equipment (can be individual People's computer, server, or network equipment etc.) execution each embodiment methods described of the present invention all or part of step. And aforesaid storage medium includes:USB flash disk, portable hard drive, read only memory (Read-Only Memory, ROM), random access memory are deposited Reservoir (Random Access Memory, RAM), magnetic disc or CD etc. are various can be with the medium of store program codes.

Claims (30)

1. a kind of fault handling method, for carrying out troubleshooting, described Virtualization Computer in virtualization computer system System includes:Virtual architecture management system, business management system and at least one virtual machine, described at least one virtual machine fortune On at least physical equipment, at least one virtual machine described is used for executing service application row, and described business management system is used In managing described service application, described virtual architecture management system is used for managing described at least one virtual machine and described at least Platform physical equipment is it is characterised in that described fault handling method includes:
Described virtual architecture management system obtains fault warning message, and described fault warning message carries the mark letter of faulty equipment Breath and fault type;
Described virtual architecture management system determines the first virtual machine set, described first virtual machine according to described fault warning message Set includes at least one first virtual machine being affected by described faulty equipment;
Described virtual architecture management system sends state alarm information to described business management system, and described state alarm information is taken Information with described first virtual machine set.
2. fault handling method according to claim 1 is it is characterised in that described fault handling method also includes:
Described virtual architecture management system determines the impact information of described first virtual machine set according to described fault warning message, Described impact information be used for indicate described faulty equipment to described at least one first virtual machine produce impact type and/or Rank;
Then described virtual architecture management system sends state alarm information to described business management system and also carries described first void The impact information of plan machine set.
3. fault handling method according to claim 2 it is characterised in that described faulty equipment to described at least one The type of the impact that one virtual machine produces includes following at least one:Fault, excessive risk, risk, low-risk or no affect.
4. according to described fault handling method arbitrary in claims 1 to 3 it is characterised in that described fault handling method also wraps Include:
Described virtual architecture management system receives the first request message that described business management system sends, and described first request disappears Cease for indicating virtual machine to be restored, described virtual machine to be restored is a subset in described first virtual machine set;
Described virtual architecture management system preferentially recovers described virtual machine to be restored according to described first solicited message.
5. according to any one of claims 1 to 3 fault handling method it is characterised in that described fault handling method also Including:
If described virtual architecture management system does not receive described in described business management system transmission in preset time threshold value First solicited message, then recover at least one first virtual machine described according to preset virtual machine recovery policy.
6. fault handling method according to any one of claim 1 to 5 is it is characterised in that described fault handling method Also include:
Described virtual architecture management system sends state alarm clearance message to described business management system.
7. a kind of virtual architecture management system, for carrying out troubleshooting, described virtualization meter in virtualization computer system Calculation machine system includes:Virtual architecture management system, business management system and at least one virtual machine, described at least one is virtual Machine operates at least one physical equipment, and at least one virtual machine described is used for executing service application, described operational control system Unite for managing described service application, described virtual architecture management system be used for managing at least one virtual machine described and described extremely A few physical equipment is it is characterised in that described virtual architecture management system includes:
Acquisition module, for obtaining fault warning message, described fault warning message carries identification information and the event of faulty equipment Barrier type;
Determining module, for determining the first virtual machine set, described first virtual machine set bag according to described fault warning message Include at least one first virtual machine being affected by described faulty equipment;
Sending module, for sending state alarm information to described business management system, described state alarm information carries described The information of the first virtual machine set.
8. virtual architecture management system according to claim 7 is it is characterised in that described determining module is additionally operable to according to institute State the impact information that fault warning message determines described first virtual machine set, described impact information is used for indicating that described fault sets The standby type to the impact that at least one first virtual machine described produces and/or rank;
Then also to carry described first to the described state alarm information that described business management system sends virtual for described sending module The impact information of machine set.
9. virtual architecture management system according to claim 8 is it is characterised in that described faulty equipment is to described at least one The type of the impact that individual first virtual machine produces includes following at least one:Fault, excessive risk, risk, low-risk or no shadow Ring.
10. the virtual architecture management system according to any one of claim 7 to 9 is it is characterised in that described virtual architecture Management system also includes receiver module and recovery module;
Described receiver module, for receiving the first request message that described business management system sends, described first request message For indicating virtual machine to be restored, described virtual machine to be restored is a subset in described first virtual machine set;
Described recovery module, for preferentially recovering described virtual machine to be restored according to described first solicited message.
11. virtual architecture management systems according to any one of claim 7 to 9 are it is characterised in that described recovery module When being additionally operable to described first solicited message not receiving described business management system transmission in preset time threshold value, according to pre- Put virtual machine recovery policy and recover at least one first virtual machine described.
12. virtual architecture management systems according to any one of claim 7 to 11 are it is characterised in that described transmission mould Block is additionally operable to send state alarm clearance message to described business management system.
A kind of 13. fault handling methods, for carrying out troubleshooting, described Virtualization Computer in virtualization computer system System includes:Virtual architecture management system, business management system and at least one virtual machine, described at least one virtual machine fortune On at least physical equipment, at least one virtual machine described is used for executing service application row, and described business management system is used In managing described service application, described virtual architecture management system is used for managing described at least one virtual machine and described at least Platform physical equipment;It is characterized in that, described fault handling method includes:
Described business management system receives the state alarm information that described virtual architecture management system sends, and described state alarm disappears Breath carries the information of the first virtual machine set being affected by faulty equipment, described first virtual machine set include at least one the One virtual machine;
The business of described business management system at least one first virtual machine association according to described state alarm information determines Application;
Described business management system processes operation to the service application execution of at least one the first virtual machine association described.
14. fault handling methods according to claim 13 it is characterised in that described state alarm information also carry described The impact information of the first virtual machine set, described impact information is used for indicating described faulty equipment at least one first void described The type of impact and/or rank that plan machine produces;
Then described business management system processes operation, bag to the service application execution of at least one the first virtual machine association described Include:
Described business management system is according to the impact information of described first virtual machine set at least one first virtual machine described The service application execution of association processes operation.
15. fault handling methods according to claim 14 it is characterised in that described first virtual machine set produce shadow The type rung includes following at least one:Fault, excessive risk, risk, low-risk or no affect.
16. according to described fault handling method arbitrary in claim 13 to 15 it is characterised in that described business management system pair The service application execution of at least one the first virtual machine association described processes operation and includes at least one in the following manner:
The service application of at least one the first virtual machine association described is switched to and is not subject to described fault by described business management system The virtual machine execution of equipment impact;Or
The application state information of at least one the first virtual machine described is designated isolation by described business management system, described Isolation is used for indicating the industry that at least one first virtual machine described stops execution at least one the first virtual machine association described Business application;Or
Described business management system sends the first request message to described virtual architecture management system, and described first request message is used In indicating virtual machine to be restored, described virtual machine to be restored is a subset in described first virtual machine set;Or
The control node of the service application at least one the first virtual machine association described for the described business management system sends described State alarm information so that described control node according to described state alarm information by least one first virtual office described The service application of connection switch to described in the virtual machine execution that affected by described faulty equipment or will at least one first void described The application state information of plan machine is designated described isolation.
17. fault handling methods according to claim 16 are it is characterised in that described business management system is to described virtual Architecture management system sends the first request message and includes:
Described business management system determines the first request message according to the impact information of described first virtual machine set;
Described business management system sends described first request message to described virtual architecture management system.
18. fault handling methods according to claim 17 are it is characterised in that described business management system is to described virtual Architecture management system sends described first request message, including:
The priority of the service application according at least one the first virtual machine association described for the described business management system is to described void Intend architecture management system and send described first request message.
19. fault handling methods according to claim 17 or 18 are it is characterised in that described business management system is to described Virtual architecture management system sends described first request message, including:
The deployment mode of the service application according at least one first virtual machine association described for the described business management system is to described Virtual architecture management system sends described first request message, the portion of the service application of at least one the first virtual machine association described Administration's pattern includes at least one in active-standby mode, load sharing pattern and single virtual machine mode.
20. fault handling methods according to any one of claim 13 to 19 are it is characterised in that described troubleshooting side Method also includes:
Described business management system receives the state alarm clearance message that described virtual architecture management system sends;
Described business management system state alarm information according to described state alarm clearance message erasing.
A kind of 21. business management systems, for carrying out troubleshooting, described Virtualization Computer in virtualization computer system System includes:Virtual architecture management system, business management system and at least one virtual machine, described at least one virtual machine fortune On at least physical equipment, at least one virtual machine described is used for executing service application row, and described business management system is used In managing described service application, described virtual architecture management system is used for managing described at least one virtual machine and described at least Platform physical equipment is it is characterised in that described business management system includes:
Receiver module, for receiving the state alarm information that described virtual architecture management system sends, described state alarm information Carry the information of the first virtual machine set being affected by faulty equipment, described first virtual machine set include at least one first Virtual machine;
Determining module, the business at least one first virtual machine association according to the determination of described state alarm information should With;
Processing module, for the service application execution process operation at least one the first virtual machine association described.
22. business management systems according to claim 21 it is characterised in that described state alarm information also carry described The impact information of the first virtual machine set, described impact information is used for indicating described faulty equipment at least one first void described The type of impact and/or rank that plan machine produces;
Then described processing module processes operation to the service application execution of at least one the first virtual machine association described, including basis The impact information of described first virtual machine set processes behaviour to the service application execution of at least one the first virtual machine association described Make.
23. business management systems according to claim 22 it is characterised in that described first virtual machine set produce shadow The type rung includes following at least one:Fault, excessive risk, risk, low-risk or no affect.
24. according to described business management system arbitrary in claim 21 to 23 it is characterised in that described processing module is to described The service application execution of at least one the first virtual machine association processes operation and includes at least one in the following manner:
The service application of at least one the first virtual machine association described is switched to the virtual machine being affected by described faulty equipment Execution;Or
The application state information of at least one the first virtual machine described is designated isolation, described isolation is used for indicating At least one first virtual machine described stops the service application of execution at least one the first virtual machine association described;Or
Send the first request message to described virtual architecture management system, described first request message is used for indicating void to be restored Plan machine, described virtual machine to be restored is a subset in described first virtual machine set;Or
Control node to the service application of at least one the first virtual machine association described sends described state alarm information, so that Obtain described control node to be switched the service application of at least one first virtual machine association described according to described state alarm information Execute to the described virtual machine being affected by described faulty equipment or the application state of at least one the first virtual machine described is believed Breath is designated described isolation.
25. according to the arbitrary described business management system of claim 21 to 24 it is characterised in that described determining module is additionally operable to First request message is determined according to the impact information of described first virtual machine set;
Wherein, described business management system also includes sending module, for sending described the to described virtual architecture management system One request message.
26. business management systems according to claim 25 are it is characterised in that described sending module is additionally operable to according to described The priority of the service application of at least one the first virtual machine association sends described first to described virtual architecture management system please Seek message.
27. business management systems according to claim 25 or 26 are it is characterised in that described sending module is additionally operable to basis The deployment mode of the service application of at least one the first virtual machine association described sends described to described virtual architecture management system First request message, the deployment mode of the service application of at least one the first virtual machine association described includes active-standby mode, load At least one in apportionment mode and single virtual machine mode.
28. business management systems according to any one of claim 21 to 27 it is characterised in that
Described receiver module is additionally operable to receive the state alarm clearance message that described virtual architecture management system sends;
Described processing module is additionally operable to state alarm information according to described state alarm clearance message erasing.
A kind of 29. virtual architecture management systems it is characterised in that described virtual architecture management system include processor, memorizer, Communication interface and bus.Wherein, processor, memorizer, communication interface are communicated by bus;Described memorizer is used for storing Instruction, when described virtual architecture management system is run, described in described computing device, the instruction of memory storage is with using described Hardware resource perform claim in virtual architecture management system requires arbitrary methods described in 1 to 6.
A kind of 30. business management systems are it is characterised in that described business management system includes processor, memorizer, communication interface And bus.Wherein, processor, memorizer, communication interface are communicated by bus;Described memorizer is used for store instruction, institute When stating business management system operation, described in described computing device, the instruction of memory storage is with using described business management system In hardware resource perform claim require 13 to 20 in arbitrary methods described.
CN201610841392.3A 2016-09-22 2016-09-22 Fault processing method, virtual architecture management system and business management system Pending CN106452846A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201610841392.3A CN106452846A (en) 2016-09-22 2016-09-22 Fault processing method, virtual architecture management system and business management system
PCT/CN2017/085356 WO2018054081A1 (en) 2016-09-22 2017-05-22 Fault processing method, virtual infrastructure management system and service management system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610841392.3A CN106452846A (en) 2016-09-22 2016-09-22 Fault processing method, virtual architecture management system and business management system

Publications (1)

Publication Number Publication Date
CN106452846A true CN106452846A (en) 2017-02-22

Family

ID=58166295

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610841392.3A Pending CN106452846A (en) 2016-09-22 2016-09-22 Fault processing method, virtual architecture management system and business management system

Country Status (2)

Country Link
CN (1) CN106452846A (en)
WO (1) WO2018054081A1 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018054081A1 (en) * 2016-09-22 2018-03-29 华为技术有限公司 Fault processing method, virtual infrastructure management system and service management system
CN108874411A (en) * 2017-05-12 2018-11-23 华为技术有限公司 A kind of method and related system of system infrastructure upgrading
CN109039740A (en) * 2018-08-01 2018-12-18 平安科技(深圳)有限公司 A kind of method and apparatus handling O&M monitoring alarm
CN110083584A (en) * 2019-05-07 2019-08-02 深信服科技股份有限公司 File reconstruction method, apparatus, equipment and computer readable storage medium
CN110120146A (en) * 2019-04-25 2019-08-13 新浪网技术(中国)有限公司 It is a kind of based on platform system in the alarm method of platform system in alarm and alarm
CN110888754A (en) * 2019-11-14 2020-03-17 北京金山云网络技术有限公司 Message obtaining method and device
CN111771190A (en) * 2018-04-12 2020-10-13 华为技术有限公司 Method and equipment for detecting state of virtual machine
CN114048004A (en) * 2021-11-22 2022-02-15 北京志凌海纳科技有限公司 High-availability batch scheduling method, device, equipment and storage medium for virtual machines
CN115858222A (en) * 2022-12-19 2023-03-28 安超云软件有限公司 Virtual machine fault processing method and system and electronic equipment
CN116401009A (en) * 2023-03-28 2023-07-07 北京益安在线科技股份有限公司 Intelligent management system based on kvm virtualization

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109600250B (en) * 2018-09-29 2023-07-18 中国平安人寿保险股份有限公司 Service system fault notification method, device, electronic device and storage medium
CN116458118A (en) * 2020-10-01 2023-07-18 上海诺基亚贝尔股份有限公司 Method, apparatus and computer program
CN112866009B (en) * 2021-01-04 2023-03-10 国网山东省电力公司青岛供电公司 Virtual network fault diagnosis method and device for comprehensive service station
CN113315653B (en) * 2021-04-30 2022-07-12 新华三大数据技术有限公司 Network equipment management method and device, network equipment and computer equipment
CN115086143A (en) * 2022-04-28 2022-09-20 阿里巴巴(中国)有限公司 Fault early warning method and device
CN116643906A (en) * 2023-06-01 2023-08-25 北京首都在线科技股份有限公司 Cloud platform fault processing method and device, electronic equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102984214A (en) * 2012-11-08 2013-03-20 华为技术有限公司 Method and device which are capable of achieving telecom cloud business migration
US20130151885A1 (en) * 2010-08-18 2013-06-13 Fujitsu Limited Computer management apparatus, computer management system and computer system
US20150378831A1 (en) * 2014-06-28 2015-12-31 Vmware, Inc. Using a recovery snapshot during live migration

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10037237B2 (en) * 2013-03-28 2018-07-31 Telefonaktiebolaget L M Ericsson (Publ) Method and arrangement for fault management in infrastructure as a service clouds
CN103559124B (en) * 2013-10-24 2017-04-12 华为技术有限公司 Fast fault detection method and device
US9727439B2 (en) * 2014-05-28 2017-08-08 Vmware, Inc. Tracking application deployment errors via cloud logs
CN106452846A (en) * 2016-09-22 2017-02-22 华为技术有限公司 Fault processing method, virtual architecture management system and business management system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130151885A1 (en) * 2010-08-18 2013-06-13 Fujitsu Limited Computer management apparatus, computer management system and computer system
CN102984214A (en) * 2012-11-08 2013-03-20 华为技术有限公司 Method and device which are capable of achieving telecom cloud business migration
US20150378831A1 (en) * 2014-06-28 2015-12-31 Vmware, Inc. Using a recovery snapshot during live migration

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018054081A1 (en) * 2016-09-22 2018-03-29 华为技术有限公司 Fault processing method, virtual infrastructure management system and service management system
CN108874411A (en) * 2017-05-12 2018-11-23 华为技术有限公司 A kind of method and related system of system infrastructure upgrading
US11777830B2 (en) 2018-04-12 2023-10-03 Huawei Cloud Computing Technolgies Co., Ltd. Virtual machine status probe method, and device
CN111771190B (en) * 2018-04-12 2024-04-09 华为云计算技术有限公司 Method and equipment for detecting state of virtual machine
CN111771190A (en) * 2018-04-12 2020-10-13 华为技术有限公司 Method and equipment for detecting state of virtual machine
CN109039740A (en) * 2018-08-01 2018-12-18 平安科技(深圳)有限公司 A kind of method and apparatus handling O&M monitoring alarm
CN110120146A (en) * 2019-04-25 2019-08-13 新浪网技术(中国)有限公司 It is a kind of based on platform system in the alarm method of platform system in alarm and alarm
CN110083584A (en) * 2019-05-07 2019-08-02 深信服科技股份有限公司 File reconstruction method, apparatus, equipment and computer readable storage medium
CN110888754A (en) * 2019-11-14 2020-03-17 北京金山云网络技术有限公司 Message obtaining method and device
CN114048004A (en) * 2021-11-22 2022-02-15 北京志凌海纳科技有限公司 High-availability batch scheduling method, device, equipment and storage medium for virtual machines
CN115858222A (en) * 2022-12-19 2023-03-28 安超云软件有限公司 Virtual machine fault processing method and system and electronic equipment
CN115858222B (en) * 2022-12-19 2024-01-02 安超云软件有限公司 Virtual machine fault processing method, system and electronic equipment
CN116401009A (en) * 2023-03-28 2023-07-07 北京益安在线科技股份有限公司 Intelligent management system based on kvm virtualization

Also Published As

Publication number Publication date
WO2018054081A1 (en) 2018-03-29

Similar Documents

Publication Publication Date Title
CN106452846A (en) Fault processing method, virtual architecture management system and business management system
US10644952B2 (en) VNF failover method and apparatus
WO2017161984A1 (en) Method, device and system for deploying data clusters, and computer storage medium
CN105095001B (en) Virtual machine abnormal restoring method under distributed environment
US10541862B2 (en) VNF processing policy determining method, apparatus, and system
CN106462450A (en) Notification about virtual machine live migration to VNF manager
CN110661641B (en) Virtual network function VNF deployment method and device
CN110134518A (en) A kind of method and system improving big data cluster multinode high application availability
CN109960634A (en) A kind of method for monitoring application program, apparatus and system
CN107659948B (en) Method and device for controlling access of AP (access point)
CN106790436A (en) A kind of traffic system monitoring method and control centre's Cloud Server based on cloud framework
CN105119736A (en) Data check method and device in network function virtualization architecture
CN112948063A (en) Cloud platform creation method and device, cloud platform and cloud platform implementation system
CN109309617A (en) Disaster tolerance switching method, relevant device and computer storage medium
CN109343944A (en) Data processing method, device, terminal and the storage medium of eSIM card
CN116095145B (en) Data control method and system of VPC cluster
CN112804202A (en) Multi-internetwork data security interaction method and device, server and storage medium
CN114124803B (en) Device management method and device, electronic device and storage medium
CN105187482B (en) A kind of method and message server of the realization of PaaS platform fault self-recovery
CN110795202B (en) Resource allocation method and device of virtualized cluster resource management system
CN104657240B (en) The Failure Control method and device of more kernel operating systems
CN116126457A (en) Container migration method and server cluster
CN108737144B (en) Method and device for resource management
CN115421891A (en) Task distribution method, device, equipment and medium
CN112787840B (en) Method, device and system for realizing service function deployment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170222