CN117194104A

CN117194104A - Disaster recovery switching processing method and device applied to multiple systems

Info

Publication number: CN117194104A
Application number: CN202311134972.5A
Authority: CN
Inventors: 许天一; 吴泽君; 王文勃; 张展程; 孟朝雄
Original assignee: CCB Finetech Co Ltd
Current assignee: CCB Finetech Co Ltd
Priority date: 2023-09-04
Filing date: 2023-09-04
Publication date: 2023-12-08

Abstract

The invention discloses a disaster recovery switching processing method and device applied to multiple systems, and relates to the technical field of automatic programming. One embodiment of the method comprises the following steps: responding to disaster recovery switching operation of a plurality of systems, triggering a scheme information configuration interface to be displayed, and receiving scheme information configured for each system to obtain a change scheme of each system; carrying out instantiation processing on each execution task in each modification scheme to obtain an instance task so as to generate an instance task set; and carrying out aggregation processing on the instance tasks of the same task type in all the instance task sets, integrating to obtain an aggregation instance task to be transmitted into an instance task pool based on parameters in the aggregated instance task, calling a task scheduling engine to scan so as to carry out automatic processing on the aggregation instance task, obtaining a processing result, and taking the processing result as an execution result of each aggregated instance task. The embodiment gathers the task scheduling of the same type so as to reduce the task scheduling times and improve the execution efficiency.

Description

Disaster recovery switching processing method and device applied to multiple systems

Technical Field

The invention relates to the technical field of automatic programming, in particular to a disaster recovery switching processing method and device applied to multiple systems.

Background

Currently, with the development of large-scale IT business, especially in the IT business of the financial industry, extremely high business stability is required. According to the related policies issued by the regulatory authorities, nationwide large banks should in principle adopt both co-city and off-site disaster backup and recovery policies. Therefore, in disaster recovery in large financial industry, a large number of operation and maintenance operations are extremely complicated and cumbersome.

In the process of implementing the present invention, the inventor finds that at least the following problems exist in the prior art: the large-scale financial multi-IT system has more switching tasks of the same type during switching and changing, and the firewall is changed and operated by thousands of firewalls. The existing automatic operation and maintenance tool only performs automatic issuing of operation and maintenance operation, executes regularly and feeds back whether script execution is successful or not, can not automatically collect tasks of the same type, causes extremely large task scheduling amount, has low execution efficiency and easily causes task queue blocking.

Disclosure of Invention

In view of the above, the embodiments of the present invention provide a disaster recovery switching processing method and apparatus applied to multiple systems, which at least can solve the problems of low switching exercise efficiency and even failure caused by high-concurrency task scheduling blocking, which can be solved by replacing manual execution for disaster recovery operation scheduling of a financial IT system with a complex large scene.

In order to achieve the above object, according to an aspect of the embodiments of the present invention, there is provided a disaster recovery switching processing method applied to a multi-system, including:

responding to disaster recovery switching operation of a plurality of systems, triggering a scheme information configuration interface to be displayed, and receiving scheme information configured for each system to obtain a change scheme of each system;

carrying out instantiation processing on each execution task in each change scheme to obtain an instance task corresponding to each execution task so as to generate an instance task set corresponding to each change scheme;

analyzing the task type of each execution task, carrying out aggregation processing on the instance tasks with the same task type in all instance task sets, and integrating to obtain an aggregation instance task based on parameters in the aggregated instance task;

and transferring the converged instance task into an instance task pool, calling a task scheduling engine to scan the instance task pool so as to automatically process the converged instance task, obtaining a processing result, and taking the processing result as an execution result of each converged instance task.

Optionally, the receiving the scheme information configured for each system, to obtain a modification scheme of each system includes:

For each system, receiving input scheme basic information;

responding to a system identifier input in an interface, displaying a switching template set, receiving a selection operation of one target switching template, and determining a flow set corresponding to the target switching template;

displaying node sets under each flow, inquiring the node sets successfully drilled last time in each system from a node library, receiving selection operation on one or more nodes, and inputting information of the selected nodes into corresponding node information in an interface;

displaying a step set under each node, and receiving confirmation or configuration operation of execution parameters of each step; displaying an execution task set in each step, and receiving a confirmation operation of an execution parameter of each execution task;

and generating a change scheme corresponding to each system based on the scheme basic information, the target switching template, the flow set, the node information, the step information and the execution task parameters.

Optionally, after the obtaining the modification scheme of each system, the method further includes:

and displaying the flow of each change scheme, and receiving configuration operation of the dependency relationship among the flows of different change schemes.

and carrying out auditing and checking treatment on each change scheme, and submitting each change scheme to a manual auditing link for manual rechecking in response to the passing of the checking result.

Optionally, the automatically processing the converged instance task to obtain a processing result includes:

and acquiring a convergence instance task to be processed, and transmitting the convergence instance task to an automation platform for executing processing to obtain a processing result.

Optionally, after the processing result is obtained, the method further includes:

acquiring an execution state, and determining whether the converged instance task is successfully issued and executed according to the execution state;

responding to failure of issuing execution, and automatically triggering the issuing execution operation again for the converged instance task;

and in response to failure of issuing execution states in a preset time period, determining that the issuing execution of the converged instance task is abnormal, and triggering a manual skipping mechanism for the converged instance task.

Optionally, the method further comprises:

under the condition that the execution states of all steps under the node are judged to be successful, storing the node and corresponding information into a node library; and

And calling the execution progress detail log to acquire and display the current execution progress of each converged instance task.

To achieve the above object, according to another aspect of the embodiments of the present invention, there is provided a disaster recovery switching processing device applied to a multi-system, including:

the configuration module is used for responding to disaster recovery switching operation of a plurality of systems, triggering and displaying a scheme information configuration interface, receiving scheme information configured for each system and obtaining a change scheme of each system;

the instance module is used for carrying out instantiation processing on each execution task in each change scheme to obtain instance tasks corresponding to each execution task so as to generate an instance task set corresponding to each change scheme;

the aggregation module is used for analyzing the task type of each execution task, carrying out aggregation processing on the instance tasks with the same task type in all instance task sets, and integrating to obtain an aggregation instance task based on parameters in the aggregated instance task;

the scheduling module is used for transmitting the converged instance task into an instance task pool, calling a task scheduling engine to scan the instance task pool so as to automatically process the converged instance task, obtaining a processing result, and taking the processing result as an execution result of each converged instance task.

Optionally, the configuration module is configured to:

for each system, receiving input scheme basic information;

Optionally, the configuration module is further configured to:

Optionally, the device further comprises a verification module, configured to:

Optionally, the scheduling module is configured to:

Optionally, the apparatus further includes a timeout mechanism module configured to:

Optionally, the apparatus further includes:

In order to achieve the above object, according to still another aspect of the embodiments of the present invention, there is provided a disaster recovery switching processing electronic device applied to a multi-system.

The electronic equipment of the embodiment of the invention comprises: one or more processors; and the storage device is used for storing one or more programs, and when the one or more programs are executed by the one or more processors, the one or more processors realize any disaster recovery switching processing method applied to the multiple systems.

To achieve the above object, according to still another aspect of the embodiments of the present invention, there is provided a computer readable medium having stored thereon a computer program, which when executed by a processor, implements any of the disaster recovery handover processing methods described above applied to multiple systems.

To achieve the above object, according to still another aspect of an embodiment of the present invention, there is provided a computer program product. The computer program product of the embodiment of the invention comprises a computer program, and the program is executed by a processor to realize the disaster recovery switching processing method applied to the multiple systems.

According to the solution provided by the present invention, one embodiment of the above invention has the following advantages or beneficial effects: based on the existing automatic technical design, in the configuration of a multi-system switching scheme of a complex scene, parameters of a plurality of tasks of the same type are highly converged and issued, so that task scheduling frequency is reduced, the execution efficiency of the scheme is improved, and the problem of execution blocking caused by excessive existing schemes is solved.

Further effects of the above-described non-conventional alternatives are described below in connection with the embodiments.

Drawings

The drawings are included to provide a better understanding of the invention and are not to be construed as unduly limiting the invention. Wherein:

FIG. 1 is a schematic flow chart of a disaster recovery switching processing method applied to multiple systems according to an embodiment of the present invention;

FIG. 2 is a flow chart of an alternative disaster recovery handover processing method applied to multiple systems according to an embodiment of the present invention;

FIG. 3 is a flow chart of another alternative disaster recovery handoff processing method applied to multiple systems according to an embodiment of the present invention;

FIG. 4 is a flow chart of yet another alternative disaster recovery handoff processing method applied to multiple systems according to an embodiment of the present invention;

FIG. 5 is a schematic diagram of a disaster recovery switching method applied to multiple systems according to an embodiment of the present invention;

FIG. 6 is a schematic diagram of main modules of a disaster recovery switching device for multiple systems according to an embodiment of the present invention;

FIG. 7 is an exemplary system architecture diagram in which embodiments of the present invention may be applied;

fig. 8 is a schematic diagram of a computer system suitable for use in implementing an embodiment of the present invention.

Detailed Description

Exemplary embodiments of the present invention will now be described with reference to the accompanying drawings, in which various details of the embodiments of the present invention are included to facilitate understanding, and are to be considered merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

It is noted that embodiments of the invention and features of the embodiments may be combined with each other without conflict. In the technical scheme of the invention, the related aspects of acquisition, analysis, use, transmission, storage and the like of the personal information of the user accord with the regulations of related laws and regulations, are used for legal and reasonable purposes, are not shared, leaked or sold outside the legal use aspects and the like, and are subjected to supervision and management of a supervision department. Necessary measures should be taken for the personal information of the user to prevent illegal access to such personal information data, ensure that personnel having access to the personal information data comply with the regulations of the relevant laws and regulations, and ensure the personal information of the user.

Once these user personal information data are no longer needed, the risk should be minimized by limiting or even prohibiting the data collection and/or deletion. User privacy is protected, when applicable, by de-identifying the data, including in some related applications, such as by removing a particular identifier (e.g., date of birth, etc.), controlling the amount or specificity of stored data (e.g., collecting location data at a city level rather than at a specific address level), controlling how the data is stored, and/or other methods.

Banking financial institutions in the existing supervision requirement data set should plan synchronously, build synchronously, run information systems synchronously, and disaster recovery systems synchronously. For example, the expansion of business fields in banks can change more and more system operation and maintenance work from manual operation to automatic operation and maintenance operation due to the complexity of application and system components and the development of IT automatic operation and maintenance tools.

The automation tool is a set of scripts which help enterprises to program, combine and write operation constraints on the scripts in a mode of programming operation and issue and execute a set of programmed scripts to the target machine. The script writing, the script operation and the script arrangement are split, and the input and output parameters can be transmitted between the script operation and the script operation. The automation tool improves the reusability of operation, improves the configuration speed of the flow and the normalization of the nodes, and decouples script writing and flow configuration, so that the whole automation arrangement is easier to be created and executed.

The automatic execution improves the operation and maintenance efficiency compared with manual execution, but only improves the execution efficiency of a single task, and can not improve the execution quality and the stable and efficient execution of the whole operation and maintenance scene, the problem is particularly prominent in banking industry, a large-scale bank core system is basically in more than 60 sets, the core system is required to complete the switching change of the whole service within 15 minutes according to the disaster-backup switching requirement of the supervision E scene, and each set of system switching is complicated in overall switching scheme and is inevitably caused by the fact that the production environment parameters are complex, the variety is numerous, the switching operation is different, and the like.

In addition, when the automatic issuing is performed, each switching step involves thousands of task scheduling issues, so that the switching efficiency is greatly reduced, and when the automatic switching change is performed, even if the automatic switching change is performed for a plurality of times, the problems of configuration execution authority, environment parameters, database version parameters, plug-in version parameters and the like are unavoidable, so that the switching operation of the production environment is performed in a failed mode, and the specified switching requirement is difficult to reach. Particularly, when some special change operations are performed, such as disaster recovery switching of a firewall, calling firewall equipment once for each network table, tens of systems, tens of thousands of network changes, low execution efficiency, or failure of executing a certain task can cause system switching failure, even affect the overall upstream and downstream systems, and cause a large-scale service stagnation effect. Thus, in disaster recovery in the large financial industry, a large number of operations and maintenance are extremely tedious and cumbersome.

Referring to fig. 1, a main flowchart of a disaster recovery switching processing method applied to a multi-system according to an embodiment of the present invention is shown, including the following steps:

s101: responding to disaster recovery switching operation of a plurality of systems, triggering a scheme information configuration interface to be displayed, and receiving scheme information configured for each system to obtain a change scheme of each system;

s102: carrying out instantiation processing on each execution task in each change scheme to obtain an instance task corresponding to each execution task so as to generate an instance task set corresponding to each change scheme;

s103: analyzing the task type of each execution task, carrying out aggregation processing on the instance tasks with the same task type in all instance task sets, and integrating to obtain an aggregation instance task based on parameters in the aggregated instance task;

s104: and transferring the converged instance task into an instance task pool, calling a task scheduling engine to scan the instance task pool so as to automatically process the converged instance task, obtaining a processing result, and taking the processing result as an execution result of each converged instance task.

In the above embodiment, for step S101, the multisystem handover: the disaster recovery switching of the core service system in the large IT system comprises application start-stop, database switching, redis, MQ middleware switching and network equipment switching waiting, the service scene is complex, the related task amount is large, the time consumption of the general switching is long, and the RTO (Recovery Time Objective, recovery time target) efficiency is affected. The multi-system switching in the scheme, in particular to a multi-system switching scene in daily disaster recovery switching exercise in the financial industry.

The user firstly selects a plurality of systems as systems for switching the disaster to be treated, such as a database, a machine room and a specific application, wherein the number of the systems can be a plurality of systems, and the systems can be a drilling scene or a real scene. The scheme information configuration interface is preset to receive scheme information configured by a user for each system and obtain a change scheme of each system. The scheme information here includes scheme basic information, flow information, node information, step information, and task information.

For example, a first page is set, and the first page is used for configuring scheme basic information, wherein the scheme basic information includes scheme classification, scheme switching object, service classification, physical subsystem, scheme name, whether an unscheduled scheme is an unscheduled scheme, a scheme name and scheme description information, and after information input is completed, clicking "next" to enter a configuration association flow and execution parameter interface. In the interface, a flow configuration interface can be clicked to enter, such as configuration flow use, selection of a physical subsystem, name prefix encoding, flow custom name, sequencing, selection of a flow template, and selection of a flow node. If the user selects the process template, a process template set is displayed, the user can select one of the process templates, corresponding process nodes corresponding to the process template selected by the user are displayed in the process nodes, and then the user can select one or more process nodes from the process nodes. For each node, the following steps also need to be configured, specifically, the configuration step execution parameters.

The steps are specific tasks, and the tasks refer to steps in an operation of an operation and maintenance business in the scheme, and generally correspond to one or more execution scripts, such as stopping an application service on a server or a plurality of servers. The operation and maintenance refers to the maintenance of network software and hardware which is established by a large organization, essentially the operation and maintenance of each stage of the life cycle of a network, a server and a service, and achieves acceptable states in terms of cost, stability and efficiency, wherein the traditional operation and maintenance refers to the operation and maintenance of information technology.

For step S102, the configuration operations of the flow, node, step, task, and the like are completed one by one through the above steps, so as to determine the execution range and the execution condition record of the modification scheme of each system. The tasks are specific implementation objects in the schemes, so that each execution task in each modification scheme needs to be subjected to instantiation processing to obtain an instance task, for example, an instance task 1-an instance task 2-an instance task 3, so as to generate an instance task set corresponding to each modification scheme.

In the instantiation, when the switching scheme flow is actually needed to be executed, the execution information is needed to be supplemented, wherein the execution information comprises scheme purpose and scheme action information, and the purpose information can be used for executing the specific instance of the generation and the assignment of the flow model scheme which is configured in advance. Each instance is independently executed, can be executed only once, and the execution of the instance does not affect the original scheme. Here instantiation is similar to the packing process, packing does not affect the original task, e.g., task 1 is instantiated to generate instance task 1, and even if the parameters of subsequent task 1 change, the instance task 1 does not change.

For step S103, for the modification of the multiple systems, the above steps result in an instance task set of the multiple systems. The task type of each executing task is analyzed, including but not limited to firewall handoff, DNS handoff, service registration, etc. And carrying out aggregation processing on the tasks of the same type in the example task sets of the systems, and automatically extracting the parameters of the same type so as to integrate the original example tasks needing to be executed for many times into one aggregated example task.

In the switching verification execution of the P4 service registration task in the switching scheme, a large number of service registrations are needed for multi-system switching, and the path, the component number, the effective center, the protocol type and the security node number of the java cacheupdate tools folder related to the task can be scanned, extracted and aggregated in advance.

For step S104, the method only integrates the high aggregation of the same type of repetitive tasks, reduces task issuing actions, including daily switching inspection, configuration change and switching execution, and complex whole change flow, and changes multiple calls into one-time automation operation, thereby reducing or even eliminating delay in operation and maintenance, improving switching execution scheduling efficiency, and realizing zero-delay IT operation and maintenance.

The integrated aggregate instance task is written into an instance task pool. The task scheduling engine is a central module of the product, and utilizes the task scheduling engine to acquire the to-be-processed converged instance task from the instance task pool in near real time, and issues the converged instance task to the automation platform for execution, so that an execution result is obtained. And synchronizing the execution result, and taking the execution result as the execution result of each converged instance task. For example, the task a of the system a and the task C of the system C are the same in type, and if the result after the aggregation processing is x, then x is taken as the execution result of the task a and as the execution result of the task C.

The method provided by the embodiment can greatly reduce the task scheduling times and improve the execution efficiency by highly converging the task scheduling of the same type, thereby integrally improving the success rate of the automatic operation of the change scheme, ensuring and improving the service reliability.

Referring to fig. 2, a flowchart of an optional disaster recovery switching processing method applied to multiple systems according to an embodiment of the present invention is shown, including the following steps:

s201: for each system, receiving input scheme basic information;

s202: responding to a system identifier input in an interface, displaying a switching template set, receiving a selection operation of one target switching template, and determining a flow set corresponding to the target switching template;

S203: displaying node sets under each flow, inquiring the node sets successfully drilled last time in each system from a node library, receiving selection operation on one or more nodes, and inputting information of the selected nodes into corresponding node information in an interface;

s204: displaying a step set under each node, and receiving confirmation or configuration operation of execution parameters of each step; displaying an execution task set in each step, and receiving a confirmation operation of an execution parameter of each execution task;

s205: and generating a change scheme corresponding to each system based on the scheme basic information, the target switching template, the flow set, the node information, the step information and the execution task parameters.

In the above embodiment, for steps S201 to S203, the existing automatic switching requires that the service personnel repeatedly check the switching scheme before each exercise, and confirm whether the user, the IP, the path, the version parameters and the like are properly configured in the scheme, otherwise, the switching exercise fails, and this process takes too much time, meanwhile, in the switching exercise of the large IT system, the same type of nodes and parameters are, each operation needs to perform one interface call and task circulation, which results in resource waste, resulting in redundancy and complexity of the configuration work of the existing system scheme and low execution efficiency. The traditional automatic operation tool can not switch and change for the copied system, and flexible and configurable arrangement capacity is provided; wherein, the arrangement refers to a series of operation flows predefined according to certain scenes, and can be set to be manually or automatically executed at fixed time.

In order to solve the problems, the scheme provides a reusable idea, links, steps and programs in the process of actual system business can be reused, and the next business process can be directly referenced without configuration and verification again.

For each system, the scheme receives input scheme basic information, such as scheme classification, scheme switching object, service classification, physical subsystem, scheme name, whether the scheme is an unscheduled scheme, scheme name and scheme description information, wherein the scheme basic information does not comprise scheme type, and after the input information is finished, clicking the next step. Because of the large number of systems, a user is required to input system identifiers in the interface to define schemes one by one. The number of switching templates is large, and a user needs to select one template from the templates, and the switching templates are B on the assumption that the system identifier is A. The switching template is preset, so that a flow set is preconfigured in the switching template, and a user only needs to use the switching template.

For each flow in the flow set, a set of nodes under the flow is displayed, and the information of the nodes needs to be configured one by one. During the creation of the schema, the background may automatically detect whether there are referenceable nodes. Because the exercise template is standardized, each system has a fixed exercise flow, and after the exercise is successful for each node of each system, the background will record the ID of the successful node in the exercise. The next time the system related exercise is performed, the nodes that the system has successfully performed can be recommended when the scheme is programmed.

Specifically, according to the system identification, the identification of the node which is successfully drilled by the system last time is queried from the node library, the nodes are displayed, the user can select all the nodes or only select part of the nodes, and the information of the nodes is input into the nodes in the switching template B flow. For example, the node of the system a, which was successful in drilling before, is XYZ, and a certain flow in the switching template B selected at this time includes two XY nodes, and then the information of the XY node of the successful drilling is input into the XY node of the flow.

Through the steps, the scheme continues multiplexing the switching nodes which are finished with the exercise, and the configuration error problems of authority, parameters, file paths and the like in the complex scheme are avoided. If the user selects one of the nodes, the nodes can be directly multiplexed to omit the configuration step and directly generated. After the system is exercised this time, the node which is successfully exercised is also used as a unique identifier and stored in a node library, so that the subsequent direct reference in the multi-system disaster recovery switching scheme is facilitated, and manual configuration is not needed.

The steps S204 to S205 further include configuration of the steps below the node after the above node configuration is completed. The node is a basic logic unit in disaster recovery switching, the specific steps are performed under the node, and one step is the minimum unit of specific operation. The steps are configured, and the configuration mainly comprises the configuration of parameters such as an execution path, an execution user and the like. Here, the confirmation operation of the execution parameters may be performed, and if the user clicks confirmation, no change is required, but if the user clicks configuration, a change is required. The steps are specific execution tasks, and the execution tasks also need to comprise execution parameters, wherein the execution parameters are only confirmation operations, and the user does not have modification rights.

In the prior art, the change of the focus automation is executed, or the change operation is pre-executed in advance in a simulation environment, so that on one hand, the method only improves the execution efficiency, only pays attention to whether the change operation is executed smoothly, or improves the simulation authenticity, and therefore, verification links such as environmental parameters and the like are lost, and the success rate of the change still cannot be completely ensured.

In order to solve the problems, after the configuration operation of the execution parameters of the nodes under the process and all steps under the nodes is completed, the whole scheme is established to obtain a change scheme, the background automatically carries out basic audit and verification on the change scheme, for example, the background audit and verification is based on basic rules to verify whether special characters, user names and paths accord with standard specifications, and the relatively complex background can carry out regular verification. After the verification is correct, the verification can be submitted to relevant auditors for manual review. The pedestrian rechecking is more based on the requirement of one flow compliance and the bank compliance requirement, and rechecking information comprises execution users, paths, script names, execution IP (Internet Protocol ), dependence among tasks and the like.

It should be noted that, a certain dependency relationship may be provided between the systems, for example, after a certain flow in the system Q is executed, a result needs to be sent to a certain flow in the system W for execution, and then a certain flow execution result of the system W needs to be sent to a certain flow in the system E for execution. Thus, after the generation of the alternations for each system, the flow between these alternations can be configured, similar to the scribing, to establish a dependency relationship, as shown in FIG. 3.

According to the method provided by the embodiment, the node reference function is added, redundant operation of repeated configuration of a scheme is avoided by continuously multiplexing the node information which is finished and successfully processed, the problem of configuration errors of authority, parameters, file paths and the like in a complex scheme is solved, the scheme information configuration speed is improved, the condition that operation and maintenance operation of disaster switching fails is reduced to the minimum, and the success rate of the operation and maintenance operation is greatly improved.

Referring to fig. 4, a flowchart of another alternative disaster recovery switching processing method applied to multiple systems according to an embodiment of the present invention is shown, including the following steps:

s401: acquiring an execution state, and determining whether the converged instance task is successfully issued and executed according to the execution state;

s402: responding to failure of issuing execution, and automatically triggering the issuing execution operation again for the converged instance task;

s403: in response to failure of issuing execution states in a preset time period, determining issuing execution abnormality of the converged instance task to trigger a manual skipping mechanism of the converged instance task;

s404: under the condition that the execution states of all steps under the node are judged to be successful, storing the node and corresponding information into a node library;

S405: and calling the execution progress detail log to acquire and display the current execution progress of each converged instance task.

In the above embodiment, for steps S401 to S404, the automation platform processes the aggregate instance task, and includes an execution state in addition to the processing result, where the execution state may be success or failure.

Considering that the task issuing execution has various accidents, the method can determine whether the task of the current aggregation instance is successfully issued and executed according to the execution state. If the failure occurs, the operation can be automatically performed again. In addition, the scheme is also provided with a task verification timeout mechanism to configure execution timeliness of the task. The method can automatically execute again after the first execution failure, and the abnormal execution can be judged after a certain time is exceeded, so that the abnormal task can be manually skipped, and the influence on the continuous execution of the subsequent task after the failure of the execution of the preceding task is avoided.

The task that is successfully executed can be recorded, if all tasks of all steps under a certain node are successfully executed, the node is recorded to a node library for reuse in the configuration of other schemes in the later period. Specifically, the execution state of the task may be recorded first, and then the execution state of the step may be determined according to the execution state of each task, so as to determine the execution state of the node.

For step S405, for the case that the modification scheme includes multiple execution tasks, the execution progress detail log is called, so that failure information can be checked through verification report, and it is ensured that the execution problem can be rapidly located through log analysis. In the green light verification of the P4 service registration task, it is necessary to implement debugging and maintenance on the device and perform switching execution again by using an application administrator in advance to implement error of the related javacacheupdate tools file path of the task, or error configuration of the device security code, or abnormal service process after verification.

According to the method provided by the embodiment, verification timeout, exception handling and execution progress detail display mechanisms are set so as to fully process task scheduling and ensure stable switching of complex disaster recovery switching scenes.

Referring to fig. 5, a schematic diagram of a disaster recovery switching processing method framework applied to a multi-system is shown, and the arrangement includes task management, task scheduling and example task pools according to an embodiment of the present invention:

1. the Web interface is a scheme information configuration interface, and a user can configure scheme basic information, select a switching template, select node information for multiplexing, configure step execution parameters and confirm task execution parameters.

2. And auditing and checking treatment is required to be carried out on the change scheme, so that the efficiency and reliability of switching change are improved. The execution tasks in the modification of the system Q, W, E are instantiated, and before that, the dependency relationship between the flows of the three systems can be configured. Subsequently, the tasks are converged according to the type, so that the tasks which need to be issued and executed for a plurality of times are originally set as one time.

3. Task scheduling, which is used for reading the task and issuing and executing, and the task is automatically executed and can be repeatedly executed for use; and writing the results into an instance task pool, which may be a MySQL database.

From a development point of view, there are mainly the following works to ensure the integrity of the implementation:

1. the architecture logic relationship of the complex IT system needs to be deeply understood, and the switching operation is split to form a universal switching task.

2. And the automatic operation and maintenance switching script with universality is required to be independently designed and written, an automatic quoting function component is independently designed and developed, and unified switching task standard parameters are extracted to realize task quoting of the multi-system switching scene.

3. The background analysis judging and converging module needs to be developed, the protection tools are extracted for the standardized switching operation of the same type, the protection tools are converged into a large task, and the calling times of switching change are simplified.

4. The whole switching system can generate a verification report for the verification result, the execution process of the switching operation task can be checked in real time, and the functions of exception skip, exception handling, re-execution, result inquiry and the like are provided.

5. And controlling whether the switching change operation task can continue to be referenced or not by verifying whether the result passes or not.

6. The operation and maintenance operation scheduling based on the automatic operation and maintenance tool is based on a process engine technology, and the issuing execution of scheduling is based on the deployment execution of the automatic operation and maintenance tool and agents. I.e., flow engine technology and automated operation and maintenance, are general basic technologies and are not described in detail herein.

The embodiment of the scheme provides a method for changing disaster recovery of a converged and highly multiplexed high-complexity multi-IT system, such as disaster recovery of a large commercial bank core system, and node information, such as operation configuration, parameters, paths and the like, which are successfully exercised before multiplexing, solves the problem that the current configuration is easy to be in error, so as to rapidly realize the configuration purpose of a multi-system scheme, and performs audit verification after the configuration is completed, thereby ensuring the reliability of scheme configuration, improving the execution success rate of the scheme, providing the arrangement function of dependency relations among the scheme flows of the multi-system change, and increasing the flexible configuration arrangement function. The scheme also extracts, aggregates and issues parameters of a plurality of instance tasks with the same type, reduces task scheduling frequency, solves the problem of current execution blocking, and further improves scheme execution efficiency and reliability.

Referring to fig. 6, a schematic diagram of main modules of a disaster recovery switching processing device 600 applied to a multi-system according to an embodiment of the present invention is shown, including:

the configuration module 601 is configured to trigger a display scheme information configuration interface in response to disaster recovery switching operations on a plurality of systems, and receive scheme information configured for each system to obtain a change scheme of each system;

an instance module 602, configured to perform an instantiation process on each execution task in each modification scheme, to obtain an instance task corresponding to each execution task, so as to generate an instance task set corresponding to each modification scheme;

the aggregation module 603 is configured to analyze a task type of each execution task, perform aggregation processing on instance tasks with the same task type in all instance task sets, and integrate to obtain an aggregated instance task based on parameters in the aggregated instance task;

the scheduling module 604 is configured to transfer the converged instance task into an instance task pool, invoke a task scheduling engine to scan the instance task pool, perform automatic processing on the converged instance task, obtain a processing result, and use the processing result as an execution result of each converged instance task.

In the embodiment of the present invention, the configuration module 601 is configured to:

for each system, receiving input scheme basic information;

In the embodiment of the present invention, the configuration module 601 is further configured to:

The implementation device of the invention further comprises a verification module for:

In the embodiment of the present invention, the scheduling module 604 is configured to:

The implementation device of the invention also comprises a timeout mechanism module for:

The implementation device of the invention further comprises:

In addition, the implementation of the apparatus in the embodiments of the present invention has been described in detail in the above method, so that the description is not repeated here.

Fig. 7 shows an exemplary system architecture 700, including terminal devices 701, 702, 703, a network 704, and a server 705 (by way of example only), to which embodiments of the invention may be applied.

The terminal devices 701, 702, 703 may be various electronic devices having a display screen and supporting web browsing, are installed with various communication client applications, and a user may interact with the server 705 through the network 704 using the terminal devices 701, 702, 703 to receive or transmit messages, etc.

The network 704 is the medium used to provide communication links between the terminal devices 701, 702, 703 and the server 705. The network 704 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.

The server 705 may be a server providing various services, and it should be noted that the method provided by the embodiment of the present invention is generally performed by the server 705, and accordingly, the apparatus is generally disposed in the server 705.

It should be understood that the number of terminal devices, networks and servers in fig. 7 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.

Referring now to FIG. 8, there is illustrated a schematic diagram of a computer system 800 suitable for use in implementing an embodiment of the present invention. The terminal device shown in fig. 8 is only an example, and should not impose any limitation on the functions and the scope of use of the embodiment of the present invention.

As shown in fig. 8, the computer system 800 includes a Central Processing Unit (CPU) 801 that can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM) 802 or a program loaded from a storage section 808 into a Random Access Memory (RAM) 803. In the RAM 803, various programs and data required for the operation of the system 800 are also stored. The CPU 801, ROM 802, and RAM 803 are connected to each other by a bus 804. An input/output (I/O) interface 805 is also connected to the bus 804.

The following components are connected to the I/O interface 805: an input portion 806 including a keyboard, mouse, etc.; an output portion 807 including a display such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and a speaker; a storage section 808 including a hard disk or the like; and a communication section 809 including a network interface card such as a LAN card, a modem, or the like. The communication section 809 performs communication processing via a network such as the internet. The drive 810 is also connected to the I/O interface 805 as needed. A removable medium 811 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 810 as needed so that a computer program read out therefrom is mounted into the storage section 808 as needed.

In particular, according to embodiments of the present disclosure, the processes described above with reference to flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method shown in the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network via the communication section 809, and/or installed from the removable media 811. The above-described functions defined in the system of the present invention are performed when the computer program is executed by a Central Processing Unit (CPU) 801.

The computer readable medium shown in the present invention may be a computer readable signal medium or a computer readable storage medium, or any combination of the two. The computer readable storage medium can be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples of the computer-readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In the present invention, however, the computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, with the computer-readable program code embodied therein. Such a propagated data signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination of the foregoing. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.

The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams or flowchart illustration, and combinations of blocks in the block diagrams or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.

The modules involved in the embodiments of the present invention may be implemented in software or in hardware. The described modules may also be provided in a processor, for example, as: a processor comprises a receiving module, a determining module and an acquiring module. The names of these modules do not in some way constitute a limitation of the module itself, for example, the acquisition module may also be described as "".

As another aspect, the present invention also provides a computer-readable medium that may be contained in the apparatus described in the above embodiments; or may be present alone without being fitted into the device. The computer readable medium carries one or more programs which, when executed by one of the devices, cause the device to perform any of the disaster recovery device handoff processing methods described above for multiple systems.

The computer program product of the invention comprises a computer program which, when being executed by a processor, realizes the disaster recovery switching processing method applied to multiple systems in the embodiment of the invention.

The above embodiments do not limit the scope of the present invention. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives can occur depending upon design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the scope of the present invention.

Claims

1. The disaster recovery switching processing method applied to the multiple systems is characterized by comprising the following steps:

2. The method of claim 1, wherein the receiving the scheme information configured for each system, resulting in a change scheme for each system, comprises:

for each system, receiving input scheme basic information;

3. The method according to claim 1 or 2, wherein after said deriving a change of each system, the method further comprises:

4. The method of claim 1, wherein after the deriving a change for each system, the method further comprises:

5. The method of claim 1, wherein the automating the aggregate instance task to obtain a processing result comprises:

6. The method according to claim 1 or 5, characterized in that after the obtaining of the processing result, the method further comprises:

7. The method of claim 6, wherein the method further comprises:

8. The disaster recovery switching processing device applied to the multiple systems is characterized by comprising:

9. The apparatus of claim 8, wherein the configuration module is configured to:

For each system, receiving input scheme basic information;

10. The apparatus of claim 8 or 9, wherein the configuration module is further configured to:

11. The apparatus of claim 8, further comprising a verification module to:

12. The apparatus of claim 8, wherein the scheduling module is configured to:

13. The apparatus according to claim 8 or 12, further comprising a timeout mechanism module for:

14. An electronic device, comprising:

One or more processors;

storage means for storing one or more programs,

when executed by the one or more processors, causes the one or more processors to implement the method of any of claims 1-7.

15. A computer readable medium, on which a computer program is stored, characterized in that the program, when being executed by a processor, implements the method according to any of claims 1-7.

16. A computer program product comprising a computer program which, when executed by a processor, implements the method according to any of claims 1-7.