CN107423028A

CN107423028A - A kind of parallel scheduling method of extensive flow

Info

Publication number: CN107423028A
Application number: CN201610343434.0A
Authority: CN
Inventors: 袁雪宁; 何铭
Original assignee: Individual
Current assignee: Individual
Priority date: 2016-05-23
Filing date: 2016-05-23
Publication date: 2017-12-01

Abstract

The invention discloses a kind of structure for changing flow processing execution and pattern, the parallel scheduling method of extensive flow for making system concurrency caused by the server resource demand, reduction process action example quantity that the process performance efficiency of process action example breaks off relations with the sum of process action example, is substantially reducing in extensive flow processing conflict, this method builds processing pipeline in units of every template node, and pipeline physically corresponds to process, thread or event；The corresponding pipeline example of each node.Pipeline extracts template and nodal information, while precompile or the execution code for loading node in initialization；Moreover, each pipeline is independently constantly run, complete all flow processings and calculate, pipeline has identical processing method.Present invention is particularly suitable in the scheduling processing procedure of extensive flow.

Description

A kind of parallel scheduling method of extensive flow

Technical field

The present invention relates to the flow scheduling method in process management system or flow processing system, and in particular to To a kind of scheduling method of extensive flow.

Background technology

In process management system or flow processing system, in design, flow scheme design or administrative staff's foundation Demand Design goes out flow template (Process Template) and is stored in system, and a flow template is by one Individual or multiple nodes (Node) are formed, an execution step in each node template definitim flow. During operation, operation system or up-stream system are corresponded to according to the attribute and relation of input information and data On flow template, system is according to the new flow instance of the template establishment (Process Instance), each pen New business or upstream data all by corresponding templates and create a new flow instance.Flow instance have with The same node of its source template and relation, system according to flow instance node sequence and logical relation Perform forward successively until completing.All flow instances from same flow template, only flow instance shape State has different (information and data that are carried on flow instance), and other all sames, flow instance is flowing Journey is referred to as process action example before completing.

Existing flow performs and treatment mechanism, is independently to separate process action example, be independent process, only It is vertical to perform, it is that processing execution is carried out in units of process action example, the process action example being each performed All it is performed in independent thread (Thread) or pseudo- thread or pipeline (Pipeline), such as： In current mechanism, 100 process action examples for needing to perform can be performed by dividing into 100 threads, or point Into 10 threads, flow instance is performed successively to be lined up serial manner inside each thread.

The problem of existing treatment mechanism is process action example quantity when carrying out extensive flow operation or processing Can very greatly (100,000 grades or million grades), the efficiency independently executed is low, the unit of each process action example Time-consuming meeting is linearly increasing with the rising of flow instance sum, and inquiry, renewal etc. are all to access the property operated It can all can decline comprehensively.Meanwhile existing treatment mechanism causes to take huge calculating process resource, usual one Instantaneously burden is no more than 1,000 process action example to platform General Server, then 100,000 process action realities Example just needs the resource of up to a hundred servers.

The content of the invention

The technical problems to be solved by the invention are：A kind of structure and mould for changing flow processing execution is provided The sum of formula, the process performance efficiency for making process action example and process action example breaks off relations, is substantially reducing at System caused by server resource demand, reduction process action example quantity in extensive flow processing is simultaneously Send out the parallel scheduling method of extensive flow of conflict.

To solve the above problems, present invention employs a kind of parallel scheduling method of extensive flow, each flow It is made up of several every template nodes, described parallel scheduling method is：

1) processing pipeline is built in units of every template node, pipeline physically corresponds to process, thread or thing Part；The corresponding pipeline example of each node.Pipeline extracts template and nodal information in initialization, together When precompile or load node execution code.

2) each pipeline is independently constantly run, and is completed all flow processings and is calculated, pipeline has identical Processing method, it is concretely comprised the following steps：

A) stream that all present nodes are the knot is extracted in inquiry from persistent storage or flow instance pond

Journey active instance, it is as a result process action example collection.

B) set in step A is circulated in a manner of concurrently or sequentially, joined process action example as input Number, node processing action is performed, obtains the result phase and output data of the process action example.Press simultaneously According to flow template node sets, the node of the process action example is advanced into next.

C) in step B after circular treatment, all process actions obtained in corresponding A set of steps are real The result phase set and output data set of example.

D) pipeline carries out disposably updating or storing with batch processing mode to result phase set in step C, together When one-time write or storage carried out to output data set in step C with batch processing mode.

E) pipeline completes epicycle processing work, and execution is cleared up afterwards, returns to step A and repeats to continue.

The beneficial effects of the invention are as follows：The present invention using two dimension quote sheet form by the use of node arrange (Column) as Perform entrance and instead of the existing current processing mode using flow instance as execution entrance, by existing horizontal stroke It is changed into longitudinal direction calculating processing mode to processing mode is calculated, solves in large-scale flow processing, flow (such as more than 10,000 active instances), the execution efficiency that existing passing method occurs when active instance number is huge The problem of low and a large amount of consumption process resources.

Brief description of the drawings

Fig. 1 is the schematic diagram of parallel scheduling method of the present invention.

Embodiment

Below in conjunction with the accompanying drawings, a kind of tool of the parallel scheduling method of extensive flow of the present invention is described in detail Body embodiment：

A kind of extensive parallel scheduling method of flow of the present invention based on flow data structure draw for two dimension With table structure (Reference Table, hereinafter referred to as bivariate table), as shown in figure 1, each bivariate table A kind of flow is represented, wherein table row (Column) is flow template (Process Template), each Arrange to should be in template a flow nodes (Node)；Wherein table stringer (Row) is process action example (Process Instance), a corresponding process action example per a line；Each cell is corresponding one The row node on the row flow instance；The bivariate table can according to being actually needed using array (Array) or Chained list (Linked List) or tree (Tree) or database table (Database Table) etc. realize the number According to structure.

Above-mentioned data structure is described as follows with pseudo-code：

A kind of parallel scheduling method of extensive flow of the present invention, it is concretely comprised the following steps：

1) processing pipeline (Pipeline) is built in units of every template node, pipeline physically can be according to reality Border demand corresponds to process, thread or event.The corresponding pipeline example of each node.Pipeline is initial Template and nodal information, while precompile or the execution code for loading node are extracted during change；

2) the lasting operation of each pipeline independence, all flow processings is completed and are calculated, pipeline has complete phase Same processing calculative strategy and algorithm, it is as follows：

A) stream that all present nodes are the knot is extracted in inquiry from persistent storage or flow instance pond Journey active instance, it is as a result process action example collection.

E) pipeline completes epicycle processing work, and execution is cleared up afterwards, and is entered next round and handled, i.e.,：Return to Step A repeats to continue.

Above-mentioned policing algorithm is described as follows that (part pseudo-code variable and object reference are from above-mentioned data structure with pseudo-code Pseudo-code)：

Obviously, above-mentioned implementation is only intended to clearly illustrate example, and not to the limit of embodiment It is fixed.For those of ordinary skill in the field, other can also be made on the basis of the above description Various forms of changes or variation.There is no necessity and possibility to exhaust all the enbodiments.And thus Among the obvious changes or variations amplified out is still in the protection domain of the invention.

Claims

1. a kind of parallel scheduling method of extensive flow, each flow are made up of several every template nodes, its It is characterised by, described parallel scheduling method is：