CN109814991A - A kind of data administer in task management method and device - Google Patents

A kind of data administer in task management method and device Download PDF

Info

Publication number
CN109814991A
CN109814991A CN201811594698.9A CN201811594698A CN109814991A CN 109814991 A CN109814991 A CN 109814991A CN 201811594698 A CN201811594698 A CN 201811594698A CN 109814991 A CN109814991 A CN 109814991A
Authority
CN
China
Prior art keywords
task
data
operator
management
nexus
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811594698.9A
Other languages
Chinese (zh)
Inventor
张毅然
杨威
接钧靖
陈元
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Mininglamp Software System Co ltd
Original Assignee
Beijing Mininglamp Software System Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Mininglamp Software System Co ltd filed Critical Beijing Mininglamp Software System Co ltd
Priority to CN201811594698.9A priority Critical patent/CN109814991A/en
Publication of CN109814991A publication Critical patent/CN109814991A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This application discloses the task management methods and device in a kind of improvement of data;Above-mentioned task management method, comprising: the initial data that building is administered using data is the task nexus figure of starting point, wherein task nexus figure is directed acyclic graph;Task based access control relational graph provides the task management interface for supporting visualized operation;In response to the operation detected in task management interface, data management task is managed.

Description

A kind of data administer in task management method and device
Technical field
This application involves but be not limited to field of computer technology, espespecially a kind of data administer in task management method and dress It sets.
Background technique
With the development of big data technology and application, promoting traditional industries (such as public security, finance, industry) to need magnanimity Business datum is migrated to big data platform, and in this process, the process converted, cleaned to data, referred to as data are administered. However, due to lacking unified standard, the problems such as initial data is lack of standardization, cause data to be administered difficult.Currently, data Improvement is primarily present following problems: data providing different style, and multi-source data usually requires effectively use by fusion; Most of initial data it is second-rate, there is a large amount of wrong row, wrongly written character, ambiguity, need the problems such as converting;For quality compared with High initial data needs to carry out some extra process, for example, multiple and different values of same field, it usually needs deposit respectively Storage just can guarantee subsequent use needs;The data of many data sources are all sustainable growths, need a reliable, stable stream Formula processing or cyclical process process.
Summary of the invention
The embodiment of the present application provide a kind of data administer in task management method and device, realize that flexible management data are controlled Reason task administers demand so as to cope with complicated data, promotes data governance efficiency.
On the one hand, the embodiment of the present application provide a kind of data administer in task management method, comprising: building is controlled with data The initial data of reason is the task nexus figure of starting point;Task based access control relational graph provides the task management for supporting visualized operation Interface;In response to the operation detected in task management interface, data management task is managed;Wherein, task nexus figure is to have To acyclic figure.
On the other hand, the embodiment of the present application provide a kind of data administer in task management device, comprising: task nexus figure Module is constructed, suitable for constructing the initial data administered using data as the task nexus figure of starting point;Interface provides module, is suitable for base In task nexus figure, the task management interface for supporting visualized operation is provided;Task management module is adapted in task pipe The operation detected on reason interface, manages data management task;Wherein, task nexus figure is directed acyclic graph.
On the other hand, the embodiment of the present application provides a kind of terminal device, comprising: display unit, input unit, memory and Processor;The display unit connects the processor, is suitable for display task management interface;The input unit connects the place Manage device, the operation being adapted to detect in the task management interface;The memory is suitable for the task management in storing data improvement The step of program, the task management program realizes above-mentioned task management method when being executed by the processor.
On the other hand, the embodiment of the present application provides a kind of computer-readable medium, the task pipe being stored in data improvement The step of reason program, the task management program realizes above-mentioned task management method when being executed by processor.
In the embodiment of the present application, task nexus figure is constructed by starting point of initial data, and provide support visualized operation Task management interface, realize flexible management data management task, administer demand so as to cope with complicated data, promote number According to governance efficiency.
Other features and advantage will illustrate in the following description, also, partly become from specification It obtains it is clear that being understood and implementing the application.The purpose of the application and other advantages can be by specifications, right Specifically noted structure is achieved and obtained in claim and attached drawing.
Detailed description of the invention
Attached drawing is used to provide to further understand technical scheme, and constitutes part of specification, with this The embodiment of application is used to explain the technical solution of the application together, does not constitute the limitation to technical scheme.
Fig. 1 is the flow chart of the task management method during data provided by the embodiments of the present application are administered;
The exemplary diagram of Fig. 2 dependence between the operator in the embodiment of the present application;
Fig. 3 is the exemplary diagram of the global DAG in the embodiment of the present application;
Fig. 4 is the global DAG in the embodiment of the present application and the exemplary relationship figure between task queue;
Fig. 5 is the schematic diagram of the task management device during data provided by the embodiments of the present application are administered;
Fig. 6 is the schematic diagram of terminal device provided by the embodiments of the present application.
Specific embodiment
Embodiments herein is described in detail below in conjunction with attached drawing.It should be noted that in the feelings not conflicted Under condition, the features in the embodiments and the embodiments of the present application can mutual any combination.
Step shown in the flowchart of the accompanying drawings can be in a computer system such as a set of computer executable instructions It executes.Also, although logical order is shown in flow charts, and it in some cases, can be to be different from herein suitable Sequence executes shown or described step.
The embodiment of the present application propose a kind of data administer in task management method and device, can provide based on original The task management mode of the support visualized operation of data can cope with complexity so as to promote the flexibility of task management Data administer demand, promoted data governance efficiency.In the embodiment of the present application, for data administer in there is it is a large amount of answer The data management task miscellaneous, repeatability is relatively high can execute orderly and automatically they, to maximumlly reduce Implementation cost.
Fig. 1 is the flow chart of the task management method during data provided by the embodiments of the present application are administered.The embodiment of the present application The task management method of offer can be administered by data in task management device (for example, the task tune in data governing system Spend device) it executes.However, the application does not limit this.
As shown in Figure 1, task management method provided in this embodiment may comprise steps of:
The initial data that step 101, building are administered using data is the task nexus figure of starting point, wherein task nexus figure For directed acyclic graph (DAG, Directed Acyclic Graph);
Step 102, task based access control relational graph provide the task management interface for supporting visualized operation;
Step 103, the operation in response to detecting in task management interface manage data management task.
Wherein, DAG refers to that any a line has direction, and the figure of loop is not present.
In one exemplary embodiment, step 101 may include: to establish original table and object table, wherein original table is to use In the table for the initial data that storage data are administered, object table is the table for target data needed for storing business;It will be from original The data improvement of data to target data is in turn divided into pretreatment stage, increment synchronization stage, incremental computations stage and melts The conjunction stage;According to original table, object table, pretreatment stage, increment synchronization stage, incremental computations stage and fusing stage, wound Build the operator of following four type: preconditioning operator, increment synchronization operator, incremental computations operator, fusion operator;Based on above-mentioned four The dependence between dependence and original table and preconditioning operator between the operator of seed type constructs task nexus Figure;Wherein, task nexus figure can be using original table as the directed acyclic graph of starting point, alternatively, with the corresponding data of original table Source is the directed acyclic graph of starting point.
Wherein, by establishing original table, the initial data that data source (for example, database or file) can be provided is stored Into the storage medium of data governing system local, administers and use so as to follow-up data.What object table can be provided according to user Business demand is created.
Wherein, original data processing can be divided into following four rank at the process (i.e. data improvement) of target data Section: pretreatment stage, increment synchronization stage, incremental computations stage and fusing stage.Wherein, pretreatment stage can be for original Beginning data carry out the stage of preliminary treatment, for example, carrying out the processing such as data volume statistics;The increment synchronization stage can be by data source Initial data be synchronized to data governing system local original table stage;The incremental computations stage can be to extract from original table Stage of the data to object table;Fusing stage can in the future from the incremental datas of multiple data sources and historical data press major key or Joint major key carries out the process of multivalue processing, duplicate removal and sequence.The input data and output data in above-mentioned each stage are arranged There is fixed data format, and above-mentioned each stage can carry out corresponding data processing according to predefined logic.It is controlled in data In the case that the initial data of reason is given, a corresponding operator can be created to any instance in above-mentioned each stage.
In one exemplary embodiment, it can be one-to-one relationship between preconditioning operator and increment synchronization operator, increase Measuring can be one-to-many relationship between synchronous operator and incremental computations operator, increment synchronization operator with merge between operator can be with For the relationship of multi-to-multi;It can be one-to-many relationship between original table and preconditioning operator.
In one exemplary embodiment, the task management interface that step 102 provides can show that complete task is closed in advance System's figure, alternatively, in response to the clicking operation detected in task management interface, from initial data (for example, data source or original Table) start successively display task nexus figure.However, the application does not limit this.
In one exemplary embodiment, step 103 may include: the operator in response to detecting in task management interface Selection operation determines operator set to be processed;Operation is executed in response to the management detected in task management interface, is handled The operator set.
Wherein, operator selection operation may include continuous page clicking operation, however, the application does not limit this. For example, in other embodiments, operator selection operation may include page frame selection operation.
Wherein, it may include: the clicking operation to the specific command shown in task management interface that management, which executes operation,.So And the application does not limit this.For example, in other embodiments, management execute operation may include page right click operation with And after right click the order of display interface selection operation clicking operation.
In one exemplary embodiment, operation is executed in response to the management detected in task management interface, processing is calculated Subclass may include at least one of:
Corresponding task addition order is operated when management executes, each operator in operator set is generated into corresponding data and is controlled Reason task, and task queue is added in the data management task of generation, wait scheduling to execute;
Corresponding task delete command is operated when management executes, the corresponding data of each operator in operator set are administered and are appointed Business is deleted from task queue;
Order is re-run when management executes the corresponding task of operation, in task queue, resets each calculation in operator set The state of the corresponding data management task of son, so that data management task is re-scheduled for execution.
Wherein, the executable management of data management task may include at least one of following: task addition, task are deleted It removes, task is re-run.Moreover, the embodiment of the present application support manages batch tasks accordingly.
In one exemplary embodiment, above-mentioned task management method can also include: the mission failure of configuration task queue Processing strategie;Wherein, mission failure processing strategie includes at least one of the following: that any data management task in task queue is held After row failure, suspend the data management task in task dispatching queue, until the data management task of the failure is merely re-executed Success;After any data management task in task queue executes failure, continue the data in task dispatching queue with the failure Management task does not have the follow-up data management task of dependence, until not schedulable data management task.Wherein it is possible to Before or after step 103, mission failure processing strategie is configured.However, the application does not limit this.
In one exemplary embodiment, in response to the operator selection operation detected in task management interface, determine to The operator set of processing may include: display data sources or original table in task management interface, in response in task management field The clicking operation to data source or original table detected on face, expansion display is dependent on clicked data source or original table The list of subsequent stage or multistage operator;Operator to be processed is selected from the list of expansion.
Task management method provided by the embodiments of the present application constructs task nexus figure, and base by starting point of initial data In task nexus figure, the task management interface for supporting visualized operation is provided, to realize the flexible management of data management task. For example, the operator set that execute operation can be chosen, and a key is realized in the task management interface for supporting visualized operation The batch of the corresponding data management task of operator set is added, deletes or re-run operation.In this way, provided in this embodiment Business management method can cope with complicated data and administer demand, to promote data governance efficiency.
Task management method provided by the embodiments of the present application is illustrated below by an exemplary embodiment.Its In, it is illustrated so that the task dispatcher in data governing system executes task management method as an example.The present exemplary embodiment mentions The task management method of confession may comprise steps of:
Step 1: importing initial data.Wherein, by the relevant information of disposition data source, data can be made to administer system System sends data access request to source data server according to these configuration informations.Each different data source can be used as data Source instance is recorded in data governing system.Several library tables is corresponded to below each data source, these library tables can be believed Breath is synchronized in data governing system, as the original table (Schema) of entire data governing system, wherein may include table The information such as Chinese name, English name, list of fields, field data types.It, can be by data source subsequently through the information of original table Raw data base of the initial data storage into local storage medium, as follow-up data improvement.
Step 2: building object table.In this step, it can be constructed on the basis of initial data according to business demand The model (referred to as object table) for the target data for needing to administer out.Entire data improvement can be regarded as substantially will be at initial data Manage into the process of target data.Wherein, business demand can be provided according to the actual situation by user.
Step 3: creation operator.
In the present exemplary embodiment, the whole flow process that data are administered can be divided into the following four stage: pretreatment rank Section, increment synchronization stage, incremental computations stage and fusing stage.Each stage can correspond to a type of operator, every type The operator of type has outputting and inputting for oneself fixed data format, and can handle data according to logic predetermined.Its In, in the case where given input value, corresponding operator can be created to the example in each stage.
Wherein, pretreatment stage, which refers to, carries out some preliminary pretreatments, such as statistics, connection processing etc. to initial data; The increment synchronization stage refers to the stage that data are synchronized to the local original table of data governing system from data source;The incremental computations stage Refer to the stage that data are drawn into object table from original table, wherein may include that data are converted, cleaned etc. with processing;Fusion Stage refer to by incremental data and historical data from multiple data sources by major key or combine major key carry out multivalue processing, duplicate removal and The process of sequence.Data after fusing stage (can be administered by data as the source data that upper layer analysis is applied Obtained target data).
In the present exemplary embodiment, the operator of following four type can be created: preconditioning operator, increment synchronization operator, Incremental computations operator and fusion operator.
Step 4: building DAG.
It, can be based between operator since there is specific dependences between the operator of above-mentioned four seed type Dependence constructs DAG.The exemplary diagram of Fig. 2 dependence between the operator in the embodiment of the present application.Wherein, in Fig. 2 Arrow direction indicates the direction of dependence between operator.As shown in Fig. 2, being one between preconditioning operator and increment synchronization operator One-to-one correspondence;It is one-to-many relationship between increment synchronization operator and incremental computations operator, this is because same original table Data may be drawn into multiple object tables;Increment synchronization with merge be between operator multi-to-multi relationship, this be by Caused by the difference of the target data difference and versions of data that are merged.
Based on above-mentioned analysis, the operator created in entire data being administered, which constitutes one, has multiple start nodes DAG.In the present exemplary embodiment, global DAG can be obtained using data source and original table as the preposition node of preconditioning operator (GlobalDAG).Fig. 3 is the exemplary diagram of the global DAG in the embodiment of the present application.Wherein, the arrow direction in Fig. 3 indicates operator Between dependence direction.Global DAG as shown in Figure 3 is using data source as starting point.As shown in figure 3, data source with it is original It is one-to-one relationship between table, is one-to-one relationship between original table and preconditioning operator.However, the application to this not It limits.In other embodiments, can be one-to-many relationship between data source and original table, original table and preconditioning operator it Between can be one-to-many relationship.
Step 5: providing the task management interface for supporting visualized operation.
In the present exemplary embodiment, based on the task management interface that GlobalDAG is provided, can with display data sources, according to The point selection operation of user, then successively expansion shows relevant original table and operator.For example, user can be from some data of display Source is set out, and sequentially finds and choose the operator (for example, increment synchronization operator, fusion operator etc.) dependent on the data source, and under Send out the administration order to selected operator.
For example, in task management interface, after user clicks certain data source S, unfolded display in task management interface The list T of original table below data source S;User continues to click certain original table t in list T, meeting in task management interface Be unfolded all incremental data version Vi in original table t (i=1,2 ... n) (if original table t is non-increment list, will be only One versions of data);Wherein, due to being one-to-one relationship between preconditioning operator and increment synchronization operator, Yong Hudian When hitting original table t, preconditioning operator list L1 and increment synchronization operator list L2 can also be unfolded in task management interface simultaneously.With Family can choose the increment synchronization operator in preconditioning operator and list L2 in list L1, and selected operator will be added into It has chosen in operator set.It similarly, will expansion in task management interface after user clicks the increment synchronization operator in list L2 Corresponding incremental computations operator list L3.User can choose the addition of the incremental computations operator in list L3 and choose operator set In.Similarly, after user clicks the incremental computations operator in list L3, corresponding fusion operator will be unfolded in task management interface List L4, user can choose the addition of the fusion operator in list L4 and choose in operator set.Above-mentioned user is in task management Operator selection operation on interface is only a kind of example.In other embodiments, can be directly displayed in task management interface with Data source is the task nexus figure (i.e. overall situation DAG) of starting point, and user directly can pass through clicking operation in task relational graph Corresponding operator addition is chosen to have chosen in operator set.However, the application does not limit this.
In the present exemplary embodiment, after corresponding operator is chosen and is added to and has chosen operator set, Yong Huke To trigger the processing to operator set has been chosen by clicking the administration order shown in task management interface.However, the application This is not limited.In other embodiments, user can select corresponding administration order by clicking menu bar, to trigger pair The processing of operator set is chosen.
For example, user can click addition order, the corresponding data management task of the operator in operator set will have been chosen It is added to task queue, scheduling is waited to execute;User can click delete command, corresponding by the operator in operator set has been chosen Data management task removed from task queue;User can click and re-run order, so that in task queue, resetting is The state for choosing the corresponding data management task of operator in operator set enables corresponding data management task by again Scheduling executes.
Wherein, after the task dispatcher of data governing system receives the addition order that user issues, addition can be executed Operation, at this point, task queue can initialize the relevant parameter (including service parameter and operating parameter) in the operator chosen, and Environmental variance, check logic etc. are added, the data management task that can be dispatched by task dispatcher is generated.
Wherein, task queue is the important component of the task dispatcher of data governing system, is responsible for that data is allowed to administer Task is carried out in the correct order.Fig. 4 is the global DAG in the embodiment of the present application and the relationship between task queue is shown It is intended to.All unfinished data management tasks in task queue (including do not start, fail, waiting data are administered and are appointed Business), all from having chosen operator set.The operator in operator set has been chosen from global DAG.As shown in figure 4, appointing The DAG that operator set corresponding to all unfinished data management tasks is constituted in business queue is the SubDAG (son of global DAG DAG).It should be noted that the Open circle in Fig. 4 indicates operator, filling the round of grid indicates data management task.
Step 6: configuring and starting task queue.
Wherein, after task queue is generated or updated, it can star task queue to dispatch the execution of data management task.? In one exemplary embodiment, two kinds of mission failure processing strategies are can be set in task dispatcher, to handle in task queue Data management task executes the case where failure.Wherein, mission failure processing strategie may include: mode one, in task queue Any one data management task execute failure after, pause dispatch the task queue in data management task (come failure Data management task after task will not be scheduled), until failure data management task re-execute success;Mode two, After any one data management task in task queue executes failure, continue the data management task of scheduling and the failure without according to The subsequent executable data management task for the relationship of relying in other words, will not until not schedulable data management task Suspend the scheduling to data management task.Two kinds of mission failure processing strategies are supported by setting task dispatcher, it can basis Dispatching requirement is flexibly switched, and administers demand to adapt to different data.
In an illustrative embodiments, the data management task that task dispatcher can while concurrently execute can also be set Quantity, so as to support not dependence data management task concurrently execution.
The present exemplary embodiment constructs the task nexus figure of complete set using data source as starting point, and is based on this Business relational graph, provides the task management interface for supporting visualized operation, appoints so that user quickly finds specific data and administers Business, or flexible management is carried out to data management task, for example, batch tasks are added, delete or re-run by a key.And And it can also flexibly switch mission failure processing strategie according to dispatching requirement.
Fig. 5 is the schematic diagram of the task management device during data provided by the embodiments of the present application are administered.As shown in figure 5, this The task management device that embodiment provides may include: task nexus figure building module 501, interface offer module 502 and appoint Business management module 503;Wherein, task nexus figure constructs module 501, suitable for constructing the initial data administered using data as starting point Task nexus figure, wherein task nexus figure is directed acyclic graph;Interface provides module 502, is suitable for task based access control relational graph, The task management interface for supporting visualized operation is provided;Task management module 503 is adapted for examining in task management interface The operation measured manages data management task.
In one exemplary embodiment, task nexus figure building module 501 may be adapted to be constructed in the following manner with number Initial data according to improvement is the task nexus figure of starting point: establishing original table and object table, wherein original table is for storing The table for the initial data that data are administered, object table are the table for target data needed for storing business;Will from initial data to The data improvement of target data is in turn divided into pretreatment stage, increment synchronization stage, incremental computations stage and fusing stage; According to pretreatment stage, increment synchronization stage, incremental computations stage and the fusion rank in original table, object table, data improvement Section creates the operator of following four type: preconditioning operator, increment synchronization operator, incremental computations operator, fusion operator;It is based on The dependence between dependence and original table and preconditioning operator between the operator of above-mentioned four seed type, building are appointed Business relational graph;Wherein, task nexus figure is the directed acyclic graph using original table as starting point, alternatively, with the corresponding number of original table It is the directed acyclic graph of starting point according to source.
In one exemplary embodiment, task management module 503 may be adapted in the following manner in response in task pipe The operation detected on reason interface, manages data management task: choosing in response to the operator detected in task management interface Operation, determines operator set to be processed;Operation is executed in response to the management detected in task management interface, handles operator Set.
Related description about task management device provided in this embodiment is referred to retouching for above-mentioned task management method It states, therefore is repeated no more in this.
Fig. 6 is the schematic diagram of terminal device provided by the embodiments of the present application.As shown in fig. 6, terminal provided in this embodiment Equipment (for example, the fixed terminals such as the mobile terminals such as smart phone, tablet computer or desktop computer), comprising: processor 601, Memory 602, display unit 603 and input unit 604;Wherein, display unit 603 connects processor 601, is suitable for display and appoints Business administration interface;Input unit 604 connects processor 601, the operation being adapted to detect in task management interface;Memory 602 is suitable Task management program in storing data improvement, the task management program realize above-described embodiment when being executed by processor 601 The step of task management method of offer, such as the step of task management method shown in FIG. 1.
It should be noted that the structure of terminal device shown in Fig. 6 does not constitute the restriction to terminal device, can wrap It includes than illustrating more or fewer components, perhaps combine certain components or different component layouts is provided.
Wherein, processor 601 can include but is not limited to microprocessor (MCU, Microcontroller Unit) or can The processing unit of programmed logic device (FPGA, Field Programmable Gate Array) etc..Memory 602 can be used for The software program and module for storing application software, such as the corresponding program instruction of task management method or mould in the present embodiment Block, the software program and module that processor 601 is stored in memory 602 by operation, thereby executing various function application And data processing, for example realize task management method provided in this embodiment.Memory 602 may include high speed random storage Device may also include nonvolatile memory, such as one or more magnetic storage device, flash memory or other are non-volatile solid State memory.In some instances, memory 602 may include the memory remotely located relative to processor 601, these are long-range Memory can pass through network connection to terminal device.The example of above-mentioned network include but is not limited to internet, intranet, Local area network, mobile radio communication and combinations thereof.
Wherein, input unit 604 may be adapted to the information for receiving input.Illustratively, input unit 604 may include touching Control panel (or being touch screen) and other input equipments (for example, mouse, keyboard, operating stick etc.).Display unit 603 can be with Suitable for showing the information of user's input or being supplied to the information of user.Display unit 603 may include display panel, for example, liquid Crystal display, Organic Light Emitting Diode etc..Illustratively, touch panel can cover on a display panel, when touch panel is examined After measuring touch operation on it or nearby, processor 601 is transferred to determine the type of touch event, is followed by subsequent processing device 601 provide corresponding visual output according to the type of touch event on a display panel.Illustratively, touch panel and display surface What plate can be used as two independent components to realize terminal device outputs and inputs function, alternatively, touch panel and display surface Plate, which can integrate, outputs and inputs function together to realize.
In addition, the embodiment of the present application also provides a kind of computer-readable medium, the task management being stored in data improvement The step of program, which realizes above-mentioned task management method when being executed by processor, such as step shown in FIG. 1 Suddenly.
It will appreciated by the skilled person that whole or certain steps, system, dress in method disclosed hereinabove Functional module/unit in setting may be implemented as software, firmware, hardware and its combination appropriate.In hardware embodiment, Division between the functional module/unit referred in the above description not necessarily corresponds to the division of physical assemblies;For example, one Physical assemblies can have multiple functions or a function or step and can be executed by several physical assemblies cooperations.Certain groups Part or all components may be implemented as by processor, such as the software that digital signal processor or microprocessor execute, or by It is embodied as hardware, or is implemented as integrated circuit, such as specific integrated circuit.Such software can be distributed in computer-readable On medium, computer-readable medium may include computer storage medium (or non-transitory medium) and communication media (or temporarily Property medium).As known to a person of ordinary skill in the art, term computer storage medium is included in for storing information (such as Computer readable instructions, data structure, program module or other data) any method or technique in the volatibility implemented and non- Volatibility, removable and nonremovable medium.Computer storage medium include but is not limited to RAM, ROM, EEPROM, flash memory or its His memory technology, CD-ROM, digital versatile disc (DVD) or other optical disc storages, magnetic holder, tape, disk storage or other Magnetic memory apparatus or any other medium that can be used for storing desired information and can be accessed by a computer.This Outside, known to a person of ordinary skill in the art to be, communication media generally comprises computer readable instructions, data structure, program mould Other data in the modulated data signal of block or such as carrier wave or other transmission mechanisms etc, and may include any information Delivery media.

Claims (10)

1. the task management method in a kind of data improvement characterized by comprising
The initial data administered using data is constructed as the task nexus figure of starting point, wherein the task nexus figure is oriented nothing Ring figure;
Based on the task nexus figure, the task management interface for supporting visualized operation is provided;
In response to the operation detected in the task management interface, data management task is managed.
2. the method according to claim 1, wherein the initial data administered using data that constructs is starting point Task nexus figure, comprising:
Establish original table and object table;Wherein, the original table is the table for storing the initial data of data improvement, the mesh Mark table is the table for target data needed for storing business;
Pretreatment stage, increment synchronization rank will be in turn divided into from the initial data to the improvement of the data of the target data Section, incremental computations stage and fusing stage;
According to the pretreatment stage in the original table, the object table, data improvement, increment synchronization stage, increment meter Calculation stage and fusing stage, create the operator of following four type: preconditioning operator, increment synchronization operator, incremental computations are calculated Son, fusion operator;
Between dependence and the original table and the preconditioning operator between operator based on above-mentioned four seed type Dependence constructs task nexus figure;Wherein, the task nexus figure is using the original table as the directed acyclic of starting point Figure, alternatively, using the corresponding data source of the original table as the directed acyclic graph of starting point.
3. according to the method described in claim 2, it is characterized in that, between the preconditioning operator and the increment synchronization operator It is one-to-many relationship, the increment between the increment synchronization operator and the incremental computations operator for one-to-one relationship Synchronize operator and the relationship merged between operator for multi-to-multi;It is a pair of between the original table and the preconditioning operator More relationships.
4. according to the method described in claim 2, it is characterized in that, described in response to being detected in the task management interface Operation, manage data management task, comprising:
In response to the operator selection operation detected in the task management interface, operator set to be processed is determined;
Operation is executed in response to the management detected in the task management interface, handles the operator set.
5. according to the method described in claim 4, it is characterized in that, described in response to being detected in the task management interface Management execute operation, handle the operator set, including at least one of:
Corresponding task addition order is operated when the management executes, each operator in the operator set is generated into corresponding number Task queue is added according to management task, and by the data management task of generation, scheduling is waited to execute;
Corresponding task delete command is operated when the management executes, the corresponding data of each operator in the operator set are controlled Reason task is deleted from the task queue;
Order is re-run when the management executes the corresponding task of operation, in the task queue, is reset in the operator set The corresponding data management task of each operator state, so that the data management task is re-scheduled for execution.
6. according to the method described in claim 5, it is characterized in that, the method also includes: configure the task queue appoint Business failure handling strategy;Wherein, the mission failure processing strategie includes at least one of the following:
After any data management task in the task queue executes failure, the data that pause is dispatched in the task queue are controlled Reason task, until the data management task of the failure re-executes success;
After any data management task in the task queue executes failure, continue to dispatch in the task queue with the mistake The data management task lost does not have the follow-up data management task of dependence, until not schedulable data management task.
7. according to the method described in claim 4, it is characterized in that, described in response to being detected in the task management interface Operator selection operation, determine operator set to be processed, comprising:
The data source or original table are shown in the task management interface, in response to detecting in the task management interface The clicking operation to the data source or original table arrived, expansion display is dependent on the subsequent of clicked data source or original table The list of one or more levels operator;Operator to be processed is selected from the list of expansion.
8. the task management device in a kind of data improvement characterized by comprising
Task nexus figure constructs module, suitable for constructing the initial data administered using data as the task nexus figure of starting point, wherein The task nexus figure is directed acyclic graph;
Interface provides module, is suitable for being based on the task nexus figure, provides the task management interface for supporting visualized operation;
Task management module is adapted for the operation detected in the task management interface, manages data management task.
9. a kind of terminal device characterized by comprising display unit, input unit, memory and processor;The display Unit connects the processor, is suitable for display task management interface;The input unit connects the processor, is adapted to detect for institute State the operation in task management interface;The memory is suitable for the task management program in storing data improvement, the task pipe The step of reason program realizes the task management method as described in any one of claims 1 to 7 when being executed by the processor.
10. a kind of computer-readable medium, which is characterized in that the task management program being stored in data improvement, the task The step of task management method as described in any one of claims 1 to 7 is realized when management program is executed by processor.
CN201811594698.9A 2018-12-25 2018-12-25 A kind of data administer in task management method and device Pending CN109814991A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811594698.9A CN109814991A (en) 2018-12-25 2018-12-25 A kind of data administer in task management method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811594698.9A CN109814991A (en) 2018-12-25 2018-12-25 A kind of data administer in task management method and device

Publications (1)

Publication Number Publication Date
CN109814991A true CN109814991A (en) 2019-05-28

Family

ID=66602412

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811594698.9A Pending CN109814991A (en) 2018-12-25 2018-12-25 A kind of data administer in task management method and device

Country Status (1)

Country Link
CN (1) CN109814991A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110362362A (en) * 2019-07-24 2019-10-22 北京明略软件***有限公司 Method for scheduling task and device, storage medium, electronic device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090276449A1 (en) * 2008-04-30 2009-11-05 Business Objects, S.A. Apparatus and method to transform an extract transform and load (etl) task into a delta load task
CN105976158A (en) * 2016-04-26 2016-09-28 中国电子科技网络信息安全有限公司 Visual ETL flow management and scheduling monitoring method
CN106897411A (en) * 2017-02-20 2017-06-27 广东奡风科技股份有限公司 ETL system and its method based on Spark technologies
CN110232085A (en) * 2019-04-30 2019-09-13 中国科学院计算机网络信息中心 A kind of method of combination and system of big data ETL task

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090276449A1 (en) * 2008-04-30 2009-11-05 Business Objects, S.A. Apparatus and method to transform an extract transform and load (etl) task into a delta load task
CN105976158A (en) * 2016-04-26 2016-09-28 中国电子科技网络信息安全有限公司 Visual ETL flow management and scheduling monitoring method
CN106897411A (en) * 2017-02-20 2017-06-27 广东奡风科技股份有限公司 ETL system and its method based on Spark technologies
CN110232085A (en) * 2019-04-30 2019-09-13 中国科学院计算机网络信息中心 A kind of method of combination and system of big data ETL task

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
赵双瑞: "通用ETL工具的研究与开发", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110362362A (en) * 2019-07-24 2019-10-22 北京明略软件***有限公司 Method for scheduling task and device, storage medium, electronic device

Similar Documents

Publication Publication Date Title
US20210318851A1 (en) Systems and Methods for Dataset Merging using Flow Structures
US9659012B2 (en) Debugging framework for distributed ETL process with multi-language support
US10073867B2 (en) System and method for code generation from a directed acyclic graph using knowledge modules
US10216814B2 (en) Supporting combination of flow based ETL and entity relationship based ETL
CN109800226A (en) A kind of data administer in task management method and device
CN107506442B (en) Modeling method and device of model
US9519663B2 (en) Upgrading and migrating a database by a migration tool
US20190138532A1 (en) Load plan generation
US9507838B2 (en) Use of projector and selector component types for ETL map design
CN105389251A (en) Structured query language debugger
CN105528418B (en) A kind of design documentation generation method and device
US10466981B1 (en) System and method for generative programming in an integrated development environment (IDE)
CN111324610A (en) Data synchronization method and device
CN108388623B (en) ER relationship generation method and device, computer equipment and storage medium
CN105956087A (en) Data and code version management system and method
CN108427709B (en) Multi-source mass data processing system and method
CN110908793A (en) Long-time task execution method, device, equipment and readable storage medium
CN109800069A (en) A kind of method and device realizing data and administering
CN107463391A (en) Task processing method, device and equipment
CN109814991A (en) A kind of data administer in task management method and device
CN113741931B (en) Software upgrading method and device, electronic equipment and readable storage medium
Pastrana-Pardo et al. Approach to the Best Practices in Software Development Based on DevOps and SCRUM Used in Very Small Entities
CN112597121A (en) Logic script processing method and device, electronic equipment and storage medium
CN112367205A (en) Processing method and scheduling system for HTTP scheduling request
CN115460077B (en) Configuration information updating method and device, storage medium and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190528

RJ01 Rejection of invention patent application after publication