CN111045843B

CN111045843B - Distributed data processing method with fault tolerance capability

Info

Publication number: CN111045843B
Application number: CN201911057644.3A
Authority: CN
Inventors: 谢在鹏; 张基; 李博文; 叶保留; 毛莺池
Original assignee: Hohai University HHU
Current assignee: Hohai University HHU
Priority date: 2019-11-01
Filing date: 2019-11-01
Publication date: 2021-09-28
Anticipated expiration: 2039-11-01
Also published as: CN111045843A

Abstract

The invention discloses a distributed data processing method with fault-tolerant capability, wherein each part of data is operated by three modules for realizing the same function at the same time, and the three modules are mutually independent, and the probability of the same error of the two modules is very small, so the reliability of the system can be greatly improved, and the error of a fault module is masked.

Description

Distributed data processing method with fault tolerance capability

Technical Field

The invention relates to the field of distributed computing, in particular to a distributed data processing method with fault tolerance capability.

Background

The proliferation of data volumes in the big data era has led to a rapid increase in the importance of distributed computing, and in many practical distributed systems, Transient faults (Transient faults) have led to abnormal behavior in computer systems, especially when smaller and smaller transistors and higher power densities of hardware are used to make such faults more frequent. In some large distributed systems, on average, 1% -2% of the nodes fail each day. Such error rates not only reduce the reliability of the distributed system but also affect the performance of the system, and it is important to provide effective fault tolerance in the distributed system.

Conventional fault tolerance techniques in distributed systems include replica techniques and coding techniques, however, these techniques cause a high communication load in implementing fault tolerance. For example, due to the distribution characteristic, the operation related to coding fault tolerance requires multiple nodes in the distributed cluster to cooperate with each other, which may result in a large amount of data transmission and thus high communication overhead. In the MapReduce-based distributed computing framework, a task is roughly divided into three phases of Map, Shuffle and Reduce. Firstly, each node in the Map stage calculates local input data according to a predefined Map function so as to obtain an intermediate result. These intermediate results are then exchanged between the various compute nodes, which use these intermediate values as inputs to the Reduce function to compute the output result. The performance of distributed computing is seriously affected by the high communication load of the Shuffle stage. Therefore, the communication load is a bottleneck for improving the performance of the distributed system, and the traditional scheme has the problem of excessively high communication load.

Disclosure of Invention

In order to solve the above problems, the present invention provides a distributed data processing method with fault tolerance capability.

In order to achieve the purpose of the invention, the invention provides a distributed data processing method with fault tolerance capability, which comprises the following steps:

s10, dividing the N nodes into

Sub-cluster to distribute computing tasks to

On the sub-clusters, each sub-cluster is subjected to parallel computation; wherein, the symbol

Represents rounding down, n represents the number of nodes in a sub-cluster;

s20, dividing the data set M into

Partial data sets, respectively sent to

On a sub-cluster, the data volume of each sub-cluster is made to be

Each sub-cluster divides the data set B of the sub-cluster into K' independent sub-data blocks, wherein each data block uses B_iAnd i is more than or equal to 1 and less than or equal to K ', and copying 3 parts of data in each data block to generate 3K' parts of data；

S30, each sub-cluster distributes 3K' data to n nodes of the current sub-cluster according to the set rule, so that each data block in each node local data set is different, if the node is not the same_kAnd a node_mWhen the subscript k + m is n +1, then the node_kAnd a node_mThe intersection of the owned data sets is empty, the union is the full set of the current subset cluster data set, and if k + m is not equal to n +1, the node_kAnd a node_mWith and only two identical redundant data

S50, randomly selecting a pair of check node nodes satisfying k + m-n +1 for each sub-cluster_kAnd a node_mThe remaining nodes in the sub-cluster perform the operations of steps S51 and S52, which may be performed in parallel:

s51, using CDC algorithm to local and node by each other node in sub-cluster_kCoding a first intermediate result corresponding to the redundantly stored data set to obtain a first coding result, and respectively sending the first coding result to the node_k；

S52, using CDC algorithm to local and node by each other node in sub-cluster_mCoding a second intermediate result corresponding to the redundantly stored data set to obtain a second coding result, and sending the second coding result to the node_m；

S60，node_kAfter receiving all the first encoding results, the first encoding results obtained in the decoding S51 obtain the first decoding result, node, of each piece of data_mAfter receiving all the second encoding results, decoding the second encoding results obtained in S52 to obtain second decoding results;

S70，node_kverifying the correctness of the first encoding result of each piece of data according to the first decoding result and the first intermediate result, and if the first encoding result is not correct, returning to execute the step S50, or, the node_mVerifying the correctness of the second encoding result according to the second decoding result and the second intermediate result, and if the second encoding result is not correct, returning to execute the step S50; if the number of the correct coding results in the first coding result is more than or equal to 1 and the number of the correct coding results in the second coding result is positiveThe number of the code result is more than or equal to 1, the node_kCalculating the correct result of the corresponding node according to the correct coding result in the first coding result to obtain a first operation result, and the node_mCalculating the correct result of the corresponding node according to the correct coding result in the second coding result to obtain a second operation result;

and S90, determining the final result of the corresponding sub-cluster according to the corresponding first operation result and the second operation result by each sub-cluster.

In one embodiment, in step S30, when each sub-cluster allocates 3K' copies of data to n nodes of the current sub-cluster according to a set rule, the method further includes:

in each subgroup, each node can be divided into 3K'/n data, and the data set assigned to each node is B_jRepresents; if node_kAnd a node_mIf the subscript k + m ≠ n +1, then the intersection of the data sets owned by the subscript k + m ≠ n +1 is empty, the union of the data sets owned by the subscript k + m ≠ n +1 is the full set of the current sub-cluster data set, and if k + m ≠ n +1, then the node_kAnd a node_mWith and only two identical redundant data b_i,b_jIf B is_k∩B_m＝{b_i,b_jIs defined as

Wherein, the symbol

Meaning "defined as".

As an embodiment, before step S50, the method further includes:

s40, each node executes a Map function G (-) on each data block of the local data set, and obtains an intermediate result:

wherein i represents a data block serial number, t represents a node serial number, and the Map function can be customized by a user aiming at different calculation tasks.

In one embodiment, n has a value of 6 and K' has a value of 8.

As one example, step S50 includes:

each sub-cluster randomly selects a pair of check node_kAnd a node_mAnd k + m is 7, and the rest four nodes in each sub-cluster are recorded as: node(s)_s，node_n，node_p，node_qIn each node_s，node_n，node_p，node_qAssociating local data sets with nodes_kRedundantly stored data set R_s，k，R_n，k，R_p，k，R_q，kThe first intermediate result calculated at each node is coded by CDC to obtain the coding result u of the first intermediate result_s，k，u_n，k，u_p，k，u_q，kDetermining a first coding result, and sending the first coding result to the node_k(ii) a Respectively at the node_s，node_n，node_p，node_qAssociating local data sets with nodes_mRedundantly stored data set R_s，m，R_n，m，R_p，m，R_q，mThe second intermediate result calculated at each node is coded by CDC to obtain the coding result u of the second intermediate result_s，m，u_n，m，u_p，m，u_q，mI.e., the second encoding result, transmits the second encoding result to the node_m。

As one example, step S70 includes:

if (v)₂ ^s＝v₂ ^k)∧(v₃ ⁿ＝v₃ ^k) Is formed by (v)₁ ^p＝v₁ ^k)∧(v₄ ^q＝v₄ ^k) Is true, indicating a node_kThe correct number of the coding result in the first coding result is more than or equal to 2 and the node_kAll local intermediate results are correctly calculated, and the operation result r of the current node data set is calculated through a Reduce function_k＝H(v₁ ^k，v₂ ^k，v₃ ^k，v₄ ^k) Obtaining a first operation result; if (v)₆ ^s＝v₆ ^m)∧(v₇ ⁿ＝v₇ ^m) Is true or (v)₅ ^p＝v₅ ^m)∧(v₈ ^q＝v₈ ^m) Is true, indicating a node_mThe correct number of the coding result in the second coding result is more than or equal to 2 and the node_mAll local intermediate results are correctly calculated, and the operation result r of the current node data set is calculated through a Reduce function_m＝H(v₅ ^m，v₆ ^m，v₇ ^m，v₈ ^m) Obtaining a second operation result; wherein v is₂ ^sRepresents a node_sIntermediate result of data sequence number 2, same as v₁ ^k，v₂ ^k，v₃ ^k，v₄ ^kAll represent a node_kIntermediate result of the last corresponding data sequence number, v₃ ⁿRepresents a node_nIntermediate result of the upper data sequence number 3, v₁ ^pRepresents a node_pIntermediate result of the upper data sequence number 1, v₄ ^qRepresents a node_qIntermediate result of the upper data sequence number 4, v₆ ^sRepresents a node_sIntermediate result of data sequence number 6, same as v₅ ^m，v₆ ^m，v₇ ^m，v₈ ^mAll represent a node_mIntermediate result of the last corresponding data sequence number, v₇ ⁿRepresents a node_nIntermediate result of the upper data sequence number 7, v₅ ^pRepresents a node_pIntermediate result of the upper data sequence number 5, v₈ ^qRepresents a node_qIntermediate results of the upper data sequence number 8, where H (-) denotes a function of the Reduce phase, the effect of which is to combine a plurality of intermediate results into one, the symbol Λ denotes an AND operation, v₁ ^m，v₂ ^m，v₃ ^m，v₄ ^mAll represent a node_mIntermediate result v obtained after calculation of self data block in Map stage₅ ^m，v₆ ^m，v₇ ^m，v₈ ^mAll represent a node_mTo self data in Map stageObtaining an intermediate result after block calculation;

if (v)₂ ^s＝v₂ ^k)∧(v₃ ⁿ＝v₃ ^k) Is false and (v)₁ ^p＝v₁ ^k)∧(v₄ ^q＝v₄ ^k) If not, then assume the node_kAt least one correct encoding result exists in the first encoding result; if (v)₆ ^s＝v₆ ^m)∧(v₇ ⁿ＝v₇ ^m) Is false and (v)₅ ^p＝v₅ ^m)∧(v₈ ^q＝v₈ ^m) If not, then assume the node_mAt least one correct encoding result exists in the second encoding result; if v is₂ ^s≠v₂ ^k，v₃ ⁿ≠v₃ ^k，v₁ ^p≠v₁ ^k，v₄ ^q≠v₄ ^kIf the first encoding result is not correct, the process returns to step S50, or if v is not correct₆ ^s≠v₆ ^m，v₇ ⁿ≠v₇ ^m，v₅ ^p≠v₅ ^m，v₈ ^q≠v₈ ^mIf it is determined that the second encoding result is not correct, the process returns to step S50.

As one example, after step S70, at node_kThe correct number of coding results in the first coding result of (2) is at least 1, and the node_mWhen the correct number of the encoding results in the second encoding result is at least 1, the method further comprises:

if v is₂ ^s＝v₂ ^kIndicates u_s，kIf the verification is successful, u_s，k＝u_k，s，v₁ ^k＝v₁ ^s，v₂ ^k＝v₂ ^s(ii) a By passing

To obtain v₃ ^p，

To obtain v₄ ^qJudgment of

If true, it indicates that the error source is node_kLocal data blocks, nodes_kThe first coding result sent by other nodes is received correctly to obtain the node_kOf correct result r_k＝H(v₁ ^k，v₂ ^k，v₃ ^p，v₄ ^q) Obtaining a first operation result; wherein u is_k，sRepresentation storage in node_kAnd of_sThe intermediate result corresponding to the redundant data is encoded by CDC

Represents a logical exclusive or;

if v is₆ ^s＝v₆ ^mIndicates u_s，mIf the verification is successful, u_s，m＝u_m，s，v₆ ^m＝v₆ ^s，v₅ ^m＝v₅ ^s(ii) a By passing

To obtain v₇ ^p，

To obtain v₈ ^qJudgment of

If true, it indicates that the error source is node_mLocal data block, node_mThe second coding result sent by other nodes is received correctly to obtain the node_mOf correct result r_m＝H(v₅ ^m，v₆ ^m，v₇ ^p，v₈ ^q) Obtaining a second operation result; wherein u is_m，sRepresentation storage in node_mAnd of_sThe intermediate result corresponding to the redundant data is encoded using CDC₅ ^sRepresents a node_sIntermediate result of upper data sequence number 5.

The distributed data processing method with fault tolerance capability divides N nodes into

Sub-cluster to distribute computing tasks to

On a sub-cluster, the data set M is divided into

Partial data sets, respectively sent to

On the sub-clusters, each sub-cluster distributes 3K' parts of data to n nodes of the current sub-cluster according to a set rule, and thus each sub-cluster randomly selects a pair of check node nodes meeting the condition that K + m is n +1_kAnd a node_mThe other nodes in the sub-cluster respectively use the local data set and the node_kRespectively coding the first intermediate result calculated by each node by using CDC (performance data control) of the first intermediate result calculated by each node of the redundantly stored data set to obtain a first coding result, and sending the first coding result to the node_k(ii) a The other nodes in the sub-cluster respectively use the local data set and the node_mRespectively coding second intermediate results calculated by the redundant stored data sets at the respective nodes by using CDC to obtain second coding results, and sending the second coding results to the nodes_m(ii) a Make node_kReceiving a first encoding result, decoding said first encoding result to obtain a first decoding result, node_mReceiving a second encoding result, decoding the second encoding result to obtain a second encoding resultSecondly, decoding results; node(s)_kVerifying the correctness of the first coding result according to the first decoding result and the first intermediate result, and if the first coding result is not correct, returning to execute each sub-cluster to randomly select a pair of check node nodes meeting the condition that k + m is equal to n +1_kAnd a node_mOr a node_mVerifying the correctness of the second coding result according to the second decoding result and the second intermediate result, and if the second coding result is not correct, returning to execute each sub-cluster to randomly select a pair of check node nodes meeting the condition that k + m is equal to n +1_kAnd a node_mThe process of (2); if the correct number of the coding results in the first coding result is greater than 1 and the correct number of the coding results in the second coding result is greater than 1, calculating the correct result of the corresponding node according to the correct coding result in the first coding result to obtain a first operation result, and calculating the correct result of the corresponding node according to the correct coding result in the second coding result to obtain a second operation result; therefore, each sub-cluster determines the final result of the corresponding sub-cluster according to the corresponding first operation result and the second operation result, the distributed computation not only keeps the lower communication load of the distributed coding computation, but also enables the backup maintenance and fault recovery efficiency to be close to the copy technology, thereby ensuring the high reliability of the distributed system and reducing the communication load.

Drawings

FIG. 1 is a flow diagram of a distributed data processing method with fault tolerance capabilities, according to an embodiment;

FIG. 2 is a diagram of various sub-clusters of one embodiment;

FIG. 3 is a model structure diagram of a sub-cluster of an embodiment;

FIG. 4 is a flow diagram of a distributed data processing method with fault tolerance capability according to another embodiment.

Detailed Description

In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.

Reference herein to "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the application. The appearances of the phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. It is explicitly and implicitly understood by one skilled in the art that the embodiments described herein can be combined with other embodiments.

Referring to fig. 1, fig. 1 is a flowchart illustrating a distributed data processing method with fault tolerance capability according to an embodiment, including the following steps:

s10, dividing the N nodes into

Sub-cluster to distribute computing tasks to

Indicating rounding down and n indicates the number of nodes in a sub-cluster.

Specifically, the value of N may be 6, and at this time, every 6 of the N nodes may be divided into a group, and each group forms a sub-cluster, and the sub-clusters share the same value

Sub-clusters, in one example, various sub-clusters may be as described with reference to FIG. 2. The sub-clusters are independent of each other, and the nodes within each sub-cluster may be physically distributed. Distributing computing tasks to

On the sub-clusters, each sub-cluster is computed in parallel. The 6 nodes in each sub-cluster can be respectively used as { node₁，node₂，node₃，node₄，node₅，node₆Represents it.

S20, dividing the data set M into

Partial data sets, respectively sent to

On a sub-cluster, the data volume of each sub-cluster is made to be

Each sub-cluster divides the data set B of the sub-cluster into K' independent sub-data blocks, wherein each data block uses B_iAnd 1. ltoreq. i.ltoreq.K 'indicates that each data block is copied by 3 parts to generate 3K' parts of data.

The value of K' may be 8, and if n is 6, the data set M is divided into

Partial data sets, respectively sent to

On a sub-cluster. Each sub-cluster has an amount of data of

Meanwhile, each sub-cluster divides the data set B of the sub-cluster into 8 independent sub-data blocks, wherein each data block uses B_iAnd i is more than or equal to 1 and less than or equal to 8. Each data block may further be replicated 3 copies (i.e., triple modular redundancy) resulting in 24 copies of data. The triple Modular redundancy TMR (triple Modular redundancy) is a common fault-tolerant technology in software and hardware systems, three modules for realizing the same function are used for operating simultaneously, and the final output of the system is selected by using a voting mechanism. Because the three modules are independent, the probability that the same error occurs in two modules is very small, the reliability of the system can be greatly improved, and the error of a fault module is masked.

S30, dividing each subcluster into 3K' partsDistributing the data blocks to n nodes of the current sub-cluster according to a set rule to ensure that the data blocks in the local data set of each node are different from each other, and if the node is not the same_kAnd a node_mWhen the subscript k + m is n +1, then the node_kAnd a node_mThe intersection of the owned data sets is empty, the union is the full set of the current subset cluster data set, and if k + m is not equal to n +1, the node_kAnd a node_mThere are and only two identical redundant data.

Specifically, when the value of K' is 8 and the value of n is 6, each sub-cluster allocates 24 pieces of data to 6 nodes of the sub-cluster, each node can be divided into 4 pieces of data, and a data set owned by the node is represented by B_tAnd t is more than or equal to 1 and less than or equal to 6. In the distribution process, the data sets among the nodes meet the following requirements, the data blocks in the local data sets of the nodes are different from each other, and if the nodes do not meet the requirements_kAnd a node_mIf k + m ≠ 7, then the intersection of the datasets they have is empty, and the union is the full set of the current sub-cluster datasets, and if k + m ≠ 7, then the node_kAnd a node_mWith and only two identical redundant data b_i，b_jAt this time, there is B_k∩B_m＝{b_i，b_jIs defined as R_k，m＝B_k∩B_m(ii) a Where i and j denote the index of the data block. Such as B₁＝{b₁，b₂，b₃，b₄}，B₂＝{b₁，b₂，b₅，b₆}，B₃＝{b₁，b₃，b₅，b₇}，B₄＝{b₂，b₄，b₆，b₈}，B₅＝{b₃，b₄，b₇，b₈}，B₆＝{b₅，b₆，b₇，b₈}。R_1，2＝{b₁，b₂R, e.g. R_1，6Is an empty set. Correspondingly, each of the other sub-clusters respectively distributes 24 data to 6 nodes of the corresponding sub-cluster, each node can be divided into 4 data, and the data set owned by the node can also be used as B_tT is more than or equal to 1 and less than or equal to 6, and the data sets of all the nodes are fullMeets the corresponding requirements. In an example, the model structure diagram of the sub-cluster may be as shown in fig. 3, where each surface of the solid graph in fig. 3 represents a node, each vertex represents a piece of data, and after the pieces of data are distributed to the nodes of the sub-cluster, local data sets of the nodes are different from each other, specifically, if mode is used_kAnd a node_mWhen the subscript k + m is n +1, then the node_kAnd a node_mThe intersection of the owned data sets is empty, the union is the full set of the current subset cluster data set, and if k + m is not equal to n +1, the node_kAnd a node_mThere are and only two identical redundant data.

In an example, after the step S30, the method further includes: s40, each node executes a Map function G (-) on each data block of the local data set, and obtains an intermediate result:

wherein i represents a data block serial number, t represents a node serial number, and the Map function can be customized by a user aiming at different calculation tasks; so that each node can obtain the required intermediate results.

S52, using CDC algorithm to local and node by each other node in sub-cluster_mCoding a second intermediate result corresponding to the redundantly stored data set to obtain a second coding result, and sending the second coding result to the node_m。

The CDC coding is a coding technique for creating coding opportunities using redundancy, such as a node₁Encoding the result

Is sent to the node₂，node₂By locally stored redundant data blocks b₁Corresponding intermediate result v₁ ²Decoding of coded data packets

The remaining intermediate results v in the coded data packet are obtained₂ ¹(

Refers to an exclusive or operation).

In particular, the above steps may be directed to a node_kAnd a node_mThe same operation is performed in both nodes, so that both obtain corresponding encoding results and intermediate results.

S60，node_kAfter receiving all the first encoding results, the first encoding results obtained in the decoding S51 obtain the first decoding result, node, of each piece of data_mAfter receiving all the second encoding results, the second encoding results obtained in S52 are decoded to obtain second decoding results.

In the above step, the node_kAnd a node_mWill receive the encoded data packet (e.g., node)_kReceived first coding result and node_mThe received second encoded result) is decoded, resulting in an intermediate result therein. node(s)_kReceiving a coded packet of data as u_s，k，u_n，k，u_p，k，u_q，kWherein u is_s，kRepresents a node_sTo node_kEncoding result of (1), u_n，kRepresents a node_nTo node_kEncoding result of (1), u_p，kRepresents a node_pTo node_kEncoding result of (1), u_q，kRepresents a node_qTo node_kThe result of the encoding of (1). And a node_kLocally calculates a first intermediate result v₁ ^k，v₂ ^k，v₃ ^k，v₄ ^k. If the following relationship holds: s + n is 7, p + q is 7, since

Then nodek performs the decoding steps as follows:

to obtain v₂ ^s，v₃ ⁿ，v₁ ^p，v₄ ^q；node_mReceiving a coded packet of data as u_s，m，u_n，m，u_p，m，u_q，mWherein u is_s，mRepresents a node_sTo node_mEncoding result of (1), u_n，mRepresents a node_nTo node_mEncoding result of (1), u_p，mRepresents a node_pTo node_mEncoding result of (1), u_q，mRepresents a node_qTo node_mThe result of the encoding of (1). And a node_mLocally calculates a first intermediate result v₅ ^m，v₆ ^m，v₇ ^m，v₈ ^m. Due to the fact that

Then node_mThe decoding steps are performed as follows:

to obtain v₆ ^s，v₇ ⁿ，v₅ ^p，v₈ ^q(ii) a Wherein v is₁ ^s，v₂ ^s，v₅ ^s，v₆ ^sRespectively represent a node_sIntermediate results, v, of local storage₃ ⁿ，v₄ ⁿ，v₇ ⁿ，v₈ ⁿRespectively represent a node_nIntermediate results, v, of local storage₁ ^p，v₃ ^p，v₅ ^p，v₇ ^pRespectively represent a node_pIntermediate results, v, of local storage₂ ^q，v₄ ^q，v₆ ^q，v₈ ^qRespectively represent a node_qIntermediate results stored locally.

S70，node_kVerifying the correctness of the first encoding result of each piece of data according to the first decoding result and the first intermediate result, and if the first encoding result is not correct, returning to execute the step S50, or, the node_mVerifying the correctness of the second encoding result according to the second decoding result and the second intermediate result, and if the second encoding result is not correct, returning to execute the step S50; if the number of the correct coding results in the first coding result is more than or equal to 1 and the number of the correct coding results in the second coding result is more than or equal to 1, the node_kCalculating the correct result of the corresponding node according to the correct coding result in the first coding result to obtain a first operation result, and the node_mAnd calculating the correct result of the corresponding node according to the correct coding result in the second coding result to obtain a second operation result.

The above steps can be respectively carried out on the node_kAnd a node_mThe correctness of the coding result is verified by comparing the intermediate result obtained by decoding (namely the corresponding decoding result) with the intermediate result corresponding to the local data set, and meanwhile, the correct intermediate result is calculated.

In particular, when the node_kAnd a node_mCan pass the verification and obtain the correct operation result, i.e. the first operation result r_kAnd a second operation result r_mThen, each sub-cluster can obtain the final result r ═ H (r) of the corresponding sub-cluster through the Reduce function_k，r_m)。

Specifically, the Map function and the Reduce function are two functions of a MapReduce-based distributed computing framework, and two groups of functions can be customized by a user for specific computing tasks respectively so as to perform corresponding processing on big data, so that the distributed coding computing process keeps a low communication load.

Sub-cluster to distribute computing tasks to

On a sub-cluster, the data set M is divided into

Partial data sets, respectively sent to

On the sub-clusters, each sub-cluster distributes 3K' parts of data to n nodes of the current sub-cluster according to a set rule, and thus each sub-cluster randomly selects a pair of check node nodes meeting the condition that K + m is n +1_kAnd a node_mThe other nodes in the sub-cluster respectively use the local data set and the node_kRespectively coding the first intermediate result calculated by each node by using CDC (performance data control) of the first intermediate result calculated by each node of the redundantly stored data set to obtain a first coding result, and sending the first coding result to the node_k(ii) a The other nodes in the sub-cluster respectively use the local data set and the node_mRespectively coding second intermediate results calculated by the redundant stored data sets at the respective nodes by using CDC to obtain second coding results, and sending the second coding results to the nodes_m(ii) a Make node_kReceiving a first encoding result, decoding said first encoding result to obtain a first decoding result, node_mReceiving a second encoding result, and decoding the second encoding result to obtain a second decoding result; node(s)_kVerifying a first coding junction based on the first decoding result and a first intermediate resultIf the first coding result is not correct, returning to execute each sub-cluster to randomly select a pair of check nodes meeting the condition that k + m is equal to n +1_kAnd a node_mOr a node_mVerifying the correctness of the second coding result according to the second decoding result and the second intermediate result, and if the second coding result is not correct, returning to execute each sub-cluster to randomly select a pair of check node nodes meeting the condition that k + m is equal to n +1_kAnd a node_mThe process of (2); if the correct number of the coding results in the first coding result is more than or equal to 1 and the correct number of the coding results in the second coding result is more than or equal to 1, calculating the correct result of the corresponding node according to the correct coding result in the first coding result to obtain a first operation result, and calculating the correct result of the corresponding node according to the correct coding result in the second coding result to obtain a second operation result; therefore, each sub-cluster determines the final result of the corresponding sub-cluster according to the corresponding first operation result and the second operation result, the distributed computation not only keeps the lower communication load of the distributed coding computation, but also enables the backup maintenance and fault recovery efficiency to be close to the copy technology, thereby ensuring the high reliability of the distributed system and reducing the communication load.

in each subgroup, each node can be divided into 3K'/n data, and the data set assigned to each node is B_jRepresents; if node_kAnd a node_mIf the subscript k + m ≠ n +1, then the intersection of the data sets owned by the subscript k + m ≠ n +1 is empty, the union of the data sets owned by the subscript k + m ≠ n +1 is the full set of the current sub-cluster data set, and if k + m ≠ n +1, then the node_kAnd a node_mWith and only two identical redundant data b_i，b_jIf B is_k∩B_m＝{b_i，b_jIs defined as

Wherein, the symbol

Meaning "defined as".

Specifically, when the value of K' is 8 and the value of n is 6, each sub-cluster allocates 24 pieces of data to 6 nodes of the corresponding sub-cluster, each node can be divided into 4 pieces of data, and a data set owned by the node uses B_jJ is more than or equal to 1 and less than or equal to 6. In the distribution process, the data sets among the nodes meet the following requirements, the data blocks in the local data sets of each node are different from each other, and if the nodes do not meet the requirements_kAnd a node_mIf k + m ≠ 7, then the intersection of the datasets they have is empty, and the union is the full set of the current sub-cluster datasets, and if k + m ≠ 7, then the node_kAnd a node_mWith and only two identical redundant data b_i，b_jAt this time, there is B_k∩B_m＝{b_i，b_jIs defined as R_k，m＝B_k∩B_m(ii) a Where i, j denotes the data block index.

Such as B₁＝{b₁，b₂，b₃，b₄}，B₂＝{b₁，b₂，b₅，b₆)，B₃＝{b₁，b₃，b₅，b₇}，B₄＝{b₂，b₄，b₆，b₈}，B₅＝{b₃，b₄，b₇，b₈}，B₆＝{b₅，b₆，b₇，b₈}。R_1，2＝{b₁，b₂R, e.g. R_1，6Is an empty set.

In one embodiment, before step S50, the method further includes:

In this embodiment, each node of the sub-cluster may be for each data block b of the local data set_iExecuting Map function G (b)_i) Get a corresponding set of intermediate results as

So that each node can smoothly obtain the required intermediate result. Specifically, the Map function and the Reduce function are functions of MapReduce, the MapReduce is a calculation model for large data parallel processing, specifically, a data set is distributed to each node in the Map stage to realize division of tasks, each node calculates its own data set, the calculation process is represented by a function G (·), and a result obtained after calculation is an intermediate result v. The Reduce phase may merge intermediate results produced by the Map phase. After the Map stage is calculated, the intermediate result needs to be sent to the Reduce node for corresponding operation, so as to obtain the required operation result.

In one embodiment, n has a value of 6 and K' has a value of 8.

In one embodiment, step S50 includes:

The first intermediate result comprises a node_s，node_n，node_p，node_qAccording to the local data set and the node respectively_kIntermediate results determined from redundantly stored data sets, the first encoding result comprising a node_s，node_n，node_p，node_qRespectively using CDC coding to code the obtained result according to the corresponding first intermediate result, wherein u_s，kRepresents a node_sCorresponding first coding result, u_n，kRepresents a node_nCorresponding first coding result, u_p，kRepresents a node_pCorresponding first coding result, u_q，kRepresents a node_qAnd the corresponding first coding result. The second intermediate result comprises a node_s，node_n，node_p，node_qAccording to the local data set and the node respectively_mIntermediate results determined from redundantly stored data sets, the second encoding result comprising a node_s，node_n，node_p，node_qRespectively using CDC coding to code the result according to the corresponding second intermediate result, wherein u_s，mRepresents a node_sCorresponding second coding result, u_n，mRepresents a node_nCorresponding second coding result, u_p，mRepresents a node_pCorresponding second coding result, u_q，mRepresents a node_qAnd the corresponding second encoding result.

As one example, step S70 includes:

if (v)₂ ^s＝v₂ ^k)∧(v₃ ⁿ＝v₃ ^k) Is true or (v)₁ ^p＝v₁ ^k)∧(v₄ ^q＝v₄ ^k) Is true, indicating a node_kThe correct number of the coding result in the first coding result is greater thanIs equal to 2 and node_kAll intermediate results of the local data set are correctly calculated, and the operation result r of the current node data set is calculated through a Reduce function_k＝H(v₁ ^k，v₂ ^k，v₃ ^k，v₄ ^k) Obtaining a first operation result; if (v)₆ ^s＝v₆ ^m)∧(v₇ ⁿ＝v₇ ^m) Is true or (v)₅ ^p＝v₅ ^m)∧(v₈ ^q＝v₈ ^m) Is true, indicating a node_mThe correct number of the coding result in the second coding result is more than or equal to 2 and the node_mAll intermediate results of the local data set are correctly calculated, and the operation result r of the current node data set is calculated through a Reduce function_m＝H(v₅ ^m，v₆ ^m，v₇ ^m，v₈ ^m) Obtaining a second operation result; wherein v is₂ ^sRepresents a node_sIntermediate result of the upper data sequence number 2, v accordingly₁ ^k，v₂ ^k，v₃ ^k，v₄ ^kAll represent a node_kIntermediate result of the last corresponding data sequence number, v₃ ⁿRepresents a node_nIntermediate result of the upper data sequence number 3, v₁ ^pRepresents a node_pIntermediate result of the upper data sequence number 1, v₄ ^qRepresents a node_qIntermediate result of the upper data sequence number 4, v₆ ^sRepresents a node_sIntermediate result of the upper data sequence number (i.e. the sequence number of the data block) 6, like v₅ ^m，v₆ ^m，v₇ ^m，v₈ ^mAll represent a node_mIntermediate result of the last corresponding data sequence number, v₇ ⁿRepresents a node_nIntermediate result of the upper data sequence number 7, v₅ ^pRepresents a node_pIntermediate result of the upper data sequence number 5, v₈ ^qRepresents a node_qThe intermediate result of the upper data sequence number 8, H (-) represents a function of the Reduce phase, which functions to merge multiple intermediate results into one,symbol Λ represents the logical operation and, v₁ ^k，v₂ ^k，v₃ ^k，v₄ ^kAll represent a node_mIntermediate result v obtained after calculation of self data block in Map stage₅ ^m，v₆ ^m，v₇ ^m，v₈ ^mAll represent a node_mCalculating self data blocks in a Map stage to obtain an intermediate result;

if (v)₂ ^s＝v₂ ^k)∧(v₃ ⁿ＝v₃ ^k) Is false and (v)₁ ^p＝v₁ ^k)∧(v₄ ^q＝v₄ ^k) If not, then assume the node_kAt least one correct coding result exists in the first coding result, i.e. the assumed node_kAt least one of the received 4 encoded data packets is verified as correct, e.g. v₂ ^s＝v₂ ^k，v₃ ⁿ≠v₃ ^k，v₁ ^p≠v₁ ^k，v₄ ^q≠v₄ ^k(ii) a If (v)₆ ^s＝v₆ ^m)∧(v₇ ⁿ＝v₇ ^m) Is false and (v)₅ ^p＝v₅ ^m)∧(v₈ ^q＝v₈ ^m) If not, then assume the node_mThere is at least one correct encoding result of the second encoding result, i.e. the hypothetical node_mAt least one of the received 4 encoded data packets is verified as correct, e.g. v₆ ^s＝v₆ ^m，v₈ ⁿ≠v₈ ^m，v₇ ^p≠v₇ ^m，v₅ ^q≠v₅ ^m；

If v is₂ ^s≠v₂ ^k，v₃ ⁿ≠v₃ ^k，v₁ ^p≠v₁ ^k，v₄ ^q≠v₄ ^kJudgment ofThe first coding result is not correct, i.e. u means_s，k，u_n，k，u_p，k，u_q，kIf all the verifications fail, the method returns to execute the step S50, and/or if v fails₆ ^s≠v₆ ^m，v₇ ⁿ≠v₇ ^m，v₅ ^p≠v₅ ^m，v₈ ^q≠v₈ ^mIf the second coding result is judged to be incorrect, that means u is_s，n，u_n，m，u_p，m，u_q，mIf all the verifications fail, the process returns to step S50. So that the corresponding sub-cluster randomly selects a pair of check node nodes satisfying k + m-n +1_kAnd a node_mAnd corresponding data processing is carried out again.

To obtain v₃ ^p，

To obtain v₄ ^qJudgment of

Represents a logical exclusive or;

To obtain v₇ ^p，

To obtain v₈ ^qJudgment of

In this example, in the node_kIn due to v₂ ^s＝v₂ ^kI.e. u_s，kVerification is successful then u_s，k＝u_k，s，v₁ ^k＝v₁ ^s，v₂ ^k＝v₂ ^s. Then pass through

To obtain v₃ ^p，

To obtain v₄ ^q. Then judge

If true, it indicates that the error source is node_kThe local data block, but the received coded data packet sent by other nodes is correct, and at this time, the correct result r of the current node can still be obtained_k＝H(v₁ ^k，v₂ ^k，v₃ ^p，v₄ ^q) I.e., the result of the first operation, S90 is further performed. Accordingly, in the node_mCan be processed the same for the corresponding data to obtain r_m＝H(v₅ ^m，v₆ ^m，v₇ ^p，v₈ ^q) And determining the result of the second operation, and further executing S90.

In one embodiment, it is assumed that in a distributed cluster containing N-6 nodes, the amount of data to be processed is M. The execution process of the distributed data processing method with fault tolerance capability described above can be shown with reference to fig. 4, and includes the following steps:

step 1: the 6 nodes are grouped into a sub-cluster, and the 6 nodes can be respectively used as { node₁，node₂，node₃，node₄，node₅，node₆Represents it.

Step 2: dividing a data set M into

A subset of data sets, here

I.e. the data set M is sent to the current sub-cluster. Data of the sub-clusterSet M is partitioned into 8 independent sub-data blocks b₁，b₂，b₃，b₄，b₅，b₆，b₇，b₈}. Each block of data is then replicated 3 copies (i.e., triple modular redundancy) resulting in 24 copies of data.

And step 3: distributing the 24 data to 6 nodes of the current sub-cluster according to a set rule, wherein each node can be divided into 4 data, and B₁＝{b₁，b₂，b₃，b₄}，B₂＝{b₁，b₂，b₅，b₆}，B₃＝{b₁，b₃，b₅，b₇}，B₄＝{b₂，b₄，b₆，b₈}，B₅＝{b₃，b₄，b₇，b₈}，B₆＝{b₅，b₆，b₇，b₈}。

And 4, step 4: each node pairs each data block b of the local data set_iExecuting Map function G (b)_i) To obtain a corresponding set of intermediate results, which may be denoted as V_jWhere j is 1, 2, 3, 4, 5 or 6, then V₁＝{v₁ ¹，v₂ ¹，v₃ ¹，v₄ ¹}，V₂＝{v₁ ²，v₂ ²，v₅ ²，v₆ ²}，V₃＝{v₁ ³，v₃ ³，v₅ ³，v₇ ³}，V₄＝{v₂ ⁴，v₄ ⁴，v₆ ⁴，v₈ ⁴}，V₅＝{v₃ ⁵，v₄ ⁵，v₇ ⁵，v₈ ⁵}，V₆＝{v₅ ⁶，v₆ ⁶，v₇ ⁶，v₈ ⁶}. Considering node₁Calculated in Map phase

And

is a fault tolerant procedure in case of errors.

And 5: randomly selecting a pair of check node₁And a node₆The remaining four nodes { node } in the sub-cluster₂，node₃，node₄，node₅It is coupled with a node₁Redundantly stored data set R_2，1，R_3，1，R_4，1，R_5，1And a node₆Redundantly stored data set R_2，6，R_3，6，R_4，6，R_5，6The corresponding intermediate results are respectively coded by CDC to obtain corresponding coding results u_2，1，u_3，1，u_4，1，u_5，1Is sent to the node₁，u_2，6，u_3，6，u_4，6，u_5，6Is sent to the node₆。

The results obtained in step 5 using CDC encoding were each

And

step 6: node(s)₁And a node₆And decoding the received coded data packet to obtain an intermediate result. node(s)₁Receive from

By decoding

To obtain v₂ ²，v₃ ⁵，v₁ ³，v₄ ⁴Node of the same theory₆。

And 7: node(s)₁And verifying the correctness of the coding result by comparing the intermediate result obtained by decoding with the intermediate result corresponding to the local data set, and simultaneously obtaining the correct intermediate result. Step 4 suppose that

And

a calculation error of then (v)₂ ²＝v₂ ¹)∧(v₃ ⁵＝v₃ ¹) And (v)₁ ³＝v₁ ¹)∧(v₄ ⁴＝v₄ ¹) Are all false while v₂ ²＝v₂ ¹，v₃ ⁵≠v₃ ¹，v₁ ³≠v₁ ¹，v₄ ⁴≠v₄ ¹Step 8 is executed.

And 8: due to v₂ ²＝v₂ ¹I.e. u_2，1Verification is successful, then v₁ ¹＝v₁ ²，v₂ ¹＝v₂ ². Then pass through

To obtain v₃ ³，

To obtain v₄ ⁴. At the same time

If true, then node is indicated₁The local data block is calculated incorrectly, but the received coded data packet is calculated correctly, and at the moment, the correct result r of the current node can be obtained₁＝H(v₁ ¹,v₂ ¹,v₃ ³,v₄ ⁴) Step 9 is executed.

And step 9: node(s)₁And a node₆All pass the verification and obtain the correct operation result r₁And r₆Then, the final result r ═ H (r) of the current sub-cluster can be obtained by Reduce function₁,r₆)。

The distributed coding computation-based triple modular redundancy fault-tolerant algorithm provided by the embodiment can keep the low communication load of the distributed coding computation, and can make the backup maintenance and fault recovery efficiency close to the copy technology, thereby ensuring the high reliability of the distributed system under the low communication load.

The terms "comprising" and "having" and any variations thereof in the embodiments of the present application are intended to cover non-exclusive inclusions. For example, a process, method, apparatus, product, or device that comprises a list of steps or modules is not limited to the listed steps or modules but may alternatively include other steps or modules not listed or inherent to such process, method, product, or device.

The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims

1. A distributed data processing method with fault tolerance capability is characterized by comprising the following steps:

s10, dividing the N nodes into

Sub-cluster to distribute computing tasks to

Represents rounding down, n represents the number of nodes in a sub-cluster;

s20, dividing the data set M into

Partial data sets, respectively sent to

On a sub-cluster, the data volume of each sub-cluster is made to be

Each sub-cluster divides the data set B of the sub-cluster into K' independent sub-data blocks, wherein each data block uses B_iAnd i is more than or equal to 1 and less than or equal to K 'represents that each data block is copied by 3 parts to generate 3K' parts of data;

s30, each sub-cluster distributes 3K' data to n nodes of the current sub-cluster according to the set rule, so that each data block in each node local data set is different, if the node is not the same_kAnd a node_mWhen the subscript k + m is n +1, then the node_kAnd a node_mThe intersection of the owned data sets is empty, the union is the full set of the current subset cluster data set, and if k + m is not equal to n +1, the node_kAnd a node_mThere are and only two identical redundant data;

s50, randomly selecting a pair of check node nodes satisfying k + m-n +1 for each sub-cluster_kAnd a node_mThe remaining nodes in the sub-cluster perform the operations of steps S51 and S52, which are two stepsThe steps may be performed in parallel:

s51, using CDC algorithm to local and node by each other node in sub-cluster_kCoding a first intermediate result corresponding to the redundantly stored data set to obtain a first coding result, and sending the first coding result to the node_k；

S70，node_kverifying the correctness of the first encoding result of each piece of data according to the first decoding result and the first intermediate result, and if the first encoding result is not correct, returning to execute the step S50, or, the node_mVerifying the correctness of the second encoding result according to the second decoding result and the second intermediate result, and if the second encoding result is not correct, returning to execute the step S50; if the number of the correct coding results in the first coding result is more than or equal to 1 and the number of the correct coding results in the second coding result is more than or equal to 1, the node_kCalculating the correct result of the corresponding node according to the correct coding result in the first coding result to obtain a first operation result, and the node_mCalculating the correct result of the corresponding node according to the correct coding result in the second coding result to obtain a second operation result;

2. The distributed data processing method with fault tolerance according to claim 1, wherein in step S30, when each sub-cluster distributes 3K' parts of data to n nodes of the current sub-cluster according to the set rule, the method further comprises:

Wherein, the symbol

The representation is defined as.

3. The distributed data processing method with fault tolerance capability according to claim 2, wherein before step S50, further comprising:

4. The distributed data processing method with fault tolerance of claim 1, wherein the value of n is 6 and the value of K' is 8.

5. The distributed data processing method with fault tolerance capability according to claim 4, wherein the step S50 includes:

each sub-cluster randomly selects a pair of check node_kAnd a node_mAnd k + m is 7, and the rest four nodes in each sub-cluster are recorded as: node(s)_s,node_n,node_p,node_qIn each node_s,node_n,node_p,node_qAssociating local data sets with nodes_kRedundantly stored data set R_s,k,R_n,k,R_p,k,R_q,kThe first intermediate result calculated at each node is coded by CDC to obtain the coding result u of the first intermediate result_s,k,u_n,k,u_p,k,u_q,kDetermining a first coding result, and sending the first coding result to the node_k(ii) a Respectively at the node_s,node_n,node_p,node_qAssociating local data sets with nodes_mRedundantly stored data set R_s,m,R_n,m,R_p,m,R_q,mThe second intermediate result calculated at each node is coded by CDC to obtain the coding result u of the second intermediate result_s,m,u_n,m,u_p,m,u_q,mI.e., the second encoding result, transmits the second encoding result to the node_m。

6. The distributed data processing method with fault tolerance capability according to claim 5, wherein step S70 includes:

if (v)₂ ^s＝v₂ ^k)∧(v₃ ⁿ＝v₃ ^k) Is formed by (v)₁ ^p＝v₁ ^k)∧(v₄ ^q＝v₄ ^k) Is true, indicating a node_kThe correct number of the coding result in the first coding result is more than or equal to 2 and the node_kAll local intermediate results are correctly calculated, and the operation result r of the current node data set is calculated through a Reduce function_k＝H(v₁ ^k,v₂ ^k,v₃ ^k,v₄ ^k) Obtaining a first operation result; if (v)₆ ^s＝v₆ ^m)∧(v₇ ⁿ＝v₇ ^m) Is true or (v)₅ ^p＝v₅ ^m)∧(v₈ ^q＝v₈ ^m) Is true, indicating a node_mThe correct number of the coding result in the second coding result is more than or equal to 2 and the node_mAll local intermediate results are correctly calculated, and the operation result r of the current node data set is calculated through a Reduce function_m＝H(v₅ ^m,v₆ ^m,v₇ ^m,v₈ ^m) Obtaining a second operation result; wherein v is₂ ^sRepresents a node_sIntermediate result of data sequence number 2, same as v₁ ^k,v₂ ^k,v₃ ^k,v₄ ^kAll represent a node_kIntermediate result of the last corresponding data sequence number, v₃ ⁿRepresents a node_nIntermediate result of the upper data sequence number 3, v₁ ^pRepresents a node_pIntermediate result of the upper data sequence number 1, v₄ ^qRepresents a node_qIntermediate result of the upper data sequence number 4, v₆ ^sRepresents a node_sIntermediate result of data sequence number 6, same as v₅ ^m,v₆ ^m,v₇ ^m,v₈ ^mAll represent a node_mIntermediate result of the last corresponding data sequence number, v₇ ⁿRepresents a node_nIntermediate result of the upper data sequence number 7, v₅ ^pRepresents a node_pIntermediate result of the upper data sequence number 5, v₈ ^qRepresents a node_qIntermediate results of the upper data sequence number 8, where H (-) denotes a function of the Reduce phase, the effect of which is to combine a plurality of intermediate results into one, the symbol Λ denotes the logical operation AND, v₁ ^k,v₂ ^k,v₃ ^k,v₄ ^kAll represent a node_mIntermediate result v obtained after calculation of self data block in Map stage₅ ^m,v₆ ^m,v₇ ^m,v₈ ^mAll represent a node_mCalculating self data blocks in a Map stage to obtain an intermediate result;

if (v)₂ ^s＝v₂ ^k)∧(v₃ ⁿ＝v₃ ^k) Is false and (v)₁ ^p＝v₁ ^k)∧(v₄ ^q＝v₄ ^k) If not, it indicates the node_kAt least one correct encoding result exists in the first encoding result; if (v)₆ ^s＝v₆ ^m)∧(v₇ ⁿ＝v₇ ^m) Is false and (v)₅ ^p＝v₅ ^m)∧(v₈ ^q＝v₈ ^m) If not, it indicates the node_mAt least one correct encoding result exists in the second encoding result; if v is₂ ^s≠v₂ ^k,v₃ ⁿ≠v₃ ^k,v₁ ^p≠v₁ ^k,v₄ ^q≠v₄ ^kIf the first encoding result is not correct, the process returns to step S50, or if v is not correct₆ ^s≠v₆ ^m,v₇ ⁿ≠v₇ ^m,v₅ ^p≠v₅ ^m,v₈ ^q≠v₈ ^mIf it is determined that the second encoding result is not correct, the process returns to step S50.

7. The distributed data processing method with fault tolerance of claim 6, wherein step S70 is followed by node_kThe correct number of coding results in the first coding result of (2) is at least 1, and the node_mWhen the correct number of the encoding results in the second encoding result is at least 1, the method further comprises:

if v is₂ ^s＝v₂ ^kIndicates u_s,kIf the verification is successful, u_s,k＝u_k,s，v₁ ^k＝v₁ ^s，v₂ ^k＝v₂ ^s(ii) a By passing

To obtain v₃ ^p，

To obtain v₄ ^qJudgment of

If true, it indicates that the error source is node_kLocal data blocks, nodes_kThe first coding result sent by other nodes is received correctly to obtain the node_kOf correct result r_k＝H(v₁ ^k,v₂ ^k,v₃ ^p,v₄ ^q) Obtaining a first operation result; wherein u is_k,sRepresentation storage in node_kAnd of_sThe intermediate result corresponding to the redundant data is encoded by CDC

Represents a logical exclusive or;

if v is₆ ^s＝v₆ ^mIndicates u_s,mIf the verification is successful, u_s,m＝u_m,s，v₆ ^m＝v₆ ^s，v₅ ^m＝v₅ ^s(ii) a By passing

To obtain v₇ ^p，

To obtain v₈ ^qJudgment of

If true, it indicates that the error source is node_mLocal data block, node_mThe second coding result sent by other nodes is received correctly to obtain the node_mOf correct result r_m＝H(v₅ ^m,v₆ ^m,v₇ ^p,v₈ ^q) Obtaining a second operation result; wherein u is_m,sRepresentation storage in node_mAnd of_sThe intermediate result corresponding to the redundant data is encoded using CDC₅ ^sRepresents a node_sIntermediate result of upper data sequence number 5.