CN110457161A

CN110457161A - A kind of efficiently highly reliable big data storage system, method, computer program

Info

Publication number: CN110457161A
Application number: CN201910681574.2A
Authority: CN
Inventors: 唐聃; 袁炜; 蔡红亮; 高燕; 刘善政; 曾琼
Original assignee: Chengdu University of Information Technology
Current assignee: Chengdu University of Information Technology
Priority date: 2019-07-26
Filing date: 2019-07-26
Publication date: 2019-11-15

Abstract

The invention belongs to information data processing technology field, a kind of efficiently highly reliable big data storage system, method, computer program are disclosed, the data for obtaining user and uploading are set according to configuration center；Array module calculates the array of coding；The array that array module calculates is read, is encoded according to array；Host node and from heartbeat message is sent between node mutually, reads the heartbeat delay time of configuration center；Parse failure node information；Carry out data recovery；The position of each initial data piecemeal is inquired according to the data ID to be downloaded；The setting of configuration center is read, the newborn node that the piecemeal memory node and data after being responsible for allocated code select when restoring；Configuration center, the various parameters being responsible in management system；Piecemeal after storage coding.The present invention provides a kind of storage efficiencies to be optimal, efficiently, high reliability, the big data storage solution easily extended.

Description

A kind of efficiently highly reliable big data storage system, method, computer program

Technical field

The invention belongs to information data processing technology fields more particularly to a kind of efficiently highly reliable big data to store system System, method, computer program.

Background technique

Currently, the immediate prior art:

As information technology is in the continuous development of various industries and field, data volume shows the trend of exponential growth, The rapid growth bring data storing reliability problem of data volume and the concurrent efficiency of data access are also following. Usually effective method be building by multiple back end (back end, which can be a PC or server etc., can be used as data The equipment of storage) composition distributed memory system.The sustainable growth of every profession and trade field storage data quantity leads to distributed storage The scale of system is increasing, and number of nodes is continuously increased.Existing some enterprises have possessed multiple more than 3000 nodes Storage system.There is the dispersibility of region and some characteristics of network, distributed storage due to distributed memory system System is also faced with many hardware and software failures, the test such as virus attack and natural calamity.These factors are likely to cause more A data node failure, so that loss of data.Therefore, how for distributed memory system the environment of a safety is provided, allows it With high reliability, data have high availability, become an important topic of distributed security storage at this stage.It is high Fault-tolerant ability is closely related with data reliability, becomes the indispensable technology of distributed memory system, so for dividing More fault-toleranr techniques of cloth storage system are most important to safeguards system data safety.

Currently, fault tolerant most commonly seen in distributed memory system is mainly more replication policies.The strategy It is data to be replicated n-1 copy to be respectively stored on n different nodes, to realize redundancy backup.Redundant data is n-1 secondary Notebook data can effectively restore data after occurring n-1 node while failure simultaneously.Most of commercial storage system uses Data reliability Enhancement Method be more fault-tolerant strategies.Famous distributed memory system GFS, hadoop uses this side Method.This method does not need special coding and restructing algorithm, and error resilience performance is preferable, but space utilization rate is low.If fault-tolerant energy When power is n-1, space utilization rate only has 1/n.In large-scale distributed storage system of today, with the promotion of fault-tolerant ability And the storage efficiency and ever-increasing update cost constantly declined becomes its great drawback.

Storage system fault-tolerance approach (abbreviation correcting and eleting codes strategy) based on correcting and eleting codes is that one kind is quite paid attention to by industry in recent years Enhancing storage system reliability method.Compared with more replication policies, the sharpest edges of correcting and eleting codes strategy are to protect In the case where demonstrate,proving fault-tolerant ability, it can be effectively reduced update cost, improve storage efficiency.Therefore, correcting and eleting codes strategy becomes and mentions The research hotspot of the reliability method of high distributed memory system.To using correcting and eleting codes strategy as the appearance of distributed memory system Mainly there are a Plank team of University of Tennessee and the Blaum team of IBM in wrong Mechanism Study, foreign countries, the country have Tsinghua University relax after Military team and Xu Yinlong team, Chinese University of Science and Technology etc..

Currently, the research for correcting and eleting codes strategy, mainly in two broad aspects: first is that RS (Reed-Solomon) correcting and eleting codes. RS correcting and eleting codes are the MDS codes that can entangle any mistake.Theoretically fault-tolerant ability is unrestricted for RS correcting and eleting codes, imitates with optimal storage Rate, but it calculates the operation being related in polynary finite field, and computation complexity is high (multiplying in especially polynary finite field), It is excessively huge that this is calculated as in large scale distributed system.To solve this problem, some scholars also further investigate this, and It is proposed some improved methods, wherein most typical method be by by polynary finite field operations be transformed on two element field carry out to Improve arithmetic speed.Other scholars also have outstanding contributions on improving polynary finite field operations efficiency, such as Plank proposition GF-complete improves the arithmetic speed in polynary finite field by modification instruction set.Second is that array code.The volume of array code Code process usually only uses simple binary system XOR operation, possesses operation efficiency height, realizes the advantages that process is simple.These are excellent Point is so that array code is highly suitable to be applied for large-scale distributed storage system.What array code was classified according to storage efficiency Words, can be divided into MDS code and non-MDS code.There are two fault-tolerant EVENODD codes and X code, three fault-tolerant star in typical MDS code Code.But there are no MDS array code of the fault-tolerant ability greater than 3 occur so far.In order to there is better fault-tolerant ability, work people Member designs some how fault-tolerant (fault-tolerant ability is greater than 3) non-MDS code, wherein more typical in the case where sacrificing certain storage efficiency Use Weaver code, Grid code etc..Although this kind of array code increases in fault-tolerant ability, is applied and be distributed Variety of problems can be faced in formula storage system, first is that band or stick limited amount system, need band or stick be prime number or That could be realized with the wired sexual intercourse of prime number, this be undoubtedly the expansion bands of distributed memory system are come with very big limitation, with point The expansibility of cloth storage system is not inconsistent.Second is that storage efficiency can be reduced with the increase of fault-tolerant number.Such as design one A to hold 10 wrong Weaver codes, its storage efficiency is less than 20%.So these problems cause array code in distribution Storage system field practicability is not high.

In conclusion problem of the existing technology is: fault-toleranr technique space utilization rate in current distributed memory system Low, storage efficiency is low, and poor reliability, scalability is poor, and practicability is not high.

Solve the difficulty of above-mentioned technical problem:

First is that band or stick limited amount system, need band or stick be prime number or with the wired sexual intercourse of prime number It is able to achieve, this, which is undoubtedly, carrys out very big limitation to the expansion bands of distributed memory system, the easy extension with distributed memory system Property is not inconsistent.Second is that storage efficiency can be reduced with the increase of fault-tolerant number.Such as design one can hold 10 it is wrong Weaver code, its storage efficiency is less than 20%.So these problems cause array code practical in distributed memory system field Property is not high.

Solve the meaning of above-mentioned technical problem:

First is that the numerical value in prime number is bigger, the interval between two adjacent prime numbers is also increasing.Dividing on a large scale When cloth storage system extends, if be still extended according to prime number, need to increase more nodes or dummy node, More nodes mean the increased costs of primary system extension, and more dummy nodes then need to waste more calculating, shadow Ring efficiency.It releases and the quantity of band and stick is limited, user can be allowed to be extended according to the demand of itself, not may require that increasing If more nodes and dummy node.

Second is that correcting and eleting codes used by the fault-toleranr technique of this system are MDS codes, storage efficiency is optimal, can be identical Memory space, store more effective data.

Summary of the invention

In view of the problems of the existing technology, the present invention provides a kind of efficiently highly reliable big data storage systems, side Method, computer program.

The invention is realized in this way a kind of efficiently highly reliable big data storage system includes:

The data for obtaining user and uploading are arranged according to configuration center, and transfer data to coding mould for data uploading module Block；

Array module calculates the array of coding according to the parameter of configuration center, and stores；

Coding module receives the data of data uploading module transmission, the array that array module calculates is read, according to array It is encoded；

Heart beat detection module, host node and from heartbeat message is sent between node mutually, the heartbeat for reading configuration center is prolonged The slow time, if it exceeds the time, then judge the node failure, and send information to data recovery module；

Data recovery module parses failure node information according to the feedback information of heartbeat inspecting module, reads in configuration The data of the heart restore relevant parameter, parsing failure node corresponding location information in an array, transmit to decoder module, into Row data are restored；

Decoder module obtains the failure node location information of data recovery module transmission, then carries out data recovery；

Data download module inquires the position of each initial data piecemeal according to the data ID to be downloaded, if each original Beginning deblocking, which exists, to be lost, then advanced row data are restored, if all existed, all initial data piecemeals are downloaded, It carries out being assembled into initial data；

Load balancing module reads the setting of configuration center, piecemeal memory node and data after being responsible for allocated code The newborn node selected when recovery；

Configuration center, the various parameters being responsible in management system；

Distributed memory system, the piecemeal after storing coding.

Further, the array module includes:

Parameter set unit is used for plan of establishment parameter, and parameter includes: initial data number of blocks n, verifies number of blocks m, has Confinement size GF (2^w)；

Matrix construction unit, for constructing finite field gf (2^w)And corresponding binary matrix and Cauchy matrix, and by Ke Western matrix-expand is at binary matrix；

Array Construction unit constructs array by binary matrix；

Operation optimizes unit, carries out operation statistics to entire array.

Further, the coding module includes:

Initial data loading unit takes out corresponding data according to array request from the file to be encoded；

Redundant data computing unit calculates the value of each replacement formula according to replacement list, goes out further according to array computation Redundant data.

Further, the decoder module includes；

Decoding unit, for according to Array Construction decoding matrix；

Data Computation Unit, for calculating the data for losing data recovery needs；

Data recovery unit, the data block that all loss data that are restored need.Further according to the data of these data blocks Exclusive or obtains corresponding loss data.

Another object of the present invention is to provide a kind of efficient height for efficiently highly reliable big data storage system can The big data storage method leaned on, the efficiently highly reliable big data storage method include that data upload process, data were restored Journey and data downloading process.

The array module of the efficiently highly reliable big data storage method realizes that process is as follows:

Step 1: plan of establishment parameter, parameter include: initial data number of blocks n, verify number of blocks m, finite field size GF (2^w)；

Step 2: building finite field gf (2^w) and corresponding binary matrix；

If root is primitive polynomial g (x)=x of α^a+x^b…+x^c+ 1, (a > b > ... > c) generates a finite field gf (2^w), i.e. α^a=α^b+…α^c+ 1, construction process is as follows:

α⁰=1

α¹=α

α²=α²

α^a=α^b+…+α^c+1

α^a+1=α^a× α=α^b+1+…+α^c+1+α

It is the multinomial element representation in finite field above.

Coefficient vector V (x)=(f of each finite field multinomial element f (x) is obtained again₀,f₁,…,f_w-1)；

Then, the corresponding binary matrix β (e) of building V (x).For arbitrary element e ∈ GF (2^w), β (e) be a w × The binary matrix of w, wherein i-th is classified as xⁱThe coefficient vector of emodp (x)；

Step 3: building Cauchy matrix；

From GF (2^w) m element and n element composition X={ x are chosen respectively₁,x₂,…,x_m, Y={ y₁,y₂,…,y_nStructure Cauchy matrix is built, the Cauchy matrix of building is as follows:

Step 4: Cauchy matrix is extended to binary matrix；

The corresponding binary matrix β (e) that element in Cauchy matrix obtained in step 3 is obtained according to step 2, The Cauchy matrix of m × n is extended to the binary matrix of mw × nw；

Binary matrix after being expanded is as follows:

Step 5: building array；

Firstly, each data block in n data block is cut into w parts, such as D₁,D₂,D₃,…,D_nIt is cut into D_1,1, D_1,2,…,D_1,w,D_2,1,…,D_n,w；

Obtain array:

Step 6: operation optimization

Operation statistics is carried out to entire array, in statistic array(i1, I2, i3 ∈ { 1,2 ..., n }, j1, j2, j3 ∈ { 1,2 ..., w }) number that occurs of pattern.It is more than primary formula by number Individually take out, according toThe sub- number of pattern multiplies 2,The sub- number of pattern multiply 3 after knot Fruit sorts from large to small；Array is replaced according to ranking results, is replaced successfully labeled as S_i, i is replacement serial number.Replacement is completed Alternate form sublist S and last optimization array Z2 are obtained afterwards；

If the parameter being arranged does not change, that array module need to only be executed once, obtain array Z1 and Z2, and replace Formula list S is changed, does not need to repeat；

The coding module of the efficiently highly reliable big data storage method realizes that process is as follows:

Step 1: load initial data

Nw parts of data are taken out from the file to be encoded according to array request, respectively correspond D in array_1,1,…,D_1,w, D_2,1,…,D_n,w；

Step 2: computing redundancy data

According to replacement list S, the value of each replacement formula is calculated.And then according to array Z2, calculate redundant digit According to completion coding；

The decoder module of the efficiently highly reliable big data storage method realizes that process is as follows:

Step 1: building decoding matrix

Firstly, being divided into (n+m) w block according to array Z1, so creation (n+m) w × (n+m) w matrix B, structure areI is the unit matrix of nw × nw, and O is the null matrix of nw × mw, and H is the check matrix of Z1；

Step 2 calculates and loses the data that data restore needs

It (1) is the loss of data of entire data column because when node failure, so when restoring data, according to loss Node create corresponding loss element list L；

(2) it loops through the element e lost in element list L and carries out (3) if e belongs to data element, if e belongs to In redundant elements, then the judgement of next element is carried out；

(3) r is the set for the row that e train value is 1 in check matrix H.If not having element in r, e row is set 0；If r In have element, that just from be chosen to remove in r lose element after the smallest a line of Hamming weight as r_l, element is corresponding in r Capable and e row successively with r_lXOR operation is carried out, finally by r_lSet 0；

(4) after data element operations all in L are complete, then the corresponding column of the redundant elements in L are all set 0；

Step 3 restores data；

After completing step 2, if the corresponding behavior r in corresponding matrix B of the element e in loss element list L_e, c_e It is in r_eThe set for the column that intermediate value is 1；c_eIn the corresponding data block of element be restore element e need data block；According to losing Element list L and matrix B are lost, the data block that all loss data that are restored need；It is different further according to the data of these data blocks Or obtain corresponding loss data.

Further, the data upload process specifically includes:

Data uploading module receives the data uploaded, and the data of upload are sent to coding module；

Array module reads the parameter in relation to array in configuration center；

Array module polls are under corresponding parameter, if having corresponding array, if it is not, according to parameter, meter Encoding array is calculated, and the array being finally calculated is sent to coding module, if existing, array is directly sent to volume Code module；

Coding module reads the parameter of the related coding of configuration center, and receive the initial data that data uploading module is sent with And the encoding array that array module is sent, cutting, pretreatment, coding meter then are carried out to initial data according to encoding array It calculates, obtains redundant data；

Load balancing module reads the related related parameter of load balancing in configuration center；

Each node load state of load balancing module Querying Distributed storage system；

Load balancing module chooses the memory node list of piecemeal after coding according to parameter and each node load state, will It is sent to coding module；

The piecemeal memory node list that coding module balancing received load module is chosen, then according to node listing, respectively Corresponding piecemeal is stored in distributed memory system.

Further, the data recovery procedure specifically includes:

Heart beat detection module moment control host node and from sending heartbeat message between node, if within the setting time, The heartbeat message of node is not received, then judges the node failure, and the information of the node failure is sent to data and restores mould Block；

Data recovery module reads the parameter that related data are restored in configuration center；

Data recovery module restores related parameter, parsing according to node failure information or piecemeal fail message and data Node or the piecemeal corresponding position in encoding array are lost out, send it to decoder module；

Load balancing module chooses newborn node listing, sends it to decoding according to parameter and each node load state Module；

Decoder module chooses decoding scheme, according to side according to node or the piecemeal corresponding position in encoding array is lost Case reads residue block data needed for HDFS distributed memory system；

Decoder module is decoded calculating according to remaining block data, obtains the data for losing piecemeal, equal according to load The newborn node listing that the module that weighs is chosen, the piecemeal recovered is stored in the newborn node of distributed memory system.

Further, the data downloading process specifically includes:

Data download module will if piecemeal is intact according to the state of the data ID to be downloaded inquiry each piecemeal of data Piecemeal assembles, and downloads to client, loses if there is piecemeal, then will lose blocking information and be sent to data recovery module；

Decoder module chooses decoding scheme, according to side according to node or the piecemeal corresponding position in encoding array is lost Case reads residue block data needed for distributed memory system；

Decoder module is decoded calculating according to remaining block data, obtains the data for losing piecemeal, equal according to load The newborn node listing that the module that weighs is chosen, the piecemeal recovered is stored in the newborn node of distributed memory system；

Data download module reads the data of each piecemeal, and block data is assembled into, downloads to client.

Another object of the present invention is to provide a kind of computer program, the computer program has for executing efficiently The program coding of highly reliable big data storage method, wherein the computer program is run on computers.

In conclusion advantages of the present invention and good effect are as follows:

Storage efficiency of the present invention is optimal: the correcting and eleting codes that the fault-toleranr technique of this system uses are MDS codes, are theoretically had There is optimal space utilization rate, fault-tolerant ability adds one to need to increase a redundant block.

The present invention is efficient: the correcting and eleting codes that the fault-toleranr technique of this system uses are turned by calculating the complicated finite field of RS code It changes simple XOR operation into, effectively increases code efficiency.

Reliability of the present invention: the correcting and eleting codes that the fault-toleranr technique of this system uses are transformed by RS code, with RS code, In Theoretically fault-tolerant ability is unrestricted.

The present invention easily extends: being directed to same efficient array code, the correcting and eleting codes that the fault-toleranr technique of this system uses are to item There is no limit do not need band or stick be prime number or can be realized as with the wired sexual intercourse of prime number for band or number of blocks.With It family can expansion system according to their own needs.

The present invention is low for fault-toleranr technique space utilization rate in current distributed memory system, low efficiency, poor reliability, expands The status of malleability difference provides a kind of storage efficiency and is optimal and (has MDS property), and efficiently, high reliability easily extends Big data solution.

Detailed description of the invention

Fig. 1 is efficiently highly reliable big data memory system architecture schematic diagram provided in an embodiment of the present invention.

Fig. 2 is data upload process schematic diagram provided in an embodiment of the present invention.

Fig. 3 is data recovery procedure provided in an embodiment of the present invention and data downloading process schematic diagram.

Fig. 4 is correcting and eleting codes of the present invention provided in an embodiment of the present invention and EVENODD code operation efficiency contrast schematic diagram.

Fig. 5 is the operation efficiency contrast schematic diagram of RA code of the present invention and Star code provided in an embodiment of the present invention.

Specific embodiment

In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to embodiments, to the present invention It is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not used to Limit the present invention.

The present invention is low for fault-toleranr technique space utilization rate in current distributed memory system, low efficiency, poor reliability, expands The status of malleability difference, provide a kind of storage efficiency be optimal and (there is MDS property), efficiently, high reliability, easily extend it is big Data solution.

As shown in Figure 1, efficiently highly reliable big data storage system provided in an embodiment of the present invention includes: that data upload mould Block 101, array module 102, coding module 103, heart beat detection module 104, data recovery module 105, decoder module 106, Data download module 107, load balancing module 108, configuration center 109, distributed memory system 110.

The data for obtaining user and uploading are arranged according to configuration center, and transfer data to coding for data uploading module 101 Module；

Array module 102 calculates the array of coding according to the parameter of configuration center, and stores；

Coding module 103 receives the data of data uploading module transmission, reads the array that array module calculates, according to Array is encoded；

Heart beat detection module 104, host node and from heartbeat message is sent between node mutually read the heartbeat of configuration center Delay time, if it exceeds the time, then judge the node failure, and send information to data recovery module；

Data recovery module 105 parses failure node information according to the feedback information of heartbeat inspecting module, and reading is matched The data for setting center restore relevant parameter, parse failure node corresponding location information in an array, transmit and give decoding mould Block carries out data recovery；

Decoder module 106 obtains the failure node location information of data recovery module transmission, then carries out data recovery；

Data download module 107 inquires the position of each initial data piecemeal according to the data ID to be downloaded, if each Initial data piecemeal, which exists, to be lost, then advanced row data are restored, will be under the downloading of all initial data piecemeals if all existed Come, carries out being assembled into initial data；

Load balancing module 108 reads the setting of configuration center, the piecemeal memory node after being responsible for allocated code, and The newborn node that data select when restoring；

Configuration center 109, the various parameters being responsible in management system；

Distributed memory system 110, the piecemeal after storing coding.

Further, array module 102 realizes that process is as follows:

Step 1: plan of establishment parameter, parameter include: initial data number of blocks n, verify number of blocks m, finite field size GF (2^w)。

Step 2: building finite field gf (2^w) and corresponding binary matrix.

α⁰=1

α¹=α

α²=α²

α^a=α^b+…+α^c+1

α^a+1=α^a× α=α^b+1+…+α^c+1+α

It is the multinomial element representation in finite field above.

Next, obtaining coefficient vector V (x)=(f of each finite field multinomial element f (x) again₀,f₁,…,f_w-1)。

Then, the corresponding binary matrix β (e) of building V (x).For arbitrary element e ∈ GF (2^w), β (e) be a w × The binary matrix of w, wherein i-th is classified as xⁱThe coefficient vector of emodp (x).

Step 3: building Cauchy matrix.

Step 4: Cauchy matrix is extended to binary matrix.

The corresponding binary matrix β (e) that element in Cauchy matrix obtained in step 3 is obtained according to step 2. The Cauchy matrix of m × n is extended to the binary matrix of mw × nw.

Binary matrix after being expanded is as follows:

Step 5: building array.

Firstly, each data block in n data block is cut into w parts, such as D₁,D₂,D₃,…,D_nIt is cut into D_1,1, D_1,2,…,D_1,w,D_2,1,…,D_n,w。

Obtain array:

Step 6: operation optimization

Operation statistics is carried out to entire array, in statistic array(i1, I2, i3 ∈ { 1,2 ..., n }, j1, j2, j3 ∈ { 1,2 ..., w }) number that occurs of pattern.It is more than primary formula by number Individually take out, according toThe sub- number of pattern multiplies 2,The sub- number of pattern multiply 3 after knot Fruit sorts from large to small.Array is replaced according to ranking results, is replaced successfully labeled as S_i, i is replacement serial number.Replacement is completed Alternate form sublist S and last optimization array Z2 are obtained afterwards.

If the parameter being arranged does not change, that array module need to only be executed once, obtain array Z1 and Z2, and replace Formula list S is changed, does not need to repeat.

Further, coding module 103 realizes that process is as follows:

Step 1: load initial data

Nw parts of data are taken out from the file to be encoded according to array request, respectively correspond D in array_1,1,…,D_1,w, D_2,1,…,D_n,w

Step 2: computing redundancy data

According to replacement list S, the value of each replacement formula is calculated.And then according to array Z2, calculate redundant digit According to completion coding.

Further, decoder module 106 realizes that process is as follows:

Step 1: building decoding matrix

Firstly, being divided into (n+m) w block according to array Z1, so creation (n+m) w × (n+m) w matrix B, structure areI is the unit matrix of nw × nw, and O is the null matrix of nw × mw, and H is the check matrix of Z1.

Step 2: calculating and lose the data that data restore needs

Step 2-1: being the loss of data of entire data column because when node failure, so when restoring data, according to The node of loss creates corresponding loss element list L.

Step 2-2: it loops through the element e lost in element list L and carries out step 2- if e belongs to data element 3, if e belongs to redundant elements, carry out the judgement of next element.

Step 2-3:r is the set for the row that e train value is 1 in check matrix H.If not having element in r, e row is set 0 (theoretically the element can not restore explanation).If there is element in r, that is just from the Hamming weight being chosen to remove after losing element in r The smallest a line is measured as r_l, in r the corresponding row of element and e row successively with r_lXOR operation is carried out, finally by r_lSet 0.

Step 2-4: 0 is all set after data element operations all in L are complete, then by the corresponding column of the redundant elements in L.

Step 3: restoring data

After completing step 2, if the corresponding behavior r in corresponding matrix B of the element e in loss element list L_e, c_eIt is In r_eThe set for the column that intermediate value is 1.c_eIn the corresponding data block of element be restore element e need data block.According to loss Element list L and matrix B, it is available to restore all data blocks losing data and needing.Further according to the data of these data blocks Exclusive or obtains corresponding loss data.

As shown in Fig. 2, data upload process provided in an embodiment of the present invention specifically includes:

Step 1: data uploading module receives the data uploaded, and the data of upload are sent to coding module.

Step 2: array module reads the parameter in relation to array in configuration center.

Step 3: array module polls are under corresponding parameter, if having corresponding array.If it is not, according to ginseng Number, calculation code array, and the array being finally calculated is sent to coding module.If existing, directly array is sent out Give coding module.

Step 4: coding module reads parameter of the configuration center in relation to coding, and receives the original of data uploading module transmission The encoding array that data and array module are sent.Then cutting is carried out to initial data according to encoding array, pre-processed, compiled Code calculates, and obtains redundant data.

Step 5: load balancing module reads the related related parameter of load balancing in configuration center.

Step 6: each node load state of load balancing module inquiry HDFS distributed memory system.

Step 7: load balancing module chooses the memory node column of piecemeal after coding according to parameter and each node load state Table sends it to coding module.

Step 8: then the piecemeal memory node list that coding module balancing received load module is chosen is arranged according to node Table stores corresponding piecemeal in HDFS distributed memory system respectively.

Wherein HDFS distributed memory system can be replaced with other distributed memory systems.

As shown in figure 3, data recovery procedure provided in an embodiment of the present invention specifically includes:

A1 step: the heart beat detection module moment controls host node and from heartbeat message is sent between node, if being arranged In time, the heartbeat message of node is not received, then judges the node failure.And the information of the node failure is sent to data Recovery module.

A2 step: data recovery module reads the parameter that related data are restored in configuration center.

A3 step: data recovery module restores related ginseng according to node failure information or piecemeal fail message and data Number parses and loses node or the piecemeal corresponding position in encoding array, sends it to decoder module.

A4 step: load balancing module reads the related related parameter of load balancing in configuration center.

A5 step: load balancing module is chosen newborn node listing, is sent to according to parameter and each node load state To decoder module.

A6 step: decoder module chooses decoding scheme according to node or the piecemeal corresponding position in encoding array is lost. According to scheme, residue block data needed for HDFS distributed memory system is read.

A7 step: decoder module is decoded calculating according to remaining block data, obtains the data for losing piecemeal.It presses According to the newborn node listing that load balancing module is chosen, the piecemeal recovered is stored in the newborn node of HDFS.

Data downloading process specifically includes:

B1 step: data download module inquires the state of each piecemeal of data according to the data ID to be downloaded, if piecemeal is complete It is good, then piecemeal is assembled, downloads to client.It is lost if there is piecemeal, then losing blocking information, to be sent to data extensive Multiple module.

B8 step: data download module reads the data of each piecemeal, and block data is assembled into, downloads to client.

Fig. 4 is n=5, the correcting and eleting codes and n=5 of the present invention of m=2 (data number of blocks is 5, fault-tolerant ability 2) EVENODD code, (each document No. 10 times removes most for the scramble time comparison encoded respectively to the file of 1MB-10MB Average value after big minimum value).Fig. 5 is n=5, the correcting and eleting codes of the present invention of m=3 (data number of blocks is 5, fault-tolerant ability 3) with The Star code of n=5, scramble time comparison (each document No. 10 times, the removing that the file of 1MB-10MB is encoded respectively Average value after maximin).

Correcting and eleting codes of the present invention are transformed by calculating complicated RS code, from two figures it can be seen that correcting and eleting codes of the present invention exist It is slightly above on operation efficiency with the EVENODD code and Star code in efficiently famous array code, and the appearance of correcting and eleting codes of the present invention There is no limit this has also further embodied the practicability of correcting and eleting codes of the present invention for wrong ability and stripe size.

The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention Made any modifications, equivalent replacements, and improvements etc., should all be included in the protection scope of the present invention within mind and principle.

Claims

1. a kind of efficiently highly reliable big data storage system, which is characterized in that the efficiently highly reliable big data stores system System includes:

The data for obtaining user and uploading are arranged according to configuration center, and transfer data to coding module for data uploading module；

Coding module receives the data of data uploading module transmission, reads the array that array module calculates, carries out according to array Coding；

Heart beat detection module, host node and from heartbeat message is sent between node mutually, when reading the heartbeat delay of configuration center Between, if it exceeds the time, then judge the node failure, and send information to data recovery module；

Data recovery module parses failure node information, reads configuration center according to the feedback information of heartbeat inspecting module Data restore relevant parameter, parse failure node corresponding location information in an array, transmit to decoder module, counted According to recovery；

Data download module inquires the position of each initial data piecemeal according to the data ID to be downloaded, if each original number Exist according to piecemeal and lose, then advanced row data are restored, if all existed, all initial data piecemeals are downloaded, and carry out It is assembled into initial data；

Load balancing module reads the setting of configuration center, and the piecemeal memory node and data after being responsible for allocated code restore When the newborn node that selects；

Distributed memory system, the piecemeal after storing coding.

2. efficiently highly reliable big data storage system as described in claim 1, which is characterized in that the array module packet It includes:

Parameter set unit is used for plan of establishment parameter, and parameter includes: initial data number of blocks n, verifies number of blocks m, finite field Size GF (2^w)；

Matrix construction unit, for constructing finite field gf (2^w) and corresponding binary matrix and Cauchy matrix, and by Cauchy's square Battle array is extended to binary matrix；

Array Construction unit constructs array by binary matrix；

Operation optimizes unit, carries out operation statistics to entire array.

3. efficiently highly reliable big data storage system as described in claim 1, which is characterized in that the coding module packet It includes:

Redundant data computing unit calculates the value of each replacement formula according to replacement list, goes out redundancy further according to array computation Data.

4. efficiently highly reliable big data storage system as described in claim 1, which is characterized in that the decoder module packet It includes；

Decoding unit, for according to Array Construction decoding matrix；

Data recovery unit, the data block that all loss data that are restored need；Further according to the data exclusive or of these data blocks Obtain corresponding loss data.

5. efficiently highly reliable big data storage system is efficient highly reliable described in a kind of operation claim 1-4 any one Big data storage method, which is characterized in that the efficiently highly reliable big data storage method includes data upload process, data Recovery process and data downloading process；

Step 2: building finite field gf (2^w) and corresponding binary matrix；

If root is primitive polynomial g (x)=x of α^a+x^b…+x^c+ 1, (a > b > ... > c) generates a finite field gf (2^w), i.e., α^a=α^b+…α^c+ 1, construction process is as follows:

α⁰=1

α¹=α

α²=α²

α^a=α^b+…+α^c+1

α^a+1=α^a× α=α^b+1+…+α^c+1+α

It is the multinomial element representation in finite field above；

Then, the corresponding binary matrix β (e) of building V (x), for arbitrary element e ∈ GF (2^w), β (e) is the two of a w × w System matrix, wherein i-th is classified as xⁱThe coefficient vector of e mod p (x)；

Step 3: building Cauchy matrix；

From GF (2^w) m element and n element composition X={ x are chosen respectively₁,x₂,…,x_m, Y={ y₁,y₂,…,y_nBuilding Ke The Cauchy matrix of western matrix, building is as follows:

Step 4: Cauchy matrix is extended to binary matrix；

The corresponding binary matrix β (e) that element in Cauchy matrix obtained in step 3 is obtained according to step 2, by m The Cauchy matrix of × n is extended to the binary matrix of mw × nw；

Binary matrix after being expanded is as follows:

Step 5: building array；

Firstly, each data block in n data block is cut into w parts, such as D₁,D₂,D₃,…,D_nIt is cut into D_1,1,D_1,2,…, D_1,w,D_2,1,…,D_n,w；

Obtain array:

Step 6: operation optimization

Operation statistics is carried out to entire array, in statistic array(i1,i2,i3 ∈ { 1,2 ..., n }, j1, j2, j3 ∈ { 1,2 ..., w }) number that occurs of pattern；It is more than that primary formula individually takes by number Out, according toThe sub- number of pattern multiplies 2,The sub- number of pattern multiply 3 after result from Small sequence is arrived greatly；Array is replaced according to ranking results, is replaced successfully labeled as S_i, i is replacement serial number, after the completion of replacement To alternate form sublist S and last optimization array Z2；

If the parameter being arranged does not change, that array module need to only be executed once, obtain array Z1 and Z2 and alternate form Sublist S does not need to repeat；

Step 1: load initial data

Nw parts of data are taken out from the file to be encoded according to array request, respectively correspond D in array_1,1,…,D_1,w,D_2,1,…, D_n,w；

Step 2: computing redundancy data

According to replacement list S, the value of each replacement formula is calculated；And then according to array Z2, redundant data is calculated, it is complete At coding；

Step 1: building decoding matrix

Step 2 calculates and loses the data that data restore needs

It (1) is the loss of data of entire data column because when node failure, so when restoring data, according to the section of loss Point creates corresponding loss element list L；

(2) loop through lose element list L in element e carry out (3) if e belongs to data element, if e belong to it is superfluous Remaining element then carries out the judgement of next element；

(3) r is the set for the row that e train value is 1 in check matrix H；If not having element in r, e row is set 0；If had in r Element, that is just from the smallest a line of Hamming weight being chosen to remove after losing element in r as r_l, the corresponding row of element and e in r Row successively with r_lXOR operation is carried out, finally by r_lSet 0；

Step 3 restores data；

After completing step 2, if the corresponding behavior r in corresponding matrix B of the element e in loss element list L_e, c_eBe r_eThe set for the column that intermediate value is 1；c_eIn the corresponding data block of element be restore element e need data block；It is first according to losing Plain list L and matrix B, the data block that all loss data that are restored need；It is obtained further according to the data exclusive or of these data blocks To corresponding loss data.

6. efficiently highly reliable big data storage method as claimed in claim 5, which is characterized in that the data upload process It specifically includes:

Array module polls are under corresponding parameter, if having corresponding array, if it is not, calculating and compiling according to parameter Code array, and the array being finally calculated is sent to coding module, if existing, array is directly sent to coding mould Block；

Coding module reads parameter of the configuration center in relation to coding, and receives the initial data and battle array of data uploading module transmission Then the encoding array that columnization module is sent carries out cutting, pretreatment to initial data according to encoding array, coding is calculated, obtained To redundant data；

Load balancing module is chosen the memory node list of piecemeal after coding, is sent out according to parameter and each node load state Give coding module；

The piecemeal memory node list that coding module balancing received load module is chosen stores respectively then according to node listing Corresponding piecemeal is in distributed memory system.

7. efficiently highly reliable big data storage method as claimed in claim 5, which is characterized in that the data recovery procedure It specifically includes:

The heart beat detection module moment controls host node and from heartbeat message is sent between node, if do not had within the setting time The heartbeat message of node is received, then judges the node failure, and the information of the node failure is sent to data recovery module；

Data recovery module restores related parameter according to node failure information or piecemeal fail message and data, parses and loses Disloyal point or piecemeal corresponding position in encoding array, send it to decoder module；

Load balancing module chooses newborn node listing, sends it to decoder module according to parameter and each node load state；

Decoder module chooses decoding scheme according to node or the piecemeal corresponding position in encoding array is lost, and according to scheme, reads Take residue block data needed for HDFS distributed memory system；

Decoder module is decoded calculating according to remaining block data, the data for losing piecemeal is obtained, according to load balancing mould The newborn node listing that block is chosen, the piecemeal recovered is stored in the newborn node of distributed memory system.

8. efficiently highly reliable big data storage method as claimed in claim 5, which is characterized in that the data downloading process It specifically includes:

Data download module inquires the state of each piecemeal of data according to the data ID to be downloaded, if piecemeal is intact, by piecemeal It assembles, downloads to client, lost if there is piecemeal, then will lose blocking information and be sent to data recovery module；

Decoder module chooses decoding scheme according to node or the piecemeal corresponding position in encoding array is lost, and according to scheme, reads Take residue block data needed for distributed memory system；

Decoder module is decoded calculating according to remaining block data, the data for losing piecemeal is obtained, according to load balancing mould The newborn node listing that block is chosen, the piecemeal recovered is stored in the newborn node of distributed memory system；

9. a kind of computer program, which is characterized in that the computer program has for any in perform claim requirement 5 to 8 The program coding of efficient highly reliable big data storage method described in, wherein the computer program is run on computers.

10. a kind of storage medium for being stored with computer program as claimed in claim 9.