CN111711671A

CN111711671A - Cloud storage method for efficient ciphertext file updating based on blind storage

Info

Publication number: CN111711671A
Application number: CN202010486723.2A
Authority: CN
Inventors: 路松峰; 刘奇; 方欣; 潘国阳
Original assignee: Shenzhen Huazhong University of Science and Technology Research Institute
Current assignee: Shenzhen Huazhong University of Science and Technology Research Institute
Priority date: 2020-06-01
Filing date: 2020-06-01
Publication date: 2020-09-25
Anticipated expiration: 2040-06-01
Also published as: CN111711671B

Abstract

The invention discloses a cloud storage method for efficient ciphertext file updating based on blind storage. The invention improves the traditional blind storage method for resisting the access mode leakage in the cloud environment, so that the environment is suitable for efficient and frequent updating of files. The method comprises the following steps: a storage construction algorithm suitable for the method; a file upload protocol; an incremental encryption scheme, a decryption scheme, for the file; a file download protocol; file update protocol method. The method can effectively improve the efficiency of frequent file updating operation of the ciphertext cloud storage, ensures the safe storage of data, does not expose any information to the server, and can be used for a distributed storage scheme.

Description

Cloud storage method for efficient ciphertext file updating based on blind storage

Technical Field

The invention relates to the technical field of internet, in particular to a cloud storage method for efficient ciphertext file updating based on blind storage.

Background

With the development of cloud computing, the data generation speed and the data circulation speed are obviously increased, and a big data era has come. The data has the characteristics of large scale, multiple data types, high speed requirement, low value density and the like, and is not stored in a centralized manner like a traditional information system any more, and new modes such as cloud computing, distributed storage and the like are widely adopted. As a third-party platform, cloud storage is an untrusted storage service, and the security of data may not be guaranteed. Malicious third parties or system administrators can eavesdrop or acquire sensitive data, data leakage can be caused by access control bugs, and data leakage can be caused by the fact that physical storage equipment is stolen or lost. The best method for solving the data security in the cloud storage environment is to encrypt the data, and in order to enable a user to use the cloud storage service with confidence, a cloud service provider provides various schemes for protecting the data security, but at present, the data leakage event of the cloud storage is frequent, and the data security is guaranteed to become a problem which needs to be solved in the cloud storage.

The Blind storage (Blind storage) is a mechanism for storing user data at a cloud server side and ensuring that the cloud server cannot acquire user data information, and solves the problem that an access mode is leaked by a remote server in the file access process. When the client and the server interact, the server does not know the specific content of the document blocks and the document to which each block belongs, and only needs to upload and download the document blocks and a small amount of matching operation, and a large amount of encryption and decryption calculation tasks are performed locally, so that the risk of information leakage is reduced, and a strong safety foundation is laid.

However, uploading the real blocks and the obfuscated blocks to the server together tends to increase the storage space of the server, and in the existing ciphertext file storage scheme, basic modification operations on files include adding, deleting and querying. Modification of an encrypted file in an original blind storage scheme can cause an avalanche effect to exist in decryption and encryption operations of the whole file, namely tiny changes of a plaintext or a secret key can cause great changes of ciphertext binary contents, and file processing and synchronization efficiency is reduced. The existing method does not deal with how to improve the synchronous updating efficiency.

Disclosure of Invention

The invention aims to provide a cloud storage method for efficiently updating a ciphertext file based on blind storage, aiming at the defects in the prior art.

In order to achieve the above object, the present invention provides a cloud storage method for efficient ciphertext file update based on blind storage, which comprises the following steps:

step S1: the server side constructs a specific form of storage space to save all file sets,

step S2: the client carries out variable-length block processing and encryption on a file to be uploaded based on file content to generate ciphertext data blocks with different sizes and random sequences, and the ciphertext data blocks are indexed according to the random sequences and stored in the storage space;

step S3: the client side retrieves a random sequence generated by the file identification and reads the ciphertext data blocks for multiple times to obtain all the ciphertext data blocks of the file from the storage space, and decrypts the ciphertext data blocks in the client side to restore the ciphertext data blocks into a complete file;

step S4: and modifying the downloaded file and uploading the modified downloaded file to the server side again.

Further, the step S1 specifically includes:

step 1.1, sending application information for constructing a storage space to a server through a client;

step 1.2, the server constructs a group n_SEach size is m_SA storage space S composed of data blocks with bit sizes;

and step 1.3, initializing all the nodes to be 0, and returning a creating result to the client.

Further, the step S2 specifically includes:

step 2.1, carrying out content-based variable-length blocking processing on each file f to convert the file f into data blocks with different sizes, wherein the number of the data blocks is size_f；

Step 2.2, constructing the unique identification id of the file f according to the information of the file f_fAccording to the unique identification id of the file_fGenerating seed sigma using a fully-defined-field pseudorandom function_fSeed σ_fInput pseudo-random generatorForming a random sequence;

step 2.3, based on the random sequence and the data block number size_fGenerating a random subset S_fWherein the subset S is random_fBelonging to a storage space, will S_fAs the first of the random sequences in step 2.2, the length is | S_fAn integer set of | where,

kappa is the minimum data block number accessed in each time in the storage, and α is the expansion coefficient of the storage;

step 2.4, in random subset S_fSize selection_fShortest prefixes of retrievable empty data blocks as subsets

Is the size of the data block number size_fClient follows subset for the first time

Establishing an index relationship between the increasing sequence of the retrieved numbers and the data blocks in step 2.1;

step 2.5, adding header information for each variable length block of the f of the file, generating a file key by using a pseudorandom function, and encrypting each data block by adopting an AES (advanced encryption Standard) symmetric encryption algorithm;

and 2.6, the client sends the encrypted data block to the server, and the server stores the encrypted data block into a storage position of the storage space according to the index relation established in the step 2.4.

Further, the step 2.1 specifically includes:

step 2.1.1, the client sets two integers d and r based on Rabin fingerprint algorithm, selects a fixed sliding window W, sets a size parameter of m bytes, and sets a maximum block length max_BMinimum block length min_B；

Step 2.1.2, reading the file f content of the client byte by byte into the sliding window W, and calculating the hash value R in the sliding window W according to the Rabin fingerprint algorithm_breakpoint(ii) a When R is_breakpointWhen mod d is equal to r, a breakpoint is reached, and a block is formed between two adjacent breakpoints;

step 2.1.3, the client records the block number i and the breakpoint position index of the current block_bSize of the block content_bAnd taking the hash value H (block) of the data in the block as the information of the current block, and repeating the step 2.1.2 until the whole file f is completely changed into data blocks with different sizes;

step 2.1.4, the client obtains the information of all the data blocks of the file f, and the size of the number of the data blocks_fAnd finishing the storage pretreatment of the file f.

Further, the step S3 specifically includes:

step 3.1, the client acquires the unique identification id of the file f according to the file information_fAccording to the unique identification id of the file f_fGenerating seed sigma using a fully-defined-field pseudorandom function_fSeed σ_fInputting a pseudo-random generator to obtain a random sequence;

step 3.2, the client generates a first integer set S with the length of kappa according to the random sequence_f ⁰Initiating a first round of interaction to the server;

step 3.3, the server side returns the ciphertext data block corresponding to the index to the client side;

step 3.4, the client decrypts the returned data block to find the first unique mark as id_fAs the first data block of the file f, and obtaining the block size of the file f from the header information of the first data block_f；

Step 3.5, the client analyzes the result

S indicating that the content of this file has been returned in its entirety in the first interaction_f ⁰]In the step (3), the second visit is not needed, and the step (3.8) is directly carried out, otherwise, the step (3.6) is carried out

Step 3.6, the client side performs size according to the random sequence and the number of the data blocks_fGenerating a random set S_fInitiating a second round of interaction to the server;

step 3.7, the server side returns the ciphertext data block corresponding to the index to the client side;

and 3.8, the client decrypts all the data blocks and integrates the data blocks into a complete file f according to the header information of the data blocks.

Further, the step S4 specifically includes the following steps:

step 4.1, the client modifies the file information of the downloaded and read file f into f';

step 4.2, for the modified file f ', the unique identifier of the file f' is still id_fAnd repeating the variable-length blocking processing based on the file content in the step 2.1 to convert the variable-length blocking processing into updated data blocks with different sizes, wherein the number of the updated data blocks is size_f′；

Step 4.3, the client side modifies the unique identification id of the file according to the modification_fAnd generating seed sigma by using a fully-defined-domain pseudo-random function_f′Inputting the seeds into a pseudo-random generator to obtain a new random sequence, and generating a new random sequence with a length of | S_f′Random subset S of integer set of |_f′；

Step 4.4, the client selects the random subset S_f′A random subset

The size is the updated data block number size_f′(ii) a In a random subset S_f′Size selection_f′Shortest prefixes of retrievable empty data blocks as subsets

Step 4.5, the client analyzes the data block quantity change of the variable-length block before and after updating the file, and compares the block information of the variable-length block algorithm based on the content of the file f and the modified file f' to form a corresponding table of the file block before and after the file block is updated;

step 4.6, forming the index information of the updated data block according to the new random sequence, updating the header information of all the updated data blocks, and re-encrypting the header information of all the updated data blocks;

and 4.7, sending and storing all updated ciphertext data blocks of the file to the server side according to the new index information.

Further, in the step 4.5:

if the number of the front data block and the rear data block is the same, a new available space set does not need to be searched

Replacing the updated inconsistent data blocks;

if the number of update data blocks increases, a new set of available space needs to be found

Replacing or storing the updated inconsistent data blocks into the new available space;

if the number of updated data blocks is reduced, a redundant set of available space is required

And clearing, namely replacing the updated inconsistent data blocks, and clearing redundant data blocks.

Has the advantages that: the invention improves the traditional blind storage method for resisting the access mode leakage in the cloud environment, so that the environment is suitable for efficient and frequent update of files, the efficiency of frequent update operation of the files by the ciphertext cloud storage can be effectively improved, the safe storage of data is ensured, no information is exposed to a server, and the method can be used for a distributed storage scheme.

Drawings

FIG. 1 is a schematic diagram of an architecture of a cloud storage method for efficient ciphertext file update based on blind storage;

FIG. 2 is a flowchart of the operation of a cloud storage method based on blind storage of efficient ciphertext file updates;

FIG. 3 is a flow diagram of a content-based variable length chunking process;

FIG. 4 is a flow diagram of selecting a file storage data block;

FIG. 5 is a flow diagram of encrypting and uploading a file;

FIG. 6 is a flow chart of a method of downloading a file;

FIG. 7 is a flow chart diagram of a method of updating a file.

Detailed Description

The present invention will be further illustrated with reference to the accompanying drawings and specific examples, which are carried out on the premise of the technical solution of the present invention, and it should be understood that these examples are only for illustrating the present invention and are not intended to limit the scope of the present invention.

As shown in fig. 1 and 2, the embodiment of the present invention comprises two parts of entities: one of the servers is a Client and the other is a Server or a cloud storage service provider. And the User stores the ciphertext file of the file f into the server through the client. The architecture diagram of the storage scheme of the present invention is shown in fig. 1, which shows the interaction relationship between the client and the server. Firstly, a user processes a file f through a client, then encrypts a processed file data block, stores the encrypted data block into a storage structure of a server, and can read a file block corresponding to a retrieved file from the server, combine the file blocks and read the file at the client. The protocol operations such as file uploading, file downloading, file updating and the like can be carried out between the client and the server.

The method comprises the following steps that firstly, a client processes a file f, each file is converted into a format of a plurality of variable-length data blocks, the data blocks have a header with a common length, the length of a data part is variable, and each data block is encrypted; storing all encrypted file data blocks on a storage structure, wherein the part only represents storage file blocks; thirdly, reading the file blocks corresponding to the retrieved files from the storage structure; fourthly, the read data blocks are decrypted and integrated to finally obtain complete file contents; fifthly, local updating operation is carried out on the original text file which is retrieved and downloaded by the user to generate a file, and a new ciphertext block is regenerated; and sixthly, incremental updating is carried out on the file before the storage updating of the server. Part of the program (r) represents the feature step S1, and a storage method is constructed. Representing an uploading protocol method of the file in the characteristic step S2; and fourthly, partially representing a downloading protocol method of the file in the characteristic step S3. Sixthly, part of the file update protocol method in the characteristic step S4 realizes the update protocol of the file in all the previous steps.

For convenience of the following description, the notation and system parameter settings used in the present invention are described herein:

h: a collision-tolerant hash function of the fully defined domain;

phi: a pseudo-random function;

Ψ: a pseudo-random function that fully defines a domain;

: a pseudo-random generator;

r: rabin fingerprint algorithm

n_S: number of data blocks of storage space S

m_S: size of each data block

d and r: determining two parameters of Rabin fingerprint

min_B: minimum number of bytes for variable length block

max_B: maximum number of bytes of variable length block

α: the inflation parameter in storage;

kappa: the minimum data block number accessed in each time in the storage;

specifically, the embodiment of the invention provides a cloud storage method for efficient ciphertext file updating based on blind storage, which is characterized by comprising the following steps:

step S1: the server side constructs a storage space in a specific form so as to save all file sets.

Step S1 specifically includes:

step 1.1, sending application information for constructing a storage space to a server side through a client side.

Step 1.2, the server constructs a group n_SEach size is m_SA storage space S consisting of bit-sized data blocks. Here the data block size is setMaximum block size max of variable blocks based on content by file_BTogether with the file block header information size.

Step S2: the client carries out variable-length block processing and encryption on files to be uploaded based on file contents to generate ciphertext data blocks with different sizes and random sequences, and the ciphertext data blocks are indexed according to the random sequences and stored in a storage space.

With reference to fig. 3 to 5, step S2 specifically includes:

step 2.1, carrying out content-based variable-length blocking processing on each file f to convert the file f into data blocks with different sizes, wherein the number of the data blocks is size_f。

Wherein, step 2.1 specifically includes:

Step 2.1.2, reading the file f content of the client byte by byte into the sliding window W, and calculating the hash value R in the sliding window W according to the Rabin fingerprint algorithm_breakpoint(ii) a When R is_breakpointWhen mod d is equal to r, a break point is reached, and a block is formed between two adjacent break points.

Step 2.1.3, the client records the block number i and the breakpoint position index of the current block_bSize of the block content_bAnd taking the hash value H (block) of the data in the block as the current block information, and repeating the step 2.1.2 until the whole file f is changed into the data blocks with different sizes.

Specifically, as shown in fig. 3, the steps are as follows:

if the reading byte of file is over, then go to the word b.

Reading a byte to a sliding window W; the Rabin fingerprint in the sliding window W is updated using the Rabin fingerprint algorithm.

③ calculating whether the distance between the current position and the previous breakpoint position is less than min_BIf yes, go to ①.

④ calculating whether the magnitude between the current position and the previous breakpoint position is greater than max_BIf yes, go to ⑦.

⑤ calculation of R_breakpointIf mod d is true, otherwise go to ①.

⑦ the last byte entering the sliding window is breakpoint, the current block is formed between the last byte and the previous breakpoint (the file byte start position 0 is the first breakpoint by default), the block number i and the breakpoint position index are used to index the current block_bSize of block content size_bThe hash value of the data in the block H (block) is taken as the current block information, and the sliding window is cleared and the process jumps to ①.

And (8) finishing reading the file, analyzing whether data exist in the current window, and otherwise, jumping to the capacitor (R).

And ninthly, forming a current block between the ending byte of the file and the previous breakpoint, namely the last block, and recording block information.

The reading of the file block in the r (r) is finished.

Step 2.2, the client constructs the unique identification id of the file according to the information of the file f_fObtaining a hash value H (id) corresponding to a unique file according to a collision-resistant hash function H of a fully defined domain_f) According to the unique identification id of the file_fGenerating a seed σ using a fully-defined-domain pseudorandom function Ψ_fWherein:

σ_f＝Ψ_kΨ(id_f)

mixing the seeds sigma_fAnd inputting the sequence into a pseudo-random generator to obtain a random sequence.

Step 2.3, based on the random sequence and the data block number size_fGenerating a random subset S_fWherein S is_fBelonging to a storage space, will S_fAs the first of the random sequences in step 2.2, the length is | S_fInteger set of |。

Specifically, wherein: i S_f| represents a random subset S_fThe calculation method is as follows:

where κ is the minimum number of data blocks accessed per time in storage, α is the expansion factor of storage, and S is calculated to overcome the conflict between probability of conflict and waste of storage space_fAs the sequence Λ [ sigma ]_f，|S_f|]E.g. Λ [ sigma, l]The sequence of integers is obtained by the following analysis method: using seed sigma as input parameter of pseudo-random generator to obtain a sufficiently long output, then [ n ]_S]In the range of (1), an integer sequence is analyzed; s_fRefers to the first i long distinguishable parameter in this sequence of integers. Detecting a random subset S_fSize of the number in the storage space S_fWhether each data block is empty; detection of set S of size kappa_f ⁰The number in (1) is at least one data block in the storage space S is empty; if either of the two detection conditions fails, the operation is terminated.

Is the size of the data block number size_fI.e. by

Client follows subset for the first time

The increasing order of the numbers retrieved in step 2.1 establishes an indexing relationship with the data blocks in step 2.1. Detection of

In the storage space SIf the detection condition is not passed, the operation is terminated; in a random subset S_fSize selection_fShortest prefixes of retrievable empty data blocks as subsets

The specific flow of implementing the selection of the file storage block is shown in fig. 4.

And 2.5, adding header information to each variable-length block of the f of the file, generating a file key by using a pseudorandom function, and encrypting each data block by adopting an AES (advanced encryption Standard) symmetric encryption algorithm.

The header information specifically includes: unique file identifier id of file_fHash value H (id) of file identification_f) The hash message identification code H (data) of the data content and the byte number of the last group; wherein:

the version number, initially 0, indicates that it has not been updated and is accumulated after updating. The number of the variable length blocks of the file, the first data block comprises the number of the data blocks after the file is converted, and the step of other blocks is always 0. The current block index, the next block index, and the client according to the subset

As index information of the data block, adding to the index of the current block and the index of the next block corresponding to the header of the block. And the hash message authentication code of the data content is used for verifying the integrity of the data. Last set of number of bytes: and for each variable-length block, encrypting by adopting a fixed packet symmetric encryption algorithm, filling the last group by adopting 0 byte, and recording the original byte number of the last group of the fixed packet at the head part so as to facilitate decryption and recovery.

The client user uses the pseudo-random function to generate a key to generate a unique key for the file, and the AES-256 symmetric encryption algorithm is adopted to encrypt each block of the file, wherein the ECB mode is adopted to facilitate concurrent computation of a plurality of blocks in the encryption and decryption processes, so that the encryption and decryption efficiency of the file is improved.

And 2.6, the client sends the encrypted data block to the server, and the server stores the encrypted data block into a storage position of the storage space according to the index relation established in the step 2.4. The specific flow is shown in fig. 5.

Step S3: the client side retrieves the random sequence generated by the file identification and reads the ciphertext data blocks for multiple times to obtain all the ciphertext data blocks of the file from the storage space, and decrypts the ciphertext data blocks in the client side to restore the ciphertext data blocks into the complete file. Specifically, as shown in fig. 6, step S3 specifically includes:

step 3.1, the client side obtains the unique identification id of the file f according to the information of the file f_fAccording to the unique identification id of the file f_fGenerating seed sigma using a fully-defined-field pseudorandom function_fSeed σ_fInput to a pseudo-random generator to obtain a random sequence, wherein,

step 3.2, the client generates an integer set S according to the random sequence_f ⁰And initiating a first round of interaction to the server. S_f ⁰As the sequence Λ [ sigma ]_f，κ]The method of analysis and step 2.3_fThe same is true. And 3.3, the server returns the ciphertext data block corresponding to the index to the client. Step 3.4, the client decrypts the returned data block to find the first unique mark as id_fAs the first data block of the file f, and obtaining the block size of the file f from the header information of the first data block_f。

Specifically, a client user generates a unique key for a file by using a pseudo-random function generation key, and decrypts each block by adopting an AES-256 symmetric encryption algorithm.

Finding the unique file identifier id of the file_fThe hash value of the file identifier is H (id)_f) If the data block is not found, the server side does not have the file, and the interaction is directly finished;if such a data block is found, this data block is the first data block of the file, and the block size of the file can be obtained from its header information_f。

Step 3.5, the client analyzes the result

S indicating that the content of this file has been returned in its entirety in the first interaction_f ⁰]And (5) directly entering a step 3.8 without performing second access, otherwise entering a step 3.6.

Step 3.6, the client side performs size according to the random sequence and the number of the data blocks_fGenerating a random set S_fAnd initiating a second round of interaction to the server. So that the client can all obtain the ciphertext blocks S of the remaining files_f-S_f ⁰(the relative complement is denoted here as S_fBut not to S_f ⁰). Step 3.7, the server side gathers the ciphertext data blocks (S [ S ]) corresponding to the indexes_f-S_f ⁰]) And returning the data to the client. And 3.8, the client decrypts all the data blocks and integrates the data blocks into a complete file f according to the header information of the data blocks.

Specifically, a client user generates a unique key for a file by using a pseudo-random function generation key, and decrypts each data block of the file by using an AES-256 symmetric encryption algorithm. Then according to the file unique identification of the file in the header information as id_fThe hash value of the file identifier is H (id)_f) Verifying whether the file is a data block of the file f; and integrating the block sequence number, the data block index and the next data block index in the header information into a complete file in sequence.

Step S4: and modifying the downloaded file and uploading the modified downloaded file to the server side again. The file updating protocol is realized on the basis of the download file protocol, the download file is modified, an incremental updating algorithm is realized, and the efficiency of synchronous updating of the ciphertext is improved.

As shown in fig. 7, step S4 specifically includes the following steps:

step 4.2, for the modified file f ', the unique identifier of the file f' is still id_fObtaining a hash value H (id) corresponding to a unique file according to a collision-resistant hash function H of a fully defined domain_f) (ii) a The variable-length blocking processing based on the file content in the step 2.1 is carried out again to convert the variable-length blocking processing into updated data blocks with different sizes, and the number of the updated data blocks is size_f′。

Step 4.3, the client side modifies the unique identification id of the file according to the modification_fAnd generating seed sigma by using a fully-defined-domain pseudo-random function_f′，

Mixing the seeds sigma_f′Inputting into a pseudo-random generator to obtain a new random sequence, and generating a first random sequence with length | S_f′Random subset S of integer set of |_f′(ii) a Will S_f′As the sequence Λ [ sigma ]_f，|S_f′|]Is selected. The method of analysis and S in step 2.3_fThe same; specifically, here | S_f′I always select the pseudo-random subset S of preceding and following files_f′And S_fMaximum value of the aggregate size.

Step 4.4, the client selects the random subset S_f′A random subset

The size is the updated data block number size_f′I.e. by

In a random subset S_f′Size selection_f′Shortest prefixes of retrievable empty data blocks as subsets

A file storage block is selected. Here, it is not necessary to determine whether the signal is null, and a specific flow is shown in fig. 4.

And 4.5, the client analyzes the data block quantity change of the variable-length blocks before and after updating the file, and compares the block information of the variable-length block algorithm based on the content of the file f and the modified file f' to form a corresponding table of the file blocks before and after the file block is updated.

Specifically, the modification of the file by the user includes three operations of adding, deleting and modifying. When an original file f is modified at a certain position, the file length-variable blocks of the modified file f' are changed, only the left and right blocks of the modified block of the original file f are changed, and other blocks are not changed, which specifically includes the following steps:

step 4.5.1, displaying the blocking condition of the initial file f, such as [ B ]₁，B₂，B₃，B₄，B₅]；

Step 4.5.2, e.g. for Block B₂The modification operation is carried out, and the result does not cause the break point breaking and is not less than the minimum length min of the variable-length blocks_BAnd not larger than the maximum length max of the variable length block_BThe result of the modified re-blocking may be [ B ]₁，B₆，B₃，B₄，B₅]Only block B₂Has been changed to B₆Other blocks are not changed;

step 4.5.3, e.g. for Block B₄A modification operation is carried out, which results in the generation of a new breakpoint that is not less than the minimum length min of the variable-length partition_BAnd not larger than the maximum length max of the variable length block_BThe result of the modified re-blocking may be [ B ]₁，B₆，B₃，B₇，B₈，B₅]Only block B₄Changed, newly added blocks become B₇，B₈Other blocks are not changed;

step 4.5.4, e.g., for block B₃Performing a modification operation, resulting in destruction of the original B₃Breakpoint and not less than minimum length min of variable length block_BAnd not larger than the maximum length max of the variable length block_BThe result of the modified re-blocking may be [ B ]₁，B₆，B₉，B₈，B₅]Into blocks B₃，B₇Changes are made, reducing blocking to B₉And other blocks are not changed.

In this way, the block information of the variable-length block algorithm based on the content of the source file and the new file is recorded, and the corresponding tables of the front file block and the rear file block are respectively recorded in the set of the unchanged blocks and the set of the changed blocks.

Specifically, if the number of the front and rear data blocks is the same, the size is obtained_f＝size_f′Without the need to find new sets of available space

That is at this time | S_f′|＝|S_fAnd replacing the inconsistent blocks of the updated data.

The method comprises the following specific steps:

obtaining the block information of the source file and the new file based on the content variable length block algorithm according to the step 4.5, such as the original block [ B ]₁，B₂，B₃，B₄，B₅]Modified block [ B ]₁，B₆，B₃，B₇，B₅]. The index position is also a random subset of the source file f without finding new storage space

The client side counts out a set of changed blocks { B }₂，B₄Is replaced by { B }₆，B₇}. Updating the index of the current block at the head of all the blocks according to the corresponding table of the change information of the data blocks, and updating the head index of the next block index; establishing an index of a corresponding block; that is, the client of this example only needs to connect B₆Encrypted and then replaced to B₂Index position of B, will₇Encrypted and then replaced to B₄The index position of (2).

If the number of update blocks increases, i.e. size_f＜size_f′It is necessary to find a new set of available space

That is at this time

A new data block storage space needs to be searched, and a block with inconsistent updated data is replaced or stored into a new available space.

The method comprises the following specific steps: obtaining the block information of the source file and the new file based on the content variable length block algorithm according to the step 4.5, such as the original block [ B ]₁，B₂，B₃，B₄，B₅]Modified block [ B ]₁，B₆，B₃，B₇，B₈，B₅](ii) a New storage space needs to be found to store the newly added encrypted blocks, and the index position is a random subset S of the modified file f_f′The client needs to randomly subset S from the server_f′(by definition at this time

) In the random sequence obtained from the seeds, find whether there is size_f′-size_fA null block added to the set of digital spaces

If the file is not found, the storage space is insufficient, and the storage space needs to be expanded to update the file; the client needs to count the set of changed blocks { B }₂，B₄Is replaced by { B }₆，B₇，B₈}; updating the number of the variable-length blocks of the file, updating the indexes of the current blocks at the heads of all the blocks according to the data block change information corresponding table, and updating the head indexes of the next blocks; establishing an index of a corresponding block; i.e. in this example B₆Alternative B₂，B₇Alternative B₄，B₈New data blocks found in the above are added.

If the number of updated data blocks is reduced, i.e. size_f＞size_f′Requiring a redundant set of available space

Is emptied, i.e. at this time

And replacing the updated inconsistent data blocks, and emptying redundant data blocks.

The method comprises the following specific steps: obtaining the block information of the source file and the new file based on the content variable length block algorithm according to the step 4.5, such as the original block [ B ]₁，B₂，B₃，B₄，B₅]Modified block [ B ]₁，B₂，B₆，B₅](ii) a Without finding new storage space, the index position is a random subset S of the modified file f_f′The client needs to randomly subset S from the server_f(by definition at this time

) In the random sequence obtained by seed search_f-size_f′Redundant blocks, which are emptied; the client needs to count the set of changed blocks { B }₃，B₄Is replaced by { B }₆}; updating the number of the variable-length blocks of the file, updating the indexes of the current blocks at the heads of all the blocks according to the data block change information corresponding table, and updating the head indexes of the next blocks; establishing an index of a corresponding block; i.e. in this example B₆Alternative B₃Clear B₄A block of data.

And 4.6, forming index information of the updated data block according to the new random sequence, updating all updated data block header information (the header information size is far smaller than the data part size, so that updating the data block can be ignored only when updating the part), increasing the version number in the header information of all the data blocks, re-encrypting all the data block headers, and encrypting all the data blocks in the set of the changed blocks.

The foregoing is only a preferred embodiment of the present invention, and it should be noted that other parts not specifically described are within the prior art or common general knowledge to those of ordinary skill in the art. Without departing from the principle of the invention, several improvements and modifications can be made, and these improvements and modifications should also be construed as the scope of the invention.

Claims

1. A cloud storage method for efficient ciphertext file updating based on blind storage is characterized by comprising the following steps:

2. The cloud storage method for efficient ciphertext file update based on blind storage according to claim 1, wherein the step S1 specifically includes:

3. The cloud storage method for efficient ciphertext file update based on blind storage according to claim 1, wherein the step S2 specifically includes:

step 2.1, forEach file f is subjected to variable-length blocking processing based on content to be converted into data blocks with different sizes, and the number of the data blocks is size_f；

Step 2.2, constructing the unique identification id of the file f according to the information of the file f_fAccording to the unique identification id of the file_fGenerating seed sigma using a fully-defined-field pseudorandom function_fSeed σ_fInputting a pseudo-random generator to obtain a random sequence;

4. The cloud storage method for efficient ciphertext file update based on blind storage according to claim 3, wherein the step 2.1 specifically comprises:

5. The cloud storage method for efficient ciphertext file update based on blind storage according to claim 1, wherein the step S3 specifically includes:

Step 3.5, the client analyzes the result

6. The cloud storage method for efficient ciphertext file update based on blind storage according to claim 5, wherein the step S4 specifically comprises the following steps:

Step 4.3, the client side modifies the unique identification id of the file according to the modification_fAnd generating seed sigma by using a fully-defined-domain pseudo-random function_f′Inputting the seed into a pseudo-random generatorObtaining a new random sequence, generating a new random sequence with a length of | S_f′Random subset S of integer set of |_f′；

Step 4.4, the client selects the random subset S_f′A random subset

7. The cloud storage method based on efficient ciphertext file update of blind storage according to claim 6, wherein in the step 4.5:

Replacing the updated inconsistent data blocks;