CN114840488A

CN114840488A - Distributed storage method, system and storage medium based on super-fusion structure

Info

Publication number: CN114840488A
Application number: CN202210778538.XA
Authority: CN
Inventors: 刘江; 龚立义; 郭军
Original assignee: Baike Data Technology Shenzhen Co ltd
Current assignee: Baike Data Technology Shenzhen Co ltd
Priority date: 2022-07-04
Filing date: 2022-07-04
Publication date: 2022-08-02
Anticipated expiration: 2042-07-04
Also published as: CN114840488B

Abstract

The invention discloses a distributed storage method, a distributed storage system and a storage medium based on a super-fusion structure, wherein the method comprises the following steps: acquiring data to be stored, and generating log statistical information according to the data to be stored; determining whether a file which is the same as or similar to the log statistical information exists in a preset uniform resource pool or not according to the log statistical information, and integrating and marking data to be stored to obtain an integrated marking file when the same or similar file does not exist in the uniform resource pool; and splitting the integrated marked file through the commercial server to obtain a split file, acquiring the type information of the split file, selecting a target storage disk matched with the type information from the uniform resource pool, and storing the split file into the target storage disk. The invention can automatically realize distributed storage of data, realize automatic resource allocation and realize high-efficiency communication.

Description

Distributed storage method, system and storage medium based on super-fusion structure

Technical Field

The invention relates to the technical field of data storage, in particular to a distributed storage method, a distributed storage system and a storage medium based on a super-fusion structure.

Background

Memory systems are one of the important components of computers. The memory system provides the ability to write and read information (programs and data) required for the operation of the computer, and realizes the information memory function of the computer. In a modern computer system, a multi-level storage architecture of a register, a high-speed cache, a main memory and an external memory is often adopted; the core of the computer storage system is a memory, which is a necessary memory device in the computer and used for storing programs and data; the internal memory (memory for short) mainly stores programs and data required by the current work of the computer, and comprises a Cache memory (Cache for short) and a main memory. The main memory elements currently used are semiconductor memories. The external memory (external memory for short) mainly has three implementations of magnetic memory, optical memory and semiconductor memory, and the storage medium includes hard disk, optical disk, magnetic tape and removable memory.

However, in the prior art, the storage of data is inefficient, and when data changes or needs to be updated, all data may need to be redistributed.

Thus, there is a need for improvements and enhancements in the art.

Disclosure of Invention

The technical problem to be solved by the present invention is to provide a distributed storage method based on a super-fusion structure, aiming at solving the problems that the efficiency of storing data is low and all data may need to be redistributed when the data changes or needs to be updated in the prior art.

In order to solve the technical problems, the technical scheme adopted by the invention is as follows:

in a first aspect, the present invention provides a distributed storage method based on a super-fusion structure, where the method includes:

acquiring data to be stored, temporarily storing the data to be stored, and generating log statistical information according to the data to be stored, wherein the log statistical information is used for reflecting attribute information in the data to be stored;

determining whether a file which is the same as or similar to the log statistical information exists in a preset uniform resource pool or not according to the log statistical information, and integrating and marking the data to be stored to obtain an integrated marking file when determining that the file which is the same as or similar to the log statistical information does not exist in the uniform resource pool;

and splitting the integrated mark file through a commercial server to obtain a split file, acquiring the type information of the split file, selecting a target storage disk matched with the type information from the uniform resource pool, and storing the split file into the target storage disk.

In one implementation, the generating log statistical information according to the data to be stored includes:

acquiring a file name, a keyword, a file size and a file type in the data to be stored;

and generating the log statistical information according to the file name, the keyword, the file size and the file type.

In an implementation manner, the determining whether a file identical or similar to the log statistical information exists in a preset uniform resource pool according to the log statistical information includes:

searching in the uniform resource pool according to the file name, the keyword, the file size and the file type in sequence, and determining candidate files respectively matched with the file name, the keyword, the file size and the file type in the uniform resource;

if the candidate files have files with the same file names, keywords, file sizes and file types, determining that the files with the same log statistical information exist in the uniform resource pool;

and if the candidate file does not have a file with the same file name, keyword, file size and file type, determining that the unified resource pool does not have a file with the same log statistical information.

carrying out similarity analysis on the file name, the keyword, the file size and the file type and existing files in the uniform resource pool in sequence;

if the similarity between the existing files and the file name, the keyword, the file size and the file type exceeds a threshold value, determining that the files similar to the log statistical information exist in the uniform resource pool;

if the similarity between the existing file and the file name, the keyword, the file size and the file type exceeds a threshold value, determining that the file similar to the log statistical information does not exist in the uniform resource pool.

In one implementation, the method further includes:

if the uniform resource pool has a file which is the same as or similar to the log statistical information, prompting a selection item, wherein the selection item comprises: replacing similar files, saving as new files or not saving files;

receiving an input instruction, determining a selection item corresponding to the instruction, and executing an operation corresponding to the selection item.

In one implementation, the splitting, by the commercial server, the integrated markup file to obtain a split file includes:

determining different locations of the integrated markup file through a compute node in the commercial server, and determining the same locations of the integrated markup file through a fusion node in the commercial server;

and splitting the integration mark file based on the same position and the different positions to obtain the split file.

In one implementation, the obtaining type information of the split file, selecting a target storage disk matched with the type information from the uniform resource pool, and storing the split file in the target storage disk includes:

determining the type information of the split file based on the file type in the log statistics;

according to the type information, finding out the target storage disk with the same storage type as the type information from the uniform resource pool;

and storing the split file into the target storage disk.

In a second aspect, an embodiment of the present invention further provides a distributed storage system based on a super-fusion structure, where the system includes: the system comprises a super-fusion all-in-one machine, a commercial server connected with the super-fusion all-in-one machine and a uniform resource pool connected with the commercial server; wherein, super fuse all-in-one includes:

the log statistical information acquisition module is used for acquiring data to be stored, temporarily storing the data to be stored and generating log statistical information according to the data to be stored, wherein the log statistical information is used for reflecting attribute information in the data to be stored;

an integration mark file obtaining module, configured to determine whether a file that is the same as or similar to the log statistical information exists in a preset uniform resource pool according to the log statistical information, and when it is determined that the file that is the same as or similar to the log statistical information does not exist in the uniform resource pool, integrate and mark the data to be stored to obtain an integration mark file;

and the file splitting and storing module is used for splitting the integration mark file through a commercial server to obtain a split file, acquiring the type information of the split file, selecting a target storage disk matched with the type information from the uniform resource pool, and storing the split file into the target storage disk.

In a third aspect, an embodiment of the present invention further provides a super-fusion all-in-one machine, where the super-fusion all-in-one machine includes a memory, a processor, and a distributed storage program based on a super-fusion structure, where the distributed storage program based on a super-fusion structure is stored in the memory and is executable on the processor, and when the processor executes the distributed storage program based on a super-fusion structure, the steps of the distributed storage method based on a super-fusion structure according to any one of the above schemes are implemented.

In a fourth aspect, the embodiment of the present invention further provides a computer-readable storage medium, where a distributed storage program based on a super-fusion structure is stored, and when being executed by a processor, the computer-readable storage medium implements the steps of the distributed storage method based on a super-fusion structure according to any one of the above schemes.

Has the advantages that: compared with the prior art, the invention provides a distributed storage method based on a super-fusion structure, which is characterized by acquiring data to be stored, temporarily storing the data to be stored, and generating log statistical information according to the data to be stored, wherein the log statistical information is used for reflecting attribute information in the data to be stored. And then, according to the log statistical information, determining whether a file which is the same as or similar to the log statistical information exists in a preset uniform resource pool, and integrating and marking the data to be stored to obtain an integrated marking file when determining that the file which is the same as or similar to the log statistical information does not exist in the uniform resource pool. And finally, splitting the integration mark file through a commercial server to obtain a split file, acquiring the type information of the split file, selecting a target storage disk matched with the type information from the uniform resource pool, and storing the split file into the target storage disk. The invention can automatically realize distributed storage of data and automatic resource allocation, the super-fusion structure has no setting of a master node and a slave node, each calculation/data node has the capability of bearing the function of the other calculation/data node, and the nodes complete mutual cooperation through an internal efficient distributed protocol to realize efficient communication.

Drawings

Fig. 1 is a flowchart of a specific implementation of a distributed storage method based on a super-fusion structure according to an embodiment of the present invention.

Fig. 2 is a schematic frame diagram of a distributed storage system based on a super-fusion structure according to an embodiment of the present invention.

Fig. 3 is a schematic block diagram of a super-fusion all-in-one machine in a distributed storage system based on a super-fusion structure according to an embodiment of the present invention.

Fig. 4 is a functional schematic diagram of the super-fusion all-in-one machine provided in the embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and effects of the present invention clearer and clearer, the present invention is further described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

The embodiment provides a distributed storage method based on a super-fusion structure, and the method based on the embodiment can realize high-efficiency storage of data. In specific implementation, the embodiment acquires data to be stored, temporarily stores the data to be stored, and generates log statistical information according to the data to be stored, where the log statistical information is used to reflect attribute information in the data to be stored. And then, according to the log statistical information, determining whether a file which is the same as or similar to the log statistical information exists in a preset uniform resource pool, and integrating and marking the data to be stored to obtain an integrated marking file when determining that the file which is the same as or similar to the log statistical information does not exist in the uniform resource pool. And finally, splitting the integration mark file through a commercial server to obtain a split file, acquiring the type information of the split file, selecting a target storage disk matched with the type information from the uniform resource pool, and storing the split file into the target storage disk. The embodiment can automatically realize distributed storage of data and automatic resource allocation, the super-fusion structure is not provided with a master node and a slave node, each computing/data node has the capability of bearing the function of the other computing/data node, and the nodes complete mutual cooperation through an internal efficient distributed protocol to realize efficient communication.

Exemplary method

The distributed storage method based on the super-fusion structure can be applied to terminal equipment, the terminal equipment can be a super-fusion all-in-one machine, the super-fusion all-in-one machine cluster has very good elastic expansion capability, and in the system operation process, when nodes and hard disks are added or deleted, the super-fusion structure can achieve data optimization in the cluster, and automatic redistribution and equalization are achieved; the whole data migration and rebalancing process does not influence the access of the application to the data; in the redistribution and equalization process of all data, the system can ensure that only as little data as possible is needed to be redistributed, and all data in the system does not need to be adjusted and migrated, so that the stability and performance of the system are improved. Specifically, as shown in fig. 1, the distributed storage method based on the super-fusion structure of the present embodiment specifically includes the following steps:

step S100, obtaining data to be stored, temporarily storing the data to be stored, and generating log statistical information according to the data to be stored, wherein the log statistical information is used for reflecting attribute information in the data to be stored.

In this embodiment, as shown in fig. 2, the PC first uploads the data to be stored, and the data is received and temporarily stored by the hyper-fusion all-in-one machine. The PC terminal in the embodiment is connected with the plurality of super-fusion all-in-one machines through the protocol channel, the protocol channel adopts a TCP/IP protocol to realize data transmission, and the super-fusion all-in-one machines temporarily store the data to be stored after acquiring the data to be stored uploaded by the PC terminal through the TCP/IP protocol. And then, the super-fusion all-in-one machine generates log statistical information according to the data to be stored, wherein the log statistical information is used for reflecting attribute information in the data to be stored.

Specifically, the super-fusion all-in-one machine of this embodiment obtains a file name, a keyword, a file size, and a file type in the data to be stored, and then generates the log statistical information according to the file name, the keyword, the file size, and the file type.

Step S200, determining whether a file which is the same as or similar to the log statistical information exists in a preset uniform resource pool or not according to the log statistical information, and integrating and marking the data to be stored to obtain an integrated marking file when determining that the file which is the same as or similar to the log statistical information does not exist in the uniform resource pool.

After the log statistical information is obtained, the super-fusion all-in-one machine of the embodiment can search in a preset uniform resource pool according to the log statistical information, and determine whether a file identical or similar to the log statistical information exists in the preset uniform resource pool. Specifically, the super-fusion all-in-one machine of this embodiment may sequentially search in the uniform resource pool according to the file name, the keyword, the file size, and the file type, and determine candidate files in the uniform resource that are respectively matched with the file name, the keyword, the file size, and the file type. And if the candidate files have files with the same file names, keywords, file sizes and file types, determining that the files with the same log statistical information exist in the uniform resource pool. And if the candidate file does not have a file with the same file name, keyword, file size and file type as the file name, keyword, file size and file type, determining that the unified resource pool does not have a file with the same log statistical information as the log statistical information. Or, the super-fusion all-in-one machine of this embodiment may further perform similarity analysis on the file name, the keyword, the file size, and the file type with existing files in the uniform resource pool in sequence. And if the similarity among the existing files, the file name, the keyword, the file size and the file type exceeds a threshold value, determining that the files similar to the log statistical information exist in the uniform resource pool. If the similarity between the existing file and the file name, the keyword, the file size and the file type exceeds a threshold value, determining that the file similar to the log statistical information does not exist in the uniform resource pool. And if the uniform resource pool has a file which is the same as or similar to the log statistical information, prompting a selection item, wherein the selection item comprises: and replacing the similar files, storing the similar files as new files or not storing the new files, then receiving an input instruction by the super-fusion all-in-one machine, determining a selection item corresponding to the instruction, and executing the operation corresponding to the selection item. Specifically, the super-fusion all-in-one machine of this embodiment may receive the instruction, and replace the uploaded similar file with the similar file in the uniform resource pool through the PC terminal according to the instruction, or save the similar file as a new file or not. And when the unified resource pool does not have a file which is the same as or similar to the log statistical information, judging the data to be stored as a new file, and integrating and marking the data to be stored to obtain an integrated marked file. The embodiment integrates and marks the data to be stored, so as to distinguish existing files, avoid confusion with the existing files, and be beneficial to better storing the data to be stored.

Step 300, splitting the integration mark file through a commercial server to obtain a split file, obtaining type information of the split file, selecting a target storage disk matched with the type information from the uniform resource pool, and storing the split file into the target storage disk.

In this embodiment, each super-fusion all-in-one machine is connected with the same resource pool through a commercial server, and after the integration mark file is obtained, the integration mark file can be split through the commercial server to obtain a split file. Then, the type information of the split file can be obtained, a target storage disk matched with the type information is selected from the uniform resource pool, and the split file is stored in the target storage disk. That is to say, in the embodiment, when the integration mark file is stored, the integration mark file is firstly split and then stored according to the type information, which is convenient for data management.

In one implementation, the step S300 specifically includes the following steps:

step S301, determining different positions of the integrated markup file through a computing node in the commercial server, and determining the same position of the integrated markup file through a fusion node in the commercial server;

step S302, splitting the integration mark file based on the same position and the different positions to obtain the split file.

In specific implementation, the commercial server in this embodiment includes a computing node and a fusion node, where the computing node is configured to determine different locations of the integrated markup file, and the fusion node is configured to determine the same locations of the integrated markup file. After the different locations and the same locations are determined, the embodiment may split the integrated markup file based on the same locations and the different locations to obtain the split file. In other words, in this embodiment, the same part of the integrated markup file is split into one file, and different parts of the integrated markup file are split into one file. In this embodiment, a resource pool is composed of a plurality of storage disks, and each storage disk is used for storing different types of data files. Therefore, after the split file is obtained, the type information of the split file can be further obtained, and then the split file is stored in the corresponding storage disk according to the type information, so that distributed storage is realized.

Specifically, the present embodiment may determine the type information of the split file based on the file type in the log statistical information. The log statistical information is obtained based on the file name, the keyword, the file size and the file type in the data to be stored, so the log statistical information comprises the file type. The split file is obtained by splitting an integration mark file formed by integrating marks of data to be stored, so that the type information of the split file can be determined after the file type is determined according to the log statistical information. Then, according to the type information, the target storage disk having the same storage type as the type information is found from the uniform resource pool; and finally, storing the split file into the target storage disk, so that different types of information can be stored into the corresponding storage disks in a distributed and ordered manner.

In an implementation manner, after the split file is stored in the target storage disk, the embodiment may perform an encryption operation on data in each storage disk in the uniform resource pool, and incorporate the identity information during encryption. Only after passing the identity authentication, the PC terminal can call the data in the uniform resource pool, thereby ensuring the security of the data.

The super fusion all-in-one machine in the embodiment adopts a distributed and shared-nothing design concept, data are stored in all nodes in a cluster in a distributed mode through a distributed algorithm, a data redundancy mode of a cross-node 2/3 copy can be achieved, and data reliability is greatly improved; the super-fusion architecture is not provided with a master node and a slave node, each computing/data node has the capability of bearing the function of the other computing/data node, and the nodes are mutually cooperated and communicated through an internal efficient distributed protocol. The super fusion all-in-one machine deploys calculation virtualization and distributed storage in the same server hardware, stores data on a local physical server aiming at applications with high I/O delay requirements such as virtualization and databases, reduces network overhead brought by traditional external shared storage (SAN/NAS), enables a user to set service levels of calculation and storage resources according to self needs, enables distribution of actual resources to be automatically completed by a management platform, and enables management to be easy and simple.

In summary, in this embodiment, first, data to be stored is obtained, the data to be stored is temporarily stored, and log statistical information is generated according to the data to be stored, where the log statistical information is used to reflect attribute information in the data to be stored. And then, according to the log statistical information, determining whether a file which is the same as or similar to the log statistical information exists in a preset uniform resource pool, and integrating and marking the data to be stored to obtain an integrated marking file when determining that the file which is the same as or similar to the log statistical information does not exist in the uniform resource pool. And finally, splitting the integration mark file through a commercial server to obtain a split file, acquiring the type information of the split file, selecting a target storage disk matched with the type information from the uniform resource pool, and storing the split file into the target storage disk. The embodiment can automatically realize distributed storage of data and automatic resource allocation, the super-fusion structure is not provided with a master node and a slave node, each computing/data node has the capability of bearing the function of the other computing/data node, and the nodes complete mutual cooperation through an internal efficient distributed protocol to realize efficient communication.

Exemplary System

Based on the above embodiment, the present invention further provides a distributed storage system based on a super-fusion structure, where the system includes: the system comprises a super-integration all-in-one machine, a commercial server connected with the super-integration all-in-one machine and a uniform resource pool connected with the commercial server. Wherein, as shown in fig. 3, the super-fusion all-in-one machine includes: a log statistical information acquisition module 10, an integration mark file acquisition module 20 and a file splitting and storing module 30. Specifically, the log statistical information obtaining module 10 in this embodiment is configured to obtain data to be stored, temporarily store the data to be stored, and generate log statistical information according to the data to be stored, where the log statistical information is used to reflect attribute information in the data to be stored. The integration mark file obtaining module 20 is configured to determine whether a file that is the same as or similar to the log statistical information exists in a preset uniform resource pool according to the log statistical information, and integrate and mark the data to be stored to obtain an integration mark file when it is determined that the file that is the same as or similar to the log statistical information does not exist in the uniform resource pool. The file splitting and storing module 30 is configured to split the integration markup file through a commercial server to obtain a split file, acquire type information of the split file, select a target storage disk matched with the type information from the uniform resource pool, and store the split file in the target storage disk.

In one implementation, the log statistical information obtaining module 10 includes:

the information acquisition unit is used for acquiring the file name, the keyword, the file size and the file type in the data to be stored;

and the information generating unit is used for generating the log statistical information according to the file name, the keyword, the file size and the file type.

In one implementation, the integration mark file obtaining module 20 includes:

the candidate matching unit is used for searching in the uniform resource pool according to the file name, the keyword, the file size and the file type in sequence and determining candidate files which are matched with the file name, the keyword, the file size and the file type in the uniform resource respectively;

the same judgment unit is used for determining that the file which is the same as the log statistical information exists in the uniform resource pool if the file which is the same as the file name, the keyword, the file size and the file type exists in the candidate files;

and a different judging unit, configured to determine that a file identical to the log statistical information does not exist in the uniform resource pool if a file identical to the file name, the keyword, the file size, and the file type does not exist in the candidate file.

In one implementation, the integration mark file obtaining module 20 includes:

the similarity analysis unit is used for carrying out similarity analysis on the file name, the keyword, the file size and the file type and existing files in the uniform resource pool in sequence;

a similarity judging unit, configured to determine that a file similar to the log statistical information exists in the uniform resource pool if similarity between the existing file and the file name, the keyword, the file size, and the file type exceeds a threshold;

and the dissimilarity judging unit is used for determining that the file similar to the log statistical information does not exist in the uniform resource pool if the similarity among the file name, the keyword, the file size and the file type does not exceed a threshold value in the existing file.

In one implementation, the system further includes:

a selection prompting module, configured to prompt a selection item if a file that is the same as or similar to the log statistical information exists in the uniform resource pool, where the selection item includes: replacing the similar files, saving the similar files as new files or not saving the files;

and the selection operation module is used for receiving an input instruction, determining a selection item corresponding to the instruction and executing the operation corresponding to the selection item.

In one implementation manner, the file splitting and storing module 30 further includes:

a file analysis unit for determining different locations of the integrated markup file through a computation node in the commercial server and determining the same locations of the integrated markup file through a fusion node in the commercial server;

and the file splitting unit is used for splitting the integration mark file based on the same position and the different positions to obtain the split file.

a type determining unit, configured to determine the type information of the split file based on the file type in the log statistical information;

the type analysis unit is used for finding out the target storage disk with the same storage type as the type information from the uniform resource pool according to the type information;

and the file storage unit is used for storing the split file into the target storage disk.

The working principle of each module in the distributed storage system based on the super-fusion structure of this embodiment is the same as the principle of each step in the above method embodiments, and details are not described here.

Based on the above embodiment, the invention also provides a super-fusion all-in-one machine, and a schematic block diagram of the super-fusion all-in-one machine can be shown in fig. 4. The super-integration all-in-one machine comprises a processor and a memory which are connected through a system bus, wherein the processor and the memory are arranged in a host. Wherein, the processor of the super-fusion all-in-one machine is used for providing calculation and control capability. The memory of the super-fusion all-in-one machine comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The network interface of the super-convergence all-in-one machine is used for being connected and communicated with an external terminal through network communication. The computer program is executed by a processor to implement a distributed storage method based on a hyper-fusion architecture.

It will be understood by those skilled in the art that the schematic block diagram shown in fig. 4 is only a block diagram of a portion of the structure associated with the solution of the present invention, and does not constitute a limitation on the super-fusion unitary apparatus to which the solution of the present invention is applied, and a specific super-fusion unitary apparatus may include more or less components than those shown in the drawings, or may combine some components, or have a different arrangement of components.

In one embodiment, a super-fusion all-in-one machine is provided, where the super-fusion all-in-one machine includes a memory, a processor, and a distributed storage method program based on a super-fusion structure, where the distributed storage method program based on a super-fusion structure is stored in the memory and is executable on the processor, and when the processor executes the distributed storage method program based on a super-fusion structure, the following operation instructions are implemented:

It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, operational databases, or other media used in embodiments provided herein may include non-volatile and/or volatile memory. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), double-rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), synchronous link (Synchlink) DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).

In summary, the present invention discloses a distributed storage method, system and storage medium based on a super-fusion structure, the method includes: acquiring data to be stored, and generating log statistical information according to the data to be stored; determining whether a file which is the same as or similar to the log statistical information exists in a preset uniform resource pool or not according to the log statistical information, and integrating and marking data to be stored to obtain an integrated marking file when the same or similar file does not exist in the uniform resource pool; and splitting the integrated marked file through the commercial server to obtain a split file, acquiring the type information of the split file, selecting a target storage disk matched with the type information from the uniform resource pool, and storing the split file into the target storage disk. The invention can automatically realize distributed storage of data, realize automatic resource allocation and realize high-efficiency communication.

Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims

1. A distributed storage method based on a super-fusion structure is characterized by comprising the following steps:

and splitting the integration mark file through a commercial server to obtain a split file, acquiring the type information of the split file, selecting a target storage disk matched with the type information from the uniform resource pool, and storing the split file into the target storage disk.

2. The distributed storage method based on the super-fusion structure according to claim 1, wherein the generating log statistical information according to the data to be stored comprises:

3. The distributed storage method based on the super-fusion structure according to claim 2, wherein the determining whether a file identical or similar to the log statistical information exists in a preset uniform resource pool according to the log statistical information comprises:

4. The distributed storage method based on the super-fusion structure according to claim 3, wherein the determining whether a file identical or similar to the log statistical information exists in a preset uniform resource pool according to the log statistical information comprises:

if the similarity between the existing file and the file name, the keyword, the file size and the file type exceeds a threshold value, determining that the file similar to the log statistical information does not exist in the unified resource pool.

5. The distributed storage method based on super-fusion structure according to claim 1, further comprising:

6. The distributed storage method based on the super-fusion structure according to claim 1, wherein the splitting the integration markup file by a commercial server to obtain a split file comprises:

7. The distributed storage method based on the super-fusion structure according to claim 2, wherein the obtaining of the type information of the split file, selecting a target storage disk matched with the type information from the uniform resource pool, and storing the split file in the target storage disk comprises:

and storing the split file into the target storage disk.

8. A distributed storage system based on a super-fusion structure, the system comprising: the system comprises a super-fusion all-in-one machine, a commercial server connected with the super-fusion all-in-one machine and a uniform resource pool connected with the commercial server; wherein, super fuse all-in-one includes:

9. A hyper-fusion all-in-one machine, which is characterized by comprising a memory, a processor and a hyper-fusion structure based distributed storage program stored in the memory and operable on the processor, wherein the processor implements the steps of the hyper-fusion structure based distributed storage method according to any one of claims 1 to 7 when executing the hyper-fusion structure based distributed storage program.

10. A computer-readable storage medium, wherein the computer-readable storage medium stores thereon a distributed storage program based on a super-fusion structure, and when the distributed storage program based on a super-fusion structure is executed by a processor, the steps of the distributed storage method based on a super-fusion structure according to any one of claims 1 to 7 are implemented.