CN112181729A - Virtual machine backup control device, system and method - Google Patents

Virtual machine backup control device, system and method Download PDF

Info

Publication number
CN112181729A
CN112181729A CN202011102166.6A CN202011102166A CN112181729A CN 112181729 A CN112181729 A CN 112181729A CN 202011102166 A CN202011102166 A CN 202011102166A CN 112181729 A CN112181729 A CN 112181729A
Authority
CN
China
Prior art keywords
disk
backup
virtual machine
list
backed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011102166.6A
Other languages
Chinese (zh)
Inventor
覃展恒
陈勇铨
倪国军
胡军擎
周华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Yingfang Software Co ltd
Original Assignee
Shanghai Yingfang Software Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Yingfang Software Co ltd filed Critical Shanghai Yingfang Software Co ltd
Priority to CN202011102166.6A priority Critical patent/CN112181729A/en
Publication of CN112181729A publication Critical patent/CN112181729A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1464Management of the backup or restore process for networked environments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0608Saving storage space on storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0646Horizontal data movement in storage systems, i.e. moving data in between storage devices or systems
    • G06F3/0652Erasing, e.g. deleting, data cleaning, moving of data to a wastebasket
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • G06F3/0674Disk device
    • G06F3/0676Magnetic disk device
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45562Creating, deleting, cloning virtual machine instances

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Software Systems (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a virtual machine backup control device, a system and a method, wherein the device comprises: the system comprises a backup virtual machine disk information acquisition unit, a backup virtual machine disk information acquisition unit and a backup virtual machine disk information acquisition unit, wherein the backup virtual machine disk information acquisition unit is used for acquiring backup virtual machine disk information to be backed up from a source virtualization platform host machine; a backed-up disk information acquiring unit for acquiring the disk information of the backed-up disk from the backup storage machine; the comparison unit is used for comparing the obtained backup virtual machine disk information with the backed-up disk information, searching whether a duplicate disk exists or not, deleting the duplicated disk equipment in the obtained backup virtual machine disk information if the duplicate disk is found, and forming a full backup list or an incremental backup list according to a comparison result; and the backup control unit is used for respectively issuing a backup designated disk task to the backup storage machine and issuing a snapshot creating task for the designated disk to the source virtualization platform host machine according to the full backup list and the incremental backup list so as to realize virtual machine backup.

Description

Virtual machine backup control device, system and method
Technical Field
The invention relates to the technical field of computer disaster recovery backup, in particular to a virtual machine backup control device, a virtual machine backup control system and a virtual machine backup control method for realizing virtual machine backup by comparing differences among multiple disk data.
Background
The cloud virtualization platform is a new concept derived and developed on the concept of virtualization technology, and refers to a method of cutting resources of one or more servers into a computing pool, a storage pool and a network pool by means of server resource cutting, and allocating the computing pool, the storage pool and the network pool together through resource management software for cooperative work. When the operation and maintenance personnel need to create the cloud server, the required resources can be flexibly distributed according to actual requirements, so that the hardware resources of the server can be more reasonably used, and the waste of the redundant computing capacity of the hardware is reduced.
People in daily life increasingly depend on data, so that high requirements are put on the reliability and safety of the data. In the process of server operation, the problems of data loss and data damage are often caused by hardware damage or misoperation of operation and maintenance personnel. And before the data is lost or damaged, the data is backed up at regular time, so that the loss caused by the data loss or damage can be reduced to a certain extent. Nowadays, more and more enterprises can back up business data (server databases, virtual machine systems, and the like) to provide guarantee for business continuity.
In the conventional virtual machine backup software, because there is no difference between multiple disk data in the backup process, many disks with the same data are not deduplicated, which results in repeated backup of the same data, and causes the following problems:
1. the backup speed is slow;
2. the backup occupies a large storage space;
3. the backup consumes excessive network bandwidth resources.
Disclosure of Invention
In order to overcome the defects in the prior art, the present invention provides a virtual machine backup control device, system and method, so as to solve the problems of slow backup speed, large storage space occupation and excessive network bandwidth resource consumption caused by repeated reading of redundant data in a large-scale virtual machine backup system.
To achieve the above object, the present invention provides a virtual machine backup control device, including:
the system comprises a backup virtual machine disk information acquisition unit, a backup virtual machine disk information acquisition unit and a backup virtual machine disk information acquisition unit, wherein the backup virtual machine disk information acquisition unit is used for acquiring backup virtual machine disk information to be backed up from a source virtualization platform host machine;
a backed-up disk information acquiring unit for acquiring the disk information of the backed-up disk from the backup storage machine;
the comparison unit is used for comparing the obtained backup virtual machine disk information with the backed-up disk information, searching whether a duplicate disk exists or not, deleting the duplicated disk equipment in the obtained backup virtual machine disk information if the duplicate disk is found, and forming a full backup list or an incremental backup list according to a comparison result;
and the backup control unit is used for respectively issuing a backup designated disk task to the backup storage machine and issuing a snapshot creating task for the designated disk to the source virtualization platform host machine according to the full backup list and the incremental backup list so as to realize virtual machine backup.
Preferably, the backup virtual machine disk information includes, but is not limited to, a virtual machine list, a disk list corresponding to each virtual machine, and metadata corresponding to each disk, where the metadata includes, but is not limited to, a disk identifier and a disk hash check value or CBT change block information.
Preferably, the disk information of the backed-up disk includes, but is not limited to, a backed-up disk list, and a disk identifier, a disk hash check value, or CBT change block information corresponding to each disk.
Preferably, the backup virtual machine disk information obtaining unit adds all disks in the obtained disk list to a disk backup task list, where the disk backup task list includes a disk identifier of each disk and a corresponding disk hash check value or CBT change block information.
Preferably, the comparing unit compares the disk identifiers of the disks in the disk backup task list in a traversal manner with the backed-up disk list one by one, and if a disk with the same disk identifier is not found in the backed-up disk list, adds the currently compared disk in the disk backup task list into a full backup list; if the same disk identifier exists in the backed-up disk list, for the disks with the same disk identifier, further judging whether the data of the disk file hash check value or the data of the CBT change block are consistent according to the disk identifier, if so, judging that the disk device is a duplicate disk device, deleting the corresponding disk device in the disk backup task list, and if not, adding the currently compared disk in the disk backup task list into an incremental backup list.
Preferably, after adding all disks in the disk list to the disk backup task list, the backup virtual machine disk information obtaining unit traverses the disk backup task list to find duplicate disk devices, and if there are duplicate disk devices, deletes the duplicate disk devices in the disk backup task list.
In order to achieve the above object, the present invention further provides a virtual machine backup control system, including:
the virtual machine backup control device is used for acquiring backup virtual machine disk information from a source virtualization platform host machine, acquiring backed-up disk information from a backup storage machine, deleting the duplicated disk equipment by traversing and searching disk equipment for whether duplicated disk identifiers exist in the backup virtual machine disk information and the backed-up disk information or not and comparing the consistency of the disk data, acquiring a full backup list and an incremental backup list according to a comparison result, issuing a backup designated disk task to the backup storage machine according to the full backup list and the incremental backup list and issuing a snapshot creation task to the designated disk to the source virtualization platform host machine;
the source virtualization platform host is used for providing backup virtual machine disk information for the virtual machine backup control device, creating a snapshot on a specified disk according to a specified disk creation snapshot task issued by the virtual machine backup control device to ensure consistency of master disk data in a backup process, and acquiring full/incremental data according to the disk snapshot and transmitting the full/incremental data to the backup storage machine;
and the backup storage machine is used for copying data of the corresponding disk sent by the host machine of the source virtualization platform according to the specified disk backup task issued by the virtual machine backup control device and sending the backed-up disk information to the virtual machine backup control device.
Preferably, after the source virtualization platform host obtains the full/incremental data according to the disk snapshot, a disk interface is called, and the disks on the whole chain are mapped into one block device; calling an open function of the C language, and mapping the block device into a disk address space on a file pointer; then calling a lead or fread function of the C language, reading data in a file pointer, and circularly reading to obtain all data; and calling a network transceiving library, and sending the read data to the backup storage machine.
Preferably, the backup storage machine calls a disk interface to create an empty disk; calling an open function of the C language, and mapping the block device into a disk address space on a file pointer; calling the pwrite or fwrite function of the C language and the network transceiving library, and circularly writing the data received by the network transceiving library into the disk.
In order to achieve the above object, the present invention further provides a virtual machine backup control method, including the following steps:
step S1, the virtual machine backup control device obtains the backup virtual machine disk information to be backed up from the source virtualization platform host machine;
step S2, the virtual machine backup control device obtains the disk information of the backed-up disk from the backup storage machine;
step S3, the virtual machine backup control device compares the backup virtual machine disk information obtained in step S1 with the backed-up disk information obtained in step S2 to find whether there is a duplicate disk, deletes the duplicate disk device in the obtained backup virtual machine disk information if the duplicate disk is found, and forms a full backup list or an incremental backup list according to the comparison result;
and step S4, respectively issuing a backup designated disk task to the backup storage machine and issuing a snapshot creating task for the designated disk to the source virtualization platform host machine according to the full backup list and the incremental backup list.
Compared with the prior art, the virtual machine backup control device, the virtual machine backup control system and the virtual machine backup control method realize virtual machine backup by comparing the difference among multiple disk data, so that the problems of low backup speed, large occupied storage space and excessive network bandwidth resource consumption caused by repeated reading of redundant data in a large-scale virtual machine backup system are solved.
Drawings
Fig. 1 is a schematic structural diagram of a virtual machine backup control device according to the present invention;
FIG. 2 is a system architecture diagram of a virtual machine backup control system according to the present invention;
FIG. 3 is a flowchart illustrating steps of a virtual machine backup control method according to the present invention;
fig. 4 is an architecture diagram of a virtual machine backup control system according to an embodiment of the present invention.
Detailed Description
Other advantages and capabilities of the present invention will be readily apparent to those skilled in the art from the present disclosure by describing the embodiments of the present invention with specific embodiments thereof in conjunction with the accompanying drawings. The invention is capable of other and different embodiments and its several details are capable of modification in various other respects, all without departing from the spirit and scope of the present invention.
Fig. 1 is a schematic structural diagram of a virtual machine backup control device according to the present invention. As shown in fig. 1, the present invention provides a virtual machine backup control apparatus, including:
a backup virtual machine disk information obtaining unit 101, configured to obtain backup virtual machine disk information to be backed up from a source virtualization platform host. Specifically, after acquiring tasks of one or more virtual machines of a backup source virtualization platform host, the backup virtual machine disk information acquiring unit 101 acquires a corresponding virtual machine list, a disk list corresponding to each virtual machine, and metadata corresponding to each disk from the source virtualization platform host, where in the present invention, the backup virtual machine disk information includes but is not limited to: a virtual machine list (1 or more virtual machines), a disk list (1 or more disks) corresponding to each virtual machine, and metadata corresponding to each disk, where the metadata at least includes a disk identifier including a universally Unique identifier uuid (universal Unique identifier) of the disk and a disk file identifier (generally, a random string serial number), and a disk Hash Check value calculated for the disk data by a Message Digest algorithm (Message Digest), or a Hash Check value calculated for the disk data by a Secure Hash Standard (Secure Hash Standard), or a Hash value calculated for the disk data by a Cyclic Redundancy Check (Cyclic Redundancy Check), and the metadata at least includes a disk Hash Check value or CBT (Changed Block Tracking) change Block information, and the like, for the mature prior art, it is not described herein in detail.
In the embodiment of the present invention, the backup virtual machine disk information obtaining unit 101 may add all disks in the obtained disk list to a disk backup task list. The disk backup task list comprises a disk identifier of each disk, a disk hash check value or CBT change block information.
Preferably, the backup virtual machine disk information obtaining unit 101, after adding all disks in the disk list to the disk backup task list, traverses the disk backup task list to find a duplicate disk device, and if there is a duplicate disk device, deletes the duplicate disk device in the disk backup task list. Specifically, the disk backup task list is traversed to find whether disk devices with the same disk identifier exist, and if yes, the disk devices are deleted from the disk backup task list. Specifically, if the disk a and the disk B are found to be duplicated by traversing the disk backup task list, the disk B is a duplicate disk of the disk a, and the disk B is deleted from the disk backup task list, which of course may also be regarded as a duplicate disk of the disk B, and the disk a is deleted from the disk backup task list, which is not limited by the present invention.
A backed-up disk information obtaining unit 102, configured to obtain disk information of a backed-up disk from a backup storage device. In the present invention, the disk information of the backed-up disk includes a backed-up disk list (1 or multiple disks) and a disk identifier or a disk hash check value or CBT change block information corresponding to each disk.
A comparing unit 103, configured to compare the information of the virtual machine disk obtained by the backup virtual machine disk information obtaining unit 101 with the information of the backup disk obtained by the backup disk information obtaining unit 102, if a duplicate disk is found, delete the duplicate disk device in the obtained virtual machine disk information of the backup, and if a duplicate disk is not found, form a full backup list or an incremental backup list according to the comparison result.
In the specific embodiment of the present invention, according to the disk backup task list obtained by the backup virtual machine disk information obtaining unit 101 and the backed-up disk list obtained by the backed-up disk information obtaining unit 102, the duplicate disk identifier and the disk data consistency comparison are searched in a traversal manner to search for duplicate disk devices in the disk backup task list and the backed-up disk list, and the found duplicate disk devices are deleted in the disk backup task list. Specifically, disk identifiers of each disk in the disk backup task list are traversed and compared with a backed-up disk list one by one, and if a disk with the same disk identifier is not found in the backed-up disk list, the currently compared disk in the disk backup task list is added into a full backup list; if the same disk identifier exists in the backed-up disk list, for the disks with the same disk identifier, further judging whether the data of the disk file hash check value or the CBT change block are consistent according to the disk file hash check value or the CBT change block, if so, judging that the disk device is a duplicate disk device, deleting the corresponding disk device in the disk backup task list, and if not, adding the currently compared disk in the disk backup task list into an incremental backup list.
And the backup control unit 104 is configured to respectively issue a backup designated disk task to the backup storage machine and issue a snapshot creating task for the designated disk to the source virtualization platform host machine according to the full backup list and the incremental backup list.
Specifically, the backup control unit 104 issues an instruction for creating an external snapshot for all disks in the full backup list and the incremental backup list to the source virtualization platform host, and issues an instruction for backing up all disk tasks in the full backup list and the incremental backup list to the backup storage machine, and the backup storage machine obtains full/incremental data of a specified disk from the source virtualization platform host to implement virtual machine backup.
Fig. 2 is a system architecture diagram of a virtual machine backup control system according to the present invention. As shown in fig. 2, the present invention provides a virtual machine backup control system, including:
the virtual machine backup control device 20 is configured to obtain backup virtual machine disk information from the source virtualization platform host 21, obtain backed-up disk information from the backup storage machine 22, delete duplicate disk devices by traversing and searching for disk devices that have duplicate disk identifiers in the backup virtual machine disk information and the backed-up disk information and comparing consistency of disk data, obtain a full backup list and an incremental backup list according to a comparison result, issue a backup designated disk task to the backup storage machine 22 according to the full backup list and the incremental backup list, and issue a snapshot creation task for the designated disk to the source virtualization platform host 21. Since the virtual machine backup control device 20 is described in detail above, it is not described herein.
The source virtualization platform host 21 is configured to provide backup virtual machine disk information for the virtual machine backup control device 20, create a snapshot for a specified disk according to a specified disk creation snapshot task issued by the virtual machine backup control device 20 to ensure consistency of master disk data in a backup process, and transmit disk data to the backup storage machine 22, for example, create a snapshot B for the specified disk a according to the specified disk creation snapshot task, and write new data into the snapshot B, so that the disk a becomes read-only, read data of the disk a, calculate a hash value of the disk a, and send the hash value to the backup storage machine 22.
Specifically, for the disk device in the full backup list, when reading the disk a, the disk a is cut into a plurality of small blocks, only one small block is read in sequence each time, a hash value is calculated for each small block, after reading, all data and the hash values of the small blocks are transmitted to the backup storage machine 22, and when transmission is completed, full backup is completed; for the disk device in the incremental backup list, the data of the disk a is read, the disk a is also cut into a plurality of small blocks, the hash value of each block is calculated, and the hash values are compared with the hash value of the last full backup in the backup storage machine 22 one by one, if the hash values are the same, the same data block does not need to be repeatedly backed up, and if the hash values are different, only different data blocks need to be backed up. It should be noted that the specific backup details are not within the range considered by the present invention, and the present invention only needs to consider the hash value of the entire disk device to perform disk-level deduplication.
Specifically, after obtaining disk data according to a disk snapshot, the source virtualization platform host 21 calls a disk interface, and maps disks on the entire chain (all disks in the full backup list and the incremental backup list form a chain) to a block device; then calling an open function of the C language, and mapping the block device into a disk address space on a file pointer; then calling the lead or fread function of the C language, reading the data in the file pointer, and circularly reading: reading a small part of data according to the Offset until all data are read; finally, the network transceiving library is called, and the read data is sent to the backup storage machine 22.
The backup storage device 22 is configured to copy data of a corresponding disk sent by the source virtualization platform host 21 according to a backup designation disk task issued by the virtual machine backup control device 20, and send backed-up disk information to the virtual machine backup control device 20.
Specifically, the backup storage machine 22 calls a disk interface to create an empty disk; then calling an open function of the C language, and mapping the block device into a disk address space on a file pointer; calling the pwrite or fwrite function of the C language and the network transceiving library, and circularly writing the data received by the network transceiving library into a disk: a small portion of data is written each time according to the offset amount until all of the data is written.
In an embodiment of the present invention, the disk interface includes, but is not limited to: a Qemu-img command line interface provided by the Qemu simulator; VDDK interface provided by VMware; a HyperDisk interface provided by HyperV; and other interfaces implemented by various manufacturers based on the interface packaging or secondary development.
Fig. 3 is a flowchart illustrating steps of a virtual machine backup control method according to the present invention. As shown in fig. 3, the method for controlling backup of a virtual machine according to the present invention includes the following steps:
step S1, the virtual machine backup control device obtains the backup virtual machine disk information to be backed up from the source virtualization platform host.
Specifically, after the virtual machine backup control device obtains tasks of one or more virtual machines of a backup source virtualization platform host, a corresponding virtual machine list, a disk list corresponding to each virtual machine, and metadata corresponding to each disk are obtained from the source virtualization platform host, that is, the backup virtual machine disk information includes but is not limited to: the virtual machine management system comprises a virtual machine list (1 or a plurality of virtual machines), a disk list (1 Block or a plurality of disks) corresponding to each virtual machine, and metadata corresponding to each disk, wherein the metadata comprises a disk identifier and a disk hash check value or CBT (Changed Block Tracking) change Block information. The disk identifier includes a universally Unique identifier uuid (universal Unique identifier) of the disk and a disk file identifier (generally, a random string serial number), and the Hash Check value refers to a Hash Check value calculated on disk data by using a Message Digest algorithm (Message Digest), or a Hash Check value calculated on disk data by using a Secure Hash Standard (Secure Hash Standard), or a Hash Check value calculated on disk data by using a Cyclic Redundancy Check (Cyclic Redundancy Check).
In the embodiment of the present invention, all disks in the obtained disk list may be added to a disk backup task list. The disk backup task list comprises a disk identifier of each disk and corresponding disk hash check value or CBT change block information.
Preferably, in step S1, after all disks in the disk list are added to the disk backup task list, the disk backup task list is traversed to find a duplicate disk device, and if there is a duplicate disk device, the duplicate disk device is deleted from the disk backup task list. Specifically, the disk backup task list is traversed to find whether a disk device with a disk identifier exists, and if so, the disk device is deleted from the disk backup task list.
In step S2, the virtual machine backup control device acquires the disk information of the backed-up disk from the backup storage device. In the present invention, the disk information of the backed-up disk includes a backed-up disk list (1 or multiple disks), and a disk identifier, a disk hash check value, or CBT change block information corresponding to each disk.
Step S3, the virtual machine backup control device compares the backup virtual machine disk information obtained in step S1 with the backed-up disk information obtained in step S2 to find whether there is a duplicate disk, deletes the duplicate disk device in the obtained backup virtual machine disk information if the duplicate disk is found, and forms a full backup list or an incremental backup list according to the comparison result if the duplicate disk is not found.
In the embodiment of the present invention, according to the disk backup task list obtained in step S1 and the backed-up disk list obtained in step S2, the duplicate disk identifier and the disk data consistency comparison are searched in a traversal manner to search for duplicate disk devices in the disk backup task list and the backed-up disk list, and the found duplicate disk devices are deleted in the disk backup task list. Specifically, disk identifiers of each disk in the disk backup task list are traversed and compared with a backed-up disk list one by one, and if a disk with the same disk identifier is not found in the backed-up disk list, the currently compared disk in the disk backup task list is added into a full backup list; if the same disk identifier exists in the backed-up disk list, for the disks with the same disk identifier, further judging whether the data of the disk file hash check value or the data of the CBT change block are consistent according to the disk identifier, if so, judging that the disk device is a duplicate disk device, deleting the corresponding disk device in the disk backup task list, and if not, adding the currently compared disk in the disk backup task list into an incremental backup list.
And step S4, respectively issuing a backup designated disk task to the backup storage machine and issuing a snapshot creating task for the designated disk to the source virtualization platform host machine according to the full backup list and the incremental backup list.
Specifically, the virtual machine backup control device issues an instruction for creating an external snapshot for all disks in the full backup list and the incremental backup list to the source virtualization platform host, and issues an instruction for backing up all disk tasks in the full backup list and the incremental backup list to the backup storage machine, and the backup storage machine obtains full/incremental data of a specified disk from the source virtualization platform host to realize virtual machine backup.
Examples
Fig. 4 is a system architecture diagram of a virtual machine backup control system according to an embodiment of the present invention. In this embodiment, the source virtualization platform host is called an agent module, the control device is a control module, the backup storage machine is a backup module, and the virtual machine backup process is as follows:
step 1: after the control module (backup control device) acquires the task of backing up one or more virtual machines from the user side, the corresponding virtual machine list and the disk list corresponding to each virtual machine are acquired from the agent module (host), and then the step 2 is carried out.
Step 2: and the control module (backup control device) adds all the disks in the disk list into the disk backup task list, traverses and searches for the duplicate disk device, and then enters step 3.
And step 3: and if the duplicate disk device exists, entering the step 4. If the disk device is not duplicated, step 5 is entered.
And 4, step 4: the control module (backup control means) deletes the duplicate disk device in the disk backup task list, and then proceeds to step 5.
And 5: the control module (backup control device) acquires the backed-up disk list from the backup module (backup storage machine), compares the two disk lists, traverses and searches for the duplicate disk device, and then enters step 6.
Step 6: if the disk device is duplicated, go to step 7. If the disk device is not duplicated, step 8 is entered.
And 7: and the control module (backup control device) judges whether the two data are consistent according to the hash check value of the disk file or the CBT change block, if so, the step 9 is carried out, and if not, the step 10 is carried out.
And 8: the backup module (backup control means) adds such a disk to the full backup list, and then proceeds to step 11.
And step 9: the control module (backup control means) deletes the duplicate disk device in the disk backup task list.
Step 10: the control module (backup control means) adds such a disk to the incremental backup list and then proceeds to step 11.
Step 11: the control module (backup control device) issues an instruction to the agent module (host) to create an external snapshot for all disks in the 2 lists, and then proceeds to step 12.
Step 12: the control module (backup control means) issues an instruction to the backup module (backup storage machine) to backup the disks in the 'full volume/incremental backup list', and then proceeds to step 13.
Step 13, the agent module (host) acquires the full/incremental data of the instruction disk based on the snapshot, transmits the full/incremental data to the backup module (backup storage machine), and enters step 14;
step 14: the backup module (backup storage) obtains the full/incremental data of the specified disk from the agent module (host), and then proceeds to step 15.
Step 15: step 14 is looped until the backup module (backup storage) retrieves all the data.
In summary, the virtual machine backup control device, system and method of the present invention implement virtual machine backup by comparing differences between multiple disk data, so as to solve the problems of low backup speed, large storage space occupation and excessive network bandwidth resource consumption caused by repeatedly reading redundant data in a large-scale virtual machine backup system.
The foregoing embodiments are merely illustrative of the principles and utilities of the present invention and are not intended to limit the invention. Modifications and variations can be made to the above-described embodiments by those skilled in the art without departing from the spirit and scope of the present invention. Therefore, the scope of the invention should be determined from the following claims.

Claims (10)

1. A virtual machine backup control apparatus comprising:
the system comprises a backup virtual machine disk information acquisition unit, a backup virtual machine disk information acquisition unit and a backup virtual machine disk information acquisition unit, wherein the backup virtual machine disk information acquisition unit is used for acquiring backup virtual machine disk information to be backed up from a source virtualization platform host machine;
a backed-up disk information acquiring unit for acquiring the disk information of the backed-up disk from the backup storage machine;
the comparison unit is used for comparing the obtained backup virtual machine disk information with the backed-up disk information, searching whether a duplicate disk exists or not, deleting the duplicated disk equipment in the obtained backup virtual machine disk information if the duplicate disk is found, and forming a full backup list or an incremental backup list according to a comparison result;
and the backup control unit is used for respectively issuing a backup designated disk task to the backup storage machine and issuing a snapshot creating task for the designated disk to the source virtualization platform host machine according to the full backup list and the incremental backup list so as to realize virtual machine backup.
2. The virtual machine backup control apparatus according to claim 1, wherein: the backup virtual machine disk information includes, but is not limited to, a virtual machine list, a disk list corresponding to each virtual machine, and metadata corresponding to each disk, where the metadata includes, but is not limited to, a disk identifier and a disk hash check value or CBT change block information.
3. The virtual machine backup control apparatus according to claim 2, wherein: the disk information of the backed-up disk includes, but is not limited to, a backed-up disk list, and a disk identifier, a disk hash check value, or CBT change block information corresponding to each disk.
4. The virtual machine backup control apparatus according to claim 3, wherein: and the backup virtual machine disk information acquisition unit adds all disks in the acquired disk list into a disk backup task list, wherein the disk backup task list comprises a disk identifier of each disk and a corresponding disk hash check value or CBT change block information.
5. The virtual machine backup control apparatus according to claim 4, wherein: the comparison unit compares the disk identifiers of the disks in the disk backup task list in a traversal mode with the backed-up disk list one by one, and if the disks with the same disk identifiers are not found in the backed-up disk list, the currently compared disks in the disk backup task list are added into a full backup list; if the same disk identifier exists in the backed-up disk list, for the disks with the same disk identifier, further judging whether the data of the disk file hash check value or the data of the CBT change block are consistent according to the disk identifier, if so, judging that the disk device is a duplicate disk device, deleting the corresponding disk device in the disk backup task list, and if not, adding the currently compared disk in the disk backup task list into an incremental backup list.
6. The virtual machine backup control apparatus according to claim 5, wherein: the backup virtual machine disk information acquisition unit traverses the disk backup task list to search for repeated disk equipment after all disks in the disk list are added into the disk backup task list, and deletes the repeated disk equipment in the disk backup task list if the repeated disk equipment exists.
7. A virtual machine backup control system, comprising:
the virtual machine backup control device is used for acquiring backup virtual machine disk information from a source virtualization platform host machine, acquiring backed-up disk information from a backup storage machine, deleting the duplicated disk equipment by traversing and searching disk equipment for whether duplicated disk identifiers exist in the backup virtual machine disk information and the backed-up disk information or not and comparing the consistency of the disk data, acquiring a full backup list and an incremental backup list according to a comparison result, issuing a backup designated disk task to the backup storage machine according to the full backup list and the incremental backup list and issuing a snapshot creation task to the designated disk to the source virtualization platform host machine;
the source virtualization platform host is used for providing backup virtual machine disk information for the virtual machine backup control device, creating a snapshot on a specified disk according to a specified disk creation snapshot task issued by the virtual machine backup control device to ensure consistency of master disk data in a backup process, and acquiring full/incremental data according to the disk snapshot and transmitting the full/incremental data to the backup storage machine;
and the backup storage machine is used for copying data of the corresponding disk sent by the host machine of the source virtualization platform according to the specified disk backup task issued by the virtual machine backup control device and sending the backed-up disk information to the virtual machine backup control device.
8. The virtual machine backup control system according to claim 7, wherein: the source virtualization platform host machine calls a disk interface after acquiring full/incremental data according to the disk snapshot, and maps the disks on the whole chain into a block device; calling an open function of the C language, and mapping the block device into a disk address space on a file pointer; then calling a lead or fread function of the C language, reading data in a file pointer, and circularly reading to obtain all data; and calling a network transceiving library, and sending the read data to the backup storage machine.
9. The virtual machine backup control system according to claim 8, wherein: the backup storage machine calls a disk interface to create an empty disk; calling an open function of the C language, and mapping the block device into a disk address space on a file pointer; calling the pwrite or fwrite function of the C language and the network transceiving library, and circularly writing the data received by the network transceiving library into the disk.
10. A virtual machine backup control method comprises the following steps:
step S1, the virtual machine backup control device obtains the backup virtual machine disk information to be backed up from the source virtualization platform host machine;
step S2, the virtual machine backup control device obtains the disk information of the backed-up disk from the backup storage machine;
step S3, the virtual machine backup control device compares the backup virtual machine disk information obtained in step S1 with the backed-up disk information obtained in step S2 to find whether there is a duplicate disk, deletes the duplicate disk device in the obtained backup virtual machine disk information if the duplicate disk is found, and forms a full backup list or an incremental backup list according to the comparison result;
and step S4, respectively issuing a backup designated disk task to the backup storage machine and issuing a snapshot creating task for the designated disk to the source virtualization platform host machine according to the full backup list and the incremental backup list.
CN202011102166.6A 2020-10-15 2020-10-15 Virtual machine backup control device, system and method Pending CN112181729A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011102166.6A CN112181729A (en) 2020-10-15 2020-10-15 Virtual machine backup control device, system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011102166.6A CN112181729A (en) 2020-10-15 2020-10-15 Virtual machine backup control device, system and method

Publications (1)

Publication Number Publication Date
CN112181729A true CN112181729A (en) 2021-01-05

Family

ID=73950330

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011102166.6A Pending CN112181729A (en) 2020-10-15 2020-10-15 Virtual machine backup control device, system and method

Country Status (1)

Country Link
CN (1) CN112181729A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115344423A (en) * 2022-08-01 2022-11-15 和元达信息科技有限公司 Backup method, mount recovery method, server and computer-readable storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103593263A (en) * 2013-11-20 2014-02-19 上海爱数软件有限公司 Hyper-v virtual machine backup method based on backup knowledge base and file attributes
CN103605588A (en) * 2013-11-21 2014-02-26 上海爱数软件有限公司 Disk backup method of virtue machine
CN105068856A (en) * 2015-07-16 2015-11-18 武汉噢易云计算有限公司 Mirror image snapshooting based on virtual machine backup system and backup method
US9535907B1 (en) * 2010-01-22 2017-01-03 Veritas Technologies Llc System and method for managing backup operations of virtual machines
CN107544871A (en) * 2017-07-21 2018-01-05 新华三云计算技术有限公司 A kind of disk backup method of virtue machine and device
CN109582443A (en) * 2018-12-06 2019-04-05 国网江西省电力有限公司信息通信分公司 Virtual machine standby system based on distributed storage technology
CN111399984A (en) * 2020-03-19 2020-07-10 上海英方软件股份有限公司 File recovery method and system based on virtual machine backup data
CN111651238A (en) * 2020-06-03 2020-09-11 上海英方软件股份有限公司 System and method for acquiring effective data in virtual machine protection

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9535907B1 (en) * 2010-01-22 2017-01-03 Veritas Technologies Llc System and method for managing backup operations of virtual machines
CN103593263A (en) * 2013-11-20 2014-02-19 上海爱数软件有限公司 Hyper-v virtual machine backup method based on backup knowledge base and file attributes
CN103605588A (en) * 2013-11-21 2014-02-26 上海爱数软件有限公司 Disk backup method of virtue machine
CN105068856A (en) * 2015-07-16 2015-11-18 武汉噢易云计算有限公司 Mirror image snapshooting based on virtual machine backup system and backup method
CN107544871A (en) * 2017-07-21 2018-01-05 新华三云计算技术有限公司 A kind of disk backup method of virtue machine and device
CN109582443A (en) * 2018-12-06 2019-04-05 国网江西省电力有限公司信息通信分公司 Virtual machine standby system based on distributed storage technology
CN111399984A (en) * 2020-03-19 2020-07-10 上海英方软件股份有限公司 File recovery method and system based on virtual machine backup data
CN111651238A (en) * 2020-06-03 2020-09-11 上海英方软件股份有限公司 System and method for acquiring effective data in virtual machine protection

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115344423A (en) * 2022-08-01 2022-11-15 和元达信息科技有限公司 Backup method, mount recovery method, server and computer-readable storage medium

Similar Documents

Publication Publication Date Title
US20210294510A1 (en) Deduplication replication in a distributed deduplication data storage system
US11550675B2 (en) Remote data replication method and system
US20210334002A1 (en) Redundant distributed data storage system
US20230251945A1 (en) Snapshot-based disaster recovery orchestration of virtual machine failover and failback operations
US10664352B2 (en) Live browsing of backed up data residing on cloned disks
US10831608B2 (en) Systems and methods for performing data management operations using snapshots
US11321291B2 (en) Persistent version control for data transfer between heterogeneous data stores
US8689047B2 (en) Virtual disk replication using log files
CN111078667B (en) Data migration method and related device
CN109144416B (en) Method and device for querying data
US9223797B2 (en) Reparse point replication
US10114703B2 (en) Flash copy for disaster recovery (DR) testing
CN115098299B (en) Backup method, disaster recovery method, device and equipment of virtual machine
CN104583966A (en) Backup and restore system for a deduplicated file system and corresponding server and method
CN110121694A (en) A kind of blog management method, server and Database Systems
CN112181729A (en) Virtual machine backup control device, system and method
CN110121712A (en) A kind of blog management method, server and Database Systems
US20230153010A1 (en) Pruning data segments stored in cloud storage to reclaim cloud storage space
CN105159790A (en) Data rescue method and file server
US11442815B2 (en) Coordinating backup configurations for a data protection environment implementing multiple types of replication

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20210105

WD01 Invention patent application deemed withdrawn after publication