Backup and Restore
#
PrerequisitesYou should have installed the Local PV ZFS 1.0.0 or later version for the Backup and Restore, see here for the steps to install the Local PV ZFS driver.
Project | Minimum Version |
---|---|
Local PV ZFS | 1.0.0+ |
Velero | 1.5+ |
Velero-Plugin | 3.6.0+ |
note
- To work with velero-plugin version 2.7.0 (adding support for restore on encrypted zpools) and above we have to update Local PV ZFS driver version to at least 1.5.0.
- With velero version v1.5.2 and v1.5.3, there is an issue where PVs are not getting cleaned up for restored volume.
#
Setup#
Install Velero BinaryFollow the steps mentioned here to install velero CLI.
#
Install VeleroSetup the credential file.
Install Velero using the command below:
If you want to use cloud storage like AWS-S3 buckets for storing backups, use a command like the following:
Install the velero 1.5 or later version for Local PV ZFS.
#
Deploy MinIODeploy the minIO to store the backup:
The above minIO uses tmp directory inside the pod to store the data, so when restart happens, the backed up data will be gone. We can change the above yaml to use persistence storage to store the data so that we can persist the data after restart.
Check the Velero Pods are UP and Running
#
Setup Local PV ZFS Velero PluginInstall the Velero Plugin for Local PV ZFS using the command below:
Install the velero-plugin 3.6.0 or later version which has the support for Local PV ZFS. Once the setup is done, create the backup/restore.
#
Create BackupThree kinds of backups for Local PV ZFS can be created. Let us go through them one by one:
#
Create the Full BackupTo take the full backup, create the Volume Snapshot Location as below:
The volume snapshot location has the information about where the snapshot should be stored. Here we have to provide the namespace which we have used as OPENEBS_NAMESPACE env while deploying the Local PV ZFS. The Local PV ZFS Operator yamls uses "openebs" as default value for OPENEBS_NAMESPACE env. Verify the volumesnapshot location:
Now, we can execute velero backup command using the above VolumeSnapshotLocation and the Local PV ZFS plugin will take the full backup. We can use the below velero command to create the full backup, we can add all the namespaces we want to be backed up in a comma separated format in --include-namespaces parameter.
We can check the backup status using velero backup get
command:
Once Status is Completed
, the backup has been taken successfully.
#
Create the scheduled Full BackupTo create the scheduled full backup, we can create the Volume Snapshot Location same as above to create the full backup:
Update the above VolumeSnapshotLocation with namespace and other fields accordingly. Verify the volumesnapshot location:
Now, we can create a backup schedule using the above VolumeSnapshotLocation and the Local PV ZFS plugin will take the full backup of the resources periodically. For example, to take the full backup at every 5 min, we can create the below schedule :
The velero will start taking the backup at every 5 minute of the namespaces mentioned in --include-namespaces.
We can check the backup status using velero backup get
command:
The scheduled backup will have <schedule name>-<timestamp>
format. Once Status is Completed
, the backup has been taken successfully and then velero will take the next backup after 5 min and periodically keep doing that.
#
Create the scheduled Incremental BackupIncremental backup works for scheduled backup only. We can create the VolumeSnapshotLocation as below to create the incremental backup schedule :-
Update the above VolumeSnapshotLocation with namespace and other fields accordingly. Verify the volumesnapshot location:
If we have created a backup schedule using the above VolumeSnapshotLocation, the Local PV ZFS plugin will start taking the incremental backups. Here, we have to provide incrBackupCount
parameter which indicates that how many incremental backups we should keep before taking the next full backup. So, in the above case the Local PV ZFS plugin will create full backup first and then it will create three incremental backups and after that it will again create a full backup followed by three incremental backups and so on.
For Restore, we need to have the full backup and all the in between the incremental backups available. All the incremental backups are linked to its previous backup, so this link should not be broken otherwise restore will fail.
One thing to note here is incrBackupCount
parameter defines how many incremental backups we want, it does not include the first full backup. While doing the restore, we just need to give the backup name which we want to restore. The plugin is capable of identifying the incremental backup group and will restore from the full backup and keep restoring the incremental backup till the backup name provided in the restore command.
Now we can create a backup schedule using the above VolumeSnapshotLocation and the Local PV ZFS plugin will take care of taking the backup of the resources periodically. For example, to take the incremental backup at every 5 min, we can create the below schedule :
Velero natively does not support the incremental backup, so while taking the incremental backup we have to set the appropriate ttl for the backups so that we have full incremental backup group available for restore. For example, in the above case we creating a schedule to take the backup at every 5 min and VolumeSnapshotLocation says we should keep 3 incremental backups then ttl should be set to 5 min * (3 incr + 1 full) = 20 min or more. So that the full backup and all the incremental backups are available for the restore. If we don't set the ttl correctly and full backup gets deleted, we won't be able use that backup, so we should make sure that correct ttl is set for the incremental backups schedule.
We can check the backup status using velero backup get
command:
#
ExplanationSince we have used incrBackupCount as 3 in the volume snapshot location and created the backup. So first backup will be full backup and next 3 backup will be incremental
We do not need to know which is the full backup or incremental backup. We can pick any backup in the list and the plugin will find the corresponding full backup and start the restore from there to all the way upto the backup name provided in the restore command. For example, if we want to restore schd-20201012133010, the plugin will restore in the below order
It will stop at 3rd as we want to restore till schd-20201012133010. For us, it will be like we have restored the backup schd-20201012132115 and we don't need to bother about incremenal or full backup.
Suppose we want to restore schd-20201012134010(5th backup), the plugin will restore schd-20201012134010 only as it is full backup and we want to restore till that point only.
#
RestoreWe can restore the backup using below command, we can provide the namespace mapping if we want to restore in different namespace. If namespace mapping is not provided, then it will restore in the source namespace in which the backup was present.
Now we can check the restore status:
Once the Status is Completed
we can check the pods in the destination namespace and verify that everything is up and running. We can also verify the data has been restored.
#
Restore on a Different NodeWe have the node affinity set on the PV and the ZFSVolume object has the original node name as the owner of the Volume. While doing the restore if original node is not present, the Pod will not come into running state. We can use velero RestoreItemAction for this and create a config map which will have the node mapping like below:
While doing the restore the Local PV ZFS plugin will set the affinity on the PV as per the node mapping provided in the config map. Here in the above case the PV created on nodes pawan-old-node1
and pawan-old-node2
will be moved to pawan-new-node1
and pawan-new-node2
respectively.
#
Things to ConsiderOnce VolumeSnapshotLocation has been created, we should never modify it, we should always create a new VolumeSnapshotLocation and use that. If we want to modify it, we should cleanup old backups/schedule first and then modify it and then create the backup/schedule. Also we should not switch the volumesnapshot location for the given scheduled backup, we should always create a new schedule if backups for the old schedule is present.
For the incremental backup, the higher the value of
incrBackupCount
the more time it will take to restore the volumes. So, we should not have very high number of incremental backup.
#
Uninstall VeleroWe can delete the velero installation by using this command
#
ReferenceSee the velero documentation to find all the supported commands and options for the backup and restore.