Welcome to Freezer’s documentation

Freezer is a distributed backup restore and disaster recovery as a service platform. It is designed to be multi OS (Linux, Windows, OSX, BSD), focused on providing efficiency and flexibility for block based backups, file based incremental backups, point-in-time actions, jobs synchronization (i.e. backup synchronization over multiple nodes) and many other features. It is aimed at being useful for all environments, including large ephemeral Clouds.

Key Features

  • Backup your file system using point in time snapshot

  • Strong encryption supported: AES-256-CFB

  • Backup your file system tree directly (without volume snapshot)

  • Backup your journalled MongoDB directory tree using LVM snapshot to swift

  • Backup MySQL DB with LVM snapshot

  • Restore your data from a specific date automatically to your file system

  • Low storage consumption as the backup are uploaded as a stream

  • Flexible backup policy (incremental and differential)

  • Data is archived in GNU Tar format for file based incremental

  • Block based backup support (rsync)

  • Multiple compression algorithm support (zlib, bzip2, xz)

  • Remove old backup automatically according to the provided parameters

  • Multiple storage media support (Swift, local file system, ssh)

  • Flush kernel buffered memory to disk

  • Multi platform (Linux, Windows, *BSD, OSX)

  • Manage multiple jobs (i.e. multiple backups on the same node)

  • Synchronize backups and restore on multiple nodes

  • Web user interface integrated with OpenStack Horizon

  • Can execute scripts/commands before or after a job execution

Low Resources Requirement

Freezer is designed to reduce to the minimum I/O, CPU and Memory Usage. This is achieved by generating a data stream from tar (for archiving) and gzip (for compressing). Freezer segments the stream in a configurable chunk size (with the option –max-seg-size). The default segment size is 64MB, so it can be safely stored in memory, encrypted if the key is provided, and uploaded to Swift as a segment.

Multiple segments are sequentially uploaded using the Swift Manifest. All the segments are uploaded first, and then the Manifest file is uploaded too, so the data segments cannot be accessed directly. This ensures data consistency.

By keeping the segments small, in-memory, I/O usage is reduced. Also as there’s no need to store locally the final compressed archive (tar-gziped), no additional or dedicated storage is required for the backup execution. The only additional storage needed is the LVM snapshot size (set by default at 5GB). The lvm snapshot size can be set with the option –lvm-snapsize. It is important to not specify a too small snapshot size, because in case a quantity of data is being written to the source volume and consequently the lvm snapshot is filled up, then the data is corrupted.

If more memory is available for the backup process, the maximum segment size can be increased. This will speed up the process. Please note that the segments must be smaller then 5GB, since that is the maximum object size in the Swift server.

On the other hand, if a server has small memory availability, the –max-seg-size option can be set to lower values. The unit of this option is in bytes.

How the Incremental Works

The incremental backups is one of the most crucial features. The following basic logic happens when Freezer executes:

  1. Freezer starts the execution and checks if the provided backup name for the current node already exists in Swift.

  2. If the backup exists, then the Manifest file is retrieved. This is important as the Manifest file contains the information of the previous Freezer execution.

Freezer Components

Component

Description

Freezer Web UI

Web interface that interacts with the Freezer API to configure and change settings. It provides most of the features from the Freezer Agent CLI, advanced scheduler settings such as multi-node backup synchronization, metrics, and reporting.

Freezer Scheduler

A client side component, running on the node where the data backup is to be executed. It consists of a daemon that retrieves the data from the freezer API and executes jobs (i.e. backups, restore, admin actions, info actions, pre and/or post job scripts) by running the Freezer Agent. The metrics and exit codes returned by the freezer agent are captured and sent to the Freezer API. The scheduler manages the execution and synchronization of multiple jobs executed on a single or multiple nodes. The status of the execution of all the nodes is saved through the API. The Freezer scheduler takes care of uploading jobs to the API by reading job files on the file system. It also has its own configuration file where job session or other settings like the freezer API polling interval can be configured. The Freezer scheduler manages jobs, for more information about jobs please refer to: freezer_api/README.rst under JOB the sections

Freezer Agent

Multiprocessing Python software that runs on the client side, where the data backup is to be executed. It can be executed standalone or by the Freezer Scheduler. The Freezer Agent provides a flexible way to execute backup, restore and other actions on a running system. In order to provide flexibility in terms of data integrity, speed, performance, resources usage, etc the freezer agent offers a wide range of options to execute optimized backup according to the available resources as: # Segments size (the amount of memory used) # Queues size (optimize backups where I/O, bandwidth, memory or CPU is a constraint) # I/O Affinity and process priority (it can be used with real time I/O and maximum user level process priority) # Bandwidth limitation # Client side Encryption (AES-256-CFB) # Compression (multiple algorithms supported as zlib, bzip2, xz/lzma) # Parallel upload to pluggable storage media (i.e., upload backup to swift and to a remote node by ssh,

or upload to two or more independent swift instances with different credentials, etc)

# Execute file based incremental (like tar), block based incremental (like rsync algorithm) and differential based backup and restore # Multi-platform as it can be run on Linux, Windows, *BSD and OSX # Automatic removal of old backups

Freezer API

The API is used to store and provide metadata to the Freezer Web UI and to the Freezer Scheduler. Also the API is used to store session information for multi node backup synchronization. No workload data is stored in the API. For more information to the API please refer to: freezer_api/README.rst

DB

Backend used by the API to store and retrieve metrics, metadata sessions information, job status, etc. The DB support Elasticserch and Sqlalchemy.

Architecture

Freezer may seem complex at first look. But Following diagrams will give you better understanding of the architecture:

Service Architecture

_images/Service_Architecture_02.png

Agent Backup Work Flow with API

_images/freezer_agent_backup_api.png

Agent Backup Without API

_images/freezer_agent_backup.png

Freezer Scheduler with API

_images/freezer_scheduler_api.png

Freezer Job Session

_images/job_session.png

Freezer Dashboard

_images/freezer_dashboard.png

How to Scale

_images/freezer_scheduler_api_scale.png

How to Use

Install Guides

User Guides

Admin Guides

Dev Guides