Create your backups based on docker labels
Project description
Docker volume dump
A tool to help archive data from container running in a docker container.
- Database backup support for Postgresql, Mysql/Mariadb: it create a backup in the container through docker API, then retrieves data to save it in a proper place.
rsync
volume data: it introspect container and rsync all declaredlocal volume
andbind
mount points.
note: at the moment it's not possible to backup a database and rsync those volumes.
Usage
Using docker
docker run registry.gitlab.com/micro-entreprise/docker-volume-dump archive --help
For instance if you want to create dumps from different postgresql container in a docker swarm environment this would looks likes:
docker service create \
-d \
--mode global \
--name pgsql-dumps \
--restart-condition none \
--mount type=bind,src=/path-to-dump-storage/,dst=/backups \
--mount type=bind,src=/var/run/docker.sock,dst=/var/run/docker.sock \
--mount type=bind,src=/path-to-config-directory/,dst=/etc/archiver \
--network sentry_internal \
--with-registry-auth registry.gitlab.com/micro-entreprise/docker-volume-dump \
archive -f /etc/archiver/logging.json -r -s '{"label": ["docker-volume-dump.project='$PROJECT'","docker-volume-dump.environment='$ENVIRONMENT'"]}'
This script require access to the docker daemon to query docker labels.
It must be launched on each host using --mode global
.
Using python
pip install docker-volume-dump
archive --help
Configuration
The main idea is to tell the container how to manage its backups using docker labels, you can set following labels.
You can use a custom prefix if you like so using
ARCHIVER_LABEL_PREFIX
env variable. For instance if you setARCHIVER_LABEL_PREFIX=archiver
it would expect labels likesarchiver.isactive
instead of the defaultdocker-volume-dump.isactive
.
-
docker-volume-dump.driver: Optional (
pgsql
by default) kind of data to dump (could be one ofpgsql
,mysql
,rsync
). Only one value supported by container.Note:
mysql
driver is working for mariadb as well -
docker-volume-dump.isactive: Takes no value. Used by the default selector to determine if the Archiver backups are enabled on the container.
-
docker-volume-dump.project: A project name (the container name if not set)
-
docker-volume-dump.environment: An environment (staging, production, ...)
-
docker-volume-dump.prefix: A prefix for the dump file
Database labels (pgsql
/mysql
)
- docker-volume-dump.dbname: Required, the database name to dump.
- docker-volume-dump.username: DB role used to dump the database.
Required with
mysql
, fallback topostgres
if not set forpgsql
. - docker-volume-dump.password: DB password used to dump the db.
Required with
mysql
, not use withpgsql
driver
This will generate a file in a tree likes
<project>/[<environment>/]<prefix><dbname><date>
rsync
labels
note: I've chosen to rsync data first because tar/gzip rdiff-backup failed to compress data if other programs write content at the same time. My flow is something like data -> rsync -> rdiff-backup -> tar/gzip -> s3
- docker-volume-dump.rsync-params: params to add to rsync command
predifined (hardcoded) params are
rsync -avhP --delete
- docker-volume-dump.ro-volumes: If set to one of those values
["yes", "true", "t", "1"]
(not case sensitive) rsync read-only volumes as well for the given container.
This will generate a director per declared volume/bind
<project>/[<environment>/][<prefix>]<computed folder name>
Computed folder name is based on the path inside the container where
slash (/
) are replaced per dash (-
). ie:
- Project: test
- Environment: dev
- prefix:
rsynced_
- volume declare as
-v /some/path/on/host/machine:/path/to/data
- volume declare as
-v named-volume:/other/data
note: if archiver is running inside a container where host filesystem is mounted in
/hostfs
mind to use--host-fs /hostfs
option.
Would produce:
- /backups/test/dev/rsynced_path-to-data
- /backups/test/dev/rsynced_other-data
Roadmap
- pgsql/mysql: support multiple base per DBMS
- pgsql/mysql: if dbname is not provide retreive db list to detect the DB to dump
- wondering if the way use to query docker labels is compliant with k8s
- In swarm investigate to launch only once container (not on each hosts)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file docker-volume-dump-0.5.0.tar.gz
.
File metadata
- Download URL: docker-volume-dump-0.5.0.tar.gz
- Upload date:
- Size: 9.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b73af89432664dd1b74b4cede981d46657d46200fa7466129ca5bbe089a9a593 |
|
MD5 | be3a6bf66919b2f6f5769c887058a671 |
|
BLAKE2b-256 | e931fab20ed5e0eb74bc7bfcd5716282828780c67bc4c5a918f9a787278fa399 |
File details
Details for the file docker_volume_dump-0.5.0-py3-none-any.whl
.
File metadata
- Download URL: docker_volume_dump-0.5.0-py3-none-any.whl
- Upload date:
- Size: 8.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9f08423687e3c4f14c57b2807f37be989590cf03d5df7e8c9c3c057e403f8e5c |
|
MD5 | 4dc052b20f2f27ef19ff33b320247b44 |
|
BLAKE2b-256 | e3b3041e547b74ab3bdf5502be130fbca7d8aee7c6c051890184a53f5e484a88 |