Cleanup old docker images to free up disk space and inodes
Project description
Docker Image Cleaner
A Python package (docker-image-cleaner
) and associated Docker image
(quay.io/jupyterhub/docker-image-cleaner
) to clean up old docker images when a
disk is running low on inodes or space.
The script has initially been developed to help installations of BinderHub clean up space on nodes as it otherwise can run out of space and stop being able to build now docker images.
Why?
Container images are one of the biggest consumers of disk space and inodes on kubernetes nodes. Kubernetes tries to make sure there is enough disk space on each node by garbage collecting unused container images and containers. Tuning this is important for binderhub installations, as many images are built and used only a couple times. However, on most managed kubernetes installations (like GKE, EKS, etc), we can not tune these parameters!
This script approximates the specific parts of the kubernetes container image garbage collection in a configurable way.
Requirements
- Only kubernetes nodes using the
docker
runtime are supported.containerd
orcri-o
container backends are not supported. - The script expects to run in a kubernetes
DaemonSet
, with/var/lib/docker
from the node mounted inside the container. This lets the script figure out how much disk space docker container images are actually using. - The
DaemonSet
should have aServiceAccount
attached that has permissions to talk to the kubernetes API and cordon / uncordon nodes. This makes sure new pods are not scheduled on to the node while image cleaning is happening, as it can take a while.
How does it work?
- Compute how much space
/var/lib/docker
directory (specified by thePATH_TO_CHECK
environment variable) is taking up. - If the disk space used is greater than the garbage collection trigger threshold
(specified by
IMAGE_GC_THRESHOLD_HIGH
), garbage collection is triggered. If not, the script just waits another 5 minutes (set byIMAGE_GC_INTERVAL
). - If garbage collection is triggered, the kubernetes node is first cordoned to prevent any new pods from being scheduled on it for the duration of the garbage collection.
- Unused container images are deleted one by one, starting with the biggest,
until the disk space used by
/var/lib/docker
falls below the garbage collection 'ok' threshold (specified byIMAGE_GC_THRESHOLD_LOW
). This low / high system makes sure we don't get too aggressive in cleaning the disk, as images being present on the node does speed up binderhub launches. - After the garbage collection is done, the kubernetes node is also uncordoned.
- When done, we wait another 5 minutes (set by
IMAGE_GC_INTERVAL
), and repeat the whole process.
Configuration options
Currently, environment variables are used to set configuration for now.
Env variable | Description | Default |
---|---|---|
PATH_TO_CHECK |
Path to /var/lib/docker directory used by the docker daemon |
/var/lib/docker |
IMAGE_GC_INTERVAL |
Amount of time (in seconds) to wait between checking if GC needs to be triggered | 300 |
IMAGE_GC_DELAY |
Amount of time (in seconds) to wait between deleting container images, so we don't DOS the docker API | 1 |
IMAGE_GC_THRESHOLD_TYPE |
Determine if GC should be triggered based on relative or absolute disk usage | relative |
IMAGE_GC_THRESHOLD_HIGH |
% or absolute disk space available (based on IMAGE_GC_THRESHOLD_TYPE ) when we start deleting container images |
80 |
IMAGE_GC_THRESHOLD_LOW |
% or absolute disk space available (based on IMAGE_GC_THRESHOLD_TYPE ) when we can stop deleting container images |
60 |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file docker-image-cleaner-1.0.0b1.tar.gz
.
File metadata
- Download URL: docker-image-cleaner-1.0.0b1.tar.gz
- Upload date:
- Size: 6.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 959d3cca7c03ff6ae402d174fee0bf78492f9fcdff5485f8ebe0dbcbd079d8e7 |
|
MD5 | f7cea26e15c3e42771c08a7bcbb3330a |
|
BLAKE2b-256 | bf550fd95f6eb546a6e6f30bd5bb0b872b7d4929254db20cf298a58a342bb7b1 |
File details
Details for the file docker_image_cleaner-1.0.0b1-py3-none-any.whl
.
File metadata
- Download URL: docker_image_cleaner-1.0.0b1-py3-none-any.whl
- Upload date:
- Size: 7.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4b868718caaa23f000894a5d569957bb4e4304bac743de6ae54198e04b27b1f4 |
|
MD5 | ba3ed917bf1e5df954d71314fe7d8ffa |
|
BLAKE2b-256 | 8cd96c54c033702d04a03b8b1c2a89ea4472c074a3cbf489b85efff49d6785f3 |