Skip to main content

No project description provided

Project description

prometheus-dirsize-exporter

PyPI version

Export directory size metrics efficiently.

Why?

When providing multi-user interactive computing services (with a HPC cluster or with JupyterHub), it's very helpful to know the home directory sizes of each user over time. However, as NFS is often used, running du constantly takes a long time, uses too many IOPS that we may not have many of, and is plain inefficient.

This project provides a way to keep track of directory sizes with a budgeted amount of IOPS. You can ask it to take however much time it needs but not use more than 100 IOPS, and it will do that. We do not necessarily need very up to date directory size metrics, so this is ok.

Installation

Install the package from PyPI:

pip install prometheus-dirsize-exporter

Running

You can start the process from the commandline:

dirsize-exporter <path-to-parent-directory> <iops-budget> <wait-time-in-minutes>

For example, to export metrics about directories under /home, using no more than 200 IO operations per second, updating every 60minutes, you would run:

dirsize-exporter /home 200 60

You can check out the metrics by hitting http://localhost:8000. The port can be controlled via a --port argument.

Metrics recorded

The following metrics are recorded for all top level subidrectories of the parent directory:

  • Total Size (in bytes)
  • Last Modified (including all the descendents)
  • Total Number of Entries (Files, directories & symlinks)
  • Processing Time required to gather this information This is only reported if --enable-detailed-processing-time-metric flag is passed, to prevent possible explosion of stored size of prometheus metrics when collected. This information is also not particularly useful outside of debugging this exporter, and as it varies each run, compresses poorly.
  • Last updated

Limitations

  • As directory contents might change in the course of a single run as we wait for budgets to become available, information about a directory may not be exactly correct immediately.
  • Because we do not spread the IOPS through time, the IO usage is 'spiky' - all IOPS get done at the beginning of a second, and then it goes silent. If you have a big IOPS budget, this can cause performance degradation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

prometheus-dirsize-exporter-3.1.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file prometheus-dirsize-exporter-3.1.tar.gz.

File metadata

File hashes

Hashes for prometheus-dirsize-exporter-3.1.tar.gz
Algorithm Hash digest
SHA256 dd81f3fdb6155049366a281db5e8a13c51d3674e51f71a27a50c7069cd12a40b
MD5 2878b23bda9d1071b42dfe4966c5acf9
BLAKE2b-256 c35257d73b087565e98f6b21579c139960ed9361c7644a1227c8d889872ef2fc

See more details on using hashes here.

File details

Details for the file prometheus_dirsize_exporter-3.1-py3-none-any.whl.

File metadata

File hashes

Hashes for prometheus_dirsize_exporter-3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5f229a085e4cd6db48e444387d09583beb63136c6c9ef512f81b3537bca2f8ce
MD5 fce67b37e297af4b3f5c7348a707e23f
BLAKE2b-256 99c43c5382a5b0df5d099e21d976160f997dfaa29f93cf6182c169063623dba3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page