tensorboard-reducer

Reduce multiple TensorBoard runs to new event (or CSV) files

These details have not been verified by PyPI

Project links

Homepage

Project description

TensorBoard Reducer

For a similar project built for TensorFlow rather than PyTorch, see tensorboard-aggregator.

Compute reduced statistics (mean, std, min, max, median or any other numpy operation) of multiple TensorBoard run directories. This can be used e.g. when training multiple identical models (such as deep ensembles) to reduce the noise in their loss/accuracy/error curves and establish statistical significance in performance improvements. The aggregation results can be saved to disk either as new TensorBoard event files or as CSV.

Requires PyTorch and TensorBoard. No TensorFlow installation required.

Installation

pip install tensorboard-reducer

Usage

CLI

tb-reducer runs/of-your-model* -o output-dir -r mean,std,min,max

All positional CLI arguments are interpreted as input directories and expected to contain TensorBoard event files. These can be specified individually or with wildcards using shell expansion. You can check you're getting the right input directories by running echo runs/of-your-model* before passing them to tb-reducer.

Note: By default, TensorBoard Reducer expects event files to contain identical tags and equal number of steps for all scalars. If you trained one model for 300 epochs and another for 400 and/or recorded different sets of metrics (tags in TensorBoard lingo) for each of them, see CLI flags --lax-steps and --lax-tags to disable this safeguard.

Mean of 3 TensorBoard logs

In addition, tb-reducer has the following flags:

-o/--outpath (required): File or directory where to save output on disk. Will save as a CSV file if path ends in '.csv' extension or else as TensorBoard run directories, one for each reduction suffixed by the operation's name, e.g. 'outpath-mean', 'outpath-max', etc. If output format is CSV, a single file will be created with two-level header containing one column for each combination of tag and reduce operation. Tag names will be in top-level header, reduce op in second level. Hint: Use pandas.read_csv("path/to/file.csv", header=[0, 1], index_col=0) to read CSV data back into a multi-index dataframe.
-r/--reduce-ops (optional, default: mean): Comma-separated names of numpy reduction ops (mean, std, min, max, ...). Each reduction is written to a separate outpath suffixed by its op name. E.g. if outpath='reduced-run', the mean reduction will be written to 'reduced-run-mean'.
-f/--overwrite (optional, default: False): Whether to overwrite existing output directories/CSV files. For safety, the overwrite operation will abort with an error if the file/directory to overwrite is not a CSV and does not look like a TensorBoard run directory (i.e. does not start with 'events.out').
--lax-tags (optional, default: False): Allow different runs have to different sets of tags. In this mode, each tag reduction will run over as many runs as are available for a given tag, even if that's just one. Proceed with caution as not all tags will have the same statistics in downstream analysis.
--lax-steps (optional, default: False): Allow tags across different runs to have unequal numbers of steps. In this mode, each reduction will only use as many steps as are available in the shortest run (same behavior as zip(short_list, long_list) which stops when short_list is exhausted).
--handle-dup-steps (optional, default: None): How to handle duplicate values recorded for the same tag and step in a single run. One of 'keep-first', 'keep-last', 'mean'. 'keep-first/last' will keep the first/last occurrence of duplicate steps while 'mean' computes their mean. Default behavior is to raise AssertionError on duplicate steps.
--min-runs-per-step (optional, default: None): Minimum number of runs across which a given step must be recorded to be kept. Steps present across less runs are dropped. Only plays a role if lax_steps is true. Warning: Be aware that with this setting, you'll be reducing variable number of runs, however many recorded a value for a given step as long as there are at least --min-runs-per-step. In other words, the statistics of a reduction will change mid-run. Say you're plotting the mean of an error curve, the sample size of that mean will drop from, say, 10 down to 4 mid-plot if 4 of your models trained for longer than the rest. Be sure to remember when using this.
-v/--version (optional): Get the current version.

Python API

You can also import tensorboard_reducer into a Python script for more complex operations. A simple example that makes use of the full Python API (load_tb_events, reduce_events, write_csv, write_tb_events) to get you started:

from glob import glob

import tensorboard_reducer as tbr

in_dirs = glob("glob_pattern/of_directories_to_reduce*")
out_dir = "path/to/output_dir"
out_csv = "path/to/out.csv"
overwrite = False
reduce_ops = ("mean", "min", "max")

events_dict = tbr.load_tb_events(in_dirs)

n_scalars = len(events_dict)
n_steps, n_events = list(events_dict.values())[0].shape

print(
    f"Loaded {n_events} TensorBoard runs with {n_scalars} scalars and {n_steps} steps each"
)
print(", ".join(events_dict))

reduced_events = tbr.reduce_events(events_dict, reduce_ops)

for op in reduce_ops:
    print(f"Writing '{op}' reduction to '{out_dir}-{op}'")

tbr.write_tb_events(reduced_events, out_dir, overwrite)

print(f"Writing results to '{out_csv}'")

tbr.write_csv(reduced_events, out_csv, overwrite)

print("Reduction complete")

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.3.1

Sep 21, 2023

0.3.0

Feb 20, 2023

0.2.10

Sep 4, 2022

0.2.9

Jul 6, 2022

0.2.8

Jun 29, 2022

0.2.7

Mar 16, 2022

0.2.5

Jan 24, 2022

This version

0.2.4

Jan 8, 2022

0.2.3

Jun 9, 2021

0.2.2

Jun 6, 2021

0.2.1

Jun 1, 2021

0.2.0

Jun 1, 2021

0.1.6

Apr 23, 2021

0.1.5

Apr 21, 2021

0.1.4

Apr 12, 2021

0.1.3

Apr 5, 2021

0.1.2

Apr 5, 2021

0.1.1

Apr 4, 2021

0.1.0

Apr 4, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tensorboard-reducer-0.2.4.tar.gz (14.2 kB view details)

Uploaded Jan 8, 2022 Source

Built Distribution

tensorboard_reducer-0.2.4-py2.py3-none-any.whl (14.9 kB view details)

Uploaded Jan 8, 2022 Python 2 Python 3

File details

Details for the file tensorboard-reducer-0.2.4.tar.gz.

File metadata

Download URL: tensorboard-reducer-0.2.4.tar.gz
Upload date: Jan 8, 2022
Size: 14.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.10.1

File hashes

Hashes for tensorboard-reducer-0.2.4.tar.gz
Algorithm	Hash digest
SHA256	`9066aa6be315eb1282aafda6c52cd69ef30dfa1fde7e562f7b05f93db2718145`
MD5	`7bfa2623ca5fd10f170c4c1b91eab876`
BLAKE2b-256	`1004a496250e8b9927b0600617a648ab82ac5a7cbeba58661039c68141562bb5`

See more details on using hashes here.

File details

Details for the file tensorboard_reducer-0.2.4-py2.py3-none-any.whl.

File metadata

Download URL: tensorboard_reducer-0.2.4-py2.py3-none-any.whl
Upload date: Jan 8, 2022
Size: 14.9 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.10.1

File hashes

Hashes for tensorboard_reducer-0.2.4-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`7573df70cb3684b0473405e6389151ef0c197da523f2955100faf4c159e2c281`
MD5	`15dea7879ac8483c1d73221f2af4d739`
BLAKE2b-256	`6f2a74b91f7d55f0ac94032f162efec79f26bb8f814aac80f0f386b45727f604`