pyaml

PyYAML-based module to produce a bit more pretty and readable YAML-serialized data

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- Public Domain
Programming Language
- Python
- Python :: 3.8
Topic

Project description

PyYAML-based python module to produce a bit more pretty and human-readable YAML-serialized data.

This module is for serialization only, see ruamel.yaml module for literate YAML parsing (keeping track of comments, spacing, line/column numbers of values, etc).

(side-note: to dump stuff parsed by ruamel.yaml with this module, use only YAML(typ='safe') there)

It’s a small module, and for projects that only need part of its functionality, I’d recommend copy-pasting that in, instead of adding janky dependency.

Repository URLs:

Warning

Prime goal of this module is to produce human-readable output that can be easily diff’ed, manipulated and re-used, but maybe with occasional issues.

So please do not rely on the thing to produce output that can always be deserialized exactly to what was exported, at least - use PyYAML directly for that (but maybe with options from the next section).

What this module does and why

YAML is generally nice and easy format to read if it was written by humans.

PyYAML can a do fairly decent job of making stuff readable, and the best combination of parameters for such output that I’ve seen so far is probably this one:

>>> m = [123, 45.67, {1: None, 2: False}, 'some text']
>>> data = dict(a='asldnsa\nasldpáknsa\n', b='whatever text', ma=m, mb=m)
>>> yaml.safe_dump(data, sys.stdout, allow_unicode=True, default_flow_style=False)
a: 'asldnsa

  asldpáknsa

  '
b: whatever text
ma: &id001
- 123
- 45.67
- 1: null
  2: false
- some text
mb: *id001

pyaml (this module) tries to improve on that a bit, with the following tweaks:

Most human-friendly representation options in PyYAML (that I know of) are used as defaults.
Dump “null” values as empty values, if possible, which have the same meaning but reduce visual clutter and are easier to edit.
Dicts, sets, OrderedDicts, defaultdicts, namedtuples, enums, dataclasses, etc are represented as their safe YAML-compatible base (like int, list or mapping), with mappings key-sorted by default for more diff-friendly output.
Use shorter and simplier yes/no for booleans.
List items get indented, as they should be.

Attempt is made to pick more readable string representation styles, depending on the value, e.g.:

>>> yaml.safe_dump(cert, sys.stdout)
cert: '-----BEGIN CERTIFICATE-----

  MIIH3jCCBcagAwIBAgIJAJi7AjQ4Z87OMA0GCSqGSIb3DQEBCwUAMIHBMRcwFQYD

  VQQKFA52YWxlcm9uLm5vX2lzcDEeMBwGA1UECxMVQ2VydGlmaWNhdGUgQXV0aG9y
...

>>> pyaml.p(cert):
cert: |
  -----BEGIN CERTIFICATE-----
  MIIH3jCCBcagAwIBAgIJAJi7AjQ4Z87OMA0GCSqGSIb3DQEBCwUAMIHBMRcwFQYD
  VQQKFA52YWxlcm9uLm5vX2lzcDEeMBwGA1UECxMVQ2VydGlmaWNhdGUgQXV0aG9y
...

“force_embed” option (default=yes) to avoid having &id stuff scattered all over the output. Might be more useful to disable it in some specific cases though.
“&id” anchors, if used, get labels from the keys they get attached to, not just meaningless enumerators.

“string_val_style” option to only apply to strings that are values, not keys, i.e:

>>> pyaml.p(data, string_val_style='"')
key: "value\nasldpáknsa\n"
>>> yaml.safe_dump(data, sys.stdout, allow_unicode=True, default_style='"')
"key": "value\nasldpáknsa\n"

Add vertical spacing (empty lines) between keys on different depths, to separate long YAML sections in the output visually, make it more seekable.
Discard end-of-document “…” indicators for simple values.

Result for the (rather meaningless) example above:

>>> pyaml.p(data, force_embed=False, vspacing=dict(split_lines=10))

a: |
  asldnsa
  asldpáknsa

b: whatever text

ma: &ma
  - 123
  - 45.67
  - 1:
    2: no
  - some text

mb: *ma

(force_embed=False enabled deduplication with &ma anchor, vspacing is adjusted to split even this tiny output)

Extended example:

>>> pyaml.dump(data, vspacing=dict(split_lines=10))

destination:

  encoding:
    xz:
      enabled: yes
      min_size: 5120
      options:
      path_filter:
        - \.(gz|bz2|t[gb]z2?|xz|lzma|7z|zip|rar)$
        - \.(rpm|deb|iso)$
        - \.(jpe?g|gif|png|mov|avi|ogg|mkv|webm|mp[34g]|flv|flac|ape|pdf|djvu)$
        - \.(sqlite3?|fossil|fsl)$
        - \.git/objects/[0-9a-f]+/[0-9a-f]+$

  result:
    append_to_file:
    append_to_lafs_dir:
    print_to_stdout: yes

  url: http://localhost:3456/uri

filter:
  - /(CVS|RCS|SCCS|_darcs|\{arch\})/$
  - /\.(git|hg|bzr|svn|cvs)(/|ignore|attributes|tags)?$
  - /=(RELEASE-ID|meta-update|update)$

http:
  ca_certs_files: /etc/ssl/certs/ca-certificates.crt
  debug_requests: no
  request_pool_options:
    cachedConnectionTimeout: 600
    maxPersistentPerHost: 10
    retryAutomatically: yes

logging:

  formatters:
    basic:
      datefmt: '%Y-%m-%d %H:%M:%S'
      format: '%(asctime)s :: %(name)s :: %(levelname)s: %(message)s'

  handlers:
    console:
      class: logging.StreamHandler
      formatter: basic
      level: custom
      stream: ext://sys.stderr

  loggers:
    twisted:
      handlers:
        - console
      level: 0

  root:
    handlers:
      - console
    level: custom

Note that unless there are many moderately wide and deep trees of data, which are expected to be read and edited by people, it might be preferrable to directly use PyYAML regardless, as it won’t introduce another (rather pointless in that case) dependency and a point of failure.

Some Tricks

Pretty-print any yaml or json (yaml subset) file from the shell:

% python -m pyaml /path/to/some/file.yaml
% curl -s https://www.githubstatus.com/api/v2/summary.json | python -m pyaml

Process and replace json/yaml file in-place:
```
% python -m pyaml -r file-with-json.data
```

Easier “debug printf” for more complex data (all funcs below are aliases to same thing):

pyaml.p(stuff)
pyaml.pprint(my_data)
pyaml.pprint('----- HOW DOES THAT BREAKS!?!?', input_data, some_var, more_stuff)
pyaml.print(data, file=sys.stderr) # needs "from __future__ import print_function"

Force all string values to a certain style (see info on these in PyYAML docs):
```
pyaml.dump(many_weird_strings, string_val_style='|')
pyaml.dump(multiline_words, string_val_style='>')
pyaml.dump(no_want_quotes, string_val_style='plain')
```
Using pyaml.add_representer() (note *p*yaml) as suggested in this SO thread (or github-issue-7) should also work.
Control indent and width of the results:
```
pyaml.dump(wide_and_deep, indent=4, width=120)
```
These are actually keywords for PyYAML Emitter (passed to it from Dumper), see more info on these in PyYAML docs.
Dump multiple yaml documents into a file: pyaml.dump_all([data1, data2, data3], dst_file)

explicit_start=True is implied, unless explicit_start=False is passed.

Installation

It’s a regular Python 3.8+ module/package, published on PyPI (as pyaml).

Module uses PyYAML for processing of the actual YAML files and should pull it in as a dependency.

Dependency on unidecode module is optional and should only be necessary with force_embed=False keyword, and same-id objects or recursion is used within serialized data.

Using pip is how you generally install it, usually coupled with venv usage (which will also provide “pip” tool itself):

% pip install pyaml

Current-git version can be installed like this:

% pip install git+https://github.com/mk-fg/pretty-yaml

pip will default to installing into currently-active venv, then user’s home directory (under ~/.local/lib/python...), and maybe system-wide when running as root (only useful in specialized environments like docker containers).

There are many other python packaging tools - pipenv, poetry, pdm, etc - use whatever is most suitable for specific project/environment.

More general info on python packaging can be found at packaging.python.org.

When changing code, unit tests can be run with python -m unittest discover from the local repository checkout.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- Public Domain
Programming Language
- Python
- Python :: 3.8
Topic

Release history Release notifications | RSS feed

24.9.0

Sep 27, 2024

24.7.0

Jul 18, 2024

24.4.0

Apr 17, 2024

23.12.0

Dec 25, 2023

23.9.7

Sep 29, 2023

23.9.6

Sep 13, 2023

23.9.5

Sep 9, 2023

23.9.4

Sep 9, 2023

23.9.3

Sep 6, 2023

23.9.2

Sep 5, 2023

23.9.1

Sep 3, 2023

This version

23.9.0

Sep 3, 2023

23.7.0

Jul 6, 2023

23.5.9

May 11, 2023

23.5.8

May 6, 2023

23.5.7

May 5, 2023

23.5.6

May 5, 2023

23.5.5

May 5, 2023

21.10.1

Oct 9, 2021

21.8.3

Aug 8, 2021

21.8.2

Aug 8, 2021

20.4.0

Apr 2, 2020

20.3.1

Mar 9, 2020

20.3.0

Mar 9, 2020

19.12.0

Dec 7, 2019

19.4.1

Apr 17, 2019

19.4.0

Apr 17, 2019

18.11.0

Nov 19, 2018

17.12.1

Dec 23, 2017

17.12.0

Dec 23, 2017

17.10.0

Oct 8, 2017

17.8.0

Aug 17, 2017

17.7.2

Jul 28, 2017

16.12.2

Dec 11, 2016

16.12.1

Dec 8, 2016

16.12.0

Dec 8, 2016

16.11.4

Nov 12, 2016

16.11.3

Nov 12, 2016

16.11.0

Nov 2, 2016

16.9.0

Sep 10, 2016

15.8.2

Aug 30, 2015

15.8.0

Aug 30, 2015

15.6.3

Jun 29, 2015

15.6.2

Jun 29, 2015

15.5.7

May 19, 2015

15.5.6

May 19, 2015

15.5.5

May 19, 2015

15.5.4

May 19, 2015

15.5.3

May 19, 2015

15.5.2

May 4, 2015

15.5.1

May 4, 2015

15.5.0

May 2, 2015

15.4.0

Apr 27, 2015

15.03.1

Mar 20, 2015

15.03.0

Mar 20, 2015

15.02.1

Feb 15, 2015

15.02.0

Feb 15, 2015

14.12.10

Dec 3, 2014

14.11.3

Nov 10, 2014

14.11.2

Nov 10, 2014

14.05.7

May 28, 2014

14.05.6

May 20, 2014

14.05.5

May 20, 2014

14.05.3

May 20, 2014

14.05.2

May 6, 2014

14.04.3

Apr 8, 2014

14.04.2

Apr 8, 2014

13.12.0

Dec 20, 2013

13.07.1

Jul 29, 2013

13.07.0

Jul 3, 2013

13.05.2

May 22, 2013

13.01.0

Jan 17, 2013

12.12.5

Dec 14, 2012

12.12.4

Dec 14, 2012

12.12.3

Dec 14, 2012

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyaml-23.9.0.tar.gz (21.1 kB view hashes)

Uploaded Sep 3, 2023 Source

Built Distribution

pyaml-23.9.0-py3-none-any.whl (18.0 kB view hashes)

Uploaded Sep 3, 2023 Python 3

Hashes for pyaml-23.9.0.tar.gz

Hashes for pyaml-23.9.0.tar.gz
Algorithm	Hash digest
SHA256	`c6bf6d278ea9f467ac0b1a7203f48975c07f77553b25f506600bd2f5741e6c48`
MD5	`01aaca2810c6897e49f56fcf9cd93378`
BLAKE2b-256	`738a492c17b61e62732beb3ad564d02f27740270eed116ece950dc1fc66389d2`

Hashes for pyaml-23.9.0-py3-none-any.whl

Hashes for pyaml-23.9.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e1d88be0b8198fcd1cfc528f0324b18d8a98a99fbc3288a103a7c3b5bfc6a925`
MD5	`496669bb4840f676dbe310b9849ddcf0`
BLAKE2b-256	`dd063d0a6de9de92b2f024214b7c74452c9d436e6fe764845d7239c8544135dd`