Skip to main content

A package to provide pathlib like access to zip & tar archives.

Project description

archive-path

Build Status codecov.io PyPI version Conda Version

A package to provide pathlib like access to zip & tar archives.

Installation

$ pip install archive-path

Usage

For reading zip (ZipPath) or tar (TarPath) files:

from archive_path import TarPath, ZipPath

path = TarPath("path/to/file.tar.gz", mode="r:gz")

sub_path = path / "folder" / "file.txt"
assert sub_path.filepath == "path/to/file.tar.gz"
assert sub_path.at == "folder/file.txt"
assert sub_path.exists() and sub_path.is_file()
assert sub_path.parent.is_dir()
content = sub_path.read_text()

for sub_path in path.iterdir():
    print(sub_path)

For writing files, you should use within a context manager, or directly call the close method:

with TarPath("path/to/file.tar.gz", mode="w:gz") as path:

    (path / "new_file.txt").write_text("hallo world")
    # there are also some features equivalent to shutil
    (path / "other_file.txt").putfile("path/to/external_file.txt")
    (path / "other_folder").puttree("path/to/external_folder", pattern="**/*")

Note that archive formats do not allow to overwrite existing files (they will raise a FileExistsError).

For performant access to single files:

from archive_path import read_file_in_tar, read_file_in_zip

content = read_file_in_tar("path/to/file.tar.gz", "file.txt", encoding="utf8")

These methods allow for faster access to files (using less RAM) in archives containing 1000's of files. This is because, the archive's file index is only read until the path is found (discarding non-matches), rather than the standard tarfile/zipfile approach that is to read the entire index into memory first.

Windows compatibility

Paths within the archives are always read and written as being / delimited. This means that the package works on Windows, but will not be compatible with archives written outside this package with \\ path delimiters.

Development

See CONTRIBUTING.md for details on how to contribute to this package.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

archive-path-0.3.1.tar.gz (18.1 kB view details)

Uploaded Source

Built Distribution

archive_path-0.3.1-py3-none-any.whl (17.9 kB view details)

Uploaded Python 3

File details

Details for the file archive-path-0.3.1.tar.gz.

File metadata

  • Download URL: archive-path-0.3.1.tar.gz
  • Upload date:
  • Size: 18.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.26.0

File hashes

Hashes for archive-path-0.3.1.tar.gz
Algorithm Hash digest
SHA256 be1891ffb0f03dc6efd4c812f7635251a45ccae647d684adacdb813afa6d5ae9
MD5 9b89b0b837e4628a549fe62c8d192f86
BLAKE2b-256 95454740b97d92d5957e8248d51a76d171668f4b18ac9faa04f5ff34a2351513

See more details on using hashes here.

Provenance

File details

Details for the file archive_path-0.3.1-py3-none-any.whl.

File metadata

File hashes

Hashes for archive_path-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b7f57714eccb9bd89c6b8c045c6676324fe803ec488a3f5d1324646109040f4d
MD5 536c133848b8c14a223ee59e6c252c07
BLAKE2b-256 8b830752c65e866b38055d1453bcae8668cba0e30ed624da4c3246b3340bd3a5

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page