Skip to main content

An efficient library to read from new and old format .conda and .tar.bz2 conda packages.

Project description

conda-package-streaming

pre-commit.ci status

An efficient library to read from new and old format .conda and .tar.bz2 conda packages.

Download conda metadata from packages without transferring entire file. Get metadata from local .tar.bz2 packages without reading entire files.

Uses enhanced pip lazy_wheel to fetch a file out of .conda with no more than 3 range requests, but usually 2.

Uses tar = tarfile.open(fileobj=...) to stream remote .tar.bz2. Closes the HTTP request once desired files have been seen.

Quickstart

The basic API yields (tarfile, member) tuples from conda files as tarfile is needed to extract member. Note the .tar.bz2 format yields all members, not just info/, from stream_conda_info / stream_conda_component, while the .conda format yields members from the requested inner archive — allowing the caller to decide when to stop reading.

From a url,

from conda_package_streaming.url import stream_conda_info
# url = (ends with .conda or .tar.bz2)
for tar, member in stream_conda_info(url):
    if member.name == "info/index.json":
        index_json = json.load(tar.extractfile(member))
        break

From s3,

client = boto3.client("s3")
from conda_package_streaming.s3 import stream_conda_info
# key = (ends with .conda or .tar.bz2)
for tar, member in stream_conda_info(client, bucket, key):
    if member.name == "info/index.json":
        index_json = json.load(tar.extractfile(member))
        break

From a filename,

from conda_package_streaming import package_streaming
# filename = (ends with .conda or .tar.bz2)
for tar, member in package_streaming.stream_conda_info(filename):
    if member.name == "info/index.json":
        index_json = json.load(tar.extractfile(member))
        break

From a file-like object,

from contextlib import closing

from conda_package_streaming.url import conda_reader_for_url
from conda_package_streaming.package_streaming import stream_conda_component
filename, conda = conda_reader_for_url(url)

# file object must be seekable for `.conda` format, but merely readable for `.tar.bz2`
with closing(conda):
    for tar, member in stream_conda_component(filename, conda, component="info"):
        if member.name == "info/index.json":
            index_json = json.load(tar.extractfile(member))
            break

If you need the entire package, download it first and use the file-based APIs. The URL-based APIs are more efficient if you only need to access package metadata.

Package goals

  • Extract conda packages (both formats)

  • Easy to install from pypi or conda

  • Do the least amount of I/O possible (no temporary files, transfer partial packages)

  • Open files from the network / standard HTTP / s3

  • Continue using conda-package-handling to create .conda packages

Generating documentation

Uses markdown, furo theme. Requires newer mdit-py-plugins.

pip install conda-package-streaming[docs]

One time: sphinx-apidoc -o docs .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

conda_package_streaming-0.10.0.tar.gz (13.5 kB view details)

Uploaded Source

Built Distribution

conda_package_streaming-0.10.0-py3-none-any.whl (15.7 kB view details)

Uploaded Python 3

File details

Details for the file conda_package_streaming-0.10.0.tar.gz.

File metadata

File hashes

Hashes for conda_package_streaming-0.10.0.tar.gz
Algorithm Hash digest
SHA256 a0c2c1abc20fcc0096d57177c820dfea54c8d63c159b8453ce0679864aa9af2a
MD5 01675dd65f5bb670f8a7bdbcc6af6962
BLAKE2b-256 67c7199ca72ede92930c768082f8fa6528a8d11a3990582382d4d98157564263

See more details on using hashes here.

File details

Details for the file conda_package_streaming-0.10.0-py3-none-any.whl.

File metadata

File hashes

Hashes for conda_package_streaming-0.10.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ab1c0ce4b0515568efe72082cf2930530e7329957159fce1cb916668b27f3c9a
MD5 61f8d8262c414fa0276620f1952a4496
BLAKE2b-256 b7fe05f871acf75c168bf12de1c4b75db1e9a5b2fb2d68020b025aab0b8c13a1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page