Skip to main content

Read and download STAC assets across platforms and providers

Project description

stac-asset

GitHub Workflow Status Read the Docs PyPI

Download STAC Assets using a variety of authentication schemes.

Installation

pip install stac-asset

To use the command-line interface (CLI):

pip install 'stac-asset[cli]'

Usage

API

Here's how to download a STAC Item and all of its assets to a local directory using the top-level function. The correct client will be guessed from the assets' href. Each asset's href will be updated to point to the local file.

import stac_asset

href = "https://raw.githubusercontent.com/radiantearth/stac-spec/master/examples/simple-item.json"
await stac_asset.download_item_from_href(href, ".")

CLI

To download an item using the command line:

stac-asset download https://raw.githubusercontent.com/radiantearth/stac-spec/master/examples/simple-item.json

To download all assets from the results of a pystac-client search, and save the item collection to a file named item-collection.json:

stac-client search https://planetarycomputer.microsoft.com/api/stac/v1 -c landsat-c2-l2 --max-items 1 | \
    stac-asset download > item-collection.json

If you'd like to only download certain assets, e.g. a preview image, you can use the include -i flag:

stac-client search https://planetarycomputer.microsoft.com/api/stac/v1 -c landsat-c2-l2 --max-items 1 | \
    stac-asset download -i rendered_preview -q

If you do a lot of downloads, you may want an alias:

alias stac-download="stac-asset download"

See the documentation for more examples and complete API documentation.

Clients

This library comes with several clients, each tailored for a specific data provider model and authentication scheme. Some clients require some setup before use; they are called out in this table, and the details are provided below.

Name Description Notes
HttpClient Simple HTTP client without any authentication
S3Client Simple S3 client Use requester_pays=True in the client initializer to enable access to requester pays buckets, e.g. USGS landsat's public AWS archive
FilesystemClient Moves files from place to place on a local filesystem Mostly used for testing
PlanetaryComputerClient Signs urls with the Planetary Computer Authentication API No additional setup required, works out of the box
UsgsErosClient Uses a token-based authentication workflow to download data, e.g. landsat, from USGS EROS Requires creation of a personal access token, see section below
EarthdataClient Uses a token-based authentication to download data, from some Earthdata providers, e.g. DAACs Requires creation of a personal access token, see section below

S3Client

To use the requester_pays option, you need to configure your AWS credentials. See the AWS documentation for instructions.

USGS EROS

The USGS EROS system, which hosts landsat data, requires a personal access token to download assets. Here's how to create and use your personal access token with stac-asset:

  1. Create a new personal access token
  2. Set two environment variables:
    • USGS_EROS_USERNAME to your username (found in the top right of the web UI)
    • USGS_EROS_PAT to your personal access token
  3. Use UsgsErosClient.default() to create a new client.

You can also provide your username and password to the UsgsErosClient.login method.

Earthdata

You'll need a personal access token.

  1. Create a new personal access token by going to https://urs.earthdata.nasa.gov/profile and then clicking "Generate Token" (you'll need to log in).
  2. Set an enviornment variable named EARTHDATA_PAT to your token.
  3. Use EarthdataClient.default() to create a new client.

You can also provide your token directly to EarthdataClient.login().

Design goals

As determined during a meeting at the Element 84 offices (formerly Azavea offices) on 2023-05-24.

  • async-first
  • Allow range requests
  • Download functionality
  • Update STAC items to point to new hrefs on download
  • Allow byte-stream access
  • Protocols:
    • http
    • s3
      • requestor pays
      • custom endpoint
    • custom authentication
      • Planetary Computer
      • USGS EROS
      • NASA
  • Copy directly from source to destination ("skip local")
  • Add new assets to an item
  • Update an existing asset
  • Delete assets
  • Templated paths on download
  • (possible) Support the file extension's local path
  • Checksum validation and creation
  • CLI

Versioning

This project does its best to adhere to semantic versioning. Any module, class, constant, or function that does not begin with a _ is considered part of our public API for versioning purposes. Our command-line interface (CLI) is NOT considered part of our public API, and may change in breaking ways at any time. If you need stability promises, use our API.

Contributing

Use Github issues to report bugs and request new features. Use Github pull requests to fix bugs and propose new features.

Developing

Clone, install with the dev dependencies, and install pre-commit:

git clone git@github.com:gadomski/stac-asset.git
cd stac-asset
pip install '.[dev]'
pre-commit install

Testing

All network-touching tests are disabled by default, because we can't use pytest-vcr (https://github.com/kevin1024/vcrpy/issues/597), and repeatedly hitting the network during testing and CI is bad behavior. To enable network-touching tests:

pytest --network-access

Some tests are client-specific and need your environment to be configured correctly. See each client's documentation for instructions on setting up your environment for each client. If your environment is not configured for a certain client, that client's tests are skipped.

Docs

Install the documentation dependencies:

pip install -e '.[docs]'

Then, build the docs:

make -C docs html && open docs/_build/html/index.html

It can be handy to use sphinx-autobuild if you're doing a lot of doc work:

pip install sphinx-autobuild
sphinx-autobuild docs docs/_build/html

License

Apache-2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stac-asset-0.0.4.tar.gz (24.0 kB view details)

Uploaded Source

Built Distribution

stac_asset-0.0.4-py3-none-any.whl (24.4 kB view details)

Uploaded Python 3

File details

Details for the file stac-asset-0.0.4.tar.gz.

File metadata

  • Download URL: stac-asset-0.0.4.tar.gz
  • Upload date:
  • Size: 24.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.4

File hashes

Hashes for stac-asset-0.0.4.tar.gz
Algorithm Hash digest
SHA256 98da8424e93709e3f8c1089c73eb7703f6d1cdaee71157a3fd739d25a306c98f
MD5 6717ea2aade7b0a5c4475bae8218be1f
BLAKE2b-256 bcaa667854a49af6723cfffc0da5ca2d373769ba55cf73e9a7d2a0687bbc21c1

See more details on using hashes here.

File details

Details for the file stac_asset-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: stac_asset-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 24.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.4

File hashes

Hashes for stac_asset-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 6ac688ba2e89594fef1038833ab30fddaaa7b74bed57f28a2126e9ac87535a68
MD5 4d9e85e769a7e5adb79eba1cf5561a87
BLAKE2b-256 c75a3ec51a95ca37c4344fa7a496cfddfd647fc513d7cbd82f1aef575c687a88

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page