Skip to main content

STAR file reading and writing in python.

Project description

starfile

Build Status PyPI version PyPI pyversions DOI

starfile is a Python implementation of the STAR file format designed principally for compatibility with RELION format STAR files.

It allows STAR files to be created and opened easily using a very simple API, exposing data blocks as pandas DataFrame objects.

This library aims to allow users and developers to read and write STAR files in Python as easily as possible as well as to encourage further analysis of data within the scientific Python (SciPy) ecosystem.

You can use it interactively to inspect/explore files or in scripts and larger software packages to provide basic STAR file I/O functions.

The STAR file: a new format for electronic dataframes transfer and archiving
J. Chem. Inf. Comput. Sci. 1991, 31, 2, 326–333
Publication Date: May 1, 1991
https://doi.org/10.1021/ci00002a020

Features

  • Easy to install and use
  • Simple API for reading of STAR files as pandas DataFrame objects
  • Simple API for writing of STAR files from pandas DataFrame objects

Installation

Installation is available directly from the Python package index

pip install starfile

Currently python >= 3.8 is supported. You can check your python version with

python -V

We recommend installing into a virtual environment for use in your projects.

Usage

Reading STAR files

To open a STAR file

>>> import starfile
>>> df = starfile.read('tests/dataframes/one_loop.star')
>>> df
      rlnCoordinateX  rlnCoordinateY  ...  rlnAngleTilt rlnAnglePsi
0           1572.444        1084.500  ...             0           0
1           1507.500        1104.357  ...             0           0
2           1512.432         973.500  ...             0           0
3           1560.385        1063.500  ...             0           0
4           1537.500        1060.500  ...             0           0
              ...             ...  ...           ...         ...
1360        1078.500         796.500  ...             0           0
1361        1075.500         784.500  ...             0           0
1362        1080.531         796.500  ...             0           0
1363        1045.992         737.411  ...             0           0
1364        1053.530         745.500  ...             0           0

[1365 rows x 12 columns]
  • Each block will return either a DataFrame for a loop_ block, or a dict for a simple block
  • A STAR file with a single block simply returns that DataFrame or dict, while a STAR file containing multiple blocks will return a dict of entries, with keys determined by the data_* titles
  • If you would like to always return a dict of entries even for single-block STAR files, you can use the always_dict=True keyword argument

Writing STAR files

DataFrame objects (or dicts or lists of dataframes) can be written to STAR files using starfile.write

>>> starfile.write(df, 'tests/dataframes/cars.star')

Produces a STAR file which looks like

# Created by the starfile python package (version 0.1) on 18/06/2020 13:26:32

data_cars

loop_
_Brand #1
_Price #2
Honda_Civic	22000
Toyota_Corolla	25000
Ford_Focus	27000
Audi_A4	        35000
  • floating point format can be specified by the float_format keyword argument (default %.6f)
  • data block headers are of format data_<key> where key is the dictionary key if a dict is passed, df.name if a DataFrame or list of DataFrames is passed

Interactive usage

You can also use starfile as an interactive command line tool for quick and dirty data analysis. This functionality can be installed using pip:

pip install starfile[cli]

Note that with certain shells (e.g. zsh) you may need to use pip install 'starfile[cli]' to avoid unwanted shell expansion of the square bracket syntax.

You can then call starfile <my_file>.star to be dropped into an interactive Python console with access to your data.

starfile tests/data/loop_block.star
Python 3.9.6 (default, Aug 18 2021, 12:38:10) 
Type 'copyright', 'credits' or 'license' for more information
IPython 7.27.0 -- An enhanced Interactive Python. Type '?' for help.

=== Starfile ===
    - access your data with `star`
    - write it out with `write(...)`

Both matplotlib and seaborn are available for plotting your data in this interactive shell.

License

The project is released under the BSD 3-Clause License

Testing

The project is tested using pytest. To run tests, simply run pytest in the tests directory

Known Issues

  • Cannot handle more than one loop in a data block as found in mmCIF files, please use GEMMI in these cases

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

starfile-0.5.3.tar.gz (30.2 kB view details)

Uploaded Source

Built Distribution

starfile-0.5.3-py3-none-any.whl (30.9 kB view details)

Uploaded Python 3

File details

Details for the file starfile-0.5.3.tar.gz.

File metadata

  • Download URL: starfile-0.5.3.tar.gz
  • Upload date:
  • Size: 30.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.12.1

File hashes

Hashes for starfile-0.5.3.tar.gz
Algorithm Hash digest
SHA256 923a05e93612546f3071d8afa2b1673a143836ed519b73ba5dad3c323b9c761b
MD5 597e70c82ecaafd8a5d30ee902d4d862
BLAKE2b-256 452bfe2099646dbca4198d76011ca5278fb7cfbee4f8c495d8a1179a25c1de3c

See more details on using hashes here.

File details

Details for the file starfile-0.5.3-py3-none-any.whl.

File metadata

  • Download URL: starfile-0.5.3-py3-none-any.whl
  • Upload date:
  • Size: 30.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.12.1

File hashes

Hashes for starfile-0.5.3-py3-none-any.whl
Algorithm Hash digest
SHA256 e9dd8fa68e95e5523876ef5f46f32011d8e793bbd5511c079dcda32e3df86f7e
MD5 ac8a01846f9b0df7079e3e9c7a38bea8
BLAKE2b-256 dd4a5cf0af799a955ac677172b4bf83878cf9e9d84effe71f95ae167896a2a5e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page