tabulator

Consistent interface for stream reading and writing tabular data (csv/xls/json/etc)

These details have been verified by PyPI

Maintainers

These details have not been verified by PyPI

Project links

Homepage

Project description

# tabulator-py

[![Travis](https://img.shields.io/travis/frictionlessdata/tabulator-py/master.svg)](https://travis-ci.org/frictionlessdata/tabulator-py)
[![Coveralls](http://img.shields.io/coveralls/frictionlessdata/tabulator-py.svg?branch=master)](https://coveralls.io/r/frictionlessdata/tabulator-py?branch=master)
[![PyPi](https://img.shields.io/pypi/v/tabulator.svg)](https://pypi-hypernode.com/pypi/tabulator)
[![SemVer](https://img.shields.io/badge/versions-SemVer-brightgreen.svg)](http://semver.org/)
[![Gitter](https://img.shields.io/gitter/room/frictionlessdata/chat.svg)](https://gitter.im/frictionlessdata/chat)

Consistent interface for stream reading and writing tabular data (csv/xls/json/etc).

> Release `v0.10` contains changes in `exceptions` module introduced in NOT backward-compatibility manner.

## Features

- supports various formats: csv/tsv/xls/xlsx/json/ndjson/ods/gsheet/native/etc
- reads data from variables, filesystem or Internet
- streams data instead of using a lot of memory
- processes data via simple user processors
- saves data using the same interface

## Getting Started

### Installation

To get started:

```
$ pip install tabulator
```

### Example

Open tabular stream from csv source:

```python
from tabulator import Stream

with Stream('path.csv', headers=1) as stream:
print(stream.headers) # will print headers from 1 row
for row in stream:
print(row) # will print row values list
```

### Stream

`Stream` takes the `source` argument:

```
<scheme>://path/to/file.<format>
```
and uses corresponding `Loader` and `Parser` to open and start to iterate over the tabular stream. Also user can pass `scheme` and `format` explicitly as constructor arguments. User can force Tabulator to use encoding of choice to open the table passing `encoding` argument.

In this example we use context manager to call `stream.open()` on enter and `stream.close()` when we exit:
- stream can be iterated like file-like object returning row by row
- stream can be used for manual iterating with `iter(keyed/extended)` function
- stream can be read into memory using `read(keyed/extended)` function with row count `limit`
- headers can be accessed via `headers` property
- rows sample can be accessed via `sample` property
- stream pointer can be set to start via `reset` method
- stream could be saved to filesystem using `save` method

Below the more expanded example is presented:

```python
from tabulator import Stream

def skip_even_rows(extended_rows):
for number, headers, row in extended_rows:
if number % 2:
yield (number, headers, row)

stream = Stream('http://example.com/source.xls',
headers=1, encoding='utf-8', sample_size=1000,
post_parse=[skip_even_rows], sheet=1)
stream.open()
print(stream.sample) # will print sample
print(stream.headers) # will print headers list
print(stream.read(limit=10)) # will print 10 rows
stream.reset()
for keyed_row in stream.iter(keyed=True):
print keyed_row # will print row dict
for extended_row in stream.iter(extended=True):
print extended_row # will print (number, headers, row)
stream.reset()
stream.save('target.csv')
stream.close()
```

For the full list of options see - https://github.com/frictionlessdata/tabulator-py/blob/master/tabulator/stream.py#L17

### CLI

> It's a provisional API excluded from SemVer. If you use it as a part of other program please pin concrete `goodtables` version to your requirements file.

The library ships with a simple CLI to read tabular data:

```bash
$ tabulator
Usage: cli.py [OPTIONS] SOURCE

Options:
--headers INTEGER
--scheme TEXT
--format TEXT
--encoding TEXT
--limit INTEGER
--help Show this message and exit.
```

Shell usage example:

```bash
$ tabulator data/table.csv
id, name
1, english
2, 中国人
```

## API Reference

### Snapshot

```
Stream(source,
headers=None,
scheme=None,
format=None,
encoding=None,
sample_size=None,
post_parse=None,
**options)
closed/open/close/reset
headers -> list
sample -> rows
iter(keyed/extended=False) -> (generator) (keyed/extended)row[]
read(keyed/extended=False, limit=None) -> (keyed/extended)row[]
save(target, format=None, encoding=None, **options)
exceptions
~cli
```

### Detailed

- [Docstrings](https://github.com/frictionlessdata/tabulator-py/tree/master/tabulator)
- [Changelog](https://github.com/frictionlessdata/tabulator-py/commits/master)

## Contributing

Please read the contribution guideline:

[How to Contribute](CONTRIBUTING.md)

Thanks!

Project details

These details have been verified by PyPI

Maintainers

brew callmealien okfn pwalsh

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.53.5

Mar 21, 2021

1.53.4

Feb 24, 2021

1.53.2

Feb 11, 2021

1.53.1

Nov 30, 2020

1.53.0

Nov 9, 2020

1.52.5

Nov 2, 2020

1.52.4

Sep 26, 2020

1.52.3

Jun 18, 2020

1.52.2

Jun 18, 2020

1.52.1

Jun 15, 2020

1.52.0

Jun 10, 2020

1.51.3

Jun 3, 2020

1.51.2

Jun 3, 2020

1.51.1

Jun 3, 2020

1.50.0

Jun 1, 2020

1.49.4

Jun 1, 2020

1.49.3

Jun 1, 2020

1.49.2

May 27, 2020

1.48.0

May 20, 2020

1.47.0

May 20, 2020

1.46.1

May 19, 2020

1.46.0

May 19, 2020

1.45.1

May 18, 2020

1.45.0

May 18, 2020

1.44.7

May 14, 2020

1.44.6

May 14, 2020

1.44.5

May 14, 2020

1.44.4

May 14, 2020

1.44.3

May 13, 2020

1.44.2

May 11, 2020

1.44.1

May 11, 2020

1.44.0

May 7, 2020

1.42.0

May 4, 2020

1.41.0

Apr 30, 2020

1.40.0

Apr 29, 2020

1.39.1

Apr 29, 2020

1.39.0

Apr 29, 2020

1.38.4

Apr 23, 2020

1.38.3

Apr 22, 2020

1.38.2

Apr 8, 2020

1.38.1

Mar 25, 2020

1.37.1

Mar 25, 2020

1.36.1

Mar 25, 2020

1.36.0

Mar 16, 2020

1.35.0

Feb 17, 2020

1.34.1

Feb 17, 2020

1.34.0

Feb 4, 2020

1.33.0

Jan 30, 2020

1.32.0

Jan 29, 2020

1.31.2

Dec 19, 2019

1.31.1

Dec 17, 2019

1.31.0

Dec 2, 2019

1.30.0

Nov 19, 2019

1.29.0

Oct 30, 2019

1.28.0

Oct 21, 2019

1.27.0

Oct 14, 2019

1.26.1

Sep 21, 2019

1.25.1

Sep 18, 2019

1.25.0

Sep 18, 2019

1.24.3

Sep 17, 2019

1.24.2

Aug 27, 2019

1.24.1

Aug 21, 2019

1.24.0

Aug 16, 2019

1.23.0

Jul 7, 2019

1.22.0

Jun 28, 2019

1.21.0

May 27, 2019

1.20.0

Apr 24, 2019

1.19.3

Apr 17, 2019

1.19.1

Apr 11, 2019

1.19.0

Nov 6, 2018

1.18.0

Oct 29, 2018

1.17.1

Oct 22, 2018

1.17.0

Oct 15, 2018

1.16.0

Oct 15, 2018

1.15.0

Oct 8, 2018

1.14.4

Oct 4, 2018

1.14.3

Sep 17, 2018

1.14.2

Jul 26, 2018

1.14.1

Jul 17, 2018

1.14.0

Mar 21, 2018

1.13.0

Dec 27, 2017

1.12.2

Nov 24, 2017

1.12.1

Nov 22, 2017

1.12.0

Nov 10, 2017

1.11.1

Oct 30, 2017

1.11.0

Oct 27, 2017

1.10.0

Oct 20, 2017

1.9.0

Oct 20, 2017

1.8.0

Oct 17, 2017

1.7.1

Oct 12, 2017

1.7.0

Oct 12, 2017

1.6.0

Oct 5, 2017

1.5.0

Sep 6, 2017

1.4.1

Aug 28, 2017

1.3.1

Aug 22, 2017

1.3.0

Aug 8, 2017

1.2.0

Aug 3, 2017

1.1.0

Jun 20, 2017

1.0.0

Jun 5, 2017

1.0.0a5 pre-release

May 18, 2017

1.0.0a4 pre-release

May 17, 2017

1.0.0a1 pre-release

May 10, 2017

0.15.1

May 3, 2017

0.15.0

Apr 23, 2017

0.14.2

Mar 2, 2017

0.14.1

Feb 21, 2017

0.14.0

Jan 24, 2017

0.13.0

Jan 13, 2017

This version

0.12.1

Dec 6, 2016

0.12.0

Nov 28, 2016

0.11.2

Nov 18, 2016

0.11.1

Nov 9, 2016

0.11.0

Nov 9, 2016

0.10.5

Nov 3, 2016

0.10.4

Oct 29, 2016

0.10.3

Oct 29, 2016

0.10.2

Oct 29, 2016

0.10.1

Oct 28, 2016

0.10.0

Oct 27, 2016

0.9.0

Oct 27, 2016

0.8.0

Oct 26, 2016

0.7.6

Oct 20, 2016

0.7.5

Oct 13, 2016

0.7.4

Sep 23, 2016

0.7.2

Sep 14, 2016

0.7.1

Sep 14, 2016

0.7.0

Sep 14, 2016

0.6.2

Sep 13, 2016

0.6.1

Sep 13, 2016

0.6.0

Sep 13, 2016

0.5.0

Aug 16, 2016

0.4.0

May 11, 2016

0.3.14

May 10, 2016

0.3.13

Mar 29, 2016

0.3.12

Mar 29, 2016

0.3.9

Mar 28, 2016

0.3.8

Mar 28, 2016

0.3.7

Mar 26, 2016

0.3.6

Mar 15, 2016

0.3.5

Feb 18, 2016

0.3.3

Feb 17, 2016

0.3.2

Feb 8, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tabulator-0.12.1.tar.gz (16.7 kB view details)

Uploaded Dec 6, 2016 Source

Built Distribution

tabulator-0.12.1-py2.py3-none-any.whl (34.4 kB view details)

Uploaded Dec 6, 2016 Python 2 Python 3

File details

Details for the file tabulator-0.12.1.tar.gz.

File metadata

Download URL: tabulator-0.12.1.tar.gz
Upload date: Dec 6, 2016
Size: 16.7 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for tabulator-0.12.1.tar.gz
Algorithm	Hash digest
SHA256	`d141b588b8e941cf46f44198b2a6b4745f039fb41d3db51089af9caf45af21f6`
MD5	`87c425f11ab1002a79dc974cdf019dad`
BLAKE2b-256	`5dd718c1524a023d6849de0306557706873e68243ac95d526200df192dea9246`

See more details on using hashes here.

Provenance

File details

Details for the file tabulator-0.12.1-py2.py3-none-any.whl.

File metadata

Download URL: tabulator-0.12.1-py2.py3-none-any.whl
Upload date: Dec 6, 2016
Size: 34.4 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No

File hashes

Hashes for tabulator-0.12.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`c4a31549713fc11e5e08c698cf348bb963a4f3641417d5b738f4675f853e9393`
MD5	`64b021b816e5a2219b693cfb7ecd1dcc`
BLAKE2b-256	`6f3898d7a6710393bfc809a1ee66a1780c1a90f1f507c4bc67a06db4c87fb017`