Skip to main content

Concurrent appendable key-value storage

Project description

Minimal key-value byte storage with appendable values

Partd stores data. Partd arranges data in key-value pairs where the values are just raw bytes. We can append data onto the end of existing values. You can get the entire value associated to a list of keys.

API

Partd supports four operations

  1. create: Create a partd dataset:

    >>> create('/path/to/partd-file/')
  2. put: Append key-byte pairs to dataset:

    >>> put('/path/to/partd-file/', {'x': b'Hello ', 'y': b'123'})
    >>> put('/path/to/partd-file/', {'x': b'world!', 'y': b'456'})
  3. get: Get all bytes associated to a set of keys:

    >>> get('/path/to/partd-file/', ['y', 'x'])
    [b'123456', b'Hello world!']
  4. destroy: Destroy partd dataset:

    >>> destroy('/path/to/partd-file')

That’s it.

There is no in-memory state.

Implementations

The reference implementation uses file-based locks. This works surprisingly well as long as you don’t do many small writes

If you do many small writes then you probably want to cache in memory; this is hard to do in parallel while also maintaining consistency. For this we have a centralized server (see partd.zmq) that caches data in memory and writes only large chunks to disk when necessary.

Encodings

partd.pickle and partd.numpy handle encoding/decoding pickled lists and numpy arrays with the same partd get/put interface. They compose well with the zmq solution.

Future plans

Things that will happen if our plan is perfect (it isn’t!)

  1. These functions can be implemented in other, non-file-based ways, perhaps with a distributed system that peer-to-peer communication and stuff.

  2. We’ll build interfaces around partd that handle dataframes, numpy arrays, pickled lists, etc..

  3. Compression

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

partd-0.1.0.tar.gz (5.1 kB view details)

Uploaded Source

File details

Details for the file partd-0.1.0.tar.gz.

File metadata

  • Download URL: partd-0.1.0.tar.gz
  • Upload date:
  • Size: 5.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for partd-0.1.0.tar.gz
Algorithm Hash digest
SHA256 295da6c94be50f60d9b0454a2e43142e242bb2031664f5fe4020e605547241e4
MD5 1a34ff0453e73c0775a3150990b2a141
BLAKE2b-256 711afbde24a968980ad218a120e57ee665170345c23de30498183e73bfa8f4e4

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page