Skip to main content

Open-source data platform for biology.

Project description

Stars codecov pypi

LaminDB - Open-source data platform for biology

Public beta: Close to having converged a stable API, but some breaking changes might still occur.

Here is an [intro video](https://www.youtube.com/watch?v=DtJ9KnqWA8Q) to guide beta testing.

LaminDB is a Python library to manage data & analyses related to biology:

  • Query, validate & link data batches using biological registries & ontologies.
  • Track & query data lineage across pipelines, notebooks & app uploads.
  • Manage features & labels schema-less or schema-full.
  • Collaborate across a mesh of LaminDB instances.

If you want a UI: LaminApp is built on LaminDB. If LaminDB ~ git, LaminApp ~ GitHub.

(Enterprise features for LaminApp, support, integration tests & schemas are available on a paid plan - in your or our infrastructure.)

Quickstart

Run pip install 'lamindb[jupyter]' and lamin signup <email> on the command line (more info).

Init a LaminDB instance with local or cloud default storage like you'd init a git repository:

$ lamin init --storage ./mydata   # or s3://my-bucket, gs://my-bucket

Validate & register a DataFrame that comes with basic metadata:

import lamindb as ln
import pandas as pd

ln.track()  # track run context in a notebook

# save target feature names in Feature registry
features = ln.Feature.from_values(["feature1", "feature2", "perturbation"])
ln.save(features)

# receive a batch of data
df = pd.DataFrame(
    {"feature1": [1, 2, 3], "feature2": [3, 4, 5], "perturbation": ["pert1", "pert2", "pert1"]}
)

# validate features & create a Dataset object
dataset = ln.Dataset.from_df(df, name="Dataset 1")
dataset.save()  # save/upload dataset

Search, query, and load a DataFrame:

ln.Dataset.search("dataset 1")  # run a search

# run a query (under the hood, you have the full power of SQL to query)
dataset = ln.Dataset.filter(name__contains="set 1").one()

df = dataset.load()

Documentation

Read the docs.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lamindb-0.51.1.tar.gz (253.4 kB view details)

Uploaded Source

Built Distribution

lamindb-0.51.1-py3-none-any.whl (84.1 kB view details)

Uploaded Python 3

File details

Details for the file lamindb-0.51.1.tar.gz.

File metadata

  • Download URL: lamindb-0.51.1.tar.gz
  • Upload date:
  • Size: 253.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.31.0

File hashes

Hashes for lamindb-0.51.1.tar.gz
Algorithm Hash digest
SHA256 960fea4fc8c1c77e8068629c3409504d5b9a44097cd190efeadfa5a661d8abf6
MD5 1cce5fae47dc6976c8b0339297c92352
BLAKE2b-256 3991695e9b7471590fe2ecf7e1692efaa3f1e94111c34c6d6af3e5c91f002f44

See more details on using hashes here.

Provenance

File details

Details for the file lamindb-0.51.1-py3-none-any.whl.

File metadata

  • Download URL: lamindb-0.51.1-py3-none-any.whl
  • Upload date:
  • Size: 84.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.31.0

File hashes

Hashes for lamindb-0.51.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ebc1b6179f903753ca6c4591caa936e7cb7213a74732e09a9e79aaef618a3538
MD5 43adb867db6435dee20ad1dba5edd842
BLAKE2b-256 96e2a44b319eeae710b81cd7634f4ab2c163b59a00792e113afa67946dd067cc

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page