Open-source data platform for biology.
Project description
LaminDB - Open-source data platform for biology
Public beta: Close to having converged a stable API, but some breaking changes might still occur.
Here is an [intro video](https://www.youtube.com/watch?v=DtJ9KnqWA8Q) to guide beta testing.
LaminDB is a Python library to manage data & analyses related to biology:
- Query, validate & link data batches using biological registries & ontologies.
- Track & query data lineage across pipelines, notebooks & app uploads.
- Manage features & labels schema-less or schema-full.
- Collaborate across a mesh of LaminDB instances.
If you want a UI: LaminApp is built on LaminDB. If LaminDB ~ git, LaminApp ~ GitHub.
(Enterprise features for LaminApp, support, integration tests & schemas are available on a paid plan - in your or our infrastructure.)
Quickstart
Run pip install 'lamindb[jupyter]'
and lamin signup <email>
on the command line (more info).
Init a LaminDB instance with local or cloud default storage like you'd init a git repository:
$ lamin init --storage ./mydata # or s3://my-bucket, gs://my-bucket
Validate & register a DataFrame
that comes with basic metadata:
import lamindb as ln
import pandas as pd
ln.track() # track run context in a notebook
# save target feature names in Feature registry
features = ln.Feature.from_values(["feature1", "feature2", "perturbation"])
ln.save(features)
# receive a batch of data
df = pd.DataFrame(
{"feature1": [1, 2, 3], "feature2": [3, 4, 5], "perturbation": ["pert1", "pert2", "pert1"]}
)
# validate features & create a Dataset object
dataset = ln.Dataset.from_df(df, name="Dataset 1")
dataset.save() # save/upload dataset
Search, query, and load a DataFrame
:
ln.Dataset.search("dataset 1") # run a search
# run a query (under the hood, you have the full power of SQL to query)
dataset = ln.Dataset.filter(name__contains="set 1").one()
df = dataset.load()
Documentation
Read the docs.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file lamindb-0.51.1.tar.gz
.
File metadata
- Download URL: lamindb-0.51.1.tar.gz
- Upload date:
- Size: 253.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: python-requests/2.31.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 960fea4fc8c1c77e8068629c3409504d5b9a44097cd190efeadfa5a661d8abf6 |
|
MD5 | 1cce5fae47dc6976c8b0339297c92352 |
|
BLAKE2b-256 | 3991695e9b7471590fe2ecf7e1692efaa3f1e94111c34c6d6af3e5c91f002f44 |
Provenance
File details
Details for the file lamindb-0.51.1-py3-none-any.whl
.
File metadata
- Download URL: lamindb-0.51.1-py3-none-any.whl
- Upload date:
- Size: 84.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: python-requests/2.31.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ebc1b6179f903753ca6c4591caa936e7cb7213a74732e09a9e79aaef618a3538 |
|
MD5 | 43adb867db6435dee20ad1dba5edd842 |
|
BLAKE2b-256 | 96e2a44b319eeae710b81cd7634f4ab2c163b59a00792e113afa67946dd067cc |