Skip to main content

Manage and automatize datasets for data science projects.

Project description

Dataset Manager

Manage and automatize your datasets for your project with YAML files.

Create a file name.yaml with content in your dataset directory:

name: your_dataset_name

src: https://raw.githubusercontent.com/pcsanwald/kaggle-titanic/master/train.csv

description: this dataset is a test dataset

format: csv

name: is the name for dataset reference.

src: is location from dataset.

description: describe your dataset to remember later.

format: pandas read format following read_<format> as described here: https://pandas.pydata.org/pandas-docs/stable/reference/io.html.

Each dataset is a YAML file inside dataset directory.

List all Datasets

from dataset_manager import DatasetManager

manager = DatasetManager(dataset_path)

manager.list_datasets() ## return a List with all datasets from dataset path

Get one Dataset

from dataset_manager import DatasetManager

manager = DatasetManager(dataset_path)

manager.get_dataset(name) ## Get dataset as Pandas DataFrame

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dataset_manager-0.0.4.tar.gz (2.4 kB view details)

Uploaded Source

Built Distribution

dataset_manager-0.0.4-py3-none-any.whl (7.0 kB view details)

Uploaded Python 3

File details

Details for the file dataset_manager-0.0.4.tar.gz.

File metadata

  • Download URL: dataset_manager-0.0.4.tar.gz
  • Upload date:
  • Size: 2.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.2

File hashes

Hashes for dataset_manager-0.0.4.tar.gz
Algorithm Hash digest
SHA256 3692726da6ba45fbf629995492c7da7bc8099bd088025666b857e59ed96bdd60
MD5 949e607a36ba01416a83b0e2a67053c9
BLAKE2b-256 ed14932e4181c570306e20608927dd445eb86a8bb3cdaae4685bf14b55ee788a

See more details on using hashes here.

File details

Details for the file dataset_manager-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: dataset_manager-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 7.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.2

File hashes

Hashes for dataset_manager-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 f693da45c5d8835fc43b5912c901ac66c5650c0e583845fac4be0f742465b8d0
MD5 13ff36fa48e2a980aee5926502ed1daa
BLAKE2b-256 a25f431a0aea53175f96369cd7ba465dd1cdb5945ea97e666949aea0ecbd4d44

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page