Skip to main content

Schema resources for the National Microbiome Data Collaborative (NMDC)

Project description

National Microbiome Data Collaborative Schema

PyPI - License PyPI version

The NMDC is a multi-organizational effort to integrate microbiome data across diverse areas in medicine, agriculture, bioenergy, and the environment. This integrated platform facilitates comprehensive discovery of and access to multidisciplinary microbiome data in order to unlock new possibilities with microbiome data science.

This repository mainly defines a LinkML schema for managing metadata from the National Microbiome Data Collaborative (NMDC).

Repository Contents Overview

Some products that are maintained, and tasks orchestrated within this repository are:

  • Maintenance of LinkML YAML that specifies the NMDC Schema
  • Makefile targets for converting the schema from it's native LinkML YAML format to other artifact like JSON Schema
  • Build, deployment and distribution of the schema as a PyPI package
  • Automatic publishing of refreshed documentation upon change to the schema, accessible here

Background

The NMDC Introduction to metadata and ontologies primer provides some the context for this project.

See also these slides describing the schema.

Maintaining the Schema

See MAINTAINERS.md for instructions on maintaining and updating the schema.

NMDC metadata downloads

See https://github.com/microbiomedata/nmdc-runtime/#data-exports

Ecosystem Diagram

flowchart TD
    subgraph nmdc-schema repo
    ly([NMDC LinkML YAML files])
    lg(generated artifacts)
    ly-.make all.->lg
    end
    subgraph Data Validation
    click ly href "https://github.com/microbiomedata/nmdc-schema/tree/main/src/schema" _top
    d[(Some data)]
    v[[Validation process]]
    v--Has input-->d
    v--Has input-->ly
    end
    subgraph MIxS
    m([MIxS Schema])
    end
    subgraph SubmissionPortal
    sppg[(Postgres)]
    spa[Portal API]
    sppg<-->spa
    click spa href "https://data.dev.microbiomedata.org/docs" _top
    ps[Pydantic schema]
    end
    subgraph MongoDB
    mc[(Collections)]
    ms[Implicit schema]
    ma[Search API]
    mc<-->ma
    click ma href "https://api.dev.microbiomedata.org/docs" _top
    end
    mc --Ingest--> sppg
    subgraph DH Template Prep
    saf[sheets_and_friends repo]
    sps([Submission Portal Schema])
    dhjs[Data Harmoizer JS, etc.]
    saf-->sps-->dhjs
    end
    dhjs-->SubmissionPortal
    subgraph DataMapping
    sa[sample-annotator repo]
    end
    spa-->sa-..->ma
    ly-..->ps
    sj[some json]
    ly-..->sj-..->MongoDB-..->ps

Project details


Release history Release notifications | RSS feed

This version

7.4.7

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nmdc_schema-7.4.7.tar.gz (227.5 kB view details)

Uploaded Source

Built Distribution

nmdc_schema-7.4.7-py3-none-any.whl (234.8 kB view details)

Uploaded Python 3

File details

Details for the file nmdc_schema-7.4.7.tar.gz.

File metadata

  • Download URL: nmdc_schema-7.4.7.tar.gz
  • Upload date:
  • Size: 227.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.16

File hashes

Hashes for nmdc_schema-7.4.7.tar.gz
Algorithm Hash digest
SHA256 23434c3dd03cec04f149f9ecfa55882abe6cf2d42b7e744906daefd83c7c4cb7
MD5 d873e14365532a3f3c227e0cc9471113
BLAKE2b-256 5bffd6d6aba6dad07f32cce26432d141797f1823d64387a341fe771e7e9e4e70

See more details on using hashes here.

Provenance

File details

Details for the file nmdc_schema-7.4.7-py3-none-any.whl.

File metadata

  • Download URL: nmdc_schema-7.4.7-py3-none-any.whl
  • Upload date:
  • Size: 234.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.16

File hashes

Hashes for nmdc_schema-7.4.7-py3-none-any.whl
Algorithm Hash digest
SHA256 21391b79a5bfbbd0ea4bc632150d33eb3d907fef8fa63a63a48c909ad746d0f4
MD5 f552983b77360111d9194bfb1d312c9f
BLAKE2b-256 e3e246ca3e01e1db1c0a09b87e64e1ddb73912d399bbf2b55ac809680488e140

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page