SeqRepo REST Service
Project description
seqrepo-rest-api
Provides SeqRepo and GA4GH RefGet REST interfaces to biological sequences and sequence metadata from an existing seqrepo sequence repository.
Important changes
Breaking Change (July 2022): SEQREPO_DIR is now ignored. You must pass the seqrepo instance directory explicitly. See examples below.
Description
Specific, named biological sequences provide the reference and coordinate sysstem for communicating variation and consequential phenotypic changes. Several databases of sequences exist, with significant overlap, all using distinct names. Furthermore, these systems are often difficult to install locally.
Clients refer to sequences and metadata using familiar identifiers, such as NM_000551.3 or GRCh38:1, or any of several hash-based identifiers. The interface supports fast slicing of arbitrary regions of large sequences.
A "fully-qualified" identifier includes a namespace to disambiguate accessions (e.g., "1" in GRCh37 and GRCh38). If the namespace is provided, seqrepo uses it as-is. If the namespace is not provided and the unqualified identifier refers to a unique sequence, it is returned; otherwise, ambiguous identifiers will raise an error.
SeqRepo favors identifiers from identifiers.org whenever available. Examples include refseq and ensembl.
This repository is the REST interface only. The underlying data is provided by seqrepo.
This repository also implements the GA4GH refget (v1) protocol at
<baseurl>/refget/
.
Released under the Apache License, 2.0.
Links: Issues | Docker image
Citation
Hart RK, Prlić A (2020)
SeqRepo: A system for managing local collections of biological sequences.
PLoS ONE 15(12): e0239883. https://doi.org/10.1371/journal.pone.0239883
Examples
OpenAPI docs
The REST interface is implemented with OpenAPI. Current and interactive documentation is available at the base url for the endpoint.
Fetch Sequence
Fetch sequence by an accession:
$ curl -f http://0.0.0.0:5000/seqrepo/1/sequence/NP_001274413.1
MERSFVWLSCLDSDSCNLTFRLGEVESHACSPSLLWNLLTQYLPPGAGHILRTYNFPVLSCVSSCHLIGGKMPEN
Or not:
$ curl -f http://0.0.0.0:5000/seqrepo/1/sequence/bogus
curl: (22) The requested URL returned error: 404 NOT FOUND
Popular digests are also available:
$ curl -f http://0.0.0.0:5000/seqrepo/1/sequence/MD5:d52770ec477d0c9ee01fa034aff62cb4
MERSFVWLSCLDSDSCNLTFRLGEVESHACSPSLLWNLLTQYLPPGAGHILRTYNFPVLSCVSSCHLIGGKMPEN
With range:
# 👉 Seqrepo uses interbase coordinates.
$ curl -f "http://0.0.0.0:5000/seqrepo/1/sequence/NP_001274413.1?start=5&end=10"
VWLSC
Fetch Metadata
$ curl -f "http://0.0.0.0:5000/seqrepo/1/metadata/GRCh38:1"
{
"added": "2016-08-27T21:17:00Z",
"aliases": [
"GRCh38:1",
"GRCh38:chr1",
"GRCh38.p1:1",
"GRCh38.p1:chr1",
⋮
"GRCh38.p9:chr1",
"MD5:6aef897c3d6ff0c78aff06ac189178dd",
"refseq:NC_000001.11",
"SEGUID:FCUd6VJ6uikS/VWLbhGdVmj2rOA",
"SHA1:14251de9527aba2912fd558b6e119d5668f6ace0",
"sha512t24u:Ya6Rs7DHhDeg7YaOSg1EoNi3U_nQ9SvO",
"ga4gh:SQ.Ya6Rs7DHhDeg7YaOSg1EoNi3U_nQ9SvO"
],
"alphabet": "ACGMNRT",
"length": 248956422
}
Development
$ make devready
$ source venv/bin/activate
Running a local instance
Once installed as above, you should be able to:
$ seqrepo-rest-service /usr/local/share/seqrepo/2021-01-29
The navigate to the URL shown in the console output.
Building and running a docker image
A docker image can be built with this repo or pulled from docker hub. In either case, the container requires an existing local seqrepo sequence repository.
To build a docker image in this repo:
make docker-image
This will create biocommons/seqrepo-rest-service:lastest, like this:
$ docker images
REPOSITORY TAG IMAGE ID CREATED SIZE
biocommons/seqrepo-rest-service latest ad9ca051c5c9 2 minutes ago 627MB
This docker image is periodically pushed to docker hub.
Invoke the docker image like this this:
docker run \
--name seqrepo-rest-service \
--detach --rm -p 5000:5000 \
-v /usr/local/share/seqrepo/2021-01-29:/mnt/seqrepo \
biocommons/seqrepo-rest-service \
seqrepo-rest-service /mnt/seqrepo
You should then be able to fetch a test sequence like this:
$ curl 'http://127.0.0.1:5000/seqrepo/1/sequence/refseq:NM_000551.3?end=20'
CCTCGCCTCCGTTACAACGG
If things aren't working, check the logs with docker logs -f seqrepo-rest-service
.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file seqrepo-rest-service-0.2.2.tar.gz
.
File metadata
- Download URL: seqrepo-rest-service-0.2.2.tar.gz
- Upload date:
- Size: 266.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a6dd0ad3beea4445bae885dcfd188d70afa18dddd22830c262256be60738364a |
|
MD5 | 9d484b18259256dcef43c6482cfa2aaa |
|
BLAKE2b-256 | 0e464d01480f6b3cd4a166603901621d9bbc206a7ad4f5e3f56e5231d5557634 |
File details
Details for the file seqrepo_rest_service-0.2.2-py3-none-any.whl
.
File metadata
- Download URL: seqrepo_rest_service-0.2.2-py3-none-any.whl
- Upload date:
- Size: 21.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5131e37f117f93ee05c174c28a15271853ff36fb5892402a6c2111430f17bae2 |
|
MD5 | 07279c8850a82617934f234f123a7135 |
|
BLAKE2b-256 | f38de1eef15cad26930047cba583ccfea4e348f8580ffabae7e00db18b296c8e |