A flask blueprint providing an API for accessing and searching an ElasticSearch index created from source datapackages
Project description
apies
apies is a flask blueprint providing an API for accessing and searching an ElasticSearch index created from source datapackages.
endpoints
/get
/search/count
/search/<doctypes>
configuration
Flask configuration for this blueprint:
from apies import apies_blueprint
import elasticsearch
app.register_blueprint(
apies_blueprint(['path/to/datapackage.json', Package(), ...],
elasticsearch.Elasticsearch(...),
'index-to-search-in',
document_doctype='document',
dont_highlight=['fields', 'not.to', 'highlight'],
text_field_rules=lambda schema_field: [], # list of tuples: ('exact'/'inexact'/'natural', <field-name>)
multi_match_type='most_fields',
multi_match_operator='and'),
url_prefix='/search/'
)
local development
You can start a local development server by following these steps:
-
Install Dependencies:
a. Install Docker locally
b. Install Python dependencies:
$ pip install dataflows datapackage-pipelines-elasticsearch $ pip install -e .
-
Go to the
sample/
directory -
Start ElasticSearch locally:
$ ./start_elasticsearch.sh
This script will wait and poll the server until it's up and running. You can test it yourself by running:
$ curl -s http://localhost:9200 { "name" : "DTsRT6T", "cluster_name" : "elasticsearch", "cluster_uuid" : "QnLVHaOYTkmJZzkCG3Hong", "version" : { "number" : "5.5.2", "build_hash" : "b2f0c09", "build_date" : "2017-08-14T12:33:14.154Z", "build_snapshot" : false, "lucene_version" : "6.6.0" }, "tagline" : "You Know, for Search" }
-
Load data into the database
$ python load_fixtures.py
You can test that data was loaded:
$ curl -s http://localhost:9200/jobs/_count?pretty { "count" : 3516, "_shards" : { "total" : 5, "successful" : 5, "failed" : 0 } }
-
Start the sample server
$ python server.py * Serving Flask app "server" (lazy loading) * Environment: production WARNING: Do not use the development server in a production environment. Use a production WSGI server instead. * Debug mode: off * Running on http://127.0.0.1:5000/ (Press CTRL+C to quit)
-
Now you can hit the server's endpoints, for example:
$ curl -s 'localhost:5000/api/search/jobs?q=engineering&size=2' | jq 127.0.0.1 - - [26/Jun/2019 10:45:31] "GET /api/search/jobs?q=engineering&size=2 HTTP/1.1" 200 - { "search_counts": { "_current": { "total_overall": 617 } }, "search_results": [ { "score": 18.812, "source": { "# Of Positions": "5", "Additional Information": "TO BE APPOINTED TO ANY CIVIL <em>ENGINEERING</em> POSITION IN BRIDGES, CANDIDATES MUST POSSESS ONE YEAR OF CIVIL <em>ENGINEERING</em> EXPERIENCE IN BRIDGE DESIGN, BRIDGE CONSTRUCTION, BRIDGE MAINTENANCE OR BRIDGE INSPECTION.", "Agency": "DEPARTMENT OF TRANSPORTATION", "Business Title": "Civil Engineer 2", "Civil Service Title": "CIVIL ENGINEER", "Division/Work Unit": "<em>Engineering</em> Review & Support", ... }
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for apies-0.1.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 462c36f00f45ef4f8029b09a302abcfb9350321d22e1bec91357c1624db6b624 |
|
MD5 | 1af3005d6eb645df39fb573897c6e9a4 |
|
BLAKE2b-256 | 6e991ea89385126b5f0c7307e025e4614c565cf1a3f3cb9a383c283865205099 |