A flask blueprint providing an API for accessing and searching an ElasticSearch index created from source datapackages

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3.6
Topic
- Software Development :: Libraries :: Python Modules

Project description

apies

PyPI - Python Version

apies is a flask blueprint providing an API for accessing and searching an ElasticSearch index created from source datapackages.

endpoints

`/get`

`/search/count`

`/search/

`download/<doctypes>`

Downloads search results in either csv, xls or xlsx format.

Query parameters that can be send:

types_formatted: The type of the documents to search
search_term: The Elastic search query
size: Number of hits to return
offset: Whether or not term offsets should be returned
filters: What offset to use for the pagination
dont_highlight:
from_date: If there should be a date range applied to the search, and from what date
to_date: If there should be a date range applied to the search, and until what date
order:
file_format: The format of the file to be returned, either 'csv', 'xls' or 'xlsx'. If not passed the file format will be xlsx
file_name: The name of the file to be returned, by default the name will be 'search_results'
column_mapping: If the columns should get a different name then in the original data, a column map can be send, for example:

{
  "עיר": "address.city",
  "תקציב": "details.budget"
}

For example, get a csv file with column mapping:

http://localhost:5000/api/download/jobs?q=engineering&size=2&file_format=csv&file_name=my_results&column_mapping={%22mispar%22:%22Job%20ID%22}

Or get an xslx file without column mapping:

http://localhost:5000/api/download/jobs?q=engineering&size=2&file_format=xlsx&file_name=my_results

configuration

Flask configuration for this blueprint:

    from apies import apies_blueprint
    import elasticsearch

    app.register_blueprint(
        apies_blueprint(['path/to/datapackage.json', Package(), ...],
                        elasticsearch.Elasticsearch(...), 
                        {'doc-type-1': 'index-for-doc-type-1', ...}, 
                        'index-for-documents',
                        dont_highlight=['fields', 'not.to', 'highlight'],
                        text_field_rules=lambda schema_field: [], # list of tuples: ('exact'/'inexact'/'natural', <field-name>)
                        multi_match_type='most_fields',
                        multi_match_operator='and'),
        url_prefix='/search/'
    )

local development

You can start a local development server by following these steps:

Install Dependencies:

a. Install Docker locally

b. Install Python dependencies:
```
$ pip install dataflows dataflows-elasticsearch
$ pip install -e .
```
Go to the sample/ directory

Start ElasticSearch locally:

$ ./start_elasticsearch.sh

This script will wait and poll the server until it's up and running. You can test it yourself by running:

$ curl -s http://localhost:9200
     {
     "name" : "99cd2db44924",
     "cluster_name" : "docker-cluster",
     "cluster_uuid" : "nF9fuwRyRYSzyQrcH9RCnA",
     "version" : {
         "number" : "7.4.2",
         "build_flavor" : "default",
         "build_type" : "docker",
         "build_hash" : "2f90bbf7b93631e52bafb59b3b049cb44ec25e96",
         "build_date" : "2019-10-28T20:40:44.881551Z",
         "build_snapshot" : false,
         "lucene_version" : "8.2.0",
         "minimum_wire_compatibility_version" : "6.8.0",
         "minimum_index_compatibility_version" : "6.0.0-beta1"
     },
     "tagline" : "You Know, for Search"
     }

Load data into the database

$ DATAFLOWS_ELASTICSEARCH=localhost:9200 python load_fixtures.py

You can test that data was loaded:

$ curl -s http://localhost:9200/jobs-job/_count?pretty
 {
     "count" : 1757,
     "_shards" : {
         "total" : 1,
         "successful" : 1,
         "skipped" : 0,
         "failed" : 0
     }
 }

Start the sample server

$ python server.py 
 * Serving Flask app "server" (lazy loading)
 * Environment: production
 WARNING: Do not use the development server in a production environment.
 Use a production WSGI server instead.
 * Debug mode: off
 * Running on http://127.0.0.1:5000/ (Press CTRL+C to quit)

Now you can hit the server's endpoints, for example:

     $ curl -s 'localhost:5000/api/search/jobs?q=engineering&size=2' | jq
     127.0.0.1 - - [26/Jun/2019 10:45:31] "GET /api/search/jobs?q=engineering&size=2 HTTP/1.1" 200 -
     {
         "search_counts": {
             "_current": {
             "total_overall": 617
             }
         },
         "search_results": [
             {
             "score": 18.812,
             "source": {
                 "# Of Positions": "5",
                 "Additional Information": "TO BE APPOINTED TO ANY CIVIL <em>ENGINEERING</em> POSITION IN BRIDGES, CANDIDATES MUST POSSESS ONE YEAR OF CIVIL <em>ENGINEERING</em> EXPERIENCE IN BRIDGE DESIGN, BRIDGE CONSTRUCTION, BRIDGE MAINTENANCE OR BRIDGE INSPECTION.",
                 "Agency": "DEPARTMENT OF TRANSPORTATION",
                 "Business Title": "Civil Engineer 2",
                 "Civil Service Title": "CIVIL ENGINEER",
                 "Division/Work Unit": "<em>Engineering</em> Review & Support",
         ...
     }

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3.6
Topic
- Software Development :: Libraries :: Python Modules

Release history Release notifications | RSS feed

1.11.0

Apr 18, 2024

1.10.1

Nov 13, 2023

1.10.0

Nov 12, 2023

1.9.1

Feb 12, 2023

1.9.0

Feb 12, 2023

1.8.2

Dec 26, 2022

1.8.1

Dec 26, 2022

1.8.0

Dec 26, 2022

1.7.10

Dec 21, 2022

1.7.9

Nov 13, 2022

1.7.8

Nov 6, 2022

1.7.7

Oct 26, 2022

1.7.6

Oct 26, 2022

1.7.5

Oct 26, 2022

1.7.4

Oct 25, 2022

1.7.3

Oct 25, 2022

1.7.2

Oct 25, 2022

1.7.1

Oct 24, 2022

1.7.0

Oct 24, 2022

1.6.8

Sep 28, 2022

1.6.7

Sep 27, 2022

1.6.6

Sep 27, 2022

1.6.5

Sep 27, 2022

1.6.4

Sep 27, 2022

1.6.3

Sep 27, 2022

1.6.2

Sep 27, 2022

1.6.1

Sep 27, 2022

1.6.0

Sep 27, 2022

1.5.9

Jul 22, 2022

1.5.8

Jul 22, 2022

1.5.7

Mar 27, 2022

1.5.6

Mar 27, 2022

1.5.5

Nov 25, 2021

1.5.4

Nov 25, 2021

This version

1.5.3

Nov 25, 2021

1.5.2

Nov 15, 2021

1.5.1

Nov 11, 2021

1.5.0

Nov 3, 2021

1.4.1

Oct 8, 2021

1.4.0

Sep 26, 2021

1.3.6

Apr 26, 2021

1.3.5

Apr 26, 2021

1.3.4

Apr 19, 2021

1.3.3

Apr 19, 2021

1.3.2

Mar 11, 2021

1.3.1

Feb 15, 2021

1.3.0

Feb 15, 2021

1.2.2

Feb 5, 2021

1.2.1

Feb 5, 2021

1.2.0

Feb 4, 2021

1.1.10

Mar 11, 2020

1.1.9

Feb 15, 2020

1.1.8

Dec 27, 2019

1.1.7

Dec 26, 2019

1.1.6

Dec 25, 2019

1.1.5

Dec 25, 2019

1.1.4

Dec 25, 2019

1.1.3

Dec 25, 2019

1.1.2

Dec 25, 2019

1.1.1

Dec 25, 2019

1.1.0

Dec 25, 2019

1.0.1

Nov 26, 2019

1.0.0

Nov 26, 2019

0.1.1

Sep 21, 2019

0.1.0

Sep 18, 2019

0.0.23

Sep 15, 2019

0.0.22

Jul 30, 2019

0.0.21

Jul 30, 2019

0.0.20

Jul 22, 2019

0.0.19

Jul 14, 2019

0.0.18

Jun 26, 2019

0.0.17

Jun 26, 2019

0.0.16

May 20, 2019

0.0.13

Apr 10, 2019

0.0.12

Apr 10, 2019

0.0.11

Apr 8, 2019

0.0.10

Apr 4, 2019

0.0.9

Jan 5, 2019

0.0.8

Jan 5, 2019

0.0.7

Dec 26, 2018

0.0.6

Dec 26, 2018

0.0.5

Dec 26, 2018

0.0.4

Dec 26, 2018

0.0.3

Dec 11, 2018

0.0.2

Dec 8, 2018

0.0.1

Dec 7, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

apies-1.5.3.tar.gz (16.4 kB view hashes)

Uploaded Nov 25, 2021 Source

Built Distribution

apies-1.5.3-py2.py3-none-any.whl (15.3 kB view hashes)

Uploaded Nov 25, 2021 Python 2 Python 3

Hashes for apies-1.5.3.tar.gz

Hashes for apies-1.5.3.tar.gz
Algorithm	Hash digest
SHA256	`0bcb9e1c350d0dfb7fcfc7302683a00a8f1ab16b5ac5d1d0496d45bfbe4c4759`
MD5	`4c5347deb5d9b09d245cdae606218aee`
BLAKE2b-256	`0214e57bb72a0d2bedbd3933d137f700a45bf96f7f91e91870dd673bb6efa728`

Hashes for apies-1.5.3-py2.py3-none-any.whl

Hashes for apies-1.5.3-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`6e78da660a8ad7fc7119e6018e9b83d000fe96f4308e1ae5a8dcc4c0460a493a`
MD5	`87572811a28179a3258d1a0e1323547a`
BLAKE2b-256	`1846d8e532d2aa1ba35d6c8024385ddc5ebc2b783f605ac4b5fed0356c411e00`