Skip to main content

Record to Record Recommendations for Invenio.

Project description

https://travis-ci.org/CERNDocumentServer/record-recommender.svg?branch=master

The Record Recommender creates recommendations for Invenio. By reading page views and downloads from Elasticsearch calculating the recommendations and storing them in Redis to be retrieve and displayed by Invenio.

Usage

The workflow to generate the recommendations is:

  1. recommender fetch 24 to cache the last 24 weeks of page views and downloads from Elasticsearch.

  2. recommender profiles 24 generate the user profiles from the page views and downloads. For the not logged in users profiles based on the ip-address and user agent are created.

  3. recommender build 50 calculates the recommendations using 50 processes and stores them in the specified Redis server.

Alternative the recommendations can be automatically be fetched, the profiles generated and the recommendations calculated all this with one command:

recommender update_recommender 24 50 for the last 24 weeks and using 50 processes.

Configuration

The configuration file is expected to be in /etc/record_recommender.yml otherwise the path to the config file can be defined by using the command line option --config_path.

# Record-Recommender configuration.

elasticsearch:
es_index: ['index-2014', 'index-2015', 'index-2016']
es_user: user
es_password:
es_host: localhost
es_port: 443

recommendation_version: 1

# Sentry connection string
sentry:

redis:
host: localhost
port: 6379
db: 0
prefix: 'Reco_1::'

cache:
base_path: cache/
cache_file_prefix: ''

logging:
version: 1
disable_existing_loggers: False
formatters:
    simple:
    format: '%(asctime)s - %(name)s - %(levelname)s - %(message)s'
handlers:
    console:
    class: logging.StreamHandler
    level: INFO
    formatter: simple
    stream: ext://sys.stdout
    sentry:
    class: raven.handlers.logging.SentryHandler
    level: WARN
    dsn:
    file:
    class : logging.FileHandler
    formatter: simple
    level: DEBUG
    filename: log_recommender.log
loggers:
    record_recommender:
    level: DEBUG
    handlers: [console, file, sentry]
    propagate: no
    # elasticsearch:
    #   level: WARN
    #   handlers: [console]
    #   propagate: no
root:
    level: ERROR
    handlers: [console, file, sentry]

Additional to the configuration options found in the config file environment variables, which overwrite the ones from the config file can be set.

  • RECOMMENDER_ES_PASSWORD to set the Elasticsearch password.

  • RECOMMENDER_SENTRY to set the Sentry connection string.

Command line

Usage: recommender [OPTIONS] COMMAND [ARGS]...

Record-Recommender command line version.

Options:
-v, --verbose           Enables verbose mode.
-c, --config_path TEXT  Path to the configuration file.
--help                  Show this message and exit.

Commands:
debug               Debug the application and recommender.
fetch               Fetch newest PageViews and Downloads.
build               Calculate all recommendations.
profiles            Number of weeks to build.
update_recommender  Download and build the recommendations.

Debugging the Recommendations

As first step look into the created user profiles in the defined cache folder.

To explore the graph with all loaded data use the recommender debug command to get a interactive python shell.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

record-recommender-0.0.2.tar.gz (25.6 kB view details)

Uploaded Source

File details

Details for the file record-recommender-0.0.2.tar.gz.

File metadata

File hashes

Hashes for record-recommender-0.0.2.tar.gz
Algorithm Hash digest
SHA256 cc9cf2131ec1ad3d5b0ab4a6acd6a9d7b9386b516b9886bfa7fb23eebfd57e1b
MD5 a19dfdb81c00cb9d538ec06036a4dbd7
BLAKE2b-256 3267e09ad681936b1a4025c20854a5f6524f9d2f46a2e4f98ee63a87e9867228

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page