Skip to main content

Attempt at predicting common types in CSVs about Italian people and places using Spacy NLP tool.

Project description

Travis CI build SonarCloud Quality SonarCloud Maintainability Codacy Maintainability Maintainability Pypi project Pypi total project downloads

This package is an attempt at predicting common types in CSVs about Italian people and places using ensemble heuristics with Decision Random Forests and Spacy NLP tool.

How do I install this package?

As usual, just download it using pip:

apt-get update -y
apt-get install -qyy apt-utils build-essential software-properties-common locales locales-all curl autoconf automake libtool python-dev pkg-config
curl https://raw.githubusercontent.com/LucaCappelletti94/italian_csv_type_prediction/master/setup.sh | sh

pip install italian_csv_type_prediction

Tests Coverage

Since some software handling coverages sometimes get slightly different results, here’s three of them:

Coveralls Coverage SonarCloud Coverage Code Climate Coverate

Usage examples

To get the typization of a list of data you can use:

from italian_csv_type_prediction import predict_types

predictions = predict_types([
    #list of words to predict goes here
])

Currently supported types

We currently support the following types:

TODO

Implementation notes

To train on GPU: https://mc.ai/spacy-training-using-gpu/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

italian_csv_type_prediction-1.1.54.tar.gz (4.4 MB view details)

Uploaded Source

File details

Details for the file italian_csv_type_prediction-1.1.54.tar.gz.

File metadata

  • Download URL: italian_csv_type_prediction-1.1.54.tar.gz
  • Upload date:
  • Size: 4.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.7.6

File hashes

Hashes for italian_csv_type_prediction-1.1.54.tar.gz
Algorithm Hash digest
SHA256 e6d611056691b3c25a19fafb083d92aee2b3b4288d4392f8237a5be53fa2ee2f
MD5 67bade4f2f861f8fa8d8f7b0efde3c65
BLAKE2b-256 3431ba4879e77abb004b85962ab14fafd4830322a862b8652e12e7e20d62c597

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page