Skip to main content

Attempt at predicting common types in CSVs about Italian people and places using Spacy NLP tool.

Project description

Travis CI build SonarCloud Quality SonarCloud Maintainability Codacy Maintainability Maintainability Pypi project Pypi total project downloads

This package is an attempt at predicting common types in CSVs about Italian people and places using ensemble heuristics with Decision Random Forests and Spacy NLP tool.

How do I install this package?

As usual, just download it using pip:

apt-get update -y
apt-get install -qyy apt-utils build-essential software-properties-common locales locales-all curl autoconf automake libtool python-dev pkg-config
curl https://raw.githubusercontent.com/LucaCappelletti94/italian_csv_type_prediction/master/setup.sh | sh

pip install italian_csv_type_prediction

Tests Coverage

Since some software handling coverages sometimes get slightly different results, here’s three of them:

Coveralls Coverage SonarCloud Coverage Code Climate Coverate

Usage examples

To get the typization of a list of data you can use:

from italian_csv_type_prediction import predict_types

predictions = predict_types([
    #list of words to predict goes here
])

Currently supported types

We currently support the following types:

TODO

Implementation notes

To train on GPU: https://mc.ai/spacy-training-using-gpu/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

italian_csv_type_prediction-1.1.56.tar.gz (4.4 MB view details)

Uploaded Source

File details

Details for the file italian_csv_type_prediction-1.1.56.tar.gz.

File metadata

  • Download URL: italian_csv_type_prediction-1.1.56.tar.gz
  • Upload date:
  • Size: 4.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.7.6

File hashes

Hashes for italian_csv_type_prediction-1.1.56.tar.gz
Algorithm Hash digest
SHA256 1d5211b615524e902cfbf0683a18bf4e4b580030da8b1828a213d8dd495387ac
MD5 b59e7888c7112ea0a38b7aefff4f02ca
BLAKE2b-256 2db2be263ed87be5690d68289abd6d504d69eeff08d9af0478aeb0260c2e798f

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page