Skip to main content

Attempt at predicting common types in CSVs about Italian people and places using Spacy NLP tool.

Project description

Travis CI build SonarCloud Quality SonarCloud Maintainability Codacy Maintainability Maintainability Pypi project Pypi total project downloads

This package is an attempt at predicting common types in CSVs about Italian people and places using ensemble heuristics with Decision Random Forests and Spacy NLP tool.

How do I install this package?

As usual, just download it using pip:

apt-get update -y
apt-get install -qyy apt-utils build-essential software-properties-common locales locales-all curl autoconf automake libtool python-dev pkg-config
curl https://raw.githubusercontent.com/LucaCappelletti94/italian_csv_type_prediction/master/setup.sh | sh

pip install italian_csv_type_prediction

Tests Coverage

Since some software handling coverages sometimes get slightly different results, here’s three of them:

Coveralls Coverage SonarCloud Coverage Code Climate Coverate

Usage examples

To get the typization of a list of data you can use:

from italian_csv_type_prediction import predict_types

predictions = predict_types([
    #list of words to predict goes here
])

Currently supported types

We currently support the following types:

TODO

Implementation notes

To train on GPU: https://mc.ai/spacy-training-using-gpu/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

italian_csv_type_prediction-1.1.57.tar.gz (4.4 MB view details)

Uploaded Source

File details

Details for the file italian_csv_type_prediction-1.1.57.tar.gz.

File metadata

  • Download URL: italian_csv_type_prediction-1.1.57.tar.gz
  • Upload date:
  • Size: 4.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.61.0 CPython/3.7.6

File hashes

Hashes for italian_csv_type_prediction-1.1.57.tar.gz
Algorithm Hash digest
SHA256 836c8c02d19dedc1ac2846fa1914a9b329a1cbb9ce6753fd0514fcbedf57d379
MD5 a6d10a7a3b65635b776fc1fb98cc0055
BLAKE2b-256 c63b8a01c545d603a9f601af8d4f6b8b01094e424a8576ace9505c5f7432fe2b

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page