Skip to main content

Natural language processing support for Pandas dataframes.

Project description

Natural language processing support for Pandas DataFrames.

Documentation Status

Text Extensions for Pandas adds extension types to Pandas DataFrames for representing natural language data, plus a library of functions for working with these extension types.

Features

SpanArray: A Pandas extension type for spans of text

  • Connect features with regions of a document
  • Visualize the internal data of your NLP application
  • Analyze the accuracy of your models
  • Combine the results of multiple models

TensorArray: A Pandas extension type for tensors

  • Represent BERT embeddings in a Pandas series
  • Store logits and other feature vectors in a Pandas series
  • Store an entire time series in each cell of a Pandas series

Pandas front-ends for popular NLP toolkits

Documentation

For examples of how to use the library, take a look at the notebooks in this directory.

API documentation can be found at https://text-extensions-for-pandas.readthedocs.io/en/latest/

Source Code

The source code for Text Extensions for Pandas is available at https://github.com/CODAIT/text-extensions-for-pandas.

We welcome code and documentation contributions! See the README file for more information on contributing.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

text_extensions_for_pandas-0.1a1.tar.gz (92.7 kB view details)

Uploaded Source

Built Distribution

text_extensions_for_pandas-0.1a1-py3-none-any.whl (123.5 kB view details)

Uploaded Python 3

File details

Details for the file text_extensions_for_pandas-0.1a1.tar.gz.

File metadata

  • Download URL: text_extensions_for_pandas-0.1a1.tar.gz
  • Upload date:
  • Size: 92.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0.post20200814 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.7

File hashes

Hashes for text_extensions_for_pandas-0.1a1.tar.gz
Algorithm Hash digest
SHA256 28e2cb8b1b237d596984db5421ea141b474412d28c2bb5d8faa50a98e980d6ff
MD5 0e1aa46001801c9b2ac4badbf76ebe8b
BLAKE2b-256 44002926bc7e19d9bb06801097f36fe92df6690ac43c72c743afbbe55f5be45f

See more details on using hashes here.

File details

Details for the file text_extensions_for_pandas-0.1a1-py3-none-any.whl.

File metadata

  • Download URL: text_extensions_for_pandas-0.1a1-py3-none-any.whl
  • Upload date:
  • Size: 123.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0.post20200814 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.7

File hashes

Hashes for text_extensions_for_pandas-0.1a1-py3-none-any.whl
Algorithm Hash digest
SHA256 c1aaf542778dc3d0b727ca25a58996f4df3bb789d601b3d8d3a21a383964b613
MD5 399e3b1befea9d0cad142803ec8dcc55
BLAKE2b-256 cc2398f62c5ca172c42ed6911bb639604d1b4bc9e91eef6eb24e925a1eec7ea9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page