Skip to main content

The Classical Language Toolkit

Project description

circleci pypi twitter discord

The Classical Language Toolkit (CLTK) is a Python library offering natural language processing (NLP) for pre-modern languages.

Installation

For the CLTK’s latest version:

$ pip install cltk

For more information, see Installation docs or, to install from source, Development.

Pre-1.0 software remains available on the branch v0.1.x and docs at https://legacy.cltk.org. Install it with pip install "cltk<1.0".

Documentation

Documentation at https://docs.cltk.org.

Citation

When using the CLTK, please cite the following publication, including the DOI:

Johnson, Kyle P., Patrick J. Burns, John Stewart, Todd Cook, Clément Besnier, and William J. B. Mattingly. “The Classical Language Toolkit: An NLP Framework for Pre-Modern Languages.” In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations, pp. 20-29. 2021. 10.18653/v1/2021.acl-demo.3

The complete BibTeX entry:

@inproceedings{johnson-etal-2021-classical,
    title = "The {C}lassical {L}anguage {T}oolkit: {A}n {NLP} Framework for Pre-Modern Languages",
    author = "Johnson, Kyle P.  and
      Burns, Patrick J.  and
      Stewart, John  and
      Cook, Todd  and
      Besnier, Cl{\'e}ment  and
      Mattingly, William J. B.",
    booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations",
    month = aug,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.acl-demo.3",
    doi = "10.18653/v1/2021.acl-demo.3",
    pages = "20--29",
    abstract = "This paper announces version 1.0 of the Classical Language Toolkit (CLTK), an NLP framework for pre-modern languages. The vast majority of NLP, its algorithms and software, is created with assumptions particular to living languages, thus neglecting certain important characteristics of largely non-spoken historical languages. Further, scholars of pre-modern languages often have different goals than those of living-language researchers. To fill this void, the CLTK adapts ideas from several leading NLP frameworks to create a novel software architecture that satisfies the unique needs of pre-modern languages and their researchers. Its centerpiece is a modular processing pipeline that balances the competing demands of algorithmic diversity with pre-configured defaults. The CLTK currently provides pipelines, including models, for almost 20 languages.",
}

License

Copyright (c) 2014-2024 Kyle P. Johnson under the MIT License.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cltk-1.3.0.tar.gz (625.3 kB view details)

Uploaded Source

Built Distribution

cltk-1.3.0-py3-none-any.whl (695.7 kB view details)

Uploaded Python 3

File details

Details for the file cltk-1.3.0.tar.gz.

File metadata

  • Download URL: cltk-1.3.0.tar.gz
  • Upload date:
  • Size: 625.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.12 Linux/6.5.0-28-generic

File hashes

Hashes for cltk-1.3.0.tar.gz
Algorithm Hash digest
SHA256 8c0c6f4e8508a37df71d25500d857250163760ffa6d519e53467a572f92da7ab
MD5 ce833136a8be692f12613e1e85548cc1
BLAKE2b-256 9fb8a9376a8f0ab7e3c2a28a7a5445dde6a1c170d60c8b2ef11ca6a0bfad6259

See more details on using hashes here.

File details

Details for the file cltk-1.3.0-py3-none-any.whl.

File metadata

  • Download URL: cltk-1.3.0-py3-none-any.whl
  • Upload date:
  • Size: 695.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.12 Linux/6.5.0-28-generic

File hashes

Hashes for cltk-1.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b2f87b8d75322374c9ace878702b7af7b9cfac4716fcd81c6718c6acd64b51a8
MD5 d1275b21ff88aee0bcba094e13423672
BLAKE2b-256 19275e7430805a8c1e9f6c38871a0f51614447763868b8073be81801c77b6c13

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page