Skip to main content

Python API for CJK datasets. Part of the cihai project.

Project description

cihai - United front to provide open, accessible, and standardized access to CJK data

Python Package Documentation Status Build Status Code Coverage License

In development

Tool

  • Single tool for interfacing with CJK data, compare to cjklib.

  • API, in python, for programatically interfacing with data.

  • Compatible with python 2.7, 3.3+, and pypy/pypy3.

  • Designed against a robust test suite. See Travis Builds and Revision History.

  • Supports Unihan, upcoming support for character decomposition, dictionaries (CEDict).

  • Extensible. For new data sets, read more about how you can extend cihai to support new datapackages compatible datasets.

  • For more, see internals for design philosophy.

Workgroup and Standardization

  • Find undigitized data sets relating to CJK

  • Clarifying and negotiate license details of data sets, see permissively licensing your dataset.

  • Create standardized, consistent packages for all data sets

  • Maintain aforementioned datasets

  • Continue to improve current infrastructure and packages while seeking out rare and undigitized CJK data for preservation and access

Troubleshooting

Python 2.7 and UCS

Note, to get this working on python 2.7, you must have python built with UCS4 via --enable-unicode=ucs4. You can test for UCS4 with:

>>> import sys
>>> sys.maxunicode > 0xffff
True

Most packaged and included python distributions will already be build with UCS4 (such as Ubuntu’s system python). On python 3.3 and greater, this distinction no longer exists, no action is needed.

Python support

Python 2.7, >= 3.3, pypy

Source

https://github.com/cihai/cihai

Docs

https://cihai.git-pull.com

Changelog

https://cihai.git-pull.com/en/latest/history.html

API

https://cihai.git-pull.com/en/latest/api.html

Issues

https://github.com/cihai/cihai/issues

Travis

https://travis-ci.org/cihai/cihai

Test coverage

https://coveralls.io/r/cihai/cihai

pypi

https://pypi-hypernode.com/pypi/cihai

OpenHub

https://www.openhub.net/p/cihai

License

BSD.

git repo

$ git clone https://github.com/cihai/cihai.git

install stable

$ pip install cihai

install dev

$ git clone https://github.com/cihai/cihai.git cihai
$ cd ./cihai
$ virtualenv .env
$ source .env/bin/activate
$ pip install -e .

tests

$ python setup.py test

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cihai-0.3.0.tar.gz (26.2 kB view details)

Uploaded Source

File details

Details for the file cihai-0.3.0.tar.gz.

File metadata

  • Download URL: cihai-0.3.0.tar.gz
  • Upload date:
  • Size: 26.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for cihai-0.3.0.tar.gz
Algorithm Hash digest
SHA256 10efc361a053c1a8519f5e1108875859e4882bd55c480768c26f6b09c8762292
MD5 3a0994a67b23965ef5cc201dfc8df833
BLAKE2b-256 8e41d2f33a474eac738d8a7a5401879ac3a251891b55ea6c1ba3b9c45528eb6f

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page