Skip to main content

Identification and conversion functions for Chinese text processing

Project description

https://badge.fury.io/py/dragonmapper.png https://travis-ci.org/tsroten/dragonmapper.png?branch=develop

Dragon Mapper is a Python library that provides identification and conversion functions for Chinese text processing.

Features

  • Convert between Chinese characters, Pinyin, Zhuyin, and the International Phonetic Alphabet.

  • Identify a string as Traditional or Simplified Chinese, Pinyin, Zhuyin, or the International Phonetic Alphabet.

>>> s = '我是一个美国人。'
>>> dragonmapper.hanzi.is_simplified(s)
True
>>> dragonmapper.hanzi.to_pinyin(s)
'wǒshìyīgèměiguórén。'
>>> dragonmapper.hanzi.to_pinyin(s, all_readings=True)
'[wǒ][shì/shi/tí][yī][gè/ge/gě/gàn][měi][guó][rén/ren]。'
>>> s = 'Wǒ shì yīgè měiguórén.'
>>> dragonmapper.transcriptions.is_pinyin(s)
True
>>> dragonmapper.transcriptions.pinyin_to_zhuyin(s)
'ㄨㄛˇ ㄕˋ ㄧ ㄍㄜˋ ㄇㄟˇ ㄍㄨㄛˊ ㄖㄣˊ.'
>>> dragonmapper.transcriptions.pinyin_to_ipa(s)
'wɔ˧˩˧ ʂɨ˥˩ i˥ kɤ˥˩ meɪ˧˩˧ kwɔ˧˥ ʐən˧˥.'

Getting Started

Change Log

0.2.4 (2015-04-08)

  • Fixes #8. Adds re.UNICODE to transcription conversion.

  • Fixes misformatted readings for certain characters.

  • Fixes #7. Fixes incorrect Unihan Database readings for the ‘ou’ vowel combinations.

0.2.3 (2014-04-28)

  • Fixes #6. Adds -r suffix syllable to transcription mapping data.

0.2.2 (2014-04-28)

  • Fixes a capitalization bug related to #5.

0.2.1 (2014-04-28)

  • Reformats README.rst.

  • Renames change log file to *.rst.

  • Adds authors and contributing files.

  • Sets up Travis CI.

  • Adds version to __init__.py.

  • Fixes #5. Make accented_to_numbered() add apostrophes when needed.

  • Fixes #4. Fixes numbered_to_accented() handling of 'v' vowel.

  • Fixes #3. Changes IndexError exception handlers to KeyError.

  • Fixes #2. Fixes accented_to_numbered() with uppercase accented vowel.

0.2.0 (2014-04-14)

  • Fixes typo in is_pinyin.

  • Adds is_pinyin_compatible() and is_zhuyin_compatible() functions.

  • Removes code for identifying Hanzi and incorporates Hanzi Identifier library.

  • Removes Sphinx viewcode extension.

  • Adds Python 3.4 environment to tox configuration.

  • Fixes typo in setup.py. Fixes #1.

0.1.0 (2014-02-17)

  • Initial release.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dragonmapper-0.2.5.tar.gz (1.4 MB view details)

Uploaded Source

Built Distribution

dragonmapper-0.2.5-py2.py3-none-any.whl (1.4 MB view details)

Uploaded Python 2 Python 3

File details

Details for the file dragonmapper-0.2.5.tar.gz.

File metadata

File hashes

Hashes for dragonmapper-0.2.5.tar.gz
Algorithm Hash digest
SHA256 28d351d1ea5177617c2e24650587563d00acec0ff8e1ddad54c7c124c4cd99e5
MD5 403611420f62202d9c1b3e816f260273
BLAKE2b-256 58bac2c5580c220bc6ea4fc31007c31c76eaa1167323d4791e8a15231920f06f

See more details on using hashes here.

File details

Details for the file dragonmapper-0.2.5-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for dragonmapper-0.2.5-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 2b024e3e818ad4ffaf59b2db7f52d4a17a5b41385f65df8a3a8cfb7d98688055
MD5 03ccd232802e30bb2d3daa9029cb18f7
BLAKE2b-256 cb275ae439aedf18f12259cbade19f8348af36128b0685b40b1320e28617068e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page