Skip to main content

Identification and conversion functions for Chinese text processing

Project description

https://badge.fury.io/py/dragonmapper.png https://travis-ci.org/tsroten/dragonmapper.png?branch=develop

Dragon Mapper is a Python library that provides identification and conversion functions for Chinese text processing.

Features

  • Convert between Chinese characters, Pinyin, Zhuyin, and the International Phonetic Alphabet.

  • Identify a string as Traditional or Simplified Chinese, Pinyin, Zhuyin, or the International Phonetic Alphabet.

>>> s = '我是一个美国人。'
>>> dragonmapper.hanzi.is_simplified(s)
True
>>> dragonmapper.hanzi.to_pinyin(s)
'wǒshìyīgèměiguórén。'
>>> dragonmapper.hanzi.to_pinyin(s, all_readings=True)
'[wǒ][shì/shi/tí][yī][gè/ge/gě/gàn][měi][guó][rén/ren]。'
>>> s = 'Wǒ shì yīgè měiguórén.'
>>> dragonmapper.transcriptions.is_pinyin(s)
True
>>> dragonmapper.transcriptions.pinyin_to_zhuyin(s)
'ㄨㄛˇ ㄕˋ ㄧ ㄍㄜˋ ㄇㄟˇ ㄍㄨㄛˊ ㄖㄣˊ.'
>>> dragonmapper.transcriptions.pinyin_to_ipa(s)
'wɔ˧˩˧ ʂɨ˥˩ i˥ kɤ˥˩ meɪ˧˩˧ kwɔ˧˥ ʐən˧˥.'

Getting Started

Change Log

0.2.6 (2016-05-23)

  • Fixes reading for 女. Fixes #10.

  • Add a note about Unicode string for Python 2 users.

  • Bumps required hanzidentifier version.

  • Fix umlaut on “l” consonant. Fixes #14.

0.2.5 (2015-08-06)

  • Fixes #9. Uses io.open() in setup.py with UTF-8 encoding.

0.2.4 (2015-04-08)

  • Fixes #8. Adds re.UNICODE to transcription conversion.

  • Fixes misformatted readings for certain characters.

  • Fixes #7. Fixes incorrect Unihan Database readings for the ‘ou’ vowel combinations.

0.2.3 (2014-04-28)

  • Fixes #6. Adds -r suffix syllable to transcription mapping data.

0.2.2 (2014-04-28)

  • Fixes a capitalization bug related to #5.

0.2.1 (2014-04-28)

  • Reformats README.rst.

  • Renames change log file to *.rst.

  • Adds authors and contributing files.

  • Sets up Travis CI.

  • Adds version to __init__.py.

  • Fixes #5. Make accented_to_numbered() add apostrophes when needed.

  • Fixes #4. Fixes numbered_to_accented() handling of 'v' vowel.

  • Fixes #3. Changes IndexError exception handlers to KeyError.

  • Fixes #2. Fixes accented_to_numbered() with uppercase accented vowel.

0.2.0 (2014-04-14)

  • Fixes typo in is_pinyin.

  • Adds is_pinyin_compatible() and is_zhuyin_compatible() functions.

  • Removes code for identifying Hanzi and incorporates Hanzi Identifier library.

  • Removes Sphinx viewcode extension.

  • Adds Python 3.4 environment to tox configuration.

  • Fixes typo in setup.py. Fixes #1.

0.1.0 (2014-02-17)

  • Initial release.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dragonmapper-0.2.6.tar.gz (1.4 MB view details)

Uploaded Source

Built Distribution

dragonmapper-0.2.6-py2.py3-none-any.whl (1.4 MB view details)

Uploaded Python 2 Python 3

File details

Details for the file dragonmapper-0.2.6.tar.gz.

File metadata

File hashes

Hashes for dragonmapper-0.2.6.tar.gz
Algorithm Hash digest
SHA256 f2584fb3b29df442907cf7422893becbb8c8429d9f6611f0ba111bdfec495ec9
MD5 4e7bf74caeaec710a36ede5df01159fe
BLAKE2b-256 d9ec2d7ba67efa5ce88c91db08b8fd1fc65f3bab5619c4768705b3b5c3bbed15

See more details on using hashes here.

File details

Details for the file dragonmapper-0.2.6-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for dragonmapper-0.2.6-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 21e0324ca05d553645887515202349e24427700f4c86d250d85d32b958d4bc31
MD5 3953d447a43ad3d8d96fbd24999f7e38
BLAKE2b-256 41d7cd3eda4bd4b3fdc19440209e5f9682bad817daa21879db5771ad6a8ae5d8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page