Skip to main content

Conversion between Traditional and Simplified Chinese

Project description

Open Chinese Convert 開放中文轉換

Travis AppVeyor C/C++ CI Node.js CI Python CI

Introduction 介紹

OpenCC

Open Chinese Convert (OpenCC, 開放中文轉換) is an opensource project for conversions between Traditional Chinese, Simplified Chinese and Japanese Kanji (Shinjitai). It supports character-level and phrase-level conversion, character variant conversion and regional idioms among Mainland China, Taiwan and Hong Kong. This is not translation tool between Mandarin and Cantonese, etc.

中文簡繁轉換開源項目,支持詞彙級別的轉換、異體字轉換和地區習慣用詞轉換(中國大陸、臺灣、香港、日本新字體)。不提供普通話與粵語的轉換。

Discussion (Telegram): https://t.me/open_chinese_convert

Features 特點

  • 嚴格區分「一簡對多繁」和「一簡對多異」。
  • 完全兼容異體字,可以實現動態替換。
  • 嚴格審校一簡對多繁詞條,原則爲「能分則不合」。
  • 支持中國大陸、臺灣、香港異體字和地區習慣用詞轉換,如「裏」「裡」、「鼠標」「滑鼠」。
  • 詞庫和函數庫完全分離,可以自由修改、導入、擴展。

Installation 安裝

See Download.

Usage 使用

Online demo 線上轉換展示

Warning: This is NOT an API. You will be banned if you make calls programmatically.

https://opencc.byvoid.com/

Node.js

npm npm install opencc

JavaScript

const OpenCC = require('opencc');
const converter = new OpenCC('s2t.json');
converter.convertPromise("汉字").then(converted => {
  console.log(converted);  // 漢字
});

TypeScript

import { OpenCC } from 'opencc';
async function main() {
  const converter: OpenCC = new OpenCC('s2t.json');
  const result: string = await converter.convertPromise('汉字');
  console.log(result);
}

See demo.js and ts-demo.ts.

Python

PyPI pip install opencc (Windows, Linux, Mac)

import opencc
converter = opencc.OpenCC('s2t.json')
converter.convert('汉字')  # 漢字

C++

#include "opencc.h"

int main() {
  const SimpleConverter converter("s2t.json");
  converter.Convert("汉字");  // 漢字
  return 0;
}

C

#include "opencc.h"

int main() {
  opencc_t opencc = opencc_open("s2t.json");
  const char* input = "汉字";
  char* converted = opencc_convert_utf8(opencc, input, strlen(input));  // 漢字
  opencc_convert_utf8_free(converted);
  opencc_close(opencc);
  return 0;
}

Document 文檔: https://byvoid.github.io/OpenCC/

Command Line

  • opencc --help
  • opencc_dict --help
  • opencc_phrase_extract --help

Others (Unofficial)

Configurations 配置文件

預設配置文件

  • s2t.json Simplified Chinese to Traditional Chinese 簡體到繁體
  • t2s.json Traditional Chinese to Simplified Chinese 繁體到簡體
  • s2tw.json Simplified Chinese to Traditional Chinese (Taiwan Standard) 簡體到臺灣正體
  • tw2s.json Traditional Chinese (Taiwan Standard) to Simplified Chinese 臺灣正體到簡體
  • s2hk.json Simplified Chinese to Traditional Chinese (Hong Kong variant) 簡體到香港繁體
  • hk2s.json Traditional Chinese (Hong Kong variant) to Simplified Chinese 香港繁體到簡體
  • s2twp.json Simplified Chinese to Traditional Chinese (Taiwan Standard) with Taiwanese idiom 簡體到繁體(臺灣正體標準)並轉換爲臺灣常用詞彙
  • tw2sp.json Traditional Chinese (Taiwan Standard) to Simplified Chinese with Mainland Chinese idiom 繁體(臺灣正體標準)到簡體並轉換爲中國大陸常用詞彙
  • t2tw.json Traditional Chinese (OpenCC Standard) to Taiwan Standard 繁體(OpenCC 標準)到臺灣正體
  • hk2t.json Traditional Chinese (Hong Kong variant) to Traditional Chinese 香港繁體到繁體(OpenCC 標準)
  • t2hk.json Traditional Chinese (OpenCC Standard) to Hong Kong variant 繁體(OpenCC 標準)到香港繁體
  • t2jp.json Traditional Chinese Characters (Kyūjitai) to New Japanese Kanji (Shinjitai) 繁體(OpenCC 標準,舊字體)到日文新字體
  • jp2t.json New Japanese Kanji (Shinjitai) to Traditional Chinese Characters (Kyūjitai) 日文新字體到繁體(OpenCC 標準,舊字體)
  • tw2t.json Traditional Chinese (Taiwan standard) to Traditional Chinese 臺灣正體到繁體(OpenCC 標準)

Build 編譯

Build with CMake

Linux & Mac OS X

g++ 4.6+ or clang 3.2+ is required.

make

Windows Visual Studio:

build.cmd

Test 測試

Linux & Mac OS X

make test

Windows Visual Studio:

test.cmd

Benchmark 基準測試

make benchmark

Example results (from Github CI):

1: ------------------------------------------------------------------
1: Benchmark                        Time             CPU   Iterations
1: ------------------------------------------------------------------
1: BM_Initialization/hk2s        1.56 ms         1.56 ms          442
1: BM_Initialization/hk2t       0.144 ms        0.144 ms         4878
1: BM_Initialization/jp2t       0.260 ms        0.260 ms         2604
1: BM_Initialization/s2hk        23.8 ms         23.8 ms           29
1: BM_Initialization/s2t         25.6 ms         25.6 ms           28
1: BM_Initialization/s2tw        24.0 ms         23.9 ms           30
1: BM_Initialization/s2twp       24.6 ms         24.6 ms           28
1: BM_Initialization/t2hk       0.052 ms        0.052 ms        12897
1: BM_Initialization/t2jp       0.141 ms        0.141 ms         5012
1: BM_Initialization/t2s         1.30 ms         1.30 ms          540
1: BM_Initialization/tw2s        1.39 ms         1.39 ms          529
1: BM_Initialization/tw2sp       1.69 ms         1.69 ms          426
1: BM_Initialization/tw2t       0.089 ms        0.089 ms         7707
1: BM_Convert2M                   582 ms          582 ms            1
1: BM_Convert/100                1.07 ms         1.07 ms          636
1: BM_Convert/1000               11.0 ms         11.0 ms           67
1: BM_Convert/10000               113 ms          113 ms            6
1: BM_Convert/100000             1176 ms         1176 ms            1

Projects using OpenCC 使用 OpenCC 的項目

License 許可協議

Apache License 2.0

Third Party Library 第三方庫

All these libraries are statically linked by default.

Change History 版本歷史

Links 相關鏈接

Contributors 貢獻者

Please feel free to update this list if you have contributed OpenCC.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

OpenCC-1.1.6-cp311-cp311-manylinux1_x86_64.whl (778.2 kB view details)

Uploaded CPython 3.11

OpenCC-1.1.6-cp311-cp311-macosx_10_9_x86_64.whl (739.5 kB view details)

Uploaded CPython 3.11 macOS 10.9+ x86-64

OpenCC-1.1.6-cp310-cp310-manylinux1_x86_64.whl (778.3 kB view details)

Uploaded CPython 3.10

OpenCC-1.1.6-cp310-cp310-macosx_10_9_x86_64.whl (739.5 kB view details)

Uploaded CPython 3.10 macOS 10.9+ x86-64

OpenCC-1.1.6-cp39-cp39-manylinux1_x86_64.whl (778.3 kB view details)

Uploaded CPython 3.9

OpenCC-1.1.6-cp39-cp39-macosx_10_9_x86_64.whl (739.6 kB view details)

Uploaded CPython 3.9 macOS 10.9+ x86-64

OpenCC-1.1.6-cp38-cp38-manylinux1_x86_64.whl (778.2 kB view details)

Uploaded CPython 3.8

OpenCC-1.1.6-cp38-cp38-macosx_10_9_x86_64.whl (739.5 kB view details)

Uploaded CPython 3.8 macOS 10.9+ x86-64

OpenCC-1.1.6-cp37-cp37m-manylinux1_x86_64.whl (778.7 kB view details)

Uploaded CPython 3.7m

OpenCC-1.1.6-cp37-cp37m-macosx_10_9_x86_64.whl (739.7 kB view details)

Uploaded CPython 3.7m macOS 10.9+ x86-64

File details

Details for the file OpenCC-1.1.6-cp311-cp311-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for OpenCC-1.1.6-cp311-cp311-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 d0537a0068166e9bf391db4c85dbc9099866fd29ec0a8b3d2abb8cf10e8f50df
MD5 e5adac520fb0df61ae50491d09c14025
BLAKE2b-256 c0f604601bf9dc0b7849de597e7455b9788d34b0d251f3891bf1d7c9ff6ce37f

See more details on using hashes here.

File details

Details for the file OpenCC-1.1.6-cp311-cp311-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for OpenCC-1.1.6-cp311-cp311-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 f84fdf2a5abd97171edc477a884f248b5f78b1269020d5b82482ad34d7692fcc
MD5 e7baa1f3fda8bdd6f980518a4ace0c4f
BLAKE2b-256 cf1e02a6c79358f08c83f4272f9454b2c614e1dc0a67c6f1018496a926600d0d

See more details on using hashes here.

File details

Details for the file OpenCC-1.1.6-cp310-cp310-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for OpenCC-1.1.6-cp310-cp310-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 68fa39f8aa1c471f03969d7e7dec13d8f60d30168a895808a51ac4325f88eb95
MD5 82815e671e7776c9c4292a1153553e56
BLAKE2b-256 15d32e5c61f218e51a0f767b05e257f52e53b9ecafe8ef84541e8765a42c64f0

See more details on using hashes here.

File details

Details for the file OpenCC-1.1.6-cp310-cp310-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for OpenCC-1.1.6-cp310-cp310-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 ab82f6b908eedda8f998946ff8ef3632015c345923b035931be87fa0c038357c
MD5 847aed6b16d2c7c53c17c66e248bd25a
BLAKE2b-256 d7b63e388cb70c79cf01abff805f8f4073c8ee524676a1c61de6ebc66a06abec

See more details on using hashes here.

File details

Details for the file OpenCC-1.1.6-cp39-cp39-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for OpenCC-1.1.6-cp39-cp39-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 e1a51039319d4d420e6809accfb231e808533d38d526d27846c1979052face40
MD5 8decec2206d7b1500cafe3bc0748bc19
BLAKE2b-256 0f6c90180be085e27fe322fd26983cf8e947dd328fdd5d81e67b761cc695578a

See more details on using hashes here.

File details

Details for the file OpenCC-1.1.6-cp39-cp39-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for OpenCC-1.1.6-cp39-cp39-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 62b3ff13fbf7a92601f5df0052aa1cb948c27b5bb669b3b9ccecaad2d59f588a
MD5 32809c390202e403bb89ae6c8f915e65
BLAKE2b-256 007b2ec16e820442b492921260027a2d814b62bedb1f88a580a56191904071f1

See more details on using hashes here.

File details

Details for the file OpenCC-1.1.6-cp38-cp38-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for OpenCC-1.1.6-cp38-cp38-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 0e7fccb562051be44824809b985ba22dc54fe4ea165d75f1904e9e274b29f0a5
MD5 f86d0dd0db5d805a761ed05328080420
BLAKE2b-256 8628d53665fcd0956a9a4dafcc3164cea428956ca922869f5b90a1a8cb4baa1a

See more details on using hashes here.

File details

Details for the file OpenCC-1.1.6-cp38-cp38-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for OpenCC-1.1.6-cp38-cp38-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 9f5f6a779839b2f4d63102814c7c74b262ae9542fa9e05c686f493400926ddfa
MD5 f696286ea1b50640588a955b07210780
BLAKE2b-256 bd33a60feb772f0d61b09047eedf3027e5fe50b1e405df86c1dee8f8e6aadcd6

See more details on using hashes here.

File details

Details for the file OpenCC-1.1.6-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for OpenCC-1.1.6-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 60047946f94e304e36c95a3288cd78007dd5e26a73fdbf9d810697362dce8ab8
MD5 7397026d2f7e525847f908436d7fd4f5
BLAKE2b-256 4313d53733f996be712788ab5e081f6049ec4ce91685ed1eb64eb2a5543ba545

See more details on using hashes here.

File details

Details for the file OpenCC-1.1.6-cp37-cp37m-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for OpenCC-1.1.6-cp37-cp37m-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 12e91c7ddb2a7e2c3e96a1c2b7378049f49209ea2bedaf9ea5deb56d9d11f186
MD5 774564f0862d22c3ddae931f604af515
BLAKE2b-256 c44813e47a69b0d15488f5c3c6035c2da49efa2a1e268b49f22233bee9b75413

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page