#kanji #hanzi #converter

kanji_hanzi_converter

Translate Japanese Kanji to Chinese Hanzi

2 unstable releases

0.2.0 Oct 8, 2023
0.1.0 Oct 7, 2023

#199 in Internationalization (i18n)

MIT license

85KB
160 lines

kanji_hanzi_converter

Translate Japanese Kanji to Chinese Hanzi (simplified and traditional)

The chinese character mapping table is attributed to:

http://lotus.kuee.kyoto-u.ac.jp/~chu/pubdb/LREC2012/kanji_mapping_table.txt

Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi:
Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese
In Proceedings of the Eighth Conference on International Language Resources and Evaluation
(LREC2012), pp.2149-2152, Istanbul, Turkey, (2012.5).

Since the mapping contains questionable entries, there's an additional filtering based on a japanese kanji list (2310 entries).

TODO

Filter simplified chinese entries based on GB2312 list in repo.


lib.rs:

Converts between Japanese Kanji Characters, Traditional Chinese Characters and Simplified Chinese Characters.

Note that there's some abiguity between the converting

E.g. "壹" in traditional chinese can be "一" or "壹" in Japanese Kanji.

Another Example for ambiguity are those entries from the underlying dataset. The correct entry is typically line 5055, the line 3365 is strange. Line Japanese Traditional Simplified 3365 學 學 学 5055 学 學 学 5383 斈 學 学

For that reason, only japanese characters that are also in the kanji list (2310 characters) are considered.

Dependencies