11 releases (breaking)

0.9.0 Mar 3, 2019
0.7.0 Feb 7, 2018
0.6.0 Sep 22, 2017
0.4.0 Jun 23, 2017

#431 in Internationalization (i18n)

Download history 208/week @ 2024-07-28 308/week @ 2024-08-04 385/week @ 2024-08-11 486/week @ 2024-08-18 323/week @ 2024-08-25 300/week @ 2024-09-01 306/week @ 2024-09-08 306/week @ 2024-09-15 366/week @ 2024-09-22 294/week @ 2024-09-29 231/week @ 2024-10-06 361/week @ 2024-10-13 243/week @ 2024-10-20 310/week @ 2024-10-27 303/week @ 2024-11-03 254/week @ 2024-11-10

1,133 downloads per month
Used in 23 crates (10 directly)

MIT/Apache

660KB
5.5K SLoC

UNIC — Unicode Character Database

Crates.io Documentation

This UNIC component provides access to character properties as defined in the Unicode® Standard Annex #44 - Unicode Character Database.

UCD is a UNIC super-crate, composed of smaller crates that provide data in specific areas, therefore, allowing access only to the data needed instead of forcing dependent crates to import all UCD data.

Crates

Here's a list of components (available or planned) for this super-crate:

  • version: The Unicode Version of UCD data.

  • common: Common properties, such as Alphabetic, White-Space, Control and Numeric.

  • age: Age property.

  • bidi: Bidirectional properties. (Hebrew, Arabic, ...)

  • block: Block properties.

  • case: Letter Case properties.

  • category: General_Category property.

  • hangul: Hangul Syllable Composition & Decomposition.

  • ident: Identifier properties.

  • name: Name property.

  • normal: Normalization properties.

  • segment: Segmentation properties.

  • ea-width: East Asian Width properties.

  • joining: Cursive joining properties. (Arabic, Syriac, ...)

  • numeric: Other character numeric properties.

  • script: Script properties.

See http://unicode.org/reports/tr44/#Property_List_Table for the complete list of properties defined in UCD. Eventually, all these properties will be available by under unic-ucd.

Dependencies