9 stable releases

1.5.0 May 28, 2024
1.4.0 Nov 16, 2023
1.3.2 Oct 3, 2023
1.3.0 Sep 23, 2023
1.0.0-beta1 Aug 5, 2022

#14 in Internationalization (i18n)

Download history 97478/week @ 2024-08-02 99230/week @ 2024-08-09 91957/week @ 2024-08-16 96575/week @ 2024-08-23 96505/week @ 2024-08-30 107348/week @ 2024-09-06 98771/week @ 2024-09-13 153506/week @ 2024-09-20 150658/week @ 2024-09-27 163324/week @ 2024-10-04 148262/week @ 2024-10-11 143583/week @ 2024-10-18 135155/week @ 2024-10-25 528113/week @ 2024-11-01 851236/week @ 2024-11-08 911220/week @ 2024-11-15

2,457,012 downloads per month
Used in 114 crates (23 directly)

Unicode-3.0

1MB
17K SLoC

icu_locid_transform crates.io

Canonicalization of locale identifiers based on CLDR data.

This module is published as its own crate (icu_locid_transform) and as part of the icu crate. See the latter for more details on the ICU4X project.

It currently supports locale canonicalization based upon the canonicalization algorithm from UTS #35: Unicode LDML 3. LocaleId Canonicalization, as well as the minimize and maximize likely subtags algorithms as described in UTS #35: Unicode LDML 3. Likely Subtags.

The maximize method potentially updates a passed in locale in place depending up the results of running the 'Add Likely Subtags' algorithm from UTS #35: Unicode LDML 3. Likely Subtags.

This minimize method returns a new Locale that is the result of running the 'Remove Likely Subtags' algorithm from UTS #35: Unicode LDML 3. Likely Subtags.

Examples

use icu::locid::Locale;
use icu::locid_transform::{LocaleCanonicalizer, TransformResult};

let lc = LocaleCanonicalizer::new();

let mut locale: Locale = "ja-Latn-fonipa-hepburn-heploc"
    .parse()
    .expect("parse failed");
assert_eq!(lc.canonicalize(&mut locale), TransformResult::Modified);
assert_eq!(locale, "ja-Latn-alalc97-fonipa".parse::<Locale>().unwrap());
use icu::locid::locale;
use icu::locid_transform::{LocaleExpander, TransformResult};

let lc = LocaleExpander::new();

let mut locale = locale!("zh-CN");
assert_eq!(lc.maximize(&mut locale), TransformResult::Modified);
assert_eq!(locale, locale!("zh-Hans-CN"));

let mut locale = locale!("zh-Hant-TW");
assert_eq!(lc.maximize(&mut locale), TransformResult::Unmodified);
assert_eq!(locale, locale!("zh-Hant-TW"));
use icu::locid::locale;
use icu::locid_transform::{LocaleExpander, TransformResult};
use writeable::assert_writeable_eq;

let lc = LocaleExpander::new();

let mut locale = locale!("zh-Hans-CN");
assert_eq!(lc.minimize(&mut locale), TransformResult::Modified);
assert_eq!(locale, locale!("zh"));

let mut locale = locale!("zh");
assert_eq!(lc.minimize(&mut locale), TransformResult::Unmodified);
assert_eq!(locale, locale!("zh"));

More Information

For more information on development, authorship, contributing etc. please visit ICU4X home page.

Dependencies

~0.5–1.2MB
~24K SLoC