#unicode-normalization #unicode #normalization #unicode-text #decomposition #unicode-characters #recomposition

unicode-normalization-alignments

This crate provides functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15

1 unstable release

Uses old Rust 2015

0.1.12 Dec 30, 2019

#1084 in Text processing

Download history 49790/week @ 2024-11-17 49342/week @ 2024-11-24 55827/week @ 2024-12-01 53361/week @ 2024-12-08 52268/week @ 2024-12-15 23660/week @ 2024-12-22 28912/week @ 2024-12-29 53375/week @ 2025-01-05 57492/week @ 2025-01-12 50773/week @ 2025-01-19 59553/week @ 2025-01-26 61678/week @ 2025-02-02 66189/week @ 2025-02-09 65574/week @ 2025-02-16 79467/week @ 2025-02-23 71664/week @ 2025-03-02

287,384 downloads per month
Used in 148 crates (4 directly)

MIT/Apache

495KB
24K SLoC

unicode-normalization-alignments

Build Status Docs

This is a forked version of unicode-normalization wich provides alignment information during normalization.

Unicode character composition and decomposition utilities as described in Unicode Standard Annex #15.

This crate requires Rust 1.36+.

extern crate unicode_normalization_alignments;

use unicode_normalization_alignments::char::compose;
use unicode_normalization_alignments::UnicodeNormalization;

fn main() {
	assert_eq!(compose('A','\u{30a}'), Some('Å'));

	let s = "ÅΩ";
	let c = s.nfc().map(|(c, diff)| {
		match diff {
			0 => println!("Nothing changed here"),
			1 => println!("New character"),
			_ => println!("{} characters were removed", diff),
		}

		c
	}).collect::<String>();
	assert_eq!(c, "ÅΩ");
}

crates.io

You can use this package in your project by adding the following to your Cargo.toml:

[dependencies]
unicode-normalization-alignments = "0.1.12"

Dependencies