#utf-8 #string #unicode #validation #simd #unicode-characters

no-std simdutf

Unicode validation and transcoding at billions of characters per second

17 releases

0.5.1 Sep 1, 2024
0.4.18 Aug 11, 2024
0.4.17 May 2, 2024
0.4.16 Jan 1, 2024
0.3.0 Jul 27, 2022

#387 in Text processing

Download history 8/week @ 2024-07-06 45/week @ 2024-07-13 100/week @ 2024-07-20 46/week @ 2024-07-27 6/week @ 2024-08-03 125/week @ 2024-08-10 10/week @ 2024-08-17 2/week @ 2024-08-24 239/week @ 2024-08-31 11/week @ 2024-09-07 54/week @ 2024-09-14 3/week @ 2024-09-21 141/week @ 2024-09-28 29/week @ 2024-10-05 29/week @ 2024-10-12

199 downloads per month

MIT license

1.5MB
30K SLoC

C++ 29K SLoC // 0.2% comments Rust 718 SLoC // 0.0% comments

simdutf

Latest Version Documentation License

Unicode validation and transcoding at billions of characters per second.

This crate is the Rust binding of simdutf.

Documentation: https://docs.rs/simdutf


lib.rs:

Unicode validation and transcoding at billions of characters per second.

This crate is the Rust binding of simdutf.

Compilation

This crate works out of the box as long as you have a C++11-compatible toolchain installed correctly.

simdutf links C++ standard library, which adds a dynamic linking dependency.

For more details, see simdutf documentation and cc documentation.

Here is an example for local benchmark:

export RUSTFLAGS='-C target-cpu=native'
export CXXFLAGS='-march=native'
cargo build --release

Dependencies