6 releases (3 breaking)

0.7.0 Oct 16, 2023
0.6.1 Dec 3, 2022
0.6.0 Jun 15, 2022
0.5.1 Mar 1, 2022
0.4.0 Apr 28, 2021

#948 in Machine learning

Download history 13/week @ 2024-07-20 15/week @ 2024-07-27 2/week @ 2024-08-03 11/week @ 2024-08-10 64/week @ 2024-08-17 28/week @ 2024-08-24 5/week @ 2024-08-31 21/week @ 2024-09-07 19/week @ 2024-09-14 46/week @ 2024-09-21 55/week @ 2024-09-28 44/week @ 2024-10-05 42/week @ 2024-10-12 81/week @ 2024-10-19 89/week @ 2024-10-26 54/week @ 2024-11-02

269 downloads per month
Used in 2 crates

MIT/Apache

305KB
6K SLoC

Preprocessing

The Big Picture

linfa-preprocessing is a crate in the linfa ecosystem, an effort to create a toolkit for classical Machine Learning implemented in pure Rust, akin to Python's scikit-learn.

Current state

linfa-preprocessing provides a pure Rust implementation of:

  • Standard scaling
  • Min-max scaling
  • Max Abs Scaling
  • Normalization
  • Count vectorization
  • TfIdf vectorization
  • Whitening

Examples

There are various usage examples in the examples/ directory. To run, use:

$ cargo run --release --example count_vectorization
$ cargo run --release --example tfidf_vectorization
$ cargo run --release --example scaling
$ cargo run --release --example whitening

BLAS/Lapack backend

See this section to enable an external BLAS/LAPACK backend.

License

Dual-licensed to be compatible with the Rust project.

Licensed under the Apache License, Version 2.0 http://www.apache.org/licenses/LICENSE-2.0 or the MIT license http://opensource.org/licenses/MIT, at your option. This file may not be copied, modified, or distributed except according to those terms.

Dependencies

~10–21MB
~349K SLoC