9 releases

Uses old Rust 2015

0.4.0 May 2, 2022
0.3.5 Feb 4, 2022
0.3.4 Dec 28, 2021
0.3.2 Aug 18, 2020
0.2.1 Nov 6, 2017

#951 in Text processing

Download history 42/week @ 2024-08-24 54/week @ 2024-08-31 26/week @ 2024-09-07 27/week @ 2024-09-14 89/week @ 2024-09-21 40/week @ 2024-09-28 25/week @ 2024-10-05 67/week @ 2024-10-12 162/week @ 2024-10-19 136/week @ 2024-10-26 38/week @ 2024-11-02 190/week @ 2024-11-09 84/week @ 2024-11-16 33/week @ 2024-11-23 34/week @ 2024-11-30 96/week @ 2024-12-07

262 downloads per month
Used in 4 crates

MIT license

38KB
583 lines

This crate provides fuzzy search/string matching using N-grams.

This implementation is character-based, rather than word based, matching solely based on string similarity.

Licensed under the MIT license.

Documentation

https://docs.rs/ngrammatic/latest/ngrammatic/

Installation

This crate is published on crates.io.

To use it, add this to your Cargo.toml:

[dependencies]
ngrammatic = "0.3.4"

Usage

To do fuzzy matching, build up your corpus of valid symbols like this:

use ngrammatic::{CorpusBuilder, Pad};

let mut corpus = CorpusBuilder::new()
    .arity(2)
    .pad_full(Pad::Auto)
    .finish();

// Build up the list of known words
corpus.add_text("pie");
corpus.add_text("animal");
corpus.add_text("tomato");
corpus.add_text("seven");
corpus.add_text("carbon");

// Now we can try an unknown/misspelled word, and find a similar match
// in the corpus
let word = String::from("tomacco");
if let Some(top_result) = corpus.search(word, 0.25).first() {
    if top_result.similarity > 0.99 {
        println!("{}", top_result.text);
    } else {
        println!("{} (did you mean {}? [{:.0}% match])",
                 word,
                 top_result.text,
                 top_result.similarity * 100.0);
    }
} else {
    println!("🗙 {}", word);
}

No runtime deps