2 releases

0.0.2 Nov 5, 2024
0.0.1 Oct 24, 2024

#425 in Text processing

Download history 113/week @ 2024-10-23 66/week @ 2024-10-30 99/week @ 2024-11-06 426/week @ 2024-11-13 361/week @ 2024-11-20 137/week @ 2024-11-27 153/week @ 2024-12-04 132/week @ 2024-12-11 187/week @ 2024-12-18 60/week @ 2024-12-25 105/week @ 2025-01-01 113/week @ 2025-01-08

479 downloads per month

MIT license

15MB
177K SLoC

common-words-all

Most common words sorted by ngram frequency.

Available in the following languages:

  • Chinese
  • English
  • French
  • German
  • Hebrew
  • Italian
  • Russian
  • Spanish

Available ngram sizes:

  • 1
  • 2
  • 3
  • 4
  • 5

Usage

Get top 10 english ngrams:

let top = get_top(Language::English, 10, NgramSize::One);

Examples

Simple

You can specify features of language (english) and ngram size (one)

cargo run --example simple --no-default-features -F english -F one --release

Data

Dataset version 20200217 from Google Books

License

MIT

© 2024, Eugene Hauptmann

No runtime deps