-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
similar
A diff library for Rust
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
const_format
Compile-time string formatting
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
pulldown-cmark
A pull parser for CommonMark
-
ropey
A fast and robust text rope for Rust
-
lazy-regex
lazy static regular expressions checked at compile time
-
unicode-bidi
Unicode Bidirectional Algorithm
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
tokenizers
today's most used tokenizers, with a focus on performances and versatility
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
termimad
Markdown Renderer for the Terminal
-
convert_case
Convert strings into any case
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
html2text
Render HTML as plain text
-
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
lopdf
PDF document manipulation
-
diff
An LCS based slice and string diffing implementation
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
mdbook
Creates a book from markdown files
-
prettydiff
Side-by-side diff for two files
-
fuzzy-matcher
Fuzzy Matching Library
-
stringprep
algorithm
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
roff
ROFF (man page format) generation library
-
unicase
A case-insensitive wrapper around strings
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
chardetng
A character encoding detector for legacy Web content
-
hyperlink
Very fast link checker for CI
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
unicode-script
exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
slug
Convert a unicode string to a slug
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
const-str
compile-time string operations
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
charabia
detect the language, tokenize the text and normalize the tokens
-
tailspin
A log file highlighter
-
google-dialogflow3
A complete library to interact with Dialogflow (protocol v3)
-
regex-syntax
A regular expression parser
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
text_io
really simple to use panicking input functions
-
synoptic
low-level, syntax highlighting library with unicode support
-
regress
A regular expression engine targeting EcmaScript syntax
-
stfu8
Sorta Text Format in UTF-8
-
text-size
Newtypes for text offsets
-
diffy
Tools for finding and manipulating differences between files
-
lindera
A morphological analysis library
-
matchers
Regex matching on character and byte streams
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
str_indices
Count and convert between indexing schemes on string slices
-
printpdf
writing PDF files
-
epub-builder
generating EPUB files
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
line-index
Maps flat
TextSize
offsets to/from(line, column)
representation -
minimizer
Minimize files to find minimal test case
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
unindent
Remove a column of leading whitespace from a string
-
entities
raw data needed to convert to and from HTML entities
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
autocorrect
A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)
-
shell-escape
Escape characters that may have a special meaning in a shell
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
ascii
ASCII-only equivalents to
char
,str
andString
-
mdcat
cat for markdown: Show markdown documents in terminals
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
sd
An intuitive find & replace CLI
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
stringzilla
Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances
-
hck
A sharp cut(1) clone
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
lindera-cli
A morphological analysis command line interface
-
pad
padding strings at runtime
-
aho-corasick-unsafe
Fast multiple substring searching
-
prefix
A customizable pretty printer for FIX messages
-
shell-toy
A "fortune | cowsay" implementation in Rust, i.e. a nice little toy to liven up your shell.
-
rustyline-async
A minimal readline with multiline and async support
-
font-types
Scalar types used in fonts
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
scraps
A static site generator that builds a wiki from a set of markdown files
-
llmvm-core
The core application for llmvm
-
languagetool-rust
LanguageTool API bindings in Rust
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
pdf-extract
extract content from pdfs
-
vesti
A preprocessor that compiles into LaTeX
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
console_static_text
Logging for text that should stay in the same place in a console
-
subplot
tools for specifying, documenting, and implementing automated acceptance tests for systems and software
-
chewing
(酷音) intelligent Zhuyin input method
-
supply-chain-trust-example-crate-000022
Fast multiple substring searching
-
clipboard-rs
Cross-platform clipboard API (text | image | rich text | html | files | monitoring changes) | 跨平台剪贴板 API(文本|图片|富文本|html|文件|监听变化) Windows,MacOS,Linux
-
cow-utils
Copy-on-write string utilities for Rust
-
file-organiser
Command line file manager to list, move or delete large numbers of files in nested folders filtered by age, file extension, file name pattern and/or size range
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
hgrep
grep tool with human-friendly search output. This is similar to
-C
option ofgrep
command, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
near-facsimile
Find similar or identical text files in a directory
-
ncount
A word count tool intended to derive useful stats from markdown
-
difflib
Port of Python's difflib library to Rust
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
vaporetto
pointwise prediction based tokenizer
-
any_ascii
Unicode to ASCII transliteration
-
wildcard
matching
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
aneubeck-daachorse
Daachorse: Double-Array Aho-Corasick
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
mdbook-quiz
Interactive quizzes for your mdBook
-
skyspell
Fast and handy spell checker for the command line
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
fm
Non-backtracking fuzzy text matcher
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
unicode-ccc
Unicode Canonical Combining Class detection
-
rapidfuzz
rapid fuzzy string matching library
-
molybdenum
Recursive search and replace CLI application
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
hypher
separates words into syllables
-
regex-cursor
regex fork that can search discontiguous haystacks
-
mdbook-yapp
A mdBook preprocessor for simple text replacements
-
vidyut-prakriya
A Sanskrit word generator
-
collclean
Clean up collaboration commands in LaTeX files
-
unicode-reverse
Unicode-aware in-place string reversal
-
stylin
Convert markdown to pandoc markdown with custom styles
-
wezterm-bidi
The Unicode Bidi Algorithm (UBA)
-
vi
An input method library for vietnamese IME
-
tantivy-analysis-contrib
A set of analysis components for Tantivy
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
glu
Unpacks a Markdown document full of code snippets into a temp directory
-
repr
The regular-expression-as-linear-logic interpretation and its implementation
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
trans-epub
Translate EPUB with CLI
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
tibco_ems
A high level API for the Tibco EMS
-
mupdf
Safe Rust wrapper to MuPDF
-
mdbook-epub
An EPUB renderer for mdbook
-
uwc
Counts things in unicode text files
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
quixote
Quizzes and tests in Markdown
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
fontfor
find fonts which can show a specified character and preview them in terminal or browser
-
nvl-cli
A program to download webnovels
-
htmd-cli
The command line tool for htmd
-
prop-check-rs
A Property-based testing Library in Rust
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
norad
Read and write Unified Font Object files
-
sk-skimmer
Fuzzy Finder in rust!
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
bfom
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
none-shall-pass
Artifact for GitHub Action to validate hyperlinks in all markdown files
-
focaccia
no_std implementation of Unicode case folding comparisons
-
mktoc
Generate Table of Contents from Markdown files
-
qpdf
Rust bindings to QPDF C++ library
-
yffi
Bindings for the Yrs native C foreign function interface
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
igrepper
The interactive grepper
-
chamkho
Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
-
graphannis
new backend implementation of the ANNIS linguistic search and visualization system
-
autotex
Continuously compile TeX and LaTeX
-
regex-literal
delimited regular expression literals
-
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞,基於MediaWiki和OpenCC之字詞轉…
-
textra
type less, write more
-
tantivy-jieba
that bridges between tantivy and jieba-rs
-
unicode_categories
Query Unicode category membership for chars
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
dvd-term
A bouncing ASCII art DVD logo (or custom text) for the terminal
-
mdbook-pandoc
A mdbook backend that outsources most of the rendering process to pandoc
-
uuhelp_parser
A collection of functions to parse the markdown code of help files
-
bbd
Binary Braille Dump
-
pray
A tui tool for preparing a prompt to the llms
-
tbll
tbll outputs data in tabular format
-
percent-encoding
Percent encoding and decoding
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
frawk
an efficient Awk-like language
-
frida-build
Rust bindings for Frida
-
nu_plugin_emoji
a nushell plugin called emoji
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
basic-text
Basic Text strings and I/O streams
-
termfactor
is your terminal sick, mid or sus? test your terminal features & find out
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
jayce
tokenizer 🌌
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
sqdj
sqdj shortens delimited data
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
emojic
Emoji constants
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
yara-x
A pure Rust implementation of YARA
-
fontkit
font loading and indexing
-
text_analysis
Analyze text stored as *.txt in provided file or directory. Doesn't read files in subdirectories. Counting all words and then searching for every unique word in the vicinity (+-5 words)…
-
rustyink
Blazing fast static site generator
-
retryable-result
repeatedly try on some recoverable errors until too many or a fatal error
-
lexical-sort
Sort Unicode strings lexically
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
serpl
terminal UI for search and replace, ala VS Code
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
arrow-row
Arrow row format
-
see-cat
A cute cat(1)
-
rutie
The tie between Ruby and Rust
-
grok
popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, German, Italian or French into their digit representation
-
fasttext
binding
-
supply-chain-trust-example-crate-000021
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
minix
straightforward minifier for JavaScript and CSS files, developed using Rust. This tool efficiently reduces the size of JS and CSS files by removing unnecessary characters, whitespace…
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
sile
Simon’s Improved Layout Engine
-
hh-highlight
hh: The text highlighter. Read from stdin, and highlight selected words printed to stdout
-
fish-printf
printf implementation, based on musl
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
unicode-width-16
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ
,ŋ
andê
. -
gst-plugin-textahead
GStreamer Plugin for displaying upcoming text buffers ahead of time
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
egui-dropdown
An actual dropdown list for egui
-
mini_markdown
Small markdown renderer
-
libreofficekit
in Rust
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
terminal-clipboard
a minimal cross-platform clipboard
-
dcsv
Dyanmic csv reader,writer,editor
-
uclanr
A random word picker that gives you actually useful words
-
typed-oid
Typed Object IDs
-
as3_parser
ActionScript 3 parser
-
array_tool
Helper methods for processing collections
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
oxford_join
Join string slices with Oxford Commas!
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
filenamify
Convert a string to a valid filename
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
words-count
Count the words and characters, with or without whitespaces
-
ferris-says
flavored replacement for the classic cowsay
-
deliminator
Universal code documentation generator
-
treegrep
A pattern matcher frontend or backend which displays results in a tree
-
detone
Decompose Vietnamese tone marks
-
arf-strings
Encoding and decoding for ARF strings
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactor
sub-command. Compile BibTeX files to legacythebibliography
TeX code using thecompile
sub-command… -
pluralizer
Rust package to pluralize or singularize any word based on a count inspired on pluralize NPM package
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
trpl
A support crate for The Rust Programming Language book
-
shwild
(for Rust)
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
wit-bindgen-markdown
Markdown generator for WIT and the component model, typically used through the
wit-bindgen-cli
crate -
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
advreader
Split text file into text sequences, strings and (line) comments
-
termdiff
Write a diff with color codes to a string
-
mini-openai
An OpenAI API client with minimal dependencies
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
dialogi
A dialog parser
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
mdbook-linkcheck2
A backend for
mdbook
which will check your links for you -
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
numeric-sort
A zero-allocation, human-readable sorting library
-
rust-persian-tools
Official Rust implementation of Persian Tools
-
abbreviation_extractor
extracting abbreviations from text
-
asciigraph
Ascii Graph
-
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
chinese_dictionary
A searchable Chinese / English dictionary with helpful utilities
-
mkwebsite
build websites using markdown
-
dptran
run DeepL translations on command line written by Rust
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
mdbook-angular
mdbook renderer to run angular code samples
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
kbremap
Custom keyboard layouts for windows
-
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
line-straddler
Determine how lines through text (underlines, strikethroughs, etc) should be rendered
-
kathoey
text feminization using open corpus linguistics data
-
unidown
Convert Markdown to Unicode
-
svgbob
Transform your ascii diagrams into happy little SVG
-
ib-pinyin
一个高性能拼音匹配库
-
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
minspan
a package for determining the minimum span of one vector within another
-
avt
asciinema virtual terminal
-
text-to-ascii-art
program to convert text to ASCII art
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
obmrs
As a participant, you will create a structure to receive and hold the exchange-distributed order book. This structure will be called the OrderBoard, and will hold the order book's bids and asks as a price-sorted map…
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
iepub
epub、mobi电子书读写
-
dprint-plugin-markdown
Markdown formatter for dprint
-
iirs
Inverted repeats finder
-
mdopen
Preview markdown files in a browser
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
colonnade
format tabular data for display
-
pager
pipe your output through an external pager
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath
. -
extract_anchors
Утилита для извлечения из исходных кодов всех помеченных отрывков
-
picodiff
Tiny GUI app to compare text easily
-
wordcut-engine
Word segmentation/breaking library
-
text-editing
string with utilities for editing
-
arabic_reshaper
Reconstruct Arabic sentences to be used in applications that doesn't support Arabic script
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
slack-blocks-render
Slack blocks render is a Rust library to render Slack blocks as Markdown
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
linebreak
breaking a given text into lines within a specified width
-
regex-charclass
Manipulate and convert regex character classes
-
secular
No Diacr!
-
whitespace-sifter
Sift duplicate whitespaces away!
-
zspell
Native Rust library for spellchecking
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
lorem-ipsum
Quickly generate placeholder test
-
yamd
Markdown Document (flavour)
-
aki-mcolor
mark up text with color
-
rmbs
Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
cbfr
A buffer that run on stack, focusing on performance and speed
-
dmos
Djot HTML renderer with advanced features
-
gosyn
Golang syntax parser in Rust
-
substring
method for string types
-
date_time_parser
Rust NLP library for parsing English natural language into dates and times
-
dmntk-gendoc
DMNTK | Documentation generator
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
gh_page_tool
A github gh-pages tool for static blog site
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
diary-cli
A powerful cli for documenting and keeping a diary
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
svgc
svgc
(previously SvgCompress) is a tool for compressing SVG files by removing unnecessary whitespace, comments, metadata and some other data. It also supports optimization with SVGO and compression into SVGZ. -
just-enough-emojis
text to emoji cli
-
ColorShell
A small crate for coloring text for rust
-
mandown
Markdown to groff (man page) converter
-
merge_pdf
Merge PDF files in a directory
-
supply-chain-trust-example-crate-000098
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
nu-utils
Nushell utility functions
-
indent
Functions for indenting multiline strings
-
cedarwood
efficiently-updatable double-array trie in Rust (ported from cedar)
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
poppler-sys-rs
Low-level (FFI) bindings for poppler-glib
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
pprint
Flexible and lightweight pretty printing library for Rust
-
lcat
lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
kelp
A convert tool for Japanese
-
utilprint
easily using colors or unicode in your terminal printing
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
sourceannot
render snippets of source code with annotations
-
textcode
Text encoding/decoding library. Supports: UTF-8, ISO6937, ISO8859, GB2312
-
dnd-character
A Dungeons and Dragons character generator
-
jawk
JSON AWK
-
inlet_manifold
A general purpose highlighting library
-
betacode
conversion
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
ethan-rs-wc
The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.
-
smol-symbol
compile-time globally unique, 25-character, 128-bit symbols similar to the Symbol type in Crystal Lang
-
hangeul
Korean alphabet manipulation library
-
presenterm
A terminal slideshow presentation tool
-
swimos_model
SwimOS Generic Representation of the SwimOS Serialization Model
-
common-words-all
Most common words sorted by ngram frequency
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
typo_checker
Searches for typos of English words and returns matches or similar words.(英単語のタイポを検索し、合致もしくは類似する単語を返却します)
-
lookbook
Component preview framework for Dioxus
-
dbxcase
Dropbox-compatible case-folding algorithm
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
asciidork-eval
Asciidork eval
-
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
simdutf
Unicode validation and transcoding at billions of characters per second
-
bump-bin
Increments version with semver specification
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
ragtime
Easy Retrieval Augmented Generation
-
choco
markup language for dialogue systems
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
fmtm
A diff-friendly Markdown formatter that breaks lines on sensible punctuations and words to fit a line width
-
stylish-ansi
stylish
helpers for writing styles as ANSI escape codes -
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
mdbook-typst-pdf
mdbook typst pdf backend
-
tree-sitter-stack-graphs
Create stack graphs using tree-sitter parsers
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
armnod
random string generator
-
nu_plugin_regex
nu plugin to search text with regex
-
rust-beam
A LaTeX slide generator you can write in faster than beamer
-
wikidump
parsing Mediawiki XML dumps
-
orly
Download O'Reilly books as EPUB
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
wikipedia_prosesize
Count Wikipedia prose size
-
semsimian
Sematic similarity calculations for ontologies implemented in Rust
-
bogrep
Full-text search for bookmarks from multiple browsers
-
truncate_string_at_whitespace
Truncate a &str at the closest whitespace to a specified length with unicode safety
-
soundex-rs
that calculates soundex value
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
patiencediff
algorithm
-
mtc-token-healing
Token healing implementation
-
chocodye
changing the color of the chocobos' plumage in Final Fantasy XIV
-
markdown-extract-cli
Extract sections of a markdown file with a regular expression
-
quranize
Encoding transliterations into Quran forms
-
dodo-zh
operate on pinyin and cedict.u8 file
-
unicount
Alphabetic counter supporting unicode
-
mdi
markdown include
-
char-ranges
Iterate chars and their start and end byte positions
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
ultra-nlp
A NLP library
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
natural
Pure rust library for natural language processing
-
bin-rs
Binary Reader from multi source
-
zipcodes
Query US zipcodes without SQLite
-
pandoc_types
Rust port of pandoc-types
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
mdbook-rustviz
An mdbook preprocessor that allows users to embed RustViz visualizations into mdbook projects
-
fetch-catnip
fetch displaying system information and a cute cat
-
datadriven
Rewritable table-driven testing
-
prolangkit
A set of language-agnostic utilities for langdev
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
harper-core
The language checker for developers
-
cloc
Count, or compute differences of, lines of source code and comments
-
str-utils
some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
indent_write
Write adapters to add line indentation
-
lowcharts
draw low-resolution graphs in terminal
-
typst-ansi-hl
highlights your Typst code using ANSI escape sequences
-
json_to_table
pretty print JSON as a table
-
ssbm_utils
A collection of useful tools for working with data from Super Smash Bros Melee
-
gspell
Rust bindings for gspell
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
human_regex
A regex library for humans
-
avatarsay
Beautiful quotes from Avatar: The Last Airbender
-
stop-words
Common stop words in many languages
-
uniart
A CLI tool to convert images and gifs to terminal characters
-
casespaetzle
This package introduces common identifier case traits and a macro for case trait generation
-
data-streams
Extension traits for reading and writing data with streams
-
tfidf-text-summarizer
extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary
-
simple-ssg
Plain and simple static site generator for Djot and Markdown light markup languages
-
asciimath-unicode
Convert asciimath to unicode
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
grammalecte_client
Grammalecte HTTP client
-
unic-ucd-ident
UNIC — Unicode Character Database — Identifier Properties
-
cargo-context-ranger
Quickly grab your rust context for building LLM prompts to help
-
tfon
Bitmap font parsing / conversion
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}
in text with their respective value. It can also be used as a rust library -
abbreviator
abbreviating long words
-
paltoquet
rule-based general-purpose tokenizers
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
bce
-
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
indicator-extractor
Extract indicators (IP, domain, email, hashes, etc.) from a string or a PDF file
-
smeagol-wiki
A personal wiki webserver. Work in progress.
-
message_segment_calculator
package to calculate SMS message segments
-
rk-utils
A collection of utility functions and data structures for rust
-
slicestring
slicing Strings
-
chord3
Create pdf songbooks from chopro source
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
selmr
Package to create and use Simple Explainable Language Multiset Representations
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
dekor
styling and character repository in Rust
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
html-auto-p
function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
rustblocks
tetromino based falling blocks game
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
syllabize-es
Syllabize Spanish text, and much more
-
tracery
Text-expansion library
-
ruby_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
uwl
A management stream for bytes and characters
-
loc
Count lines of code (cloc) fast
-
mantra-lang-tracing
Contains functionality needed to collect requirement traces from code or plain text files for the
mantra
framework -
mdbook-keeper
An improved testing experience for mdbook
-
readability
Port of arc90's readability project to rust
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
mdsh
Markdown shell pre-processor
-
metatron
core library
-
mitex-parser
Parser for MiTeX
-
utf16_lit
macro_rules to make utf-16 literals
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
portmanteau
create portmanteaux
-
sanitize-filename-reader-friendly
A filename sanitizer aiming to produce reader friendly filenames
-
kindle2cbz
extracting images from kindle books in MOBI format to CBZ archives
-
hydroper_source_text
Source text containing line locations
-
lorem-rustum
lib for generating lorem-ipsum with a rusty fleur
-
simple_peg
A command-line peg parser implemented by Rust
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
mdbook_rash
Binary to create doc from
rash
code -
egg-mode-text
Text parsing for Twitter: character counting, hashtag/mention extraction
-
url_encor
A lightweight library to encode and decode special characters in urls
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
streampager
pager for command output or large files
-
escrit
learning languages by reading texts
-
cow-rewrite
Rewrite copy-on-write types copying only when it's neccessary
-
wimbd
A CLI for inspecting and analyzing large text datasets
-
zh_num
Convert ASCII numbers and zh words
-
yozuk
Chatbot for Programmers
-
sbert
Sentence Bert (SBert)
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
tre-regex
Rust safe bindings to the TRE regex module
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
tower_allowed_hosts
tower allowed hosts layer
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
angr
analyse ngrams in text files
-
kataru
YAML-based dialogue engine for game development
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
geoipsed
Inline decoration of IPv4 and IPv6 address geolocations
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
strip-tags
Strip HTML and PHP tags from strings
-
pukram2html
converting Pukram-formatted text to HTML
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
brzozowski
Work with regular expressions using Brzozowski derivatives. "
-
easy_random
Generate random data easily with easy_random :)
-
rs-line-count
counts number of lines(' ')
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
eternaltwin_constants
Hammerfest store implementation
-
slugify
Macro for flexible slug generation
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen and some other apps
-
pdf_composer
Create PDF documents from YAML Front Matter source documents
-
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
ik-rs
chinese segment, ik-analyzer for rust
-
srake
RAKE Algoritm
-
uo_rst_parser
fork of rst_parser with fixes for upstream-ontologist
-
stego_wps
text-based steganography using the Words Per Sentence (WPS) method. It allows encoding and decoding hidden messages within the structure of a given text.
-
fluxcap
time expression parsing inspired on duckling
-
mathemascii
AsciiMath parsing and rendering
-
reason-shell
Reason: A Shell for Research Papers
-
malachi
A domain specific pattern matching language made for defining bot commands
-
tu
CLI tool to convert a natural language date/time string to UTC
-
lindera-dictionary
A morphological analysis library
-
tiny_pretty
Tiny implementation of Wadler-style pretty printer
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
htmd
A turndown.js inspired HTML to Markdown converter
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
cindex
CSV indexing library
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
midstring
Create a string between two other strings, that is lexicographically halfway between them
-
latinrs
Rust package that transliterates text and characters into thier latin form
-
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
verify_string
verifying strings
-
term_grid
formatting strings into a grid layout
-
gregex
Regex solver utilizing NFA
-
kanbanrs
A basic kanban task manager written with egui
-
cogni
Unix native interface for LLMs
-
sixbit
Small packed strings
-
r-matrix
Rust port of cmatrix
-
google-fonts
Download and cache TTF fonts from Google
-
image-to-ascii
Converts images and gifs to ascii art
-
tree-sitter-stack-graphs-typescript
Stack graphs definition for TypeScript & TSX using tree-sitter-typescript
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
hebrew_unicode_script
A lightweight library to check if a hebrew character belongs to certain collections
-
aki-gsub
substitude text command, replace via regex
-
screen_printer
displaying rectangular blocks of text to a terminal
-
tabprinter
creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.
-
bigstr
A command-line tool to make string BIG
-
tremor-kv
A logstash inspured key value extractor
-
asmfmt
A formatter designed for programs in assembly language with AT&T syntax
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
tnipv-lint
lints for tnipv, the Telcoin Network Improvement Proposal validator
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
mdtrans
Markdown parser and transformer using
pest.rs
, focused on flexibility to a project’s needs -
whichlicense_detection
detect licenses used by the WhichLicense project
-
supply-chain-trust-example-crate-000066
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
lisho
personal link shortener with no external dependencies in under 200 lines of Rust
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
formatjson
Formats JSON files
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
quake_text
Utils for Quake strings and characters
-
text_lines
Information about lines of text in a string
-
docket
markdown to HTML documentation rendering
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
linkcheck2
extracting and validating links
-
isbn
handling ISBNs
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
text_trees
textual output for tree-like structures
-
rofi-mode
High-level library for creating Rofi plugins and custom modes
-
glk
Bindings for the Glk I/O interface for hosting interactive fiction interpreters
-
blitztext
fast keyword extraction and replacement in strings
-
flowistry_ide
Information Flow in the IDE for Rust
-
owoify_rs
port of mohan-cao's owoify-js is released under MIT license, which is also the same license of owoify-js. This simple library will turn any string into nonsensical babyspeak similar…
-
wantora
wantora工具
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
man
Generate structured man pages
-
palmdoc-compression
Fast & safe implementation of PalmDoc/MOBI/AZW/Kindle flavored LZ77
-
gaze
small utility library with the goal of making it easier to scan/lex text and collections
-
utf8-io
Traits and types for UTF-8 I/O
-
mdbook-check-missing-md
A backend for
mdbook
which will find Markdowns you forgot on SUMMARY.md -
libcoerced
serde_json and generic text content matching, verification and coercion
-
thesaurus
An offline thesaurus library for Rust
-
neuralassimilator
fine-tuning Language Learning Models (LLMs) from unstructured text
-
text-parsing
Hierarchical text processing preserving char position info
-
encoding-next
Character encoding support for Rust
-
mdbook-tools
A collection of tools for mdbook
-
bytescolor
A versatile Rust library for colorizing strings and byte data in terminal applications using ANSI escape codes
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
antex
Styled text and tree in terminal
-
bubble-bath
Small and quick HTML sanitizer
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
my-notes
note taking app for taking hierarchical notes in markdown
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
crlify
A std::io::Write wrapper that replaces with on Windows
-
ragegun
Performs lexica based analysis on text (i.e. age, gender, PERMA, OCEAN personality traits, ..)
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
markdown_converter
html to markdown converter and flavored markdown to discord markdown converter
-
easy-regex
Make long regular expressions like pseudocodes
-
rulet
figlet implementation
-
tgrep
Toy grep that honors .gitignore
-
markov-text
creating a small markov model for text generation
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
argentum_user_business
Business layer of user component
-
tzgrep
grep tar.gz
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
text-tokenizer
Custom text tokenizer
-
anslatortray
translate from English to Pig Latin!
-
royal_road_archiver
An archival program and library for the webnovel site RoyalRoad
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
tazui
terminal calculator
-
levenshtein_lite
No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
regex_generate
Use regular expressions to generate text
-
hmd
Custom Markdown Engine for my personal blog
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
rsrpp
project for research paper pdf
-
somedoc
A very simple document model and markup generator
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
pullup
Convert between markup formats
-
libopenlipc-sys
Wrapper around liblipc to interact with Kindle dbus-based LIPC events
-
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
supply-chain-trust-example-crate-000037
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
mdrss
generating RSS feeds from markdown files
-
tergo-formatter
Formatter for tergo
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
archive-pdf-urls
Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine
-
minify-html-common
Common code and data for minify-html*
-
mudder
Generate lexicographically-spaced strings between two strings from pre-defined alphabets
-
aho-corasick
Fast multiple substring searching
-
markov
A generic markov chain implementation in Rust
-
codespan_preprocessed
Beautiful diagnostic reporting for M4 (or cpp) preprocessed text files
-
slow-scan-print
Text is printed at fixed intervals by character or by line, and its name is inspired by SSTV (Slow Scan TV)
-
character-set
High performance
set.contains(char)
-
indentsort
Structure-preserving sorting of arbitrary indented text
-
hns
Human numeric sorting program — does what
sort -h
is supposed to do! -
charx
A replacement for char::is_ascii*
-
regex-chunker
Iterate over the data in a
Read
type in a regular-expression-delimited way -
hyperscan
bindings for Rust with Multiple Pattern and Streaming Scan
-
marker
finding issues in CommonMark documents
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
worcher
full-text search for static websites
-
case
A set of letter case string helpers
-
chunkr
A fast and quick chunking library for rust
-
roe
Unicode case conversion
-
rpgtools
Collection of utilities to help run tabletop role-playing games
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
wcount
CLI word counting tool
-
to-binary
Converts From Hex, UTF-8, Vector of Bytes, or a Byte Slice to a Binary String And Implements Many Useful Functions For Binary Strings
-
uwu-rs
uwuifying library
-
auto-regex
Automagically finds a regex that best matches an example and a sample list
-
emoji_converter
Converts text to emojis
-
litua
Read a text document, receive its tree in Lua and manipulate it before representing it as string
-
qsv_vader_sentiment_analysis
Bindings for Rust from the original Python VaderSentiment analysis tool. Forked for use with qsv.
-
utils_rust
这是一个用于各种实用功能的 Rust 库
-
instant-segment
Fast English word segmentation
-
elden-ring-saver
ansi2
-
gender_guesser
that guesses the gender of a given name
-
rsnltk
Rust-based Natural Language Toolkit
-
recvmsg
Traits for receiving datagrams reliably, without truncation
-
Person_Parser
Rust parser
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
flashtext2
The FlashText algorithm implemented in Rust
-
tagsearch
Filter plaintext files based on @keyword tags
-
textspan
Text span utility
-
visa
(Virtual Instrument Software Architecture) Wrapper which allows concurrent and dynamic switching between visa implementations from different vendors
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
unflow
DSL to convert design to code
-
match-pinyin-with-hanzi
Checks whether the sentence in Chinese characters (汉字) matches with the sentence in pinyin (拼音). Erhua is supported.
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
rusty_tools
useful tools for rust
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
transportations_library
Knowledge stack for transportation
-
align
aligning text
-
make_it_braille
A CLI tool and library to manipulate unicode braille images
-
kitty_table
Very small and barebones library for printing tables of data
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
hebrew
alephbet primatives and parsing library for rust
-
console-utilities
handling console input and output utilities
-
tangler
Extracts code blocks from Markdown documents
-
dnd_spellbook_maker
making pdf documents of spells that ressemble 5th edition D&D official source book spell descriptions
-
wz
Count words, fast
-
rdfoothills-mime
collection of mostly small, RDF related utilities, including conversion between different serialization formats, using external tools. This part provides RDF serialization format types (plus HTML)…
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
ncase
Enforce a case style
-
jp_utils
Utils for working with Japanese text
-
gdnative-doc
Documentation tool for gdnative
-
pho
PHOnetic alphabet renderer
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
rep-grep
wgrep/write-grep CLI
-
galm
pattern matching library
-
merge-whitespace
Procedural macros for merging whitespace in const contexts
-
notedown_ast
Notedown Abstract Syntax Tree
-
botanical-latin
Decliner / conjugator / inflector for classical / botanical Latin
-
cheetah
An extremely fast and flexible static site generator
-
reword
some utility functions for human-readable formatting of words
-
lister-cli
Lister: Navigate Markdown Lists
-
lodestone
A website wrapper for FFXIV's lodestone
-
aqkanji2koe
wrapper for Rust
-
ttaw
talking to a wall, a piecemeal natural language processing library
-
rust-regex-dsl
Regular expression DSL
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
quickmd
Quickly preview a markdown file
-
ultra
Cryptanalysis of the Engima
-
xi-rope
A generic rope data structure built on top of B-Trees
-
ru-html-extractor
A universal web page main content extractor based on line block density distribution
-
sttx
belt for transforming speech-to-text data
-
tokengrams
Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
markov_str
Markov Chain implementation optimized for text generation
-
justcsv
CSV-file reader/writer
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
etch
Not just a text formatter, don't mark it down, etch it
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
cosmic-text
Pure Rust multi-line text handling
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
czv
performing CSV-related operations for data engineering and analysis
-
text-colorizer
Transitionary package
-
anystr
An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
wool
Preview Github Markdown Offline
-
mathematica-notebook-filter
mathematica-notebook-filter
parses Mathematica notebook files and strips them of superfluous information so that they can be committed into version control systems more easily -
yinzhe9
喵喵隐者9
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
text_unit
Newtypes for text offsets
-
mdbook-tabs
mdBook plugin for rendering content in tabs
-
nih_log
An opinionated yet flexible logger catering to the needs of the NIH-plug plugin framework
-
unicode-casing
Titlecase helper function on characters
-
punycode
Functions to decode and encode Punycode
-
nfa_regex
NFA regex engine for text processing
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
squidge
shortens delimited data
-
markdown-gen
generating Markdown files
-
clima
A minimal Markdown reader in the terminal
-
dispnet-hash
Hash Library used for Dispnet
-
ungoliant
The pipeline for the OSCAR corpus
-
langsan
sanitizing language model input and output
-
cyrconv
A funny faux cyrillic character mapper
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
strsplit
Split a string slice using a delimiter of your choice
-
uwuify
fastest text uwuifier in the west
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
rasciigraph
function to plot ascii graphs
-
vidyut-lipi
A Sanskrit transliterator
-
txt_to_md
Command converting from a txt file to a markdown file
-
founder
The package provides a font toolbox
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
dtex
Better TeX
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
mdbook-fs-summary
Summary generator for mdbook
-
display_bytes
Human-readable display of byte sequences
-
minigrepsada
Test
-
plsfix
Text cleaner upper
-
djotters
Djot (Markdown) parser that runs at hyper speeds!
-
markdown-table
Creating markdown tables with Rust!
-
mitex
TeX2Typst converter
-
gematria_rs
Gematria, a traditional Hebrew numerology system
-
hitori
Generic compile-time regular expressions
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
asciidork-opts
Asciidork opts
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
prototty_render
Traits for defining a renderer and a view
-
ps-str
String transcoding library
-
unicode_names2_generator
Generates the perfect-hash function used by
unicode_names2
-
text-transliterate
transliterate texts using the SO iconv from POSIX
-
recase
Changes the convention case of input text
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
texcore
Create LaTeX documents using native Rust types
-
weggli-ruleset
Ruleset managment/matching/reporting for weggli patterns
-
character_frequency
counting character frequencies in a string concurrently
-
cellumina
configure and run cellular automata
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
fast_aug
Fast data augmentation for text
-
imperative
Check for imperative mood in text
-
code-splitter
Split code into semantic chunks using tree-sitter
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
rusty-x
A snippet manager in Rust
-
twitter-text
in Rust
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
story-dl
Story web scraping
-
hexstring
handling hexadecimal string
-
bk-tree
A Rust BK-tree implementation
-
akai
Next-generation, functional and simple ID system
-
detect-newline-style
Determine a string's preferred newline character
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
sayit
String replacements using regex
-
onig_sys
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
searcher_txt
A copy of grep that i mate to train my rust skills
-
pulldown_mdbook
A pull parser for mdBook
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
strfry
randomize a string
-
markovish
Markov chain implementation for text generation
-
mdbook-force-relative-links
An
mdbook
pre-processor to transform all local links to relative ones -
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
ucfirst
Uppercase the first letter of a string
-
kas-text
Text layout and font management
-
godotdoc
documentation generator for GdScript
-
ansi-width
Calculate the width of a string when printed to the terminal
-
deepl-rustls
DeepL API - RusTLS Fork
-
bge
Rust interface for BGE Small English Embedding Library
-
br-pdf
PDF Invoice Processing
-
grep-reader
short text for crates.io
-
subject-classifier
classifying a commit by it's subject
-
mdbook-najan
Preprocessor for the Najan mdBook
-
find_unicode
Find Unicode characters, the easy way!
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
quickner-core
A fast and simple NER tool
-
khat
A cat clone, nothing more nothing less
-
adobe-cmap-parser
parse Adobe CMap files
-
chisel-lexers
Chisel backend lexers/scanners
-
path2regex
Express style path to RegExp utility
-
veg
Flexible tables
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
dequote
Remove nested quotes around text
-
pdf-rename
This script reads a list of PDF files from a specified directory and renames each file based on its content. The renaming logic uses the content of the PDF to generate a more descriptive and meaningful filename.
-
bstr
A string type that is not required to be valid UTF-8
-
letter-sequence
A method to create sequence displayed as uppercase or lower letters, or digits
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
text-scanner
A UTF-8 char-oriented, zero-copy, text and code scanning library
-
kanjidic_types
A collection of types encompassing the variety of data about kanji available from Kanjidic
-
slugify-rs
generate slugs from strings
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
speech-synthesis
Common traits for speech synthesis
-
write16
A UTF-16 analog of the Write trait
-
qpprint
console printing/formatting
-
faster-chars-count
counting length of chars faster than Chars::count()
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
spacey
a tiny yet performant whitespace vmm/interpreter
-
aki-mline
match line, regex text filter like a grep of linux command
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
wordmarkov
but flexible Markov chain library, specifically for text sentences, which handles punctuation and whitespace
-
neo-mime
Strongly Typed Mimes
-
hyphenation_commons
Proemial code for the
hyphenation
library -
qm-role-build
Role builder from markdown tables
-
vec-string-to-static-str
providing utilities for converting vectors of
String
s into vectors of&'static str
-
mdbook_incremental_preview
Incrementally preview mdBook, patching changes instead of making full rebuilds
-
html-to-pulldown-cmark-events
Parse HTML to pulldown-cmark's events
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
alass-ffi
FFI bindings for subtitle synchronization with alass
-
markdown-toc
Markdown Table of Contents generator
-
rust-texas
generate latex documents
-
uulm-sopra23-group03-core
Core library for implementing Battle of the Centerländ
-
streamparser
Adapt simple stream parsers for different types of streams
-
boss
Baseball Open Source Software
-
mdcat-http-reqwest
Reqwest-based HTTP resource handler for mdcat
-
sm-search
way of searching through text - for people who are too lazy to use Regex
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
reddish
A Rust utility library
-
gregex-logic
Logic for the gregex crate
-
markdown-extract
Extract sections of a markdown file
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
fast-str
A flexible, easy-to-use, immutable, efficient
String
replacement for Rust -
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
abjad
Calculate the numerical abjad value of Arabic-script text
-
fiat-lux
Offline terminal-accessible Bible
-
tuilet
A textual user interface for Toilet, the ANSI-art text generator
-
asciimath-text-renderer
Render asciimath in terminal
-
regexgrep
ripgrep tool that suports regular expressions
-
sortuniq
Find or count unique values in an input stream
-
invisible_unicode
finding invisible unicode characters
-
synox
Program synthesis of string transformations from input-output examples
-
cur
that will hunt for your regular expression
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
censor
text profanity filter
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
ranting
Linguistic formatting placeholder extensions for rust
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
runestr-pancjkv
rune
-based Pan-CJKV support -
notegraf
Core library for building a graph-oriented notebook
-
tiniestsegmenter
Compact Japanese segmenter
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
commonregex
Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.
-
split-every
Split for every n occurrences of a pattern iteratively!
-
emoji
Every emoji, their metadata, and localized annotations
-
CFDTCP
Short text clustering for determining center points and correlation judgment [one-hot,Jenks Natural Breaks]
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
bitranslit
Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.
-
mdbook-mdinclude
Preprocessor for mdBook that better handles including markdown files
-
zhlint
A linting tool for Chinese text content
-
rascii_art
Advanced ASCII Art Generator
-
refac
Transform some text given a generic natural language prompt
-
ob
A Blog and RSS system written in Rust
-
fast_symspell
Spelling correction & Fuzzy search
-
chardet
rust version of chardet
-
haoxue-dict
Chinese dictionary and word segmenter
-
encoded-words
Encoded Words for usage in MIME headers
-
regex-split
split_inclusive for the regex crate
-
ftrace
trace files and paths
-
no-comment
Remove rust-style line and block comments from a char iterator
-
grammateus
facilitate working with Ancient Greek words
-
fimdoc
Firendship is Magic Document, converts Markdown into FIMFiction BBCode
-
text_layout
Text layout algorithms
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
sauropod
Stellular Shared Resources
-
char_reader
Safely read wild streams as chars or lines
-
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
string-utility
substring implementation (with the RangeBound trait), keep string before or after a specified string, etc
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
lindera-cc-cedict
A Japanese morphological dictionary for CC-CEDICT
-
sedregex
Sed-like regex library
-
caser
Change text between PascalCase, camelCase, and snake_case
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
unicode_categories_zheyuye
Query Unicode category membership for chars
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
catalog-of-markdown
Generate the catalog of markdown file
-
irg-kvariants
wrapper around kvariant from hfhchan/irg
-
in_rainbows_printer
Prints some In Rainbows-style (the Radiohead album) text to your termial
-
spellabet
Convert characters into spelling alphabet code words
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
snake_case_converter
convert strings to snake case
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
valid_rust_char
A tiny library to check if a char it's valid in a rust file
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
ultron
A web base text-editor with strong focus on displaying multi-width characters correctly
-
normalize-hebrew-rs
package that normalizes special symbols within Hebrew string used in the Qumran-Digital project
-
slack_update
app to set Slack status, emoji and photo
-
strizer
minimal and fast library for text tokenization
-
utf8_reader
A UTF-8 reader that read UTF-8 characters from object that implement Read trait
-
nibi
static site generator. (WIP, implementing)
-
mdtranslation
prepare multi-lingual Markdown documents
-
quilltex
open-source Rust library designed to convert LaTeX documents into a Delta format that can be used with Quill.js and vice versa
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
rut
A small UTF-8 parsing library for applications that need to parse individual
char
s -
mdzk
Plain text Zettelkasten based on mdBook
-
uwu_cli
uwuifying the terminal
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
tradukisto
Kinda useful natural language translation library and utility
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
pulldown_typst
A pull parser for Typst markup
-
bzzz
Bee-Speak Translator
-
xpanda
Unix shell-like parameter expansion/variable substitution
-
jcalendar
Japanese Calendar for Rust
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
hello_lib
Demonstrate Generics Function
-
umlauts
text transformation of german umlauts
-
text_distance
A collection of approximate string matching algorithms
-
loading
Show loading in terminal
-
repgrep
An interactive command line replacer for
ripgrep
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
gimme
Pull useful data out of your clipboard
-
pdf2pwg
Single purpose A4 page renderer rendering PDF using pdfium to PWG/URF
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
genpdf
User-friendly PDF generator written in pure Rust
-
hfs_nfd
Handle Apple's unique NFD-like Unicode normalization, which is used in HFS+, in Rust
-
rustrings
Strings manipulation for Rust
-
mdbook-indexing
mdbook preprocessor for index generation
-
csvsc
Build processing chains for CSV files
-
str-block
Remove the initial newline and common indentation from string literals
-
texting
string helpers
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
enum-ts
TypeScript Enum pattern matcher codegen
-
common_substrings
Finding all common strings
-
redpatterns
a list of patterns for scanners 📟
-
charclass
define and modify unicode character classes
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
any2utf8
Convert any enncoding to utf-8
-
stylish-core
The semver-stable subset of
stylish
-
solstice_glyph
A fast text renderer for solstice, powered by glyph_brush
-
rust_stringz
string manipulation library for Rust that you should probably don't need. It's just for learning purposes.
-
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
ellipse
Truncate and ellipse strings in a human-friendly way
-
rl-sys
Native bindings to libreadline
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
bureau
extracting dependency licenses, used by nertsio
-
readability-text-cleanup
cleaning up text from HTML using Mozilla's Readability algorithm
-
word-star-puzzle-solver
A CLI tool for solving VGs word star puzzle
-
translitrs
Transliteration utility for Serbian language
-
ryaspeller
lib for searching typos in text, files and websites
-
mitex-spec
Specification Library for MiTeX
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
sql-builder
SQL code generator
-
unicode-vo
Unicode vertical orientation detection
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
ucd-raw
Uninterpreted access to the unicode UCD
-
cfasttext-sys
fastText ffi binding
-
ferrissay
cowsay
-
groop
Outline-aware grep with syntax highlighting
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
correct_word
A No brainer 'did you mean' library for Rust
-
loro-internal
Loro internal library. Do not use it directly as it's not stable.
-
hi-doc
Diagnostics formatting library
-
unindenter
unindent text
-
buss-protocol
Bussin binary protocol
-
enso-lazy-reader
An efficient buffered reader
-
git-busy
A wrapper around "git commit" that generates the commit messages for you
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
pdf_composer_definitions
PDF Composer definitions crate
-
my_mini_grep
A mini application that aims to replicate the behavior of the grep shell command
-
emojicons-2021
Parse :emoji: notation to unicode representation
-
pinot
Fast, high-fidelity OpenType parser
-
rust_baht_text
Convert number to Thai Baht text
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
group-similar
Group similar values based on Jaro-Winkler distance
-
grace-cli
CLI tool for processing files and strings
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
skyspell_kak
skyspell - kakoune integration
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
recursive-file-loader
recursively load files via references in the files
-
terminal_cli
A standalone library with no-std support for command line terminal interfaces. With autocomplete support, helpers for commands and properties and a prompt implementation.
-
afrim-memory
handle of sequential codes easier for an input method
-
uwubot
discord bot for uwuifying text
-
mask-text
mask text with multiple masking options
-
hyper-old-types
HTTP types from hyper 0.11.x
-
clippers
Cross-platform clipboard management library
-
repub-rs
binary for converting mhtml webpages into remarkable-style summarized epubs
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
scatternotes
A cli application to manage unstructured notes
-
looking-glass
reflection & type-erasure library for Rust
-
extract-words
Extracts words from text without allocation
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
wordpieces
Split tokens into word pieces
-
macro_colors
colorful printing macros
-
json-peek
Amature JSON parser library designed for my specific need
-
rustic_print
A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.
-
cowonverter
owo-ify text files. provides a simple function for this purpose.
-
remove-markdown-links
Turns
[example](https://example.com)
intoexample
. That’s it -
text-diff
text diffing and assertion library
-
aki-mcycle
mark up text with cycling color
-
spoilers
High-level Rust bindings for CTranslate2
-
help_crafter
help message generator without hussle
-
node-emoji
Convert
:emoji:
to Unicode using GitHub’s and EmojiDB’s emoji names -
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
wattle
tools for rust
-
lithe
A Slim template engine by using Pest
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
azusa
String index transformer for Rust utf8 to JavaScript utf16
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
is-vowel
Heuristically test whether a character is a vowel letter
-
naromat
Convert text to narou novel format
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
tabwriter
Elastic tabstops
-
koelner-phonetik
koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words
-
nmd-core
Official NMD compiler core
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
rs-wordcloud
☁ Flexible WordCloud implementation for ⚙Rust
-
xconv
A high-performance batch file encoding conversion tool
-
vape
full width aesthetics
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
korean_regex
Regex extension for Hangeul analysis
-
neardup
near-duplicate matching
-
event_parser
Rust NLP library for parsing English natural language into icalendar events
-
product-os-content
Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server
-
snekdown
A parser for the custom snekdown markdown syntax
-
minigre_base
text file search tool
-
encoding_c
C API for encoding_rs
-
igpay-atinlay
Translate text to Pig Latin
-
ra_rustc_lexer
lexer used by rustc, zero stability gurantees
-
markdown-it-autolink
A markdown-it plugin for parsing GFM autolinks
-
mdbook-plugin-utils
mdBook plugins
-
harper
implement SSGs
-
lf_lint
Linter that checks that all files in <PATH> ends with a newline
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
spongemock
CLI-tool for random text capitalization
-
spinning_cube
Perspective projection of a spinning cube, using just ASCII characters
-
szovegertesimutato-score
Calculate szovegertesimutato score for a given text and language
-
line_adjustment
Line adjustment example
-
timeblok
A language for event scheduling in plain text
-
text_to_emoji
Convert text to emoji
-
kanpyo
Japanese Morphological Analyzer
-
stardict
Rust implement stardict
-
kirjat-rs
prices for finnish textbooks from multiple stores
-
mdbook-infisearch
InfiSearch plugin for Mdbook
-
confusables
around Unicode confusables/homoglyphs
-
trie-match
Fast match macro
-
markov_strings
A simplistic Markov chain text generator
-
whitespace_text_steganography
A steganography strategy that uses whitespace to hide text in other text
-
veryfi
Module for communicating with the Veryfi OCR API
-
bpmf_py
A Bopomofo and Pinyin library
-
stylish-stringlike
API for string-like objects that have styles applied
-
cha-rs
Extract specific characters from an input
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
shear
trimming excess contents from things
-
runanum
Существительные с правильными окончаниями после чисел
-
collapse
Trim and collapse consecutive whitespace to a single space
-
strip_markdown
remove markdown syntax from markdown files
-
playin-cards
♠ Library for French-suited playing cards
-
ctrl-z
A composable reader to treat
0x1A
as an end-of-file marker -
matchpick
Find and replace multi-lines using a match-case
-
uiuifree-normalize
uiuifree text normalize
-
rscx-mdx
MDX support for RSCx
-
text_maid
Text maid
-
indentation
Formatter
-
text_searcher
text searcher tool
-
rcut
replacement for GNU cut that supports UTF-8
-
wcrs
GNU wc in Rust
-
kanpyo-dict
Dictionary Library for Kanpyo
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
monkey-printer
infinite nr of monkeys you could write Shakespeare
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
compact_str
A memory efficient string type that transparently stores strings on the stack, when possible
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
mdx
in Rust
-
csv_to_table
pretty print CSV as a table
-
md2gemtext
for converting Markdown into gemtext
-
mail-internals-ng
[mail-api] _internal_ parts for the mail-api crates
-
ruby-string
A string type that tracks Ruby glosses attached to parts of it
-
utf-cli
A UTF code viewer
-
parattice
Recursive paraphrase lattice generator
-
toml-test-data
TOML test cases
-
charmap
one-to-(none/one/many) character mapping
-
utfx
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
bocu1
BOCU-1 compressed unicode encoding
-
summary
Extract the sentences which best summarize a document
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
genex
Text-expansion library
-
unicode_escape
decoding escape sequences in strings
-
mdbook-nix-eval
mdbook preprocessor for evaluating nix expressions
-
twjsrs
(De)serialize TiddlyWiki tiddlers to and from their JSON format
-
hr-id
A human-readable unique ID which supports Unicode, safe for URLs and file paths
-
ferris-finder
Unicode character finder with fuzzy search
-
assert-text
the testing macro tools
-
alpha-counter
Alphabetic counter
-
thesauromatic
command-line thesaurus that returns related words when given a word. The output words are one per line, making it easy to process in shell pipelines.
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
zhconv-cli
Convert Traditional/Simplified Chinese and regional words of Taiwan/Hong Kong/mainland China/Singapore based on Wikipedia and OpenCC rulesets 轉換中文簡體、繁體及兩岸、新馬地區詞,基於維基百科和…
-
sauron-markdown
parsing markdown into sauron node
-
rustplexity
bigram-based perplexity calculator, useful for filtering out boilerplate or other abnormal text
-
mdbook-chapter-number
A mdBook preprocessor that adds chapter numbers to the each page header
-
mdbook-bash-tutorial
A mdbook preprocessor that allows embedding Bash scripts as tutorials
-
small-bwt
BWT construction in small space
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
pra
Print Random ASCII
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
cyrla
two-way conversion between latin and cyrillic script
-
floem-cosmic-text
Unofficial cosmic-text fork for Floem
-
linkcheck
extracting and validating links
-
mdbook-image-size
A mdbook preprocessor which support image size syntax
-
mdbook-twiki
twiki backend for mdbook
-
amongify
A very ඞ sus ඞ program
-
lazy-char-iter
Lazily iterate over Unicode characters from a u8 slice in Rust
-
base256_lib
Base256 encoder/decoder
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
rdfoothills-conversion
collection of mostly small, RDF related utilities, including conversion between different serialization formats, using external tools. This part allows to convert between…
-
changecase
A trait and implementation for changing the case of Strings and &str. It currently supports uppercase, lowercase, alternating case, and inverting case. Title case is in the works.
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
strange
A static website generator
-
humnum
Human numeric sorting program — does what
sort -h
is supposed to do! -
html-compare
compare html files
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate
-
interslavic
in rust
-
mdbook-chapter-list
mdbook preprocessor for listing subchapters
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
hina
:]
-
text-utils
Text utils for unescaping and align
-
yozuk-core-skillset
Set of default Yozuk skills
-
peppergrep
grep utility written following the 12th chapter of the Rust book. Some little modifications were made.
-
arabic-script
An expressive API for the characters of the Arabic script
-
grepox
Minimalist's grep written in Rust
-
unfurl
expanding links in text
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
delay_writer
Wraps a writer and delays its output after each newline
-
uniwhat
Display the unicode characters text
-
shutil
Shell utility helper library
-
verba
working with Latin words
-
mutf8
Basic support for mutf8 strings
-
pink-runtime
A very minimal replecement based DSL, intended for math
-
wfst4str
Python library based on rustfst for manipulatig strings with wFSTs
-
basic_lib_for_me
just a basic lib that i will use usually
-
console-traits
Basic console traits, such as setting char position and writing strings
-
imatree
command line tool to generate aesthetically pleasing (and sometimes not so pleasing) images of whatever string you want
-
bnf_sampler
that uses recursive descent algorithm to ensure tokens produced by a large language model follow a Backus Naur Form schema
-
wkhtmlapp
Convert html to pdf or image
-
ocrmypdf-rs
A sdk for the ocrmypdf command line tool
-
str_overlap
Methods for finding the overlap between two string slices
-
rex-regex
A pedestrian library for regular expressions: Slow, but easy to understand
-
askama-markdown-cmark
Askama filter for markdown, using pulldown-cmark
-
pathmut
Command line utility for extracting, removing, and replacing components from path strings
-
polyhorn-ios
Polyhorn for iOS
-
ascii_converter
converting between different ascii representations
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
scalar-value-string
Scalar Value String
-
spacebar
An anti-plagiarism tool based on null width characters
-
rex-regextract
extracts key value pairs out of text
-
vidyut-kosha
A Sanskrit key-value store
-
afrim-translator
Manage the predication system of the afrim input method
-
mudders
Generating Lexicographically-Evenly-Spaced Strings, or: Mudder.js in Rust
-
wcounter
Give the word and count the appearance
-
swrite
Infallible alternatives to write! and writeln! for Strings
-
rst_parser
a reStructuredText parser
-
unic-ucd-normal
UNIC — Unicode Character Database — Normalization Properties
-
arbitrator
Format text based on a set of rules and regexes
-
staple
powerful static site generator
-
mdbook-fix-cjk-spacing
mdbook preprocess that fixes CJK line breaks
-
opt2doc
generate documentation from Rust source code
-
termwrap
Wrap Unicode text with ANSI color codes
-
mdict
Rust implement mdict
-
fum
fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
tectonic_bridge_flate
Exposing flate compression to the Tectonic C code
-
markdown2unicode
Converter from markdown notation to unicode characters
-
boxy
Declarative builder for Unicode box-drawing characters
-
learning_fromdocs_minigrep
Project made on the path toward learning Rust from its docs. Completely personal and to test how to upload to crates.io.
-
char_trie
Text segmentation based on trie tree, High performance, support for custom dictionary
-
gpt4ocr
Extract structured text from PDFs using OpenAI's GPT4o
-
termbook
behind the
termbook-cli
-
glyphana
Quickly find, inspect & collect unicode glyps
-
jtxt
A JavaScript syntax text processing tool, an awk alternative
-
hunspell-rs
Rust bindings to the Hunspell library
-
marcus
An experimental Markdown parser written in Rust
-
senax-inflector
Pluralizes and singularizes English nouns. This crate is forked from https://github.com/whatisinternet/Inflector.
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
syntastica-themes
A collection of themes for syntastica
-
lsp-document
Helpers to convert between LSP documents and Rust strings
-
bbcode-tagger
BBCode tree parser and tagger
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
sc_decode
Special character decoding
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
bytepiece_rs
The Bytepiece Tokenizer Implemented in Rust
-
transcript
A transcriber for European scripts
-
vibrato
viterbi-based accelerated tokenizer
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
glyph_brush_layout
Text layout for ab_glyph
-
shoebill
A Wadler/Leijen style pretty-printer
-
remake
writing maintainable regex and managing symbol soup
-
uchr
translate char values to the characters themselves, and vice versa
-
summertail
Summarize your tail -f output, by reporting how many lines match regexes
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
utf8-command
UTF-8 encoded
std::process::Command
output -
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
trim
trim whitespaces from files
-
zw
encoding and decoding text using zero-width characters
-
goodname
assist you with cool naming of your methods and software
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
readwise
wrapper for the Readwise API
-
mdbook-footnote
mdbook preprocessor for footnotes
-
yarner-block-links
A Yarner plugin that adds to each code block a list of links to all referenced and all referencing blocks
-
stardict_wrapper
Rust implement stardict
-
skribo
low-level text layout
-
mdbook-collector
A mdbook preprocessor which collects json/yaml/toml data from each post and builds a json file of it
-
manchu-converter
Converts transcripted Manchu text to Manchu script with Manchu alphabet
-
ipa-translate
translating between IPA and ASCII text
-
inslice
A command-line utility for filtering text input by columns and rows
-
braille_pics
producing text-art pictures using Braille characters
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
mdbook-unlink
A mdBook backend that validates local links
-
morc
Dead simple, minimal markdown generator library written in Rust
-
rammer
A Baysian Text Classifier for Spam/Ham
-
bookgrep
Basic grep equivalent, minor mods to Chapter 12
-
matrix_display
display a matrix in command line with pretty unicode separators
-
kincaid
A word statistics library in Rust
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
matchable
Check if text is matching against string or regex in one API
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
genkit
A common generator kit for static site generator
-
ssml-parser
parsing speech sythnesis markup language
-
quill_delta_pdf
Convert Quill Delta to PDF
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
pulldown-cmark-fork
A pull parser for CommonMark
-
stringsext
find multi-byte-encoded strings in binary data
-
aki-xtee
copy standard input to each files and standard output
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
libxdiff
Rust bindings for the libxdiff C library
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
fmtm_ytmimi_markdown_fmt
Fork of @ytmimi's Markdown formatter; powers FMTM
-
struckdown
A structured markdown / commonmark library for Rust
-
html_to_markdown
Convert HTML to Markdown
-
indoc
Indented document literals
-
trans-case
Transform case
-
gfm-autolinks
Parse GitHub Flavored Markdown autolinks
-
highlight-pulldown
Process pulldown-cmark events to apply syntax highlighting to code blocks
-
jput
puts and putc on unicode-width align for Rust
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
clparse
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
text_styled
Trait to style String and &str
-
hxgm30-noise
Noise-generation for use in hexagram30 projects
-
shq
sh quoting tool and library
-
env2toml
Convert env vars to toml text
-
owoify
Text owoification library
-
diffy-fork-filenames
Fork of https://docs.rs/diffy that allows specifiying filenames
-
univstring
The Universal String(Rust String, CString, Wide(UTF-16/UTF-32) CString) trait
-
sigalign-impl
implementations for core
-
libflagup
Display a country's flag as an emoji
-
mpdf
writing PDF files
-
basic-text-internals
Basic Text string literal implementation details
-
whitespace
Encode arbitrary data whitespaces and vice versa
-
twitter_text_conformance
twitter-text in Rust
-
gzip-cmp
that makes a distance measurement between binary data based on the difference of the compressed data length
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
genere
randomization of text respecting grammatical gender of sentences
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
timfmt
A small utility for formatting code as Tim likes it
-
icu_casemapping
Unicode case mapping algorithms
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
md-include
include any file in markdown files
-
uwildmat
wildmat/uwildmat matching
-
moscato
Outline scaler for OpenType glyphs
-
fmt-extra
Extra formaters not shipped in the standard library
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
corpus-count
Util to count words and character ngrams in a corpus
-
text2art
translate text to ascii art
-
llmvm-outsource-lib
outsource backend for llmvm
-
ctf-brute
Brute-force utilities for Rust
-
irssi-sys
Automatically generated bindings to irssi
-
syllable
counter for use with reading level calculations
-
zummi
fun lib that produces spoonerisms
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
litegrep
A basic tool for searching in files for lines of text, based on a query
-
casaubon
A feature-poor static site generator
-
untex
Understand and manipulate TeX files with ease
-
mdplayscript
An extension of Markdown for play scripts
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
marko
Programmtically format text with Markdown syntax
-
gen3-charset
Pokemon Generation 3 Character Set Support (GBA)
-
trim_lines
An extremely simple and tiny library which provides an iterator over the lines of a string, trimmed of whitespace. It is a simple wrapper around the Lines iterator in std::str which trims the whitespace from each line.
-
textos
Texts, strings, formatting, unicode…
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
lingua-dutch-language-model
The Dutch language model for Lingua, an accurate natural language detection library
-
mul
Bengali stemmer
-
ogrep
searching in indentation-structured texts
-
darn
Extras for the Book on Data Analysis with Rust Notebooks
-
pdf-min
Very minimal crate for writing PDFs
-
ripmors
encoding and decoding international Morse code and several variants
-
m_lexer
extensible regular expressions based lexer
-
perspicuity_formula
Calculate Flesh Reading Ease for a given text and language
-
spellcheck_toy
a basic spellchecking library based on edit distance
-
rahat3062_minigrep
A light-weight & minimal implementation of the grep cli app
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
pretok
A string pre-tokenizer for C-like syntaxes
-
sparklet
small flashcards library
-
uecho
The unicode of the echo command
-
mdbook-rust
Enhanced Rust support for MDBook
-
vndb_rs
Bare-bones VNDB api
-
random_access_unicode
Provide random access to the unicode characters in a file
-
ayda
Ask your Documents Anything. A tool for querying your documents with a large language model.
-
crypto-invert
Unicode Upside-Down Mapping
-
mqtt-analyzer
Help analyzing MQTT messages
-
topfew
CLI to find high frequency occurrences in structured text files
-
destiny_rs
An async library to interact with Bungie's destiny 2 api
-
cermic
Print .png images in your terminal as ascii art!
-
ezemoji
Catigoryized Emoji's
-
slidedeck
Create an HTML slide deck from Markdown
-
unicode-canvas
creating text base drawing
-
ngrams
Generate n-grams from sequences
-
ergrep
grep strings within a line from a text file
-
aprilasr-sys
Low-level FFI bindings for the april-asr C api (libaprilasr)
-
supply-chain-trust-example-crate-000068
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
detect-indent
Detect the indentation of code
-
like
A SQL like style pattern matching
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
sastrawi
stemming and stopword removal for Bahasa Indonesia based on PHP sastrawi project by Andy Librian
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
kindleclip
CLI tool to export a file per book in kindle 'My Clippings.txt'
-
august
& program for converting HTML to plain text
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
justify
plaintext while handling Unicode gracefully
-
modeling
tools to analysis different languages by Ctags
-
yozuk-helper-english
English NLP utilities for Yozuk
-
mdbook-translation
prepare multi-lingual mdBook books
-
string-overlap
A helper crate for "layering" ASCII art
-
cowsay
program that generates ASCII art pictures of a cow with a message
-
read_chars
An iterator over characters read from some I/O source
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
korrektor
work with Uzbek language text processing
-
halfcaps
tRaNslAtE aNy TeXt To ThIs
-
case_style
Converts from and to different case styles
-
fmty
Composable
core::fmt
utilities -
chisel-parsers
Chisel parser front ends
-
unicode_font
Convert unicode characters between fonts
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
hunspell-sys
Bindings to the hunspell C API
-
tectonic_bridge_icu
Expose a subset of the ICU Unicode APIs to Rust/Cargo
-
pygmentize
wrapper for syntax highlighting
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
spdkit-surface
short text for crates.io
-
mocker
transforming text with uppercase and lowercase letters, resembling the Spongebob Mock Meme
-
aki-stats
output the statistics of text, like a wc of linux command
-
shallow
long text
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
subscript-compiler
A modern LaTeX rendition
-
bible-reference-rs
Extract Bible references from plain text
-
allsorts_no_std
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
framework
detector for different frameworks in one projects
-
ascii-rs
Process image into colored-ascii image
-
finl-charsub
Character substitution module for in-line replacement of fixed strings
-
scoundrel
An engine for working with ASCII roguelikes
-
regex-automata
Automata construction and matching using regular expressions
-
caseless
Unicode caseless matching
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
pcre2
High level wrapper library for PCRE2
-
xhtmlchardet
Character set detection for XML and HTML
-
tergo-tokenizer
R language tokenizer
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
wildflower
Wildcard matching against strings
-
bibliofile
A TUI epub reader inspired by DOS-era programs
-
fast2s
A fast Traditional Chinese to Simplified Chinese conversion library. Built with FST, faster than most of other libraries.
-
single_source
Generate code files from snippets in md tutorial files
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
crustword
Crusty Crosswords
-
trigram
Trigram-based string similarity for fuzzy matching
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
moenarchbook
Creates a book from markdown files
-
scripter
A screenplay compiler
-
character-stream
Helper data structures for reading UTF-8 characters from a stream
-
text-to-json
Convert text to json in rust
-
stringsort
Pathological sorting of string characters
-
corg
A cog-like tool
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
file-search
File indexing and search
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
inform
indentation formatter
-
sydney
Vim-like, Command-line Gemini Client
-
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
token-read
reading whitespace delimited files intended for competitive programming
-
serbzip
A quasi-lossless Balkanoidal meta-lingual compressor
-
varcon-core
Varcon-relevant data structures
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
llmvm-core-lib
llmvm core application
-
forgiving-htmlescape
HTML entity encoding and decoding, with support for leaving malformed entities intact
-
gqlog
👾 filter your json logs with graphql 👾
-
catmark
Console printer for CommonMark
-
romulus
a stream editor like sed
-
konbaato
Conversions between different Japanese scripts
-
awabi
A morphological analyzer using mecab dictionary
-
markdown-linkify
Markdown preprocessor for substiting link shorthands to valid links according to configurable regexes and custom substitution implementations
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingual
cmdline tool. -
encoding-next-index-tradchinese
Index tables for traditional Chinese character encodings
-
presciidoc
Preprocessing AsciiDoc for other tools
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
mdbook-chapter-zero
A mdBook preprocessor that allows 0th (sub-)chapter
-
yitizi
異體字查詢 Get variant Chinese characters
-
latin1str
Windows-1252 string types
-
ucd-util
A small utility library for working with the Unicode character database
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
trevordmiller
Personal CLI
-
isixhosa
help process text in isiXhosa
-
pdf_seekers
parser and information extractor from PDF documents based on keyword search functionality (powered by Rust)
-
zawk
An efficient Awk-like language implementation by Rust with stdlib
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
utf
UTF-8
-
flw
Process text via configurable tasks
-
mdbook-fishextract
A mdbook preprocessor which handles mermaid graphs, offline, requires
mmdc
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
encoding_c_mem
C API for encoding_rs::mem
-
ddvm
Document to Document Virtual Machine
-
kvu
The simplest command line tool to manage key-value pair lines
-
uniquewords-rs
Count the frequencies of words in text file(s)
-
anagrambot
find anagrams of words
-
mdbook-tagger
A mdbook preprocessor to collect and build tag data in Json format
-
simple-word-count
word count function, try to get same result with Microsoft Office Word application
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
kansuji
漢数字と数字の相互変換のためのライブラリ
-
ae-renderable
handling individual text characters that can be rendered on the screen. Intended for use in game development.
-
mdbook-open-gh-issue
mdbook preprocessor to add a open-on-github link on every page
-
leven
Make your own blog!
-
falcom-sjis
Falcom-compatibile Shift JIS implementation
-
static_format
Format strings with no runtime overhead
-
word_filter
A Word Filter for filtering text
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
deepfrog
A deep learning NLP suite (PoS,lemmatiser,NER) with FoLiA XML support
-
mdbook-newday
Command line utility to add a new line for today to the top of an mdbook SUMMARY.md book
-
mors
converting morsecode to text and reverse
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
platina
parameterized golden testing library
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
pencil_case
but effective library for writing GUI in bevy
-
blockcounter
Counts the blocks in a stream
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
pillar
small tool to format lines into columns
-
rescue-blanket
Escape values while they are being formatted
-
string-box
Create Rust string from UTF-8 string, byte string or wide string
-
mojibake
Encode/Decode bytes as emoji base2048
-
bullshit-rs
A BullshitGenerator implementation in Rust
-
pascii
image => text
-
seq2xypic
Turn a text sequence diagram into a LaTeX xypic diagram
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
latex_snippet
Convert even erroneous LaTeX snippets into HTML
-
mdbook-post
A CLI for add post to mdbook
-
demoji
Remove all emojis from a string
-
wordfreq-model
Model loaders for wordfreq-rs
-
caribon
A repetition detector program and library
-
literate
programming tool that extracts code written in your Markdown files
-
eliza
natural language processing program developed by Joseph Weizenbaum in 1966
-
skyspell_core
skyspell core library
-
ewin-com
editor for Window(GUI) users.No need to remember commands
-
bareun_rs
Bareun is a Korean Morphological analyzer for Rust
-
furigana
Map furigana to a word given its reading
-
tweak
when/then clauses to run
-
saurus
Markdown to LaTeX Transpiler
-
difference
text diffing and assertion library
-
is_printable
Determine whether a given text-based value is printable
-
didaskalia
over-complicated language to store text entries and sequences across different languages
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
bebop-lang
LISP based Markdown preprocessor for creating html documents!
-
typos-dict
Source Code Spelling Correction
-
intname
Full English name for any integer of any primitive integer type
-
textalyzer
Analyze key metrics like number of words, readability, and complexity of any kind of text
-
cw
Count Words, a fast wc clone
-
ansi-cut
cutting a string while preserving colors
-
chinese2digits
The Best Tool of Chinese Number to Digits. A useful tool in NLP and robot project.
-
am-parse
converting asciimath to other formats
-
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
esl01-drawdag
Parse an ASCII DAG into parent relations
-
mdtransform
program that renders Markdown as HTML for websites
-
minigrep-yogie
A demo Rust to grep some word from rust-lang.com
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
mdbook-morsels
Morsels plugin for Mdbook
-
patterns
no_std compatible pattern scan library using simd
-
ced
Dead easy csv editor
-
mdbook-reference-table
mdBook preprocessor to create reference tables
-
ascii_tree
generates ascii trees
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
informa
font compiler for digital typography
-
rustex
auto-generated LaTeX files in Rust
-
ptero-cli
A text steganography CLI tool for Social Media
-
yarner-lib
build plugins for the Literate Programming tool Yarner
-
tre-regex-sys
Rust bindgen bindings to the TRE regex module
-
string_morph
string case transformations with an emphasis on accuracy and performance. The case conversions are available as functions as well as traits on String types.
-
csvre
replacing data in CSV columns with regular expressions
-
mdbook-files
Preprocessor for mdbook which renders files from a directory as an interactive widget
-
local-encoding
encoding/decoding string with local charset. It usefull for work with ANSI strings on Windows.
-
bionic-ebooks
Takes an EPUB file and generate a copy with bionic like font applied
-
base_emoji
Convert everything to Emojis (and back)! - 🐫🔦🔥🎲🐬🍀🍟🔋🐬🎲🐬🍀🎁🍟💧👂🔥🚪🔋🍟🔦🔋🚗👣🔦🌍👂🍤🐗
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
ascii-engine
A fast ascii engine written in rust!
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
textblocks
working with text blocks
-
mdbook-hide
A preprocessor for mdbook that adds support for hidden chapters
-
nb2fr
Convert integer number to literal French text
-
markdown-composer
composing markdown documents
-
ccase
Command line interface to convert strings into any case
-
t9
Easily create port number from words and vice versa
-
gret
command line tool to search for patterns and show matches in a tree structure
-
seven_seg
Seven-segment digital display for terminal
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
gulpeaseindex
Calculate Gulpease index for a given text and language
-
kaomoji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
lines_lossy
extension to
BufRead
with a functionlines_lossy
that works likeBufRead::lines
but with lossy UTF-8 decoding -
bookkeeping
Minimal, safe and flexible in-memory bookkeeping API
-
comment-strip
Remove comments out of text files
-
markdown-it-tasklist
A markdown-it plugin for parsing GFM tasklists
-
vidyut-chandas
A Sanskrit metrical classifier
-
dr
Command-line data file processing in Rust
-
mdbook-asciidoc
mdBook backend for AsciiDoc generation
-
cmark2tex
A small utility to convert markdown files to pdf exploiting tectonic
-
route-pattern
A parser and matcher for route patterns
-
demoji-rs
demoji implementation for Rust
-
document_tree
reStructuredText’s DocumentTree representation
-
terminal-supports-emoji
Check whether the current terminal supports emoji
-
const_format_proc_macros
detail of the
const_format
crate -
rustextile
Textile markup language parser for Rust
-
cutters
Rule based sentence segmentation library
-
repub
convert markdown documents to epub
-
textr
TeX-inspired plug-n-play interface for converting JSON documents into PDFs
-
zp
Copy the contents of the source file or the std output buffer to the clipboard, allowing users to easily paste the contents into another file or program
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
h_hangul
Korean Characters
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
table_to_html
interface to convert a
tabled::Table
into a HTML table (<table>
) -
bos_books_codes
that handles 3-character Bible Books Codes
-
markdown-it-gfm
A markdown-it plugin for parsing definition lists
-
aklat
create books from markdown files (like Gitbook)
-
xsystem
Conversion between the Esperanto x-system and Unicode circumflexes
-
yozuk-sdk
Types used in the Yozuk ecosystem
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
scie
research about how to build simple code identify engine for different languages
-
lindera-filter
Character and token filters for Lindera
-
fsays
flavored replacement for the classic cowsay
-
bbd-lib
Binary Braille Dump
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
vaporetto_rules
Rule-base filters for Vaporetto
-
react-html
react html components for frender
-
cli-animate
Enrich your Rust CLIs with stunning, easy-to-implement animations
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
github-slugger
A slugger for GitHub headings
-
moenster
mønster (n) - pattern. simple glob-style pattern matching for strings
-
darts
A double array trie, A Forward Maximum Matching Searcher
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
mdbook-snips
Markers for hidden lines in rust blocks within an mdbook
-
nipah_tokenizer
A powerful yet simple text tokenizer for your everyday needs!
-
md_converter
My markdown to html converter
-
great-tokenizer
A tokenizer
-
hemtt-tokens
A token library for hemtt
-
dynfractal
A program to render any equation (in terms of z and c) into an ASCII fractal
-
minigrep5
grep implementation in Rust
-
morse-nostd
A nostd version of the morse crate
-
highlights
Convert kindle highlights into different formats
-
trunc8
Truncate text to a specific line length, based on a number of parameters
-
charasay
The future of cowsay 🐮! Colorful characters saying something 🗨️
-
norm-email
strip email provider defined behaviour from email addresses
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
economic_indicator_finder
A finder for extracting economic indicators from paragraphs
-
guarding_parser
Guarding is a guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
ron_to_table
pretty print RON as a table
-
dhoni
converting Bengali text into their phonetic counterpart
-
every-range
Iterator for interleaving included and excluded ranges
-
rnltk
Natural Language Toolkit for Rust
-
pdf_composer_base
PDF Composer base functionality crate
-
regex-cli-test
A command line tool for testing against regex patterns
-
noneifempty
Convert T to None if empty
-
deliminator-core
Universal code documentation generator
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
retest
Command-line regular expression tester
-
text_alignment
Align your text in Rust in the CLI
-
paperoni
A web article downloader
-
transcoding_rs
Converts text encoding the easy and efficient way
-
conllu-utils
working with the CoNLL-U dependency format
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
minbpe
Port of Andrej Karpathy's minbpe to Rust
-
cattocol
Combine two text into one text as columns
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
morph-rs
Dictionary Morphologizer for Russian language
-
syntax-error
Syntax error report
-
xim-ctext
compound text en/decoder
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
jp-location-relation
隣接する市区町村の一覧を取得
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
syllarust
quickly counting syllables
-
publish_demo_node
This program picks a random christmas emoji for you
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
smoldown
Native Rust library for parsing Markdown
-
LatexImgGen-lib
generating Latex images of equations with an edition number to avoid cache flushing and timestamp attributes
-
datex
package to automatically process your data and make it look pretty in LaTeX
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
markdown_to_html_parser
parses Markdown syntax into HTML
-
hema_minigrep
grep clone
-
prettify-markdown
Format Markdown at the speed of Rust
-
readable-readability
Really fast readability
-
minigrep_desonglll
grep implementation from The Rust Programing Book
-
strings
String utilities, including an unbalanced Rope
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
alpino-tokenizer
Wrapper around the Alpino tokenizer for Dutch
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
grep-rezza
An app for search text in file
-
aki-txpr-macro
the more easy to use libaki-*
-
encoding-index-korean
Index tables for Korean character encodings
-
yozuk-model
NLP model generator for Yozuk
-
bitfont
Takes an ASCII string and generates a vector containing a bitmap font, for easy overlay into images
-
nib
static site generator
-
jellybean-pack-2
Sweet syntax highlighting with tree-sitter
-
mdtohtml
markdown to html renderer (with a couple of missing features)
-
hline
a grep-like tool that highlights lines in files
-
ammonia
HTML Sanitization
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
varcon
Source Code Spelling Correction
-
mdbook-obsidian
mdBook preprocessor to render Obsidian specific syntax
-
lindera-ipadic-neologd
A Japanese morphological dictionary for IPADIC NEologd
-
os_display
Display strings in a safe platform-appropriate way
-
x1b
State tracker for 0x1b terminal escape codes. Also gives random characters names.
-
lindera-decompress
A morphological analysis library
-
mojimoji-rs
fast converter between Japanese hankaku and zenkaku characters, mojimoji
-
beary
augment Bear, the note-taking app
-
crudite
A JSON CRDT
-
escaped-delimiter
Iterator of delimited slices with escape characters
-
fancy_print
Easily print animated ASCII text to the terminal!
-
html2runes
An HTML to Text converter
-
mepple
English dictionary as a library
-
iconv-compat-win-sys
iconv bindings for Rust
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
adauguet_minigrep
First program from the Rust Book
-
charless_minigrep
A program for searching for phrases in a text and printing the lines containing it
-
encoding-index-tradchinese
Index tables for traditional Chinese character encodings
-
tfidf-summarizer
Basic tf-idf compute for documents
-
esc
Escape characters in strings
-
mdbook-playscript
Preprocessor for mdBook, which styles stage play scripts
-
simple_csv
CSV parsing implementation
-
askama-filters
Extra template filters for Askama
-
encoding8
various 8-bit encodings
-
sola
A 16-bit stack machine and forth language
-
names-changer
Convert a names of sql schemes from camelcase to snake case
-
trexter
Text progression tracking library
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
static_table
creates pretty tables at compiler time
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
zalgo-text
A command line tool for generating zalgo text
-
strainer
Command-line tool for finding duplicate lines in text files
-
re2
Wrapper for the re2 C++ regex library
-
hsk
Return HSK Level for Simplified Chinese Characters
-
unicode_clusters
variable width unicode characters as single items, allowing for array like indexing etc
-
lyn
A scanner useful for creating recursive descent parsers
-
lingua-danish-language-model
The Danish language model for Lingua, an accurate natural language detection library
-
twemoji-rs
A word-cloud image generation crate
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
latex-to-html
Latex to html converter
-
fwuffgrep
Basic implementation of a grep command written in rust
-
lingua-italian-language-model
The Italian language model for Lingua, an accurate natural language detection library
-
bookrafter
This repository contains code related to bookrafter rendering
-
xsv
A high performance CSV command line toolkit
-
textcat
detect text categories. It can be used to detect the language of a given text
-
ucd-parse
parsing data files in the Unicode character database
-
tb_normalization
normalization utf8 string, loc dau vietnamese and some language
-
top-english-words
retrieve top words from the English language
-
aqua_parse
A parser library
-
anagram
A collection of anagram utility functions
-
vl-convert-pdf
convert SVG to PDF with embedded text
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
alphabet-encoder
A quick and dirty way to deal with escape characters
-
psa
PSA(Project structure analysis) is a analyzer for analysis project struct
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
japhonex
Japanese phone number checker for Rust
-
unic-bidi
UNIC — Unicode Bidirectional Algorithm
-
exit-with-code
A CLI which returns the exit code of your choice
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
ttf_word_wrap
Wraps text based on character width
-
kryptor
command-line cryptography tool
-
codes-iso-15924
This package contains an implementation of the ISO 15924 Codes for the representation of names of scripts specification
-
traktorhtml2txt
Does what it says, convert an Traktor set list HTML export to text
-
grepy
search files
-
math_test
Generates math questions for elementary school kids
-
unicode_skeleton
detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39
-
aki-json-pick
The json pick out command
-
cp437_constants
Defined constants covering the entirety of ASCII code page 437 (https://en.wikipedia.org/wiki/Code_page_437)
-
stardict_lib
Rust implement stardict
-
rsplitter
Splitter is a tool to split words from text that contains no spaces
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
spyglass
Search engine for documents, inspired by bioinformatics
-
mdbook-to-github-wiki
Turns an mdbook book into a github wiki
-
rexpaint
functionality for reading and writing .xp files of the Grid Sage Games REXPaint ASCII art editor
-
corpus-preproc
A preprocessor for text and HTML corpora
-
ezk-internal
ezk internal util
-
diff_match_patch
Diff Match Patch is a high-performance library in Rust-lang that manipulates plain text
-
asciis
ASCII base on RFC20. Just ord() and chr()
-
fingers
a finger client library
-
indexrs
inefficient multi-language search index
-
hashlogs
Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout
-
lindera-tokenizer
A morphological analysis library
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
find-simdoc
Time- and memory-efficient all pairs similarity searches in documents
-
html_to_pdf_lib
converting HTML to PDF
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
mdbook-multicode
Allows you to give multilanguage code examples, toggled by a spinner
-
hoedown
bindings for the Hoedown markdown processor
-
texrender
Thin wrapper around running
latexmk
to render LaTeX documents. Also supports generating Tex documents. -
jpreprocess-naist-jdic
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
utf8reader
wrapper around Reader that returns a stream of UTF-8 characters
-
kth-lines
Command line tool for filtering stdin lines that just work
-
tex
The νTeX typesetting engine
-
pdf_form
programatically filling out pdf forms
-
encoding-next-index-japanese
Index tables for Japanese character encodings
-
jp_inflections
A small crate to conjugate japanese words
-
geml
Generator-orientated ML parser
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
conv-rs
CLI converter for multi extensions
-
slicer
that slices string slices into smaller string slices
-
spellcheck
A spell-checker based on the statistical algorithm described by Peter Norvig in http://norvig.com/spell-correct.html
-
font-index
Static index for system fonts
-
kaomji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
minigrep_maybe
Just another minigrep from Rust book
-
dismej
convert text to discord emojis
-
newline-converter
Newline byte converter library
-
mdbook-webinclude
Preprocessor for mdBook that includes content from URLs
-
nib-cli
A cli for a yet another static site generator Nib
-
try_publish_for_demo
This program picks a random christmas emoji for you
-
luaparse
A Lua 5.3 parser
-
wordninja
port of the Word Ninja English word splitting library
-
deface
Lightweight markup to HTML converter
-
libgrep-rs
searching through text
-
milligrep
Custom simplified implementation of grep
-
hex_d_hex
HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…
-
strng
Owned UTF-8 string
-
markx
markdown parser
-
combos
Print all permutations of a word list
-
contractions
expand contractions in English
-
case-conv
Faster case conversion crate
-
cologne_phonetics
generate phonetic cologne codes for utf8 strings
-
markdown-table-formatter
Markdown table formatter fully compliant with Unicode 15.1.0
-
mdlint-cli
tbd
-
text-tables
A terminal/text table prettifier with no dependencies
-
decline-word
Choose word form based on given number
-
latexify
Shared definition for turn a rust object into latex code
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
hex_table
Format a byte slice into a hex table
-
ruby-parser
A parser for the Ruby language
-
equt-md-ext
Extend event iterator
-
json-event-parser-witespace
A json event parser with whitespace
-
scrambler
command line tool to scramble letters
-
tablers
create tables and export them into CSV or PDF. Without external deps
-
rigrep
grep from Rust Book
-
charjpoet
Charj Poet is a API for write to
.cj
language -
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
vextractor
vextractor-cli
-
encoding-next-index-simpchinese
Index tables for simplified Chinese character encodings
-
mystem
Wrapper around Yandex Mystem for Rust
-
cjieba-sys
unsafe ffi to cppjieba
-
text-sanitizer
convert text to plain ASCII text
-
epubparse
Parse epub and convert to text-only Book structure
-
backslash
parsing escape characters
-
mdbook-trace
A traceable document preprocessor for mdbook
-
clparser
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format. (fork marcaddeo/clparse)
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
uniaxe
replace Unicode letters with Ascii equivalents
-
unic-idna-mapping
UNIC — IDNA — IDNA Mapping Table
-
jellybean-pack-1
Sweet syntax highlighting with tree-sitter
-
panduck-latex
Use panduck to generate XeLaTeX
-
the-book-tui
A TUI application to load, read, and search The Rust Book
-
fnew
A Unicode-aware line-oriented drop-in replacement for coreutils' fold
-
stopwords
from popular text processing frameworks
-
grep-searcher
Fast line oriented regex searching as a library
-
djot
Djot parser written in pure Rust
-
japanese-ruby-filter
Japanese ruby notation parser
-
rusty_word_builder
Syllable and Word generation library written fully in Rust
-
goya
morphological analyzer for Rust and WebAssembly
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
brainfuck_interpreter
brainfuck interpreter written in Rust
-
typos-vars
Source Code Spelling Correction
-
encoding
Character encoding support for Rust
-
p4d-mdproof
Markdown to PDF converter
-
conveyance
A stop-gap CLI for conveyancing
-
simple-text-pattern
Matching simple text patterns
-
convert_encoding
Convert encoding of text files in batch
-
fst-subseq-ascii-caseless
An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst
-
munemo-rs
Turn an integer into a more rememberable word, or vice-versa
-
txtframe
Creates a frame for text
-
pest_ascii_tree
Helper crates converting the parsing result of any pest grammar into an ascii tree
-
simplearrayhash
-
mdbook-extended-markdown-table
Preprocessor for mdBook that generates tables with merged cells from ASCII text
-
rep-cli
Replace text file in bulk
-
bgrep
grep tailored to handle binary patterns and files
-
mdbook-mathpunc
An mdbook preprocessor that prevents line breaks between inline math blocks and punctuation marks when using katex
-
scannedpdf
A fast, low memory footprint, simple images to PDF crate
-
nutrimatic
Tools for reading Nutrimatic (https://nutrimatic.org) index files
-
dictionary-1024
a mnemonic dictionary with 1024 words, 10 bits per word
-
transliterate1234
UTF-8 to ASCII transliteration
-
unicode-utf8
that converts utf-8 bytes to a unicode scalar value, and vice versa
-
snakecase
general purpose snakecase implementation supporting both ascii and unicode
-
pseudolocalize
Pseudolocalization crate for Rust
-
saku
efficient rule-based Japanese Sentence Tokenizer
-
lindera-compress
A morphological analysis library
-
node_rust_demo
This program picks a random christmas emoji for you
-
encoding-index-japanese
Index tables for Japanese character encodings
-
unidecode
pure ASCII transliterations of Unicode strings
-
mon-artist
Render ASCII art to SVG, with user-customizable rendering rules
-
asciifolding
ascii folding library
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
base1112031
way to represent numbers using all Unicode characters except ASCII control characters
-
pdf_forms
programatically filling out pdf forms
-
ascii_help
help you quickly convert ASCII codes
-
tectonic_engine_xdvipdfmx
The
xdvipdfmx
program as a reusable crate -
basen
Convert binary data to ASCII with a variety of supported bases
-
ascii-alphabetic-char
Traits for ASCII alphabetic characters
-
jellybean-pack-0
Sweet syntax highlighting with tree-sitter
-
pdf2
PDF 2.0 Rust library
-
html_to_epub
A command line converts .html file to .epub file
-
rosie-sys
build or link to librosie to access the Rosie Pattern Language
-
lingua-latvian-language-model
The Latvian language model for Lingua, an accurate natural language detection library
-
jx
An interactive JSON explorer for the command line
-
pcre2-sys
Low level bindings to PCRE2
-
debug_canvas
Easily print complicated debug strings
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
lithe-cli
A cli of lithe
-
emojicons
Parse :emoji: notation to unicode representation
-
encoding-index-simpchinese
Index tables for simplified Chinese character encodings
-
docstring
manipulating and parsing documentation strings
-
lindera-analyzer
A morphological analysis library
-
flag_by_code
Easiest way to get an emoji flag from BCP-47 language tags
-
tex-rs
create latex documents in Rust
-
ascii_set
Fast membership of ASCII character classes
-
toml_to_table
pretty print TOML as a table
-
writedown
format parser
-
minigrep_linqi
grep implementation from The Rust Programing Book
-
unic-ucd-block
UNIC — Unicode Character Database — Unicode Blocks
-
testing_table
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
rustyword
An anagram finder
-
replace-all
Cli to quickly replace occurences of a word in a file
-
mdtable-cli
that makes creating tables in markdown much easier!
-
base_u256
base-u256 is to utf-8 as base-64 is to ascii
-
owlz
"Owlz" ascii emoojis, created randomly or by design
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
yeslogic-fontconfig
RENAMED: use the fontconfig crate instead
-
txt_processor
A little library for text processing
-
publish_demo_node_1
This program picks a random christmas emoji for you
-
case_convert
Converts the first letter of a Rust String to uppercase
-
mr-regex
Minimalist ASCII Regex-engine with 300 lines of Rust
-
fountain
Parser and formatter for the Fountain screenplay markup language
-
lindera-dictionary-builder
Shared code for building Lindera dictionary files
-
globe
Customizable ASCII globe generator
-
grep-pcre2
Use PCRE2 with the 'grep' crate
-
swappy
An anagram generator
-
scenic
Presentation tool that utilizes LaTex Beamer package
-
md2pdf
A small utility to convert markdown files to pdf exploiting tectonic
-
ascii-read
BufRead
-like methods for reading into anAsciiString
-
xgrepx
xgrep is a rust implementation of grep. This is a follow up from the rust book
-
forming
lightweight architecture as code language. 架构描述语言
-
overlap
shows overlap text in files
-
pdfutil
PDF document manipulation
-
tpaint
Terminal based unicode and ascii diagrams and drawings
-
chardetng_c
C bindings for chardetng
-
rustascii
Display Rust in ASCII
-
borderrs
Add stylish borders around your text and datastructures
-
asciify
converting images to a readable format on the command line
-
pdf-annotations-converter
Converts annotations found in PDF files to different formats
-
ascii-to-hex
A small, simple library to converting an ASCII text string into its hexadecimal equivalent
-
infisearch_lang_ascii
Basic ascii tokenizer for InfiSearch
-
jpreprocess-core
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
morsels_lang_ascii
Basic ascii tokenizer for morsels
-
shift_or_euc
Detects among the Japanese legacy encodings
-
wkhtmltox-sys
FFI bindings to wkhtmltox
-
tabled
An easy to use library for pretty print tables of Rust
struct
s andenum
s