-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
similar
A diff library for Rust
-
const_format
Compile-time string formatting
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
heck
case conversion library
-
convert_case
Convert strings into any case
-
ropey
A fast and robust text rope for Rust
-
lazy-regex
lazy static regular expressions checked at compile time
-
pulldown-cmark
A pull parser for CommonMark
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
termimad
Markdown Renderer for the Terminal
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
ammonia
HTML Sanitization
-
lopdf
PDF document manipulation
-
html2text
Render HTML as plain text
-
mdbook
Creates a book from markdown files
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
pretty
Wadler-style pretty-printing combinators in Rust
-
printpdf
reading and writing PDF files
-
chardetng
A character encoding detector for legacy Web content
-
prettydiff
Side-by-side diff for two files
-
fuzzy-matcher
Fuzzy Matching Library
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
unicode-bidi
Unicode Bidirectional Algorithm
-
regress
A regular expression engine targeting EcmaScript syntax
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
stringprep
algorithm
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
lindera
A morphological analysis library
-
garde
Validation library
-
charabia
detect the language, tokenize the text and normalize the tokens
-
comrak
A 100% CommonMark-compatible GitHub Flavored Markdown parser and formatter
-
synoptic
low-level, syntax highlighting library with unicode support
-
const-str
compile-time string operations
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
nucleo
plug and play high performance fuzzy matcher
-
unicode-script
exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
diffy
Tools for finding and manipulating differences between files
-
inlinable_string
inlinable_string
crate provides theInlinableString
type – an owned, grow-able UTF-8 string that stores small strings inline and avoids heap-allocation – and theStringExt
trait… -
glob-match
An extremely fast glob matcher
-
barcoders
A barcode-encoding library
-
charset
Character encoding decoding for email
-
roff
ROFF (man page format) generation library
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
autocorrect
A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
smartcat
Putting a brain behind
cat
. CLI interface to bring language models in the Unix ecosystem 🐈⬛ -
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
ferris-says
flavored replacement for the classic cowsay
-
line-index
Maps flat
TextSize
offsets to/from(line, column)
representation -
ascii
ASCII-only equivalents to
char
,str
andString
-
typstyle
The CLI for Typstyle
-
stringzilla
Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances
-
uuhelp_parser
A collection of functions to parse the markdown code of help files
-
stringsext
find multi-byte-encoded strings in binary data
-
tiefdownconverter
A CLI tool to manage and convert Markdown-based projects
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
entities
raw data needed to convert to and from HTML entities
-
kapow
Template processor
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
uncomment
A cli tool to remove comments from code. Supports multiple languages.
-
str_indices
Count and convert between indexing schemes on string slices
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
epub-builder
generating EPUB files
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
icu_pattern
ICU pattern utilities
-
regex-cursor
regex fork that can search discontiguous haystacks
-
text-size
Newtypes for text offsets
-
hgrep
grep tool with human-friendly search output. This is similar to
-C
option ofgrep
command, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
matchers
Regex matching on character and byte streams
-
hck
A sharp cut(1) clone
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
pluralizer
Rust package to pluralize or singularize any word based on a count inspired on pluralize NPM package
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
wildcard
matching
-
mkrs
Build automation tool
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
bundle_repo
Pack a local or remote Git Repository to XML for LLM Consumption
-
font-types
Scalar types used in fonts
-
mdxjs
Compile MDX to JavaScript in Rust
-
rumdl
A fast Markdown linter written in Rust (Ru(st) MarkDown Linter)
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
languagetool-rust
LanguageTool API bindings in Rust
-
anthropic-text-editor
A micro-CLI to apply tool calls from Anthropic for their text_editor_20250124 built-in computer use tool
-
pdf-extract
extract content from pdfs
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
trivet
Parser Library
-
stop-words
Common stop words in many languages
-
vaporetto
pointwise prediction based tokenizer
-
picovoice
Rust SDK for Picovoice's voice recognition platform
-
difflib
Port of Python's difflib library to Rust
-
netidx
Secure, fast, pub/sub messaging
-
unindent
Remove a column of leading whitespace from a string
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
llmvm-core
The core application for llmvm
-
cargo-spellcheck
Checks all doc comments for spelling mistakes
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
thoth-note
note-taking app written in Rust
-
mdbook-preprocessor-graphql-playground
A preprocessor for mdbook to add GraphQL playgrounds
-
blurhash
A pure Rust implementation of blurhash
-
mdbook-yapp
A mdBook preprocessor for simple text replacements
-
hyperlink
Very fast link checker for CI
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
cow-utils
Copy-on-write string utilities for Rust
-
any_ascii
Unicode to ASCII transliteration
-
unicode-reverse
Unicode-aware in-place string reversal
-
slug
Convert a unicode string to a slug
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
norad
Read and write Unified Font Object files
-
text_io
really simple to use panicking input functions
-
allms
One Library to rule them aLLMs
-
diff-match-patch-rs
The fastest implementation of Myer's diff algorithm to perform the operations required for synchronizing plain text
-
mdbook-pandoc
A pandoc-powered mdbook backend
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
zawk
An efficient Awk-like language implementation by Rust with stdlib
-
aho-corasick-unsafe
Fast multiple substring searching
-
create_broken_files
Create broken files from other ones
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
hypher
separates words into syllables
-
collclean
Clean up collaboration commands in LaTeX files
-
fontfor
find fonts which can show a specified character and preview them in terminal or browser
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
near-facsimile
Find similar or identical text files in a directory
-
mdcat
cat for markdown: Show markdown documents in terminals
-
unicode-ccc
Unicode Canonical Combining Class detection
-
srgn
A grep-like tool which understands source code syntax and allows for manipulation in addition to search
-
uwc
Counts things in unicode text files
-
precis-profiles
PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings Representing Usernames and Passwords as defined in rfc8265; and Nicknames as defined in rfc8266
-
sd
An intuitive find & replace CLI
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
jx
An interactive JSON explorer for the command line
-
mdbook-epub
An EPUB renderer for mdbook
-
mdbook-environment
A preprocessor for MdBook for working with environment variables
-
pad
padding strings at runtime
-
igrepper
The interactive grepper
-
aneubeck-daachorse
Daachorse: Double-Array Aho-Corasick
-
sile
Simon’s Improved Layout Engine
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
skyspell
Fast and handy spell checker for the command line
-
regex-literal
delimited regular expression literals
-
asciidork-cli
Asciidork CLI
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
file-organiser
Command line file manager to list, move or delete large numbers of files in nested folders filtered by age, file extension, file name pattern and/or size range
-
tbll
tbll outputs data in tabular format
-
molybdenum
Recursive search and replace CLI application
-
moonwave
generating documentation from comments in Lua source code
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
stylin
Convert markdown to pandoc markdown with custom styles
-
subplot
tools for specifying, documenting, and implementing automated acceptance tests for systems and software
-
fm
Non-backtracking fuzzy text matcher
-
reword
some utility functions for human-readable formatting of words
-
vidyut-prakriya
A Sanskrit word generator
-
nvl-cli
A program to download webnovels
-
arf-strings
Encoding and decoding for ARF strings
-
qpdf
Rust bindings to QPDF C++ library
-
etradeTaxReturnHelper
Parses etrade and revolut financial documents for transaction details (income, tax paid, cost basis) and compute total income and total tax paid according to chosen tax residency (currency)
-
ncount
A word count tool intended to derive useful stats from markdown
-
gibberish-or-not
Figure out if text is gibberish or not
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
asmfmt
A formatter designed for programs in assembly language with AT&T syntax
-
mdbook-quiz
Interactive quizzes for your mdBook
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
yake-rust
Yake (Yet Another Keyword Extractor) in Rust
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
console_static_text
Logging for text that should stay in the same place in a console
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
htmd-cli
The command line tool for htmd
-
pathmut
Command line utility for manipulating path strings
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactor
sub-command. Compile BibTeX files to legacythebibliography
TeX code using thecompile
sub-command… -
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
fasttext
binding
-
bfom
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
rsrpp-cli
project for research paper pdf
-
mdfried
A markdown viewer for the terminal that renders images and big headers
-
byteyarn
hyper-compact strings
-
say-rust
command-line tool which is an alternative to echo
-
autumnus
Syntax highlighter powered by Tree-sitter and Neovim themes
-
dom_smoothie
extracting relevant content from web pages
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
wordcut-engine
Word segmentation/breaking library
-
texted
A markdown blog platform that lets you in control of your data
-
repgrep
An interactive command line replacer for
ripgrep
-
morse_n_s
Test program that plays Morse code "N"s using Rust and CPAL, inspired by its use in historical aviation communications, including transmissions by Amelia Earhart
-
dprint-plugin-jupyter
Formats code blocks in Jupyter notebooks
-
site
fast, opinioned static site generator
-
yara-x-parser
A parsing library for YARA rules
-
document_tree
reStructuredText’s DocumentTree representation
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
quixote
Quizzes and tests in Markdown
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
roman-numerals-rs
Manipulate well-formed Roman numerals
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
krafna
terminal-based alternative to Obsidian's Dataview plugin, allowing you to query your Markdown files using standard SQL syntax
-
slack-blocks-render
Slack blocks render is a Rust library to render Slack blocks as Markdown
-
harper-core
The language checker for developers
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
pager
pipe your output through an external pager
-
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞,基於MediaWiki和OpenCC之字詞轉…
-
codetypo-dict
Source Code Spelling Correction
-
dccmd-rs
A command line client for DRACOON
-
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
COXave
Instruments for codings
-
dmos
Djot HTML renderer with advanced features
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
charasay
The future of cowsay 🐮! Colorful characters saying something 🗨️
-
duvet
A requirements traceability tool
-
instant-segment
Fast English word segmentation
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
dptran
run DeepL translations on command line written by Rust
-
newline-converter
Newline byte converter library
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
vlazba
Lojban words generator and analyzer
-
kalosm-learning
A simplified machine learning library for building off of pretrained models
-
addbib
An app to add linked bibliographies to markdown files
-
grok
popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
libharu_ng
Easily generate PDFs from your Rust app
-
quickmd
Quickly preview a markdown file
-
percent-encoding
Percent encoding and decoding
-
stfu8
Sorta Text Format in UTF-8
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
hlight
dedicated to delivering exceptional syntax highlighting capabilities
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
rst
a reStructuredText parser and renderer for the command line
-
regexml
XPath compatible regex engine
-
babel
Map Rust enums to different LLM providers' model strings
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
sapling-streampager
streampager is a pager for command output or large files
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
picodiff
Tiny GUI app to compare text easily
-
secular
No Diacr!
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, Portuguese, German, Italian or French into their digit representation
-
rustic_print
A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.
-
rapidfuzz
rapid fuzzy string matching library
-
zp
Copy the contents of the source file or the standard output buffer to the clipboard, with support for maintaining a history of copied content, allowing users to easily paste into another file or program
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
plsfix
Text cleaner upper
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
text-to-ascii-art
program to convert text to ASCII art
-
nmd
Official NMD CLI and compiler
-
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ
,ŋ
andê
. -
emojic
Emoji constants
-
herring-automata
Automata construction for Herring
-
vader-sentimental
A faster Rust version from the original Python VaderSentiment analysis tool
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
ascii_help
help you quickly convert ASCII codes
-
quranize
Encoding transliterations into Quran forms
-
clipcount
Counting words from the clipboard content
-
mandown
Markdown to groff (man page) converter
-
pks
Welcome! Please see https://github.com/alexevanczuk/packs for more information!
-
dmntk-gendoc
DMNTK | Documentation generator
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
replaxe
A command-line tool to replace text in files with easy patterns
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
iepub
epub、mobi电子书读写
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
notion2html
Convert Notion pages to HTML
-
arrow-string
String kernels for arrow arrays
-
textra
type less, write more
-
async-utf8-decoder
Convert AsyncRead to incremental UTF8 string stream
-
autotex
Continuously compile TeX and LaTeX
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
furigana
Map furigana to a word given its reading
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
asciimath-unicode
Convert asciimath to unicode
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
cai
The fastest CLI tool for prompting LLMs
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
latkerlo-jvotci
Tools for creating and decomposing Lojban lujvo
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
quagga
CLI tool that combines multiple text files into a single prompt suitable for Large Language Models
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
mdbook-angular
mdbook renderer to run angular code samples
-
wcc
my own version of wc for personal use
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
oxford_join
Join string slices with Oxford Commas!
-
unic-ucd-ident
UNIC — Unicode Character Database — Identifier Properties
-
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
creature_feature
Composable n-gram combinators that are ergonomic and bare-metal fast
-
hauchiwa
Incredibly flexible static site generator library with incremental rebuilds and cached image optimization
-
dodo-zh
operate on pinyin and cedict.u8 file
-
textalyzer
Analyze key metrics like number of words, readability, and complexity of any kind of text
-
frawk
an efficient Awk-like language
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
string-auto-indent
Normalizes multi-line string indentation while preserving platform-specific line endings
-
vesti
A preprocessor that compiles into LaTeX
-
epcmanager
EPC text tool for RFID
-
dprint-plugin-markdown
Markdown formatter for dprint
-
simple-sds-sbwt
A fork of simple-sds used in the sbwt crate
-
lll_toon
convert JPG images to ASCII art with colors
-
soundex-rs
that calculates soundex value
-
mdbook-tailor
mdbook preprocessor for image-tailor
-
anda_engine
Agents engine for Anda -- an AI agent framework built with Rust, powered by ICP and TEEs
-
cargo-sync-rdme
Cargo subcommand to synchronize README with crate documentation
-
termdiff
Write a diff with color codes to a string
-
reggy
friendly, resumable regular expressions for text analytics
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
mdmodels
generate models, code and schemas from markdown files
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
archive-pdf-urls
Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
unidoc
Unite all Markdown
-
line-span
Find line ranges and jump between next and previous lines
-
ripgrep_all
rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
charx
A replacement for char::is_ascii*
-
dnd-character
A Dungeons and Dragons character generator
-
sk-skimmer
Fuzzy Finder in rust!
-
deindent
A command line utility and Rust library to format overly-indented text
-
twars-url2md
A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion
-
mdbook-pdf-headless_chrome
Control Chrome programatically
-
domrs
Document builder and serializer
-
asciidork-parser
Asciidork parser
-
textgrid
working with PRAAT .TextGrid files with parsing, riting, manipulation, and history tracking modulesfor TextGrid data
-
kathoey
text feminization using open corpus linguistics data
-
pukram-formatting
A type to represent the formatting of the pukram markup language
-
bogrep
Full-text search for bookmarks from multiple browsers
-
aki-mcolor
mark up text with color
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
colored_text
adding colors and styles to terminal text
-
indent
Functions for indenting multiline strings
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
md-tui
A terminal markdown viewer
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}
in text with their respective value. It can also be used as a rust library -
pandoc
API that wraps calls to the pandoc 2.x executable
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
ib-pinyin
一个高性能拼音匹配库
-
bnf_sampler
that uses recursive descent algorithm to ensure tokens produced by a large language model follow a Backus Naur Form schema
-
whitespace-sifter
Sift duplicate whitespaces away!
-
typedoc_json_to_md
Converts TypeDoc JSON generated by typedoc-json-parser to Markdown
-
portmanteau
create portmanteaux
-
koji
An interactive CLI for creating conventional commits
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
regexnight
Command-line tool to print syntax-highlighted versions of regular expressions and spot errors
-
regex-charclass
Manipulate and convert regex character classes
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
bashdoc
generating documentation/help menu for user defined bash functions
-
html-compare
compare html files
-
extract_anchors
Утилита для извлечения из исходных кодов всех помеченных отрывков
-
see-cat
A cute cat(1)
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
unidown
Convert Markdown to Unicode
-
reflexo-typst
Bridge Typst to Web Rendering, with power of typst
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
gulagcleaner_rs
Ad removal tool for PDFs
-
mdbook_ls
mdBook Language Server
-
prompt-input
lightweight library for user input prompts in Rust, designed to make input handling straightforward
-
bbd
Binary Braille Dump
-
djotters
Djot (Markdown) parser that runs at hyper speeds!
-
overlap-chunk
splitting text into chunks of specified size with adjustable overlap percentage
-
frizbee
SIMD smith waterman for absurdly fast fuzzy matching, similar algorithm to FZF/FZY
-
mdbook-aquascope
Interactive Aquascope editor for your mdBook
-
chewing-cli
Tools of the Chewing (酷音) intelligent Zhuyin input method
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
rawcode
Implements a simple as-is encoding format
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
minspan
a package for determining the minimum span of one vector within another
-
text-editing
string with utilities for editing
-
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
bullet_stream
Bulletproof printing for bullet point text
-
substring
method for string types
-
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
hh-highlight
hh: The text highlighter. Read from stdin, and highlight selected words printed to stdout
-
fix-name-case
CLI tool to convert variable and function names to snake_case
-
string-replace-all
String replacement utility inspired by JavaScript, allowing pattern-based substitutions with support for both exact matches and regex patterns
-
runi
a CLI tool to generate unicode fonts
-
armnod
random string generator
-
mktoc
Generate Table of Contents from Markdown files
-
numeric-sort
A zero-allocation, human-readable sorting library
-
tish
A tiny, fast shell with not so tiny features
-
vidyut-lipi
A Sanskrit transliterator
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
utf16string
String types to work directly with UTF-16 encoded strings
-
rust_file_encode_mode_convert
这是一个rust的库,用于检测文件的编码格式。 支持GBK,GBK2312 , UTF8, UTF16LE, UTF16BE, UTF8+BOM,UTF32 等多种编码格式。
-
cliche
Dead simple static site generator
-
substudy
Language-learning tools for working with parallel, bilingual subtitles and media files
-
avatarsay
Beautiful quotes from Avatar: The Last Airbender
-
affinidi-messaging-text-client
Affinidi Messaging SDK
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
rwalk
A blazing fast web directory scanner
-
ewts
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (lib)
-
codepage
Mapping between Windows code page numbers and encoding_rs character encodings
-
gst-plugin-regex
GStreamer Regular Expression Plugin
-
n_gram
training n-gram language models
-
mdtrans
Markdown parser and transformer using
pest.rs
, focused on flexibility to a project’s needs -
date_time_parser
Rust NLP library for parsing English natural language into dates and times
-
hyperscan
bindings for Rust with Multiple Pattern and Streaming Scan
-
whatwg-datetime
parsing the datetime microsyntax, as defined by the WHATWG HTML Standard
-
eliza
natural language processing program developed by Joseph Weizenbaum in 1966
-
keep-a-changelog
generating and manipulating CHANGELOG.md files that use the Keep A Changelog format
-
reason-shell
Reason: A Shell for Research Papers
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
pray
A tui tool for preparing a prompt to the llms
-
unicodeit
Converts LaTeX to Unicode (rust port)
-
xml_magic
A reasonably fast XML formatter
-
rins_markdown_parser
markdown parser written on Rust
-
kas-text
Text layout and font management
-
mdlink
Auto-convert HTTP links for your favorite services into nice Markdown links
-
nu_plugin_regex
nu plugin to search text with regex
-
indent_write
Write adapters to add line indentation
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
asimov-cli
ASIMOV Command-Line Interface (CLI)
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
hanconv
Convert between Chinese characters variants
-
colonnade
format tabular data for display
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
mdbook-nice
A mdbook plugin to add nice css to your book
-
mdbook-linkcheck2
A backend for
mdbook
which will check your links for you -
hebrew_unicode_script
A low-level library designed to ascertain whether a character belongs to the Hebrew Unicode script. It supports checks for individual characters as well as for membership within collections
-
timug
It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.
-
svgdx-pandoc
pandoc filter for svgdx codeblocks in Markdown
-
libchai
汉字编码优化算法
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
mdbook-llms-txt-tools
convert mdbook to llmstxt.org format
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
word-tally
Output a tally of the number of times unique words appear in source input
-
strloin
copy on write slices of a string
-
filename-refactor
Command to refactor file names
-
mini_markdown
Small markdown renderer
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
orly
Download O'Reilly books as EPUB
-
html-auto-p
function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
sqdj
sqdj shortens delimited data
-
fmtt
A diff-friendly text formatter that breaks lines on sensible punctuations and words to fit a line width
-
sanitation
developing memory-safe programs while detecting and capturing possibly malicious bytes
-
inlet_manifold
A general purpose highlighting library
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
maxlen
Length-bounded string and slice/vector
-
agentai
designed to simplify the creation of AI agents
-
mdsh
Markdown shell pre-processor
-
pulldown-html-ext-cli
CLI tool for extended HTML rendering of Markdown with pulldown-cmark
-
uniquewords-rs
Count the frequencies of words in text file(s) or stdin
-
tergo-formatter
Formatter for tergo
-
botanical-latin
Decliner / conjugator / inflector for classical / botanical Latin
-
mtc-token-healing
Token healing implementation
-
simstring_rust
A native Rust implementation of the SimString algorithm
-
utf16_lit
macro_rules to make utf-16 literals
-
jayce
tokenizer 🌌
-
ncase
Enforce a case style
-
clima
A minimal Markdown reader in the terminal
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
keyphrases
Rapid Automatic Keyword Extraction (RAKE) implementation in Rust
-
catalog-of-markdown
Generate the catalog of markdown file
-
doxygen-bindgen
Converts Doxygen comments into Rustdoc markdown
-
linebreak
breaking a given text into lines within a specified width
-
fast_symspell
Spelling correction & Fuzzy search
-
utilprint
easily using colors or unicode in your terminal printing
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
lexical-sort
Sort Unicode strings lexically
-
iregex
Intermediate representation for Regular Expressions
-
lintestor
Linux package tester
-
kanaria
functions such as hiragana, katakana, half-width and full-width mutual conversion and discrimination
-
gosyn
Golang syntax parser in Rust
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
furze
finite state transducers (fst) writen in rust
-
erebus
A CLI message generation library
-
itex
Initialize a LaTex project inside a folder instantly
-
cbfr
A buffer that run on stack, focusing on performance and speed
-
cli_app_capo
CLI application with Unix-like tools
-
uklatn
Ukrainian Cyrillic transliteration to Latin script
-
cloc
Count, or compute differences of, lines of source code and comments
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
linkcheck2
extracting and validating links
-
dcsv
Dyanmic csv reader,writer,editor
-
allsorts-subset-browser
Temp fork of allsorts 0.15 - includes patch for subsetting fonts for browsers
-
gigagei
random quote fetching console utility
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
ean-rs
generating and validating EAN barcodes
-
uwurs
UwUify your strings with uwurs!
-
fast_whitespace_collapse
Collapse consecutive spaces and tabs into a single space using SIMD
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
adobe-cmap-parser
parse Adobe CMap files
-
htmd
A turndown.js inspired HTML to Markdown converter
-
imstr
Cheaply clonable and slicable immutable strings
-
galm
pattern matching library
-
csv_to_table
pretty print CSV as a table
-
huggingface/tokenizers-python
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
-
crlify
A std::io::Write wrapper that replaces with on Windows
-
human_regex
A regex library for humans
-
yy1
Tiny utility to convert KiCad centroid files into Neoden YY1 pick and place machine format
-
readability
Port of arc90's readability project to rust
-
esri_ascii_grid
reading ESRI Ascii Grid .asc files
-
ethan-rs-wc
The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.
-
shwild
(for Rust)
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
stam-tools
Command-line tools for working with stand-off annotations on text (STAM)
-
commit_crafter
AI powered tool for Git commit message generator
-
like
A SQL like style pattern matching
-
libannict
Annict API のクライアントライブラリ
-
morse_code_parser
A Morse code parser and decoder implemented in Rust
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
nlf
A CLI to append newline characters (LF) at the end of text file
-
magic_string_search
string search library to rank strings based on their similarity to a query string
-
lowcharts
draw low-resolution graphs in terminal
-
simple-ssg
Plain and simple static site generator for Djot and Markdown light markup languages
-
pandoc_types
Rust port of pandoc-types
-
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
ungoliant
The pipeline for the OSCAR corpus
-
asoiaf-api
Rust wrapper for the api of ice and fire
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
asciit
A compact and visually appealing ASCII table for your terminal, featuring colored numbers and letters
-
paltoquet
rule-based general-purpose tokenizers
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
unic-emoji-char
UNIC — Unicode Emoji — Emoji Character Properties
-
buf-min
Minimal utf-8 safe buffer traits
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
mdopen
Preview markdown files in a browser
-
subtitler
parsing and generating subtitles
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
reedy
A terminal-based RSS reader with a clean TUI interface
-
syllabize-es
Syllabize Spanish text, and much more
-
parse-wiki-text-2
Parse wiki text from Mediawiki into a tree of elements
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
docket
markdown to HTML documentation rendering
-
forbidden-bands
8-bit string handling library
-
rustdoc-md
Convert Rust documentation JSON into clean, organized Markdown files
-
array_tool
Helper methods for processing collections
-
words-count
Count the words and characters, with or without whitespaces
-
chord3
Create pdf songbooks from chopro source
-
epub2mdbook
convert EPUB files to MDBook format
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
vi
An input method library for vietnamese IME
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
cqtool
converting between CQ strings and message segment arrays
-
ipynb-to-md
Convert Jupyter Notebooks to Markdown files
-
codetypo-vars
Source Code Spelling Correction
-
correct_word
A No brainer 'did you mean' library for Rust
-
encoding-next
Character encoding support for Rust
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
sgrep
grep util for those lazy to remember many command line options
-
figlet-comment
quickly create banner to use as comments
-
files-to-text-cli
Files to text cli
-
etch
Not just a text formatter, don't mark it down, etch it
-
four-char-code
A string of 4 ascii chars represented by an u32
-
streampager
pager for command output or large files
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
scrunchy
echo alt
-
aki-gsub
substitude text command, replace via regex
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
html-linter
An HTML linting library for checking HTML structure and semantics
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
unicode_names2_generator
Generates the perfect-hash function used by
unicode_names2
-
trust_pdf
Verifies signed PDFs against the originals, checking for sneaky modifications
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
uclanr
A random word picker that gives you actually useful words
-
watch-files
A dead simple file system watcher
-
naming_utils
generating naming conventions, pluralizing words, and rest api paths in Rust
-
serpl
terminal UI for search and replace, ala VS Code
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
bubble-bath
Small and quick HTML sanitizer
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
regex_generate
Use regular expressions to generate text
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
unified-diff
GNU unified diff format
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
shaperglot
Test font files for OpenType language support
-
ratskin
A wrapper around termimad that produces ratatui widgets from markdown
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
ldd_md_parse
markdow to html simple tool
-
slugify-rs
generate slugs from strings
-
mdbook-fs-summary
Summary generator for mdbook
-
unicode-matching
match Unicode open/close brackets
-
context-notation
Featherweight semantic notation for text
-
uo_rst_parser
fork of rst_parser with fixes for upstream-ontologist
-
sancus
open-source tool that extracts third-party license information from a deployment-ready application
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
afrim-translator
Manage the predication system of the afrim input method
-
libretranslate
A wrapper for the LibreTranslate web API
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
wikidump
parsing Mediawiki XML dumps
-
eddie
Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
ens-normalize-rs
Ethereum Name Service (ENS) name normalization
-
named_entity_parsing
Named entity parser. Used in Rusev to parse a list of tokens into a list of entities.
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
gh_page_tool
A github gh-pages tool for static blog site
-
gannot
A small library with limited functionality focused on genome annotation
-
treebender
An HDPSG inspired symbolic NLP library for Rust
-
tokenizations
alignments library
-
egg-mode-text
Text parsing for Twitter: character counting, hashtag/mention extraction
-
diffy-imara
Tools for finding and manipulating differences between files
-
hangeul
Korean alphabet manipulation library
-
dnd_spellbook_maker
making pdf documents of 5th edition D&D spells that are formatted like D&D source books
-
recipe_parser_andrewromanyk
A parser for food recipes. Divides recipes into: name, description, ingredients and instructions.
-
scraps_libs
A static site generator that makes it easy to write internal links (Wiki links) in Markdown files
-
case
A set of letter case string helpers
-
metatron
core library
-
iregex-syntax
Common syntax for regular expressions
-
wikipedia_prosesize
Count Wikipedia prose size
-
simple-logging
logger for the log facade
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
svgbob
Transform your ascii diagrams into happy little SVG
-
midstring
Create a string between two other strings, that is lexicographically halfway between them
-
pulldown_mdbook
A pull parser for mdBook
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
fr_alebref_libbrefdata
BrefData library
-
libanubhav
management system written in Rust
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
rsrusl
A really simple useful library ported to Rust
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
minos-codex
Minos Codex is a tool for detecting and identifying secrets in a string
-
zipcodes
Query US zipcodes without SQLite
-
unicount
Alphabetic counter supporting unicode
-
block-id
generating opaque, unique, and short string values from (unsigned) integers
-
iregex-automata
Finite automata definitions for the
iregex
crate -
enc-check
inspect utf-8 and utf-16 character encodings
-
mdi
markdown include
-
pulldown-html-ext
Extended HTML rendering capabilities for pulldown-cmark
-
chronofold
A conflict-free replicated data structure (a.k.a CRDT) for versioned text
-
shopping-parser
A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units
-
split-every
Split for every n occurrences of a pattern iteratively!
-
tracery
Text-expansion library
-
typope
Pedantic source code checker for orthotypography mistakes and other typographical errors
-
ADA_Standards
help you handle checks on your ADA projects, especially good to build scripts to check coding standards conformity
-
cedarwood
efficiently-updatable double-array trie in Rust (ported from cedar)
-
emoji
Every emoji, their metadata, and localized annotations
-
parse_wiki_text
Parse wiki text from Mediawiki into a tree of elements
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
fakelish
English-like word generator; port of Ryo Ota's fakelish
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
markov_str
Markov Chain implementation optimized for text generation
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
mazer-core
A minimal, simple math markup language that compiles to HTML, written in Rust
-
tfidf-text-summarizer
extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary
-
fimdoc
Firendship is Magic Document, converts Markdown into FIMFiction BBCode
-
vidyut-kosha
A Sanskrit key-value store
-
caseless
Unicode caseless matching
-
binatime
A binary clock in the terminal
-
gspell
Rust bindings for gspell
-
mdbook-github-authors
mdbook preprocessor to display Github profiles of authors of a page
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen and some other apps
-
enma
serving anime and manga information 📦
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
chat_commands_parcer
theme of the parcer is a command like input in the enviroment like chats of different social apps (Discord, Telegram) It parces users input-commands in format: /command_name|identifier (arguments)…
-
srx
A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
enum-ts
TypeScript Enum pattern matcher codegen
-
vcsq_lib
vcsq (vcsQuery): Version Control System (VCS) Querying CLI
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
caer
Tools for making things on the web
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
mini-openai
An OpenAI API client with minimal dependencies
-
unixstring
An FFI-friendly null-terminated byte string
-
samvadsetu
LLM API for commonly used LLM services including Gemini, ChatGPT, and Ollama. The name implies a bridge for dialogue since the library facilitates communication and interaction between…
-
adulting
A program to print one rule at a time from The 25 Principles for Adult Behavior: John Perry Barlow
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
xee-interpreter
Interpreter for XPath and XSLT
-
ps-str
String transcoding library
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
typeline_ext_utils
operators for typeline
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
treegrep
A pattern matcher frontend or backend which displays results in a tree
-
uwl
A management stream for bytes and characters
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
thesaurus
An offline thesaurus library for Rust
-
whichlicense_detection
detect licenses used by the WhichLicense project
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
matrix_display
display a matrix in command line with pretty unicode separators
-
utf8streamreader
lookahead iterator on an utf8 byte stream
-
gaze
small utility library with the goal of making it easier to scan/lex text and collections
-
bigstr
A command-line tool to make string BIG
-
spezilinter
spezifisch's linter for different file formats, linting for weirdly specific stuff
-
fast-str
A flexible, easy-to-use, immutable, efficient
String
replacement for Rust -
html_to_epub
A command line converts .html file to .epub file
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
source-cache
Query the cache of raw text
-
koto_regex
A Koto library for working with regular expressions
-
vibrato
viterbi-based accelerated tokenizer
-
gregex
Regex solver utilizing NFA
-
yozuk
Chatbot for Programmers
-
timeblok
A language for event scheduling in plain text
-
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
ascii-img-cli
Command-line tool for using ascii-img
-
repr
The regular-expression-as-linear-logic interpretation and its implementation
-
unicode_extension
Don't use this crate
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
bump-bin
Increments version with semver specification
-
modeling
tools to analysis different languages by Ctags
-
textgridde-rs
dealing with Praat TextGrid files. MIT licensed.
-
mdbook-tabs
mdBook plugin for rendering content in tabs
-
markdown-to-html
Markdown parser that runs at hyper speeds!
-
rustpython-parser-vendored
RustPython parser vendored third-party crates
-
spanned
string processing with file/line/col information and the regular rust
str
API -
cglue-bindgen
cleanup cbindgen headers for CGlue
-
sedregex
Sed-like regex library
-
ragegun
Performs lexica based analysis on text (i.e. age, gender, PERMA, OCEAN personality traits, ..)
-
asciidoctor-client
A kludge to improve the performance of static site generators that use asciidoc through its cli
-
filenamify
Convert a string to a valid filename
-
easy-regex
Make long regular expressions like pseudocodes
-
prescript
parsing and executing Prescript scripts
-
block-list
A minimalist hosts-based tool for managing block lists and ad-blocking
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
font-map
Macros and utilities for parsing font files
-
abbreviation_extractor
extracting abbreviations from text
-
rust-base62
base62 encode/decode, no dependencies other libraries
-
project_tstings
test project
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
rust-texas
generate latex documents
-
text-tokenizer
Custom text tokenizer
-
xi-rope
A generic rope data structure built on top of B-Trees
-
trpl
A support crate for The Rust Programming Language book
-
committer
git commit message generator
-
zoitei
alphabet conversions
-
qpprint
console printing/formatting
-
aho-corasick
Fast multiple substring searching
-
jfmt
command-line tool for formatting json files in both readable and compact formats. It supports stdin/stdout shell usage, as well as working on files directly.
-
deliminator
Universal code documentation generator
-
glimpse
A blazingly fast tool for peeking at codebases. Perfect for loading your codebase into an LLM's context.
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath
. -
readarr
API docs
-
vndb-api
Fully Functional Visual Novel Database (VNDB) HTTPS API Wrapper
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
mdbook-ai-pocket-reference
mdbook preprocessor for the ai-pocket-reference project
-
sbert
Sentence Bert (SBert)
-
markdown-extract
Extract sections of a markdown file
-
flxy
Full-text searching and scoring of strings
-
IndicScriptSwap
help transliterate between various indic scripts. It is not ready yet and has many issues. If you encounter any issues, please contact me (https://github.com/mssrprad/transliterate-ferris/tree/cli or pradyumna…
-
goofy-animals
Generate a name in adjective-adjective-animal form
-
gerber-types
Types and code generation for Gerber files (RS-274X)
-
merge-whitespace
Procedural macros for merging whitespace in const contexts
-
abbreviator
abbreviating long words
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
ohnomore
Transformations for TüBa-D/Z lemmas
-
google-book-scraper
downloading the contents of books hosted on books.google.com for offline viewing
-
ob
A Blog and RSS system written in Rust
-
tre-regex
Rust safe bindings to the TRE regex module
-
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
diff-man
diff utility lib
-
tagsearch
Filter plaintext files based on @keyword tags
-
loc
Count lines of code (cloc) fast
-
litua
Read a text document, receive its tree in Lua and manipulate it before representing it as string
-
bitutils2
A package of tools for bit manipulations, including bit indexing, bitfields, and a variation of regular expressions for binary data
-
poppler-sys-rs
Low-level (FFI) bindings for poppler-glib
-
heart-strings
Quickly get random heart emojis to copy!
-
viterbi_pos_tagger
A part-of-speech (POS) tagger using the Viterbi algorithm
-
betacode
conversion
-
kproc
Knowledge Processing library
-
pdf-sign
extract signed date from pdf file
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
string-offsets
Converts string offsets between UTF-8 bytes, UTF-16 code units, Unicode code points, and lines
-
lookbook
Component preview framework for Dioxus
-
replace_cli
copy contains of given file to another
-
fuzzy-datetime
Detects, completes and normalises fuzzy date and date-time strings for interoperability with chrono or direct output as ISO-8601-compatible strings
-
pprint
Flexible and lightweight pretty printing library for Rust
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
unaccent
remove accents from strings, inspired by PostgreSQL's unaccent extension
-
image-to-ascii
Converts images and gifs to ascii art
-
ruby_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
regex-chunker
Iterate over the data in a
Read
type in a regular-expression-delimited way -
pukram2html
converting Pukram-formatted text to HTML
-
spf
.spf (Simple Pixel Font) file parsing, and useful api's to go alongside
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
antex
Styled text and tree in terminal
-
character-set
High performance
set.contains(char)
-
skyspell_kak
skyspell - kakoune integration
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
doccy
brace based markup language
-
leptos-markdown
A component which can render markdown as html element in leptos
-
acorns
Generate an AsciiDoc release notes document from tracking tickets
-
hns
Human numeric sorting program — does what
sort -h
is supposed to do! -
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
palmdoc-compression
Fast & safe implementation of PalmDoc/MOBI/AZW/Kindle flavored LZ77
-
ascii-hangman
customizable Hangman game with ASCII-art rewarding for children (desktop version)
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
butterkups-minigrep
Mini grep utility; very weak application, use grep instead
-
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
hashmoji_generator
Code generation tool for hashmoji
-
emoji-commit
Make your git logs beautiful and readable with the help of emojis
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
choco
markup language for dialogue systems
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
bstr
A string type that is not required to be valid UTF-8
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
matchpick
Find and replace multi-lines using a match-case
-
marcus
An experimental Markdown parser written in Rust
-
chinese-numerals
Convert numbers to Chinese numerals
-
grammalecte_client
Grammalecte HTTP client
-
rsnltk
Rust-based Natural Language Toolkit
-
unflow
DSL to convert design to code
-
quickner-core
A fast and simple NER tool
-
tantivy-jieba
that bridges between tantivy and jieba-rs
-
roman_numerals_fn
A function to convert integers to their roman numeral representation as strings. Values from 1 to 3999 are possible, otherwise it returns an OutOfRangeError. Zero has no representation in roman numerals.
-
mdbook_header_footer
mdBook preprocessor to prepend header and append footer to certain chapters
-
ipa-translate
translating between IPA and ASCII text
-
dingtalk
Robot Util, Send text/markdown/link messages using DingTalk robot, 钉钉机器人
-
hcompress
HCompress Library
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
nfa_regex
NFA regex engine for text processing
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
tinyvec_string
tinyvec based string types
-
merge3
merge tool for three-way merges
-
just-enough-emojis
text to emoji cli
-
rk-utils
A collection of utility functions and data structures for rust
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
man
Generate structured man pages
-
symspell
Spelling correction & Fuzzy search
-
nlprule
A fast, low-resource Natural Language Processing and Error Correction library
-
mdplayscript
An extension of Markdown for play scripts
-
slicestring
slicing Strings
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
rcut
replacement for GNU cut that supports UTF-8
-
lithe
A Slim template engine by using Pest
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
hexroll3-scroll
HEXROLL3 Scroll - the sandbox content generator
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
wtf8-rs
WTF-8 encoding
-
texcore
Create LaTeX documents using native Rust types
-
zalgo-codec-common
Convert an ASCII text string into a single unicode grapheme cluster and back
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
realhydroper-utf16
Work with UTF-16 in Rust
-
moto
motivated automation
-
character_frequency
counting character frequencies in a string concurrently
-
supercat
A syntax highlighting alternative to cat
-
unicode-box-drawing
Unicode box-drawing characters
-
basalt-core
core functionality for Basalt TUI application
-
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
mdbook-typstpdf
An mdBook backend that generates PDF output using Typst
-
ragzilla
providing tools for RAG (Retrieval-Augmented Generation) pipelines
-
mdzk
Plain text Zettelkasten based on mdBook
-
display_bytes
Human-readable display of byte sequences
-
anslatortray
translate from English to Pig Latin!
-
alphabet_detector
Natural language alphabet detection library
-
gdnative-doc
Documentation tool for gdnative
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
unicodeit-cli
The command line interface to unicodeit
-
mitex-spec-gen
Guard to geneate specification files for dependent crates
-
file-expert
Expert system for recognizing source code files, similar to GitHub/lingust
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
tgrep
Toy grep that honors .gitignore
-
fsays
flavored replacement for the classic cowsay
-
regex-split
split_inclusive for the regex crate
-
unicode-language
detect language coverage given a list of codepoints
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
cosmic-text
Pure Rust multi-line text handling
-
recase
Changes the convention case of input text
-
org-rust-exporter
exporter for org mode documents parsed with
org-rust-parser
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
bard
Creates PDF and HTML songbooks out of easy-to-write Markdown sources
-
match-pinyin-with-hanzi
Checks whether the sentence in Chinese characters (汉字) matches with the sentence in pinyin (拼音). Erhua is supported.
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
none-shall-pass
Artifact for GitHub Action to validate hyperlinks in all markdown files
-
text_lines
Information about lines of text in a string
-
garde-fr
Validation library
-
conventional_commits_next_version_lib
A tooling and language agnostic library to calculate the next semantic version based on the Conventional Commits since the prior version. Supports monorepos.
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
royal_road_archiver
An archival program and library for the webnovel site RoyalRoad
-
markdown2pdf
Create PDF with Markdown files (a md to pdf transpiler)
-
mazer-cli
A minimal, simple math markup language that compiles to HTML, written in Rust
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
bureau
extracting dependency licenses, used by nertsio
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
rust-regex-dsl
Regular expression DSL
-
cli_app_capo15
CLI application with Unix-like tools
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
mdbook-plugin-utils
mdBook plugins
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
detone
Decompose Vietnamese tone marks
-
boreal-cli
CLI utility to run boreal, a YARA rules engine
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
natural
Pure rust library for natural language processing
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
fast_trie
A memory efficient trie library
-
ttaw
talking to a wall, a piecemeal natural language processing library
-
squidge
shortens delimited data
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
hydroper_source_text
Source text containing line locations
-
opstr
‘Operate on strings’ command line utility
-
lorem-rustum
lib for generating lorem-ipsum with a rusty fleur
-
darkdown
A darkdown(our own markup language) parser written in rust
-
baste64
A base64 codec
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
m2h
Convert Markdown to HTML with syntax highlighting
-
lindera-filter
Character and token filters for Lindera
-
interpolate
form of string interpolation
-
auto-regex
Automagically finds a regex that best matches an example and a sample list
-
mdbook-najan
Preprocessor for the Najan mdBook
-
mdbook-indexing
mdbook preprocessor for index generation
-
formatjson
Formats JSON files
-
onig_sys
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
ultra-nlp
A NLP library
-
string-simple
containing some simple string utilities that I use in my other projects
-
str-utils
some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
unicode-casing
Titlecase helper function on characters
-
interslavic
in rust
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
twitter-text
in Rust
-
gazetta-render-ext
A static site generator framework. Extra render code.
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
recs_lib
A confusing library for encrypting files and text
-
hmd
Custom Markdown Engine for my personal blog
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
icemelter
minimize files that trigger internal compiler errors (ICEs)
-
ewts-cli
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
google-fonts
Download and cache TTF fonts from Google
-
beautify
your terminal
-
qm-role-build
Role builder from markdown tables
-
nu_plugin_emoji
a nushell plugin called emoji
-
gstring
String with support for Unicode graphemes
-
when
'When' parses natural language date/time and produces computer friendly output structures
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
hyphenation_commons
Proemial code for the
hyphenation
library -
cheetah
An extremely fast and flexible static site generator
-
chinese
language nlp tools
-
mask-text
mask text with multiple masking options
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
deinflect
japanese deinflection
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
imperative
Check for imperative mood in text
-
tu
CLI tool to convert a natural language date/time string to UTC
-
rand-hira
CLI tool to generate random hiragana characters
-
aki-mcycle
mark up text with cycling color
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
tectonic_engine_bibtex
The
bibtex
program as a reusable crate -
rizzer
Fuzzy matching tool to find string similarity
-
rep-grep
wgrep/write-grep CLI
-
r4d
Text oriented macro processor
-
ru-html-extractor
A universal web page main content extractor based on line block density distribution
-
enpsrlib
English Phrase Structure Rules library
-
story-dl
Story web scraping
-
dekor
styling and character repository in Rust
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
xmlwriter
streaming XML writer
-
rusile
components for the SILE typesetter
-
nugine-rust-utils
Nugine's personal Rust utilities
-
sayit
String replacements using regex
-
common-words-all
Most common words sorted by ngram frequency
-
zummi
fun lib that produces spoonerisms
-
pandoc_ast
deserializes and serializes the markdown ast for writing pandoc filters
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
tzgrep
grep tar.gz
-
crate-starter
starter
-
pygmentize
wrapper for syntax highlighting
-
less
pager utility for displaying file contents or piped input, with dynamic scrolling and search functionality
-
texc-latex
Contains LaTeX templates for TeXCreate
-
tablers
create tables and export them into CSV or PDF. Without external deps
-
apple-notes-exporter
CLI tool for exporting Apple Notes to Markdown
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
perm-text
curling straight/dumb quotation marks ("") and apostrophes (') into their curly/smart (“”’) equivalents
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
schmfy
Schmfication library
-
pdfcr
render a codebase to a pdf
-
jot_note
Jot Note Taking CLI
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
rustyink
Blazing fast static site generator
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
dr
Command-line data file processing in Rust
-
unicode-width-16
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
fbihtax
CLI tool to help manage tax payments in FBiH (Bosnia and Herzegovina Federation)
-
markdown-toc
Markdown Table of Contents generator
-
text_unit
Newtypes for text offsets
-
simple_peg
A command-line peg parser implemented by Rust
-
iocutil
IoC utilities for malware researchers
-
md-designer
A CLI tool for creating design docs in Markdown
-
nesty
Generate code with with human readable indentation
-
subject-classifier
classifying a commit by it's subject
-
geoipsed
Inline decoration of IPv4 and IPv6 address geolocations
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
align
aligning text
-
cow-rewrite
Rewrite copy-on-write types copying only when it's neccessary
-
mdbook-rustviz
An mdbook preprocessor that allows users to embed RustViz visualizations into mdbook projects
-
haoxue-dict
Chinese dictionary and word segmenter
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
loki_text
advanced string manipulation with pattern searching and replacement capabilities
-
akiaki
A good old fashioned wiki engine with a flat-file database
-
inslice
A command-line utility for filtering text input by columns and rows
-
codetypo-cli
Source Code Spelling Correction
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
path2regex
Express style path to RegExp utility
-
mago-docblock
Analyzes PHP docblocks to extract annotations, tags, and documentation comments, aiding tools that rely on inline documentation
-
codegenrs
Moving code-gen our of build.rs
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
peppergrep
grep utility written following the 12th chapter of the Rust book. Some little modifications were made.
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
yazi-prebuild
Used to place the pre-built assets of yazi (https://github.com/sxyazi/yazi)
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
runestr-pancjkv
rune
-based Pan-CJKV support -
pillar
small tool to format lines into columns
-
mdlc
Markdown Link Checker. Find broken web and local links.
-
table_to_html
interface to convert a
tabled::Table
into a HTML table (<table>
) -
mapm
A set of tools, with command line and graphical interfaces, used to build exams typeset in LaTeX
-
stringsort
Pathological sorting of string characters
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
refac
Transform some text given a generic natural language prompt
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
langsan
sanitizing language model input and output
-
rjoin
joining CSV data on command line
-
char-ranges
Iterate chars and their start and end byte positions
-
zw
encoding and decoding text using zero-width characters
-
code-to-pdf
Generates a syntax-highlighted PDF of your source code
-
spider_scraper
A css scraper using html5ever
-
angr
analyse ngrams in text files
-
edit
Open a file in the default text editor
-
globber
Extended glob matching library
-
rust_hero
Rust assistant that utilizes NLP to enhance the quality of rust code
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
tokengrams
Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
kindle2cbz
extracting images from kindle books in MOBI format to CBZ archives
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
incredimo
just another font for your terminal
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
hina
:]
-
natural_syntax
Natural language syntax highlighting
-
hebrew
alephbet primatives and parsing library for rust
-
xhtmlchardet
Character set detection for XML and HTML
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
parser-cli
Command-line interface for extracting text from various file formats
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
oppenheimer
Hierarchical listboards for your terminal
-
case-conv
Faster case conversion crate
-
tantivy-tokenizer-api
Tokenizer API of tantivy
-
okh-tool
A CLI tool to deal with Open Know-How (OKH) data files. Its main functionalities are: validation of and conversion between the different formats
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
clipboard_macos
obtain access to the macOS clipboard
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
roxy_markdown_parser
Roxy plugin for parsing Markdown
-
mdbook-check-missing-md
A backend for
mdbook
which will find Markdowns you forgot on SUMMARY.md -
synox
Program synthesis of string transformations from input-output examples
-
quake_text
Utils for Quake strings and characters
-
strip-tags
Strip HTML and PHP tags from strings
-
unic-ucd-version
UNIC — Unicode Character Database — Version
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
ascii-img
Convert images to ASCII
-
text-colorizer
Transitionary package
-
hitori
Generic compile-time regular expressions
-
ranting
Linguistic formatting placeholder extensions for rust
-
magic_string
magic string
-
xconv
A high-performance batch file encoding conversion tool
-
translitrs
Transliteration utility for Serbian language
-
zhlint
A linting tool for Chinese text content
-
topfew
CLI to find high frequency occurrences in structured text files
-
reg_match
A match style regex tool
-
cur
that will hunt for your regular expression
-
quartz_commands
Generates a parser at compile-time for handling commands similar in structure to those of Minecraft
-
texting
string helpers
-
text_layout
Text layout algorithms
-
chunkr
A fast and quick chunking library for rust
-
notegraf
Core library for building a graph-oriented notebook
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
num2en
For converting integer and decimal numbers into English cardinal or ordinal number words
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
readability-rs
Port of arc90's readability project to rust
-
nkl
Nuclear Kernel Library
-
recursive-file-loader
recursively load files via references in the files
-
text-utils
Text utils for unescaping and align
-
markflowy
A Markdown Editor App
-
comment-strip
Remove comments out of text files
-
warkov-wordgen
Use markov-chains to generate new items
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
findtext_doc
Search text in Document
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
backslash
parsing escape characters
-
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
textcat
detect text categories. It can be used to detect the language of a given text
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
gfm-autolinks
Parse GitHub Flavored Markdown autolinks
-
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
caser
Change text between PascalCase, camelCase, and snake_case
-
rmemo
Tools for taking notes fast on the CLI
-
sortuniq
Find or count unique values in an input stream
-
csvsc
Build processing chains for CSV files
-
extract-words
Extracts words from text without allocation
-
rmbs
Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
arabic-script
An expressive API for the characters of the Arabic script
-
lister-cli
Lister: Navigate Markdown Lists
-
braille_pics
producing text-art pictures using Braille characters
-
xuexi
parse a sentence into words for multiple language
-
campfire
A tiny static site generator, greatly inspired by Zola
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
wn-parser
parser for WordNet database files
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
aki-mline
match line, regex text filter like a grep of linux command
-
slack_update
app to set Slack status, emoji and photo
-
mdbook-keeper
An improved testing experience for mdbook
-
char_reader
Safely read wild streams as chars or lines
-
castle_shared_parser
Castle Shared Parser
-
scie
research about how to build simple code identify engine for different languages
-
raw_str
type which contains a byte slice that is believed to contain UTF-8 string data, but which has not been validated
-
kvarn-chute
A Markdown converter designed to use the Kvarn templating engine
-
chromalog
A customizable logger with dynamic color coding and file logging
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
is-vowel
Heuristically test whether a character is a vowel letter
-
polars_arrow_rvsry99dx
Apache Arrow
-
august
& program for converting HTML to plain text
-
hoedown
bindings for the Hoedown markdown processor
-
umlauts
text transformation of german umlauts
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
dokkoo
Mokk (Macro Output Key Kit) implementation written in Rust
-
yeah
CLI tool to return the vendor name for a given MAC address
-
awabi
A morphological analyzer using mecab dictionary
-
sixbit
Small packed strings
-
markx
markdown parser
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
cha-rs
Extract specific characters from an input
-
repa
Peak Performance Pattern Seeker
-
glyphana
Quickly find, inspect & collect unicode glyps
-
ccase
Command line interface to convert strings into any case
-
pinot
Fast, high-fidelity OpenType parser
-
shapdf
Create Shapes into PDF
-
levenshtein_lite
No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function
-
transition-table
transition table utilities for keyword parser
-
mojibake
Encode/Decode bytes as emoji base2048
-
onepage
static site generator
-
markov_strings
A simplistic Markov chain text generator
-
kpathsea
Rust interface to the kpathsea TeX file management library
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
yinzhe9
喵喵隐者9
-
pdf-min
Very minimal crate for writing PDFs
-
extract-strings
Extract ascii strings from files
-
wcount
CLI word counting tool
-
mepple
English dictionary as a library
-
typeline
Efficient, Type-Safe Pipeline Processor
-
mdast2minimad
converting markdown AST to minimad texts
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
tectonic_engine_spx2html
The Tectonic engine that converts SPX output to HTML
-
markov-text
creating a small markov model for text generation
-
html_to_markdown
Convert HTML to Markdown
-
encoded-words
Encoded Words for usage in MIME headers
-
indentsort
Structure-preserving sorting of arbitrary indented text
-
english
language decliner
-
xee-ir
Xee intermediate representation and compilation to bytecode
-
concatenator
Add two pieces of text together
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
hydroperfox-sourcetext
Source text containing line locations
-
committed
Nitpicking commit history since beabf39
-
senax-inflector
Pluralizes and singularizes English nouns. This crate is forked from https://github.com/whatisinternet/Inflector.
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
japhonex
Japanese phone number checker for Rust
-
help_crafter
help message generator without hussle
-
amongify
A very ඞ sus ඞ program
-
monkey-printer
infinite nr of monkeys you could write Shakespeare
-
linkedin-headlines
LinkedIn Headlines Generator, a Rust library to generate high-quality headlines for LinkedIn posts or profiles
-
delay_writer
Wraps a writer and delays its output after each newline
-
mdrss
generating RSS feeds from markdown files
-
fast_aug
Fast data augmentation for text
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
eytzinger
implements the "eytzinger" (aka BFS) array layout
-
searcher_txt
A copy of grep that i made to show that im bad at rust
-
kth-lines
Command line tool for filtering stdin lines that just work
-
find_unicode
Find Unicode characters, the easy way!
-
ucd-raw
Uninterpreted access to the unicode UCD
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
ellipse
Truncate and ellipse strings in a human-friendly way
-
text_magic
string manipulation, including reversing strings and checking if strings are palindromes
-
flashtext2
The FlashText algorithm implemented in Rust
-
html-to-pulldown-cmark-events
Parse HTML to pulldown-cmark's events
-
hello_lib
Demonstrate Generics Function
-
trigram
Trigram-based string similarity for fuzzy matching
-
irssi-sys
Automatically generated bindings to irssi
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
carbon-dump
-
whitespace
Encode arbitrary data whitespaces and vice versa
-
readput
Fast and easy stdin input parsing for competitive programming in rust
-
even_bigger_s
Better String Literal
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
keyvalues-parser
A parser/renderer for vdf text
-
wcounter
Give the word and count the appearance
-
slidedeck
Create an HTML slide deck from Markdown
-
engish
A language utility for sampling letters and building words
-
neardup
near-duplicate matching
-
jcalendar
Japanese Calendar for Rust
-
kvu
The simplest command line tool to manage key-value pair lines
-
invisible_unicode
finding invisible unicode characters
-
emojicons-2021
Parse :emoji: notation to unicode representation
-
jp-location-relation
隣接する市区町村の一覧を取得
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
lindera-cli
A morphological analysis command line interface
-
bk-tree
A Rust BK-tree implementation
-
hunspell-sys
Bindings to the hunspell C API
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
xsystem
Conversion between the Esperanto x-system and Unicode circumflexes
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
trunc8
Truncate text to a specific line length, based on a number of parameters
-
oneparse
A one in all simple parser framework
-
octor
rmd combines all readmes into one
-
show-bytes
Display bytes as printable ascii with escape sequences as needed
-
mdtransform
program that renders Markdown as HTML for websites
-
is-digit
Detect decimal digit in char or first char of the str and String
-
semchunk-rs
A fast and lightweight Rust library for splitting text into semantically meaningful chunks
-
markdown-composer
composing markdown documents
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
itex2mml
Convert LaTeX-formatted source (for example
$a+b$
) into MathML (bindings to https://golem.ph.utexas.edu/~distler/blog/itex2MML.html) -
sol-lang
Sol toolchain and library
-
pulldown_typst
A pull parser for Typst markup
-
md-to-html
CLI tool to convert Markdown files to HTML
-
esc
Escape characters in strings
-
ftrace
trace files and paths
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
render_as_tree
visualizing tree data structures via text
-
product-os-content
Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server
-
forgiving-htmlescape
HTML entity encoding and decoding, with support for leaving malformed entities intact
-
ik-rs
chinese segment, ik-analyzer for rust
-
mdbook-twiki
twiki backend for mdbook
-
clt
efficiently creating command line applications
-
tame-webpurify
client for the WebPurify REST API
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
summary
Extract the sentences which best summarize a document
-
strip_markdown
remove markdown syntax from markdown files
-
ansi-width
Calculate the width of a string when printed to the terminal
-
koelner-phonetik
koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words
-
rulet
figlet implementation
-
yzb64
Ytrizja base-64 specialization
-
bge
Rust interface for BGE Small English Embedding Library
-
ascii-rs
Process image into colored-ascii image
-
korean_regex
Regex extension for Hangeul analysis
-
rust_baht_text
Convert number to Thai Baht text
-
ainu-utils
A collection of utilities for the Ainu language
-
cologne_phonetics
generate phonetic cologne codes for utf8 strings
-
vape
full width aesthetics
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
wcrs
GNU wc in Rust
-
mime_4
Strongly Typed Mimes
-
xim-ctext
compound text en/decoder
-
mdbook-tools
A collection of tools for mdbook
-
mdbook-docslab
mdBook preprocessor for interactive code with docslab
-
uwubot
discord bot for uwuifying text
-
case01-rs
This program outputs 0 and 1 with a probability of one in two. It's like the two sides of a coin.
-
unicode-line-stacker
Tiny library for stacking Unicode line-drawing characters on top of each other
-
cofe
tiny string similarity crate
-
halfcaps
tRaNslAtE aNy TeXt To ThIs
-
genpdf
User-friendly PDF generator written in pure Rust
-
markdown-it-latex
Allows for the insertion of math in Markdown documents using LaTeX
-
lithe-cli
A cli of lithe
-
romulus
a stream editor like sed
-
string_py
aims to make the String type as easy to use as the str type in python
-
twitter_text_parser
Parser for twitter-text in Rust
-
utils_rust
这是一个用于各种实用功能的 Rust 库
-
squ
command-line utility for converting quotation marks in plaintext files to "smart quotes"
-
mdbook-collector
A mdbook preprocessor which collects json/yaml/toml data from each post and builds a json file of it
-
solr_post
command line tool and library to post files in a directory to a Solr server
-
wattle
tools for rust
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
macro_colors
colorful printing macros
-
detect-indent
Detect the indentation of code
-
text_alignment
Align your text in Rust in the CLI
-
timfmt
A small utility for formatting code as Tim likes it
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
str_overlap
Methods for finding the overlap between two string slices
-
group-similar
Group similar values based on Jaro-Winkler distance
-
mdbook-hide
A preprocessor for mdbook that adds support for hidden chapters
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
spyglass
Search engine for documents, inspired by bioinformatics
-
unindenter
unindent text
-
sourcepawn_lsp
Language Server implemention for the SourcePawn programming language
-
rainbow-text
Write text in rainbow colors
-
rust_pie_ob
a pie orderbook library for rust
-
libgrep-rs
searching through text
-
whitespace_text_steganography
A steganography strategy that uses whitespace to hide text in other text
-
ipsae-core
markdown parser for DIY lover
-
rvim
A text editor in rust
-
logseq
Handle Logseq Markdown files in Rust
-
mdbook-fix-cjk-spacing
mdbook preprocess that fixes CJK line breaks
-
segtok
Sentence segmentation and word tokenization tools
-
notedown_ast
Notedown Abstract Syntax Tree
-
utfx
-
rex-regextract
extracts key value pairs out of text
-
phonet
A CLI tool and library to validate phonotactic patterns for constructed languages
-
cyrla
two-way conversion between latin and cyrillic script
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
termbook-cli
termbook
is a command-line tool to buildmdbook
’s while executingbash
codeblocks and collecting their output to become part of themdbook
-
chunk_norris
splitting large text into smaller batches for LLM input
-
hunspell-rs
Rust bindings to the Hunspell library
-
mdtranslation
prepare multi-lingual Markdown documents
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
ucf
A universal code formatter
-
markitdown
designed to facilitate the conversion of various document formats into markdown text
-
genkit
A common generator kit for static site generator
-
uxmlwriter
A tiny drop-in replacement for xmlwriter
-
ragtime
Easy Retrieval Augmented Generation
-
regex-automata
Automata construction and matching using regular expressions
-
pulldown-cmark-fork
A pull parser for CommonMark
-
ctrl-z
A composable reader to treat
0x1A
as an end-of-file marker -
besida
Language for defining branching dialogue
-
termwrap
Wrap Unicode text with ANSI color codes
-
azusa
String index transformer for Rust utf8 to JavaScript utf16
-
mdbook-multicode
Allows you to give multilanguage code examples, toggled by a spinner
-
mdlint-cli
tbd
-
minigrep_maybe
Just another minigrep from Rust book
-
conventional_commits_linter_lib
A tooling and language agnostic library to lint Git commits against the Conventional Commits specification
-
markovish
Markov chain implementation for text generation
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
merge-whitespace-utils
Procedural macros for merging whitespace in const contexts
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
stylish-stringlike
API for string-like objects that have styles applied
-
latex_snippet
Convert even erroneous LaTeX snippets into HTML
-
mdbook-force-relative-links
An
mdbook
pre-processor to transform all local links to relative ones -
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
deck
A command line tool to generate HTML presentations from Markdown documents
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
mdbook-snips
Markers for hidden lines in rust blocks within an mdbook
-
rustextile
Textile markup language parser for Rust
-
wordninja
port of the Word Ninja English word splitting library
-
mdx
in Rust
-
vec-string-to-static-str
providing utilities for converting vectors of
String
s into vectors of&'static str
-
ruby-string
A string type that tracks Ruby glosses attached to parts of it
-
utf-cli
A UTF code viewer
-
md2gemtext
for converting Markdown into gemtext
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
morse-nostd
A nostd version of the morse crate
-
node-emoji
Convert
:emoji:
to Unicode using GitHub’s and EmojiDB’s emoji names -
afrim-memory
handle of sequential codes easier for an input method
-
mdbook-image-size
A mdbook preprocessor which support image size syntax
-
spongebobizer
Command-line utility that outputs its stdin, converted to 'sPonGeBoB cAsE', and a library to support it
-
encoding-next-index-singlebyte
Index tables for various single-byte character encodings
-
humnum
Human numeric sorting program — does what
sort -h
is supposed to do! -
citation
manipulate citation files
-
text-template
Small template engine for use with plain text (e.g. creating text email), not intended for HTML.
-
hyper-old-types
HTTP types from hyper 0.11.x
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
api_key
Generate api key in rust supports base32, base62, string, uuid4, uuid5
-
redpatterns
a list of patterns for scanners 📟
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
typos-dict
Source Code Spelling Correction
-
mdbook-nix-eval
mdbook preprocessor for evaluating nix expressions
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingual
cmdline tool. -
hyphenator
segmenting words into syllables
-
mdtohtml
markdown to html renderer (with a couple of missing features)
-
jposta
A fast and intuitive Terminal User Interface (TUI) tool for searching Japanese postal codes and addresses
-
latex-to-html
Latex to html converter
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
catdream
Sleeping cat dreams your text
-
rdg
Random data generator for the command line
-
thesauromatic
command-line thesaurus that returns related words when given a word. The output words are one per line, making it easy to process in shell pipelines.
-
repub
convert markdown documents to epub
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
sttx
belt for transforming speech-to-text data
-
markdown_to_html_parser
parses Markdown syntax into HTML
-
encoding_c
C API for encoding_rs
-
rss4mdbook
a generator for mdBook as CLI tool, export RSS.xml into u want path
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
fwuffgrep
Basic implementation of a grep command written in rust
-
serbzip
A quasi-lossless Balkanoidal meta-lingual compressor
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
ucfirst
Uppercase the first letter of a string
-
texoder
A text stream which can encode/decode text in several encoding formats
-
code-tour
Enhanced example-based learning, i.e. awesome examples user experience
-
sarcasm
tExT creation and validation library
-
munemo-rs
Turn an integer into a more rememberable word, or vice-versa
-
utf8reader
wrapper around Reader that returns a stream of UTF-8 characters
-
mdbook-to-example
Turns an mdbook book into a Rust example
-
cvicenie_2
Cvicenie 2
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
markdown-includes
Include other documents, table of content, or rust-doc in Markdown using a simple template system
-
indoc
Indented document literals
-
markdown-linkify
Markdown preprocessor for substiting link shorthands to valid links according to configurable regexes and custom substitution implementations
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
ucd-util
A small utility library for working with the Unicode character database
-
simple_bencode
bencode encoder and decoder, that uses neither rustc-serialize or Serde. Instead, it serializes from / deserializes to a tree using a 4-branch enum.
-
auk_markdown
Markdown support for Auk
-
codex
Human-friendly notation for Unicode symbols
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
tergo-tokenizer
R language tokenizer
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
annatto
Converts linguistic data formats based on the graphANNIS data model as intermediate representation and can apply consistency tests
-
transcript
A transcriber for European scripts
-
igpay-atinlay
Translate text to Pig Latin
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
text_distance
A collection of approximate string matching algorithms
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
prr
Mailing list style code reviews for github
-
trie-match
Fast match macro
-
infisearch_lang_ascii
Basic ascii tokenizer for InfiSearch
-
jp_utils
Utils for working with Japanese text
-
uecho
The unicode of the echo command
-
bitranslit
Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
mdify
A CLI tool that translates md files to html while keeping project structure
-
wxf-converter
Transform yaml, json, pkl files to wolfram
-
svgrep
A grep-like utility for separated-values files written in Rust
-
mdbook-typst-pdf
mdbook typst pdf backend
-
search-in-terminal
A terminal-based search tool
-
llm-tui
A Terminal User Interface (TUI) for interacting with Language Learning Models (LLM) using llm-cli
-
moon-phases
Fast command-line application to show the moon phase
-
marker
finding issues in CommonMark documents
-
gnu-echo-rs
A rewrite of the echo GNU core utility in rust
-
txt_to_md
Command converting from a txt file to a markdown file
-
strings
String utilities, including an unbalanced Rope
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
arg_input
ARGF-style input handling for Rust
-
jput
puts and putc on unicode-width align for Rust
-
base256_lib
Base256 encoder/decoder
-
markdown-table
Creating markdown tables with Rust!
-
encoding8
various 8-bit encodings
-
text-to-png
way to render text to a png image with basic options
-
writings
The Bahá’í Sacred Writings for use in Rust projects and APIs
-
indicator-extractor
Extract indicators (IP, domain, email, hashes, etc.) from a string or a PDF file
-
text-diff
text diffing and assertion library
-
fmty
Composable
core::fmt
utilities -
kincaid
A word statistics library in Rust
-
mdbook-bib
mdbook plugin allowing to load and present a bibliography in BibLaTex format in your books and cite its references
-
mdbook-footnote
mdbook preprocessor for footnotes
-
ligotab
Format delimited data with lightweight markup
-
terraphim-markdown-parser
Terraphim Markdown Parser
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
mdbook-chart
A preprocessor and cli tool for mdbook to show chart use C3.js
-
wz
Count words, fast
-
uwu_cli
uwuifying the terminal
-
toml_to_table
pretty print TOML as a table
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
liwe
IWE core library
-
uniwhat
Display the unicode characters text
-
unicode_escape
decoding escape sequences in strings
-
rst_renderer
a reStructuredText renderer
-
syllable
counter for use with reading level calculations
-
grace-cli
CLI tool for processing files and strings
-
strainer
Command-line tool for finding duplicate lines in text files
-
findtext_sheet
Search text in SpreadSheet
-
gimme
Pull useful data out of your clipboard
-
asciir
Print ASCII table/values
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
kudubot-bindings
Rust Bindings for the kudubot framework
-
h_hangul
Korean Characters
-
remake
writing maintainable regex and managing symbol soup
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
uwu-rs
uwuifying library
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
md-inc
Include files in Markdown docs
-
shoebill
A Wadler/Leijen style pretty-printer
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
yagenerator
Application that uses tinytemplate engine to generate text files. If you have a set of structured data, and need to generated a bunch of arbitrary types of files from it, this tool can help you to save some time.
-
rut
A small UTF-8 parsing library for applications that need to parse individual
char
s -
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
writedown-html
Writedown HTML backend
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
bbx
A robust, performant BBCode pull parser
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
pdf_composer_definitions
PDF Composer definitions crate
-
lindera-decompress
A morphological analysis library
-
txttyp
Formatted string typewriter
-
encoding-index-simpchinese
Index tables for simplified Chinese character encodings
-
chinese2digits
The Best Tool of Chinese Number to Digits. A useful tool in NLP and robot project.
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
sauron-md
parsing markdown into sauron node
-
spellcheck_toy
a basic spellchecking library based on edit distance
-
adbook
Creates a book from AsciiDoc files
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
paperoni
A web article downloader
-
utf8_reader
A UTF-8 reader that read UTF-8 characters from object that implement Read trait
-
code-splitter
Split code into semantic chunks using tree-sitter
-
equt-md-ext
Extend event iterator
-
count-md
configurable command-line tool and Rust library for Unicode-aware, Markdown-aware, HTML-aware word counting in Markdown documents
-
is_printable
Determine whether a given text-based value is printable
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
jmdict
The free Japanese dictionary database JMdict, pre-digested for use in Rust libs and apps
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
tnil
Parsing, glossing, and generating utilites for New Ithkuil
-
genex
Text-expansion library
-
minigrepsada
Test
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
dtex
Better TeX
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
basic-text-internals
Basic Text string literal implementation details
-
marko
Programmtically format text with Markdown syntax
-
ptero-cli
A text steganography CLI tool for Social Media
-
string_morph
string case transformations with an emphasis on accuracy and performance. The case conversions are available as functions as well as traits on String types.
-
ascii_converter
converting between different ascii representations
-
docfmt
A document formatter using Handlebars templates
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
re2
Wrapper for the re2 C++ regex library
-
bpmf_py
A Bopomofo and Pinyin library
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
linetime
command line utility to add timestamps at the start of lines. The tool can either process lines from stdin or execute a command and process lines from the command's stdout and stderr.
-
prettify-markdown
Format Markdown at the speed of Rust
-
xee-xpath
XPath 3.1 library API
-
blitztext
fast keyword extraction and replacement in strings
-
gret
command line tool to search for patterns and show matches in a tree structure
-
mdbook-unlink
A mdBook backend that validates local links
-
skribo
low-level text layout
-
moenarchbook
Creates a book from markdown files
-
goodname
assist you with cool naming of your methods and software
-
combos
Print all permutations of a word list
-
string_manip_rust
Demo of managing projects
-
markdown-it-autolink
A markdown-it plugin for parsing GFM autolinks
-
alphabet-encoder
A quick and dirty way to deal with escape characters
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
html2runes
An HTML to Text converter
-
split_exact
splitting strings into arrays of slices
-
string-overlap
A helper crate for "layering" ASCII art
-
yozuk-helper-english
English NLP utilities for Yozuk
-
mdbook-chapter-zero
A mdBook preprocessor that allows 0th (sub-)chapter
-
rustysozluk
Eksi Sozluk API wrapper
-
kanjidic_types
A collection of types encompassing the variety of data about kanji available from Kanjidic
-
set_mtime
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
scripter
A screenplay compiler
-
yozuk-core-skillset
Set of default Yozuk skills
-
ontodev_valve
A lightweight validation engine written in rust
-
indent_tokenizer
Generate tokens based on indentation
-
cindex
CSV indexing library
-
tectonic_xetex_format
Tectonic/XeTeX engine data structures and their expression in TeX "format" files
-
moscato
Outline scaler for OpenType glyphs
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
beemovie-cli
Bee Movie CLI Application
-
charmap
one-to-(none/one/many) character mapping
-
p4d-mdproof
Markdown to PDF converter
-
trans-epub
Translate EPUB with CLI
-
marmite
easiest static site generator
-
prettythanks
frontend to dtolnay/prettyplease library
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
minimizer
Minimize files to find minimal test case
-
spellabet
Convert characters into spelling alphabet code words
-
chisel-lexers
Chisel backend lexers/scanners
-
framework
detector for different frameworks in one projects
-
unicode_categories
Query Unicode category membership for chars
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
futf
Handling fragments of UTF-8
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
parattice
Recursive paraphrase lattice generator
-
mapm-cli
The command-line implementation of mapm
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
ream
Data language for building maintainable social science datasets
-
shelldon
your new Rust-powered buddy with GPT features!
-
libxdiff
Rust bindings for the libxdiff C library
-
stardict_lib
Rust implement stardict
-
rustex
auto-generated LaTeX files in Rust
-
groop
Outline-aware grep with syntax highlighting
-
auto_correct
provide auto correct suggestions. Currently supporting EN-US.
-
tgo
Heterogeneous data type transtion, it's safe, lightweight and fast
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
mul
Bengali stemmer
-
merge_pdf
Merge PDF files in a directory
-
lindera-dictionary
A morphological analysis library
-
epubparse
Parse epub and convert to text-only Book structure
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
github-slugger
A slugger for GitHub headings
-
topiary-cli
CLI app for Topiary, the universal code formatter
-
unicode-vo
Unicode vertical orientation detection
-
djot
Djot parser written in pure Rust
-
const-utf16
Utf8 to utf16 conversion functions for use in const contexts
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
grep-reader
short text for crates.io
-
unic-ucd-normal
UNIC — Unicode Character Database — Normalization Properties
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
chat-gpt-lib-rs
interacting with OpenAI's ChatGPT API, providing a simple interface to make API requests and handle responses
-
remove-markdown-links
Turns
[example](https://example.com)
intoexample
. That’s it -
tuilet
A textual user interface for Toilet, the ANSI-art text generator
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
vidyut-chandas
A Sanskrit metrical classifier
-
pylyzer_core
pylyzer core
-
nlprule-build
Build tools for a fast, low-resource Natural Language Processing and Error Correction library
-
syllarust
quickly counting syllables
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
sastrawi
stemming and stopword removal for Bahasa Indonesia based on PHP sastrawi project by Andy Librian
-
deepphonemizer
G2P model (inference only)
-
hms
Command line utility to help with storing text (eg. code snips)
-
text-to-json
Convert text to json in rust
-
tectonic_io_base
Basic types for Tectonic's pluggable I/O backend system
-
diffy-fork-filenames
Fork of https://docs.rs/diffy that allows specifiying filenames
-
highlights
Convert kindle highlights into different formats
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
find-typos
Find typos in your project
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
unidok
A powerful, readable, easy-to-learn markup language
-
pencil_case
but effective library for writing GUI in bevy
-
highlight-pulldown
Process pulldown-cmark events to apply syntax highlighting to code blocks
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
trans-case
Transform case
-
scraps
A static site generator that makes it easy to write internal links (Wiki links) in Markdown files
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
pocky
A framework for building your own static site generator
-
lingua-danish-language-model
The Danish language model for Lingua, an accurate natural language detection library
-
tradukisto
Kinda useful natural language translation library and utility
-
sparklet
small flashcards library
-
unicode_font
Convert unicode characters between fonts
-
tinytoken
tokenizing text into words, numbers, symbols, and more, with customizable parsing options
-
static_table
creates pretty tables at compiler time
-
typeline_ext_csv
csv parsing and serialization for typeline
-
caribon
A repetition detector program and library
-
unicode-canvas
creating text base drawing
-
retest
Command-line regular expression tester
-
untex
Understand and manipulate TeX files with ease
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
synterm
making beautiful REPLs and Shells with fish like as you type syntax highlighting
-
mdbook-fishextract
A mdbook preprocessor which handles mermaid graphs, offline, requires
mmdc
-
polka
A dot language parser for Rust; based on Parser Expression Grammar (PEG) using the excellent pest crate as the underpinning
-
utf
UTF-8
-
case_converter
Case Converter
-
naromat
Convert text to narou novel format
-
struckdown
A structured markdown / commonmark library for Rust
-
csvre
replacing data in CSV columns with regular expressions
-
cw
Count Words, a fast wc clone
-
escaped-delimiter
Iterator of delimited slices with escape characters
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
askama-filters
Extra template filters for Askama
-
rml
A machine learning library in Rust
-
utf8-command
UTF-8 encoded
std::process::Command
output -
lines
Utililities for iterating readers efficiently line-by-line
-
grammateus
facilitate working with Ancient Greek words
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
aki-stats
output the statistics of text, like a wc of linux command
-
dom-content-extraction
Content extraction via text density paper
-
books_description_parser
A Rust-based parser to extract book details from structured markdown-like text and output them in formats like JSON or Rust structs for further processing
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
yitizi
異體字查詢 Get variant Chinese characters
-
mdbook-open-gh-issue
mdbook preprocessor to add a open-on-github link on every page
-
character-stream
Helper data structures for reading UTF-8 characters from a stream
-
br-pdf
PDF Invoice Processing
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
seven_seg
Seven-segment digital display for terminal
-
mdbook-translation
prepare multi-lingual mdBook books
-
dvi2html
converter
-
corpus-preproc
A preprocessor for text and HTML corpora
-
write16
A UTF-16 analog of the Write trait
-
cpp_to_rust
Automatic generator of C++ library wrappers
-
mdbook-bash-tutorial
A mdbook preprocessor that allows embedding Bash scripts as tutorials
-
difference
text diffing and assertion library
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
iasthk
Harvard-Kyoto to IAST conversion
-
deface
Lightweight markup to HTML converter
-
vtext
NLP with Rust
-
bionic-ebooks
Takes an EPUB file and generate a copy with bionic like font applied
-
e_book_sync_library
Synchonize e-book with your local e-library
-
ron_to_table
pretty print RON as a table
-
term-rustdoc
A TUI for Rust docs
-
hsk
Return HSK Level for Simplified Chinese Characters
-
rnltk
Natural Language Toolkit for Rust
-
rosie-sys
build or link to librosie to access the Rosie Pattern Language
-
oxcomm
using Google Translate on the fly
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
heckmv
A basic case-conversion renaming CLI tool
-
findtext_textfile
Search text in text file
-
md-include
include any file in markdown files
-
stardict_wrapper
Rust implement stardict
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
pcre2
High level wrapper library for PCRE2
-
linkcheck
extracting and validating links
-
note-mark
A markdown parser
-
scrambler
command line tool to scramble letters
-
latex
An ergonomic library for programatically generating LaTeX documents and reports
-
mdbook-playscript
Preprocessor for mdBook, which styles stage play scripts
-
ryaspeller
lib for searching typos in text, files and websites
-
any2utf8
Convert any enncoding to utf-8
-
smoldown
Native Rust library for parsing Markdown
-
mdbook-asciidoc
mdBook backend for AsciiDoc generation
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
xlsxwriter
Write xlsx file with number, formula, string, formatting, autofilter, merged cells, data validation and more
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
gregex-logic
Logic for the gregex crate
-
intname
Full English name for any integer of any primitive integer type
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
anagrambot
find anagrams of words
-
mmseg
Chinese word segmenation algorithm MMSEG in Rust
-
spinne-core
Core logic for spinne
-
strange
A static website generator
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
aklat
create books from markdown files (like Gitbook)
-
rust_book_code
The code of rust book
-
aki-xtee
copy standard input to each files and standard output
-
aprilasr-sys
Low-level FFI bindings for the april-asr C api (libaprilasr)
-
beemovie
Bee Movie crate
-
word_filter
A Word Filter for filtering text
-
md_parser_wasm
A markdown parser written in Rust and compiled to WebAssembly
-
boxy
Declarative builder for Unicode box-drawing characters
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
korrektor
work with Uzbek language text processing
-
bookrafter
This repository contains code related to bookrafter rendering
-
ezemoji
Catigoryized Emoji's
-
git-busy
A wrapper around "git commit" that generates the commit messages for you
-
snake_case_converter
convert strings to snake case
-
bos_books_codes
that handles 3-character Bible Books Codes
-
strip-ansi-escapes
Strip ANSI escape sequences from byte streams
-
inclip
terminal utility for clipboard interaction and convenience
-
forming
lightweight architecture as code language. 架构描述语言
-
veryfi
Module for communicating with the Veryfi OCR API
-
bbd-lib
Binary Braille Dump
-
subscript-compiler
A modern LaTeX rendition
-
google_material_symbols
Google Material Design Icon Font
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
sprinkles
Randomly colors input text and outputs it to the terminal
-
rangle
An Org-Tangle alternative
-
carlo-latex
A LaTeX emitter for the simple interpreted programming language Carlo
-
bbcode-tagger
BBCode tree parser and tagger
-
sauron-markdown
parsing markdown into sauron node
-
blockcounter
Counts the blocks in a stream
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
quill_delta_pdf
Convert Quill Delta to PDF
-
odict
A blazingly-fast dictionary file format for human languages
-
encoding-next-index-korean
Index tables for Korean character encodings
-
czv
performing CSV-related operations for data engineering and analysis
-
file-search
File indexing and search
-
cattocol
Combine two text into one text as columns
-
alpino-tokenizer
Wrapper around the Alpino tokenizer for Dutch
-
once-cell-regex
just gives you the
regex
macro from theonce_cell
docs! -
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
kilo
small, fast utility crate/library for manipulating strings and generating sourcemaps with all in Magic 🪄
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
panduck
Conversion tool by rust, inspired by pandoc
-
indentation
Formatter
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
lines_lossy
extension to
BufRead
with a functionlines_lossy
that works likeBufRead::lines
but with lossy UTF-8 decoding -
markdown2unicode
Converter from markdown notation to unicode characters
-
markdown-table-formatter
Markdown table formatter fully compliant with Unicode 15.1.0
-
md_converter
My markdown to html converter
-
base100
Encode your data into emoji
-
mdbook-all-the-markdowns
Render all markdown files in a given folder structure
-
ucd-parse
parsing data files in the Unicode character database
-
top-english-words
retrieve top words from the English language
-
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
zuk
Yozuk command-line interface
-
wfst4str
Python library based on rustfst for manipulatig strings with wFSTs
-
arbitrator
Format text based on a set of rules and regexes
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
ogrep
searching in indentation-structured texts
-
text-sanitizer
convert text to plain ASCII text
-
kanpyo
Japanese Morphological Analyzer
-
tokengeex
efficient tokenizer for code based on UnigramLM and TokenMonster
-
wordnet
Read a wordnet dictionary in Rust
-
crypto-invert
Unicode Upside-Down Mapping
-
unicount-lib
Alphabetic counter supporting unicode
-
tb_normalization
normalization utf8 string, loc dau vietnamese and some language
-
single_source
Generate code files from snippets in md tutorial files
-
pra
Print Random ASCII
-
code-span
Add additional infomation to code character
-
gqlog
👾 filter your json logs with graphql 👾
-
encoding_c_mem
C API for encoding_rs::mem
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
mdbook-mdinclude
Preprocessor for mdBook that better handles including markdown files
-
kanpyo-dict
Dictionary Library for Kanpyo
-
lindera-compress
A morphological analysis library
-
dhoni
converting Bengali text into their phonetic counterpart
-
textocx
Tex code to Office MathML
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
skyspell_core
skyspell core library
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
alpha-counter-cli
Alphabetic counter
-
pikchr-cli
PIC-like diagramming language to SVG converter
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
icu-data
International Components for Unicode (ICU) data in Rust structures
-
ttf_word_wrap
Wraps text based on character width
-
ascii_tree
generates ascii trees
-
sitdown
Static site generator
-
anystr
An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32
-
cozo-ce
A general-purpose, transactional, relational database that uses Datalog and focuses on graph data and algorithms
-
bitfont
Takes an ASCII string and generates a vector containing a bitmap font, for easy overlay into images
-
contractions
expand contractions in English
-
ngrams
Generate n-grams from sequences
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
crustword
Crusty Crosswords
-
mupdf-sys
Rust FFI binding to MuPDF
-
bocu1
BOCU-1 compressed unicode encoding
-
df_cp437
Decoder for CP437 to UTF-8
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
demoji
Remove all emojis from a string
-
soundchange
implementing sound change algorithms in Rust
-
ssml-parser
parsing speech sythnesis markup language
-
nib
static site generator
-
yozuk-model
NLP model generator for Yozuk
-
color-char
Character with color information
-
const_format_proc_macros
detail of the
const_format
crate -
owlz
"Owlz" ascii emoojis, created randomly or by design
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
rustyword
An anagram finder
-
jellybean-pack-2
Sweet syntax highlighting with tree-sitter
-
assert-text
the testing macro tools
-
typst-kit
Common utilities for Typst tooling
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
markdown-heading-id
Filter for
pulldown-cmark
which converts headings with custom ID -
nib-cli
A cli for a yet another static site generator Nib
-
anagram
A collection of anagram utility functions
-
hashlogs
Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout
-
typeline_ext_sqlite
sqlite integration for typeline
-
minigrep_bakedspacetime
Minimal Rust implementation of grep based on The Book
-
bytescolor
A versatile Rust library for colorizing strings and byte data in terminal applications using ANSI escape codes
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
lingua-slovak-language-model
The Slovak language model for Lingua, an accurate natural language detection library
-
fribidi
bindings for Rust
-
ankiding
Creating Anki-Flashcards within Markdown!
-
text_to_emoji
Convert text to emoji
-
economic_indicator_finder
A finder for extracting economic indicators from paragraphs
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
rusty_code_code_for_book
my book_rusty code
-
varcon-core
Varcon-relevant data structures
-
simple-word-count
word count function, try to get same result with Microsoft Office Word application
-
fzy-tmux
A rewrite of fzy's fzy-tmux script in Rust
-
fuzzy_match_flex
Fuzzy matching library based on the popular
FuzzyWuzzy
library for python -
text-tables
A terminal/text table prettifier with no dependencies
-
stamd
Webservice for working with stand-off annotations on text (STAM)
-
aki-txpr-macro
the more easy to use libaki-*
-
cdoc-parser
A markdown-based document parser with custom syntax
-
doc-sync
A CLI to convert rust documentation to markdown files and then back to rust documentation
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
caseformat
Power flow case data format
-
unicode-utf8
that converts utf-8 bytes to a unicode scalar value, and vice versa
-
decline-word
Choose word form based on given number
-
html_to_pdf_lib
converting HTML to PDF
-
xgrepx
xgrep is a rust implementation of grep. This is a follow up from the rust book
-
spongedown
Converts markdown to html with svgbob support
-
rg
You don’t want this crate - you want the
ripgrep
crate -
mdbook-compress
Compress an mdBook project into a single PDF file
-
alith-core
Alith core package
-
pest_ascii_tree
Helper crates converting the parsing result of any pest grammar into an ascii tree
-
ddvm
Document to Document Virtual Machine
-
emojicons
Parse :emoji: notation to unicode representation
-
ascii-webcam
A webcam that visualizes its output as ASCII art directly in the terminal
-
llmvm-core-lib
llmvm core application
-
mdtable-cli
that makes creating tables in markdown much easier!
-
charjpoet
Charj Poet is a API for write to
.cj
language -
naveengrep
command line tool similar to the grep
-
mdbook-tagger
A mdbook preprocessor to collect and build tag data in Json format
-
fmtm_ytmimi_markdown_fmt
Fork of @ytmimi's Markdown formatter; powers FMTM
-
tantivy-object-store
A tantivy Directory implementation against object stores (S3, GCS, etc.)
-
mdbook-files
Preprocessor for mdbook which renders files from a directory as an interactive widget
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
unicode_clusters
variable width unicode characters as single items, allowing for array like indexing etc
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
pdf_form
programatically filling out pdf forms
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
cmark2tex
A small utility to convert markdown files to pdf exploiting tectonic
-
textos
Texts, strings, formatting, unicode…
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
chisel-parsers
Chisel parser front ends
-
poetry-book
Create a poetry book in latex, starting from plain text
-
md-dir-builder
Webserver for serving all markdown files in a directory
-
pdf_composer_base
PDF Composer base functionality crate
-
maybe_utf8
Byte container optionally encoded as UTF-8
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
lindera-assets
A helper crate to fetch assets and build dictionary for lindera
-
aki-json-pick
The json pick out command
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
ced
Dead easy csv editor
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
tectonic_bridge_harfbuzz
Expose the Harfbuzz C/C++ APIs to Rust/Cargo
-
bytepiece_rs
The Bytepiece Tokenizer Implemented in Rust
-
base_u256
base-u256 is to utf-8 as base-64 is to ascii
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate
-
tinysegmenter
Compact Japanese tokenizer
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
goya
morphological analyzer for Rust and WebAssembly
-
yarner-lib
build plugins for the Literate Programming tool Yarner
-
morc
Dead simple, minimal markdown generator library written in Rust
-
encoding
Character encoding support for Rust
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
wz-conf
Configuration options for wz
-
mail-internals-ng
[mail-api] _internal_ parts for the mail-api crates
-
presciidoc
Preprocessing AsciiDoc for other tools
-
flw
Process text via configurable tasks
-
nerd_font
Enumerated contents of the JetBrainsMono Nerd Font
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
xsv
A high performance CSV command line toolkit
-
ascii_utils
handle ASCII characters
-
strizer
minimal and fast library for text tokenization
-
bgrep
grep tailored to handle binary patterns and files
-
indexable_str
creating indexable strings
-
fuzzysearchrs
Fuzzy search for finding strings in string with levenshtein distance
-
unicode_skeleton
detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
tashkil
A lightweight library for removing Arabic diacritics
-
catmark
Console printer for CommonMark
-
termbook
behind the
termbook-cli
-
norm-email
strip email provider defined behaviour from email addresses
-
beary
augment Bear, the note-taking app
-
mdbook-post
A CLI for add post to mdbook
-
docstring
manipulating and parsing documentation strings
-
tadm
A collection of algorithms and data structures wrote out while reading The Algorithm Design Manual book
-
unidecode
pure ASCII transliterations of Unicode strings
-
ewts-c
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (c lib)
-
rslint_errors
Pretty error reporting library based on codespan-reporting built for the RSLint project
-
llmvm-outsource-lib
outsource backend for llmvm
-
asciifolding
ascii folding library
-
mdbook-reference-table
mdBook preprocessor to create reference tables
-
static_format
Format strings with no runtime overhead
-
json-peek
Amature JSON parser library designed for my specific need
-
my_parser_kma_andurusenko_group_7
\<> _ <>\
-
lingua-greek-language-model
The Modern Greek language model for Lingua, an accurate natural language detection library
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
bookbinder_latex
Produce latex and pdf books
-
unic-common
UNIC — Common Utilities
-
align_text
Aligns lines in a block of text within a number of columns
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
tex
The νTeX typesetting engine
-
tiniestsegmenter
Compact Japanese segmenter
-
lingua-czech-language-model
The Czech language model for Lingua, an accurate natural language detection library
-
mdbook-obsidian
mdBook preprocessor to render Obsidian specific syntax
-
cjieba-sys
unsafe ffi to cppjieba
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
tfidf-summarizer
Basic tf-idf compute for documents
-
dd
a clone of the unix coreutil dd
-
fnew
A Unicode-aware line-oriented drop-in replacement for coreutils' fold
-
minigrep_ddesreum
My version of the grep command using Rust inspired by The Rust Programing Book
-
mdbook-webinclude
Preprocessor for mdBook that includes content from URLs
-
allsorts_no_std
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
yozuk-sdk
Types used in the Yozuk ecosystem
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
rustrawi
Rust port of the original PHP Sastrawi
-
emoji_converter
Converts text to emojis
-
token-counter
wc
for tokens: count tokens in files with HF Tokenizers -
txtframe
Creates a frame for text
-
mime-rs
A text processing framework, inspired by Emacs lisp and keyboard macros
-
lingua-vietnamese-language-model
The Vietnamese language model for Lingua, an accurate natural language detection library
-
nutrimatic
Tools for reading Nutrimatic (https://nutrimatic.org) index files
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
masker
Mask patterns in data
-
openlibrary-rs
A wrapper around openlibrary's Web API
-
japanese-ruby-filter
Japanese ruby notation parser
-
genpdfi
User-friendly PDF generator written in pure Rust
-
jp_inflections
A small crate to conjugate japanese words
-
ascii_set
Fast membership of ASCII character classes
-
uniaxe
replace Unicode letters with Ascii equivalents
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
morsels_lang_ascii
Basic ascii tokenizer for morsels
-
jellybean-pack-1
Sweet syntax highlighting with tree-sitter
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
indexrs
inefficient multi-language search index
-
pdf_forms
programatically filling out pdf forms
-
suffix
arrays
-
vaporetto_rules
Rule-base filters for Vaporetto
-
ascii-alphabetic-char
Traits for ASCII alphabetic characters
-
phonics
Phonetic spelling algorithms in Rust
-
lindera-dictionary-builder
Shared code for building Lindera dictionary files
-
conveyance
A stop-gap CLI for conveyancing
-
poppler
Wrapper for the GPL-licensed Poppler PDF rendering library
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
find-simdoc
Time- and memory-efficient all pairs similarity searches in documents
-
asciify
converting images to a readable format on the command line
-
ruby-parser
A parser for the Ruby language
-
ascii-engine
A fast ascii engine written in rust!
-
tabled
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
overlap
shows overlap text in files
-
lindera-py
Python binding for Lindera
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
transliterate1234
UTF-8 to ASCII transliteration
-
rustascii
Display Rust in ASCII
-
typeline_ext_http
http(s) tooling for typeline
-
mdbook-chapter-number
A mdBook preprocessor that adds chapter numbers to the each page header
-
lingua-dutch-language-model
The Dutch language model for Lingua, an accurate natural language detection library
-
writedown
format parser
-
mdbook-extended-markdown-table
Preprocessor for mdBook that generates tables with merged cells from ASCII text
-
findtext_pdf
Search text in PDF
-
simplearrayhash
-
trexter
Text progression tracking library
-
utf-8
Incremental, zero-copy UTF-8 decoding with error handling
-
morsels_lang_chinese
Chinese tokenizer for morsels
-
ascii-to-hex
A small, simple library to converting an ASCII text string into its hexadecimal equivalent
-
txt_processor
A little library for text processing
-
unic-idna-mapping
UNIC — IDNA — IDNA Mapping Table
-
fum
fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules
-
mdbook-mathpunc
An mdbook preprocessor that prevents line breaks between inline math blocks and punctuation marks when using katex
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
borderrs
Add stylish borders around your text and datastructures
-
lindera-analyzer
A morphological analysis library
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
nipah_tokenizer
A powerful yet simple text tokenizer for your everyday needs!
-
wkhtmltox-sys
FFI bindings to wkhtmltox
-
slicer
that slices string slices into smaller string slices
-
typos-cli
Source Code Spelling Correction
-
llmvm-chat
An llmvm frontend that acts as a CLI chat interface
-
pdfutil
PDF document manipulation
-
string-sections
Build tool for Leptos
-
cutters
Rule based sentence segmentation library
-
bookbinder
Produce books in various formats from markdown, with some understanding of structural semantics and rendering options
-
latin1str
Windows-1252 string types
-
panduck-latex
Use panduck to generate XeLaTeX
-
swappy
An anagram generator
-
fooz
fuzzy based string search
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
folia
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)
-
md2pdf
A small utility to convert markdown files to pdf exploiting tectonic
-
ascii-read
BufRead
-like methods for reading into anAsciiString
-
convert_encoding
Convert encoding of text files in batch
-
dumbfuzz
dumb library for fuzzy search
-
lindera-ipadic-neologd
A Japanese morphological dictionary for IPADIC NEologd
-
mathml-latex
Convert between MathML and LaTeX
-
lingua-italian-language-model
The Italian language model for Lingua, an accurate natural language detection library
-
tectonic_bundles
Tectonic "bundle" (support file collection) implementations
-
lindera-cc-cedict
A Japanese morphological dictionary for CC-CEDICT
-
jieba-macros
jieba-rs proc-macro
-
tectonic_dep_support
Support for finding third-party libraries using either pkg-config or vcpkg
-
jellybean-pack-0
Sweet syntax highlighting with tree-sitter
-
fountain
Parser and formatter for the Fountain screenplay markup language
-
mtoc
Generate a table of contents for markdown files
-
json_to_table
pretty print JSON as a table
-
xmldecl
Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way
-
latexify
Shared definition for turn a rust object into latex code
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
yeslogic-fontconfig
RENAMED: use the fontconfig crate instead
-
typeline_ext_python
python integration for typeline
-
guarding_parser
Guarding is a guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
grep-printer
grep crate's Sink trait that provides standard printing of search results, similar to grep itself
-
tectonic_status_base
Basic types for reporting status messages to a user
-
lindera-sqlite
Lindera tokenizer for SQLite FTS5 extention
-
rusty_word_builder
Syllable and Word generation library written fully in Rust
-
rust-jieba
Rust binding to cppjieba
-
utf8-rfc2279
UTF-8 according to the obsolete RFC2279
-
textframe
query plain text documents by unicode offset without loading them all into memory
-
unic-idna-punycode
UNIC — Implementation of Punycode (RFC 3492) algorithm
-
ascii_code_finder
find ascii code of a character or get a character by its ascii code
-
jpreprocess-naist-jdic
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
font-map-core
Core font-parsing capabilities for font-map
-
grep-searcher
Fast line oriented regex searching as a library
-
saku
efficient rule-based Japanese Sentence Tokenizer
-
scenic
Presentation tool that utilizes LaTex Beamer package
-
lingua-polish-language-model
The Polish language model for Lingua, an accurate natural language detection library
-
lipsum-cn
Pseudo-Chinese lorem ipsum generator
-
tectonic_bridge_icu
Expose a subset of the ICU Unicode APIs to Rust/Cargo
-
indentation_flattener
From indented input, generate plain output with indentation PUSH and POP codes
-
grep-matcher
A trait for regular expressions, with a focus on line oriented search
-
lingua-swedish-language-model
The Swedish language model for Lingua, an accurate natural language detection library
-
tectonic_bridge_flate
Exposing flate compression to the Tectonic C code
-
shift_or_euc
Detects among the Japanese legacy encodings
-
grep-pcre2
Use PCRE2 with the 'grep' crate
-
chardetng_c
C bindings for chardetng
-
lindera-tokenizer
A morphological analysis library
-
lingua-albanian-language-model
The Albanian language model for Lingua, an accurate natural language detection library
-
lingua-indonesian-language-model
The Indonesian language model for Lingua, an accurate natural language detection library
-
lingua-georgian-language-model
The Georgian language model for Lingua, an accurate natural language detection library