-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
similar
A diff library for Rust
-
const_format
Compile-time string formatting
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
lazy-regex
lazy static regular expressions checked at compile time
-
ropey
A fast and robust text rope for Rust
-
pulldown-cmark
A pull parser for CommonMark
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
unicode-bidi
Unicode Bidirectional Algorithm
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
tokenizers
today's most used tokenizers, with a focus on performances and versatility
-
convert_case
Convert strings into any case
-
termimad
Markdown Renderer for the Terminal
-
html2text
Render HTML as plain text
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
unicase
A case-insensitive wrapper around strings
-
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
diff
An LCS based slice and string diffing implementation
-
prettydiff
Side-by-side diff for two files
-
lopdf
PDF document manipulation
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
chardetng
A character encoding detector for legacy Web content
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
fuzzy-matcher
Fuzzy Matching Library
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
stringprep
algorithm
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
roff
ROFF (man page format) generation library
-
unicode-script
exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
mdbook
Creates a book from markdown files
-
const-str
compile-time string operations
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
hyperlink
Very fast link checker for CI
-
charabia
detect the language, tokenize the text and normalize the tokens
-
ascii
ASCII-only equivalents to
char
,str
andString
-
text_io
really simple to use panicking input functions
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
slug
Convert a unicode string to a slug
-
google-dialogflow3
A complete library to interact with Dialogflow (protocol v3)
-
regex-syntax
A regular expression parser
-
tailspin
A log file highlighter
-
synoptic
low-level, syntax highlighting library with unicode support
-
lindera-cli
A morphological analysis command line interface
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
printpdf
writing PDF files
-
diffy
Tools for finding and manipulating differences between files
-
regress
A regular expression engine targeting EcmaScript syntax
-
minimizer
Minimize files to find minimal test case
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
str_indices
Count and convert between indexing schemes on string slices
-
matchers
Regex matching on character and byte streams
-
text-size
Newtypes for text offsets
-
shell-escape
Escape characters that may have a special meaning in a shell
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
epub-builder
generating EPUB files
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
ferris-says
flavored replacement for the classic cowsay
-
line-index
Maps flat
TextSize
offsets to/from(line, column)
representation -
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
unindent
Remove a column of leading whitespace from a string
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
wildcard
matching
-
stringzilla
Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
font-types
Scalar types used in fonts
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
mdcat
cat for markdown: Show markdown documents in terminals
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
stfu8
Sorta Text Format in UTF-8
-
autocorrect
A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)
-
sd
An intuitive find & replace CLI
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
hck
A sharp cut(1) clone
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
none-shall-pass
Artifact for GitHub Action to validate hyperlinks in all markdown files
-
lindera
A morphological analysis library
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
llmvm-core
The core application for llmvm
-
shell-toy
A "fortune | cowsay" implementation in Rust, i.e. a nice little toy to liven up your shell.
-
aho-corasick-unsafe
Fast multiple substring searching
-
vesti
A preprocessor that compiles into LaTeX
-
pad
padding strings at runtime
-
mdbook-yapp
A mdBook preprocessor for simple text replacements
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
file-organiser
Command line file manager to list, move or delete large numbers of files in nested folders filtered by age, file extension, file name pattern and/or size range
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
netidx
Secure, fast, pub/sub messaging
-
glu
Unpacks a Markdown document full of code snippets into a temp directory
-
prefix
A customizable pretty printer for FIX messages
-
rustyline-async
A minimal readline with multiline and async support
-
pdf-extract
extract content from pdfs
-
languagetool-rust
LanguageTool API bindings in Rust
-
chewing
(酷音) intelligent Zhuyin input method
-
subplot
tools for specifying, documenting, and implementing automated acceptance tests for systems and software
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
clipboard-rs
Cross-platform clipboard API (text | image | rich text | html | files | monitoring changes) | 跨平台剪贴板 API(文本|图片|富文本|html|文件|监听变化) Windows,MacOS,Linux
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
any_ascii
Unicode to ASCII transliteration
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
cow-utils
Copy-on-write string utilities for Rust
-
hgrep
grep tool with human-friendly search output. This is similar to
-C
option ofgrep
command, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
near-facsimile
Find similar or identical text files in a directory
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
console_static_text
Logging for text that should stay in the same place in a console
-
ncount
A word count tool intended to derive useful stats from markdown
-
difflib
Port of Python's difflib library to Rust
-
wordcut-engine
Word segmentation/breaking library
-
vaporetto
pointwise prediction based tokenizer
-
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
fm
Non-backtracking fuzzy text matcher
-
fiat-lux
Offline terminal-accessible Bible
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
skyspell
Fast and handy spell checker for the command line
-
scraps
A static site generator that builds a wiki from a set of markdown files
-
wezterm-bidi
The Unicode Bidi Algorithm (UBA)
-
savvy
R extension interface
-
regex-cursor
regex fork that can search discontiguous haystacks
-
molybdenum
Recursive search and replace CLI application
-
hypher
separates words into syllables
-
unicode-reverse
Unicode-aware in-place string reversal
-
tantivy-analysis-contrib
A set of analysis components for Tantivy
-
igrepper
The interactive grepper
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
rapidfuzz
rapid fuzzy string matching library
-
vi
An input method library for vietnamese IME
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
stylin
Convert markdown to pandoc markdown with custom styles
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
mupdf
Safe Rust wrapper to MuPDF
-
jayce
tokenizer 🌌
-
entities
raw data needed to convert to and from HTML entities
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
mdbook-quiz
Interactive quizzes for your mdBook
-
mdbook-epub
An EPUB renderer for mdbook
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
nvl-cli
A program to download webnovels
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
notion2html
Convert Notion pages to HTML
-
regex-literal
delimited regular expression literals
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
quixote
Quizzes and tests in Markdown
-
trans-epub
Translate EPUB with CLI
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
collclean
Clean up collaboration commands in LaTeX files
-
fontfor
find fonts which can show a specified character and preview them in terminal or browser
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
minix
straightforward minifier for JavaScript and CSS files, developed using Rust. This tool efficiently reduces the size of JS and CSS files by removing unnecessary characters, whitespace…
-
supply-chain-trust-example-crate-000022
Fast multiple substring searching
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
aneubeck-daachorse
Daachorse: Double-Array Aho-Corasick
-
text_analysis
Analyze text stored as *.txt in provided file or directory. Doesn't read files in subdirectories. Counting all words and then searching for every unique word in the vicinity (+-5 words)…
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
prop-check-rs
A Property-based testing Library in Rust
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
tibco_ems
A high level API for the Tibco EMS
-
bfom
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
qpdf
Rust bindings to QPDF C++ library
-
termfactor
is your terminal sick, mid or sus? test your terminal features & find out
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
uuhelp_parser
A collection of functions to parse the markdown code of help files
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
graphannis
new backend implementation of the ANNIS linguistic search and visualization system
-
htmd-cli
The command line tool for htmd
-
focaccia
no_std implementation of Unicode case folding comparisons
-
ident_case
applying case rules to Rust identifiers
-
svgbob
Transform your ascii diagrams into happy little SVG
-
uwc
Counts things in unicode text files
-
unicode_categories
Query Unicode category membership for chars
-
tantivy-jieba
that bridges between tantivy and jieba-rs
-
yffi
Bindings for the Yrs native C foreign function interface
-
sk-skimmer
Fuzzy Finder in rust!
-
bbd
Binary Braille Dump
-
percent-encoding
Percent encoding and decoding
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
dvd-term
A bouncing ASCII art DVD logo (or custom text) for the terminal
-
b2c2-parser
b2c2におけるBASICコードのパーサー?
-
frawk
an efficient Awk-like language
-
vidyut-prakriya
A Sanskrit word generator
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
frida-build
Rust bindings for Frida
-
newslookout
A web scraping platform built for news scanning, using LLMs for text processing, powered by Rust
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞,基於MediaWiki和OpenCC之字詞轉…
-
basic-text
Basic Text strings and I/O streams
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
sile
Simon’s Improved Layout Engine
-
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
zspell
Native Rust library for spellchecking
-
norad
Read and write Unified Font Object files
-
rustyink
Blazing fast static site generator
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
fontkit
font loading and indexing
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
emojic
Emoji constants
-
treegrep
A pattern matcher frontend or backend which displays results in a tree
-
lexical-sort
Sort Unicode strings lexically
-
arrow-row
Arrow row format
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
mktoc
Generate Table of Contents from Markdown files
-
fasttext
binding
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
serpl
terminal UI for search and replace, ala VS Code
-
unicode-ccc
Unicode Canonical Combining Class detection
-
autotex
Continuously compile TeX and LaTeX
-
grok
popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
nu_plugin_emoji
a nushell plugin called emoji
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
tbll
tbll outputs data in tabular format
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
supply-chain-trust-example-crate-000021
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, German, Italian or French into their digit representation
-
heart-strings
Quickly get random heart emojis to copy!
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
hh-highlight
hh: The text highlighter. Read from stdin, and highlight selected words printed to stdout
-
fish-printf
printf implementation, based on musl
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
mdbook-pandoc
A mdbook backend that outsources most of the rendering process to pandoc
-
gst-plugin-textahead
GStreamer Plugin for displaying upcoming text buffers ahead of time
-
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ
,ŋ
andê
. -
egui-dropdown
An actual dropdown list for egui
-
array_tool
Helper methods for processing collections
-
figlet-comment
quickly create banner to use as comments
-
abbreviation_extractor
extracting abbreviations from text
-
dcsv
Dyanmic csv reader,writer,editor
-
numeric-sort
A zero-allocation, human-readable sorting library
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
oxford_join
Join string slices with Oxford Commas!
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
libreofficekit
in Rust
-
dom-content-extraction
Content extraction via text density paper
-
typed-oid
Typed Object IDs
-
filenamify
Convert a string to a valid filename
-
terminal-clipboard
a minimal cross-platform clipboard
-
kelp
A convert tool for Japanese
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
extract_anchors
Утилита для извлечения из исходных кодов всех помеченных отрывков
-
lindera-dictionary
A morphological analysis library
-
mini_markdown
Small markdown renderer
-
rust-persian-tools
Official Rust implementation of Persian Tools
-
chocodye
changing the color of the chocobos' plumage in Final Fantasy XIV
-
retryable-result
repeatedly try on some recoverable errors until too many or a fatal error
-
ewts-cli
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)
-
rutie
The tie between Ruby and Rust
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
obmrs
As a participant, you will create a structure to receive and hold the exchange-distributed order book. This structure will be called the OrderBoard, and will hold the order book's bids and asks as a price-sorted map…
-
uclanr
A random word picker that gives you actually useful words
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
whitespace-sifter
Sift duplicate whitespaces away!
-
ib-pinyin
一个高性能拼音匹配库
-
colonnade
format tabular data for display
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
detone
Decompose Vietnamese tone marks
-
wit-bindgen-markdown
Markdown generator for WIT and the component model, typically used through the
wit-bindgen-cli
crate -
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
harfbuzz_rs_now
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
advreader
Split text file into text sequences, strings and (line) comments
-
dptran
run DeepL translations on command line written by Rust
-
regexnight
Command-line tool to print syntax-highlighted versions of regular expressions and spot errors
-
termdiff
Write a diff with color codes to a string
-
minspan
a package for determining the minimum span of one vector within another
-
dialogi
A dialog parser
-
pluralizer
Rust package to pluralize or singularize any word based on a count inspired on pluralize NPM package
-
mdbook-linkcheck2
A backend for
mdbook
which will check your links for you -
arf-strings
Encoding and decoding for ARF strings
-
chamkho
Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactor
sub-command. Compile BibTeX files to legacythebibliography
TeX code using thecompile
sub-command… -
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
text-to-ascii-art
program to convert text to ASCII art
-
dprint-plugin-markdown
Markdown formatter for dprint
-
sourceannot
render snippets of source code with annotations
-
asciigraph
Ascii Graph
-
secular
No Diacr!
-
unicode-width-16
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
shwild
(for Rust)
-
tree-sitter-stack-graphs-typescript
Stack graphs definition for TypeScript & TSX using tree-sitter-typescript
-
iepub
epub、mobi电子书读写
-
words-count
Count the words and characters, with or without whitespaces
-
mdbook-angular
mdbook renderer to run angular code samples
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
line-straddler
Determine how lines through text (underlines, strikethroughs, etc) should be rendered
-
unidown
Convert Markdown to Unicode
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
kathoey
text feminization using open corpus linguistics data
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
as3_parser
ActionScript 3 parser
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
chinese_dictionary
A searchable Chinese / English dictionary with helpful utilities
-
sqdj
sqdj shortens delimited data
-
mdopen
Preview markdown files in a browser
-
pager
pipe your output through an external pager
-
just-enough-emojis
text to emoji cli
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
pray
A tui tool for preparing a prompt to the llms
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath
. -
nu-utils
Nushell utility functions
-
url_parser
URL Parser is a Rust parser developed to parse URLs into structured components such as scheme, domain, path, query and fragment
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
pprint
Flexible and lightweight pretty printing library for Rust
-
text-editing
string with utilities for editing
-
linebreak
breaking a given text into lines within a specified width
-
regex-charclass
Manipulate and convert regex character classes
-
dmos
Djot HTML renderer with advanced features
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
mdbook-typst-pdf
mdbook typst pdf backend
-
html_parser_tarasenko
Базовий HTML-парсер на Rust з використанням Pest
-
lorem-ipsum
Quickly generate placeholder test
-
common-words-all
Most common words sorted by ngram frequency
-
substring
method for string types
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
mini-openai
An OpenAI API client with minimal dependencies
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
poppler-sys-rs
Low-level (FFI) bindings for poppler-glib
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
supply-chain-trust-example-crate-000098
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
date_time_parser
Rust NLP library for parsing English natural language into dates and times
-
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
arabic_reshaper
Reconstruct Arabic sentences to be used in applications that doesn't support Arabic script
-
gh_page_tool
A github gh-pages tool for static blog site
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
rmbs
Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API
-
uwurs
UwUify your strings with uwurs!
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
diary-cli
A powerful cli for documenting and keeping a diary
-
svgc
svgc
(previously SvgCompress) is a tool for compressing SVG files by removing unnecessary whitespace, comments, metadata and some other data. It also supports optimization with SVGO and compression into SVGZ. -
indent
Functions for indenting multiline strings
-
codepack
A *lightning-fast* tool that transforms a directory into a single, LLM-friendly text file, perfect for easy processing and analysis
-
mandown
Markdown to groff (man page) converter
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
json_to_table
pretty print JSON as a table
-
crlify
A std::io::Write wrapper that replaces with on Windows
-
ColorShell
A small crate for coloring text for rust
-
deliminator
Universal code documentation generator
-
cbfr
A buffer that run on stack, focusing on performance and speed
-
iirs
Inverted repeats finder
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
semsimian
Sematic similarity calculations for ontologies implemented in Rust
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
armnod
random string generator
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
hidden_watermark
Hidden Watermark in Rust
-
inlet_manifold
A general purpose highlighting library
-
utilprint
easily using colors or unicode in your terminal printing
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
repr
The regular-expression-as-linear-logic interpretation and its implementation
-
mkwebsite
build websites using markdown
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
picodiff
Tiny GUI app to compare text easily
-
fetch-catnip
fetch displaying system information and a cute cat
-
textcode
Text encoding/decoding library. Supports: UTF-8, ISO6937, ISO8859, GB2312
-
zipcodes
Query US zipcodes without SQLite
-
ragtime
Easy Retrieval Augmented Generation
-
roman-numerals-rs
Manipulate well-formed Roman numerals
-
dbxcase
Dropbox-compatible case-folding algorithm
-
hebrew_unicode_script
A lightweight library to check if a hebrew character belongs to certain collections
-
orly
Download O'Reilly books as EPUB
-
avt
asciinema virtual terminal
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
ethan-rs-wc
The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.
-
md-tui
A terminal markdown viewer
-
tremor-kv
A logstash inspured key value extractor
-
merge_pdf
Merge PDF files in a directory
-
smol-symbol
compile-time globally unique, 25-character, 128-bit symbols similar to the Symbol type in Crystal Lang
-
swimos_model
SwimOS Generic Representation of the SwimOS Serialization Model
-
asciidork-eval
Asciidork eval
-
natural
Pure rust library for natural language processing
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
lookbook
Component preview framework for Dioxus
-
slack-blocks-render
Slack blocks render is a Rust library to render Slack blocks as Markdown
-
textra
type less, write more
-
bump-bin
Increments version with semver specification
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
stylish-ansi
stylish
helpers for writing styles as ANSI escape codes -
pink_accents
Replacement of patterns in string to simulate speech accents
-
wikipedia_prosesize
Count Wikipedia prose size
-
betacode
conversion
-
pandoc_types
Rust port of pandoc-types
-
iregex
Intermediate representation for Regular Expressions
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
kbremap
Custom keyboard layouts for windows
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
safe-string
safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
typo_checker
Searches for typos of English words and returns matches or similar words.(英単語のタイポを検索し、合致もしくは類似する単語を返却します)
-
bobo_html_parser
parser of html markdown
-
yamd
Markdown Document (flavour)
-
nu_plugin_regex
nu plugin to search text with regex
-
aki-gsub
substitude text command, replace via regex
-
shopping-parser
A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units
-
hangeul
Korean alphabet manipulation library
-
text-parsing
Hierarchical text processing preserving char position info
-
yara-x
A pure Rust implementation of YARA
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
soundex-rs
that calculates soundex value
-
lcat
lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
kbo
Spectral Burrows-Wheeler transform accelerated local alignment search
-
mtc-token-healing
Token healing implementation
-
unic-ucd-ident
UNIC — Unicode Character Database — Identifier Properties
-
mdbook-rustviz
An mdbook preprocessor that allows users to embed RustViz visualizations into mdbook projects
-
choco
markup language for dialogue systems
-
dodo-zh
operate on pinyin and cedict.u8 file
-
rins_markdown_parser
markdown parser written on Rust
-
unicount
Alphabetic counter supporting unicode
-
char-ranges
Iterate chars and their start and end byte positions
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
lowcharts
draw low-resolution graphs in terminal
-
data-streams
Extension traits for reading and writing data with streams
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
simdutf
Unicode validation and transcoding at billions of characters per second
-
catalog-of-markdown
Generate the catalog of markdown file
-
bin-rs
Binary Reader from multi source
-
dnd-character
A Dungeons and Dragons character generator
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
quranize
Encoding transliterations into Quran forms
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
truncate_string_at_whitespace
Truncate a &str at the closest whitespace to a specified length with unicode safety
-
rust-beam
A LaTeX slide generator you can write in faster than beamer
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
cloc
Count, or compute differences of, lines of source code and comments
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
markdown-extract-cli
Extract sections of a markdown file with a regular expression
-
ssbm_utils
A collection of useful tools for working with data from Super Smash Bros Melee
-
mdi
markdown include
-
gosyn
Golang syntax parser in Rust
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
indicator-extractor
Extract indicators (IP, domain, email, hashes, etc.) from a string or a PDF file
-
harper-core
The language checker for developers
-
casespaetzle
This package introduces common identifier case traits and a macro for case trait generation
-
bce
-
b2c2-tokenizer
b2c2のBASICコードのトーカナイザー?
-
slicestring
slicing Strings
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
paltoquet
rule-based general-purpose tokenizers
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
stop-words
Common stop words in many languages
-
presenterm
A terminal slideshow presentation tool
-
grammalecte_client
Grammalecte HTTP client
-
ultra-nlp
A NLP library
-
fmtm
A diff-friendly Markdown formatter that breaks lines on sensible punctuations and words to fit a line width
-
bogrep
Full-text search for bookmarks from multiple browsers
-
trpl
A support crate for The Rust Programming Language book
-
pdf-rename
This script reads a list of PDF files from a specified directory and renames each file based on its content. The renaming logic uses the content of the PDF to generate a more descriptive and meaningful filename.
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
chord3
Create pdf songbooks from chopro source
-
mdsh
Markdown shell pre-processor
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
selmr
Package to create and use Simple Explainable Language Multiset Representations
-
human_regex
A regex library for humans
-
wikidump
parsing Mediawiki XML dumps
-
escrit
learning languages by reading texts
-
message_segment_calculator
package to calculate SMS message segments
-
utf16_lit
macro_rules to make utf-16 literals
-
tfon
Bitmap font parsing / conversion
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
csv_to_table
pretty print CSV as a table
-
asciimath-unicode
Convert asciimath to unicode
-
string-simple
containing some simple string utilities that I use in my other projects
-
ruby_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
lorem-rustum
lib for generating lorem-ipsum with a rusty fleur
-
syllabize-es
Syllabize Spanish text, and much more
-
tracery
Text-expansion library
-
tfidf-text-summarizer
extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary
-
uwl
A management stream for bytes and characters
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}
in text with their respective value. It can also be used as a rust library -
loc
Count lines of code (cloc) fast
-
transportations_library
Knowledge stack for transportation
-
url_encor
A lightweight library to encode and decode special characters in urls
-
mdbook-keeper
An improved testing experience for mdbook
-
indent_write
Write adapters to add line indentation
-
rst_parser
a reStructuredText parser
-
prolangkit
A set of language-agnostic utilities for langdev
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
tree-sitter-stack-graphs
Create stack graphs using tree-sitter parsers
-
mdbook_rash
Binary to create doc from
rash
code -
avatarsay
Beautiful quotes from Avatar: The Last Airbender
-
typst-ansi-hl
highlights your Typst code using ANSI escape sequences
-
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
aki-mcolor
mark up text with color
-
html-auto-p
function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
tre-regex
Rust safe bindings to the TRE regex module
-
portmanteau
create portmanteaux
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen and some other apps
-
kindle2cbz
extracting images from kindle books in MOBI format to CBZ archives
-
hydroper_source_text
Source text containing line locations
-
simple_peg
A command-line peg parser implemented by Rust
-
utf8path
String-friendly path type
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
cow-rewrite
Rewrite copy-on-write types copying only when it's neccessary
-
str-utils
some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
zh_num
Convert ASCII numbers and zh words
-
mantra-lang-tracing
Contains functionality needed to collect requirement traces from code or plain text files for the
mantra
framework -
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
dekor
styling and character repository in Rust
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
streampager
pager for command output or large files
-
eternaltwin_constants
Hammerfest store implementation
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
pdf_composer
Create PDF documents from YAML Front Matter source documents
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
tower_allowed_hosts
tower allowed hosts layer
-
sapling-renderdag
Render a DAG into ASCII or Unicode text
-
kataru
YAML-based dialogue engine for game development
-
strip-tags
Strip HTML and PHP tags from strings
-
jawk
JSON AWK
-
yozuk
Chatbot for Programmers
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
egg-mode-text
Text parsing for Twitter: character counting, hashtag/mention extraction
-
easy_random
Generate random data easily with easy_random :)
-
wimbd
A CLI for inspecting and analyzing large text datasets
-
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
address_book
Інструмент командного рядка для парсингу телефонних номерів, ідентифікаторів, дат та неправильних…
-
sbert
Sentence Bert (SBert)
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
text-tokenizer
Custom text tokenizer
-
metatron
core library
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
srake
RAKE Algoritm
-
mathemascii
AsciiMath parsing and rendering
-
malachi
A domain specific pattern matching language made for defining bot commands
-
sixbit
Small packed strings
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
reason-shell
Reason: A Shell for Research Papers
-
tiny_pretty
Tiny implementation of Wadler-style pretty printer
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
gspell
Rust bindings for gspell
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
fluxcap
time expression parsing inspired on duckling
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
mitex-parser
Parser for MiTeX
-
tergo-formatter
Formatter for tergo
-
stego_wps
text-based steganography using the Words Per Sentence (WPS) method. It allows encoding and decoding hidden messages within the structure of a given text.
-
rustblocks
tetromino based falling blocks game
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
angr
analyse ngrams in text files
-
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
supply-chain-trust-example-crate-000066
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
kanbanrs
A basic kanban task manager written with egui
-
ik-rs
chinese segment, ik-analyzer for rust
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
etch
Not just a text formatter, don't mark it down, etch it
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
whichlicense_detection
detect licenses used by the WhichLicense project
-
sanitize-filename-reader-friendly
A filename sanitizer aiming to produce reader friendly filenames
-
text_trees
textual output for tree-like structures
-
image-to-ascii
Converts images and gifs to ascii art
-
cindex
CSV indexing library
-
lisho
personal link shortener with no external dependencies in under 200 lines of Rust
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
uo_rst_parser
fork of rst_parser with fixes for upstream-ontologist
-
man
Generate structured man pages
-
asmfmt
A formatter designed for programs in assembly language with AT&T syntax
-
slugify
Macro for flexible slug generation
-
readability
Port of arc90's readability project to rust
-
linkcheck2
extracting and validating links
-
midstring
Create a string between two other strings, that is lexicographically halfway between them
-
mdtrans
Markdown parser and transformer using
pest.rs
, focused on flexibility to a project’s needs -
film_parser
application for parsing detailed film data from raw text, including information such as title, release year, director, screenwriter, cast, genre, and runtime, into a structured Film…
-
wiki_corpus_grammar
Extract text from Wikipedia dumps (.bz2) and convert it to JSONLines format
-
verify_string
verifying strings
-
cogni
Unix native interface for LLMs
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
mdbook-tools
A collection of tools for mdbook
-
tu
CLI tool to convert a natural language date/time string to UTC
-
brzozowski
Work with regular expressions using Brzozowski derivatives. "
-
palmdoc-compression
Fast & safe implementation of PalmDoc/MOBI/AZW/Kindle flavored LZ77
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
uniart
A CLI tool to convert images and gifs to terminal characters
-
libcoerced
serde_json and generic text content matching, verification and coercion
-
text_lines
Information about lines of text in a string
-
bytescolor
A versatile Rust library for colorizing strings and byte data in terminal applications using ANSI escape codes
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
regex_generate
Use regular expressions to generate text
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
latinrs
Rust package that transliterates text and characters into thier latin form
-
blitztext
fast keyword extraction and replacement in strings
-
owoify_rs
port of mohan-cao's owoify-js is released under MIT license, which is also the same license of owoify-js. This simple library will turn any string into nonsensical babyspeak similar…
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
bigstr
A command-line tool to make string BIG
-
glk
Bindings for the Glk I/O interface for hosting interactive fiction interpreters
-
gaze
small utility library with the goal of making it easier to scan/lex text and collections
-
google-fonts
Download and cache TTF fonts from Google
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
gregex
Regex solver utilizing NFA
-
docket
markdown to HTML documentation rendering
-
thesaurus
An offline thesaurus library for Rust
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
formatjson
Formats JSON files
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
tnipv-lint
lints for tnipv, the Telcoin Network Improvement Proposal validator
-
rasciigraph
function to plot ascii graphs
-
tabprinter
creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.
-
htmd
A turndown.js inspired HTML to Markdown converter
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
ragegun
Performs lexica based analysis on text (i.e. age, gender, PERMA, OCEAN personality traits, ..)
-
rofi-mode
High-level library for creating Rofi plugins and custom modes
-
easy-regex
Make long regular expressions like pseudocodes
-
markov-text
creating a small markov model for text generation
-
tgrep
Toy grep that honors .gitignore
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
marker
finding issues in CommonMark documents
-
argentum_user_business
Business layer of user component
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
tzgrep
grep tar.gz
-
quake_text
Utils for Quake strings and characters
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
mudder
Generate lexicographically-spaced strings between two strings from pre-defined alphabets
-
encoding-next
Character encoding support for Rust
-
jawohl
Complete JSON documents from a stream of JSON tokens
-
charx
A replacement for char::is_ascii*
-
case
A set of letter case string helpers
-
justcsv
CSV-file reader/writer
-
anslatortray
translate from English to Pig Latin!
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
royal_road_archiver
An archival program and library for the webnovel site RoyalRoad
-
screen_printer
displaying rectangular blocks of text to a terminal
-
flowistry_ide
Information Flow in the IDE for Rust
-
mdbook-fs-summary
Summary generator for mdbook
-
hmd
Custom Markdown Engine for my personal blog
-
r-matrix
Rust port of cmatrix
-
term_grid
formatting strings into a grid layout
-
libopenlipc-sys
Wrapper around liblipc to interact with Kindle dbus-based LIPC events
-
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
isbn
handling ISBNs
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
aho-corasick
Fast multiple substring searching
-
markov
A generic markov chain implementation in Rust
-
codespan_preprocessed
Beautiful diagnostic reporting for M4 (or cpp) preprocessed text files
-
utf8-io
Traits and types for UTF-8 I/O
-
b2c2-casl2
b2c2におけるCASL2の雑なパーサー?
-
mdbook-check-missing-md
A backend for
mdbook
which will find Markdowns you forgot on SUMMARY.md -
markdown_converter
html to markdown converter and flavored markdown to discord markdown converter
-
antex
Styled text and tree in terminal
-
textspan
Text span utility
-
levenshtein_lite
No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function
-
character-set
High performance
set.contains(char)
-
hns
Human numeric sorting program — does what
sort -h
is supposed to do! -
regex-chunker
Iterate over the data in a
Read
type in a regular-expression-delimited way -
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
pukram2html
converting Pukram-formatted text to HTML
-
czv
performing CSV-related operations for data engineering and analysis
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
mdrss
generating RSS feeds from markdown files
-
supply-chain-trust-example-crate-000037
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
mdbook-tabs
mdBook plugin for rendering content in tabs
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
hyperscan
bindings for Rust with Multiple Pattern and Streaming Scan
-
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
worcher
full-text search for static websites
-
rpgtools
Collection of utilities to help run tabletop role-playing games
-
rs-line-count
counts number of lines(' ')
-
to-binary
Converts From Hex, UTF-8, Vector of Bytes, or a Byte Slice to a Binary String And Implements Many Useful Functions For Binary Strings
-
auto-regex
Automagically finds a regex that best matches an example and a sample list
-
slow-scan-print
Text is printed at fixed intervals by character or by line, and its name is inspired by SSTV (Slow Scan TV)
-
indentsort
Structure-preserving sorting of arbitrary indented text
-
litua
Read a text document, receive its tree in Lua and manipulate it before representing it as string
-
unic-emoji-char
UNIC — Unicode Emoji — Emoji Character Properties
-
somedoc
A very simple document model and markup generator
-
lodestone
A website wrapper for FFXIV's lodestone
-
pullup
Convert between markup formats
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
elden-ring-saver
ansi2
-
gender_guesser
that guesses the gender of a given name
-
bubble-bath
Small and quick HTML sanitizer
-
indented_text_writer
IndentedTextWriter
-
rsnltk
Rust-based Natural Language Toolkit
-
Person_Parser
Rust parser
-
forbidden-bands
8-bit string handling library
-
console-utilities
handling console input and output utilities
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
visa
(Virtual Instrument Software Architecture) Wrapper which allows concurrent and dynamic switching between visa implementations from different vendors
-
unflow
DSL to convert design to code
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
markov_str
Markov Chain implementation optimized for text generation
-
roe
Unicode case conversion
-
cheetah
An extremely fast and flexible static site generator
-
align
aligning text
-
qsv_vader_sentiment_analysis
Bindings for Rust from the original Python VaderSentiment analysis tool. Forked for use with qsv.
-
xconv
A high-performance batch file encoding conversion tool
-
instant-segment
Fast English word segmentation
-
recvmsg
Traits for receiving datagrams reliably, without truncation
-
isixhosa
help process text in isiXhosa
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
chunkr
A fast and quick chunking library for rust
-
wz
Count words, fast
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
my_parser_rrin
My parser
-
squidge
shortens delimited data
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
utils_rust
这是一个用于各种实用功能的 Rust 库
-
wcount
CLI word counting tool
-
gdnative-doc
Documentation tool for gdnative
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
botanical-latin
Decliner / conjugator / inflector for classical / botanical Latin
-
cosmic-text
Pure Rust multi-line text handling
-
minify-html-common
Common code and data for minify-html*
-
aqkanji2koe
wrapper for Rust
-
ttaw
talking to a wall, a piecemeal natural language processing library
-
searcher_txt
A copy of grep that i mate to train my rust skills
-
rust-regex-dsl
Regular expression DSL
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
dnd_spellbook_maker
making pdf documents of spells that ressemble 5th edition D&D official source book spell descriptions
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
xi-rope
A generic rope data structure built on top of B-Trees
-
sttx
belt for transforming speech-to-text data
-
ultra
Cryptanalysis of the Engima
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
reword
some utility functions for human-readable formatting of words
-
tagsearch
Filter plaintext files based on @keyword tags
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
pho
PHOnetic alphabet renderer
-
text-colorizer
Transitionary package
-
ncase
Enforce a case style
-
sapling-streampager
streampager is a pager for command output or large files
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
unicode-casing
Titlecase helper function on characters
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
ungoliant
The pipeline for the OSCAR corpus
-
match-pinyin-with-hanzi
Checks whether the sentence in Chinese characters (汉字) matches with the sentence in pinyin (拼音). Erhua is supported.
-
testing_table
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
notedown_ast
Notedown Abstract Syntax Tree
-
anystr
An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32
-
merge-whitespace
Procedural macros for merging whitespace in const contexts
-
wool
Preview Github Markdown Offline
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
tangler
Extracts code blocks from Markdown documents
-
make_it_braille
A CLI tool and library to manipulate unicode braille images
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
text_unit
Newtypes for text offsets
-
nih_log
An opinionated yet flexible logger catering to the needs of the NIH-plug plugin framework
-
quickmd
Quickly preview a markdown file
-
punycode
Functions to decode and encode Punycode
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
lister-cli
Lister: Navigate Markdown Lists
-
markdown-gen
generating Markdown files
-
jp_utils
Utils for working with Japanese text
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
dispnet-hash
Hash Library used for Dispnet
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
cyrconv
A funny faux cyrillic character mapper
-
document_tree
reStructuredText’s DocumentTree representation
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
strsplit
Split a string slice using a delimiter of your choice
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
dtex
Better TeX
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
abbreviator
abbreviating long words
-
ewts
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (lib)
-
flashtext2
The FlashText algorithm implemented in Rust
-
markdown-table
Creating markdown tables with Rust!
-
tokengrams
Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models
-
yinzhe9
喵喵隐者9
-
gematria_rs
Gematria, a traditional Hebrew numerology system
-
hitori
Generic compile-time regular expressions
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
neuralassimilator
fine-tuning Language Learning Models (LLMs) from unstructured text
-
mitex-spec
Specification Library for MiTeX
-
asciidork-opts
Asciidork opts
-
galm
pattern matching library
-
clima
A minimal Markdown reader in the terminal
-
prototty_render
Traits for defining a renderer and a view
-
ps-str
String transcoding library
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
unicode_names2_generator
Generates the perfect-hash function used by
unicode_names2
-
recase
Changes the convention case of input text
-
texcore
Create LaTeX documents using native Rust types
-
rep-grep
wgrep/write-grep CLI
-
display_bytes
Human-readable display of byte sequences
-
uwuify
fastest text uwuifier in the west
-
hebrew
alephbet primatives and parsing library for rust
-
bk-tree
A Rust BK-tree implementation
-
fast_aug
Fast data augmentation for text
-
plsfix
Text cleaner upper
-
imperative
Check for imperative mood in text
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
rusty-x
A snippet manager in Rust
-
mdbook-mdinclude
Preprocessor for mdBook that better handles including markdown files
-
p4d-mdproof
Markdown to PDF converter
-
story-dl
Story web scraping
-
hexstring
handling hexadecimal string
-
koji
An interactive CLI for creating conventional commits
-
spacey
a tiny yet performant whitespace vmm/interpreter
-
akai
Next-generation, functional and simple ID system
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
rusty_tools
useful tools for rust
-
fimdoc
Firendship is Magic Document, converts Markdown into FIMFiction BBCode
-
djotters
Djot (Markdown) parser that runs at hyper speeds!
-
tinytoken
tokenizing text into words, numbers, symbols, and more, with customizable parsing options
-
veg
Flexible tables
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
text-transliterate
transliterate texts using the SO iconv from POSIX
-
pulldown_mdbook
A pull parser for mdBook
-
txt_to_md
Command converting from a txt file to a markdown file
-
markovish
Markov chain implementation for text generation
-
wantora
wantora工具
-
nfa_regex
NFA regex engine for text processing
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
cellumina
configure and run cellular automata
-
adobe-cmap-parser
parse Adobe CMap files
-
markdown-toc
Markdown Table of Contents generator
-
code-splitter
Split code into semantic chunks using tree-sitter
-
character_frequency
counting character frequencies in a string concurrently
-
ultron
A web base text-editor with strong focus on displaying multi-width characters correctly
-
godotdoc
documentation generator for GdScript
-
twitter-text
in Rust
-
slugify-rs
generate slugs from strings
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
founder
The package provides a font toolbox
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
br-pdf
PDF Invoice Processing
-
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
subject-classifier
classifying a commit by it's subject
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
khat
A cat clone, nothing more nothing less
-
path2regex
Express style path to RegExp utility
-
mathematica-notebook-filter
mathematica-notebook-filter
parses Mathematica notebook files and strips them of superfluous information so that they can be committed into version control systems more easily -
qm-role-build
Role builder from markdown tables
-
vec-string-to-static-str
providing utilities for converting vectors of
String
s into vectors of&'static str
-
ru-html-extractor
A universal web page main content extractor based on line block density distribution
-
detect-newline-style
Determine a string's preferred newline character
-
pdfrust
PDF parser
-
bstr
A string type that is not required to be valid UTF-8
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
sayit
String replacements using regex
-
text-scanner
A UTF-8 char-oriented, zero-copy, text and code scanning library
-
speech-synthesis
Common traits for speech synthesis
-
rust-texas
generate latex documents
-
faster-chars-count
counting length of chars faster than Chars::count()
-
ansi-width
Calculate the width of a string when printed to the terminal
-
bge
Rust interface for BGE Small English Embedding Library
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
onig_sys
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
find_unicode
Find Unicode characters, the easy way!
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
wordmarkov
but flexible Markov chain library, specifically for text sentences, which handles punctuation and whitespace
-
hyphenation_commons
Proemial code for the
hyphenation
library -
neo-mime
Strongly Typed Mimes
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
mdbook-najan
Preprocessor for the Najan mdBook
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
rascii_art
Advanced ASCII Art Generator
-
mdbook_incremental_preview
Incrementally preview mdBook, patching changes instead of making full rebuilds
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
letter-sequence
A method to create sequence displayed as uppercase or lower letters, or digits
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
markdown2pdf
Create PDF with Markdown files (a md to pdf transpiler)
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
iver
-
ob
A Blog and RSS system written in Rust
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
mdbook-force-relative-links
An
mdbook
pre-processor to transform all local links to relative ones -
streamparser
Adapt simple stream parsers for different types of streams
-
uulm-sopra23-group03-core
Core library for implementing Battle of the Centerländ
-
aki-mline
match line, regex text filter like a grep of linux command
-
kas-text
Text layout and font management
-
ftrace
trace files and paths
-
boss
Baseball Open Source Software
-
reddish
A Rust utility library
-
censor
text profanity filter
-
quickner-core
A fast and simple NER tool
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
abjad
Calculate the numerical abjad value of Arabic-script text
-
synox
Program synthesis of string transformations from input-output examples
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
cur
that will hunt for your regular expression
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
deepl-rustls
DeepL API - RusTLS Fork
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
ranting
Linguistic formatting placeholder extensions for rust
-
runestr-pancjkv
rune
-based Pan-CJKV support -
notegraf
Core library for building a graph-oriented notebook
-
emoji
Every emoji, their metadata, and localized annotations
-
CFDTCP
Short text clustering for determining center points and correlation judgment [one-hot,Jenks Natural Breaks]
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
weggli-ruleset
Ruleset managment/matching/reporting for weggli patterns
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
ryaspeller
lib for searching typos in text, files and websites
-
split-every
Split for every n occurrences of a pattern iteratively!
-
uwu-rs
uwuifying library
-
mdcat-http-reqwest
Reqwest-based HTTP resource handler for mdcat
-
alass-ffi
FFI bindings for subtitle synchronization with alass
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
repgrep
An interactive command line replacer for
ripgrep
-
utf8_reader
A UTF-8 reader that read UTF-8 characters from object that implement Read trait
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
strfry
randomize a string
-
mdtranslation
prepare multi-lingual Markdown documents
-
regex-split
split_inclusive for the regex crate
-
sm-search
way of searching through text - for people who are too lazy to use Regex
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
mdbook-image-size
A mdbook preprocessor which support image size syntax
-
lindera-cc-cedict
A Japanese morphological dictionary for CC-CEDICT
-
grammateus
facilitate working with Ancient Greek words
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
testcall
companinon crate to bintest, implements test facilities
-
mdbook-bash-tutorial
A mdbook preprocessor that allows embedding Bash scripts as tutorials
-
invisible_unicode
finding invisible unicode characters
-
haoxue-dict
Chinese dictionary and word segmenter
-
markdown-extract
Extract sections of a markdown file
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
fast-str
A flexible, easy-to-use, immutable, efficient
String
replacement for Rust -
sauropod
Stellular Shared Resources
-
xpanda
Unix shell-like parameter expansion/variable substitution
-
grep-reader
short text for crates.io
-
char_reader
Safely read wild streams as chars or lines
-
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
string-utility
substring implementation (with the RangeBound trait), keep string before or after a specified string, etc
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
sortuniq
Find or count unique values in an input stream
-
kanjidic_types
A collection of types encompassing the variety of data about kanji available from Kanjidic
-
regexgrep
ripgrep tool that suports regular expressions
-
unicode_categories_zheyuye
Query Unicode category membership for chars
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
unicode-vo
Unicode vertical orientation detection
-
skyspell_kak
skyspell - kakoune integration
-
langsan
sanitizing language model input and output
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
castle_shared_parser
Castle Shared Parser
-
valid_rust_char
A tiny library to check if a char it's valid in a rust file
-
refac
Transform some text given a generic natural language prompt
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
pdf2pwg
Single purpose A4 page renderer rendering PDF using pdfium to PWG/URF
-
strizer
minimal and fast library for text tokenization
-
nibi
static site generator. (WIP, implementing)
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
dequote
Remove nested quotes around text
-
sapling-drawdag
Parse an ASCII DAG into graph edges
-
sedregex
Sed-like regex library
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
enum-ts
TypeScript Enum pattern matcher codegen
-
tradukisto
Kinda useful natural language translation library and utility
-
pulldown_typst
A pull parser for Typst markup
-
bzzz
Bee-Speak Translator
-
poppler
Wrapper for the GPL-licensed Poppler PDF rendering library
-
text_to_emoji
Convert text to emoji
-
hello_lib
Demonstrate Generics Function
-
ucfirst
Uppercase the first letter of a string
-
in_rainbows_printer
Prints some In Rainbows-style (the Radiohead album) text to your termial
-
umlauts
text transformation of german umlauts
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
encoded-words
Encoded Words for usage in MIME headers
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
gimme
Pull useful data out of your clipboard
-
scatternotes
A cli application to manage unstructured notes
-
snake_case_converter
convert strings to snake case
-
spellabet
Convert characters into spelling alphabet code words
-
jcalendar
Japanese Calendar for Rust
-
html-to-pulldown-cmark-events
Parse HTML to pulldown-cmark's events
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
bitranslit
Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.
-
commonregex
Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.
-
irg-kvariants
wrapper around kvariant from hfhchan/irg
-
hfs_nfd
Handle Apple's unique NFD-like Unicode normalization, which is used in HFS+, in Rust
-
fast_symspell
Spelling correction & Fuzzy search
-
csvsc
Build processing chains for CSV files
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
explore
terminal file explorer
-
texting
string helpers
-
mdzk
Plain text Zettelkasten based on mdBook
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
common_substrings
Finding all common strings
-
redpatterns
a list of patterns for scanners 📟
-
charclass
define and modify unicode character classes
-
qpprint
console printing/formatting
-
normalize-hebrew-rs
package that normalizes special symbols within Hebrew string used in the Qumran-Digital project
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
solstice_glyph
A fast text renderer for solstice, powered by glyph_brush
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
zhlint
A linting tool for Chinese text content
-
chisel-lexers
Chisel backend lexers/scanners
-
ellipse
Truncate and ellipse strings in a human-friendly way
-
rl-sys
Native bindings to libreadline
-
my_mini_grep
A mini application that aims to replicate the behavior of the grep shell command
-
asciimath-text-renderer
Render asciimath in terminal
-
bureau
extracting dependency licenses, used by nertsio
-
loading
Show loading in terminal
-
chardet
rust version of chardet
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
mdbook-indexing
mdbook preprocessor for index generation
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
ucd-raw
Uninterpreted access to the unicode UCD
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
cfasttext-sys
fastText ffi binding
-
ferrissay
cowsay
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
hi-doc
Diagnostics formatting library
-
unindenter
unindent text
-
caser
Change text between PascalCase, camelCase, and snake_case
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
rustrings
Strings manipulation for Rust
-
rulet
figlet implementation
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
git-busy
A wrapper around "git commit" that generates the commit messages for you
-
help_crafter
help message generator without hussle
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
write16
A UTF-16 analog of the Write trait
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
pdf_composer_definitions
PDF Composer definitions crate
-
rust_stringz
string manipulation library for Rust that you should probably don't need. It's just for learning purposes.
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
grace-cli
CLI tool for processing files and strings
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
minigrepsada
Test
-
enso-lazy-reader
An efficient buffered reader
-
uwu_cli
uwuifying the terminal
-
text_layout
Text layout algorithms
-
mdbook-collector
A mdbook preprocessor which collects json/yaml/toml data from each post and builds a json file of it
-
uwubot
discord bot for uwuifying text
-
no-comment
Remove rust-style line and block comments from a char iterator
-
mask-text
mask text with multiple masking options
-
hyper-old-types
HTTP types from hyper 0.11.x
-
wattle
tools for rust
-
nmd-core
Official NMD compiler core
-
repub-rs
binary for converting mhtml webpages into remarkable-style summarized epubs
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
wordpieces
Split tokens into word pieces
-
cowonverter
owo-ify text files. provides a simple function for this purpose.
-
remove-markdown-links
Turns
[example](https://example.com)
intoexample
. That’s it -
text-diff
text diffing and assertion library
-
translitrs
Transliteration utility for Serbian language
-
json-peek
Amature JSON parser library designed for my specific need
-
tiniestsegmenter
Compact Japanese segmenter
-
aki-mcycle
mark up text with cycling color
-
any2utf8
Convert any enncoding to utf-8
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
azusa
String index transformer for Rust utf8 to JavaScript utf16
-
is-vowel
Heuristically test whether a character is a vowel letter
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
tabwriter
Elastic tabstops
-
slack_update
app to set Slack status, emoji and photo
-
boreal-cli
CLI utility to run boreal, a YARA rules engine
-
buss-protocol
Bussin binary protocol
-
clippers
Cross-platform clipboard management library
-
rs-wordcloud
☁ Flexible WordCloud implementation for ⚙Rust
-
markov_strings
A simplistic Markov chain text generator
-
afrim-memory
handle of sequential codes easier for an input method
-
vape
full width aesthetics
-
utfx
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
event_parser
Rust NLP library for parsing English natural language into icalendar events
-
macro_colors
colorful printing macros
-
readability-text-cleanup
cleaning up text from HTML using Mozilla's Readability algorithm
-
rustic_print
A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.
-
stylish-core
The semver-stable subset of
stylish
-
encoding_c
C API for encoding_rs
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
pinot
Fast, high-fidelity OpenType parser
-
markdown-it-autolink
A markdown-it plugin for parsing GFM autolinks
-
mdbook-plugin-utils
mdBook plugins
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
b2c2-flag
b2c2のコマンドライン引数を保持するだけのやつ
-
stardict
Rust implement stardict
-
emoji_converter
Converts text to emojis
-
looking-glass
reflection & type-erasure library for Rust
-
rust_baht_text
Convert number to Thai Baht text
-
naromat
Convert text to narou novel format
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
szovegertesimutato-score
Calculate szovegertesimutato score for a given text and language
-
rcut
replacement for GNU cut that supports UTF-8
-
monkey-printer
infinite nr of monkeys you could write Shakespeare
-
line_adjustment
Line adjustment example
-
vidyut-kosha
A Sanskrit key-value store
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
kirjat-rs
prices for finnish textbooks from multiple stores
-
terminal_cli
A standalone library with no-std support for command line terminal interfaces. With autocomplete support, helpers for commands and properties and a prompt implementation.
-
goodname
assist you with cool naming of your methods and software
-
charmap
one-to-(none/one/many) character mapping
-
mdbook-infisearch
InfiSearch plugin for Mdbook
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
snekdown
A parser for the custom snekdown markdown syntax
-
korean_regex
Regex extension for Hangeul analysis
-
confusables
around Unicode confusables/homoglyphs
-
trie-match
Fast match macro
-
whitespace_text_steganography
A steganography strategy that uses whitespace to hide text in other text
-
extract-words
Extracts words from text without allocation
-
veryfi
Module for communicating with the Veryfi OCR API
-
igpay-atinlay
Translate text to Pig Latin
-
ra_rustc_lexer
lexer used by rustc, zero stability gurantees
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
lithe
A Slim template engine by using Pest
-
interslavic
in rust
-
collapse
Trim and collapse consecutive whitespace to a single space
-
lf_lint
Linter that checks that all files in <PATH> ends with a newline
-
group-similar
Group similar values based on Jaro-Winkler distance
-
playin-cards
♠ Library for French-suited playing cards
-
ctrl-z
A composable reader to treat
0x1A
as an end-of-file marker -
linkcheck
extracting and validating links
-
swrite
Infallible alternatives to write! and writeln! for Strings
-
uiuifree-normalize
uiuifree text normalize
-
rscx-mdx
MDX support for RSCx
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
text_maid
Text maid
-
timeblok
A language for event scheduling in plain text
-
text_searcher
text searcher tool
-
wcrs
GNU wc in Rust
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
polyhorn-ios
Polyhorn for iOS
-
word-star-puzzle-solver
A CLI tool for solving VGs word star puzzle
-
zhconv-cli
Convert Traditional/Simplified Chinese and regional words of Taiwan/Hong Kong/mainland China/Singapore based on Wikipedia and OpenCC rulesets 轉換中文簡體、繁體及兩岸、新馬地區詞,基於維基百科和…
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
grepox
Minimalist's grep written in Rust
-
mdx
in Rust
-
ruby-string
A string type that tracks Ruby glosses attached to parts of it
-
asciir
Print ASCII table/values
-
utf-cli
A UTF code viewer
-
parattice
Recursive paraphrase lattice generator
-
toml-test-data
TOML test cases
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
bocu1
BOCU-1 compressed unicode encoding
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
spoilers
High-level Rust bindings for CTranslate2
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
genex
Text-expansion library
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
mdbook-nix-eval
mdbook preprocessor for evaluating nix expressions
-
twjsrs
(De)serialize TiddlyWiki tiddlers to and from their JSON format
-
gregex-logic
Logic for the gregex crate
-
hr-id
A human-readable unique ID which supports Unicode, safe for URLs and file paths
-
ferris-finder
Unicode character finder with fuzzy search
-
assert-text
the testing macro tools
-
spongemock
CLI-tool for random text capitalization
-
kanpyo-dict
Dictionary Library for Kanpyo
-
thesauromatic
command-line thesaurus that returns related words when given a word. The output words are one per line, making it easy to process in shell pipelines.
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
strange
A static website generator
-
small-bwt
BWT construction in small space
-
text-utils
Text utils for unescaping and align
-
compact_str
A memory efficient string type that transparently stores strings on the stack, when possible
-
mail-internals-ng
[mail-api] _internal_ parts for the mail-api crates
-
stylish-stringlike
API for string-like objects that have styles applied
-
unfurl
expanding links in text
-
peppergrep
grep utility written following the 12th chapter of the Rust book. Some little modifications were made.
-
matchpick
Find and replace multi-lines using a match-case
-
harper
implement SSGs
-
md2gemtext
for converting Markdown into gemtext
-
indentation
Formatter
-
mitex
TeX2Typst converter
-
minigre_base
text file search tool
-
cha-rs
Extract specific characters from an input
-
mdbook-twiki
twiki backend for mdbook
-
genpdf
User-friendly PDF generator written in pure Rust
-
shear
trimming excess contents from things
-
str_overlap
Methods for finding the overlap between two string slices
-
char_trie
Text segmentation based on trie tree, High performance, support for custom dictionary
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
base256_lib
Base256 encoder/decoder
-
humnum
Human numeric sorting program — does what
sort -h
is supposed to do! -
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate
-
neardup
near-duplicate matching
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
kanpyo
Japanese Morphological Analyzer
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
hina
:]
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
yozuk-core-skillset
Set of default Yozuk skills
-
runanum
Существительные с правильными окончаниями после чисел
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
mudders
Generating Lexicographically-Evenly-Spaced Strings, or: Mudder.js in Rust
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
delay_writer
Wraps a writer and delays its output after each newline
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
shutil
Shell utility helper library
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
mrdocument
Automatic PDF transcription and classification via OpenAI
-
node-emoji
Convert
:emoji:
to Unicode using GitHub’s and EmojiDB’s emoji names -
pink-runtime
A very minimal replecement based DSL, intended for math
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
stardict_wrapper
Rust implement stardict
-
gpt4ocr
Extract structured text from PDFs using OpenAI's GPT4o
-
basic_lib_for_me
just a basic lib that i will use usually
-
spinning_cube
Perspective projection of a spinning cube, using just ASCII characters
-
amongify
A very ඞ sus ඞ program
-
console-traits
Basic console traits, such as setting char position and writing strings
-
product-os-content
Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server
-
imatree
command line tool to generate aesthetically pleasing (and sometimes not so pleasing) images of whatever string you want
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
bnf_sampler
that uses recursive descent algorithm to ensure tokens produced by a large language model follow a Backus Naur Form schema
-
cyrla
two-way conversion between latin and cyrillic script
-
lazy-char-iter
Lazily iterate over Unicode characters from a u8 slice in Rust
-
sauron-markdown
parsing markdown into sauron node
-
changecase
A trait and implementation for changing the case of Strings and &str. It currently supports uppercase, lowercase, alternating case, and inverting case. Title case is in the works.
-
pathmut
Command line utility for extracting, removing, and replacing components from path strings
-
learning_fromdocs_minigrep
Project made on the path toward learning Rust from its docs. Completely personal and to test how to upload to crates.io.
-
html-compare
compare html files
-
markdown2unicode
Converter from markdown notation to unicode characters
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
arabic-script
An expressive API for the characters of the Arabic script
-
wcounter
Give the word and count the appearance
-
unicode_escape
decoding escape sequences in strings
-
recursive-file-loader
recursively load files via references in the files
-
mutf8
Basic support for mutf8 strings
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
mdbook-chapter-list
mdbook preprocessor for listing subchapters
-
staple
powerful static site generator
-
mdbook-fix-cjk-spacing
mdbook preprocess that fixes CJK line breaks
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
ocrmypdf-rs
A sdk for the ocrmypdf command line tool
-
opt2doc
generate documentation from Rust source code
-
wfst4str
Python library based on rustfst for manipulatig strings with wFSTs
-
strip_markdown
remove markdown syntax from markdown files
-
matrix_display
display a matrix in command line with pretty unicode separators
-
mdict
Rust implement mdict
-
transcript
A transcriber for European scripts
-
floem-cosmic-text
Unofficial cosmic-text fork for Floem
-
fum
fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules
-
rex-regex
A pedestrian library for regular expressions: Slow, but easy to understand
-
uchr
translate char values to the characters themselves, and vice versa
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
tectonic_bridge_flate
Exposing flate compression to the Tectonic C code
-
spacebar
An anti-plagiarism tool based on null width characters
-
summary
Extract the sentences which best summarize a document
-
afrim-translator
Manage the predication system of the afrim input method
-
emojicons-2021
Parse :emoji: notation to unicode representation
-
termbook
behind the
termbook-cli
-
uniwhat
Display the unicode characters text
-
datadriven
Rewritable table-driven testing
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
senax-inflector
Pluralizes and singularizes English nouns. This crate is forked from https://github.com/whatisinternet/Inflector.
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
rex-regextract
extracts key value pairs out of text
-
termwrap
Wrap Unicode text with ANSI color codes
-
bpmf_py
A Bopomofo and Pinyin library
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
alpha-counter
Alphabetic counter
-
arbitrator
Format text based on a set of rules and regexes
-
ucd-util
A small utility library for working with the Unicode character database
-
shoebill
A Wadler/Leijen style pretty-printer
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
glyphana
Quickly find, inspect & collect unicode glyps
-
utf8-command
UTF-8 encoded
std::process::Command
output -
boxy
Declarative builder for Unicode box-drawing characters
-
trim
trim whitespaces from files
-
marcus
An experimental Markdown parser written in Rust
-
zw
encoding and decoding text using zero-width characters
-
presciidoc
Preprocessing AsciiDoc for other tools
-
readwise
wrapper for the Readwise API
-
mdbook-footnote
mdbook preprocessor for footnotes
-
aki-xtee
copy standard input to each files and standard output
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
pra
Print Random ASCII
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
bytepiece_rs
The Bytepiece Tokenizer Implemented in Rust
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
whitespace
Encode arbitrary data whitespaces and vice versa
-
hunspell-rs
Rust bindings to the Hunspell library
-
summertail
Summarize your tail -f output, by reporting how many lines match regexes
-
sc_decode
Special character decoding
-
leptos-markdown
A component which can render markdown as html element in leptos
-
fmtm_ytmimi_markdown_fmt
Fork of @ytmimi's Markdown formatter; powers FMTM
-
verba
working with Latin words
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
mdbook-unlink
A mdBook backend that validates local links
-
str-block
Remove the initial newline and common indentation from string literals
-
rammer
A Baysian Text Classifier for Spam/Ham
-
highlight-pulldown
Process pulldown-cmark events to apply syntax highlighting to code blocks
-
unidok
A powerful, readable, easy-to-learn markup language
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
scalar-value-string
Scalar Value String
-
rahat3062_minigrep
A light-weight & minimal implementation of the grep cli app
-
kincaid
A word statistics library in Rust
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
litegrep
A basic tool for searching in files for lines of text, based on a query
-
genkit
A common generator kit for static site generator
-
hxgm30-noise
Noise-generation for use in hexagram30 projects
-
yarner-block-links
A Yarner plugin that adds to each code block a list of links to all referenced and all referencing blocks
-
inslice
A command-line utility for filtering text input by columns and rows
-
remake
writing maintainable regex and managing symbol soup
-
braille_pics
producing text-art pictures using Braille characters
-
yara-x-parser
A parsing library for YARA rules
-
bookgrep
Basic grep equivalent, minor mods to Chapter 12
-
syntastica-themes
A collection of themes for syntastica
-
trans-case
Transform case
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
gfm-autolinks
Parse GitHub Flavored Markdown autolinks
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
skribo
low-level text layout
-
ascii_converter
converting between different ascii representations
-
jput
puts and putc on unicode-width align for Rust
-
jtxt
A JavaScript syntax text processing tool, an awk alternative
-
syllable
counter for use with reading level calculations
-
lsp-document
Helpers to convert between LSP documents and Rust strings
-
stringsext
find multi-byte-encoded strings in binary data
-
matchable
Check if text is matching against string or regex in one API
-
vibrato
viterbi-based accelerated tokenizer
-
pulldown-cmark-fork
A pull parser for CommonMark
-
glyph_brush_layout
Text layout for ab_glyph
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
genere
randomization of text respecting grammatical gender of sentences
-
trim_lines
An extremely simple and tiny library which provides an iterator over the lines of a string, trimmed of whitespace. It is a simple wrapper around the Lines iterator in std::str which trims the whitespace from each line.
-
owoify
Text owoification library
-
diffy-fork-filenames
Fork of https://docs.rs/diffy that allows specifiying filenames
-
indoc
Indented document literals
-
libxdiff
Rust bindings for the libxdiff C library
-
struckdown
A structured markdown / commonmark library for Rust
-
askama-markdown-cmark
Askama filter for markdown, using pulldown-cmark
-
basic-text-internals
Basic Text string literal implementation details
-
sigalign-impl
implementations for core
-
mpdf
writing PDF files
-
md-localizer
Localize markdown with remote links
-
casaubon
A feature-poor static site generator
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
google-book-scraper
downloading the contents of books hosted on books.google.com for offline viewing
-
uecho
The unicode of the echo command
-
text_styled
Trait to style String and &str
-
timfmt
A small utility for formatting code as Tim likes it
-
moscato
Outline scaler for OpenType glyphs
-
b2c2-common
b2c2の共通部分
-
morc
Dead simple, minimal markdown generator library written in Rust
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
univstring
The Universal String(Rust String, CString, Wide(UTF-16/UTF-32) CString) trait
-
spellcheck_toy
a basic spellchecking library based on edit distance
-
pdfgen
PDF rendering library
-
ron_to_table
pretty print RON as a table
-
ipa-translate
translating between IPA and ASCII text
-
zummi
fun lib that produces spoonerisms
-
twitter_text_conformance
twitter-text in Rust
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
minigrepns
A mini version of famous grep application that searches texts on files
-
vndb_rs
Bare-bones VNDB api
-
gzip-cmp
that makes a distance measurement between binary data based on the difference of the compressed data length
-
icu_casemapping
Unicode case mapping algorithms
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
marko
Programmtically format text with Markdown syntax
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
uwildmat
wildmat/uwildmat matching
-
textos
Texts, strings, formatting, unicode…
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
lingua-dutch-language-model
The Dutch language model for Lingua, an accurate natural language detection library
-
mul
Bengali stemmer
-
md-include
include any file in markdown files
-
perspicuity_formula
Calculate Flesh Reading Ease for a given text and language
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
ctf-brute
Brute-force utilities for Rust
-
llmvm-outsource-lib
outsource backend for llmvm
-
detect-indent
Detect the indentation of code
-
sparklet
small flashcards library
-
mdplayscript
An extension of Markdown for play scripts
-
gen3-charset
Pokemon Generation 3 Character Set Support (GBA)
-
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
ogrep
searching in indentation-structured texts
-
fmt-extra
Extra formaters not shipped in the standard library
-
random_access_unicode
Provide random access to the unicode characters in a file
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
manchu-converter
Converts transcripted Manchu text to Manchu script with Manchu alphabet
-
cutters
Rule based sentence segmentation library
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
irssi-sys
Automatically generated bindings to irssi
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
quill_delta_pdf
Convert Quill Delta to PDF
-
halfcaps
tRaNslAtE aNy TeXt To ThIs
-
pretok
A string pre-tokenizer for C-like syntaxes
-
text2art
translate text to ascii art
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
fmty
Composable
core::fmt
utilities -
modeling
tools to analysis different languages by Ctags
-
tuilet
A textual user interface for Toilet, the ANSI-art text generator
-
mdbook-rust
Enhanced Rust support for MDBook
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
mdbook-translation
prepare multi-lingual mdBook books
-
ssml-parser
parsing speech sythnesis markup language
-
code-span
Add additional infomation to code character
-
mqtt-analyzer
Help analyzing MQTT messages
-
cowsay
program that generates ASCII art pictures of a cow with a message
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
m_lexer
extensible regular expressions based lexer
-
rusile
components for the SILE typesetter
-
sydney
Vim-like, Command-line Gemini Client
-
ngrams
Generate n-grams from sequences
-
destiny_rs
An async library to interact with Bungie's destiny 2 api
-
aki-stats
output the statistics of text, like a wc of linux command
-
unicode-canvas
creating text base drawing
-
unic-ucd-normal
UNIC — Unicode Character Database — Normalization Properties
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
regex-automata
Automata construction and matching using regular expressions
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
env2toml
Convert env vars to toml text
-
yozuk-helper-english
English NLP utilities for Yozuk
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingual
cmdline tool. -
slidedeck
Create an HTML slide deck from Markdown
-
libflagup
Display a country's flag as an emoji
-
spdkit-surface
short text for crates.io
-
ergrep
grep strings within a line from a text file
-
bbcode-tagger
BBCode tree parser and tagger
-
like
A SQL like style pattern matching
-
llmvm-core-lib
llmvm core application
-
shallow
long text
-
quilltex
open-source Rust library designed to convert LaTeX documents into a Delta format that can be used with Quill.js and vice versa
-
supply-chain-trust-example-crate-000068
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
wildflower
Wildcard matching against strings
-
cermic
Print .png images in your terminal as ascii art!
-
sastrawi
stemming and stopword removal for Bahasa Indonesia based on PHP sastrawi project by Andy Librian
-
aprilasr-sys
Low-level FFI bindings for the april-asr C api (libaprilasr)
-
character-stream
Helper data structures for reading UTF-8 characters from a stream
-
mdbook-newday
Command line utility to add a new line for today to the top of an mdbook SUMMARY.md book
-
dr
Command-line data file processing in Rust
-
text_alignment
Align your text in Rust in the CLI
-
noneifempty
Convert T to None if empty
-
hoedown
bindings for the Hoedown markdown processor