Text processing

regex

regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.

v1.11.1 19.5M no-std #regex #regex-engine #regex-automata #automata #parser
textwrap

word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.

v0.16.2 7.1M #text-formatting #hyphenation #typesetting #unicode-text #wrap
encoding_rs

A Gecko-oriented implementation of the Encoding Standard

v0.8.35 10.4M no-std #unicode #charset #web #standard #big5-hanzi-encode #encoder
similar

A diff library for Rust

v2.7.0 4.8M #unified-diff #difference #patience #change #diff
fancy-regex

regexes, supporting a relatively rich set of features, including backreferences and look-around

v0.14.0 4.9M no-std #regex #backreferences #performance #re
heck

case conversion library

v0.5.0 26.0M no-std #camel-case #snake-case #unicode
const_format

Compile-time string formatting

v0.2.34 3.3M no-std #formatting #concat #no-std #macro #arguments
unicode-normalization

functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15

v0.1.24 9.8M no-std #unicode-normalization #recomposition #unicode-text #decomposition #text #unicode #normalization
convert_case

Convert strings into any case

v0.8.0 10.4M #casing #camel-case #title-case #case-converter #string #boundaries #case
unicode-segmentation

Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules

v1.12.0 8.8M no-std #unicode-segmentation #unicode-text #word #unicode #boundary #grapheme #text
ropey

A fast and robust text rope for Rust

v2.0.0-alpha.2 138K #rope #edit #buffer #text-edit #text
lazy-regex

lazy static regular expressions checked at compile time

v3.4.1 1.1M no-std #regex #lazy-evaluation #lazy-regex #macro #static #lazy-static-regex
pulldown-cmark

A pull parser for CommonMark

v0.13.0 1.4M bin+lib #common-mark #markdown #pulldown-cmark #parser
unicase

A case-insensitive wrapper around strings

v2.8.1 7.2M no-std #case-insensitive #case-folding #no-std #lower-case
deunicode

Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.

v1.6.1 1.0M no-std #emoji #transliteration #ascii #unicode #unidecode
scraper

HTML parsing and querying with CSS selectors

v0.23.1 411K bin+lib #css-selectors #web-scraping #selector #web-page #scraping #css
unicode-bidi

Unicode Bidirectional Algorithm

v0.3.18 7.0M no-std #text-layout #bidi #rtl #unicode-text #unicode #unicode-text-layout #browser #text
rustybuzz

A complete harfbuzz shaping algorithm port to Rust

v0.20.1 329K no-std #text-shaping #opentype #true-type #shaping
html2text

Render HTML as plain text

v0.14.3 74K #text-html #text #html #html-text
emojis

✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!

v0.6.4 69K no-std #emoji #gemoji #unicode #github
ammonia

HTML Sanitization

v4.0.0 219K #input-validation #security #xss #html #sanitization
lopdf

PDF document manipulation

v0.36.0 113K #pdf #editing #merge #manipulation #operand
termimad

Markdown Renderer for the Terminal

v0.31.3 59K #tui #markdown-renderer #markdown-parser #terminal #applications
widestring

wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both u16 and u32 string types are provided, including support for UTF-16 and UTF-32…

v1.2.0 1.9M no-std #utf-16 #winapi #wide-string #utf-32
mdbook

Creates a book from markdown files

v0.4.48 167K bin+lib #book #markdown #render-markdown #rust-book #gitbook
lngcnv

linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna

v1.10.1 4.0K app #linguistics #phonetic #spelling #language #speech #text-processing
strip-ansi-escapes

Strip ANSI escape sequences from byte streams

v0.2.1 802K #ansi-term #ansi-escapes #strip-ansi #ansi-escaping #ansi-terminal #terminal #writer
prettydiff

Side-by-side diff for two files

v0.8.0 118K #diff #change #text #compare #word
fuzzy-matcher

Fuzzy Matching Library

v0.3.7 330K #text-search #fuzzy-search #fuzzy-matching #match #search #text
unicode-general-category

Fast lookup of the Unicode General Category property for char

v1.0.0 939K no-std #unicode #no-std #category #general
regress

A regular expression engine targeting EcmaScript syntax

v0.10.3 1.0M no-std #regex #syntax #regress
linkify

Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.

v0.10.0 55K #link #url #web #text
pulldown-cmark-to-cmark

Convert pulldown-cmark Events back to the string they were parsed from

v21.0.0 260K #markdown-converter #common-mark #pulldown-cmark #markdown #render
text-splitter

Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.

v0.25.1 33K #nlp #artificial-intelligence #tokenize #split
finl_unicode

handling Unicode functionality for finl (categories and grapheme segmentation)

v1.3.0 562K #unicode-segmentation #unicode #grapheme #identification #segmentation #cluster
lindera

A morphological analysis library

v0.41.0 29K #morphological-analysis #tokenize #library #multilingual #dictionary #analysis
printpdf

reading and writing PDF files

v0.8.2 14K #pdf #gui #pdf-generation #wkhtmltopdf #graphics
onig

Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.

v6.4.0 618K #regex #oniguruma #onig #bindings #source
garde

Validation library

v0.22.0 45K #validation #garde #framework #ascii #rules #string #length #derive #valid
titlecase

Capitalize text according to a style defined by John Gruber for Daring Fireball

v3.5.0 44K bin+lib #title-case #capitalization #style #wasm #capitalisation #case #title
font-kit

A cross-platform font loading library

v0.14.2 442K #font #loader #font-kit #name #postscript #back-end #rasterize-glyph
charabia

detect the language, tokenize the text and normalize the tokens

v0.9.3 17K #tokenize #language #normalize #document #segmenter #tokenizer
roff

ROFF (man page format) generation library

v0.2.2 257K #roff #page #name #italic #bit #description #synopsis
unicode-script

exposes the Unicode Script and Script_Extension properties from UAX #24

v0.5.7 519K #script #unicode #scripting-language #unicode-text #text
synoptic

low-level, syntax highlighting library with unicode support

v2.2.9 26K #unicode #rules #text-processing #comments #language
const-str

compile-time string operations

v0.6.2 310K no-std #proc-macro #string #operation #const #constant
unescaper

Unescape strings with escape sequences written out as literal characters

v0.1.5 618K #escaping #string #unescaper
Inflector

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.11.4 864K #inflection #pluralize #snake-case #snake #camel
diff

An LCS based slice and string diffing implementation

v0.1.13 3.9M #diff #bar
nucleo

plug and play high performance fuzzy matcher

v0.5.0 13K #fuzzy-matching #fuzzy-search #matcher #fzf
mkrs

Build automation tool

v0.23.1 app #target #config #readability #mode #output #run #processing #targets-dependencies #documentation #default
os_display

Display strings in a safe platform-appropriate way

v0.1.4 50K no-std #shell #terminal #shell-terminal #terminal-text
diffy

Tools for finding and manipulating differences between files

v0.4.2 241K #diff #merge #patch
edit

Open a file in the default text editor

v0.1.5 26K #editing #text-editing #text-editors #cross-platform #cli
chardetng

A character encoding detector for legacy Web content

v0.1.17 137K #unicode #charset #web #content #operation
stringsext

find multi-byte-encoded strings in binary data

v2.3.5 470 app #unicode #string-search #stringsext #forensics #getreu
inlinable_string

inlinable_string crate provides the InlinableString type – an owned, grow-able UTF-8 string that stores small strings inline and avoids heap-allocation – and the StringExt trait…

v0.1.15 532K no-std #inline #inlinable #string #10
hyperlink

Very fast link checker for CI

v0.1.44 2.1K app #link-checker #link-checking #validation #broken-link-finder #linter #ci #action
smartcat

Putting a brain behind cat. CLI interface to bring language models in the Unix ecosystem 🐈‍⬛

v2.2.0 app #chatgpt #artificial-intelligence #cat #pipe #workflow #task
cruet

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.15.0 25K #inflection #pluralize #camel-case #camel #snake
line-index

Maps flat TextSize offsets to/from (line, column) representation

v0.1.2 60K #line-index #ide #index #language-server
wana_kana

checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji

v4.0.0 11K bin+lib #hiragana #japanese #katakana #romaji #kana
uuhelp_parser

A collection of functions to parse the markdown code of help files

v0.0.30 22K #coreutils #parser #name #text #info #gnu-coreutils
str_indices

Count and convert between indexing schemes on string slices

v0.4.4 148K no-std #string #utf-16 #indices #no-std #text
whyq

jq wrapper

v0.10.2 600 app #jq #yaml #toml #yq
mdxjs

Compile MDX to JavaScript in Rust

v0.3.4 1.5K #mdx #markdown #compile #javascript #gfm
xan

The CSV magician

v0.49.0 1.4K app #csv-tsv #csv #tsv #aggregation #terminal #magician #column #format #row #cli
ferris-says

flavored replacement for the classic cowsay

v0.3.2 6.4K #cowsay #rustaceans #print #ferris #fsays
unicode_names2

Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…

v1.3.0 150K no-std #unicode #unicode-text #unicode-names2 #text
stringzilla

Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances

v3.12.4 800 sys no-std #string-search #hash #information-retrieval #search #sorting #operation
autocorrect

A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)

v2.13.3 500 #linter #spell-check #lint #cjk #copywriting
entities

raw data needed to convert to and from HTML entities

v1.0.2-rc.1 93K #html-entities #escaping #character #html #character-escaping
ascii

ASCII-only equivalents to char, str and String

v1.1.0 1.7M no-std #ascii #libstd #ascii-string
blockwatch

Linter that tracks changes between dependent blocks of code

v0.1.13 950 app #blockwatch #block #ignored #file #language #how #action #supported-languages
jieba-rs

The Jieba Chinese Word Segmentation Implemented in Rust

v0.7.2 38K #nlp #chinese #segmenation #chinese-word-segmentation
mdbook-katex

mdBook preprocessor rendering LaTeX equations to HTML

v0.9.3 2.4K bin+lib #mdbook #katex #latex #delimiter
google-translate2-cli

A complete library to interact with Translate (protocol v2)

v6.0.0+20170525 1.1K app #translation #google-translate #google #google-cli #translate #cli #api
epub-builder

generating EPUB files

v0.8.0 1.1K #epub #epub-builder #builder #default #toc-element #epub-content #zip-library
unicode-case-mapping

Fast lowercase, uppercase, and titlecase mapping for characters

v1.0.0 129K #case-mapping #title-case #upper-case #lower-case #unicode #unicode-characters #character
ncount

A word count tool intended to derive useful stats from markdown

v0.7.2 2.2K app #word-count #novel #text
mdbook-pdf

A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol

v0.1.11 330 app #mdbook #pdf #rust-book #book #protocols
two_percent

Fuzzy Finder in rust!

v0.12.14 190 bin+lib #menu #skim #utilities #fuzzy #mode
scraps

static site generator based on Markdown files written with simple Wiki-link notation. It can be used primarily for personal or team knowledge management.

v0.21.5 700 app #static-site-generator #wiki #markdown
prema

convert markdown to html

v0.1.6 550 app #html #directory #markdown #themes #basic #footer #file #command
decancer

that removes common unicode confusables/homoglyphs from strings

v3.2.8 10K #homoglyphs #security #confusable #unicode #moderation #character #applications
regex-cursor

regex fork that can search discontiguous haystacks

v0.1.5 5.0K #nfa-automata #dfa-automata #regex-automata #regex
hck

A sharp cut(1) clone

v0.11.4 100 bin+lib #regex #delimiter #compression #column #text #cli #literals #decompression #text-processing
llmvm-core

The core application for llmvm

v1.1.3 300 app #artificial-intelligence #llm #llmvm #back-end #template #thread #preset #workspace #api-bindings
apisnip

A terminal user interface (TUI) tool for trimming OpenAPI specifications down to size ✂️

v1.4.56 1.1K app #openapi #tui #swagger
pdf-extract

extract content from pdfs

v0.9.0 15K #pdf #pdf2txt #pdf2text #text
prop-check-rs

A Property-based testing Library in Rust

v0.0.911 4.5K bin+lib #property-testing #property-based #testing #prop #machine #gens
hgrep

grep tool with human-friendly search output. This is similar to -C option of grep command, but its output is enhanced with syntax highlighting focusing on human readable outputs.

v0.3.8 bin+lib #syntax-highlighting #grep #search #ripgrep #bat #directory
matchers

Regex matching on character and byte streams

v0.2.0 9.0M #regex #pattern-match #streaming #matcher
unicode-blocks

contains a list of all unicode blocks and provides some functions to search across them

v0.1.9 49K no-std #unicode-characters #block #unicode #cjk #character
unicode-xid

Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31

v0.2.6 10.3M no-std #xid #unicode #unicode-text #documentation #text
unindent

Remove a column of leading whitespace from a string

v0.2.4 5.7M #multi-line #literals #heredoc #string-literal #nowdoc #string #multiline
omekasy

Decorate alphanumeric characters in your input with various font; special characters in Unicode

v1.3.1 460 app #script #omekasy #unicode #emoji #bold-italic #blackboard #monospace #sans #bold-script #bold-fraktur
mdbook-admonish

A preprocessor for mdbook to add Material Design admonishments

v1.19.0 3.5K bin+lib #mdbook #material-design #ui-design #markdown #material #ui
arrow-string

String kernels for arrow arrays

v55.0.0 2.0M #arrow-arrays #kernel #regex #array #parquet #arrow
bfom

Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?

v0.1.47 1.5K app #markdown #bfom #wrong
unicode_categories

Query Unicode category membership for chars

v0.1.1 3.3M #unicode #unicode-categories #char
cargodisttest

💬 a CLI for learning to distribute CLIs in rust

v0.20.5 1.4K app #cargodisttest #world
rust-stemmers

some popular snowball stemming algorithms

v1.2.0 488K #nlp #information-retrieval #stemming #algorithm
font-types

Scalar types used in fonts

v0.8.4 161K no-std #typography #font-types #font
regex-syntax

A regular expression parser

v0.8.5 30.2M no-std #regex-engine #ast #regex-automata #parser #dfa
aki-resort

sort lines of text. You can use regex to specify the KEY.

v0.1.25 1.4K bin+lib #sorting #aki-resort #filter #numeric #text #month #version #time #mar #jan
text_io

really simple to use panicking input functions

v0.1.13 24K #read-line #io #scanf #scan #io-read
trivet

Parser Library

v3.0.0 700 #recursive-descent #json-parser #parser #json
wildcard

matching

v0.3.0 63K no-std #wildcard #matching #no-std #text-processing
netidx

Secure, fast, pub/sub messaging

v0.27.3 #pub-sub #kerberos #networking #distributed
stop-words

Common stop words in many languages

v0.8.1 7.2K #nlp #stop-words #localization #word
stringcase

Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on

v0.4.0 30K #camel-case #snake-case #kebab-case #pascal-case #kebab
difflib

Port of Python's difflib library to Rust

v0.4.0 2.9M #difflib #unified-diff #diff #text #differs
dmos-cli

Djot HTML renderer with advanced features - CLI

v0.5.1 390 app #syntax-highlighting #djot #cli #file #stdin #highlighting #anchor #syntax #emoji
languagetool-rust

LanguageTool API bindings in Rust

v2.1.5 160 bin+lib #language-tool #rust #client-server #language #docker #api-wrapper
html2md

binary to convert simple html documents into markdown

v0.2.15 9.6K bin+lib #markdown-converter #html-markdown-converter #render-markdown #html #list #underline
xi-unicode

Unicode utilities useful for text editing, including a line breaking iterator

v0.3.0 128K #unicode #utf-8 #xi-unicode
unicode-id

Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31

v0.3.5 953K no-std #unicode-id #unicode #tr31 #unicode-text #text
schemat

A code formatter for Scheme, Lisp, and any S-expressions

v0.3.2 170 nightly app #scheme #s-expr #format
uncased

Case-preserving, ASCII case-insensitive, no_std string types

v0.9.10 1.0M no-std #case-insensitive #ascii #case-preserving #no-std
bundle_repo

Pack a local or remote Git Repository to XML for LLM Consumption

v0.6.0 440 app #artificial-intelligence #git #tokenize #llm #cli
rumdl

A fast Markdown linter written in Rust (Ru(st) MarkDown Linter)

v0.0.36 2.1K bin+lib #linter #documentation #markdown #markdown-linter #issue #error
tossicat

입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리

v0.6.1 750 #hangul #hangeul #library #라이브러리 #함수
mdbook-yapp

A mdBook preprocessor for simple text replacements

v1.1.3 750 app #mdbook-preprocessor #mdbook #mdbook-pre-processor #replace #text-replacement #pattern #config #text
slice-command

slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation

v0.4.2 550 app #slice #command-line-tool #slice-command #text
wezterm-bidi

The Unicode Bidi Algorithm (UBA)

v0.2.3 59K #terminal #bidi #uba #terminal-emulator #status
vaporetto

pointwise prediction based tokenizer

v0.6.5 1.4K no-std #japanese #tokenize #morphological-analysis #morphological
file-organiser

Command line file manager to list, move or delete large numbers of files in nested folders filtered by age, file extension, file name pattern and/or size range

v0.1.8 420 app #file-extension #directory #file-organiser
stylin

Convert markdown to pandoc markdown with custom styles

v0.9.3 bin+lib #name #style #stylin #paragraph #pipeline #block #caption #spans #title #webp
stam

powerful library for dealing with stand-off annotations on text. This is the Rust library.

v0.16.5 #annotations #nlp #linguistics #standoff #text-processing #annotation
cow-utils

Copy-on-write string utilities for Rust

v0.1.3 110K no-std #copy-on-write #text #cow-utils #string #cow
any_ascii

Unicode to ASCII transliteration

v0.3.2 158K no-std bin+lib #transliteration #emoji #ascii #unidecode #unicode
skyspell

Fast and handy spell checker for the command line

v4.0.0 1.0K bin+lib #spell-check #dictionary #interface #identifier #action #list
lindera-tantivy

Lindera Tokenizer for Tantivy

v0.41.0 4.4K #tantivy #tokenize #lindera #tokenizer
sliceslice

A fast implementation of single-pattern substring search using SIMD acceleration

v0.4.3 36K #simd #text-search #substring-search #string-search #search #single #text #string
mdbook-typst

An mdBook backend to output Typst markup, pdf, png, or svg

v0.1.7 app #typst #mdbook #svg #config
newdoc

Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation

v2.18.3 2.9K bin+lib #documentation-generator #asciidoc #redhat #documentation #documentation-tool
zawk

An efficient Awk-like language implementation by Rust with stdlib

v0.5.25 app #awk #csv-tsv #stdlib #etl #csv #tsv
diff-match-patch-rs

The fastest implementation of Myer's diff algorithm to perform the operations required for synchronizing plain text

v0.4.1 3.7K #diff-match-patch #patch #diff #text-synchronization #match
nom-unicode

Unicode extensions for Nom

v0.4.0 13K no-std #nom #nom-unicode #unicode #content #alpha0
gemini-map

A command-line tool to run files in parallel through Google Gemini

v0.1.2 300 app #gemini-map #gemini #pdf #split-pdf #html
charset

Character encoding decoding for email

v0.1.5 219K #charset #utf-7 #email #unicode
hyphenation

Knuth-Liang hyphenation for a variety of languages

v0.8.4 12K #typesetting #hyphenation #language #unicode-segmentation #text
tkrar

Count frequency of words in a file or a directory

v0.3.0 340 app #cli #directory #word-count #word #format #character #stdin #stop-words #pattern #count
vesti

A preprocessor that compiles into LaTeX

v0.15.0 2.8K app #latex #transpiler #document #coprime #file
mdbook-catppuccin

🎊 Soothing pastel theme for mdBook

v3.0.0 app #mdbook #plugin #pre-processor #markdown #catppuccin
collclean

Clean up collaboration commands in LaTeX files

v0.4.2 340 app #collclean #bob #run #accusam #amet
tiefdownconverter

A CLI tool to manage and convert Markdown-based projects

v0.7.0 290 app #pandoc #document-conversion #markdown
pad

padding strings at runtime

v0.1.6 134K #run-time #pad #character #alignment #pad-str
tau-engine

A document tagging library

v1.14.1 480 #rule-engine #tau #tags #rules #search #identifier
quixote

Quizzes and tests in Markdown

v0.6.4 bin+lib #quiz #markdown #quixote #zes
near-facsimile

Find similar or identical text files in a directory

v1.0.8 bin+lib #duplicates #compare #similarity #similar
hypher

separates words into syllables

v0.1.5 16K no-std #syllable #hyphenation #language
nu_plugin_regex

nu plugin to search text with regex

v0.12.0 170 app #regex #plugin #nu-plugin-regex #groups #a-c
trans-epub

Translate EPUB with CLI

v0.0.14 130 app #epub #trans-epub #epub-translator
unicode-reverse

Unicode-aware in-place string reversal

v1.0.9 98K no-std #grapheme-cluster #reverse #unicode #grapheme #no-std #string
unicode-bidi-mirroring

Unicode Bidi Mirroring property detection

v0.4.0 203K #unicode #mirroring #detect #detection
presenterm

A terminal slideshow presentation tool

v0.12.0 470 app #presentation #terminal #slide #slideshow #markdown #markdown-slides #tool
topiary-cli

CLI app for Topiary, the universal code formatter

v0.6.0 300 app #code-formatter #tree-sitter #text #formatter #cli
chewing

(酷音) intelligent Zhuyin input method

v0.9.1 #bopomofo #chewing #pinyin #layout #im
norad

Read and write Unified Font Object files

v0.15.0 650 #ufo #font #font-glyph #graphics
allms

One Library to rule them aLLMs

v0.17.1 150 #openai #anthropic #mistral #assistant #api-bindings #gemini
sile

Simon’s Improved Layout Engine

v0.15.12 410 bin+lib #typesetting #tex #pdf-generation #publish #typesetting-system #engine #lua
sigrs

Interactive grep (for streaming)

v0.1.4 340 app #grep #streaming #tui #sig #interactive #keymap
uwc

Counts things in unicode text files

v1.0.8 app #word-count #unicode #wc #unicode-word-count #testing-fixtures #cluster
idna

IDNA (Internationalizing Domain Names in Applications) and Punycode

v1.0.3 17.6M no-std #idna #http #web #no-std
rsrpp-cli

project for research paper pdf

v1.0.12 490 app #rsrpp #rsrpp-cli #pdf
nmd

Official NMD CLI and compiler

v1.4.3 1.2K app #markdown #compiler #nmd #markdown-rendering #document #markdown-parser #markdown-editor #markdown-viewer #markdown-converter #dossier
graphannis

new backend implementation of the ANNIS linguistic search and visualization system

v3.7.1 550 #graph-annis #search-engine #linguistic-analysis
igrepper

The interactive grepper

v1.3.5 370 app #editor #grepper #igrepper
mdbook-epub

An EPUB renderer for mdbook

v0.4.44 180 bin+lib #mdbook #epub #markdown #documentation
wchar

Procedural macros for compile time UTF-16 and UTF-32 wide strings

v0.11.1 10K #utf-16 #wide-string #utf-32 #wchar-t #unicode
regex-literal

delimited regular expression literals

v1.3.2 500 #regex #literals #serialization #delimiter
sd

An intuitive find & replace CLI

v1.0.0 15K app #find-replace #regex #sed
htmd-cli

The command line tool for htmd

v0.3.4 470 app #markdown-converter #html-markdown-converter #html-converter #render-markdown #html
mdbook-environment

A preprocessor for MdBook for working with environment variables

v0.0.4 300 app #mdbook #environment #pre-processor #environments
molybdenum

Recursive search and replace CLI application

v0.1.10 550 bin+lib #applications #search-pattern #molybdenum #folder
boreal

evaluate YARA rules, used to scan bytes for textual and binary pattern

v0.9.0 270 #string-matching #scan #rules #yara #pattern-matching #yara-parser #reliability #module #default
line-ending

Detect, normalize, and convert line endings across platforms, including support for character streams. Ensures consistent handling of LF, CRLF, and CR line endings in text processing.

v1.5.1 6.8K #line-ending #ending #platform
matcher_rs

A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust

v0.5.7 100 #multi #nlp #string-matching #string-search #search-pattern #text-search #text #config
asciidork-cli

Asciidork CLI

v0.19.0 310 app #asciidork #asciidork-cli #image #cli
promptify

A plaintext directory formatting tool for interacting with LLMs on the command line

v0.1.6 290 app #llm #promptify #glob
probly-search

A lightweight full-text search engine with a fully customizable scoring function

v2.0.1 1.0K #search-index #search-query #bm25 #document #query
unicode-joining-type

Fast lookup of the Unicode Joining Type and Joining Group properties

v1.0.0 59K no-std #unicode #joining #shaping #no-std #arabic
moonwave

generating documentation from comments in Lua source code

v1.3.0 bin+lib #documentation #moonwave #name #json #visually #docs
smeagol-wiki

A personal wiki webserver. Work in progress.

v0.4.10 850 app #wiki #page #store #gollum
mdcat

cat for markdown: Show markdown documents in terminals

v2.7.1 500 bin+lib #markdown #terminal #cat #less
mdbook-embedify

based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen and some other apps

v0.2.12-rc.3 270 app #mdbook-plugins #mdbook #plugin #embed #pre-processor #app #mdbook-embed-plugin
mdbook-pandoc

A pandoc-powered mdbook backend

v0.10.1 1.3K bin+lib #pandoc #mdbook #latex #pdf #book #back-end #config
tree-sitter-stack-graphs-typescript

Stack graphs definition for TypeScript & TSX using tree-sitter-typescript

v0.4.0 bin+lib #stack-graphs #tree-sitter #typescript #tsx
mdbook-combiner

combine mdbook summaries from multiple source into one mdbook

v0.1.17 110 app #mdbook #book #markdown #combiner
string-offsets

Converts string offsets between UTF-8 bytes, UTF-16 code units, Unicode code points, and lines

v0.2.0 3.5K #character #utf-16 #line #position #unicode
etradeTaxReturnHelper

Parses etrade and revolut financial documents for transaction details (income, tax paid, cost basis) and compute total income and total tax paid according to chosen tax residency (currency)

v0.7.4 2.8K bin+lib #document #revolut #etrade #documents #currency
arf-strings

Encoding and decoding for ARF strings

v0.7.3 230 #string #arf #arf-strings #nul-escaped-portion
nvl-cli

A program to download webnovels

v0.1.4 app #webnovels #fiction #nvl-cli
reword

some utility functions for human-readable formatting of words

v7.0.1 4.8K #reword #rogstadkjærnet #olsson
tbll

tbll outputs data in tabular format

v1.1.0 app #table #tbll #cli #format #overview #eof
fm

Non-backtracking fuzzy text matcher

v0.4.0 18K #regex #matcher #fm #matching
minimizer

Minimize files to find minimal test case

v2.0.3 1.1K app #minimizer #case #strategies
uuid25

25-digit case-insensitive UUID encoding

v0.3.5 210 no-std #string-representation #uuid25 #encoding
document_tree

reStructuredText’s DocumentTree representation

v0.4.2 4.8K #document-tree #representation #tree #image #testing
asciisavers

A small collection of ascii screensavers

v0.3.8 app #screen-savers #ball #dvd #toasters
mdbook-quiz

Interactive quizzes for your mdBook

v0.3.10 app #mdbook #markdown #learning #toml #quiz
ricat

A Rust-Based implemenation of classic UNIX cat command

v0.4.5 1.4K app #cat #command-line-tool #system-tools #text-processing #file
wrapr

wrap your code for ai

v0.1.8 380 app #tui #artificial-intelligence #clipboard #file-utility #file
lindera-ko-dic-builder

A Korean morphological dictionary builder for ko-dic

v0.32.3 16K #dictionary #builder #korean #ko-dic #morphological
obsidian-export

associated CLI program to export an Obsidian vault to regular Markdown

v25.3.0 270 bin+lib #markdown #obsidian #export #exporter #obsidian-vault #front-matter #syntax
heatseeker

A fast, robust, and portable fuzzy finder

v1.7.2 app #heatseeker #interrupt #power-shell
latex-thebib

Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the refactor sub-command. Compile BibTeX files to legacy thebibliography TeX code using the compile sub-command…

v0.3.4 360 app #bibliography #latex #thebibliography #2022
fiat-lux

Offline terminal-accessible Bible

v0.3.9 600 app #bible #fiat-lux #dat #database #resources #com
indefinite

Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel

v0.1.9 5.5K #article #noun #grammar #an #a
freetype-rs

Bindings for FreeType font library

v0.38.0 32K #font-glyph #font #glyph
bashdoc

generating documentation/help menu for user defined bash functions

v0.6.0 1.0K app #documentation #bash #output #delimiter #color #void #zshrc #command-line-tool #docs #below
yake-rust

Yake (Yet Another Keyword Extractor) in Rust

v1.0.3 #nlp #extractor #keyword #sentence #terms #frequently #stop-words
yara-x-parser

A parsing library for YARA rules

v0.14.0 3.3K #yara #parser #yara-x-parser #yara-x
wordcut-engine

Word segmentation/breaking library

v1.1.9 #nlp #library #split #engine #wordcut #path #load-dict #algorithm #text #cluster
unicode_titlecase

add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings

v2.4.0 330 #title-case #unicode #locale #char #casing #to-titlecase #string
scru64

Sortable, Clock-based, Realm-specifically Unique identifier

v2.0.1 no-std #identifier #scru64 #text #bit #integer #base36 #4261
simple-string-patterns

Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library

v0.3.17 #string-matching #expression #character #set #string #bounds-builder #enums #floats
utf16_iter

Iterator by char over potentially-invalid UTF-16 in &[u16]

v1.0.5 10.8M #utf-16 #unicode #iterator
whitespace-sifter

Sift duplicate whitespaces away!

v2.3.4 500 #white-space #duplicates #string #sifter #text
mdbook-graphviz

mdbook preprocessor to add graphviz support

v0.2.1 850 app #mdbook #graphviz #svg #file
iepub

epub、mobi电子书读写

v0.8.2 460 #ebook #epub #mobi #azw
levenshtein_automata

Creates Levenshtein Automata in an efficient manner

v0.2.1 433K #levenshtein-automata #automata #levenshtein #levenshtein-distance #fuzzy
line-numbers

Find line numbers in strings by byte offsets, quickly

v0.4.0 1.3K #line-numbers #numbers #line #quickly
byteyarn

hyper-compact strings

v0.5.1 10K #string #binary-string #binary #text
qpdf

Rust bindings to QPDF C++ library

v0.3.4 800 #pdf #qpdf #object #x86-64 #arm64
rustc-literal-escaper

code to unescape string literals

v0.0.2 41K #rustc-literal-escaper #literals #unicode
fuzzy-muff

Fuzzy Matching Library

v0.4.8 190 #fuzzy-search #text-search #fuzzy-matching #match #search #text
spellbook

A spellchecking library compatible with Hunspell dictionaries

v0.3.2 1.2K no-std #spell-check #dictionary #no-std #language #hunspell #nuspell #spell-checking #suggestions #spellcheck #practice
svgdx-pandoc

pandoc filter for svgdx codeblocks in Markdown

v0.4.0 400 app #svg #pandoc #diagram #svgdx
repgrep

An interactive command line replacer for ripgrep

v0.16.0 app #find-replace #ripgrep #grep #regex #utf-8
mdbook-mermaid

mdbook preprocessor to add mermaid support

v0.15.0 7.5K bin+lib #mdbook #mdbook-mermaid #mdbook-plugins #mermaid
aneubeck-daachorse

Daachorse: Double-Array Aho-Corasick

v1.1.1 4.5K no-std #aho-corasick #text-search #search #double-array #multi #pattern-matching #text #arxiv
roman-numerals-rs

Manipulate well-formed Roman numerals

v3.1.0 9.6K no-std #roman-numeral #numeral #roman-numerals #roman
aki-mcolor

mark up text with color

v0.1.32 1.3K bin+lib #ansi #filter #aki-mcolor #text #color
rhai-autodocs

Custom documentation generator for the Rhai scripting language

v0.8.0 #documentation #scripting-language #scripting-engine #arg
mdfried

A markdown viewer for the terminal that renders images and big headers

v0.10.0 1.4K app #header #mdfried #markdown
epub

support the reading of epub files

v2.1.2 1.2K #ebook #epub #archive #epub-doc #cover
mdbook-tailor

mdbook preprocessor for image-tailor

v0.8.2 1.6K bin+lib #mdbook #mdbook-plugins #image #tailor #webp
zhconv

Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞，基於MediaWiki和OpenCC之字詞轉…

v0.3.1 120 #chinese #mediawiki #localization #conversion #open-cc #variant #ruleset
unicode-truncate

Unicode-aware algorithm to pad or truncate str in terms of displayed width

v2.0.0 403K no-std #unicode #truncate #pad #unicode-text #unicode-width #text #width
affinidi-messaging-text-client

Affinidi Messaging SDK

v0.10.3 600 app #affinidi #ssi #client
instant-segment

Fast English word segmentation

v0.11.1 #nlp #segmentation #segmenter
character_converter

Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize

v2.1.5 1.0K #chinese #hanzi #localization #traditional #simplified #convert
vmks-exam-generator

CLI program for pseudo-randomly generating different variants of an embedded programming exam

v1.3.1 bin+lib #generator #exam #question-bank #bank #questions #group #segment
frizbee

Fast fuzzy matching via SIMD smith waterman, similar algorithm to FZF/FZY

v0.3.0 4.5K nightly #haystack #fzf-fzy #frizbee #algorithm
sgrep

grep util for those lazy to remember many command line options

v1.0.5 250 app #sgrep #search #directory #insensitive #bash
snakit

Command-line tool that recursively renames all files and folders within a specified directory to snake_case

v0.1.1 app #snake-case #snakit #verbose
clipcount

Counting words from the clipboard content

v1.0.7 270 app #word-count #clipboard #text-clipboard #words #count
textwrap-cli

Command line interface for textwrap

v0.2.3 app #textwrap #cli #textwrap-cli #input
lexicmp

comparing and sorting strings lexicographically and naturally

v0.2.0 19K #emoji #transliteration #unicode #lexicographical #sorting #iterator
lipsum

lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…

v0.9.1 34K #markov-chain #typography #text #random
adrs

Architectural Decision Record command line tool

v0.3.0 120 app #record #adrs #adr
asimov-dataset-cli

ASIMOV Dataset Command-Line Interface (CLI)

v25.0.0-dev.3 440 bin+lib #artificial-intelligence #cli #asimov
grok

popular java & ruby grok library which allows easy text and log file processing with composable patterns

v2.0.0 23K #grok #processing #information #alias
pathmut

Command line utility for manipulating path strings

v0.7.0 130 nightly bin+lib #file-extension #file-path #component #string #utility #prefix #stem #name #default #extension
inkjet

A batteries-included syntax highlighting library for Rust, based on tree-sitter

v0.11.1 650 #syntax-highlighting #tree-sitter #highlight
substudy

Language-learning tools for working with parallel, bilingual subtitles and media files

v0.5.2 550 bin+lib #srt #substudy #text #progress
mdbook-theme

A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor

v0.1.6 bin+lib #themes #mdbook #rust-book #ace #markdown #book #pre-processor
say-rust

command-line tool which is an alternative to echo

v1.0.1 240 app #say #say-rust #text
htop

HTML to PDF converter

v0.2.0 app #headless-chrome #html-converter #pdf #html #converter
rst

a reStructuredText parser and renderer for the command line

v0.4.2 100 app #restructuredtext #renderer #file-format #line
charasay

The future of cowsay 🐮! Colorful characters saying something 🗨️

v3.3.0 550 bin+lib #ansi-term #cowsay #ansi-escapes #character #print #terminal #ansi-art
mdbook-alerts

mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book

v0.7.0 2.6K bin+lib #mdbook-preprocessor #mdbook #alert #mdbook-pre-processor #markdown #github
precis-tools

Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)

v0.1.9 14K #internationalization #precis #compare #enforcement #preparation #comparison
kelp

A convert tool for Japanese

v0.6.0 800 bin+lib #full-width #half-width #katakana #hiragana #cli #convert
vidyut-lipi

A Sanskrit transliterator

v0.2.0 170 bin+lib #sanskrit #transliterator #scheme #nlp
tabprinter

creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.

v0.2.1 130 #formatting #printing #style #table #output #alignment #processing #amiga #cell #color
uncomment

A cli tool to remove comments from code. Supports multiple languages.

v1.0.4 200 app #cli #pre-commit #comments #python #file
regexml

XPath compatible regex engine

v0.2.1 600 #xpath #xml-schema #engine #regex #xml
yeslogic-ucd-generate

A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables

v0.7.0 app #unicode-characters #fst #unicode #character #generate #table
trprvr

TRanslate PRogress VieweR

v0.2.0 260 app #viewe-r #trprvr
mut-str

A toolkit for working with mutable string slices (&mut str)

v1.1.0-alpha.2 500 no-std #slice #string #mutability #str-ext #index
hauchiwa

Incredibly flexible static site generator library with incremental rebuilds and cached image optimization

v0.4.0 260 #ssg #optimization #style-sheet #rebuild #system
crankshaft-config

Configuration facilities for Crankshaft

v0.1.0 3.9K #crankshaft #crankshaft-config #task #bioinformatics
nanohtml2text

A zero-dependency library to convert HTML to plain text

v0.2.1 1.4K bin+lib #text-html #text #html #string #html-text #parser
stfu8

Sorta Text Format in UTF-8

v0.2.7 73K #unicode #utf-8 #repr #invalid #unicode-text #binary #text
unicode-security

Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules

v0.1.2 213K #security #unicode #unicode-text #mechanism #text
addbib

An app to add linked bibliographies to markdown files

v0.1.0 150 app #bibliography #citation #markdown #automation #documentation
text2num

Parse and convert numbers written in English, Dutch, Spanish, Portuguese, German, Italian or French into their digit representation

v2.6.0 #nlp #words-to-numbers #ordinal #text2digits
thoth-note

note-taking app written in Rust

v0.1.1 app #note-taking #tui #markdown #rust #note
fuzzt

Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.

v0.3.1 9.0K #levenshtein #string-similarity #jaro-winkler #hamming #jaro
plsfix

Text cleaner upper

v0.1.8 #upper #plsfix #print
codetypo-vars

Source Code Spelling Correction

v0.9.1 130 #spelling #codetypo #variables #pr #correction #spell-check #development #development-tools
rapidfuzz

rapid fuzzy string matching library

v0.5.0 14K #levenshtein #string-similarity #levenshtein-distance #hamming #jaro #similarity
sapling-streampager

streampager is a pager for command output or large files

v0.11.0 17K #pager #less #more #sapling
case_insensitive_hashmap

A HashMap that uses case-insensitive strings as keys

v1.0.1 1.6K #hash-map #case-folding #case-insensitive #unicase
madato

command line tool for reading and writing tabular data (XLS, ODS, CSV, YAML), and Markdown

v0.7.0 5.0K bin+lib #markdown #yaml #excel #csv
llguidance

Super-fast Structured Outputs

v0.7.15 1.6K #output #llguidance #grammar #outputs
rust-persian-tools

Official Rust implementation of Persian Tools

v1.1.4 #persian #localization #farsi #iran #tool #text-processing
asciimath-unicode

Convert asciimath to unicode

v0.1.4 300 bin+lib #unicode #asciimath #asciimath-unicode #binary
jx

An interactive JSON explorer for the command line

v0.5.0 app #interactive #explorer #interactive-cli #json
vader-sentimental

A faster Rust version from the original Python VaderSentiment analysis tool

v0.1.2 bin+lib #sentiment-analysis #nlp #vader-sentimental #text-analysis #lol #content #analyse
jetscii

A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes

v0.5.3 48K #ascii #string-search #byte #simd #ascii-text #search #ascii-string #string #character
colornames

An enum of color names, with a catchall RGB variant

v0.0.6 #color #variant #colornames #insensitive #value
ipset_lookup

ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…

v0.4.8 480 bin+lib #threat-intel #blocklists #threat-feed #cti
bogrep

Full-text search for bookmarks from multiple browsers

v0.10.1 bin+lib #full-text-search #bookmarks #grep #browser #source #cli
herring-automata

Automata construction for Herring

v0.1.3 #dfa-automata #nfa-automata #herring #dfa #automata
mdmodels

generate models, code and schemas from markdown files

v0.2.3 150 bin+lib #markdown #python #define #model #typescript #codegen #golang
hyphertool

Hypertool is a command-line tool for syllabification and hyphenisation

v0.3.0 170 app #nlp #hyphenation #syllabification #dehyphenation #ver-wer-ken
galm

pattern matching library

v0.3.1 140 #sorting #kanji #matching #cli #start
quranize

Encoding transliterations into Quran forms

v1.0.0 2.0K #quran #quranize #suffix-tree #text
swift-check

High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)

v0.2.1 600 no-std #search #simd #validation #no-alloc
typst-ansi-hl

highlights your Typst code using ANSI escape sequences

v0.4.0 1.9K #typst #ansi #discord #syntax
patchkit

parsing and manipulating patch files

v0.2.1 5.9K #patch #patch-file #patchkit #hunk-line #parse-patch
tesseract-rs

Rust bindings for Tesseract OCR with optional built-in compilation

v0.1.19 130 sys #ocr #computer-vision #text-recognition #compilation #build
COXave

Instruments for codings

v1.0.8 1.4K #utf-16 #utf-8 #utf-32 #ascii #encoding
rustdoc-stripper

manipulate rustdoc comments

v0.1.19 500 bin+lib #documentation #rustdoc #comments #strip #tool #docs
picodiff

Tiny GUI app to compare text easily

v0.9.4 250 app #diff #productivity #text #compare
zet

zet finds the union, intersection, set difference, etc of files considered as sets of lines

v2.0.1 300 bin+lib #set-operations #uniq #zet #intersection #set #union
rust_string_utils

String utilities for rust based on org.apache.commons.lang3

v0.1.20 1.4K #ignore-case #byte-array #delimiter #lang3 #status #utilities
uast

Unicode Aware Saṃskṛta Transliteration in Rust 🦀

v6.0.1 2.1K bin+lib #transliteration #iast #uast
ripsecrets

A command-line tool to prevent committing secret keys into your source code

v0.1.9 150 bin+lib #security #secret #ripsecrets #pre-commit #search
tantivy-common

common traits and utility functions used by multiple tantivy subcrates

v0.9.0 389K #search-engine #tantivy #search-indexing #tokenize #sub-crate #document #tokenizer
rzozowski

A regex crate using Brzozowski derivatives

v0.1.1 100 #derivative #regex #brzozowski #star #class
autumnus

Syntax highlighter powered by Tree-sitter and Neovim themes

v0.2.0 370 bin+lib #syntax-highlighting #tree-sitter #highlighter-coloring #style-sheet #syntax-coloring #highlighter
babel

Provide Rust enums for Groq, SambaNova, Openrouter's llm model names

v0.0.10 700 #babel #model #provider #response #call #string #interface #conversation
unicode-properties

Query character Unicode properties according to UAX #44 and UTR #51

v0.1.3 3.0M no-std #unicode-properties #unicode #unicode-text #no-alloc #emoji #unicode-emoji #text #general-category #unicode-general-category
mdbook-d2

D2 diagram generator plugin for MdBook

v0.3.3 190 bin+lib #mdbook #markdown #d2 #common-mark
word-tally

Output a tally of the number of times unique words appear in source input

v0.19.0 480 bin+lib #word-count #tally #word-tally #cli #word #words #count
unic-ucd-ident

UNIC — Unicode Character Database — Identifier Properties

v0.9.0 357K #internationalization #character-property #unic #unicode #unicode-text #unicode-characters #locale-data #text
mandown

Markdown to groff (man page) converter

v1.1.0 2.5K bin+lib #manpage #troff #markdown #roff
mdbook-cmdrun

mdbook preprocessor to run arbitrary commands

v0.7.1 700 bin+lib #mdbook-preprocessor #mdbook #mdbook-pre-processor #runcmd #command #cmdrun
vlazba

Lojban words generator and analyzer

v0.7.14 500 bin+lib #nlp #lojban #conlang #analyzer #algorithm #jvozba #weighting #generator
text-to-ascii-art

program to convert text to ASCII art

v0.1.10 310 bin+lib #ascii-art #art #string #text-ascii-art
mdlib

A beautiful markdown note-taking application

v0.1.1 app #notes #web-apps #markdown #knowledge-base #wiki
rustkorean

processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…

v1.1.2 #korean #character #hangul #jongseong
harfbuzz_rs

A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust

v2.0.1 750 #harfbuzz #text-layout #shaping #ffi #textlayout
furigana

Map furigana to a word given its reading

v0.1.10 240 #japanese #furigana #物ものの
gh-emoji

Convert :emoji: to Unicode using GitHub’s emoji names

v1.0.8 5.9K #emoji #unicode #github #markdown #convert
hashmoji_generator

Code generation tool for hashmoji

v0.1.2 190 bin+lib #generator #hashmoji #version
dprint-plugin-markdown

Markdown formatter for dprint

v0.18.0 4.3K #dprint-plugin #dprint #dprint-plugin-markdown
mfmt

Meta formatter library

v0.3.10 400 nightly #formatter #language #mfmt #line #format
emojic

Emoji constants

v0.4.1 950 no-std #gender #pair #tone #emoji #unicode
pretty-xmlish

Pretty print XML-ish data with unicode art

v0.1.13 3.6K #sql #printing #art #data
oxford_join

Join string slices with Oxford Commas!

v0.5.0 160 no-std #list #join #grammar #comma #string
autotex

Continuously compile TeX and LaTeX

v1.4.1 390 app #latex #tex-engine #autotex #pdf #manual
kas-text

Text layout and font management

v0.7.0 150 #bidi #text-rendering #shaping #management #harfbuzz #glyph #emoticon #correctly #processing #navigation
see-cat

A cute cat(1)

v0.8.1 500 app #syntax-highlighting #cat #viewer #terminal #markdown
figlet-comment

quickly create banner to use as comments

v0.4.0 380 app #comments #figlet #figlet-comment #stdout #clipboard
ast-grep-language

Search and Rewrite code at large scale using precise AST pattern

v0.37.0 6.5K #search-pattern #codemod #rewrite #ast #pattern #search
capitalize

Change first character to upper case and the rest to lower case, and other common alternatives

v0.3.4 850 #title-case #capitalize #change #string #alternative
textalyzer

Analyze key metrics like number of words, readability, and complexity of any kind of text

v0.3.0 bin+lib #nlp #text #analysis #complexity #duplications
the_rock

A command line King James bible viewer

v0.9.2 app #viewer #bible #rock
sublime_fuzzy

Fuzzy matching algorithm based on Sublime Text's string search

v0.7.0 64K bin+lib #fuzzy-search #match #search #text-search #text
rustic_print

A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.

v0.2.1 260 #text-styling #formatted-output #cli-enhancement #rust-library #console-printing #table
stam-tools

Command-line tools for working with stand-off annotations on text (STAM)

v0.9.2 bin+lib #nlp #annotations #linguistics #standoff #alignment #text-processing #annotation
nlpo3

Thai natural language processing library, with Python and Node bindings

v1.4.0 420 #nlp #tokenize #thai #word-segmentation #tokenizer
ticker-sniffer

extracting multiple stock ticker symbols from a text document

v0.1.0-alpha9 bin+lib #ticker #extract #sniffer #progress
quagga

CLI tool that combines multiple text files into a single prompt suitable for Large Language Models

v0.1.3 130 bin+lib #llm #text #cli #directory #txt #node-modules #size #part #quagga-ignore #clipboard
mdbook-angular

mdbook renderer to run angular code samples

v0.4.0 900 bin+lib #mdbook #angular #sample #ts #action #block #flags
what-rs

Identify what something is! A pyWhat reimplementation in Rust

v0.4.1 290 app #regex #nlp #identifier
misanthropy

An interface to the Anthropic API

v0.0.7 120
libharu_ng

Easily generate PDFs from your Rust app

v1.0.10 sys #pdf #libharu #api-bindings #generator #haru
creature_feature

Composable n-gram combinators that are ergonomic and bare-metal fast

v0.1.7 bin+lib #nlp #ngrams #min-hash #ml #performance #featurization #book #hash #hashed-a
unitil

EUC-JPの全角チルダを波ダッシュに変換するツール

v0.2.1 app #unitil #tool #に変換する #euc #jpの全角チル #を波ダッシュ #全角チルダを #txt #変換の
pulldown-cmark-toc

Generate a table of contents from a Markdown document

v0.7.0 #markdown #toc #pulldown-cmark #common-mark #github
frawk

an efficient Awk-like language

v0.4.8 app #awk #csv-tsv #csv #tsv
mdbook-codeblocks

A mdbook preprocessor to prepend customizable vignette to code blocks

v0.1.21 140 app #mdbook-preprocessor #mdbook #mdbook-pre-processor #code-block
timug

It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.

v0.1.3 110 app #blog #static-site-generator #markdown #static-page #page-generator #generator
dcsv

Dyanmic csv reader,writer,editor

v0.3.4-beta.2 170 #csv #editor #value #text-processing #cli #writer
pager

pipe your output through an external pager

v0.16.1 10K #pager #less #more #friends
clima

A minimal Markdown reader in the terminal

v1.1.1 800 app #markdown #terminal #md #termimad #skin
mdka

HTML to Markdown converter

v1.4.6 1.4K bin+lib #markdown-parser #convert-html #render-markdown #html
unidown

Convert Markdown to Unicode

v0.8.2 bin+lib #unicode #style #unidown #table #heading
mdbook-private

An mdbook preprocessor that controls visibility of private chapters and sections within them

v0.2.3 bin+lib #mdbook #private #section
float-pretty-print

Format f64 for showing to user, not for serialisation

v0.1.1 623K #pretty-print #human-readable #float #string-representation #serialization #format
in_definite

Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.

v1.0.0 4.6K #nlp #english #grammar #text
hlight

dedicated to delivering exceptional syntax highlighting capabilities

v0.0.10 310 #syntax-highlighting #syntax-set #hlight #highlighting #theme-set #syntax #file #start
fastn-jdebug

fastn: Full-stack Web Development Made Easy

v0.1.1 #fastn-jdebug #static-site-generator #fastn #language #json #markdown #component #ftd #io-fastn #ft-code-repo
four-char-code

A string of 4 ascii chars represented by an u32

v2.3.0 4.1K no-std #u32 #four-char-code #four #widipedia #no-std
ib-pinyin

一个高性能拼音匹配库

v0.2.5 #pinyin #cjk #unicode #chinese #py #个高性能拼 #匹配库
sk-skimmer

Fuzzy Finder in rust!

v0.13.6 250 bin+lib #menu #fuzzy-finder #skim #utilities #mode #fzf #sk #fuzzy
secular

No Diacr!

v1.0.1 2.2K #unicode-normalization #diacritics #secular #diacr #normalization #unicode
reggy

friendly, resumable regular expressions for text analytics

v0.0.6 220 #nlp #analytics #reggy #regex
deindent

A command line utility and Rust library to format overly-indented text

v1.0.1 250 bin+lib #indentation #formatter #deindent #indent #clipboard
seshat-unicode

A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.

v0.3.1 750 #unicode #unicode-version #seshat-unicode #properties #normalization #standard
nu-utils

Nushell utility functions

v0.103.0 16K bin+lib #nu-shell #shell #nu-utils
pukram-formatting

A type to represent the formatting of the pukram markup language

v0.2.1 140 #text-formatting #pukram #markup
mdbook_rash

Binary to create doc from rash code

v2.9.9 900 bin+lib #rash #mdbook #shell #ansible #container #docker
diagnostic

Pretty diagnostic report

v0.6.4 #diagnostics #diagnostics-report #ansi-colors
indent

Functions for indenting multiline strings

v0.1.1 119K #indentation #multi-line #string #multiline
kathoey

text feminization using open corpus linguistics data

v1.1.5 #nlp #russian #text-feminization #data
inlet_manifold

A general purpose highlighting library

v0.2.0 550 #regex #highlighting #tailspin #default
pandoc

API that wraps calls to the pandoc 2.x executable

v0.8.11 1.7K #markdown #latex #pandoc #executable #instructions
diacritics

Remove diacritics from letters, for example when standardizing input for a search

v0.2.2 800 #diacritics #text-search #normalize #search #text
mktoc

Generate Table of Contents from Markdown files

v4.0.0 bin+lib #toc #markdown #generator #command-line-tool #table-of-contents #min-depth
cskk

C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ

v3.1.4 1.0K #skk #input-methods #henkan #html #version #方式のかな
charx

A replacement for char::is_ascii*

v1.1.0 #charx #is-ascii #build
easy_reader

easily navigating forward, backward or randomly through the lines of huge files

v0.5.2 7.1K #file-reader #backward #line #reverse #random
iotext_rs

IoText data protocol

v0.5.0 220 bin+lib #data #iot #protocols #com #crc #bieli #timestamp
scrunch

full-text-searching compression

v0.8.0 bin+lib #scrunch #element #zero #status
dmos

Djot HTML renderer with advanced features

v0.5.1 400 #syntax-highlighting #djot #dmos #highlighting #syntax #anchor #emoji
mdbook-callouts

mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book

v0.2.2 bin+lib #mdbook-preprocessor #mdbook #obsidian #mdbook-pre-processor #callouts #markdown
scanix

search a text or pattern in files. A fast and lightwight text tool.

v0.5.1 800 app #scanix #config #production
unbom

Remove UTF-8 BOM from files

v0.2.2 app #unbom #utf-8 #txt
string-patterns

Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…

v0.3.9 #mode #methods #string-patterns #string #pair #enums
hi-doc-jumprope

fast rope (fancy string) library built on top of Skiplists - hi-doc fork

v1.2.0 #fork #string #hi-doc-jumprope #jump-rope
nu_plugin_emoji

a nushell plugin called emoji

v0.12.0 190 app #emoji #plugin #nu-shell #unicode
rust-ai

A collection of 3rd-party AI APIs for Rust

v0.1.22 bin+lib #openai #openai-api #azure-api #ai-api #azure #com #xxxxxxxxxxxxxxxx #sk #westus #xxxxxxxxxx
vi

An input method library for vietnamese IME

v0.7.0 #input-methods #vi #vietnamese #vietnamese-language #ime #vni
string-auto-indent

Normalizes multi-line string indentation while preserving platform-specific line endings

v0.1.2 #auto-indent #auto #indent
regex-charclass

Manipulate and convert regex character classes

v1.0.3 280 #regex #complement #difference #intersection #union
nib-cli

A cli for a yet another static site generator Nib

v0.0.3 220 app #cli #text #nib #config
glyph_brush_layout

Text layout for ab_glyph

v0.2.4 54K #text-layout #ab-glyph #font-rendering #true-type
shell2batch

Coverts simple basic shell scripts to windows batch scripts

v0.4.5 38K #shell #batch-file #batch #scripting #convert
mdbook-pikchr

A mdbook preprocessor to render pikchr code blocks as images in your book

v0.1.9 500 app #mdbook #markdown #pikchr #pic #md
ctreg

Compile-time regular expressions the way they were always meant to be

v1.0.3 #regex #ctreg #expression #greeting
agentai

designed to simplify the creation of AI agents

v0.1.3 490 #chatgpt #agent #generative-ai #gemini
px-wsdom-ts-convert

wsdom crate

v0.0.3 150 bin+lib #wsdom #convert #px-wsdom-ts-convert #demo #live-everything
async-utf8-decoder

Convert AsyncRead to incremental UTF8 string stream

v1.0.0 #async #async-stream #utf-8 #utf8-decoder
mdbook-aquascope

Interactive Aquascope editor for your mdBook

v0.3.5 bin+lib #mdbook #aquascope #interpreter
html-compare

compare html files

v0.1.4 290 #compare #mrml #mjml #extension
flowquad

that helps you build UI stuff with Macroquad

v1.1.2 290 #text-input #toggle #button #label #container #macroquad
svgbob_cli

Transform your ascii diagrams into happy little SVG

v0.7.6 1.7K app #svg #bob #ascii #convert
stroka

Small String optimization

v1.0.0-beta.5 240 no-std #string #optimization #stroka #str
charname

Incredibly simple library that just gives you the Unicode name for a character

v1.16.0 270 #charname #modification #code-point
cai

The fastest CLI tool for prompting LLMs

v0.10.0 bin+lib #artificial-intelligence #llm #openai #gpt #cli #ml
mdbook-tocjs

A mdbook preprocessor which adds extra js and css file for ToC hydration

v0.1.4 450 bin+lib #mdbook #toc #css #wing #theme-dir #save-dir
quicksilverx

easy to use grep clone

v0.1.0 130 app #quicksilverx #clone
veg

Flexible tables

v0.5.5 210 #table #veg #colored #define #anyhow
turtlefmt

Auto-formatter for RDF Turtle

v0.1.2 120 bin+lib #turtle #turtlefmt #format
mdbook-open-on-gh

mdbook preprocessor to add a open-on-github link on every page

v2.4.3 130 bin+lib #mdbook #page #mdbook-open-on-gh #open-on-gh
catalog-of-markdown

Generate the catalog of markdown file

v0.1.6 350 bin+lib #markdown #catalog #catalog-of-markdown #title
substring

method for string types

v1.4.5 122K no-std #substring #string #slice
reflexo-typst

Bridge Typst to Web Rendering, with power of typst

v0.5.5-rc7 1.6K #typst #browser #reflexo #server-side-rendering #wasm #ts
mdbook-nice

A mdbook plugin to add nice css to your book

v0.1.0 app #mdbook-nice #nice #book
gen-mdbook-summary

generate SUMMARY.md for mdbook project

v0.0.5 160 app #mdbook #summary #md #file #ignore #tool
regexnight

Command-line tool to print syntax-highlighted versions of regular expressions and spot errors

v0.2.0 140 bin+lib #error #regex #regexnight #right #light
v_escape

The simd optimized escaping code

v0.18.0 18K #simd #escaping #html-escape
krafna

terminal-based alternative to Obsidian's Dataview plugin, allowing you to query your Markdown files using standard SQL syntax

v0.5.6 bin+lib #markdown #obsidian #sql #json #query #execute #field
armnod

random string generator

v0.10.0 bin+lib #random #random-string #armnod
fimdoc

Firendship is Magic Document, converts Markdown into FIMFiction BBCode

v0.6.1 1.5K bin+lib #fimdoc #bbcode #fanfiction #fiction #mlp #pony #document
mdbook-linkcheck2

A backend for mdbook which will check your links for you

v0.9.1 bin+lib #mdbook #mdbook-linkcheck2 #http-header #link-check2
ncase

Enforce a case style

v0.3.2 490 bin+lib #style #convert-text #pascal-case #case #convert #word
rake

Rapid Automatic Keyword Extraction (RAKE) algorithm

v0.3.6 #algorithm #keyword #rake #text-processing
mnm

Mnemonic sentences for BitTorrent info-hashes

v1.0.1 app #info-hashes #mnm #mnemonic #word #rationale #glick #complaisantly #definissent #tuilleries #pilotin
mdbook-chess

An mdbook preprocessing plugin to generate chess boards

v0.2.2 app #chess #mdbook #chess-board #markdown
ewts

Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (lib)

v0.1.3 #converter #ewts #tibetan #localization #transliteration #lib #symbols
langram

Natural language detection library

v0.1.0 #nlp #detect #detect-language #recognise
cronus_spec

The definitions for cronus API spec

v0.4.1 500 #specification #typescript #cronus #documentation #async #clean-architecture #openapi #axum #async-trait
minspan

a package for determining the minimum span of one vector within another

v0.1.2 2.9K #another #minspan
md-tui

A terminal markdown viewer

v0.8.7 bin+lib #tui #markdown-viewer #viewer #markdown #action
extract_anchors

Утилита для извлечения из исходных кодов всех помеченных отрывков

v0.1.3 290 app #extract-anchors #anchor #отрывк #publish
minimo

terminal ui library combining alot of things from here and there and making it slightly easier to play with

v0.5.42 170 #terminal #terminal-colors #printing #cli #color #terminal-color
md-ulb-pwrap

Markdown paragraph wrapper using Unicode Line Breaking Algorithm

v0.1.2 340 #markdown #unicode #md-ulb-pwrap
utf8_iter

Iterator by char over potentially-invalid UTF-8 in &[u8]

v1.0.4 11.0M #iterator #utf-8 #utf-8-encoding #unicode
n_gram

training n-gram language models

v0.1.12 700 #ngrams #language-model #lm #simple #corpus #eos
rsonpath-lib

Blazing fast JSONPath query engine powered by SIMD. Core library of rsonpath.

v0.9.4 200 #json-query #json-path #simd-json #search #simd
mdbook-tabs

mdBook plugin for rendering content in tabs

v0.2.3 1.4K bin+lib #tabs #mdbook #mdbook-tabs
utf64

encode utf-8 strings into utf-64, and decode them back

v1.0.2 200 #unicode #string #traits #unicode-text #utility #text
mdbook-hints

mdBook preprocessor to add hover hints to your book

v0.1.5 bin+lib #mdbook-preprocessor #mdbook #tooltip #hint #mdbook-pre-processor
simstring_rust

A native Rust implementation of the SimString algorithm

v0.1.2 310 #string-matching #nlp #simstring #cpmerge #unicode #algorithm #hash-db #measure #ngrams
lister-cli

Lister: Navigate Markdown Lists

v0.1.4 120 bin+lib #list #markdown #ui
puppet-fmt

Automatic code formatter for puppet manifests

v0.1.2 app #manifest #puppet-fmt #string #alignment #white-space #manifests #output
rust_file_encode_mode_convert

这是一个rust的库，用于检测文件的编码格式。支持GBK,GBK2312 , UTF8, UTF16LE, UTF16BE, UTF8+BOM,UTF32 等多种编码格式。

v11.45.14 bin+lib #charset #unicode #convert #等多种编码格
mdbook-external-links

Open external links inside your mdBooks in a different tab

v0.1.2 app #mdbook-plugins #mdbook #link #external #tabs #mdbook-preprocessor
loki_text

advanced string manipulation with pattern searching and replacement capabilities

v0.1.4 220 #text #search #base64
hyperscan

bindings for Rust with Multiple Pattern and Streaming Scan

v0.3.2 5.3K #hyperscan #regex #streaming #scan #run-time
to_markdown_table

An easy way to format any data structure into a Markdown table

v0.1.5 9.3K #markdown-tables #table-row #markdown
rustash

CLI tool to manage your notes

v0.3.1 500 app #notes #list #rustash #index
treegrep

A pattern matcher frontend or backend which displays results in a tree

v0.1.4 270 app #regex #tree-search #grep #search-tree #search #back-end
textpod

Local, web-based notetaking app inspired by 'One Big Text File' idea

v0.1.5 app #file #textpod #attachment #markdown #copy
indent_write

Write adapters to add line indentation

v2.2.0 343K no-std #indentation #write #indent-write
jayce

tokenizer 🌌

v12.1.0 1.1K #tokenize #tokenizer #jayce #occurs #found #source #once-lock #follow
cliche

Dead simple static site generator

v1.2.0 330 app #static-site-generator #markdown #routing #style-sheet #content
libchai

汉字编码优化算法

v0.2.4 270 bin+lib #libchai #汉字编码输入 #汉字编码优化 #案优化算法 #的图形界面来 #赖来使用 #项目中安装为 #字自动拆分系 #后者可以通过 #以及基于退火
giff

Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…

v0.2.1 260 app #git-diff #diff #cmd #git #head
reason-shell

Reason: A Shell for Research Papers

v0.3.10 app #academic-paper #research #paper #title #command-line
eliza

natural language processing program developed by Joseph Weizenbaum in 1966

v2.0.1 bin+lib #nlp #chat-bot #linguistics #weizenbaum
doxygen-bindgen

Converts Doxygen comments into Rustdoc markdown

v0.1.3 180 #markdown #doxygen-bindgen #doxygen #build-dependencies
filename-refactor

Command to refactor file names

v0.2.1 160 app #name #translation #character #subcommand #names #f2h
encoding-next

Character encoding support for Rust

v0.3.0 1.1K #unicode #charset #character-encoding #iso-8859-1
botanical-latin

Decliner / conjugator / inflector for classical / botanical Latin

v0.0.7 170 #latin #botanical-latin #botanical
replaxe

A command-line tool to replace text in files with easy patterns

v0.1.1 app #replace #text-replacement #text #pattern #command-line
fcowsay

working with cowsay

v2.0.0 120 #cowsay #fcowsay #animalsay #animal
rustclock

a stopwatch or timer cli made in rust

v0.2.2 app #minutes #rustclock #clock #minuttes
rst_renderer

a reStructuredText renderer

v0.4.2 4.7K #renderer #restructuredtext #right #standalone
dialogi

A dialog parser

v0.3.4 230 #parser #format #dialogi #header
zp

Copy the contents of the source file or the standard output buffer to the clipboard, with support for maintaining a history of copied content, allowing users to easily paste into another file or program

v1.2.1 360 bin+lib #copy #cmd #copy-to-clipboard #daemon
hat-splitter

HAT splitter

v0.1.9 650 #splitter #hat #hat-splitter
wit-bindgen-markdown

Markdown generator for WIT and the component model, typically used through the wit-bindgen-cli crate

v0.41.0 490 #wasi #wit-bindgen #java #testing
dcss-api

A DCSS Webtile API for Rust

v0.2.1 #api #games #dcss-api #game #webtile
rewrite

Safely rewrite file contents from stdin, even when file is open as an input

v1.0.0 app #redirect #in-place #rewrite #sponge #command-line-tool
text-editing

string with utilities for editing

v0.2.2 #text-editing #editing #text-line
mdsh

Markdown shell pre-processor

v0.7.0 bin+lib #markdown #shell #pre-processor #extension #pre-commit-hooks #pre-processing
codetypo-dict

Source Code Spelling Correction

v0.12.7 140 #spell-check #spelling #codetypo #development-tools #monorepo #text-processing #correction
libretranslate

A wrapper for the LibreTranslate web API

v0.5.2 #translation #language #libretranslate #api
wikidot-normalize

provide Wikidot-compatible string normalization

v0.12.0 10K #wikidot #normalization #slug #normal
gulagcleaner_rs

Ad removal tool for PDFs

v0.15.6 #pdf #gulagcleaner #wuolah #studocu #stucleaner
esri_ascii_grid

reading ESRI Ascii Grid .asc files

v0.4.6 120 #grid #raster #ascii #esri #asc
aki-gsub

substitude text command, replace via regex

v0.1.38 1.3K bin+lib #aki-gsub #filter #text
strloin

copy on write slices of a string

v0.3.0 360 #copy-on-write #slice #strloin #string #cow
conststr

Constant strings

v0.3.1 450 no-std #conststr #string #from-str
ascii_help

help you quickly convert ASCII codes

v1.2.1 230 app #ascii #codes #character
ponsic-winsafe

The dependency of the ponsic crate

v1.0.0 120 #ponsic-winsafe #ponsic #winsafe #风格封装 #的一部分
vidyut-kosha

A Sanskrit key-value store

v0.2.0 160 #sanskrit #store #lexicon #nlp
runi

a CLI tool to generate unicode fonts

v0.1.4 app #font #generator #unicode #unicode-font-generator #cli #abcdef
huggingface/tokenizers-python

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

GitHub 0.21.2-dev.0 393K #tokenize #nlp #tokenizer #bpe #production #gpt #transformer #bert #tokenizers
slack-blocks-render

Slack blocks render is a Rust library to render Slack blocks as Markdown

v0.4.1 360 #slack #render-markdown #html
piet-cosmic-text

A text layout engine for piet based on cosmic-text

v0.3.4 no-std #cosmic-text #piet #piet-cosmic-text
fmtt

A diff-friendly text formatter that breaks lines on sensible punctuations and words to fit a line width

v0.8.0 110 bin+lib #formatter #fmtt #paragraph #text #body #figure #content #pdf
ADA_Standards

help you handle checks on your ADA projects, especially good to build scripts to check coding standards conformity

v0.2.0 170 #string-parser #analysis #code #ada #string
askalono-cli

detect the contents of license files

v0.5.0 app #open-source-licensing #licensing #askalono #text
poppler-sys-rs

Low-level (FFI) bindings for poppler-glib

v0.24.0 3.2K sys #ffi #poppler #poppler-glib
colonnade

format tabular data for display

v1.3.3 #text-alignment #alignment #table #wrap #text #justify
unicode-canonical-combining-class

Fast lookup of the Canonical Combining Class property

v1.0.0 4.3K no-std #combining #class #canonical #unicode #unicode-properties #no-std
keyphrases

Rapid Automatic Keyword Extraction (RAKE) implementation in Rust

v0.3.3 200 #nlp #extract #keyphrases #keyword #rake
bobo_html_parser

parser of html markdown

v0.1.1 bin+lib #html-parser #pest-parser #pest #bobo #tags #html #markdown #structure #rules #grammar
text_utils_s

edit array. Example delete duplicate in array. Clear string

v0.1.5 270 #deduplicate #string #regex #unique #collection
lodestone

A website wrapper for FFXIV's lodestone

v0.5.0 380 #lodestone #ffxiv #search #datacenter #profile #api-bindings
string_wizard

manipulate string like a wizard

v0.0.26 #wizard #string-wizard #chunks
rustdoc-md

Convert Rust documentation JSON into clean, organized Markdown files

v0.1.0 280 bin+lib #converter #api #documentation #rustdoc #markdown #item
cesu8

Convert to and from CESU-8 encoding (similar to UTF-8)

v1.1.0 2.9M #utf-8 #cesu8 #character-encoding
portmanteau

create portmanteaux

v0.2.2 130 #portmanteau #word #vowel #portmanteaux
asciidoctor-client

A kludge to improve the performance of static site generators that use asciidoc through its cli

v0.4.3 260 bin+lib #asciidoctor #client-server #asciidoctor-client #hugo #cli
fast_symspell

Spelling correction & Fuzzy search

v0.1.10 bin+lib #spell-check #symspell #edit-distance #verbosity #sym-spell #strategy #spell-checking #spellcheck
broken-md-links

A command-line tool and library to detect broken links in Markdown files

v2.1.1 bin+lib #broken-links #link #broken-md-links #output
tform

format plain text into well-structured Markdown or HTML

v0.1.1 #convert #markdown #streaming #formatter #conversion #config #io
razy-importer

lazy_importer

v0.3.4 390 #obfuscation #reverse-engineering #malware #lazy-importer #anti-reversing #static-analysis
cedarwood

efficiently-updatable double-array trie in Rust (ported from cedar)

v0.4.6 39K #trie #string-search #cedar #text-search #search #string #text
nucleo-matcher

plug and play high performance fuzzy matcher

v0.3.1 30K #fuzzy-matching #fuzzy-search #nucleo #matcher #fzf #text-processing #performance
duvet

A requirements traceability tool

v0.4.1 850 bin+lib #tool #duvet #testing #report #start #phase
colored_text

adding colors and styles to terminal text

v0.3.0 #ansi-term #terminal-colors #ansi-terminal #text-formatting #terminal
mdbook-pagebreaks

A mdbook preprocessor to insert page breaks when rendering to HTML

v0.3.1 120 app #mdbook #html #break #pagebreaks #title #markdown #io
hebrew_unicode_script

A low-level library designed to ascertain whether a character belongs to the Hebrew Unicode script. It supports checks for individual characters as well as for membership within collections

v2.0.0 800 no-std #hebrew #unicode-text #utf-8 #unicode-characters #no-std #collection
iregex

Intermediate representation for Regular Expressions

v0.1.3 120 #iregex
mdbook-curly-quotes

mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans

v0.4.37 260 app #mdbook #markdown #quote
physis

Interact with XIV game data

v0.3.0 #modding #ffxiv #ffxiv-modding #final-fantasy-xiv
crowbook-text-processing

some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)

v1.1.1 100 bin+lib #html-escaping #rules #text #escaping #meaning
utf16_lit

macro_rules to make utf-16 literals

v2.0.2 117K #utf-16 #utf16-lit #lit
kitoken

Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization

v0.10.1 1.3K no-std #tokenize #unigram #bpe #nlp #wordpiece #tokenizer
deliminator

Universal code documentation generator

v0.3.1 bin+lib #deliminator #list #tags #versatile #repl #bash #md #n-a #txt
crate2bib-cli

A CLI tool for the crate2bib crate

v0.3.2 app #crate2bib #crate2bib-cli #crate2-bib #entry
ean-rs

generating and validating EAN barcodes

v0.2.2 #barcode #ean #ean-rs #codes
overlap-chunk

splitting text into chunks of specified size with adjustable overlap percentage

v0.0.3 130 bin+lib #chunking #overlap #text #size
lindera-unidic-builder

A Japanese morphological dictionary builder for UniDic

v0.32.3 16K #japanese #builder #dictionary #morphological #unidic
lexical-sort

Sort Unicode strings lexically

v0.3.1 133K no-std #transliteration #sorting #unicode #no-std #lexicographical
mdbook-langtabs

An mdbook preprocessor that adds language tabs for code blocks

v0.1.1 280 bin+lib #mdbook-preprocessor #mdbook #mdbook-pre-processor #tabs #markdown #language #block
mdbook-pdf-headless_chrome

Control Chrome programatically

v0.1.14 310 #headless-chrome #mdbook #chrome #programmatically
erebus

A CLI message generation library

v0.1.8 #erebus #panic
mdbook-ai-pocket-reference

mdbook preprocessor for the ai-pocket-reference project

v0.1.3 220 bin+lib #reference #artificial-intelligence #pocket #ai-pocket-reference #tabs
moto

motivated automation

v0.2.29 1.0K bin+lib #automation #run-time #moto #variables #task #block
sixbit

Small packed strings

v0.5.0 #unicode #string #small #text
inflections

High performance inflection transformation library for changing properties of words like the case

v1.1.1 888K #camel-case #inflection #traits #inflect #case
kashida

Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.

v0.1.1 490 #justification #arabic #text
substr-iterator

Substring extractor based on characters without allocation

v0.1.3 290 no-std #allocation #no-alloc #iterator #string
fix-name-case

CLI tool to convert variable and function names to snake_case

v1.3.0 140 app #snake-case #refactoring #name
orly

Download O'Reilly books as EPUB

v0.1.7 bin+lib #epub #orly #oreilly-books-downloader #cargo #recommended
str_inflector

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.12.0 24K #inflection #pluralize #snake-case #snake #camel
linkcheck2

extracting and validating links

v0.8.0 #link-checker #linkcheck #validation #link #check
unicode-display-width

Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string

v0.3.0 2.1K #unicode-width #east-asian-width #unicode #wcwidth #wcswidth #width
loc

Count lines of code (cloc) fast

v0.5.0 420 bin+lib #loc #language #seconds #ci #inspect #bench #src #sh
itex

Initialize a LaTex project inside a folder instantly

v1.3.5 app #latex #settings #instantly #folder #system #template
cbfr

A buffer that run on stack, focusing on performance and speed

v0.1.6 bin+lib #string #buffer #text #byte
gigagei

random quote fetching console utility

v0.1.2 150 app #gigagei #utility #text
rust-regex-dsl-creator

Regular expression DSL derive macros

v0.1.8 310 bin+lib #regex #dsl #derive
writings

The Bahá’í Sacred Writings for use in Rust projects and APIs

v0.1.0 #writings #api #section #source
owned_str

Provide a stack allocated String for no-std or const environement

v0.1.2 130 #string #owned #owned-str #push-str #unsized-str #environement #hello #buff #world
pixt

Terminal Based Cross Platform Image Viewer

v1.1.1 app #viewer #pixt #character #renderer #help #path
aho-corasick-unsafe

Fast multiple substring searching

v0.0.4 no-std #text-search #aho-corasick #string-search #search-pattern #multi #text #pattern #string
rgon

A command-line tool written in Rust that searches for a query string within a file

v1.0.1 bin+lib #search #rgon #txt
tergo-formatter

Formatter for tergo

v0.2.10 #formatter #tergo #tergo-formatter
mdlink

Auto-convert HTTP links for your favorite services into nice Markdown links

v0.2.12 1.4K app #link #mdlink #links
prompt-input

lightweight library for user input prompts in Rust, designed to make input handling straightforward

v1.0.0 #user-input #prompt #cli-input #cli #user
readability

Port of arc90's readability project to rust

v0.3.0 9.0K #readability #port #extractor #toml
turn-uppercase

Small command to uppercase text in command line and copy to clipboard

v0.1.1 app #clipboard #upper-case #turn-uppercase
gliclass-rs

Inference engine for GLiClass models

v0.9.0 120 #classification #nlp #model
iregex-syntax

Common syntax for regular expressions

v0.1.3 120 #regex #iregex-syntax #syntax
matcher_py

A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust

v0.5.7 130 #nlp #text-classification
pink_accents

Replacement of patterns in string to simulate speech accents

v0.0.6 bin+lib #accents #replace #rules #text #format
flx-rs

Rewrite emacs-flx in Rust for dynamic modules

v0.2.1 380 #fuzzy-search #string-search #module #search #string
linoleum

but ergonomic line editor

v3.0.1 450 #linoleum #break #edit-result
ultra-nlp

A NLP library

v0.8.0 190 #nlp #ultra #ultra-nlp #ngrams #extract #word #char
split-every

Split for every n occurrences of a pattern iteratively!

v3.1.0 280 #occurrence #split #iterator #string-matching #pattern #string
tree-sitter-stack-graphs-python

Stack graphs definition for Python using tree-sitter-python

v0.3.0 bin+lib #tree-sitter #stack-graphs #python #tree-sitter-python
unclog

allows you to build your changelog from a collection of independent files. This helps prevent annoying and unnecessary merge conflicts when collaborating on shared codebases.

v0.7.3 bin+lib #changelog #git #markdown #config
until_needle

An extension to the BufRead trait that allows reading until a specified pattern (needle) is found

v0.2.0 130 #needle #io #cursor #until-needle-read
cnv

Command-line tool to convert between units of measurement

v0.8.0 500 bin+lib #measurement #cnv #convert #km
org-rust-exporter

exporter for org mode documents parsed with org-rust-parser

v0.1.8 #document #exporter #documents #org-rust-parser
mylibrary_

my personal library

v1.2.7 240 #mylibrary #regex #algorithm
maybe-regex

Wrapper for strings that may be either a regex or a plain-text string

v0.2.1 260 #string #utility #regex
spc-core

A command-line tool for processing and analyzing data from SPC files

v0.1.0 120 #reserved #spc #units
pandoc_types

Rust port of pandoc-types

v0.6.0 110 #pandoc #pandoc-types #inline
furze

finite state transducers (fst) writen in rust

v0.1.1 800 #search-engine #fst #builder
paltoquet

rule-based general-purpose tokenizers

v0.11.0 1.2K #paltoquet
cloc

Count, or compute differences of, lines of source code and comments

v0.6.2 app #cloc #secs #e-g #multi
repvar

A tiny CLI tool that replaces variables of the style ${KEY} in text with their respective value. It can also be used as a rust library

v0.14.1 bin+lib #replace #variables #command-line-tool #unix-style #text-processing
like

A SQL like style pattern matching

v0.3.1 2.7K #pattern-matching #escaping #like #i-like
dnd-character

A Dungeons and Dragons character generator

v0.14.20 2.0K #generator #character #dnd
strinject

Inject text from somewhere else into given text

v0.2.0 150 #string #automation #text #inject #marker #download
libanubhav

management system written in Rust

v0.2.1 #book #libanubhav #books #id #system #exit #language #nichols #martin
ragit-korean

korean tokenizer for ragit

v0.3.5 170 bin+lib #korean #ragit #ragit-korean
notmecab

tokenizing text with mecab dictionaries. Not a mecab wrapper.

v0.5.1 #notmecab #これ #いけ
keep-a-changelog

generating and manipulating CHANGELOG.md files that use the Keep A Changelog format

v0.1.4 650 #keep-a-changelog #changelog #keep
r-matrix

Rust port of cmatrix

v0.2.7 bin+lib #matrix #r-matrix #rmatrix
utf58

High-tech encoding of the Unicode space in one quibble and up to 3 bytes

v0.1.1 300 #utf58 #byte #utf-58
ahtml-from-markdown

Convert Markdown to ahtml HTML element trees

v0.1.0 100 #markdown #tree #ahtml-from-markdown #website
redpen-linter

Rust linter

v0.4.0 250 nightly app #linter #lint #stabilization #default #case
casile

The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry

v0.14.4 bin+lib #pdf-generation #publish #sile #typesetting #wizardry #book #status #setup #settings
raylib_interactive

An interactive library for Raylib

v0.1.5 130 #raylib #button #checkbox #interactive
vectorscan-rs

Ergonomic bindings to the Vectorscan high-performance regex library

v0.0.5 330 #regex #ffi #bindings #text
escrit

learning languages by reading texts

v0.2.2 app #language-learning #language #text #input
filecheck

writing tests for utilities that read text files and produce text output

v0.5.0 70K #testing #filecheck #directive #variables #output #num #test #testing-tools #num-d
syllabize-es

Syllabize Spanish text, and much more

v0.5.2 nightly bin+lib #spanish #syllable #syllabize #syllabize-spanish-text #text
parse-wiki-text-2

Parse wiki text from Mediawiki into a tree of elements

v0.2.0 #text-parser #element #node #elements
epcmanager

EPC text tool for RFID

v0.1.0 app #rfid #ascii #epc
textwrap-macros

procedural macros to use textwrap utilities at compile time

v0.3.0 3.7K no-std #typesetting #text-formatting #wrap #macro
array_tool

Helper methods for processing collections

v1.0.3 27K #vector #substitution #grapheme #unique #string #collection
okkhor

English to Bangla phonetic conversion following the 'Avro' rules

v0.5.2 290 #rules #okkhor
chord3

Create pdf songbooks from chopro source

v0.3.4 app #guitar #music #lyrics #mandolin #chopro
jaaj-rs

Blazingly 🔥 fast 🚀 and memory safe ✨ JaaJ implementation in Rust 🦀

v0.1.0 app #jaaj #font-rendering #jaaj-rs #font #rendering
lowcharts

draw low-resolution graphs in terminal

v0.5.8 18K bin+lib #plot #grep #troubleshooting #console #graph #text
asimov-cli

ASIMOV Command-Line Interface (CLI)

v25.0.0-dev.4 140 bin+lib #asimov #artificial-intelligence #asimov-cli #cli #ai
jawk

JSON AWK

v0.1.15 950 bin+lib #awk #jawk #array
pray

A tui tool for preparing a prompt to the llms

v1.5.0 900 app #llm #tui #clipboard #text-processing
normalize-line-endings

Takes an iterator over chars and returns a new iterator with all line endings (\r, \n, or \r\n) as \n

v0.3.0 1.8M #line-ending #normalize #ending #char
fast_whitespace_collapse

Collapse consecutive spaces and tabs into a single space using SIMD

v0.1.0 #white-space #collapse #simd
ripjson

A fast and lean way to grep in JSON files

v0.9.11 app #json #grep #ripjson
pragmatic-segmenter

Rust port of pySBD v3.1.0

v0.1.3 #nlp #segmentation #sentence #boundary #sbd
mdopen

Preview markdown files in a browser

v0.5.0 app #markdown #browser #common-mark #tiny-http
bpetok

CLI for tokenizing text input using Byte Pair Encoding (BPE)

v0.1.2 app #tokenize #bpe #text-tokenizer #text #cli
src2md

Turn source code into a Markdown document with syntax highlighting, or extract it back

v0.1.4 220 bin+lib #markdown #extract #documentation #code
windot

emoji picker

v0.2.2 300 app #emoji #picker #clipboard #gtk
twars-url2md

A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion

v1.4.2 220 bin+lib #markdown-converter #render-markdown #html-converter #html-markdown-converter #html #web #scenario
alphabet_detector

Natural language alphabet detection library

v0.3.0 130 bin+lib #language #word #split #text #match
htmd

A turndown.js inspired HTML to Markdown converter

v0.1.6 7.0K #markdown-converter #html #html-markdown-converter #render-markdown #js #handler
tag2upload-service-manager

Debian tag2upload service manager

v0.1.1 bin+lib #manager #service-manager #service
string-replace-all

String replacement utility inspired by JavaScript, allowing pattern-based substitutions with support for both exact matches and regex patterns

v0.2.1 #regex #string #string-replace-all
sqdj

sqdj shortens delimited data

v0.2.3 app #shortener #delimited #delimited-data #cli #scala
scatternotes

A cli application to manage unstructured notes

v0.1.4 230 app #notes #scatternotes #tags
bbd

Binary Braille Dump

v0.3.2 100 app #dump #character #style #wrapping #stdin #output #nrbt
textmate-scope-selector-peg

Textmate scope selector implementation as a PEG (parser grammar) in Rust

v2.0.0 130 #peg #selector #textmate #grammar
minigrep_jeck

minigrep is a grep clone that takes a query and searches for the query in the file; with added support for regex

v0.1.1 bin+lib #mini-grep #minigrep-jeck #jeck
xenon-lexer

The Xenon compiler's lexer

v0.3.0-alpha-0 600 #programming-language #lexer #language #xenon #xenon-language-lexer #programming
cli_app_capo

CLI application with Unix-like tools

v0.1.2 app #command-line-tool #unix #cli-app-capo #cli-tool
santoka

Translations of 668 of Taneda Santoka's free-verse haiku

v1.0.2 #haiku-poetry #poetry #literature #dataset #haiku #japan #translator
uklatn

Ukrainian Cyrillic transliteration to Latin script

v1.20.0 #transliteration #ukraine #romanization #script #2010 #2021
docket

markdown to HTML documentation rendering

v0.7.1 app #static-site-generator #markdown #docket #rendering
mdbook-merjong

A preprocessor for mdbook to add merjong support

v0.1.1 190 bin+lib #mdbook #merjong #mdbook-merjong #mdbook-plugins
ndef-rs

NDEF (NFC Data Exchange Format) parser and generator in Rust

v0.2.2 210 #mime #ndef-record #ndef-rs #text-payload #array #scratch #api
slugify

Macro for flexible slug generation

v0.1.0 31K #slugify #slug #macro #generation #separator
ethan-rs-wc

The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.

v0.1.1 bin+lib #wc #statistics #txt #erwc
yara-x-fmt

A code-formatting library for YARA rules

v0.14.0 140 #yara-x #yara #rules
chamkho

Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line

v1.4.3 110 app #nlp #thai #lao #library #text
huski-auxies

Auxiliaries for huski implementation

v1.0.5 160 #huski-auxies #huski #auxies
rusty-dawg

building and querying Directed Acyclic Word Graphs (DAWGs) and Compacted DAWGs (CDAWGs) for efficient string indexing and searching

v0.2.2 bin+lib #cdawg #dawg #rusty-dawg #bindings
uniquewords-rs

Count the frequencies of words in text file(s) or stdin

v0.9.1 650 app #nlp #file #stdin #pre-processor
xml_magic

A reasonably fast XML formatter

v1.0.0 app #xml #cli #formatter #file #style
unicode-intervals

Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets

v0.2.0 #code-point #interval #unicode #unicode-category #lowercase-letter #include-characters #max-codepoint
adobe-cmap-parser

parse Adobe CMap files

v0.4.1 45K #postscript #parser #pdf #cmap #font
lingua-english-language-model

The English language model for Lingua, an accurate natural language detection library

v1.2.0 14K #language-recognition #lingua #language-detection #nlp
fea-rs

Tools for working with Adobe OpenType Feature files

v0.19.0 bin+lib #opentype #font #validation #compilation #parser
analiticcl

approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation

v0.4.8 bin+lib #spelling-correction #nlp #linguistics #levenshtein #spell-check #text-processing
re_case

Case conversions, the way Rerun likes them

v0.23.0-rc.2 39K #multimodal #re-run #re-case #robotics #cpp #visualization #python
webspeeddial

A dial system for websites

v1.0.0 130 bin+lib #website #webspeeddial #bookmarks #com #gentoo #forms #fzf #dmenu #lars-zauberer #wofi
naming_utils

generating naming conventions, pluralizing words, and rest api paths in Rust

v0.1.1 100 #naming #pluralize #case-conversion #utility #path
readability-liveboat

Port of arc90's readability project to rust, updated for use with liveboat

v0.3.4 450 #readability #readability-liveboat #liveboat #readability-rs
aws-smt-strings

manipulating SMT-LIB strings and regular expressions

v0.4.0 200 #regex #smt-lib #string #smt
lexi-matic

A Lexer Library

v0.1.1 #lexer #regex #lexi-matic #eq
unicode-language

detect language coverage given a list of codepoints

v2.0.3 100 #unicode #language #points #range
bump-bin

Increments version with semver specification

v0.4.3 bin+lib #version-bump #semver #minor-version #cli
ainu-utils

A collection of utilities for the Ainu language

v0.4.0 150 #language #ainu #ainu-utils
vyder_std

Standard library for vyder

v0.3.4 #vyder #vyder-std #std
gregex

Regex solver utilizing NFA

v0.7.2 600 #regex-automata #nfa-automata #regex #nfa
mdbook-toc

mdbook preprocessor to add Table of Contents

v0.14.2 3.1K bin+lib #content #mdbook #toc #contents
gh_page_tool

A github gh-pages tool for static blog site

v0.4.0 app #page #publish #site
tre-regex

Rust safe bindings to the TRE regex module

v0.4.1 #regex #safe-bindings #tre #api-bindings
daffy

small file comparision tool, uses Levenshtein distance to compare files

v0.2.1 app #daffy #distance
trust_pdf

Verifies signed PDFs against the originals, checking for sneaky modifications

v3.0.1 650 #pdf #document #security
context-notation

Featherweight semantic notation for text

v0.1.4 130 #text #context #notation
tantivy-stemmers

A collection of Tantivy stemmer tokenizers

v0.4.0 750 #tantivy #tokenize #stemmer #algorithm #tokenizer
enma

serving anime and manga information 📦

v0.9.2 #web-scraping #manga #anime #otaku #rust #scraper
iconv-native

A lightweight text encoding converter based on platform native API or libiconv

v0.1.0 100 #iconv #unicode #wasm
asciidork-backend

Asciidork backend

v0.18.2 190 #back-end #asciidork #asciidork-backend
unicodeit

Converts LaTeX to Unicode (rust port)

v0.2.0 #unicodeit #port #latex
analyst

A command line tool that supports quick browsing of csv data

v0.1.0 bin+lib #analyst #00 #age
streampager

pager for command output or large files

v0.10.3 700 bin+lib #pager #less #more #config #indicator
cheat_checker

Detects similarities between sets of files

v2.7.0 app #checker #encoding #cheat-checker
mdbook-plugin-utils

mdBook plugins

v0.2.3 800 #mdbook-plugin-utils #plugin #mdbook-plugins
mdbook-trunk

mdBook plugin which bundles packages using Trunk and includes them as iframes

v0.2.3 330 bin+lib #trunk #mdbook-trunk #mdbook #web
mdbook-plantuml

A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams

v0.8.0 750 bin+lib #plant-uml #mdbook #markdown #diagram #common-mark
mdbook-kroki-preprocessor

render kroki diagrams from files or code blocks in mdbook

v0.2.0 app #mdbook #diagram #kroki #proprocessor
mj_minigrep

Welcome to mj minigrep project

v0.1.6 200 bin+lib #mj-minigrep #mini-grep #create
ik-rs

chinese segment, ik-analyzer for rust

v0.7.0 #information-retrieval #tantivy #ik-analyzer #search
rust-beam

A LaTeX slide generator you can write in faster than beamer

v0.3.0 app #beam #beamer #slide #title
rs-tool

A command-line tool to perform reservoir sampling on a file or a stream

v0.1.1 app #statistics #logging #sampling #reservoir #stream #sample
chewing-cli

Tools of the Chewing (酷音) intelligent Zhuyin input method

v0.9.1 app #chewing-cli #chewing #dictionary
runiq

An efficient way to filter duplicate lines from input, à la uniq

v2.0.0 bin+lib #filtering #unique #logging #algorithm
lemmeknow

Identify any mysterious text or analyze strings from a file

v0.8.0 800 bin+lib #cryptography #security #regex #forensics #identify
dirgrab-lib

Core library for dirgrab: concatenates file contents from directories, respecting Git context

v0.2.0 360 #dirgrab #dirgrab-lib #gitignore #target-path
text_trees

textual output for tree-like structures

v0.1.2 9.1K #tree #tree-node #formatter
searcher_txt

A copy of grep that i made to show that im bad at rust

v1.2.6 bin+lib #grep #txt #search #cli #case
regex_generate

Use regular expressions to generate text

v0.2.3 700 #regex #text-generation #regex-text-generation #generation
named_entity_parsing

Named entity parser. Used in Rusev to parse a list of tokens into a list of entities.

v0.4.0 140 #ner #nlp #seq-eval
text_lines

Information about lines of text in a string

v0.6.0 73K #text-lines #line #text
adulting

A program to print one rule at a time from The 25 Principles for Adult Behavior: John Perry Barlow

v0.3.0 360 app #adulting
egg-mode-text

Text parsing for Twitter: character counting, hashtag/mention extraction

v1.15.1 #twitter #extract #egg-mode-text #twitter-text #character-count #entities #length #individually #23
speki-cli

cli version of speki

v0.1.5 410 app #speki #speki-cli #cli
bwrap

A fast, lightweight, embedded systems-friendly library for wrapping text

v1.3.0 5.3K no-std #wrap #no-std #formatting #line-feed #80-column #80-column-formatting #language
markdown_converter

html to markdown converter and flavored markdown to discord markdown converter

v0.3.4 #markdown-converter #converter #arguments
mdbook-linkcheck

A backend for mdbook which will check your links for you

v0.7.7 3.7K bin+lib #mdbook #mdbook-linkcheck #linkcheck #book
mdi

markdown include

v0.0.39 bin+lib #markdown #mdi #md
tremor-kv

A logstash inspured key value extractor

v0.6.2
mathml-core

MathML type definitions

v0.1.7 #mathml #math-parser #math #define #latex #convert #expression
tgrep

Toy grep that honors .gitignore

v1.6.10 bin+lib #gitignore #grep #search-pattern #pattern
nucleo-ui

TUI wrapper around the nucleo fuzzy matching crate

v0.1.6 420 bin+lib #nucleo #nucleo-ui #finder #note
manchu-converter

Converts transcripted Manchu text to Manchu script with Manchu alphabet

v0.4.0 #manchu #manchu-converter #converter
bt-echo

implemenation of the echo command-line utility

v0.1.1 app #echo #bt-echo #io #sequence #string
sm-search

way of searching through text - for people who are too lazy to use Regex

v0.1.3 260 bin+lib #regex #sm-search #sm
langsan

sanitizing language model input and output

v0.0.10 #language-model #input-validation #language #model
mttf

working with TrueType fonts. Most parts are zero-allocation.

v0.1.7 140 #mttf
html-linter

An HTML linting library for checking HTML structure and semantics

v0.1.1 #linter #semantic #html-linter #text-content #rules #pattern #compound
rust_readability

A package to assess the complexity of texts using a variety of readability formulas

v0.2.0 170 #nlp #readability #flesch-kincaid #coleman-liau #lix #rix #write #txt #string
mdbook-fs-summary

Summary generator for mdbook

v0.2.1 130 app #mdbook #summary #markdown #static
forbidden-bands

8-bit string handling library

v0.2.3 #c64 #ascii #8-bit #unicode #string #ascii-text
whichlicense_detection

detect licenses used by the WhichLicense project

v6.0.0 180 bin+lib #detect #detection #whichlicense-detection #text
fr_alebref_libbrefdata

BrefData library

v0.4.1 210 #fr-alebref-libbrefdata #libbrefdata #alebref
csml_interpreter

The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots

v1.11.2 #chat-bot #programming-language #csml #interpreter
CLI_Project_Scott_Coakley

CLI Project in Rust

v0.1.0 app #cli #scott #coakley
regexy

lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex

v0.2.0 #regex #regexy #text #is-match
parse2csv

parse log-file and output to stdout as csv file by regex

v0.2.0 app #csv #regex #parse2csv #run
typope

Pedantic source code checker for orthotypography mistakes and other typographical errors

v0.4.0 120 bin+lib #typography #spelling #pedantic #language #error
minos-codex

Minos Codex is a tool for detecting and identifying secrets in a string

v0.0.32 100 bin+lib #string #codex #secret
mdbook-github-authors

mdbook preprocessor to display Github profiles of authors of a page

v0.1.0 bin+lib #mdbook-github-authors #author #github #contributors #page #chapter #github-authors #user-name
message_segment_calculator

package to calculate SMS message segments

v0.1.1 bin+lib #sms #message #calculator #ucs-2 #header #twilio #sms-messages
cqtool

converting between CQ strings and message segment arrays

v0.1.0 #cqtool #array #可以完成cq字 #串与消息段数 #之间的 #arrays #将消息转为消 #段数组格式 #将消息转为cq #符串格式
spanned

string processing with file/line/col information and the regular rust str API

v0.3.0 34K #api #spanned #u8
hanconv

Convert between Chinese characters variants

v0.3.4 150 bin+lib #simplified-chinese #traditional-chinese #chinese #utf-8 #cli
mdbook-cat-prep

a preprocessor for mdbook which provides teacher, subject, material and tag functionality

v1.0.9 bin+lib #wiki #mdbook #education #cat #materiálů
wikipedia_prosesize

Count Wikipedia prose size

v0.3.0-rc.2 340 #wikipedia #mediawiki #size #prosesize
cocomo

(Constructive Cost Model) CLI utility and library

v0.10.2 bin+lib #tokei #scc #cloc #loc #sloc #month #arguments
iregex-automata

Finite automata definitions for the iregex crate

v0.1.3 130 #regex-automata #nfa-automata #dfa-automata #automata #regex
bk-tree

A Rust BK-tree implementation

v0.5.0 5.8K #bk-tree #fuzzy-search #levenshtein #search #metrics
nlf

A CLI to append newline characters (LF) at the end of text file

v0.2.0 app #nlf #简体中文 #txt
svgbob

Transform your ascii diagrams into happy little SVG

v0.7.6 4.3K #svg #diagram #ascii #bob #text
filenamify

Convert a string to a valid filename

v0.1.2 2.4K #filename #normalize #filenamify #text-processing
mdbook-dtmo

Creates a book from markdown files with added plugins

v0.15.2 app #markdown #rust-book #gitbook #book #plugin
unified-diff

GNU unified diff format

v0.2.1 32K bin+lib #unified-diff #format #package #toml
inflector-plus

Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.11.7 #inflection #pluralize #snake-case #snake #camel
sre-engine

A low-level implementation of Python's SRE regex engine

v0.4.3 550 #regex #sre #engine
zipcodes

Query US zipcodes without SQLite

v0.3.4 400 #zipcode #filter #sqlite #state #us
prescript

parsing and executing Prescript scripts

v0.1.1 #prescript #font #comments #structure #ni-pdf
auto-regex

Automagically finds a regex that best matches an example and a sample list

v0.1.3 #regex #dataset #auto-regex #text #string #filter
mdbook_fork4ls

Fork of mdBook for mdBook_LS

v0.4.45 1.2K bin+lib #mdbook #rust-book #gitbook #markdown #book
mazer-core

A minimal, simple math markup language that compiles to HTML, written in Rust

v0.12.0 #mazer-core #mazer
xpath-cli

Evaluate XPath selectors on XML or HTML documents

v1.2.0 app #html #xml #xpath-cli #document #documents
commit_crafter

AI powered tool for Git commit message generator

v0.1.5 bin+lib #nlp #commit-message #git #productivity #artificial-intelligence
mdbook-llms-txt-tools

convert mdbook to llmstxt.org format

v0.1.1 app #mdbook #llm #converter #documentation
clipboard-substitutor

CLI tool to monitor clipboard changes and perform operations based on the contents

v0.7.8 app #clipboard #content #text-clipboard
summary-rs

A summary library for lithium battery and sodium ion battery

v0.1.2 130 #docx #battery #summary
hxd

configurable and dependency-free hexdump library

v0.1.1 190 #endian #hexd #endianness #normal #hexd-options-builder #relative-offset #upper-case #spacing
tfidf-text-summarizer

extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary

v0.0.3 #text-summarization #nlp #tf-idf #summary
bukvalno

A cli tool for converting images to ascii art

v0.3.0 bin+lib #ascii-art #art #image
clippy-to-md

cli tool to convert clippy json reports to markdown files

v0.1.0 app #clippy #clippy-to-md
gspell

Rust bindings for gspell

v0.7.0 #gnome #gspell #gtk
interslavic

in rust

v0.2.1 #interslavic #language #gender #stems #csv #com
case

A set of letter case string helpers

v1.0.0 63K #ascii #alphabet #camel #snake #ascii-text #helper #ascii-string #string
lindera-cc-cedict-builder

A Chinese morphological dictionary builder for CC-CEDICT

v0.32.3 16K #cc-cedict #builder #dictionary #morphological #chinese
kbnf-regex-automata

A forked version of regex-automata for kbnf

v0.4.10 no-std #nfa-automata #dfa-automata #regex-automata #regex #dfa
rust-tfidf

calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents

v1.1.1 #tf-idf #document #text-document #statistics #documents
eddie

Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler

v0.4.2 #levenshtein #string-similarity #edit-distance #hamming #jaro #text
enum-ts

TypeScript Enum pattern matcher codegen

v0.2.6 app #typescript #modeling #enums #mvvm #match #pattern-match #matcher
fontconfig

Safe, higher-level wrapper around the Fontconfig library

v0.9.0 1.4K #fontconfig #font #search #wrapper
include-doc

Include examples in your Rustdocs

v0.2.2 600 #rustdoc #documentation #example #source-file
buf-min

Minimal utf-8 safe buffer traits

v0.7.1 9.7K #buffer #traits #buf-min
mdbook-presentation-preprocessor

A preprocessor for utilizing an MDBook as slides for a presentation

v0.3.1 app #pre-processor #mdbook #presentation #rust-book #markdown #gitbook
unicode-matching

match Unicode open/close brackets

v0.5.4 #brackets #unicode #txt #find-matching
mdbook-ocirun

mdbook preprocessor to run arbitrary commands and code snippets inside containers

v0.2.1 130 bin+lib #mdbook-preprocessor #mdbook #mdbook-pre-processor #container #ocirun #snippets
srx

A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation

v0.1.4 3.0K #srx
mdbook-quiz-schema

Schema for quizzes used in mdbook-quiz

v0.3.10 bin+lib #mdbook #mdbook-quiz #schema #learning #markdown #toml #tracing
char-ranges

Iterate chars and their start and end byte positions

v0.1.2 no-std #position #range #text #char #double-ended #no-std
stringmatch

Allow the use of regular expressions or strings wherever you need string comparison

v0.4.0 18K #regex #compare #string-comparison #comparison #string
koto_regex

A Koto library for working with regular expressions

v0.15.3 180 #koto #regex #scripting-language #expression
ironsmith-parser

Transforms Smithy 2.0 IDL files into an abstract syntax tree

v0.1.0 #text #ironsmith #tree
pulldown-html-ext

Extended HTML rendering capabilities for pulldown-cmark

v0.5.0 230 #element #block #class #pulldown-cmark #rendering #writer #config #highlighting #mapping #control
ipynb-to-md

Convert Jupyter Notebooks to Markdown files

v0.2.0 app #jupyter-notebook #markdown #convert #notebook #jupyter
cglue-bindgen

cleanup cbindgen headers for CGlue

v0.3.0 250 app #c-glue #abi #ffi #cbindgen
statisk

opinionated static site generator

v0.2.4 140 app #assets #statisk #generator
simple-logging

logger for the log facade

v2.0.2 9.1K #logging #simple #facade #logger
vocalolyrics

Lyrics scraper, primarily for Vocaloid content. By default, atwiki is used as the source. We plan to make other sources selectable, but that is not currently possible

v0.2.4 #lyrics #vocalolyrics #testing
truncate_string_at_whitespace

Truncate a &str at the closest whitespace to a specified length with unicode safety

v1.0.3 120 #white-space #truncate #string #safety #truncate-text
samvadsetu

LLM API for commonly used LLM services including Gemini, ChatGPT, and Ollama. The name implies a bridge for dialogue since the library facilitates communication and interaction between…

v0.1.2 #llm #ollama #large-language-model #gemini
unicode-ellipsis

truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long

v0.3.0 5.1K #ellipsis #unicode #unicode-text #word #string #text
rst_parser

a reStructuredText parser

v0.4.2 370 #restructuredtext #rst-parser #parser
midstring

Create a string between two other strings, that is lexicographically halfway between them

v0.1.3 bin+lib #lexicographically #lexical #midstring #string #sorting #aan
gchemol-parser

Text parsing made simple

v0.5.1 750 #text-parser #combinator #gchemol #text-reader #line
stringutil

A collection of useful string utilities

v0.1.0 100 #string-utilities #string #tool #utilities
linurgy

Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers

v0.6.0 #newlines #text-editors #text #ending #crlf #stream #newline
ps-str

String transcoding library

v0.1.0-2 150 #ps-str #str
jpreprocess

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.12.0 290 bin+lib #text-to-speech #open-j-talk #library
owlz

"Owlz" ascii emojis, created randomly or by design

v0.1.2 130 #emoji #owls #random #generation
asciidork-parser

Asciidork parser

v0.19.0 320 #asciidork #parser #asciidork-parser
rust-texas

generate latex documents

v0.3.6 490 #latex #pdf-generation #texas #pdf
pulldown_typst

A pull parser for Typst markup

v0.3.7 #typst #markup #pulldown-typst
markov_str

Markov Chain implementation optimized for text generation

v0.3.0 #markov-chain #text-generation #string #markov
csv_to_table

pretty print CSV as a table

v0.7.0 300 #pretty-table #csv #table-generator #table
rig-dyn

A dynamic client-provider abstraction framework for Rust applications on top of rig-core

v0.3.0 280 #provider #traits #rig-dyn #rig-core #discovery #communication
summavy-ownedbytes

Expose data as static slice

v0.5.0 #slice #search-engine #tantivy
ngrammatic

Character-oriented ngram generator and fuzzy matching library

v0.4.0 110 #ngrams #shingles #fuzzy #pad
fyi_ansi

Compile-time ANSI formatting macros for FYI

v2.0.2 290 #ansi #csi #ansi-csi
rusty_regex

A minimalistic regex engine in Rust using the pipeline: Regex -> AST -> NFA -> DFA -> Match(String)

v0.2.0 220 bin+lib #regex #digits #string #character #underscore #default #position #alternative
thesaurus

An offline thesaurus library for Rust

v0.5.2 #synonyms #thesaurus #thesaurus-rs
litime

A command line tool to display the current time ish with a literature quote

v3.3.0 bin+lib #quote #literature #time
glk

Bindings for the Glk I/O interface for hosting interactive fiction interpreters

v0.2.0 #glk #glulx #blorb #interpreter #if #associated
timeblok

A language for event scheduling in plain text

v0.5.0 170 #calendar #dsl #productivity #planning #ics #text
tectonic

A modernized, complete, embeddable TeX/LaTeX engine. Tectonic is forked from the XeTeX extension to the classic “Web2C” implementation of TeX and uses the TeXLive distribution of support files.

v0.15.0 750 bin+lib #typesetting #latex #tex #font
gaze

small utility library with the goal of making it easier to scan/lex text and collections

v0.5.0 #unicode-segmentation #gaze #preface
fast-str

A flexible, easy-to-use, immutable, efficient String replacement for Rust

v1.0.0 #string #serialize #serde #constant #serde-serialize
block-list

A minimalist hosts-based tool for managing block lists and ad-blocking

v1.1.4 app #block-list #privacy #host #ads #block #privacy-tools
enc-check

inspect utf-8 and utf-16 character encodings

v0.2.1 130 app #utf-8 #unicode #inspect
spezilinter

spezifisch's linter for different file formats, linting for weirdly specific stuff

v1.1.2 bin+lib #markdown #spezilinter #once
unicode_extension

Don't use this crate

v0.4.0 210 #unicode #string #extension
tectonic_bridge_core

Exposing core backend APIs to the Tectonic C/C++ code

v0.4.1 550 sys #typesetting #tex #tectonic #xetex #texlive #path
abbreviation_extractor

extracting abbreviations from text

v0.1.4 #nlp #abbreviation #extractor #biomedical #text-processing
mecab

Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library

v0.1.6 #japanese #analyzer #mecab #libmecab #morphological
vndb_tags_get

convert VNDB tag list (JSON to markdown)

v1.2.1 app #markdown #tags #vndb
archive-pdf-urls

Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine

v0.5.1 150 app #archive #pdf #url #machine
binatime

A binary clock in the terminal

v1.0.1 app #binatime
textgridde-rs

dealing with Praat TextGrid files. MIT licensed.

v0.1.5 #linguistics #phonetic #file-format #praat #textgrid
caseless

Unicode caseless matching

v0.2.2 79K #caseless #matching #rust-caseless
randem

Print a random emoji optionally with the given string as seed

v0.1.2 240 bin+lib #printing #seed #randem
metatron

core library

v1.1.1 600 #report-generation #template-engine #data-reporting #text-report #pdf
modeling

tools to analysis different languages by Ctags

v0.6.2 bin+lib #ctags #modeling #plant-uml #golang #visualization #java #opt #model-driven-development
coinflip_animation

coinflip animation in the terminal, as a screensaver or just simply to look at

v0.2.1 240 app #animation #coinflip #coinflip-animation
substring-replace

developer-friendly methods to manipulate strings with character indices

v0.2.2 #replace #substring #substring-replace #methods #string
stur

functions for working with strings

v0.1.1 #stur #string
diffy-imara

Tools for finding and manipulating differences between files

v0.3.2 #diff #patch #merge
uclanr

A random word picker that gives you actually useful words

v2.1.0 app #word #uclanr #amount #words
litua

Read a text document, receive its tree in Lua and manipulate it before representing it as string

v2.0.0 bin+lib #document-generation #markup #lua #content-tree
termdiff

Write a diff with color codes to a string

v3.1.4 130 #diff #terminal #text #text-comparison
easy-regex

Make long regular expressions like pseudocodes

v0.11.7 #regex #multi-language #meta #easy #readable
choco

markup language for dialogue systems

v0.2.2 #system #graph #syntax #text
grammalecte_client

Grammalecte HTTP client

v0.1.5 300 #spell-check #grammalecte #grammalecte-client #client #spell-checking
mdbook_header_footer

mdBook preprocessor to prepend header and append footer to certain chapters

v0.0.2 bin+lib #mdbook #chapter #header #header-footer
aki-xcat

concatenate files that are plain, gzip, xz and zstd

v0.1.36 1.5K bin+lib #filter #text #lz4
rsrusl

A really simple useful library ported to Rust

v0.1.5 250 bin+lib #standard #simple #rusl #useful
glimpse

A blazingly fast tool for peeking at codebases. Perfect for loading your codebase into an LLM's context.

v0.7.0 app #tokenize #directory #back-end #viewing #structure #depth #processing #model #config #counting
spongebob

convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe

v2.0.1 440 bin+lib #spongebob #world #text-processing #wo-rld #wo-rl-d
gte-rs

Text embedding and re-ranking pipelines

v0.9.1 #nlp #text-embeddings #reranking #pipeline #model
vibrato

viterbi-based accelerated tokenizer

v0.5.2 1.1K #japanese #morphological-analysis #tokenize #morphological
clarifai_grpc

The official Clarifai gRPC Rust client

v8.0.0 #deep-learning #grpc-client #artificial-intelligence #image-recognition #computer-vision #clarifai #neural-network
skyspell_kak

skyspell - kakoune integration

v3.0.1 700 bin+lib #spell-check #kakoune #skyspell
riimut

Transform latin letters to runes & vice versa

v1.2.1 #futhark #runes #younger-futhark #futhorc #elder-futhark #medieval-futhork #staveless-futhark #staveless-runes #transform
csmlinterpreter

The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily

v0.3.2 #language-interpreter #chat-bot #csml #events #interpreter
antex

Styled text and tree in terminal

v0.0.8 bin+lib #text #terminal #styled #ansi #tree #ascii
markdown-extract

Extract sections of a markdown file

v2.0.0 100 bin+lib #markdown #extract #markdown-extract #changelog
IndicScriptSwap

help transliterate between various indic scripts. It is not ready yet and has many issues. If you encounter any issues, please contact me (https://github.com/mssrprad/transliterate-ferris/tree/cli or pradyumna…

v0.6.0 bin+lib #script #com #indicscriptswap #transliterate-ferris
mdbook-pagetoc

A mdbook plugin that provides a table of contents for each page

v0.2.0 650 bin+lib #toc #content #mdbook #pagetoc #table #mdbook-table-contents #contents
pulldown_mdbook

A pull parser for mdBook

v0.3.2 #mdbook #pulldown-mdbook #pull-down
wiki-tui

easy to use Wikipedia Text User Interface

v0.9.1 100 bin+lib #tui #wikipedia #wikipedia-api #wikipedia-tui
midpad

Command line utility to pad texts

v1.1.1 110 bin+lib #midpad
irg-kvariants

wrapper around kvariant from hfhchan/irg

v0.1.1 16K #kvariants #irg-kvariants #charabia #hfhchan-irg
ohnomore

Transformations for TüBa-D/Z lemmas

v0.5.0 #lemma #lemmatization #transformation
minigrep_nc

An implemantation of grep in Rust

v0.1.0 110 bin+lib #minigrep-nc #mini-grep
taboc

A table of contents generator for markdown documents

v0.2.105 bin+lib #taboc #pijul #fossil
tracery

Text-expansion library

v0.2.1 #tracery #text #random #macro #string #execute
ing2ynab

cleans up ing.com.au transactions for YNAB

v0.0.5 app #ynab #ing2ynab #ofx #notes
tokenizations

alignments library

v0.4.2 850 #nlp #algorithm #tokenizations #text #io-tokenizations #python
freesia

some string operators

v0.1.2 #freesia #trim-whitespace #upper-case
pulldown-cmark-mdcat

Render pulldown-cmark events to TTY

v2.7.1 1.4K #pulldown-cmark #markdown #cat #cmark #less
phonet

A CLI tool and library to validate phonotactic patterns for constructed languages

v1.0.2 bin+lib #language #regex #conlang #phone #phoner #lang #statement
strs_tools

Tools to manipulate strings

v0.18.0 340 no-std #wtools #split #strs-tools #general-purpose #sample
split-identifier

Rust package that provides functions to split programmatic identifiers according to case conventions

v0.1.0 #split #identifier #split-identifier #package
markdown-it-footnotes

Creates footnotes and lists of footnotes in Markdown documents

v0.1.0 #footnotes #markdown-it-footnotes #markdown #reference #foo #bar
sbert

Sentence Bert (SBert)

v0.4.1 bin+lib #nlp #transformer #bert #embedding
aho-corasick

Fast multiple substring searching

v1.1.3 19.3M no-std #aho-corasick #text-search #string-search #search-pattern #multi #text #pattern #string #search
unicount

Alphabetic counter supporting unicode

v0.1.2 app #unicode #unicount #separator #english-lower #cv #ct #ac
const_unit_poc

Proof of Concept: Physical units through const generics

v1.1.3 nightly #const-generics #generics #const-unit-poc #message #checked #evaluatable #cm
typeline_ext_utils

operators for typeline

v0.1.0 #pipeline #shell #tl #stream
pprint

Flexible and lightweight pretty printing library for Rust

v0.2.2 #pretty-print #pretty #rust #documentation #printing #model
lingua-spanish-language-model

The Spanish language model for Lingua, an accurate natural language detection library

v1.2.0 11K #language
merge-whitespace

Procedural macros for merging whitespace in const contexts

v1.1.0 150 macro #white-space #proc-macro #graphql #context
budoux

Rust port of BudouX (machine learning powered line break organizer tool)

v0.1.1 #budoux #budou-x #budou-x-rs
committer

git commit message generator

v0.11.1 app #generator #committer
bard

Creates PDF and HTML songbooks out of easy-to-write Markdown sources

v2.0.1 bin+lib #music #markdown #tex #songbook #songwriting #em #go
mdbook-treesitter

mdBook preprocessor for html adding tree-sitter highlighting support

v1.0.0 130 bin+lib #tree-sitter #mdbook #mdbook-treesitter #javascript
emoji

Every emoji, their metadata, and localized annotations

v0.2.1 850 #emoji #man #annotations #variant #glyph #name #classification #version #language
viterbi_pos_tagger

A part-of-speech (POS) tagger using the Viterbi algorithm

v0.1.0 bin+lib #nlp #pos #tagger #part-of-speech
vortilo

Analizas la gramatikon de Esperantaj frazoj

v0.1.1 #vortilo
asciidork-eval

Asciidork eval

v0.18.2 310 #asciidork #eval #asciidork-eval
kurtbuilds_regex

Wraps the regex library to also provide macros

v0.1.1 #regex #macro #kurtbuilds-regex
unaccent

remove accents from strings, inspired by PostgreSQL's unaccent extension

v0.1.1 300 #unicode-normalization #diacritics #string-utils #text-processing #normalization #unicode
rosie

Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search

v0.1.1 #regex #rosie #fsa #pattern-matching
xee-ir

Xee intermediate representation and compilation to bytecode

v0.1.4 550 #xslt #xpath #xee #xml #bytecode
man

Generate structured man pages

v0.3.0 3.7K #manpage #flags #output #com #author #short #long #note #name #status
text-parsing

Hierarchical text processing preserving char position info

v0.6.6 140 #text-parsing #info #parser
mdbook-spec

An mdBook preprocessor to help with the Rust specification

v0.1.1 bin+lib #specification #mdbook #mdbook-spec
purlu

A full-text search engine

v1.0.0 #purlu #english #cute #index #object
kl-hyphenate

Knuth-Liang hyphenation for a variety of languages

v0.7.3 #typesetting #language #unicode-segmentation #text #segmentation #normalization
mdbook-yml-header

mdBook preprocessor for removing yml header

v0.1.4 app #mdbook-preprocessor #mdbook #markdown #mdbook-pre-processor #rust-book #book
utf8streamreader

lookahead iterator on an utf8 byte stream

v0.1.0 #utf8streamreader #utf8-reader #stream
regex-filtered

Efficiently check an input against a large number of patterns

v0.2.0 2.3K #regex #multiple #filtered #prefilter #filtered-re2 #filter
advent-ocr

Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters

v0.1.5 500 #advent-of-code #ascii #ocr
ob

A Blog and RSS system written in Rust

v1.0.10 app #rss #static-site-generator #blog
sample-std

Sampler definitions and implementations for st

v0.2.1 18K #random #st #sample-std #strategies
knock-knock

CLI tool for obtaining and outputting domain name information in an easy-to-read format

v0.3.2 260 app #knock #information #knock-knock #format #cross-platform #source
lookbook

Component preview framework for Dioxus

v0.2.0-alpha.1 #dioxus #preview #component
etch

Not just a text formatter, don't mark it down, etch it

v0.4.2 13K bin+lib #etch #word #css
xi-rope

A generic rope data structure built on top of B-Trees

v0.3.0 140 #rope #data-structures #text-editing #editor
goofy-animals

Generate a name in adjective-adjective-animal form

v0.0.2 no-std bin+lib #random #no-std #naming #random-generator #forms #generator
surt-rs

Sort-friendly URI Reordering Transform (SURT)

v0.1.3 260 bin+lib #web-archiving #archive #normalization #url #generate-surt
bfom-lib

Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?

v0.1.52 1.4K #markdown #bfom #bfom-lib #wrong #useage
font-map

Macros and utilities for parsing font files

v0.2.9 130 #font #svg #true-type-font #macro #preview #api-bindings
notion2html

Convert Notion pages to HTML

v1.0.1 app #render-markdown #notion #html #markdown
mmsearch

一个从文本文件中查找字符的命令行工具。只支持utf8编码的文件

v0.1.1 bin+lib #mmsearch #个从文本文 #中查找字符的 #只支持utf8编码 #它允许用户搜 #文本文件内容 #是一个命令行 #这是一个简单 #doc查看
mini-openai

An OpenAI API client with minimal dependencies

v0.1.2 #chatgpt #artificial-intelligence #llm #ollama #server #openai
plagiarismbasic_lib

Basic plagiarism checker written in Rust

v1.2.0 #lib #plagiarism #string-similarity #wip
markdown2pdf

Create PDF with Markdown files (a md to pdf transpiler)

v0.1.3 230 bin+lib #pdf #markdown #md #markdown-to-pdf
unescape

Unescapes strings with escape sequences written out as literal characters

v0.1.0 156K #escaping #unicode #string
mdbook-svgbob

SvgBob mdbook preprocessor which swaps code-blocks with neat SVG

v0.2.1 1.1K app #mdbook #svg #bob #markdown
typos-cli

Source Code Spelling Correction

v1.31.1 14K bin+lib #spell-check #spelling #typo #correction #code-quality #pr #monorepo #cli
lindera-ipadic-neologd-builder

A Japanese morphological dictionary builder for IPADIC NEologd

v0.32.3 15K #japanese #builder #dictionary #ipadic #neologd
regex-chunker

Iterate over the data in a Read type in a regular-expression-delimited way

v0.3.0 bin+lib #regex #iterator #chunking #read #btree-map
gaoya

Locality Sensitive Hashing Data Structures

v0.2.0 1.0K #min-hash #lsh #simhash #dedup #document #structures #neardup #locality-sensitive-hashing #search #fx-hash-set
rascii_art

Advanced ASCII Art Generator

v0.4.5 500 bin+lib #ascii-art #generator #art #image #filename #img2ascii #ascii #charset
boreal-cli

CLI utility to run boreal, a YARA rules engine

v0.9.0 480 app #string-matching #yara #boreal #scan #pattern-matching #engine #yara-scanner
linebreak

breaking a given text into lines within a specified width

v0.3.1 #line-break #wrap #character #version #line #break
morse_code_parser

A Morse code parser and decoder implemented in Rust

v0.1.2 bin+lib #morse #rust #parser
rsrpp

project for research paper pdf

v1.0.12 700 #rsrpp #parser #field
sanitizer

A collection of methods and macros to sanitize struct fields

v0.1.6 7.2K #validation #sanitizer #e164 #trim #case
diff-man

diff utility lib

v0.1.7 320 bin+lib #diff #lib #diff-man
dicexp

A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")

v1.1.1 bin+lib #ttrpg #dice #random #roll-dice #interpreter #2d8-5
bilingual

A cmdline tool used for markdown translation via calling Chinese translation api cloud services

v0.1.3 bin+lib #bilingual #api #文本 #文件的 #tags #小牛 #腾讯 #使用翻译云服 #百度 #文件也包含很
marcus

An experimental Markdown parser written in Rust

v0.1.2 #marcus #glob
mdbook-infisearch

InfiSearch plugin for Mdbook

v0.10.1 app #mdbook #infisearch #search #static-site #jamstack #wasm #javascript
abbreviator

abbreviating long words

v0.1.9 #word #abbreviator #abbreviate #words
google_translate_request

Google translate request to a spesific endpoint

v1.0.0 #translation #google-translate #endpoint
rsnltk

Rust-based Natural Language Toolkit

v0.1.3 bin+lib #nlp #semantic #nltk #stanza #text-analysis
hns

Human numeric sorting program — does what sort -h is supposed to do!

v0.2.0 app #stdout #stdio #human-numeric-sort #coreutils #stdin #numeric-sorting
display_bytes

Human-readable display of byte sequences

v0.2.1 550 #pretty-print #display #pretty #byte
natural

Pure rust library for natural language processing

v0.5.0 21K #soundex #tokenize #tf-idf #ngrams #phonetic #classification #padding #distance #serde #inflector
ipa-translate

translating between IPA and ASCII text

v0.2.0 #text-translation #ipa #ipa-translate #text
palmdoc-compression

Fast & safe implementation of PalmDoc/MOBI/AZW/Kindle flavored LZ77

v0.3.1 180 #lz77 #compression #palmdoc #kindle #decompression
dingtalk

Robot Util, Send text/markdown/link messages using DingTalk robot, 钉钉机器人

v2.0.2 #ding-talk #robot #sdk #钉钉机器人 #message #com-doc #card #action #btn
chromalog

A customizable logger with dynamic color coding and file logging

v0.0.2 #logging #log-file #console-logger #colored #console #file
leptos-markdown

A component which can render markdown as html element in leptos

v0.1.0 app #leptos #leptos-markdown #markdown
nfa_regex

NFA regex engine for text processing

v1.0.1 #nfa #regex #nfa-regex
hmd

Custom Markdown Engine for my personal blog

v0.4.13 #ssg #markdown #md #web
easy_io

Fast and dead-simple IO for competitive programming in Rust

v0.3.0 #competitive-programming #io #input-reader #output-writer
formatjson

Formats JSON files

v0.3.1 1.0K bin+lib #json #formatting #formatting-json #formatter #speed
realhydroper-utf16

Work with UTF-16 in Rust

v1.1.0 #utf-16 #string #realhydroper-utf16
fsays

flavored replacement for the classic cowsay

v0.3.0 app #cowsay #rustaceans #fsays #print #ferris
heart-strings

Quickly get random heart emojis to copy!

v1.0.0 app #emoji #fun #heart #copy #revolving-hearts-cupid #gift-heart-heartpulse #cupid-heartpulse
anslatortray

translate from English to Pig Latin!

v0.5.0 bin+lib #text-translation #translator #localization #latin #pig
sastrawi

stemming and stopword removal for Bahasa Indonesia based on PHP sastrawi project by Andy Librian

v0.1.1 #librian #sastrawi #indonesia
r4d

Text oriented macro processor

v3.1.0 210 bin+lib #processor #macro #rad #text-processing #cli
cli_app_capo15

CLI application with Unix-like tools

v0.1.1 app #command-line-tool #unix #cli
treebender

An HDPSG inspired symbolic NLP library for Rust

v0.1.1 bin+lib #nlp #parser #earley #hdpsg #syntax #earley-parser
avatarsay

Beautiful quotes from Avatar: The Last Airbender

v0.1.3 app #terminal #quote #airbender #shell #kitty #wezterm #iterm
scraps_libs

Scraps is a static site generator based on Markdown files written with simple Wiki-link notation. It can be used primarily for personal or team knowledge management.

v0.21.5 700 #scraps #tags #libs #markdown #static-site-generator #personal-knowledge-management
dedent

Procedural macro for stripping indentation from multi-line string literals

v0.1.1 3.1K macro #indentation #proc-macro #formatting #string-formatting
convert_string

A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)

v0.2.0 140 #reserved #formatting #string-formatting #keyword #case
wtf8-rs

WTF-8 encoding

v1.1.0 #unicode #wtf8-rs #wtf8
texcore

Create LaTeX documents using native Rust types

v0.7.2 nightly #latex #element #texcore
matchpick

Find and replace multi-lines using a match-case

v0.2.1 bin+lib #match-case #file #matchpick #stdin
ohos-ime

Bindings to the inputmethod API of OpenHarmony

v0.2.0 6.0K #harmony-os #open-harmony #input-methods
catsay-AK

A catsay cli

v0.1.0 app #catsay #catsay-ak #dead
meep

pasting service

v1.0.1 nightly app #pastebin #pasting #pastebin-service #command-line
mdbook-auto-gen-summary

A preprocessor and cli tool for mdbook to auto generate summary

v0.1.10 490 app #mdbook #summary #markdown #md
porigon

Lightweight FST-based autocompleter library, targeting WebAssembly and data stored in-memory

v0.4.0 #in-memory #porigon #searchable #use-case
uwurs

UwUify your strings with uwurs!

v0.3.4 #character #uwurs #transformation #mapping #emoji #probability
csv-groupby

execute a sql-like group-by on arbitrary text or csv files

v0.10.0 app #csv #regex #report #sql #text
nu_plugin_clipboard

A nushell plugin to copy text into clipboard or get text from it

v0.102.0 app #nu-shell #clipboard #clipboard-manager #copy #nu-plugin #plugin #json
justify

plaintext while handling Unicode gracefully

v0.1.3 bin+lib #justification #paragraph #justify #gracefully #text
syllarust

quickly counting syllables

v0.2.0 110 #nlp #syllable #rayon #text #language
epub2mdbook

convert EPUB files to MDBook format

v0.15.0 bin+lib #ebook #epub #mdbook #converter
graphannis-capi

C-API to the ANNIS linguistic search and visualization system

v3.7.0 190 #c-api #search-engine #graph-annis
nib

static site generator

v0.0.8 550 #text #cli #nib #generator
vectorscan

wrapper for Vectorscan

v0.1.0 #vectorscan #hyperscan #database
rust-regex-dsl

Regular expression DSL

v0.1.8 #dsl #regex #rust-regex-dsl #why
analyse-json

CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents

v0.6.1 bin+lib #ndjson #json #content #file-path #object #processing #cli #explode-arrays #inspect-arrays #glob
rusile

components for the SILE typesetter

v0.15.12 370 #sile #pdf-generation #rusile #typesetter #tex #typesetting-system #lua #run
mdzk

Plain text Zettelkasten based on mdBook

v0.5.2 bin+lib #mdbook #zettelkasten #notes #markdown
fancy-regex-fork-pb

A custom fork of the fancy-regex crate. You probably don't want to use this.

v0.3.2 #regex #backtracking #fancy-regex
easy_random

Generate random data easily with easy_random :)

v0.2.5 270 #random #alphabet #string-matching #string #generator #underscore
patiencediff

algorithm

v0.2.0 150 bin+lib #unified-diff #algorithm #patiencediff
ferret

A trigram-based tool for detecting similarity in groups of text documents or program code

v1.1.1 bin+lib #similarity #code #plagiarism #document #collusion #text-document #count
parascope

Weggli ruleset scanner for source code and binaries

v0.1.1 app #binary-analysis #binaries #ida #rules #input #weggli
file-expert

Expert system for recognizing source code files, similar to GitHub/lingust

v1.1.0 bin+lib #expert-system #source-code #linguist #linguist-heuristics
yozuk

Chatbot for Programmers

v0.22.11 140 #chat-bot #telegram-bot #yozuk #programmers #command-line-tool #nlp #development-tools
annotated-string

String with ability to annotate (format) its individual fragments

v0.2.1 #fragment #hi-doc #annotated
ruby_inflector

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.0.10 3.7K #inflection #pluralize #snake-case #snake #camel
readable-regex

Regex made for humans. Wrapper to build regexes in a verbose style.

v0.1.0-alpha1 #regex #readable-regex #lazy-evaluation #01 #12 #31 #why #human
rjoin

joining CSV data on command line

v0.2.0 bin+lib #join #rjoin #field
tengwar

Transliterate text into J.R.R. Tolkien's Tengwar.

v1.1.0 bin+lib #unicode #tengwar #quenya #sindarin #unicode-text #text
unicode-casing

Titlecase helper function on characters

v0.1.0 7.8K #unicode-casing #unicode #casing
pithy

Ultra-fast, spookily accurate text summarizer that works on any language

v0.1.7 bin+lib #nlp #text-summarization #summarize #summarization #text
rk-utils

A collection of utility functions and data structures for rust

v0.2.2 #topological-sorting #trie #string-processing #longest-match #node
none-shall-pass

Artifact for GitHub Action to validate hyperlinks in all markdown files

v0.2.3 app #documentation #pass #none-shall-pass #arguments #page #marketplaces
bstr

A string type that is not required to be valid UTF-8

v1.12.0 8.7M no-std #byte-string #utf-8 #unicode #text #string
recase

Changes the convention case of input text

v0.3.0 130 #camel-case #snake-case #pascal-case #case #conventions #text
google_translator

Custom Google Translator

v0.2.3 bin+lib #translator #language #translation
tpt

Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command

v0.3.0 bin+lib #cat #wc #cli
uo_rst_parser

fork of rst_parser with fixes for upstream-ontologist

v0.4.3 4.4K #restructuredtext #upstream-ontologist #parser
supercat

A syntax highlighting alternative to cat

v0.1.0 app #syntax-highlighting #cat #tree-sitter #cli #engine #numbers
letter-sequence

A method to create sequence displayed as uppercase or lower letters, or digits

v2.1.0 bin+lib #sequence #letter #letter-sequence #sequence-builder #try-from #output
ewts-cli

Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)

v0.1.3 app #converter #ewts #tibetan #symbols #localization #cli
unic-ucd-age

UNIC — Unicode Character Database — Age

v0.9.0 8.4K #age #internationalization #character-property #unicode #unicode-text #text
acorns

Generate an AsciiDoc release notes document from tracking tickets

v1.0.0 1.1K bin+lib #release-notes #asciidoc #documentation #redhat
basalt-core

core functionality for Basalt TUI application

v0.2.2 200 #obsidian #basalt-core #markdown #applications #text #ratatui
escape-bytes

Escapes bytes that are not printable ASCII characters

v0.1.1 13K no-std #escaping #byte #escape-bytes
pulldown-cmark-escape

An escape library for HTML created in the pulldown-cmark project

v0.11.0 455K #html-escaping #render-markdown #common-mark #html
ttaw

talking to a wall, a piecemeal natural language processing library

v0.3.0 #nlp #cmudict #rhyme #alliteration #double-metahone
match-pinyin-with-hanzi

Checks whether the sentence in Chinese characters (汉字) matches with the sentence in pinyin (拼音). Erhua is supported.

v0.1.4 #pinyin #hanzi #match-pinyin-with-hanzi
codetypo-cli

Source Code Spelling Correction

v1.30.2 120 bin+lib #spell-check #codetypo #spelling #development-tools #monorepo #correction #pr
fast_trie

A memory efficient trie library

v0.1.4 #string-matching #library #efficient #serde #trie
lindera-decompress

A morphological analysis library

v0.32.3 16K #morphological-analysis #library #tokenize #decompression #multilingual #analysis #morphological
hyper-static-server

friendly library to build static servers with hyper HTTP server

v0.5.1 #server #static #hyper
poriborton

Interconversion between Unicode and various Bengali ANSI encodings

v0.2.3 210 #bengali #unicode #bijoy #ascii #ansi
quickner-core

A fast and simple NER tool

v0.0.1-alpha.20 190 #nlp #ner #config #named-entity
eaverdeja-minigrep

minigrep from chapter 12 of the Rust lang book

v0.1.1 230 bin+lib #mini-grep #eaverdeja-minigrep #book
html-auto-p

function like wpautop in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags.

v0.2.4 #paragraph #br #wpautop #autop #html
gstring

String with support for Unicode graphemes

v0.9.1 110 #grapheme #gstring #g-string #string #301
unicodeit-cli

The command line interface to unicodeit

v0.2.0 app #unicode #latex #unicodeit #math
jfmt

command-line tool for formatting json files in both readable and compact formats. It supports stdin/stdout shell usage, as well as working on files directly.

v1.2.1 app #json #formatter #cli #auto-formatter #space
vidyut-prakriya

A Sanskrit word generator

v0.2.0 #sanskrit #nlp #generator #error
next-pagefind

Pagefind for next.js non output export applications. Fully crawl and index your app in one command.

v0.1.4 app #pagefind #next-pagefind #js
royal_road_archiver

An archival program and library for the webnovel site RoyalRoad

v1.0.3 bin+lib #road #royal #webnovel
latex2mathml

Convert LaTeX equations to MathML

v0.2.3 850 #mathml #latex #convert-html #display-style
odict

A blazingly-fast dictionary file format for human languages

v2.4.0 470 #dictionary #file-format #linguistics #language #language-learning
crowbook

Render a Markdown book in HTML, PDF or Epub

v0.16.1 bin+lib #epub #book #markdown #pdf #html #fiction #latex
string-overlap

A helper crate for "layering" ASCII art

v1.0.0 #overlap #string #ascii-art #layer
mask-text

mask text with multiple masking options

v0.1.2 3.6K #mask #mask-text #thanks
engish

A language utility for sampling letters and building words

v0.2.0 #word #english #language #words
rnltk

Natural Language Toolkit for Rust

v0.4.0 #nlp #stemming #sentiment #language
advanced_string_generator

A command-line tool for generating strings based on customizable regex patterns

v0.1.2 bin+lib #regex #generator #rust #cli #pattern
unveil-rs

Unveil Rs is a tool to create presentations from markdown files

v0.1.2-alpha1 bin+lib #css #unveil #js #markdown #slide #reveal
flxy

Full-text searching and scoring of strings

v0.1.19 750 #fuzzy-search #search #string-search #emacs #string
jira-clean

clean up Jira task description that is an output of jira-cli tool

v0.1.2 app #jira #clean #description #issue
lorem-rustum

lib for generating lorem-ipsum with a rusty fleur

v0.0.5 #fleur #lorem-rustum #rustum #start
datatroll

a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files

v0.1.3 #csv #data-science #datatroll #pagination
xmlwriter

streaming XML writer

v0.1.0 290K #xml-writer #xml #svg
gazetta-render-ext

A static site generator framework. Extra render code.

v0.3.0 #static-site #gazetta #blog #framework #assets #website #format
byteutils

that provides a collection of frequently used utility functions for working with bytes, strings, and vectors. It includes common tasks such as converting between strings and byte arrays…

v0.1.0 #byte-string #vec #library #utilities #operation #byte #string
salign

Align and prettify comments in asm files

v1.0.1 app #assembly #salign #asm #arguments
hydroper_source_text

Source text containing line locations

v1.0.3 #text #source #hydroper-source-text
blitztext

fast keyword extraction and replacement in strings

v0.1.1 bin+lib #fuzzy-search #aho-corasick #search #keyword #trie
text-tokenizer

Custom text tokenizer

v0.6.2 600 #tokenize #text-tokenizer #tokenizer
salvation-cosmic-text

Pure Rust multi-line text handling

v0.12.0 no-std #font-rendering #shaping #text-layout
loe

Very fast and yet another line ending (CRLF <-> LF) converter written in Rust

v0.3.0 bin+lib #newlines #lf #eol #crlf
untanglr

Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies

v1.1.0 bin+lib #split #string #untanglr #dog #og #thequickbrownfox #umpedoverthelazy #text-processing
encoding-next-index-singlebyte

Index tables for various single-byte character encodings

v1.20180106.0 1.1K #encoding-next #index #cp437 #iso-8859-1 #encoding
regex-split

split_inclusive for the regex crate

v0.1.0 3.8K #regex #regex-split #split-inclusive #day
google-fonts

Download and cache TTF fonts from Google

v0.1.5 #webp #true-type #graphics #font #api-bindings
mdbook-najan

Preprocessor for the Najan mdBook

v0.3.1 210 bin+lib #mdbook #najan #mdbook-najan
csv2ndjson-lite

A little tool to convert a csv to a valid ndjson/json-stream

v0.2.0 app #json-stream #array #csv2ndjson-lite #ndjson-json-stream #arrays #duration
analogize

log analyzer

v0.6.0 bin+lib #analogize #analyzer #status
geoipsed

Inline decoration of IPv4 and IPv6 address geolocations

v0.1.3 app #ip-geolocation #dfir #regex #geolocations #logging #metadata
mdplayscript

An extension of Markdown for play scripts

v0.6.0 #play #pulldown-cmark #markdown #script
cesu8-str

CESU-8 and Java CESU-8 string validation and manipulation

v1.2.1 390 no-std #cesu8 #utf-8 #validation
shopping-parser

A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units

v0.1.1 bin+lib #shopping #parser #cli-parser #rust
typship

A cli for typst packages

v0.4.1 bin+lib #typship #publish #init #notice #tool #help
globber

Extended glob matching library

v0.1.3 #extended #glob #matching #wildcard #range
opstr

‘Operate on strings’ command line utility

v1.1.0 bin+lib #unicode #string #cli #output #default #letter
record-query

doing record analysis and transformation

v1.0.4 bin+lib #query #javascript #record #command-line-tool
panda-re-sys

The official *-sys library for interfacing with PANDA (Platform for Architecture-Neutral Dynamic Analysis)

v0.8.0 260 sys #properties #panda #analysis
slicestring

slicing Strings

v0.3.3 #substring #slice #cut #string #string-slicing
tagalyzer

A CLI tool to gather statistics on collections of plaintext-adjacent files

v0.3.0 bin+lib #tags #statistics #word-analysis #writing-analysis #processing
harfbuzz

Rust bindings to the HarfBuzz text shaping engine

v0.6.0 1.3K no-std #opentype #font-shaping #unicode #unicode-text #font #shaping
roan-engine

The core engine for the Roan project

v0.1.6 100 #engine #roan-engine #roan
punycode

Functions to decode and encode Punycode

v0.4.1 52K bin+lib #punycode #rfc-3492 #assert-eq
lspt

Language Server Protocol (LSP) types made easy

v0.2.0 210 #lsp #proposed #documentation
english

language decliner

v0.0.3 #english #decliner #linguistics #nlp #conjugator #inflector
kakasi

Romanize hiragana, katakana and kanji (Japanese text)

v0.1.0 500 bin+lib #hiragana #kanji #romaji #is-japanese
oxcomm

using Google Translate on the fly

v0.1.2 #text-translation #google-translate #google #text #language #translation
khaiii-rs

Bindings to Kakao Hangul Analyzer III (khaiii) for parsing and analyzing Korean text

v0.1.4 #korean #khaiii #api-bindings #version
hunspell-rs

Rust bindings to the Hunspell library

v0.4.0 4.9K #spell-check #hunspell #hunspell-rs #spell-checking #spellcheck
deinflect

japanese deinflection

v0.1.4 #deinflection #deinflect #deinflections #string
md-bakery

Markdown Bakery CLI app

v1.2.0 app #bakery #md-bakery #derive #debugging #source #hash-map #end #snippet-b #snippet-a
str-utils

some traits to extend types which implement AsRef<[u8]> or AsRef<str>

v0.1.7 550 no-std #ascii #string #ascii-text #starts-with #caseless #ends-with
markdown-toc

Markdown Table of Contents generator

v0.2.0 bin+lib #toc #markdown #generator #header #table-of-contents #link
reedy

A terminal-based RSS reader with a clean TUI interface

v0.1.4 bin+lib #rss #tui #rss-feed #rss-reader #feed-reader #reader
yara-x-cli

A command-line interface for YARA-X

v0.14.0 130 app #yara #yara-x #yara-x-cli
mdbook-metadata

mdBook preprocessor to parse markdown metadata

v0.1.1 app #mdbook #pre-processor #metadata
cargo-markdown

Local crates.io readme development server with ultra-fast hot reloading goodness

v1.0.3 app #cargo-subcommand #mockups #readme #cli
zummi

fun lib that produces spoonerisms

v0.1.2 bin+lib #spoonerisms #zummi #horld #world
afrim-translator

Manage the predication system of the afrim input method

v0.2.1 #autocomplete #input-methods #afrim #translator #predication #engine #ime #predicate #auto-complete
enpsrlib

English Phrase Structure Rules library

v0.1.0 #phrase #structure #english #linguistics #psr
aki-mline

match line, regex text filter like a grep of linux command

v0.1.32 1.3K bin+lib #filter #text #aki-mline
baidu_trans

百度翻译API

v0.7.5 #translation #baidu #language #translate #百度翻译api
computergeneration

compgen but all wrong

v0.2.0 app #computergeneration #information #lower-case #wrong #sensitive #auto #pattern
asimov-sdk

ASIMOV Software Development Kit (SDK) for Rust

v24.0.0-dev.22 no-std #artificial-intelligence #asimov #sdk
nesty

Generate code with with human readable indentation

v0.2.0 1.8K #nesty #indentation #world #produce #newlines #if-expr #crlf #newline
forestrie-builder

Build a trie and convert it TokenStream

v0.3.1 600 #forestrie #builder #forestrie-builder #token-stream
serbian-cyrillic-latin-conversion

Serbian Cyrillic to Latin and Latin to Cyrillic conversion library

v1.0.2 #cyrillic #latin #serbian
roxy_markdown_parser

Roxy plugin for parsing Markdown

v0.1.2 #markdown-parser #roxy-markdown-parser #roxy #markdown
perm-text

curling straight/dumb quotation marks ("") and apostrophes (') into their curly/smart (“”’) equivalents

v1.0.4 app #quote #apostrophes #curly #text
derek-minigrep

grep clone

v0.1.1 bin+lib #derek-minigrep #clone #mini-grep
schmfy

Schmfication library

v0.3.0 360 #schmfy #everything #non-alphabetical
marktask

A CLI tool for parsing and manipulating Markdown tasks

v0.2.0 bin+lib #task #markdown #todo #tasks
onig_sys

onig_sys crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma…

v69.8.1 619K sys #regex #bindings #oniguruma
wildcard_ex

extended wildcards that allows VB-like specifications

v0.1.2 bin+lib #wildcard #string-matching #pattern #ex
roman_numerals_fn

A function to convert integers to their roman numeral representation as strings. Values from 1 to 3999 are possible, otherwise it returns an OutOfRangeError. Zero has no representation in roman numerals.

v1.0.0 #roman-numeral #roman-numerals #roman #numeral
c6o-obsidian-export

associated CLI program to export an Obsidian vault to regular Markdown

v21.9.0 bin+lib #obsidian #markdown #export #front-matter #groenen #embed #notes #bot #commit #version
zoitei

alphabet conversions

v0.1.0 #synthesis #zoitei #convert #conversion #conversions
imperative

Check for imperative mood in text

v1.0.6 10K #imperative #word #text #description #contribute #word-list
indented_text_writer

IndentedTextWriter

v0.4.0 450 #indent #text-writer #indented-text-writer #i32 #string #write-line
st7789_rs

A driver and graphics library for st7789 displays, primarily used on a Raspberry Pi

v0.1.5 310 #st7789-rs #display #st7789 #pi #lcd #foundation #eventually #computer-microcontroller #devices
just-enough-emojis

text to emoji cli

v2.0.0 app #emoji #cli #text
unicode-vo

Unicode vertical orientation detection

v0.1.0 191K #unicode #detect #unicode-vo #detection
markdown-viewer

Support preview of markdown files

v0.1.0 #markdown-viewer #viewer #markdown
rizzer

Fuzzy matching tool to find string similarity

v0.2.0 #score #similarity #rizzer #description
lingua-german-language-model

The German language model for Lingua, an accurate natural language detection library

v1.2.0 12K #language-recognition #lingua #language-detection #nlp
ascii-hangman-backend

customizable Hangman game with ASCII-art rewarding for children (backend)

v5.7.2 #back-end #ascii-hangman #ascii-art #hangman-game #children #kids
squidge

shortens delimited data

v0.2.3 #shortener #delimited #delimited-data #shorten-line #config
translation-api-cn

Some useful structs for calling Chinese translation api cloud services. A helper tool for bilingual cmdline tool.

v0.1.3 #translation #api-bindings #tencent #bilingual #10 #文件的
maudit

Framework for generating static websites

v0.2.0 #maudit #string
topfew

CLI to find high frequency occurrences in structured text files

v0.2.3 bin+lib #field #cli #topfew
fiberplane-markdown

convert Fiberplane Notebooks to and from Markdown

v1.0.0-beta.14 700 #markdown #notebook #fiberplane #convert
string-simple

containing some simple string utilities that I use in my other projects

v0.1.0 #utility #string #text
rfsee-tf-idf

TF-IDF implementation for rfsee

v0.1.0 #tf-idf #neovim-plugin #rfsee #nvim #index #regex
text-to-png

way to render text to a png image with basic options

v0.2.0 550 #font-rendering #png #text-rendering #svg #rendering
subject-classifier

classifying a commit by it's subject

v0.4.2 #changelog #subject #classification
wool

Preview Github Markdown Offline

v0.1.3 bin+lib #offline #markdown-preview #markdown #offline-github-markdown-preview #md
common-words-all

Most common words sorted by ngram frequency

v0.0.2 #word #ngrams #english #chinese #french #german #hebrew #russian #spanish #data
rtss

A command-line tool to annotate stdout/stderr with elapsed times

v0.6.2 bin+lib #timestamp #filter #command-line-tool
textgrid

working with PRAAT .TextGrid files with parsing, riting, manipulation, and history tracking modulesfor TextGrid data

v0.1.0 110 #text-grid #textgrid #interval #format #merge
bbcode-tagger

BBCode tree parser and tagger

v0.2.0 #tagger #bb-code #bbcode-tagger
ry

yaml searching

v0.1.1 bin+lib #yaml #search #string-matching #yq #matching #value #path #array #node
nanoid-dictionary

Popular alphabets for use with nanoid

v0.4.3 700 #nano-id #nanoid-dictionary #nolookalikes
translitrs

Transliteration utility for Serbian language

v0.2.2 bin+lib #transliteration #latin #cyrillic #pandoc #filter #text
akiaki

A good old fashioned wiki engine with a flat-file database

v0.0.3 app #wiki #networking #fast-cgi #server
quewuigrep

grep-like tool written in Rust

v0.1.1 bin+lib #search #case-insensitive #case-sensitive
path2regex

Express style path to RegExp utility

v0.0.4 #routing #express #regex
pomsky-bin

Compile pomsky expressions, a new regular expression language

v0.11.0 bin+lib #regex #pomsky #language
fip

Field Parser, roughly emulating "awk '{print $<field-number>}'"

v1.0.2 app #fip #field-number #find-nth-field
unicode-normalization-alignments

functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15

v0.1.12 304K #unicode-normalization #alignment #recomposition #unicode-text #text #decomposition #unicode #normalization
magic_string_rain

magic string

v0.3.5 7.2K #magic #rain #string #magic-string #napi
inslice

A command-line utility for filtering text input by columns and rows

v1.1.0 bin+lib #row #column #inslice #colslc #rowslc
icemelter

minimize files that trigger internal compiler errors (ICEs)

v0.3.2 app #ice #rustc #debugging-tool #github #mcve #cargo-bisect-rustc #report
strcursor

string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries

v0.2.5 1.7K #string #unicode #cursor #grapheme
pi_ucd

unicode字符函数，获得字符的语言区间段；及根据文字排版的需要，判断字符是否为单字字符或字母字符

v0.1.0 #pi #unicode #unicode字符函 #判断字符是否 #字符 #单字字符或字 #获得字符的 #extension #symbols #forms
mdbook-keeper

An improved testing experience for mdbook

v0.5.0 bin+lib #mdbook #book #keeper #done #skeptic
simple_peg

A command-line peg parser implemented by Rust

v0.3.0 app #peg #compile #peg-parser
sejong

Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.

v0.1.5 #hangul #korean #localization #input
arabic-script

An expressive API for the characters of the Arabic script

v0.1.0 #unicode #arabic #arabic-script
runestr-pancjkv

rune-based Pan-CJKV support

v0.1.1 #runestr-pancjkv #pan-cjkv-region #pancjkv #rune-string
text_unit

Newtypes for text offsets

v0.1.10 2.0K #offset #text #text-unit
udp-logger-rs

Log macro for log's kv-unstable backend and a UDP socket logger

v0.1.4 #logging #key-value #udp #kv
mastodon-async-entities

Types for (de)serializing entities from the Mastodon API; part of mastodon-async

v1.1.0 210 #mastodon #mastodon-async #async #openssl
java_string

Java strings, tolerant of invalid UTF-16 encoding

v0.1.2 #utf-16 #string-encoding #java #server #minecraft
wyrcan-todo

A todo manager for managing todotxt based files

v0.1.6 app #wyrcan-todo #wyrcan
sms_splitter

An SMS message splitter and part calculator with support for GSM and Unicode

v0.1.9 #unicode #splitter #sms #gsm #split-sms
search-in-terminal

A terminal-based search tool

v0.1.3 bin+lib #cs #search #terminal #search-engine
hitori

Generic compile-time regular expressions

v0.2.3 300 no-std #regex #expression #hitori #expressions
somedoc

A very simple document model and markup generator

v0.2.10 310 #documentation #model #markdown-flavor #writer
blackboxmc_java

BlackboxMC bindings for java.util

v0.5.1 100 #blackbox-mc #utilities #java
quake

knowledge management tool for geek

v0.5.0 app #dashboard #knowledge #link #knowledge-graph #markdown #content-management #transflow #search #knowledge-management
frontmatter

A Fairly Trivial Wrapper for yaml-rust to Extract Frontmatter from a String Slice

v0.4.0 360 #front-matter #slice #unstable
cow-rewrite

Rewrite copy-on-write types copying only when it's neccessary

v0.1.0 #cow #rewrite #neccessary
peppergrep

grep utility written following the 12th chapter of the Rust book. Some little modifications were made.

v0.1.1 bin+lib #peppergrep #attend-case
enso-lazy-reader

An efficient buffered reader

v0.2.0 nightly #reader #utf #enso-lazy-reader #read
divvunspell-bin

Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support

v1.0.0 app #spell-check #divvunspell #divvunspell-bin #fst #hfst-ospell
pdfcr

render a codebase to a pdf

v1.3.0 190 app #pdf #pdfcr #file #title #required
wordfreq

port of wordfreq for looking up the frequencies of words in many languages

v0.2.3 #nlp #wordfreq #word-freq
human_regex

A regex library for humans

v0.3.0 #human-readable #regex #end #character #flags #repeat
tree-sitter-stack-graphs-java

Stack graphs for the Java programming language

v0.5.0 bin+lib #tree-sitter #stack-graphs #java
bookgrep

Basic grep equivalent, minor mods to Chapter 12

v0.1.3 bin+lib #bookgrep
ccase

Command line interface to convert strings into any case

v0.4.1 bin+lib #casing #title-case #string #case #pattern #png #boundaries
mdbook-gitbook

mdBook preprocessor to properly render GitBook specific syntax

v1.0.3 bin+lib #mdbook-preprocessor #mdbook #mdbook-pre-processor #gitbook #markdown #git-book #syntax
pillar

small tool to format lines into columns

v0.1.2 app #table-column #column #padding #pad #tabs
markdown-extract-cli

Extract sections of a markdown file with a regular expression

v2.1.0 app #markdown #extract #markdown-extract-cli #expression #welcome #md
story-dl

Story web scraping

v0.6.0 bin+lib #epub #story #fanfiction #scraping #site #note
boringascii

Strings that can only be constructed to contain non-whitespace, non-control ASCII characters

v1.0.0 #boringascii #signature #cryptography
bottomify

Fantastic (maybe) CLI for translating between bottom and human-readable text

v1.2.0 160 bin+lib #bottomify #bottom #unicode #text #why
rsmorphy

Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)

v0.4.0 #inflection #russian #pluralize #nlp #ukrainian
correct_word

A No brainer 'did you mean' library for Rust

v0.2.0 220 #levenshtein #correct-word #word #did-you-mean #algorithm #text-processing #word-correction
tet_rs

A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust

v0.3.1 #hci #text-entry #text-entry-benchmark #distribution
kana-converter

converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)

v0.1.2 #kana #full-width #half-width #byte-conversion
ucd

Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…

v0.1.1 6.4K #ucd #unicode-characters #unicode #unicode-text #code-point #character #text
toktkn

a minimal byte-pair encoding tokenizer implementation

v0.1.2 340 #nlp #tokenize #pyo3 #maturin #python
crawdad

ChaRActer-Wise Double-Array Dictionary

v0.4.0 1.3K no-std #trie #double-array-trie #double-array #search #text-search #text
mario_minigrep

first project for minigrep

v0.1.1 bin+lib #mini-grep #mario-minigrep #mario
rusty-axml

A parser for Android AXML files

v0.2.0 bin+lib #axml #name #rusty-axml #status
asciit

A compact and visually appealing ASCII table for your terminal, featuring colored numbers and letters

v1.0.1 110 app #ascii-table #ascii #terminal #terminal-app
spf

.spf (Simple Pixel Font) file parsing, and useful api's to go alongside

v0.4.1 #spf #parser #surface #language
pukram2html

converting Pukram-formatted text to HTML

v0.3.0 bin+lib #pukram #text-processing #markup #html
soft-ascii-string

char/str/string wrappers which add a "is-ascii" soft constraint

v1.1.0 #ascii #safe-strings #ascii-text #constraints #safe #ascii-string #bug #string #logging #constraint
fenx

parsing and handling FEN and algebraic chess notations

v0.1.1 #chess #chess-board #fenx #fen #notation
text-utils

Text utils for unescaping and align

v0.4.3 1.2K #utilities #urlencode #text-alignment #interface #escaping #text
libflagup

Display a country's flag as an emoji

v0.0.8 #libflagup #country #quiz #flagup #homebrew
mitex-spec-gen

Guard to geneate specification files for dependent crates

v0.2.4 #latex #specification #typst #math #mi-tex #io-mitex #wasm
nlprule

A fast, low-resource Natural Language Processing and Error Correction library

v0.6.4 2.1K bin+lib #nlp #spell-check #grammar #spelling #text
tzgrep

grep tar.gz

v0.2.0 180 bin+lib #tar #grep #gz
tantivy-object-store

A tantivy Directory implementation against object stores (S3, GCS, etc.)

v0.1.0 #full-text-search #search-engine #object-store #search
little_boxes

Adds boxes around stdin. Optionally adds a title

v1.8.0 app #boxes #title #little-boxes #command-line
apple-notes-exporter

CLI tool for exporting Apple Notes to Markdown

v0.1.0 bin+lib #notes #markdown #apple #export #attachment
unicode-width-16

Determine displayed width of char and str types according to Unicode Standard Annex #11 rules

v0.1.0 1.1K no-std #unicode-width #unicode-text #unicode #width #text
committed

Nitpicking commit history since beabf39

v1.1.7 240 bin+lib #git #development #styling #beabf39 #pre-commit #logging #processing
mdbook-bib

mdbook plugin allowing to load and present a bibliography in BibLaTex format in your books and cite its references

v0.0.6 bin+lib #bibliography #mdbook #pre-processor #plugin #bib
uapi-version

Compare versions according to the UAPI Version Format Specification

v0.4.0 110 #uapi #systemd #version #specification
hebrew

alephbet primatives and parsing library for rust

v0.1.1 #hebrew #nikkud #dot #end
scie

research about how to build simple code identify engine for different languages

v0.1.0 app #scie #testing #fs
fuzzywuzzy

A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy

v0.0.2 1.6K #utility #string #text #ratio #process
texting

string helpers

v0.0.7 #string #nlp #texting #text #helper #str
words-count

Count the words and characters, with or without whitespaces

v0.1.6 2.9K no-std #word-count #utf-8 #letter #character #word
mdbook-check-missing-md

A backend for mdbook which will find Markdowns you forgot on SUMMARY.md

v0.1.1 bin+lib #mdbook #mdbook-check-missing-md #check
crate-starter

starter

v1.1.0 app #library #example #rust #starter
ptero-cli

A text steganography CLI tool for Social Media

v0.4.2 bin+lib #steganography #encoding-decoding #text #media
regexgrep

ripgrep tool that suports regular expressions

v1.0.3 app #regex #expression #regexgrep #sensativity
profane-rs

Check Messages For Profanity/Swearing

v0.0.4 110 #profane-rs #profane
yazi-prebuild

Used to place the pre-built assets of yazi (https://github.com/sxyazi/yazi)

v0.1.2 2.2K bin+lib #yazi #prebuild #yazi-prebuild #build-deps
emojicon

Find Emoji by using Emoticons and GitHub's, Bengali emoji names

v0.4.0 210 #emoji #unicode #emoticon #bengali #gemoji
notegraf

Core library for building a graph-oriented notebook

v0.1.1 #note-taking #notebook #notegraf #markdown
idna-cli

Encode/decode Unicode domain names to/from IDNA ASCII

v0.2.2 bin+lib #ascii #domain #idna #csv
just-run

Convenience crate for executing system commands with the expectation of successful termination and UTF-8 encoded output, for basic straightforward command execution scenarios

v0.1.0 #success #just-run #variant
markflowy

A Markdown Editor App

v0.7.5 bin+lib #tauri #markflowy #alpha #macos-app #windows-app #reactjs #chatgpt #简体中文 #linux-app #language
left-pad

left-padding for strings

v1.0.1 7.2K #padding #left-pad #pad
commentator

Source code comments extractor binary and SDK

v0.2.3 bin+lib #sdk #commentator #java
align

aligning text

v1.0.0 app #text-alignment #alignment #text
mdbook-skill-tree

mdbook plugin to show roadmaps

v3.0.0 app #skill-tree #skill #mdbook #roadmaps
hyphenation_commons

Proemial code for the hyphenation library

v0.8.4 12K #hyphenation #hyphenation-commons #common
recursive-file-loader

recursively load files via references in the files

v1.0.3 #recursion #loader #recursive-file-loader
mapm

A set of tools, with command line and graphical interfaces, used to build exams typeset in LaTeX

v7.0.0 #latex #problem #mapm
zspell-cli

Command line interface for the ZSpell spellchecking library

v0.5.5 260 app #spell-check #spelling #dictionary #cli #interface #spellcheck
techlead

CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project

v0.2.0 app #chatgpt #openai-api #openai #client #gpt #api-client
Grepulous

An attempt to make a grep like command

v0.1.0 app #grepulous #power
reg_match

A match style regex tool

v0.1.0 #regex #reg-match #macro #match
incredimo

just another font for your terminal

v0.1.17 bin+lib #banner #terminal #art #ascii-art
gripx

a cooler alt for grep built in rust

v0.1.0 bin+lib #gripx #learning-by-doing
sortuniq

Find or count unique values in an input stream

v0.2.0 app #stream #sortuniq #local #film #stage #helena #carroll #television #winifred #november
stringutils

A collection of various and (hopefully) useful String utility functions

v0.0.3 #stringutils #byte-array #rust-stringutils
prism-js

rust bindings for prism.js syntax highlighting library

v0.1.2 #prism-js #js #highlight #punctuation #spans #prism-rs
harfbuzz-sys

Rust bindings to the HarfBuzz text shaping engine

v0.6.1 17K sys #opentype #font-shaping #harfbuzz #unicode-text #shaping #unicode #font #opentype-font
image-to-ascii

Converts images and gifs to ascii art

v0.7.0 100 bin+lib #ascii-art #art #character #font #gif #alphabet #image-path
typst-ts-core

Core function of Typst.ts

v0.5.0-rc6 1.5K #typst #ts #typst-ts-core #wasm #browser
glyph-names

Mapping of characters to glyph names according to the Adobe Glyph List Specification

v0.2.0 3.4K #font-glyph #name #glyph #font #specification #agl-specification #glyph-name
ranting

Linguistic formatting placeholder extensions for rust

v0.2.1 #placeholder #inflection #noun #verb #indefinite-article
ltxcut

formats a table-like stream into a LaTeX-table

v0.1.1 app #latex-table #ltxcut #field #delimiter #wrap-lines #wrap-fields #escape-fields #line #testing #csv
mdbook-summary-generate

A mdbook preprocessor to generate SUMMARY.md from a directory structure

v0.1.2 app #mdbook #summary #mdbook-summary-generate #structure #path
strange

A static website generator

v0.9.0 app #website-generator #markdown #static-website #static #website #generator #web
haseo

diff command line made simple

v0.1.5 bin+lib #haseo #js
mdbook-embed

A preprocessor that simplifies embedded URL

v0.2.0 app #mdbook #markdown #url #mdbook-plugins
cklein

High-level safe bindings to the Klein scripting language

v0.1.0 #cklein #parameters #print #run #fizzbuzz #numbers
detect-indent

Detect the indentation of code

v0.1.0 1.3K #indent #detect-indent #detect
seq2xypic

Turn a text sequence diagram into a LaTeX xypic diagram

v0.1.1 app #seq2xypic #xypic
chunkr

A fast and quick chunking library for rust

v0.1.17 #chunking #chunkr #yourself
cnpj

Brazilian CNPJ parsing, validating and formatting library

v0.2.2 no-std #cnpj #brazil #brasil #numbers #valid
rep-grep

wgrep/write-grep CLI

v0.0.7 app #find-replace #regex #grep #sed
minigrep_dqy

A mini command for grep like linux

v0.1.0 bin+lib #linux #mini-grep #minigrep-dqy
revstr

Simply reverses strings

v1.0.2 app #string #revstr
campfire

A tiny static site generator, greatly inspired by Zola

v1.1.0 app #static-site-generator #campfire #gardens #why #stream #obsidian #campfires
imagecli

A command line image processing tool

v0.2.1 bin+lib #guide #image #imagecli
character_text_splitter

splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter

v0.1.3 #character-text-splitter #splitter #chunks
chardet

rust version of chardet

v0.2.4 11K #chardet #language #utf-8 #confidence
kvarn-chute

A Markdown converter designed to use the Kvarn templating engine

v0.4.0 bin+lib #kvarn #template #markdown #kvarn-extension
markdown-formatter

Flavored Markdown (ZH) content formatter

v0.0.13 bin+lib #formatter #markdown-formatter #markdown
rupantor

A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic

v0.3.0 #avro #bengali #avro-phonetic #bangla
mdbook-davids_cooking

A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs

v0.1.3 app #mdbook #davids-cooking #preprocesor
spare

colorful format iterable

v0.0.3 #iterable #spare #compatibility
guy

Take your terminal to Flavortown

v1.0.1 app #guy #flavortown
mdbook-rustviz

An mdbook preprocessor that allows users to embed RustViz visualizations into mdbook projects

v0.2.0 app #pre-processor #rustviz #mdbook-rustviz #project
polars_arrow_rvsry99dx

Apache Arrow

v0.17.1 nightly #arrow #analytics #duration #environment #database
bubble-bath

Small and quick HTML sanitizer

v0.2.1 #security #input-validation #xss #html
uwu-rs

uwuifying library

v1.0.0 #owo #uwu #web #version
pst

publish posts to Micro.blog

v0.2.0 app #blog #post #pst #config
kansuji

漢数字と数字の相互変換のためのライブラリ

v0.1.1 #kansuji #漢数字と #めの #の相互変換の #htm #概要 #puripuri2100 #なお #するかは #めるものとす
is-vowel

Heuristically test whether a character is a vowel letter

v0.1.0 110 #letter #vowel #is-vowel
markx

markdown parser

v0.1.1 #markx #mark2html
mdbook-scientific

Enables inline equations for mdbook to set by $..$ signs and $$..$$

v0.5.0-beta.3 bin+lib #equation #mdbook #scientific-equation #scientific
trevordmiller

Personal CLI

v1.1.4 app #trevordmiller #principles #cli
rusticsearch

A lightweight, Elasticsearch-compatible search server (early WIP)

v0.0.2 app #rusticsearch #alias #icsearch #operate #sense
esl01-renderdag

Render a graph into ASCII or Unicode text

v0.3.0 600 #esl01-renderdag #renderdag #esl01 #scm
yeslogic-unicode-script

Fast lookup of the Unicode Script property

v1.0.0 300 #internationalization #unicode-properties #script #unicode #unicode-text #text
slack_update

app to set Slack status, emoji and photo

v0.1.7 app #slack #photo #expiration #timestamp
cha-rs

Extract specific characters from an input

v0.0.3 bin+lib #cha-rs #cha #16
bpmf_py

A Bopomofo and Pinyin library

v0.1.0 #pinyin #convert #bopomofo #mandarin
bitflip

functions to generate bitflips of binary and UTF-8 strings

v0.1.0 200 #bitflip #bitflips #blip #bitsquatting
rustrings

Strings manipulation for Rust

v1.0.2 #rustrings #rings #format-text
markdown-gen

generating Markdown files

v1.2.1 550 #markdown-generator #markdown #generator #paragraph #heading #bold-italic #quote
meaningsearch

package that helps you find meaningful lines of any given input. Especially useful in CTFs.

v0.1.4 app #ctf #search #meaningsearch #tool
qpprint

console printing/formatting

v0.2.1 160 #terminal #console #terminal-console #format
mdbook-typstpdf

An mdBook backend that generates PDF output using Typst

v0.1.1 280 bin+lib #mdbook #typst #pdf #markdown #documentation
findtext_sheet

Search text in SpreadSheet

v0.1.2 130 bin+lib #xlsx #excel #search #text-search #text #cli
unic-ucd-normal

UNIC — Unicode Character Database — Normalization Properties

v0.9.0 9.1K #unicode-normalization #internationalization #unic #unicode-text #locale-data #unicode-algorithms #text #unicode-characters #text-processing #compose
minigrep_xiaoai

一个简单的命令行工具，用于在文件中搜索字符串。

v0.1.1 bin+lib #mini-grep #例子 #minigrep-xiaoai #索字符串 #用于在文件中 #个简单的 #package-manager #cargo #的文本 #用于搜索文件
quilltex

open-source Rust library designed to convert LaTeX documents into a Delta format that can be used with Quill.js and vice versa

v0.1.0 #quilltex #standard-package
tagsearch

Filter plaintext files based on @keyword tags

v0.37.0 bin+lib #tags #filter #tagsearch #service #model
fontship

A font development toolkit and collaborative work flow

v0.10.0 bin+lib #font #ufo #fontship #glyph #flow #glyphsapp #setup
porter-stemmer

Flexible and unicode friendly, Porter stemmer implementation

v0.1.2 #stemmer #stem #normalization #porter #text
stringslice

A collection of methods to slice strings based on character indices rather than bytes

v0.2.0 3.2K no-std #slice #unicode #substring #utf-8 #string
kanyey

cli tool for generating quotes in your terminal from Kanye West

v0.1.3 app #kanyey #why
yinzhe9

喵喵隐者9

v0.1.0 #yinzhe9 #喵喵隐者9
trigram

Trigram-based string similarity for fuzzy matching

v0.4.4 9.0K #fuzzy-matching #string-matching #trigram #string
tnipv-lint

lints for tnipv, the Telcoin Network Improvement Proposal validator

v0.1.0 #telcoin #validation #tnipv #tnip #tnips
outerspace

Methods for prefixing and suffixing the non-whitespace characters in a string

v0.2.1 #outerspace #outerspace-rs #hello
mepple

English dictionary as a library

v0.2.0 #mepple
cur

that will hunt for your regular expression

v0.5.0 no-std #regex #expression #hunt #catch
findtext_doc

Search text in Document

v0.1.2 130 bin+lib #word-search #text-search #documentation #search #docx #word #text #cli
twitter-text

in Rust

v0.2.0 340 #twitter-text #text #twitter #objective-c #testing #javascript #java #ruby #conformance #build
remake

writing maintainable regex and managing symbol soup

v0.1.0 #remake #numbers #run-time
aprilasr

High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys

v0.2.0 160 #nlp #audio #neural-network #wrapper
base16-rs

in Rust offers capabilities for encoding and decoding data in Base16 format. By utilizing the hex library, you can transform data into its hexadecimal representation and also decode…

v0.1.1 bin+lib #base-16 #base16-rs
minigrepns

A mini version of famous grep application that searches texts on files

v0.1.0 bin+lib #minigrepns
basic-text-internals

Basic Text string literal implementation details

v0.19.2 950 #basic-text #plain-text #basic-text-internals #detail
normalize-hebrew-rs

package that normalizes special symbols within Hebrew string used in the Qumran-Digital project

v0.1.0 #hebrew #normalize-hebrew-rs #normalize
mdlc

Markdown Link Checker. Find broken web and local links.

v0.8.1 bin+lib #link #mdlc #checker
rustsay

CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal

v0.2.0 110 app #terminal #cowsay #ascii #cli
dokkoo

Mokk (Macro Output Key Kit) implementation written in Rust

v0.5.0 nightly bin+lib #mokk #liquid #dokkoo #yaml #markdown
carnation

some string operators

v0.1.1 #carnation #trim-whitespace #upper-case
fmty

Composable core::fmt utilities

v0.1.1 no-std #display #text #utilities #string-format
whitespace-conf

Key-value configuration file delimited with whitespaces

v1.0.0 #whitespace-conf #white-space #conf #note #fs
char_reader

Safely read wild streams as chars or lines

v0.1.1 1.3K #unicode #char #reader #line #stream
slicedisplay

Simplistic Display implementation for Vecs and slices

v0.2.2 600 #display #string #slice #text
clis

a simpl search/fuzy finder

v0.1.1 app #search #finder #cli #term
japhonex

Japanese phone number checker for Rust

v0.1.1 #japhonex #regex #optional #hyphen #phone-number #japanese
rustfmt_emitter

Rustfmt emitter library

v1.0.0 #rustfmt #emitter #fmt #formatted #code-formatter
help_crafter

help message generator without hussle

v0.3.1 #help-message #help #parameters #crafter #hussle #command
ftd

ftd: FifthTry Document Format

v0.2.0 bin+lib #markdown #ftd #prose #json
korean

hangul manipulation

v0.3.1 #korean #korean-rs #unicode-block
clafrica

This application allows you to type most of the characters in the african alphabet in any text field

v0.4.1 bin+lib #input-methods #typing #african #dictionary #interface #afrim #ime
spigot

parser for valve's keyvalue file format (gameinfo.txt, vmt, etc.)

v0.1.2 #spigot #value
halfcaps

tRaNslAtE aNy TeXt To ThIs

v0.2.0 app #upper-case #half #upper #text #case
monkey-printer

infinite nr of monkeys you could write Shakespeare

v0.1.4 app #shakespeare #monkey-printer #monkey
stam-python

STAM is a library for dealing with standoff annotations on text, this is the python binding

v0.10.2 650 #nlp #annotations #linguistics #standoff #text-processing #annotation
markdown-it-autolink

A markdown-it plugin for parsing GFM autolinks

v0.2.0 #markdown-it #markdown #autolink #markdown-it-plugin #autolinks #add #md
pangu

Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)

v0.2.0 #spacing #pangu #objective-c #clojure #go #java #php #ruby #swift #elixir
mdbook-quiz-validate

Input validation for quizzes used in mdbook-quiz

v0.3.10 #validation #markdown #mdbook #toml #mdbook-quiz #config #learning #tracing
mystem

Wrapper around Yandex Mystem for Rust

v0.2.2 #mystem #lex #lexemes #site #requrements
delay_writer

Wraps a writer and delays its output after each newline

v0.2.1 #writer #delay #delay-writer
extract-strings

Extract ascii strings from files

v0.4.0 bin+lib #string #ascii #ascii-text
ufofmt

A fast, flexible UFO source file formatter based on the Norad library

v0.7.1 app #formatter #ufo #normalizer #font #graphics
encoding-index-singlebyte

Index tables for various single-byte character encodings

v1.20141219.5 209K #index #iso-8859-1 #table #encoding #encoder-trap #iso-8859-2 #stable #0-dev
fast_aug

Fast data augmentation for text

v0.1.0 bin+lib #nlp #augmentation #text-augmentation #base-augmenter #text-augment-parameters
adib-say-hello

say hello and say goodbye library

v0.3.0 #say #adib-say-hello #hello
szovegertesimutato-score

Calculate szovegertesimutato score for a given text and language

v0.1.0 #nlp #readability #szovegertesimutato #text-analysis #language
ucd-raw

Uninterpreted access to the unicode UCD

v0.5.0 no-std #ucd-raw #ucd #raw
comment-strip

Remove comments out of text files

v0.1.3 390 bin+lib #remove #delete #command-line #strip #comments
rust-crate-grrs-jesse

search files

v0.1.0 app #search #demo #rust-crate-grrs-jesse #cli
hello_lib

Demonstrate Generics Function

v0.1.6 #demo #function #generics #hello
rmemo

Tools for taking notes fast on the CLI

v0.3.2 bin+lib #markdown #rmemo #md #subcommand #config
butterkups-minigrep

Mini grep utility; very weak application, use grep instead

v0.1.1 bin+lib #mini-grep #grep #butterkups-minigrep #line
text_layout

Text layout algorithms

v0.3.0 no-std #text-layout #text #graphics #layout
mindmap

Search your notes at the speed of thought

v0.1.2 bin+lib #notes #thought #mindmap #server #model #watching #showcase
concatenator

Add two pieces of text together

v0.1.1 #together #concatenator
ellipse

Truncate and ellipse strings in a human-friendly way

v0.2.0 650 #ellipse #truncate #string #human
fifthtry-mdbook

fork of mdbook, only for ft-cli

v0.4.8 bin+lib #mdbook #rust-book #ft-cli #book #gitbook #markdown
segtok

Sentence segmentation and word tokenization tools

v0.1.5 120 #tokenize #split #segmenter #word #tokenizer
wcounter

Give the word and count the appearance

v0.2.4 app #zsh #fzf #wcounter #appearance #counter
branchout

Quick and easy ASCII tree of a directory

v0.1.2 app #directory #branchout #structures
mdbook-open-git-repo

mdbook preprocessor to add a open-on-git-repo link on every page

v0.0.4 100 bin+lib #mdbook #git #markdown #page
jcalendar

Japanese Calendar for Rust

v0.1.2 #calendar-week #calendar #console #week #koyomi #calendar-date
wasmer-wit-parser

wit-bindgen-gen-c

v0.1.1 450 #wasmer #wit-bindgen #wit-bindgen-gen-c
jp-location-relation

隣接する市区町村の一覧を取得

v0.1.1 bin+lib #relation #jp-location-relation #location #の一覧を #隣接する #html #隣接エリアの #データソース #隣接街名の
invisible_unicode

finding invisible unicode characters

v1.0.0 #unicode #invisible #invisible-unicode #sample #검사
leetcode-picker

Command line app for picking leetcode quiz

v0.1.8 380 bin+lib #leetcode #quiz #picker #content #file #code-snippets #level #title #name #id
allwords

Generate all the words over a given alphabet

v0.1.2 #word #alphabet #brute-force #iterator #fuzzy #brute-force-words
amongify

A very ඞ sus ඞ program

v0.1.0 app #sus #mode #amongify #among-us
libphonenumber-sys

rust ffi bindings to libphonenumber

v0.1.1 sys #phone-number #libphonenumber #libphonenumber-sys #valid #hand #phone-number-util-error
logisheets_parser

the parser of excel formula

v0.7.0 #logi-sheets #formula #parser
bigsi_rs

A in-memory implementation of a BIGSI-like data structure

v0.1.1 #structure #bigsi-rs #bigsi #index
html_to_epub

A command line converts .html file to .epub file

v0.1.4 140 bin+lib #epub #html #html-to-epub #title #author #cover
mdrss

generating RSS feeds from markdown files

v0.1.0 #rss #rss-feed #markdown
chinese

language nlp tools

v0.0.2 #chinese #nlp
ferrissay

cowsay

v0.1.1 app #cowsay #ferrissay #crabsay
mdbook-bibfile-referencing

An mdBook preprocessor to add bibfile referencing to each page

v0.3.0 app #mdbook #bibliography #referencing #citeproc
minigrepwebdot

Minigrep is a command-line utility tool that helps to search for occurences of words on a file

v0.1.0 bin+lib #minigrepwebdot #into-iter #config
trunc8

Truncate text to a specific line length, based on a number of parameters

v0.2.0 #trunc8
chunk_norris

splitting large text into smaller batches for LLM input

v0.2.1 #batching #tokenize #llm #nlp #text
cli-colors

A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing

v1.0.0 #text-formatting #ansi-colors #cli-colors #ansi #ansi-color #text #formatting #formatting-text
string_macros

Small proc macro library for handling string literals

v1.0.1 macro #string-macros #literals #string
codetypo

Source Code Spelling Correction

v0.10.34 #spelling #codetypo #correction #spell-check #development #monorepo #pr #development-tools #automation #malloc
nxfetch

A minimal, fast and batteries included fetcher!

v0.3.0 bin+lib #fetcher #nxfetch #uptime #name #user #shell
ctf-brute

Brute-force utilities for Rust

v0.2.1 #ctf #ctf-brute #syntax #pattern #limitation
markdown-composer

composing markdown documents

v0.3.0 #markdown #markdown-composer #code-block
zindex-scanner

A CLI tool to scan and analyze z-index definitions in JavaScript/TypeScript files

v0.1.1 250 app #scanner #typescript #javascript #z-index #define
pangu2

Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)

v0.1.1 #spacing #pangu #pangu2 #python #java #objective-c #clojure #elixir #go #browser
flashtext2

The FlashText algorithm implemented in Rust

v0.2.0 #nlp #string-matching #flashtext2 #flashtext #insensitive
wtf8

WTF-8 encoding. https://simonsapin.github.io/wtf-8/

v0.1.0 12K #surrogate #unicode #wtf8 #io-wtf-8
esc

Escape characters in strings

v0.2.2 app #cli #text #string
fuzzy-string-distance

Fuzzy string distance comparisons

v1.0.0 #edit-distance #levenshtein #levenshtein-distance #string-comparison #fuzzy-search #compare #text-processing
tectonic_engine_bibtex

The bibtex program as a reusable crate

v0.2.2 490 #typesetting #bibtex #tex #xetex
pandoc-ac

pandoc filter for converting acronym codes to LaTeX

v0.3.0 bin+lib #acronym #pandoc #pandoc-filter #latex
xconv

A high-performance batch file encoding conversion tool

v0.1.0 app #xconv #tool #gbk #file
text_magic

string manipulation, including reversing strings and checking if strings are palindromes

v0.1.0 #text #text-magic #magic
forgiving-htmlescape

HTML entity encoding and decoding, with support for leaving malformed entities intact

v0.1.0 #forgiving-htmlescape #html-escape #intact
dialogue-rs

parsing dialogue scripts

v0.1.0 #dialog #dialogue-rs #comments #block #command #start #marker #end #thanks
kspconfigtool

KSP1 ConfigNode parser and block removal tool

v0.1.0 app #confignode #ksp #kerbal #tool
sayit

String replacements using regex

v0.3.0 360 bin+lib #regex #replace #ron #text #format
mdbook-twiki

twiki backend for mdbook

v0.1.1 app #twiki #mdbook-twiki #mdbook #filename
rustdoc-include

importing external Markdown files into *.rs file as doc comments

v0.1.2 app #documentation #rustdoc #import #comments #include
chinese-ner

A CRF based Chinese Named-entity Recognition Library written in Rust

v0.2.4 #ner #nlp #chinese #crf
yeslogic-ucd-parse

parsing data files in the Unicode character database

v0.1.13 190 #character-properties #parser #ucd #unicode #character-property #database
runiq-lib

An efficient way to filter duplicate lines from input, à la uniq

v1.2.2 bin+lib #unique #filtering #logging #runiq
slidedeck

Create an HTML slide deck from Markdown

v0.0.2 app #markdown #slidedeck
angr

analyse ngrams in text files

v0.1.0 app #analysis #angr #text #txt
ripgrep

line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.

v14.1.1 34K app #ripgrep #grep #search-pattern #pattern #regex
grader

Stream-based CLI for binary sorting text files via a given shell command

v0.2.0 app #sorting #stream #grader #cli #text #error #logging
hunspell-sys

Bindings to the hunspell C API

v0.3.1 5.0K sys #hunspell #hunspell-sys #target #api
extract-words

Extracts words from text without allocation

v0.2.0 #extract #allocation #extract-words #punctuation #entries
thousand_birds_deno

deno executable

v1.46.3 bin+lib #deno #executable #lock-files
esre

alt regex library

v0.1.1 #regex #esre #opt #format
strip_markdown

remove markdown syntax from markdown files

v0.2.0 1.3K #markdown #strip-markdown #strip
koelner-phonetik

koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words

v0.1.0 #phonetik-algorithm #koelner-phonetik #phonetik
AsgoreCore

A small rust library to manipulate arabic text to fit in non-supporting arabic games or programes

v0.1.2 bin+lib #programes #asgore-core
unicode_types

A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)

v0.2.0 120 #unicode #type #plain-text #convenience #boilerplate
pulldown-html-ext-cli

CLI tool for extended HTML rendering of Markdown with pulldown-cmark

v0.5.0 230 app #html #file #pulldown-html-ext-cli #pulldown-cmark
book_lib

that provides an API for managing PDFs on your mac device in one place

v0.1.3 440 #book #lib #place #pdf
textspan

Text span utility

v0.5.2 #nlp #algorithm #python #text #align-spans #utility #remove-span-overlaps #remove-span-overlaps-idx
scan-lib

A directory searcher library for rust

v0.1.1 #directory #scan #directory-searcher
levenshtein_lite

No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function

v0.1.1 #levenshtein #levenshtein-distance #automata #lite
esperanto-text

Convert Esperanto text between UTF-8, x-system and h-system transliterations

v1.0.0 bin+lib #esperanto #text #transliteration #string
octor

rmd combines all readmes into one

v0.1.2 bin+lib #octor #verbose
is-digit

Detect decimal digit in char or first char of the str and String

v0.1.2 #digits #is-digit #is-dec-digit
symspell

Spelling correction & Fuzzy search

v0.4.3 2.0K #spelling-correction #symspell #dictionary #spell-check #verbosity #spellcheck #strategy
mdbook-hide

A preprocessor for mdbook that adds support for hidden chapters

v0.4.0 400 bin+lib #mdbook #hide #mdbook-hide #chapter
rust_baht_text

Convert number to Thai Baht text

v0.1.0 #text #thai #numbers #baht
bge

Rust interface for BGE Small English Embedding Library

v0.2.0 #text-embedding #transformer #bert #sentence-similarity
cologne_phonetics

generate phonetic cologne codes for utf8 strings

v0.1.0 no-std #phonetic #cologne-phonetics #string #cologne-code
daumdic

Daum Dictionary API wrapper

v0.8.0 #daumdic #search #dictionary #from-secs
cofe

tiny string similarity crate

v0.1.1 #cofe
termcolors

Format text and display colors in the terminal

v0.2.2 bin+lib #termcolors #resolution #newlines
bigstr

A command-line tool to make string BIG

v0.1.1 app #big #command-line-tool #bigstr #text-processing
cabocha

Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library

v0.2.0 #japanese #structure-analyzer #cabocha #dependencies #testing
vape

ｆｕｌｌｗｉｄｔｈａｅｓｔｈｅｔｉｃｓ

v0.4.0 app #full-width #aesthetic #vaporwave #ａｅｓｔｈｅ #ｉｃｓ
case-conv

Faster case conversion crate

v0.1.6 nightly #conv #case-conv #case #result #linux
yzb64

Ytrizja base-64 specialization

v0.1.1 #specialization #yzb64
typeline

Efficient, Type-Safe Pipeline Processor

v0.1.0 bin+lib #shell #stream #pipeline #tl
indentsort

Structure-preserving sorting of arbitrary indented text

v0.1.1 #sorting #indent #text
libxdiff

Rust bindings for the libxdiff C library

v0.2.0 no-std #api-bindings #libxdiff #mm-file
mime_4

Strongly Typed Mimes

v0.4.0-a.0 140 #media-type #mime #media-extensions #media-range
unicode-jp

convert Japanese Half-width-kana[半角ｶﾅ] and Wide-alphanumeric[全角英数] into normal ones

v0.4.0 1.3K bin+lib #japanese #kana #zenkaku #hankaku #unicode
hydroperfox-sourcetext

Source text containing line locations

v1.0.0 #text #source #hydroperfox-sourcetext
meme_generator_utils

Meme generator utils

v0.0.7 120 #meme #meme-generator-utils #generator #meme-generator-rs #表情列表 #查看 #表情包生成器 #雕表情包 #用于制作各种
strmatch

Conditionally match strings in Rust using regex without much boilerplate

v0.1.1 #regex #boilerplate #strmatch #debugging
uwubot

discord bot for uwuifying text

v0.3.0 bin+lib #text #uwubot #bot #setup #portal #choice
bibutils-sys

Rust bindings for bibutils, a program for bibliography format interconversion

v0.1.1 sys #ffi #bibutils #bibutils-sys
lorgn_lang

a general purpose scripting language optimized for graphical programming

v0.1.0 #lorgn #language #lorgn-lang #notation
okh-tool

A CLI tool to deal with Open Know-How (OKH) data files. Its main functionalities are: validation of and conversion between the different formats

v0.5.2 bin+lib #okh #validation #command-line-tool #convert #open-know-how
minigrep_joshua

tutorials

v0.1.0 bin+lib #tutorial #mini-grep #minigrep-joshua
str_overlap

Methods for finding the overlap between two string slices

v0.4.3 700 no-std #string #overlap #intersection
ragzilla

providing tools for RAG (Retrieval-Augmented Generation) pipelines

v0.3.2 240 #rag #artificial-intelligence #parser #transcribing #embedding #pipeline
glob-match

An extremely fast glob matcher

v0.2.1 503K #glob-match #matcher #glob
grepox

Minimalist's grep written in Rust

v0.2.19 app #grep #regex #minimal #fast #search #cli
encoding-index-tradchinese

Index tables for traditional Chinese character encodings

v1.20141219.5 209K #index #tradchinese #standard #encoding #iso-8859-1 #encoder-trap #table #stable #0-dev #iso-8859-2
notoize

that tells you what Noto font stack you need

v2.14.0 bin+lib #font-stack #font #notoize
string_py

aims to make the String type as easy to use as the str type in python

v0.3.0 #string #string-py
guarding

guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.

v0.2.6 bin+lib #guarding #model #tree-sitter #architecture-tests #arch-unit #function-name #guardian #archunit
wantora

wantora工具

v0.1.2 bin+lib #wantora #wantora工具 #介绍 #未完成 #初始化antora脚 #个人开发的 #功能 #件变动并实时 #启动antora编译 #实时监控本地
rmbs

Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API

v1.2.0 app #artificial-intelligence #llm #summarize #condense #ai
timfmt

A small utility for formatting code as Tim likes it

v0.2.0 app #fmt #tim #timfmt
product-os-content

Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server

v0.0.4 #product-os #content #product-os-content
runanum

Существительные с правильными окончаниями после чисел

v0.1.1 #cases #runanum #chisel #яблок #чисел
axum-toml

Axum extractor for TOML

v0.2.0 #toml #axum #axum-toml
clipcat

A command line tool for copying the contents to clipboard of multiple files in one go

v0.1.5 340 app #clipcat #path-to-directory #digits
todo-to-issue

CLI tool that converts forgotten TODO comments into actionable GitHub issues

v0.1.1 app #issue #todo #github #comments #github-issue
ragtime

Easy Retrieval Augmented Generation

v0.2.0 #artificial-intelligence #rag #generation #phi3 #arc #7b-instruct #document #llama-backend #rag-qa-phi3-gte-qwen #model
beediff

LCS algorithm in various applications

v0.1.2 #beediff #applications #case-sensitive
hex-utilities

working with hexadecimal numbers

v0.1.5 #utilities #hex #hex-utilities #numbers #text
h4x_re

Hacky Regex's

v0.2.4 #regex #h4x-re #h4x
numbers_into_words

Command-line utility and library for writing a positive integer as English words

v0.1.2 bin+lib #word #numbers #numbers-into-words #thousands #twenty-three #words
print-positions

providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions

v0.6.1 15K #ansi-escapes #unicode #unicode-text #escaping #ansi-escaping #grapheme #text
mdtoepson

Filter to change from markdown format to Epson ESC codes for my Panasonic KX-2123

v0.1.2 app #mdtoepson #kx-2123
baselinker

BaseLinker.com API client

v0.2.2 #baselinker #field #client #e-commerce
lazy-string-replace

A lazy version of String::replace, so that it can be formatted or recursively replaced without intermediate allocations

v0.1.3 #replace #lazy-evaluation #string #allocation
aki-xtee

copy standard input to each files and standard output

v0.1.25 1.5K bin+lib #filter #text #xz
ctrl-z

A composable reader to treat 0x1A as an end-of-file marker

v0.1.0 #ctrl-z #eof #substitution #sub
mdbook-chapter-zero

A mdBook preprocessor that allows 0th (sub-)chapter

v0.1.0 bin+lib #chapter #zero #mdbook-chapter-zero #sub #chapter-zero #pre-processor
less

pager utility for displaying file contents or piped input, with dynamic scrolling and search functionality

v0.1.0 app #pager #viewer #text #terminal #cli
punkt

sentence tokenizer

v1.0.5 100 nightly #tokenize #sentence #punkt #training #token
onepage

static site generator

v0.1.8 bin+lib #static-site-generator #static-site #blog #markdown #site
contractions

expand contractions in English

v0.5.4 1.9K #nlp #pre-processor #contractions #english #language
ipsae-core

markdown parser for DIY lover

v0.1.1 #lover #ipsae-core #ipsae
ucd-util

A small utility library for working with the Unicode character database

v0.2.2 182K #unicode-character-properties #unicode-character-database #unicode-characters #character-property #character-database #unicode #character-properties
logseq

Handle Logseq Markdown files in Rust

v0.3.0 #logseq #markdown #knowledge-base
spider_transformations

Transformation utils to use for Spider Web Crawler

v2.36.66 9.3K #web-crawler #spider-transformations #html-text #chunking #crawler #content
fast2s

A fast Traditional Chinese to Simplified Chinese conversion library. Built with FST, faster than most of other libraries.

v0.3.1 3.6K #chinese #hanzi #localization #traditional #simplified #convert
mdbook-fix-cjk-spacing

mdbook preprocess that fixes CJK line breaks

v0.1.1 bin+lib #mdbook #cjk #spacing #break #space
ranpha

Generate QR code of your Wi-FI network

v0.1.1 app #ranpha #schema #key #size
ut1_blocklist

UT1 blocklist URL/domain filters

v0.3.2 #blocklist #filter #adult-content #ut1
textcat

detect text categories. It can be used to detect the language of a given text

v0.3.2 bin+lib #ngrams #textcat #categorization #text
cursed_strings

Annoyed that Rust has two string types? Well it doesn't any more

v0.1.1 nightly #cursed-strings #cursed #char-indices #deref
tokengrams

Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models

v0.3.0 #tokengrams #index #query
weirdgrep

Weird grepping tool for huge pages of code

v1.0.5 app #regex-parser #parser #weirdgrep #path #apply #afterwards
spacemod

A easy to understand and powerful text search-and-replace tool

v0.1.1 app #tool #text-replace #refactoring #refactoring-tools
base16384

Encode binary file to printable utf16be, and vice versa

v0.1.0 no-std #base16384 #safety #slice-as-chunks
extstd

intended as an extension of the standard library

v0.5.1 nightly bin+lib #extstd
khmercut

A blazingly fast Khmer word segmentation tool written in Rust

v0.1.5 bin+lib #khmercut #run #ស-រុក #ព-រៃនប #នៅ
llmvm-codeassist

A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.

v0.2.0 200 app #artificial-intelligence #llm #assistant #lsp #code
uwl

A management stream for bytes and characters

v0.6.0 67K #character #stream #lexer #token #characters
korean_regex

Regex extension for Hangeul analysis

v0.3.0 230 #regex #korean #korean-regex
bloodhound

Fuzzy file finder

v0.5.5 #find #fuzzy #file
mdbook-force-relative-links

An mdbook pre-processor to transform all local links to relative ones

v0.1.2 app #themes #rust-book #mdbook #markdown #book
markovish

Markov chain implementation for text generation

v0.2.2 #markov-chain #language #parser #text
lindera-ipadic-builder

A Japanese morphological dictionary builder for IPADIC

v0.32.3 16K #japanese #builder #dictionary #morphological #ipadic
mdbook-multicode

Allows you to give multilanguage code examples, toggled by a spinner

v0.1.0 bin+lib #mdbook #spinner #mdbook-multicode
gazetta-cli

A static site generator framework. Shared CLI code.

v0.3.0 #static-site #blog #gazetta
azusa

String index transformer for Rust utf8 to JavaScript utf16

v1.0.1 #javascript #string #string-index #utf8-to-utf16
ngram

Iterator adaptors for n-grams and k-skip-n-grams

v0.1.13 nightly #ngrams #skip #gram #skipgram #n
mdbook-indexing

mdbook preprocessor for index generation

v0.1.2 app #mdbook #indexing #mdbook-indexing #index #name #entries
awabi

A morphological analyzer using mecab dictionary

v0.3.0 bin+lib #mecab #token #dictionary #mecabrc
anthropic-text-editor

A micro-CLI to apply tool calls from Anthropic for their text_editor_20250124 built-in computer use tool

v0.2.0 240 app #anthropic #claude #text-editors #cli #tool-calls
shear

trimming excess contents from things

v0.1.0 #shear #limited #value
minigrep5

grep implementation in Rust

v0.2.1 bin+lib #minigrep5 #add-one #create
ruSTLa

A reStructuredText → LarST ⊂ LaTeX transpiler

v0.38.0 bin+lib #restructuredtext #latex #transpiler
semchunk-rs

A fast and lightweight Rust library for splitting text into semantically meaningful chunks

v0.1.1 #nlp #chunking #semantic #tokenize #text #token
smoltoken

A fast library for Byte Pair Encoding (BPE) tokenization

v0.2.0 130 #tokenize #bpe #artificial-intelligence #tokenizer
xee-xpath

XPath 3.1 library API

v0.1.4 500 #xpath #xml #xee #api
group-similar

Group similar values based on Jaro-Winkler distance

v0.2.2 bin+lib #distance #similarity #jaro #string
genkit

A common generator kit for static site generator

v0.3.1 #genkit #serialization #split-styles #zine
http

A set of types for representing HTTP requests and responses

v1.3.1 20.0M no-std #http #http-response #http-request #request #response
codegenrs

Moving code-gen our of build.rs

v3.0.2 2.5K #codegen #codegenrs #development
mdbook-snips

Markers for hidden lines in rust blocks within an mdbook

v0.1.3 bin+lib #mdbook #mdbook-snips #snips #snip
mdx

in Rust

v0.0.4 bin+lib #mdx #markdown #mdx-ast #anyway
morph-rs

Dictionary Morphologizer for Russian language

v0.2.0 bin+lib #language #morph #tags
encoding8

various 8-bit encodings

v0.3.2 5.4K #encoding #encoding8 #encoding-8
rahat3062_minigrep

A light-weight & minimal implementation of the grep cli app

v0.1.3 bin+lib #mini-grep #rahat3062-minigrep #rahat3062
lithe

A Slim template engine by using Pest

v0.0.3 #text #cli #lithe #pest
lingua-french-language-model

The French language model for Lingua, an accurate natural language detection library

v1.2.0 11K #nlp #language-recognition #lingua
md2gemtext

for converting Markdown into gemtext

v0.1.0 bin+lib #gemini #markdown #md2gemtext #gemtext
auk_markdown

Markdown support for Auk

v0.1.0 #auk #markdown #auk-markdown #syntax
regex-map

Associative container where the keys are regular expressions

v0.1.0 #regex #regex-map #map
beautify

your terminal

v0.2.0 #color #beautify #terminal #gradients
unindenter

unindent text

v0.1.0 app #text #indent #unindent
prune

struct

v0.1.6 nightly #prune #struct
node-emoji

Convert :emoji: to Unicode using GitHub’s and EmojiDB’s emoji names

v1.0.7 #emoji #unicode #markdown #github
codex

Human-friendly notation for Unicode symbols

v0.1.1 11K #symbols #unicode #codex
mail-internals-ng

[mail-api] _internal_ parts for the mail-api crates

v0.2.4 #mail-api #email #internal #mail-internal
spongebobizer

Command-line utility that outputs its stdin, converted to 'sPonGeBoB cAsE', and a library to support it

v0.4.1 bin+lib #spongebobizer
parser-web

Web API for extracting text from various file formats

v0.1.3 120 bin+lib #web-api #text-extraction #pdf #parser #document
darkdown

A darkdown(our own markup language) parser written in rust

v0.1.5 bin+lib #darkdown #converter #link #below #created
rust_lemmatizer

A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words

v0.3.0 bin+lib #nlp #lemmatization #rust-lemmatizer #vec
glifnames

Mapping of characters to glyph names according to the Adobe Glyph List Specification

v0.2.0 #font-glyph #name #glyph #font #ufo #glif
mdbook-numthm

An mdbook preprocessor for automatically numbering theorems, lemmas, etc

v0.2.0 bin+lib #mdbook-preprocessor #mdbook #mdbook-pre-processor #katex #numbering #label
text-template

Small template engine for use with plain text (e.g. creating text email), not intended for HTML.

v0.1.0 #plain-text #template #text
rreplace

designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.

v0.1.0 #replace #string #substring #multiple #substitution
twas

A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game

v1.0.0 bin+lib #substitution #random #csv #mad-lib #text #reference
kvu

The simplest command line tool to manage key-value pair lines

v0.1.3 bin+lib #key-value #dotenv #config #environment
newslookout

A web scraping platform built for news scanning, using LLMs for text processing, powered by Rust

v0.4.9 1.1K #data-transformation #model-deployment #data-science #analytics #machine-learning
rut

A small UTF-8 parsing library for applications that need to parse individual chars

v0.4.2 #rut #byte #conformance
wcc

my own version of wc for personal use

v1.0.11 app #count #wcc #counter #ls #label #hello
redpatterns

a list of patterns for scanners 📟

v0.2.0 #regex #pomsky #secret
dumbfuzz

dumb library for fuzzy search

v1.0.0 #search #dumbfuzz
notedown_ast

Notedown Abstract Syntax Tree

v0.16.3 nightly #ast #notedown #text #tree #utilities
mdtohtml

markdown to html renderer (with a couple of missing features)

v2.0.0 bin+lib #mdtohtml #information
blazingly_fast_rust_donut

Generates a rotating donut in the terminal using ASCII art

v1.0.0 app #donut #art #blazingly
cumaea

handle prompts for user input

v0.1.1 #cumaea #prompt-text #call
markdown-linkify

Markdown preprocessor for substiting link shorthands to valid links according to configurable regexes and custom substitution implementations

v0.3.1 bin+lib #markdown #linkify #markdown-linkify
rand-hira

CLI tool to generate random hiragana characters

v0.1.0 app #random #rand-hira #hira
sims

Simplistic string search

v0.1.1 #search #sims #arguments
igpay-atinlay

Translate text to Pig Latin

v0.1.0 #latin #igpay-atinlay #igpay #vowel
catdream

Sleeping cat dreams your text

v0.1.0 app #catdream #text
ultron-ssg

A syntax highlighting library ideal for usage in a static site generator

v0.3.0 #syntax-highlighting #monospace #ultron #highlighting
quill-delta-rs

Quill editor Delta format in Rust

v1.1.1 110 #delta #editing #format #delete #change
render_as_tree

visualizing tree data structures via text

v0.2.1 #text #render #tree #parent
rls-vfs

Virtual File System for the RLS

v0.8.0 430 #rls #vfs #rls-vfs
mdbook-image-size

A mdbook preprocessor which support image size syntax

v0.2.1 bin+lib #image-size #syntax #mdbook #size #height #center #right
alpha-counter

Alphabetic counter

v0.2.1 #counter #alpha-counter #alpha #vec
zalgo-codec

Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.

v0.13.3 no-std bin+lib #obfuscation #zalgo #unicode
mdbook-tools

A collection of tools for mdbook

v0.1.1 app #mdbook #python #directory #unnumbered #md #directories
milkbox

A collections of daily utils

v0.0.14 600 app #milkbox
thesauromatic

command-line thesaurus that returns related words when given a word. The output words are one per line, making it easy to process in shell pipelines.

v0.0.11 bin+lib #nlp #thesaurus #synonyms
local_strtools

Collection of string related utilities

v0.1.1 #utilities #arguments #local #panic
kpathsea

Rust interface to the kpathsea TeX file management library

v0.2.3 130 #kpathsea #tex #error
toresy

term rewriting system based on tokenization

v0.5.0 bin+lib #tokenize #toresy #rules #formatting #data #system #tokenization #wikipedia-rewriting
unicode_reader

Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters

v1.0.2 15K #code-point #unicode #unicode-text #grapheme #reader #text
caser

Change text between PascalCase, camelCase, and snake_case

v1.1.1 bin+lib #snake-case #caser
mdast2minimad

converting markdown AST to minimad texts

v0.1.0 #markdown #minimad #termimad #mdast #convert
iconv

bindings for Rust

v0.1.1 2.1K #iconv #encoding #converter
codes-iana-charset

This package contains an implementation of the IANA Character Set registry

v0.1.3 #codes-iana-charset #charset #iana
gpl-memo

Gemachain Program Library Memo

v3.0.1 #memo #gpl-memo #gpl
deno_tauri

deno executable

v1.46.1 #deno #executable #tauri
summary

Extract the sentences which best summarize a document

v0.1.0 #summary #summarizer #summarize #tf-idf #summarization
elden-ring-saver

ansi2

v0.1.0 #save #ring #elden-ring-saver #ansi2 #mode #com
uiua-doc-gen

Documentation generator for Uiua libraries

v0.15.1 160 app #documentation #documentation-generator #uiua #cli
encoding_rs_transcode

Transcode text within writers using encoding_rs

v0.8.3 #charset #unicode #transcode #write
leetcode

solutions in Rust

v0.1.4 #leetcode #leetcode-rs
lorem-ipsum

Quickly generate placeholder test

v0.1.2 app #lorem-ipsum #testing #word-list #generator
code-tour

Enhanced example-based learning, i.e. awesome examples user experience

v0.2.0 macro #example #learning #cli #experience #tour #derive
texoder

A text stream which can encode/decode text in several encoding formats

v0.0.5 #texoder
sarcasm

tExT creation and validation library

v0.1.0 app #encoding-decoding #sarcasm #text #fun #text-encoding #localization
chemstring

A parser that converts strings to their representation using chemical element notations

v0.1.0 #chemstring #chem-string #permutation
todo_r

command line utility that keeps track of your todo comments in code

v0.7.2 bin+lib #todo-r #user3 #style #found #syntax #blumberg #respected #directory #command-line
ruby-string

A string type that tracks Ruby glosses attached to parts of it

v0.1.0 #text #cjk #furigana #bopomofo
munemo-rs

Turn an integer into a more rememberable word, or vice-versa

v0.1.1 #munemo-rs #munemo #codec #integer
rustplexity

bigram-based perplexity calculator, useful for filtering out boilerplate or other abnormal text

v0.1.0 #perplexity #rustplexity #plexity
mdbook-to-example

Turns an mdbook book into a Rust example

v0.1.0 #mdbook-to-example #mdbook #set-name #package-book
cvicenie_2

Cvicenie 2

v0.1.0 app #cvicenie-2 #cvicenie
text_alignment

Align your text in Rust in the CLI

v0.1.0 #text-alignment #alignment #text
termwrap

Wrap Unicode text with ANSI color codes

v0.1.4 280 #fold #wrap #unicode #string #color
tashkil

A lightweight library for removing Arabic diacritics

v0.1.0 #diacritics #arabic #language #dari #pashto
findtext_textfile

Search text in text file

v0.1.1 130 bin+lib #text-search #markdown #search #file-search #encoding #text-encoding #text
naming_clt

Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files

v1.1.0 app #code #search-pattern #camel-case #naming #clt #pattern #command-line-tool #link
infisearch_common

Internal library for other InfiSearch packages

v0.10.1 #infisearch #static-site #search #indexer
hearthstone

simulator written in Rust

v0.1.0 bin+lib #hearthstone
mdbook-bash-tutorial

A mdbook preprocessor that allows embedding Bash scripts as tutorials

v0.1.6 bin+lib #mdbook-preprocessor #mdbook #tutorial #bash #markdown #mdbook-pre-processor
static_table

creates pretty tables at compiler time

v0.6.0 120 macro #pretty-table #macro #static-table #time-table #print
agldt

Tools for handling data conforming the standards of the Ancient Greek and Latin Dependency Treebank

v0.1.2 #treebank #agldt #pre-processor #serialization #stage #oddities #br
korrektor

work with Uzbek language text processing

v0.3.1 #uzbek #language #processing #text-processing
liwe

IWE core library

v0.0.31 430 #markdown #liwe #para #md #zettelkasten
saneput

Sane input reading library

v0.2.0 #saneput #input #ff #space-tab #15
wcrs

GNU wc in Rust

v0.2.0 app #wcrs #rocket-rocket #output
cindex

CSV indexing library

v0.6.0-beta.1 120 #indexing #csv #cindex #text-processing #query #indexer
ohos-ime-sys

Bindings to the inputmethod API of OpenHarmony

v0.1.4 6.1K #harmony-os #input-methods #open-harmony #ffi
ucf

A universal code formatter

v0.1.5 app #formatter #ucf #formatting #file
ftrace

trace files and paths

v0.2.1 app #strace #file #trace #fs #syscalls #path
minigrep_elijahkx

MiniGrep is a Rust-based command-line tool, with a (current) size of 588KB that lets users search files for a given query string and shows matching lines with their line numbers

v0.1.2 bin+lib #mini-grep #linux #grep #cli
mdbook-docslab

mdBook preprocessor for interactive code with docslab

v0.1.0 app #documentation #mdbook #pre-processor #docslab #path-to-your-book
ansi-width

Calculate the width of a string when printed to the terminal

v0.1.0 24K #ansi-escapes #ansi-term #terminal #width
sqlify

CLI tool for formatting SQL queries

v0.1.1 app #sql #formatting #sqlify
spandex-hyphenation

Knuth-Liang hyphenation for a variety of languages

v0.7.4 #typesetting #hyphenation #language #text
xim-ctext

compound text en/decoder

v0.3.0 no-std bin+lib #xim #ctext #xim-ctext #en-decoder #mode
rmw-utf8

Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法（为中文压缩优化，基于rust编程语言开发）。

v0.0.6 #utf-8 #rmw-utf8 #8的短文本压缩 #基于rust编程语 #开发 #面向utf #为中文压缩优 #utf
uiuifree-normalize

uiuifree text normalize

v0.1.1 #normalize #uiuifree-normalize #uiuifree
perlin

A lazy, zero-allocation and data-agnostic Information Retrieval library

v0.1.0 #information-retrieval #search-engine #text #search
rvim

A text editor in rust

v0.0.8 app #highlighting #rvim #js #sh #java #go #py #json #cs #rb
repa

Peak Performance Pattern Seeker

v0.1.5 250 app #regex #hyperscan #grep #text-processing
litegrep

A basic tool for searching in files for lines of text, based on a query

v0.1.2 bin+lib #litegrep #mini-grep #itegrep
omgwtf8

Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit

v0.1.0 #unicode #surrogate #wtf8 #slice
my_mini_grep

A mini application that aims to replicate the behavior of the grep shell command

v0.1.0 bin+lib #case-sensitive #search #case-insensitive
trie-match

Fast match macro

v0.2.0 500 macro no-std #double-array #match #macro #text #no-alloc
lindera-cli

A morphological analysis command line interface

v0.41.0 600 app #morphological-analysis #cli #tokenize #dictionary #multilingual #format #morphological
simple_bencode

bencode encoder and decoder, that uses neither rustc-serialize or Serde. Instead, it serializes from / deserializes to a tree using a 4-branch enum.

v0.1.4 #bencode #simple-bencode #array #decode-error #string
sentencepiece

Binding for the sentencepiece tokenizer

v0.11.2 9.0K #sentence-piece #tokenize #tokenizer #sentence-piece-processor
moon-phases

Fast command-line application to show the moon phase

v0.3.3 app #emoji #moon-phases #phase #numeric #zodiac
allsorts-subset-browser

Temp fork of allsorts 0.15 - includes patch for subsetting fonts for browsers

v0.16.0 900 #opentype #true-type #font-shaping #font #parser #shaping #opentype-font
vec-string-to-static-str

providing utilities for converting vectors of Strings into vectors of &'static str

v1.0.0 #static #string #utilities #vec-string #unsafe
matcher_c

A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust

v0.5.7 120 #multi #search-pattern #text-search #string-search #text
encoding-utils

A utils to help with encoding and decoding os strings and more

v0.1.0 8.9K #encoding-utils #api
lingua-danish-language-model

The Danish language model for Lingua, an accurate natural language detection library

v1.2.0 8.8K #language
svgrep

A grep-like utility for separated-values files written in Rust

v2.1.2 bin+lib #svgrep #grep #file
mdbook-svgbob2

Alternative mdbook preprocessor for svgbob

v0.3.0 bin+lib #mdbook #svg #markdown #svgbob
toml_to_table

pretty print TOML as a table

v0.6.0 #pretty-table #toml #pretty-print #format #table
shapdf

Create Shapes into PDF

v0.1.0 bin+lib #pdf #shape #shapdf #pdf-generation
dr

Command-line data file processing in Rust

v0.7.0 app #dataframe #parquet #csv
write16

A UTF-16 analog of the Write trait

v1.0.0 10.8M no-std #utf-16 #unicode #traits
aoutils

A tiny utilities package to test publishing to crates.io

v0.1.1 #aoutils #io #ensure-newline #learning
ascii-hangman-webapp

customizable Hangman game with ASCII-art rewarding for children (webapp version)

v5.7.2 #ascii-art #web-apps #ascii-hangman #hangman-game #version #children
hex_d_hex

HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…

v1.0.1 120 #hex-d-hex #hex #byte
utils_rust

这是一个用于各种实用功能的 Rust 库

v0.1.1-alpha.1 #encode #decode #utils-rust #自用代码
wordninja

port of the Word Ninja English word splitting library

v0.1.0 bin+lib #wordninja #ninja #string #summary #py #usticeinsuredome #nitedstatesinord #ionfortheuniteds #atesofamerica #rtoformamoreperf
notedown-error

Notedown Error Handlers

v1.1.10 #notedown #notedown-error #handler #error #parser
parser-cli

Command-line interface for extracting text from various file formats

v0.1.3 110 bin+lib #pdf #text-extraction #docx #cli-parser #format
rss4mdbook

a generator for mdBook as CLI tool, export RSS.xml into u want path

v0.2.42 app #rss #mdbook #generator #serialization #cli
humnum

Human numeric sorting program — does what sort -h is supposed to do!

v0.2.0 #stdout #stdio #stdin #coreutils #numeric-sorting #human-numeric-sort
merge-whitespace-utils

Procedural macros for merging whitespace in const contexts

v1.1.0 #white-space #graphql #proc-macro #context #merge-whitespace
jp_utils

Utils for working with Japanese text

v0.1.7 #japanese #parser #charset #language #traits
re_view_text_document

view that shows a single text box

v0.23.0-rc.3 28K #text-document #view #document
kindle2cbz

extracting images from kindle books in MOBI format to CBZ archives

v0.1.0 app #kindle2cbz #convert #format
indenter

A formatter wrapper that indents the text, designed for error display impls

v0.3.3 2.4M no-std #fmt-display #indentation #error-display #formatter #error #impl #display-fmt
xlsxwriter

Write xlsx file with number, formula, string, formatting, autofilter, merged cells, data validation and more

v0.6.1 20K #xlsx #excel #spreadsheet #api-bindings #libxlsxwriter
moenster

mønster (n) - pattern. simple glob-style pattern matching for strings

v0.1.0 #moenster #string
fuzzy_mime

A Mime-Type parsing library for rust

v0.1.0 #fuzzy #mime #fuzzy-mime #borrowed-media-type #fail #subtypes
unicode_escape

decoding escape sequences in strings

v0.1.0 #unicode #escaping #sequence #decode #char
stringedits

Edit trait and associated iterators for small edits to strings

v0.2.0 #stringedits #edit #replace #string #spellcheck-toy
tinytoken

tokenizing text into words, numbers, symbols, and more, with customizable parsing options

v0.1.4 130 #tokenize #tinytoken #choice #true #yes #add-symbol #err #parser #tokenizer
cyrla

two-way conversion between latin and cyrillic script

v0.1.0 #cyrillic #latin #serbian #script #converter-builder
rep-cli

Replace text file in bulk

v0.1.0 app #productivity #cli #rep-cli #replace #bulk #file
txt_to_md

Command converting from a txt file to a markdown file

v0.1.1 app #markdown #md #txt #markdown-text #text-file #text
betacode

conversion

v1.2.0 #validation #ascii #betacode #converter #linguistics #ancient-greek #convert
sttx

belt for transforming speech-to-text data

v0.1.0 app #text-to-speech #utility #time-series #whisper-cpp #speech-recognition #stt
regex-intersect

Find out if two regexes have a non-empty intersection

v1.2.0 1.4K #regex #intersect #intersection #non-empty #match
github-slugger

A slugger for GitHub headings

v0.1.0 230 #markdown-it #markdown #slug #heading #foo
mdbook-latex

An mdbook backend for generating LaTeX and PDF documents

v0.1.24 app #latex #mdbook #mdbook-latex #md2tex
regex_quote_fixer

Rewrites grep regexpressions for the use in the regex crate

v0.2.1 100 #regex #grep #regex-quote-fixer #regexpression #character-class
mdbook-footnote

mdbook preprocessor for footnotes

v0.1.1 200 app #mdbook #footnotes #mdbook-footnote
jpreprocess-njd

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.12.0 300 #text-to-speech #open-j-talk #library
markitdown

designed to facilitate the conversion of various document formats into markdown text

v0.1.9 bin+lib #atom #docx #pdf #markdown #image #excel #openai #deepseek #csv #html
mdbook-preprocessor-boilerplate

Boilerplate code for mdbook preprocessors

v0.1.2 5.4K #mdbook-preprocessor #mdbook #boilerplate #proprocessor
byte-num

converting numbers to bytes, and bytes to numbers in base 10!

v0.1.3 #byte #byte-num #num #10
worcher

full-text search for static websites

v0.1.2 #full-text-search #search #worcher #regex #text-search
api_key

Generate api key in rust supports base32, base62, string, uuid4, uuid5

v0.1.0 #api-key #uuid5 #com
wdg-base64

The Base64 Data Encoding

v0.4.7 #base64 #wdg-base64 #encoding #data #b64-decode
rex-regextract

extracts key value pairs out of text

v0.1.1 app #extract #regex #kv #rex #text
p4d-mdproof

Markdown to PDF converter

v0.1.2 bin+lib #converter #executable #p4d-mdproof #leroycep-mdproof
sourcepawn_lsp

Language Server implemention for the SourcePawn programming language

v0.9.6 bin+lib #arguments #progress #sourcepawn #server
libgrep-rs

searching through text

v0.1.4 #regex #libgrep-rs #text #filename #txt #grep-rs
prettify-markdown

Format Markdown at the speed of Rust

v0.2.0 #markdown #prettify-markdown #prettify #format-markdown #print #file-content
tdrip

command-line tool to easily remove headers and metadata from text

v0.1.0 app #tdrip #txt #hack
fwuffgrep

Basic implementation of a grep command written in rust

v1.0.0 bin+lib #fwuffgrep #source #grep-like #study-project
olagem

Typing speed test in the terminal

v0.2.0 230 bin+lib #tui #olagem #default #box #config
pig_latin

applying Pig Latin to text

v0.1.0 bin+lib #text #pig-latin #latin
crop

A pretty fast text rope

v0.4.2 4.8K #rope #edit #text-editing #buffer #tree
swc_plugin_import

babel-plugin-import rewritten in Rust

v0.1.8 3.6K #import #plugin #swc-plugin
text_manipulation_rs

generating random placeholder text in different languages

v0.1.3 #language #text-manipulation #random-text-generate #text #dictionary
redact-engine

Protect confidentiality with dynamic redaction by replacing sensitive data from string or JSON format

v0.1.2 480 #sensitive-data #redact #redaction #format #text #path #key #use-case
table_to_html

interface to convert a tabled::Table into a HTML table (<table>)

v0.7.0 700 #pretty-table #html #tabled #format #print #table
quake_text

Utils for Quake strings and characters

v0.3.0 170 #quake-world #quake #ascii #character #string #text
whitespace_text_steganography

A steganography strategy that uses whitespace to hide text in other text

v0.2.1 #steganography #text #white-space #steg #hiding
uecho

The unicode of the echo command

v0.1.0 app #unicode #uecho #command #codes
adbook

Creates a book from AsciiDoc files

v0.1.14 bin+lib #book #asciidoc #ssg #asciidoctor
repub

convert markdown documents to epub

v0.4.1 app #ebook #markdown #repub #epub
veloci_levenshtein_automata

Creates Levenshtein Automata in an efficient manner

v0.1.0 #automata #levenshtein-automata #levenshtein #fuzzy
mdbook-svgdx

mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images

v0.6.0 450 bin+lib #svg #mdbook #diagram #svgdx
marko

Programmtically format text with Markdown syntax

v0.3.0 #marko #syntax #markdown #task #hash-map #false
teddy

A SIMD-accelerated multistring searcher

v0.2.0 #teddy #foo #baz #bar #bump
webreg

A CLI tool for testing regexes against web pages

v0.1.0 app #regex #page #webreg #url #pages #insensitive
read_chars

An iterator over characters read from some I/O source

v0.3.0 #io #read-chars #char
markdown_to_html_parser

parses Markdown syntax into HTML

v0.1.0 bin+lib #html-parser #render-markdown #markdown-parser
mdbook-typst-pdf

mdbook typst pdf backend

v0.6.0 app #mdbook #pdf #typst #back-end #book
admerge

Merge multiply sources into one, with advanced options

v0.1.3 #concatenation #merge #utilities #concatenate
strings

String utilities, including an unbalanced Rope

v0.1.1 4.1K #rope #string #iterator #substring #postitions
lindera-ko-dic

A Japanese morphological dictionary for ko-dic

v0.41.0 24K #dictionary #ko-dic #korean #morphological
markdown-table

Creating markdown tables with Rust!

v0.2.0 600 #table #markdown-tables #markdown #utilities
genex

Text-expansion library

v0.6.4 #text-templates #text #genex #modifier #grammar #weight #rules #hash-set
mdxbook

Fork of mdBook, with more customizations and flexibility for programmers

v0.4.25 bin+lib #mdbook #markdown #rust-book #gitbook #book
carlo_grep

A fun game where you guess what number the computer has chosen

v0.1.1 bin+lib #carlo-grep #grep #carlo
markdown-it-latex

Allows for the insertion of math in Markdown documents using LaTeX

v0.1.0 #latex #markdown-it-latex #markdown #syntax
ucfirst

Uppercase the first letter of a string

v0.3.0 230 #upper-case #casing #capital #string
irssi-sys

Automatically generated bindings to irssi

v0.1.0 #irssi #irssi-sys #translation
md-inc

Include files in Markdown docs

v0.3.1 bin+lib #documentation #block #command-line #command #text #docs #language #width #decorator #line
code-to-pdf

Generates a syntax-highlighted PDF of your source code

v0.1.8 750 bin+lib #pdf #font #path #margin #define #ignore #image #overflowing #error-tolerant
mdbook-nix-eval

mdbook preprocessor for evaluating nix expressions

v1.0.1 bin+lib #mdbook #nix #nixos #expression
kudubot-bindings

Rust Bindings for the kudubot framework

v0.18.2 #chat #python #kudubot
fuzzysearchrs

Fuzzy search for finding strings in string with levenshtein distance

v0.1.0 #levenshtein #fuzzy-search #levenshtein-distance #search #fuzzysearch
uiuifree-text-data

csv and excel convert

v0.1.10 #convert #uiuifree-text-data #text-data
sesdiff

Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).

v0.3.1 bin+lib #nlp #diff #lemmatization #linguistics #text-processing
text-diff

text diffing and assertion library

v0.4.0 27K bin+lib #diff #difference #assert #change
kasedenv

Read environment variables by lower, upper case or case-insensitive keys

v0.1.0 #lower-case #upper-case #case-insensitive #key #env #environment
string_morph

string case transformations with an emphasis on accuracy and performance. The case conversions are available as functions as well as traits on String types.

v0.1.0 5.0K #snake-case #camel-case #inflect #string
nutrimatic

Tools for reading Nutrimatic (https://nutrimatic.org) index files

v0.1.1 #language #trie #node
gen3-charset

Pokemon Generation 3 Character Set Support (GBA)

v0.1.0 #gba #charset #gen3-charset #intl #jpn #set #fr
render_readme

Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's

v0.13.0 410 #github #markdown #html #readme #convert #language
termbook-cli

termbook is a command-line tool to build mdbook’s while executing bash codeblocks and collecting their output to become part of the mdbook

v1.4.6 app #markdown #common-mark #mdbook #terminal
wn-parser

parser for WordNet database files

v0.1.0 110 #parser #wn-parser #key
emojicons-2021

Parse :emoji: notation to unicode representation

v2.0.1 #emoji #emojicons-2021 #emojicons #cat
kth-lines

Command line tool for filtering stdin lines that just work

v0.1.0 app #kth #kth-lines #line #nth #bash
caribon

A repetition detector program and library

v0.8.1 bin+lib #repeat #string-matching #caribon #word #language #statistics
regex-automata

Automata construction and matching using regular expressions

v0.4.9 28.3M no-std #nfa-automata #dfa-automata #regex #dfa
wit-bindgen-gen-markdown

Markdown generator for WIT and the component model, typically used through the wit-bindgen-cli crate

v0.3.0 #wasi #wit-bindgen #wit-bindgen-cli
clippy_lints

A bunch of helpful lints to avoid common pitfalls in Rust

v0.0.212 1.8K nightly #clippy #lint #plugin
cringify

Annoy your friends with the cringified text

v0.2.0 bin+lib #text #cringify #output #hippie #internet
com-croftsoft-lib-string

CroftSoft String Library

v0.1.0 #croft-soft #com-croftsoft-lib-string #lib #metadata
pinot

Fast, high-fidelity OpenType parser

v0.1.5 1.7K #opentype-font #opentype #parser #font #graphics
terraphim-markdown-parser

Terraphim Markdown Parser

v0.1.0 bin+lib #artificial-intelligence #ai-agent #terraphim #personal-assistant #privacy
romulus

a stream editor like sed

v0.3.0 bin+lib #sed #awk #grep #env-var #text #cli
chars_counter

The trait that implements character counting for the &str type

v0.1.1 #counter #chars-counter #char #start
minigrep_santunioni

A lightweight version of grep

v0.1.0 bin+lib #grep #mini-grep #minigrep-santunioni
ogrep

searching in indentation-structured texts

v0.4.0 app #indentation #grep #outline #text-search #search
aki-mcycle

mark up text with cycling color

v0.1.29 1.1K bin+lib #filter #text #ansi #color
chanoma

Characters Normalization library. 文字列正規化処理用のライブラリです。

v0.1.2 bin+lib #nlp #japanese #chanoma #文字列正規化 #理用の #language #を指定する #文字から #ファイルの
uniwhat

Display the unicode characters text

v0.2.0 app #unicode-text #unicode #name #space #mark #sparkles #text #diaeresis
split_ext

Extension traits for splitting

v0.1.1 #split #ext #split-ext #splitting
doc-chunks

Clusters of doc comments and dev comments as coherent view

v0.2.1 1.6K #documentation #chunks #cluster
afrim-memory

handle of sequential codes easier for an input method

v0.4.2 #data-structures #input-methods #memory-data-structure #memory #afrim #ime #node #rc
skribo

low-level text layout

v0.1.0 #text-formatting #text-layout #graphics #layout
perspicuity_formula

Calculate Flesh Reading Ease for a given text and language

v0.1.0 #nlp #readability #formula #flesh #text-analysis
yeslogic-fontconfig-sys

Raw bindings to Fontconfig without a vendored C library

v6.0.0 156K sys #fontconfig #bindings #font
neardup

near-duplicate matching

v0.1.0 bin+lib #hash #ngrams #algorithm #matching #dataset #10
framework

detector for different frameworks in one projects

v0.2.4 #framework #project #detector #detect #framework-detector #projects #path
transcript

A transcriber for European scripts

v0.1.12 bin+lib #transcript #futhark #unimplemented #rules
haoxue-dict

Chinese dictionary and word segmenter

v0.1.7 #dictionary #haoxue-dict #segmenter
mle

The markup link extractor (mle) extracts links from markup files (Markdown and HTML)

v0.25.2 bin+lib #link #markup #html #render-markdown #link-extractor #markdown-link #documentation #markdown
morse-nostd

A nostd version of the morse crate

v0.1.2 #morse #morse-nostd #encode #io
kanjidic_types

A collection of types encompassing the variety of data about kanji available from Kanjidic

v0.1.4 #kanji #japanese #kanjidic
aaa

CLI tool for work with 3a files

v1.1.1 app #aaa #file #color #parameters #body #comments #header #preview #value #frame
md-dir-builder

Webserver for serving all markdown files in a directory

v0.3.1 app #builder #directory #highlighting #run
asimov-core

ASIMOV Software Development Kit (SDK) for Rust

v24.0.0-dev.22 no-std #artificial-intelligence #asimov #sdk
macro_colors

colorful printing macros

v0.2.0 #color #printing #macro
zbuf

“Zero-copy” string and bytes buffers

v0.1.2 #buffer #byte #zbuf #language #html5ever
genpdf

User-friendly PDF generator written in pure Rust

v0.2.0 6.2K #pdf #text-layout #document #element #table #system #family #page #text #hyphenation
spel

A fast spell checker for everyone

v0.1.4 app #spell-check #spel #path
chinese2digits

The Best Tool of Chinese Number to Digits. A useful tool in NLP and robot project.

v1.0.0 #nlp #digits #chinese #extract
noodler

A port of the python-ngram project that provides fuzzy search using N-gram

v0.1.0 #ngrams #fuzzy #shingles #search
prettythanks

frontend to dtolnay/prettyplease library

v0.1.0 app #ast #formatting #rust-fmt #pretty #command-line
markov-text

creating a small markov model for text generation

v0.1.1 #markov-chain #markov-text #model #random #markov
uniart

A CLI tool to convert images and gifs to terminal characters

v1.0.0 app #ascii-art #art #unicode #terminal #cli
trim_lines

An extremely simple and tiny library which provides an iterator over the lines of a string, trimmed of whitespace. It is a simple wrapper around the Lines iterator in std::str which trims the whitespace from each line.

v0.2.0 #trim #line #trim-lines
trans-case

Transform case

v0.1.0 #case #transform #text
toml-test

Verify Rust TOML parsers

v1.0.4 4.3K
anon-csv-cli

anonymise CSV files, providing various options to substitute real data with plausable fake data

v1.0.4 app #csv #anonymization #anon
spellabet

Convert characters into spelling alphabet code words

v0.2.0 #text-formatting #humanize #word #spelling-alphabet
syllable

counter for use with reading level calculations

v0.1.0 #syllable #word-count #readability #english #flesch-kincaid
smoldown

Native Rust library for parsing Markdown

v0.1.0 #html-parser #markdown #md
mdbook-checklist

An mdBook preprocessor for generating checklists and indexes

v0.1.1 app #mdbook #mdbook-preprocessor #mdbook-pre-processor #markdown #checklist
transition-table

transition table utilities for keyword parser

v0.0.3 no-std #hobby #utilities #transition
literumilo

A spell checker and morphological analyzer for Esperanto

v0.1.0 bin+lib #esperanto #spell-check #morpheme #analyzer
serbzip

A quasi-lossless Balkanoidal meta-lingual compressor

v0.10.0 app #codec #serbzip #text #dictionary #balkanoid #background #cheap
retest

Command-line regular expression tester

v0.2.3 950 app #regex #tester #retest #regex-validator #pattern
kg-diag

Error/diagnostic management. I/O routines for reading UTF-8 textual data with position tracking.

v0.4.0 nightly #kg-diag #diag #parser
kincaid

A word statistics library in Rust

v0.2.4 #word-count #syllable #readability #english #flesch-kincaid
wz

Count words, fast

v1.0.3 app #word-count #line-count #wc #word-counter #word #words #byte
djot

Djot parser written in pure Rust

v0.0.2 app #djot #markup
icu-data

International Components for Unicode (ICU) data in Rust structures

v0.1.0 #unicode #mapping #ucm
goodname

assist you with cool naming of your methods and software

v0.2.2 #acronym #goodname #trie #match
byte_string

Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format

v1.0.0 55K #debugging #ascii #ascii-text #ascii-string #string #debug #byte-str #format
custom-rust-stemmers

Experimental fork of: A rust implementation of some popular snowball stemming algorithms

v0.1.0 #stemmer #algorithm #custom-rust-stemmers #com
ryaspeller

lib for searching typos in text, files and websites

v0.1.4 bin+lib #spell-check #spelling #yandex #api-bindings #spellcheck #website
yozuk-core-skillset

Set of default Yozuk skills

v0.22.11 150 #yozuk #chat-bot #programmers #development-tools #skill #telegram-bot
readput

Fast and easy stdin input parsing for competitive programming in rust

v0.1.3 180 #io #input-parser #stdin #parser #utility #input #parsing
yagenerator

Application that uses tinytemplate engine to generate text files. If you have a set of structured data, and need to generated a bunch of arbitrary types of files from it, this tool can help you to save some time.

v0.1.3 bin+lib #template-engine #template-generator #code #text #generator #engine
equt-md-ext

Extend event iterator

v0.2.7 #iterator #equt-md-ext #front-matter
mdbook-last-changed

mdbook preprocessor to add the last modification date per page

v0.1.4 bin+lib #page #last-changed #mdbook
unicode-range

UnicodeRange is a Rust library for parsing and stringifying Unicode ranges. It provides functionality to convert a string representation of Unicode ranges into a vector of code points and vice versa.

v0.1.0 #unicode #unicode-range #range #font #opentype-fonts
corollary

Cross-compiles Haskell into Rust

v0.3.0 bin+lib #haskell #corollary #convert #system #lazy-evaluation #declaration #hkt #recursion #cross-compiler #parsing-library
agentscript

A programming language for AI agents

v0.1.2 #agentscript #agent #invoke #syntax
twitter_text_parser

Parser for twitter-text in Rust

v0.2.0 350 #text-parser #twitter-text #twitter-text-parser #testing
arg_input

ARGF-style input handling for Rust

v2.0.1 #text #cli #input
stone-mason

simplify using the Amazon Bedrock Rust SDK aws-sdk-bedrockruntime

v0.1.0 #model #anthropic #stone-mason #bedrock #sdk #note #blob #client
csv-sanity

Sanitize and transform large CSVs with millions of records quickly and efficiently

v0.1.0 bin+lib #csv #csv-sanity #regex #email #capitalize #transformer #date #trim #syntax #choice
gdnative-doc

Documentation tool for gdnative

v0.0.6 #documentation #markdown #gd-native #gdscript #godot-rust
gestalt_ratio

Calculate the gestalt pattern matching ratio between two strings

v0.2.1 #string-matching #string-similarity #ratio #string #similarity #gestalt #matching
lunir

A universal intermediate representation oriented towards Lua

v0.2.0 #lunir #indentation #optimisations
xavier

lightweight and versatile XML parsing library designed to streamline the process of handling XML data with ease and efficiency

v0.1.3 2.3K #xml #xavier #name
minigrep_baolhq

Just getting started with Rust, enjoying it so far 😇

v0.1.0 bin+lib #mini-grep #case-insensitive #case-sensitive
indent_tokenizer

Generate tokens based on indentation

v0.4.0 #tokenize #indentation #level #token #tokenizer
rustinsight

The launcher app for the interacive book

v0.10.0 bin+lib #book #rustinsight #launcher #lab #learning-by-doing #com
jput

puts and putc on unicode-width align for Rust

v0.1.2 #unicode #alignment #unicode-width #put #console #width
l

my personal library

v1.2.7 #regex #algorithm #true
gen-epub-book

Generate an ePub book from a simple plaintext descriptor

v2.3.2 bin+lib #ebook #epub #book #generate
doccy

brace based markup language

v0.3.2 13K bin+lib #html #markup-language #text-html #text #element #language #break
fontconfig-rs

Safe, higher-level wrapper around the fontconfig library

v0.1.1 420 #fontconfig #wrapper #font #search
shoebill

A Wadler/Leijen style pretty-printer

v0.1.5 #pretty-print #pretty #wadler #leijen #printing
ewin-com

editor for Window(GUI) users.No need to remember commands

v0.0.2 #ewin #com #operation #settings #macro #file #term #edit #command #bind
rcut

replacement for GNU cut that supports UTF-8

v0.0.52 app #cut #rcut #character #box #5-15
moenarchbook

Creates a book from markdown files

v0.1.1 bin+lib #mdbook #book #markdown #moenarch
vidyut-chandas

A Sanskrit metrical classifier

v0.1.0 #sanskrit #classification #vidyut-chandas #vrtta
encoding-next-index-tradchinese

Index tables for traditional Chinese character encodings

v1.20180106.1 1.1K #encoding-next #encoding #tradchinese #table #iso-8859-1 #encoder-trap
hebrew_unicode_utils

Some functions for processing Hebrew unicode characters

v0.4.3 460 #unicode-characters #hebrew #unicode-text #utf-8
mdtranslation

prepare multi-lingual Markdown documents

v0.1.2 #translation #markdown #common-mark #localization #document
code-splitter

Split code into semantic chunks using tree-sitter

v0.1.5 260 #tokenize #split #nlp #artificial-intelligence #code
is_printable

Determine whether a given text-based value is printable

v0.1.1 220 #utf-8 #ascii #printable #char
markdown-includes

Include other documents, table of content, or rust-doc in Markdown using a simple template system

v0.1.1 #include #markdown #readme #content #system #section
masker

Mask patterns in data

v0.0.4 120 #text-search #text #utility #search #data
mdbook-iced

An mdBook preprocessor to turn iced code blocks into interactive examples

v0.2.0 470 bin+lib #mdbook #iced #book #interactive #gui
swot

community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college

v0.1.0 #education #validation #email #name #college
m2h

Convert Markdown to HTML with syntax highlighting

v0.1.0 app #render-markdown #syntax-highlighting #html
forming

lightweight architecture as code language. 架构描述语言

v0.1.0 app #forming #架构描述语言 #design #style #page #pattern #architecture-description-language #轻量级架构即 #码语言
diffy-fork-filenames

Fork of https://docs.rs/diffy that allows specifiying filenames

v0.4.0 #patch #merge #diff #filenames #diffy
tradukisto

Kinda useful natural language translation library and utility

v0.1.1 app #translation #computer-vision #utility #copilot #audio #localization
squ

command-line utility for converting quotation marks in plaintext files to "smart quotes"

v0.1.0 app #quote #command-line #convert #smart #line
ssexp

A powerful parser for s-expressions

v0.3.1 1.2K #lisp #s-expr #s-exp #parser #macro-characters
aki-unbody

output first or last n lines, like a head and tail of linux command

v0.1.19 1.1K bin+lib #filter #text #head-tail #inverse
mdbook-tectonic

An mdbook backend for generating LaTeX and PDF documents

v0.3.0-beta.4 app #mdbook #latex #mdbook-tectonic
binyl

A bitwise UTF-8 string inspection tool

v1.0.0 app #utf-8 #binyl #unicode #command-line #tool
lingua-chinese-language-model

The Chinese language model for Lingua, an accurate natural language detection library

v1.2.0 8.7K #language
docfmt

A document formatter using Handlebars templates

v0.1.1 app #handlebars #formatting #documentation #template #handlebars-template
deck

A command line tool to generate HTML presentations from Markdown documents

v0.3.0 app #markdown #presentation #slide #document
lindera-filter

Character and token filters for Lindera

v0.32.3 12K #morphological-analysis #library #tokenize #filter #morphological #analysis
pdf_composer_definitions

PDF Composer definitions crate

v0.3.0 #markdown #pdf #yaml #composer #generate
pocky

A framework for building your own static site generator

v0.5.2 #web #site #markdown #static-site #static
rust_nickname_generater

that generates user/nick names based on the rust language

v1.0.6 #generator #user-name #nickname #language #discord
mdbook-open-gh-issue

mdbook preprocessor to add a open-on-github link on every page

v0.1.1 bin+lib #mdbook #issue #mdbook-open-gh-issue #page
jellybean

Syntax highlighting with tree-sitter. Sweet colors.

v0.0.2 #syntax-highlighting #tree-sitter #highlight
mdbook-unlink

A mdBook backend that validates local links

v0.1.0 app #mdbook-plugins #link #mdbook #mdbook-backend #unlink #true #chapter
pattern-3

Needle API (née Pattern API 3.0), generalization of std::str::pattern

v0.5.0 nightly no-std #pattern-3 #pattern #search #experimental
mapm-cli

The command-line implementation of mapm

v6.1.0 app #mapm #mapm-cli #filter
linetime

command line utility to add timestamps at the start of lines. The tool can either process lines from stdin or execute a command and process lines from the command's stdout and stderr.

v1.0.2 120 app #timestamp #optimization #bottleneck #line
winparsingtools

collection of structs and utilities for parsing windows binary formats

v2.1.3 550 #struct #winparsingtools #pdf
emdb_lib

Orthographic token compression

v0.1.3 #compression #emdb-lib #lib #tokenize
tweak

when/then clauses to run

v0.1.1 #tweak #case #run #statement
character_frequency

counting character frequencies in a string concurrently

v0.2.0 #character #frequency #thread #concurrently #character-frequencies #characters
mdbook-morsels

Morsels plugin for Mdbook

v0.7.3 app #morsels #static-site #mdbook #search #wasm #processing
charmap

one-to-(none/one/many) character mapping

v0.2.2 no-std #iterator #nlp #no-std #text
texc-latex

Contains LaTeX templates for TeXCreate

v0.1.6 #tex-create #latex #te-x-create #modularity #khan
encoding_c

C API for encoding_rs

v0.9.8 17K sys #c-api #charset #ffi #unicode
minigreper

Small grep style cli from the book

v0.1.0 bin+lib #minigreper #book
ligotab

Format delimited data with lightweight markup

v0.2.0 bin+lib #csv #restructuredtext #markdown #org #confluence
xhtmlchardet

Character set detection for XML and HTML

v2.2.0 17K #detect #character-set #html #xml #character #detection
parattice

Recursive paraphrase lattice generator

v0.2.2 #nlp #paraphrase #generator #lattice #lattice-kmp
paperoni

A web article downloader

v0.6.1-alpha1 app #downloader #epub #export #article-extractor #pdf #readability #css
gecliht

A disparate collection of text manipulation and formatting algorithms

v0.2.0 #stemmer #soundex #nlp #format #text
eloran

Comics and Ebook web library written in rust, with reading, search, reading status, bookmarks

v0.3.1 360 app #epub #web-ui #comic #ebook #cbz
igneous-md-viewer

The viewer component of igneous-md

v0.2.0 180 bin+lib #css #igneous-md #offline #html #framework
rew

A text processing CLI tool that rewrites FS paths according to a pattern

v0.3.0 bin+lib #regex #path #rename #pattern #tool
argot

Parse documentation from codebases into Markdown for easy doc creation

v0.2.2 app #file #argot #class #markdown #language #action #name #variables
marker

finding issues in CommonMark documents

v0.9.0 app #markdown #common-mark #validation #markdown-link #document
ascii_converter

converting between different ascii representations

v0.3.0 180 #ascii #hex #binary #converter
ukiyoe

rendering images to the terminal

v0.0.4 #terminal #image #ukiyoe #art
emoji-printer

Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)

v0.4.3 200 #emoji #printing #unicode #shortcodes
pomsky-macro

Macro for converting pomsky expressions to regexes

v0.11.0 110 macro #regex #pomsky #macro #diagnostics
llm-tui

A Terminal User Interface (TUI) for interacting with Language Learning Models (LLM) using llm-cli

v0.1.0 app #artificial-intelligence #chat-bot #llm #tui
dequote

Remove nested quotes around text

v0.9.0 no-std #quote #trim #no-std #text
hunter_mygrep

A learning project to search query in files

v0.1.0 bin+lib #grep #hunter-mygrep #hunter
hyphenator

segmenting words into syllables

v0.1.0 #syllable #hyphenator #split
vroom

Vim macros from the shell

v0.1.0 app #shell #vroom #juice #filename #stdin #lemon #mango #orange #apple
rust_io_test

basic program for searching content in files

v0.1.0 bin+lib #testing #rust-io-test #eprintln
minigre_base

text file search tool

v0.1.0 bin+lib #tool #minigre-base #base
md-designer

A CLI tool for creating design docs in Markdown

v0.1.1 bin+lib #md-designer #markdown #list #file #rules #locally #md-design-doc #yaml #cd #git
futf

Handling fragments of UTF-8

v0.1.5 1.0M #utf-8 #futf #offset
minisearch

A mini search which can handle both - case sensitive or in-sensitive both

v0.1.1 #mini-grep #minisearch #hacker-programs
moscato

Outline scaler for OpenType glyphs

v0.1.2 #true-type #glyph #opentype #loader #scaler #graphics
p101_enc

convert Olivetti P101 program to and from different encodings

v0.9.0 #pipeline #filter #enc #101 #encoding #c101
rustex

auto-generated LaTeX files in Rust

v0.1.0 #latex #report #generate #reports #component
rosie-sys

build or link to librosie to access the Rosie Pattern Language

v1.3.1 sys #regex #rosie #fsa #matching #pattern-matching
mmseg

Chinese word segmenation algorithm MMSEG in Rust

v0.3.0 #nlp #chinese #segmenation
rust-cheatsheet

a quick cheatsheet for rust

v0.1.0 bin+lib #rust-cheatsheet #cheat-sheet #art #rust-book #concepts #org-book
find_unicode

Find Unicode characters, the easy way!

v0.4.0 app #unicode-characters #find #easy #unicode #character
mul

Bengali stemmer

v0.1.0 #information-retrieval #stemming #nlp #bengali
hulk

An ultra simple no hassle static site generator

v0.1.9 app #static-site-generator #hulk #generator #static-website
unidok

A powerful, readable, easy-to-learn markup language

v0.2.0 app #common-mark #markdown #asciidoc #language
my_minigrep321

A command line tool to retrieve all lines from a file containing a given string

v0.1.2 bin+lib #case-insensitive #case-sensitive #my-minigrep321 #mini-grep
sparklet

small flashcards library

v0.1.1 #text #sparklet
deface

Lightweight markup to HTML converter

v0.1.2 app #markdown #markup #deface #converter #markup-language #rules #markdown-rendering #syntax #list #numbers
jposta

A fast and intuitive Terminal User Interface (TUI) tool for searching Japanese postal codes and addresses

v0.1.0 app #tui #address #japan #postal #terminal
monogrep

custom version of grep

v0.1.0 bin+lib #grep #monogrep
emojito

Find all the Emoji in a string. Supports composed emoji.

v0.3.5 370 #emoji #string-search #search #unicode #string
iwes

IWE LSP server

v0.0.31 420 bin+lib #server #iwes #markdown #lsp #md
single_source

Generate code files from snippets in md tutorial files

v0.1.5 app #single-source #md #source #truth #tutorial #skip #generator
yitizi

異體字查詢 Get variant Chinese characters

v0.1.0 bin+lib #yitizi #sinograph #chinese #chinese-character #nlp
file-search

File indexing and search

v0.1.11 app #file-search #search-index #search #pdf #indexing
character-stream

Helper data structures for reading UTF-8 characters from a stream

v0.13.0 #iterator #unicode #reader #wrapper #stream
aprilasr-sys

Low-level FFI bindings for the april-asr C api (libaprilasr)

v0.1.3 sys #nlp #audio #neural-network #wrapper
b64

Base64 encoding/decoding support. Originally from rustc-serialize.

v0.4.0 2.8K #b64 #encoding #character-set
kanpyo

Japanese Morphological Analyzer

v0.1.1 bin+lib #japanese #nlp #analyzer #morphological #natural-language-processing
auto_correct

provide auto correct suggestions. Currently supporting EN-US.

v0.1.9 #auto-correct #suggestions #auto #word
wcount

CLI word counting tool

v0.1.0 app #word-counter #csv #word #counter #cli
scripter

A screenplay compiler

v0.4.1 app #latex #script #screenplay #compiler
tgo

Heterogeneous data type transtion, it's safe, lightweight and fast

v0.1.0 #schema #transform #low-code #type #tool
validations

arbitrary types

v0.1.1 #validation #io
mdbook-fishextract

A mdbook preprocessor which handles mermaid graphs, offline, requires mmdc

v0.1.0 bin+lib #mdbook #mermaid #graph #fishextract #mmdc
character-set

High performance set.contains(char)

v0.4.0 #character-set #character #range #testing
crustword

Crusty Crosswords

v0.1.0 app #crosswords #crossword-generator #terminal #output #mode #language
epubparse

Parse epub and convert to text-only Book structure

v0.2.2 #ebook #epub #structure #chapter #wasm #ncx
dekor

styling and character repository in Rust

v0.2.2 110 #character #text-styling #utf-8 #terminal #console #utilities #utf-8-characters #development-tools-console
highlight-pulldown

Process pulldown-cmark events to apply syntax highlighting to code blocks

v0.2.2 #syntax-highlighting #markdown #block #highlighter
ngram-search

Ngram-based indexing of strings into a binary file

v0.1.1 bin+lib #ngrams #indexing #text-search #full-text
lindera-dictionary

A morphological analysis library

v0.41.0 32K #morphological-analysis #library #dictionary #tokenize #cc-cedict #analysis #morphological
find-simdoc

Time- and memory-efficient all pairs similarity searches in documents

v0.1.1 340 #similarity-search #all-pairs #lsh #similarity #search
encoded-words

Encoded Words for usage in MIME headers

v0.2.0 450 #header #word #encoded-words #encode #utf-8
old_icelandic_zoega

Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga

v1.1.0 #dictionary #old-icelandic #zoega #medieval-languages #old-norse #zoe-ga #dictionary-entry
lindera-py

Python binding for Lindera

v0.41.0 500 #morphological-analysis #python #library #morphological
mykebab

convert snake_case strings to kebab-case

v0.1.0 #snake-case #mykebab #snake-to-kebab
jpreprocess-jpcommon

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.12.0 290 #open-j-talk #text-to-speech #library
grace-cli

CLI tool for processing files and strings

v0.1.1 bin+lib #directory #strings-processing #files-manipulation #string #file #cli
tectonic_xetex_format

Tectonic/XeTeX engine data structures and their expression in TeX "format" files

v0.3.2 #typesetting #xetex #tex
string_manip_rust

Demo of managing projects

v0.1.2 #string #string-manip-rust #project #testing #projects
ucd-trie

A trie for storing Unicode codepoint sets and maps

v0.1.7 6.9M no-std #code-point #trie #unicode-characters #database #character #unicode
beemovie-cli

Bee Movie CLI Application

v0.1.3 app #cli #binary #text #generator
split_exact

splitting strings into arrays of slices

v1.1.0 no-std #split #slice #text #string #no-std #utility
remove-markdown-links

Turns [example](https://example.com) into example. That’s it

v1.0.0 260 bin+lib #markdown #markdown-link #link
asciimath-text-renderer

Render asciimath in terminal

v0.1.0 bin+lib #asciimath #terminal #renderer #literals #sqrt
struckdown

A structured markdown / commonmark library for Rust

v0.1.0 #cmark #common-mark #markdown #restructuredtext
gregex-logic

Logic for the gregex crate

v0.1.1 #regex-automata #nfa-automata #regex #logic #nfa #automata
count-md

configurable command-line tool and Rust library for Unicode-aware, Markdown-aware, HTML-aware word counting in Markdown documents

v0.1.0 bin+lib #markdown #count-md #document #documents #text #title
noted2xero_cli

The commandline version of the noted to web converter

v1.11.10 app #csv #noted2xero-cli #noted2xero #rust-note2xero-cli
regex_parser

This project provides a parser for standard regular expressions based on a defined grammar

v0.1.1 bin+lib #regex-parser #parser #respectively
goose-eggs

in writing Goose load tests

v0.6.0 1.8K #load-testing #web #random #eggs
unicode_names

Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…

v0.1.7 no-std #unicode #unicode-names #unicode-text #name #text
lingua-portuguese-language-model

The Portuguese language model for Lingua, an accurate natural language detection library

v1.2.0 9.2K #language-recognition #lingua #language-detection #nlp
stylish-stringlike

API for string-like objects that have styles applied

v0.3.0 #style #stylish-stringlike #string #terminal #tags #truncation-style
nlprule-build

Build tools for a fast, low-resource Natural Language Processing and Error Correction library

v0.6.4 1.9K #nlp #spelling #grammar #text
faster-chars-count

counting length of chars faster than Chars::count()

v0.3.0 no-std #count #faster-chars-count #utf-8
minigrep_necimye

Functions required to find the lines in file that contains the query. query and file path should be entered through command line preceded by two dashes. Ex: cargo run -- body filename.txt

v0.1.1 bin+lib #linux #mini-grep #txt
text-to-json

Convert text to json in rust

v0.1.3 500 bin+lib #json-text #json #rust #text
ab-radix-trie

A compressed radix trie implementation supporting matching rules

v0.2.1 130 #radix-trie #trie #ab-radix-trie #rules #pattern
ron_to_table

pretty print RON as a table

v0.6.0 #pretty-table #ron #pretty-print #format #table
epub_metadata

Produce pdf and epub books from markdown source structures

v0.1.0 #epub #structures #metadata #markdown
ezemoji

Catigoryized Emoji's

v0.2.1 100 #emoji #ezemoji #crab #clone #rain #website #github #io
typeline_ext_csv

csv parsing and serialization for typeline

v0.1.0 #pipeline #shell #stream #tl
darts

A double array trie, A Forward Maximum Matching Searcher

v0.1.0 #double-array-trie #trie #text-search #string-search #search #text #string
pygmentize

wrapper for syntax highlighting

v0.2.0 100 #syntax-highlighting #highlighter-coloring #html #highlighter #syntax-coloring #syntax-highlighter #coloring
heckmv

A basic case-conversion renaming CLI tool

v1.0.1 app #heckmv #directory #case #directories #kebab-case #snake-case #camel-case #break #required #upper-camel-case
linkcheck

extracting and validating links

v0.4.1 4.2K #link-checker #linkcheck #link #check #documentation #links
bqrs

apply boolean query to text

v0.1.3 #text-search #text #query #search #boolean #match
simplearrayhash

v0.1.1 #string-search #hash-table #search #string #key
csvsc

Build processing chains for CSV files

v2.2.1 #csv #csvsc #transformation #documentation
bidi

Unicode Bidirectional Algorithm (UBA)

v0.1.1 #bidi #bidirectional #unicode #text-processing
merge_pdf

Merge PDF files in a directory

v0.1.0 app #merge-pdf #directory #pdf
shelldon

your new Rust-powered buddy with GPT features!

v0.1.0 app #artificial-intelligence #gpt #shell #prompt
yozuk-helper-english

English NLP utilities for Yozuk

v0.22.11 #yozuk #english #nlp
dtex

Better TeX

v0.1.2 app #tex #dtex #comments
rdg

Random data generator for the command line

v0.1.1 app #regex #random #line #string #word-list
br-pdf

PDF Invoice Processing

v0.0.2 #br #inc #pdf #processing
chars_data

Build-dependency for chars, the unicode character information CLI

v0.7.0 #build-dependencies #codegen #unicode #character #points
cw

Count Words, a fast wc clone

v0.7.0 bin+lib #word-count #wc #clone #word #count
indoc

Indented document literals

v2.0.6 7.1M macro no-std #multi-line #literals #string-literal #heredoc #nowdoc #macro #no-alloc #string
chisel-lexers

Chisel backend lexers/scanners

v1.1.0 #lexer #parser #chisel #input
utf8_reader

A UTF-8 reader that read UTF-8 characters from object that implement Read trait

v0.7.0 180 #utf-8 #reader #traits #cursor #write #set-position
mdbook-shiftinclude

mdbook preprocessor for file inclusion with shift

v0.1.0 app #mdbook-preprocessor #mdbook #mdbook-pre-processor #shift
aki-stats

output the statistics of text, like a wc of linux command

v0.1.18 1.0K bin+lib #filter #statistics #text #en
mdbook-asciidoc

mdBook backend for AsciiDoc generation

v0.1.0 app #asciidoc #mdbook #mdbook-asciidoc
markov_strings

A simplistic Markov chain text generator

v0.1.5 #markov-chain #procedural-generation #generator #text #chain #procedural #markov
synterm

making beautiful REPLs and Shells with fish like as you type syntax highlighting

v0.3.1 #highlighting #synterm #command-line-tool #lexer #string #start #exit
seven_seg

Seven-segment digital display for terminal

v0.1.2 #format #combine-text #text #sevseg-four
cozo-ce

A general-purpose, transactional, relational database that uses Datalog and focuses on graph data and algorithms

v0.7.13-alpha.3 390 #cozo #token-stream #cozo-ce #algorithm #documentation #artificial-intelligence
minigrep_david20019

Command line utility that searches for a string in files

v0.1.1 bin+lib #mini-grep #minigrep-david20019 #david20019
minify-html-common

Common code and data for minify-html*

v0.0.2 19K #minify-html #minify #html #entities #tags #wasm #js #attributes #minification
combos

Print all permutations of a word list

v0.2.1 app #combos #shell #permutation #command-line-tool
pcre2

High level wrapper library for PCRE2

v0.2.9 31K #regex #pcre2 #jit #perl #pcre
tiny_pretty

Tiny implementation of Wadler-style pretty printer

v0.2.0 6.0K #pretty #tiny #tiny-pretty #text #documentation #nest #print-options #vec
glyphana

Quickly find, inspect & collect unicode glyps

v0.1.4 nightly app #typography #glyph #unicode-characters #search #character #viewer #glyps
minigrep-cli

implement minimum grep cli program

v0.1.0 bin+lib #minigrep-cli #mini-grep #file-path #功能概述 #tool #个简单的 #用于从文件中 #以执行文本搜 #索指定的 #提供命令行接
rpdf

PDF command-line utils written in Rust

v0.1.3 app #annotations #pdf #command-line-utilities #cli #annotation
validate_npm_package_name

validate npm package name

v0.1.0 #npm-package #validation #name
difference

text diffing and assertion library

v2.0.0 523K bin+lib #diff #text #change #assert
newline-converter

Newline byte converter library

v0.3.0 363K #line-break #newlines #convert #crlf #unix2dos #conversion #newline
soundchange

implementing sound change algorithms in Rust

v0.0.8 nightly #linguistics #soundchange #logging #condition #str-to
intname

Full English name for any integer of any primitive integer type

v0.2.0 #text-formatting #integer #name
v_latexescape

The simd optimized LaTeX escaping code

v0.14.8 #escaping #simd #latex #latexescape
num2en

For converting integer and decimal numbers into English cardinal or ordinal number words

v1.0.0 #ordinal #english #word #cardinal #english-words #numbers
naromat

Convert text to narou novel format

v0.3.1 bin+lib #naromat #converter #text-file #format
basic-text

Basic Text strings and I/O streams

v0.19.2 1.2K #plain-text #text #basic-text #stream
soup

Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML

v0.5.1 2.0K #soup #html #document #tags #error #id #ul
wxf-converter

Transform yaml, json, pkl files to wolfram

v0.3.2 app #wolfram #converter #exchange
unic-common

UNIC — Common Utilities

v0.9.0 946K #unicode-version #unicode #utilities #unic #version
unicode_converter

CLI tool to convert data between various Unicode encodings

v0.1.2 bin+lib #unicode #utf-16 #utf-32 #utf-8 #converter #cesu8 #utf-1
dd

a clone of the unix coreutil dd

v0.4.0 app #dd #exit #file #synopsis #block #ascii #directory #ebcdic
unicode-canvas

creating text base drawing

v0.1.1 #canvas #widgets #tui #text
sauron-markdown

parsing markdown into sauron node

v0.45.0 #node #sauron #md
igo-rs

Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語形態素解析)

v0.3.0 bin+lib #nlp #japanese #tagger #dictionary
mdbook-typst-math

An mdbook preprocessor to use typst to render math

v0.1.1 bin+lib #mdbook #typst #mdbook-preprocessor #typst-math
git-busy

A wrapper around "git commit" that generates the commit messages for you

v1.0.0 bin+lib #artificial-intelligence #commit #git #gpt-3 #git-commit #cli #gpt3
beemovie

Bee Movie crate

v1.0.1 #beemovie #generator #text #barry #benson
mdbook-translation

prepare multi-lingual mdBook books

v0.1.1 app #translation #mdbook #localization #markdown #pre-processor #book
scrambler

command line tool to scramble letters

v0.1.1 app #word #letter #scrambler #scramble #csn #words
text_distance

A collection of approximate string matching algorithms

v0.5.0 #string-matching #levenshtein #edit-distance #text #algorithm #string-matching-algorithm #string-distance
arbitrator

Format text based on a set of rules and regexes

v0.1.3 app #typesetting #troff #text
autoruby-cli

CLI to easily generate furigana for various document formats

v0.5.1 app #localization #format #text-processing #japanese #furigana #html #katakana #formats
mdbook-webinclude

Preprocessor for mdBook that includes content from URLs

v0.1.0 app #mdbook-preprocessor #mdbook #mdbook-pre-processor #url #webinclude
twitch2csv

stream the chats of Twitch channels as a CSV

v0.1.1 app #twitch2csv #csv #mistermv #message-text #a67d6dac364a #abe3 #b153d255 #f0dd09c589e4 #b6d07625 #ae08
grammateus

facilitate working with Ancient Greek words

v0.2.2 #ancient-greek #diacritics #word #greek #ancient
zuk

Yozuk command-line interface

v0.22.11 140 app #yozuk #interface #telegram-bot #nlp #development-tools #command-line-tool #programmers
unicode-box-drawing

Unicode box-drawing characters

v0.2.1 #character #hi-doc #unicode-box-drawing #characters
swappy

An anagram generator

v0.3.0 app #anagrams #language #swappy #generator #eyes #mugs #murals #wintergreen
tabwriter

Elastic tabstops

v1.4.1 110K #white-space #alignment #tabs #elastic #table
mojibake

Encode/Decode bytes as emoji base2048

v0.2.1 #emoji #base2048 #mojibake
rigrep

grep from Rust Book

v1.0.1 bin+lib #grep #unix-command #rigrep #rust #search #command-line-tool
ldd_md_parse

markdow to html simple tool

v0.1.0 app #html #parser #ldd-md-parse #md解析html #md解析为html工
ende

encoding/decoding unicode/utf-8/utf-16(ucs-2) code points

v0.1.0 bin+lib #decode #encode #encode-decode #utf-8 #utf-16 #unicode
aklat

create books from markdown files (like Gitbook)

v0.0.20 bin+lib #gitbook #rust-book #markdown #book
anagrambot

find anagrams of words

v1.0.1 #anagrams #word #anagrambot #words
lindera-unidic

A Japanese morphological dictionary for UniDic

v0.41.0 22K #japanese #dictionary #morphological #unidic
assert-text

the testing macro tools

v0.2.10 #assert #assert-text #text
hashlogs

Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout

v1.0.2 app #cryptography #hash #hashlogs #stdout #cryptographic-hashes
minigrep_philip

A simplified version of the well-known grep command

v0.1.0 bin+lib #mini-grep #minigrep-philip #philip
regex-cli

A command line tool for debugging, ad hoc benchmarking and generating regular expressions

v0.2.1 260 app #debugging #dfa #nfa #debug #cli
tuilet

A textual user interface for Toilet, the ANSI-art text generator

v0.3.1 bin+lib #figlet #toilet #generator #ansi #ascii
bookrafter

This repository contains code related to bookrafter rendering

v0.1.0 app #markdown-renderer #book #rendering #books #markdown #markdown-rendering #renderer
llmvm-core-lib

llmvm core application

v1.1.4 #artificial-intelligence #llm #api-bindings #thread #back-end #preset #workspace #template #ai
tectonic_io_base

Basic types for Tectonic's pluggable I/O backend system

v0.4.3 650 #typesetting #tex #tectonic #system #xetex
bos_books_codes

that handles 3-character Bible Books Codes

v0.1.2 #book #codes #bible #usfm #osis #books
psa

PSA(Project structure analysis) is a analyzer for analysis project struct

v0.1.1 #psa
bocu1

BOCU-1 compressed unicode encoding

v0.1.0 #unicode #unicode-text #compression #text
hashtag-regex

regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers

v0.1.1 #emoji #hashtag #regex
utf

UTF-8

v0.1.6 140 #utf-8 #utf
wordshk_tools

A combination of parsers and other tools for words.hk (粵典)

v3.16.0-beta.9 180 #dictionary #nlp #parser #cantonese #wordshk #hk #粵典 #cantonese-dictionary
leven-distance

Compute operational differences between two sequences using the Levenshtein algorithm

v1.0.0 #levenshtein #levenshtein-distance #algorithm
webgrep

grep the web: a full-browser-spec search-focused ultra-simple way to read the web without having to leave the terminal

v0.4.3 550 app #web-search #terminal #pdf #recursion #grep
opencc

binding for Rust

v0.3.0 #opencc #chinese #opencc-rs #bindings
indexrs

inefficient multi-language search index

v0.5.0 #search-index #full-text-search #search #index #text-search
minigrep-extremq

Example crate from the rustbook

v0.1.0 bin+lib #mini-grep #txt #minigrep-extremq #day #frog #somebody #bog
mdbook-playscript

Preprocessor for mdBook, which styles stage play scripts

v0.5.0 bin+lib #markdown #play #pulldown-cmark #stage #script
markdown-table-formatter

Markdown table formatter fully compliant with Unicode 15.1.0

v0.3.0 950 #markdown-tables #table-formatter #formatter #table #markdown #east-asian-width
pdf-min

Very minimal crate for writing PDFs

v0.1.12 #pdf #pdf-min #html
ayda

Ask your Documents Anything. A tool for querying your documents with a large language model.

v1.1.1 bin+lib #search #pdf #openai #academic #text-processing-search #text-processing-indexing #development-tools-cli #workspace #cli
pdf-rename

This script reads a list of PDF files from a specified directory and renames each file based on its content. The renaming logic uses the content of the PDF to generate a more descriptive and meaningful filename.

v0.1.31 app #pdf #rename #pdf-rename #directory #model #script
tnil

Parsing, glossing, and generating utilites for New Ithkuil

v0.1.3 #ithkuil #parser #tnil
ru-html-extractor

A universal web page main content extractor based on line block density distribution

v0.1.0 #extractor #ru-html-extractor #html #archive #cx #com #html-text-extractor #p-p
kradical_static

Ready-to-use EDRDG radical decompositions

v0.2.0 #kanji #kanji-radical #japanese #radical #decomposition
flw

Process text via configurable tasks

v0.0.3 bin+lib #yaml-config #task #schema #text #text-processing #tasks #task-manager #replace
rulet

figlet implementation

v2.0.0 #ascii #figlet #text #character #smushing
mdbookshelf

Create epubs from a list of mdbook repositories

v0.1.2 bin+lib #ebook #epub #mdbook #rust-book #config #repository
ucd-parse

parsing data files in the Unicode character database

v0.1.13 33K #character-properties #character-database #character-property #unicode
lingua-dutch-language-model

The Dutch language model for Lingua, an accurate natural language detection library

v1.2.0 9.3K #language-recognition #lingua #language-model #language-detection #nlp
august

& program for converting HTML to plain text

v2.4.0 bin+lib #html-converter #converter #text-html #html #text
cutters

Rule based sentence segmentation library

v0.1.4 #nlp #cutters #text-processing
mdbook-all-the-markdowns

Render all markdown files in a given folder structure

v0.3.0 bin+lib #structure #mdbook #markdown #markdowns #md #config
md-include

include any file in markdown files

v0.1.0 app #markdown #include #file
umlauts

text transformation of german umlauts

v0.2.0-alpha.3 #umlauts #utf-8 #upper-case #äöü-äö-üß-ß #umlauts-owned
makogrep

mako 的 minigrep 示例 cli

v0.1.0 bin+lib #mini-grep #makogrep #cli #项目适合tdd #测试驱动开发
bcdown

Bilibili漫画下载器，written in Rust，支持epub pdf zip格式

v0.2.2 app #bilibili #downloader #comic #epub #pdf #language #zip格式
bbx

A robust, performant BBCode pull parser

v0.3.1 no-std #parser #bbx #bbcode
readability-rs

Port of arc90's readability project to rust

v0.5.0 140 #converter #readability #html-converter #html #text-html #text
wfst4str

Python library based on rustfst for manipulatig strings with wFSTs

v1.0.4 #python #fst #wfst #linguistics #nlp #string
bionic-ebooks

Takes an EPUB file and generate a copy with bionic like font applied

v0.1.1 bin+lib #ebook #epub #bionic #applied
genere

randomization of text respecting grammatical gender of sentences

v0.1.2 bin+lib #text #sentence #generator #text-generator
lines

Utililities for iterating readers efficiently line-by-line

v0.0.6 100 #text #streaming #line
tantivy-czech-stemmer

Czech stemmer as Tantivy tokenizer

v0.2.1 #stemmer #tantivy #tokenize #czech
mdlynx

Small, fast utility to find broken file links in Markdown documents

v0.1.0 app #markdown #broken-links #document #documents #parallel
e_book_sync_library

Synchonize e-book with your local e-library

v0.3.6 bin+lib #ebook #sync #utility #config #folder
word_filter

A Word Filter for filtering text

v0.8.1 160 no-std #string #word #filter #censor
color-convert

Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust

v0.1.0 #color-convert #convert #color
skyspell_core

skyspell core library

v5.0.0 #spell-check #skyspell #skyspell-core #line #struct #folder
ascii_tree

generates ascii trees

v0.1.1 11K #tree #ascii #ascii-tree
const_format_proc_macros

detail of the const_format crate

v0.2.34 3.3M macro no-std #proc-macro #concat #formatting #macro #no-std #assertions #arguments
is_utf8

functions to determine if a sequence of bytes is valid utf-8

v0.1.4 #utf-8 #avx #is-utf8 #simd
subscript-compiler

A modern LaTeX rendition

v0.21.0 bin+lib #typesetting #compiler #subscript #latex #publish #html #math #publishing
grep-reader

short text for crates.io

v0.1.3 #grep-reader #grep #io
simplecc

Chinese Convert library (partially) compatible with OpenCC's dictionaries

v0.2.2 #opencc #simplecc #dictionary #open-cc
ed_join

Implemtation of Ed-Join Algorithm for string similarity join

v1.1.1 bin+lib #string-similarity #string #similarity #text-processing
czv

performing CSV-related operations for data engineering and analysis

v0.0.2 #data-analysis #csv #data-engineering #library #wasm #price #javascript
spellcheck_toy

a basic spellchecking library based on edit distance

v0.3.2 #spell-check #distance #toy
khat

A cat clone, nothing more nothing less

v0.1.4 bin+lib #character #reverse #khat #less #characters
utf8-command

UTF-8 encoded std::process::Command output

v1.0.1 #utf-8 #output #command-output #command #exit-status #arg
wordpieces

Split tokens into word pieces

v0.6.1 110 #tokenize #word-piece #wordpiece #piece
minigrep_kashi754

lightweight implementation of the popular grep command line tool. Built as my first project, it is not meant to be used in production.

v0.1.1 bin+lib #minigrep-kashi754 #mini-grep #case-insensitive
gret

command line tool to search for patterns and show matches in a tree structure

v0.1.2 app #ripgrep #grep #search-pattern #regex #pattern
utf8_slice

Lightweight UTF8 Slice Utilities

v1.0.0 900 #string #utf-8 #slice #unicode
csvre

replacing data in CSV columns with regular expressions

v0.1.0 app #regex #csv #command
ascii-rs

Process image into colored-ascii image

v0.1.2 #image #ascii #ascii-rs #image-engine #stdout
dictcc

Rust API for reading and querying the dict.cc offline translation database

v0.1.1 bin+lib #dictionary #database #dictcc #translation
dhoni

converting Bengali text into their phonetic counterpart

v0.1.0 #phonetic #avro #bengali #bangla
vl-convert-pdf

convert SVG to PDF with embedded text

v1.4.0 950 #svg-pdf #svg #pdf #text
demoji

Remove all emojis from a string

v0.0.3 #emoji #string #demoji
base100

Encode your data into emoji

v0.4.1 app #emoji #base100 #simd #base64 #input #memescale
nano_parser_gen

A parser generator inspired by yacc (types and functions)

v0.2.1 #gen #parser #grammar #block-content #clone #now
harfbuzz-traits

Rust Traits for the HarfBuzz text shaping engine

v0.6.0 32K #opentype #font-shaping #unicode #unicode-text #font #shaping
wikitext-parser

Partial parser for wikitext

v0.3.3 #wikitext #parser #wikitext-parser #representation
gqlog

👾 filter your json logs with graphql 👾

v1.0.3 bin+lib #filter #graphql #logging
lindera-compress

A morphological analysis library

v0.32.3 13K #morphological-analysis #compression #library #tokenize #multilingual #morphological #analysis
wz-utf16

UTF-16 counters for wz

v1.0.2 no-std #wz #wz-utf16 #line
bytescolor

A versatile Rust library for colorizing strings and byte data in terminal applications using ANSI escape codes

v0.1.0 #ansi-term #ansi-terminal #byte-color #byte #terminal
spider_scraper

A css scraper using html5ever

v0.1.2 950 #web-scraping #selector #html #element #text #attributes #document #fragment
replace-all

Cli to quickly replace occurences of a word in a file

v0.1.1 app #file #replace #replace-all
cattocol

Combine two text into one text as columns

v0.3.1 #column #concat #text #combine-text #format
anystr

An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32

v0.1.1 no-std #ascii-text #wide-string #ascii #ascii-string #any
pdfutil

PDF document manipulation

v0.4.0 app #pdfutil #object #lopdf #page #document #operation #subcommand #pdf
flesh-reading-ease

Calculate Flesh Reading Ease for a given text and language

v0.1.0 #nlp #readability #flesh #text-analysis #language
economic_indicator_finder

A finder for extracting economic indicators from paragraphs

v0.1.1 #economics #text-processing #indicator #finder #economic-indicator #paragraph
indentation

Formatter

v0.1.6 #indentation #formatter
fum

fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules

v0.1.0 app #pattern #fuzzy-search #literals #search #trigram
iasthk

Harvard-Kyoto to IAST conversion

v1.0.1 #iasthk #convert
gesha-core

Core functionality for Gesha project

v0.0.12 550 #gesha #gesha-core #generator
ucd-generate

A program for generating packed representations of the Unicode character database that can be efficiently searched

v0.3.1 app #unicode-characters #fst #unicode #character #table #generate
sudachiclone

sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer

v0.2.1 bin+lib #japanese #analyzer #sudachi #morphological
lexmatch

lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.

v0.3.0 120 app #nlp #lexmatch #text-processing #lexical-search
sprinkles

Randomly colors input text and outputs it to the terminal

v1.0.0 bin+lib #text #pretty-print #format #cli #command-line-utilities
rcut-lib

rcut is a Rust replacement for GNU cut that supports UTF-8

v0.0.52 #rcut #lib #rcut-lib #character #cut #box
chinese_segmenter

Tokenize Chinese sentences using a dictionary-driven largest first matching approach

v1.0.1 #chinese #tokenize #hanzi #segment #localization #sentence
in_rainbows_printer

Prints some In Rainbows-style (the Radiohead album) text to your termial

v0.1.0 app #printing #rainbows #terminal #running
stringsort

Pathological sorting of string characters

v2.0.1 #character #string #stringsort #characters
case_convert

Converts the first letter of a Rust String to uppercase

v0.1.0 #case-convert #string #case-conversion #convert
any2utf8

Convert any enncoding to utf-8

v0.1.1 app #utf-8 #any2utf8
mdbook-obsidian

mdBook preprocessor to render Obsidian specific syntax

v0.1.0 bin+lib #mdbook-preprocessor #mdbook #obsidian #mdbook-pre-processor #markdown
carlo-latex

A LaTeX emitter for the simple interpreted programming language Carlo

v1.0.0 #latex #carlo #carlo-latex
shallow

long text

v0.2.0 #shallow #character-shallow #mode #text #testing
jpreprocess-window

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.12.0 300 #text-to-speech #open-j-talk #library
blockcounter

Counts the blocks in a stream

v0.3.2 #gnuplot #string #file #text
markdown-heading-id

Filter for pulldown-cmark which converts headings with custom ID

v0.1.0 2.5K #markdown #pulldown-cmark #heading-id
whitespace

Encode arbitrary data whitespaces and vice versa

v2.0.0 #white-space #decode #encode #documentation #useless-things
fmtm_ytmimi_markdown_fmt

Fork of @ytmimi's Markdown formatter; powers FMTM

v0.0.3 100 #common-mark #formatter #markdown #markdown-formatter #list
minigrep_dungtl2003

very small project for rust that can find lines you want in a file

v0.1.1 bin+lib #mini-grep #minigrep-dungtl2003 #file
top-english-words

retrieve top words from the English language

v1.1.1 #english-words #word #english #popular #frequent
bibliofile

A TUI epub reader inspired by DOS-era programs

v0.1.0 app #ebook #epub #tui #termion #reader
jg

Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.

v0.1.6 bin+lib #grep #search-pattern #json #selector #pattern #search
hline

a grep-like tool that highlights lines in files

v0.2.1 bin+lib #expression #hline #recording #file #filename #niche #stdin
html_to_markdown

Convert HTML to Markdown

v0.1.0 3.1K #html-to-markdown #html #markdown #zed
unicount-lib

Alphabetic counter supporting unicode

v0.1.2 #unicode #unicount #unicount-lib #vec #ad
veryfi

Module for communicating with the Veryfi OCR API

v1.0.0 #api #veryfi #api-key #document
tokengeex

efficient tokenizer for code based on UnigramLM and TokenMonster

v1.1.0 900 bin+lib #tokenize #nlp #llm #codegeex #tokenizer
charjpoet

Charj Poet is a API for write to .cj language

v0.1.0 #charjpoet #poet #properties #md
milligrep

Custom simplified implementation of grep

v1.0.3 bin+lib #grep #milligrep #command-line-tool #mini-grep #executable #ripgrep
txttyp

Formatted string typewriter

v0.1.2 #txttyp #text #command-line #typewriter #format #string #style #cargo
mupdf-sys

Rust FFI binding to MuPDF

v0.4.4 2.1K sys #pdf #mupdf #mupdf-sys #mupdf-wrapper #progress
mdbook-compress

Compress an mdBook project into a single PDF file

v0.2.1 app #mdbook #rust-book #pdf #book #compression
mdoc

Modern PDF creation through Markdown and LaTeX

v0.3.0 bin+lib #latex #mdoc #bibliography #document #markdown #documentation #djot #compiler
iconv-compat-win-sys

iconv bindings for Rust

v0.1.1 #iconv #api-bindings #os #text-processing #encoding #external-ffi-bindings #operating-system
allsorts_no_std

Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2

v0.5.2 150 no-std #opentype #true-type-font #no-std #font-shaping #font #parser #shaping #opentype-font
pra

Print Random ASCII

v0.0.2 130 bin+lib #ascii #pra
minigrep_ao

Learning Rust

v1.0.0 bin+lib #mini-grep #minigrep-ao #sensitive
spongedown

Converts markdown to html with svgbob support

v0.5.0-alpha.1 #svg #markdown #bob
zw

encoding and decoding text using zero-width characters

v0.2.0 bin+lib #character #zw #encode #characters
ngrams

Generate n-grams from sequences

v1.0.1 3.0K #ngrams #sequence #documentation #org-wiki-n-gram
textos

Texts, strings, formatting, unicode…

v0.0.3 no-std #unicode #unicode-text #textos #no-alloc #string #text
bbd-lib

Binary Braille Dump

v0.3.2 120 #dump #encoding-decoding #bbd-lib #character
goya

morphological analyzer for Rust and WebAssembly

v0.1.9 #morphological-analysis #dictionary #wasm
org-rust-parser

parser for org mode documents

v0.1.5 #document #parser #documents
wordbreaker

A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string

v0.3.0 no-std #concatenation #dictionary #text
braille_pics

producing text-art pictures using Braille characters

v0.1.1 #character #braille #bit #braille-pic #false #bounded #characters #16 #d12345678d12345678d12345678d12345678d12345678d12345678
convert_encoding

Convert encoding of text files in batch

v0.1.0 app #encoding #convert #convert-encoding
kanjidic_converter

A program to convert from the Kanjidic XML format to a JSON format

v0.1.2 app #converter #kanji #japanese
pest_ascii_tree

Helper crates converting the parsing result of any pest grammar into an ascii tree

v0.1.0 850 #pest #ascii #tree #expr
jellybean-pack-2

Sweet syntax highlighting with tree-sitter

v0.0.2 #syntax-highlighting #tree-sitter #highlight
stamd

Webservice for working with stand-off annotations on text (STAM)

v0.1.0 app #annotations #nlp #linguistics #standoff #text-processing #annotation
cang-jie

A Chinese tokenizer for tantivy

v0.18.0 #tantivy #tokenize #full-text-search #chinese #search #tokenizer
textocx

Tex code to Office MathML

v0.1.0 app #mathml #ms-office #latex #windows #download
indeed

Append lines to a file with no shell bullshit

v0.5.0 app #file #indeed #string #three #twofer #six
yozuk-sdk

Types used in the Yozuk ecosystem

v0.22.11 120 #yozuk #chat-bot #ecosystem #programmers
ankiding

Creating Anki-Flashcards within Markdown!

v0.1.0 app #markdown #ankiding #latex #anki #decks #notes #standalone
mdbook-numeq

An mdbook preprocessor for automatically numbering centered equations

v0.4.0 bin+lib #mdbook-preprocessor #mdbook #mdbook-pre-processor #katex
wkhtmltopdf

High-level bindings to wkhtmltopdf

v0.4.0 700 #wkhtmltopdf #html #pdf #wkhtmltox #wkhtmltoimage
lindera-cc-cedict

A Japanese morphological dictionary for CC-CEDICT

v0.41.0 23K #cc-cedict #dictionary #morphological #chinese
latex-to-html

Latex to html converter

v0.1.2 bin+lib #converter #html #html-converter #equation #label
base_u256

base-u256 is to utf-8 as base-64 is to ascii

v0.1.1 #ascii #u256 #base #encode-decode
encoding_c_mem

C API for encoding_rs::mem

v0.2.6 17K sys #charset #c-api #unicode #ffi
mdtable-cli

that makes creating tables in markdown much easier!

v1.1.1 app #md #table #markdown-tables #markdown
varcon-core

Varcon-relevant data structures

v5.0.2 9.3K #spell-check #structures #varcon #code-quality #typo #monorepo #pr
fst-subseq-ascii-caseless

An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst

v0.1.1 #fst #search #ascii #subseq #caseless
tadm

A collection of algorithms and data structures wrote out while reading The Algorithm Design Manual book

v0.1.1 #tadm #book #sorting #snippets #manual
quill_delta_pdf

Convert Quill Delta to PDF

v0.1.4 200 #delta #quill #pdf #quilljs #convert
tb_normalization

normalization utf8 string, loc dau vietnamese and some language

v1.0.0 bin+lib #normalization #utf-8 #locdau #vietnamese
typeline_ext_sqlite

sqlite integration for typeline

v0.1.0 #stream #pipeline #shell #tl
asciir

Print ASCII table/values

v0.1.0 bin+lib #table-values #asciir #value #character #file
const-utf16

Utf8 to utf16 conversion functions for use in const contexts

v0.2.1 #utf-8 #utf-16 #const #context
lindera-sqlite

Lindera tokenizer for SQLite FTS5 extention

v0.41.0 390 #morphological-analysis #sqlite #library
yhy-email-encoding

Low level email encoding RFCs implementations

v0.0.2 #email #yhy-email-encoding #utf-8
llmvm-outsource-lib

outsource backend for llmvm

v1.3.1 330 #artificial-intelligence #openai #hugging-face #llm #api-bindings
lines_lossy

extension to BufRead with a function lines_lossy that works like BufRead::lines but with lossy UTF-8 decoding

v0.1.0 #utf-8 #lossy #bufread #string
lithe-cli

A cli of lithe

v0.0.3 app #text #lithe #cli
unic-idna-mapping

UNIC — IDNA — IDNA Mapping Table

v0.9.0 700 #idna #character-property #unicode #unicode-text #text
pulldown-cmark-fork

A pull parser for CommonMark

v0.5.2 bin+lib #common-mark #markdown #pulldown-cmark #parser #block
latex

An ergonomic library for programatically generating LaTeX documents and reports

v0.3.1 #latex #pdf-report #report #pdf #section #paragraph #generation #table #figure
aki-txpr-macro

the more easy to use libaki-*

v0.1.5 #fifo #macro #pipe #thread #filter
xgrepx

xgrep is a rust implementation of grep. This is a follow up from the rust book

v0.1.0 bin+lib #xgrepx #book #search #xgrep #txt
uwu_cli

uwuifying the terminal

v1.0.0 app #owo #uwu #cli #terminal #eof #world #txt #file #fantastic
asciicast

file format used by Asciinema

v0.2.2 1.4K #asciicast #asciinema #tty #ascii
unicode-character-database

Unicode character database tables (Unicode Standard Annex #44) generated using ucd-generate

v0.1.0 #unicode #ucd #tr44 #unicode-text #text
fountain-parser-rs

parse Fountain-formatted plain text files

v0.4.0 150 #fountain #parser #fountain-parser-rs
untex

Understand and manipulate TeX files with ease

v0.4.0-beta bin+lib #latex #formatter #parser #lexer #document #tex
wordnet

Read a wordnet dictionary in Rust

v0.1.2 #wordnet #nlp #dictionary
rusty_code_code_for_book

my book_rusty code

v1.1.2 app #book #rusty-code-code-for-book #for
snake_case_converter

convert strings to snake case

v0.1.0 #snake-case #snake-case-converter #converter
cjieba-sys

unsafe ffi to cppjieba

v0.1.1 sys #nlp #chinese #segmentation #cppjieba #rust-jieba
mdbook-chapter-number

A mdBook preprocessor that adds chapter numbers to the each page header

v0.1.2 app #mdbook-preprocessor #mdbook #markdown #mdbook-pre-processor #header
rust-cedar

efficiently-updatable double-array trie in Rust (ported from cedar)

v0.1.0 #trie #cedar #text-search #string-search #string #search #darts #text
ced

Dead easy csv editor

v0.2.2 bin+lib #csv #editor #text-processing #cli
rustyword

An anagram finder

v0.1.0 app #word #letter #cli #command-line-utilites
lindera-ipadic-neologd

A Japanese morphological dictionary for IPADIC NEologd

v0.41.0 21K #japanese #dictionary #neologd #ipadic #morphological
math-text-transform

Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.

v0.1.1 bin+lib #typesetting #math #unicode #unicode-text #text
fbihtax

CLI tool to help manage tax payments in FBiH (Bosnia and Herzegovina Federation)

v0.3.2 bin+lib #federation #pdf #fbihtax #forms #testing #breakdown
char_trie

Text segmentation based on trie tree, High performance, support for custom dictionary

v0.1.0 #dictionary #trie-tree #trie #trietree #char-trietree
crypto-invert

Unicode Upside-Down Mapping

v1.0.1 #crypto-invert #encode #mapping #text #testing
volt_parse

The advanced, slightly different take on the parser combinator concept

v0.5.0 nightly #parser #volt-parse #volt #text #concepts
anagram

A collection of anagram utility functions

v0.4.0 #anagrams #word #function #occurences #cool
yeslogic-unicode-blocks

Functions to access and search Unicode blocks

v0.2.0 220 no-std #character #block #unicode #cjk
strip-tags

Strip HTML and PHP tags from strings

v0.1.0 no-std #html #strip #tags #php #sanitize
kytea-tokenizer

Wrapper of tokenization by KyTea

v0.10.0 #japanese #analyzer #morphological #kytea
sauron-md

parsing markdown into sauron node

v0.1.4 #sauron #md #web-apps #html #node #web #rendering #applications
simple-word-count

word count function, try to get same result with Microsoft Office Word application

v0.1.1 #word-count #word-counter #simple-word-count #word #counter #count
kanpyo-dict

Dictionary Library for Kanpyo

v0.1.1 bin+lib #dictionary #kanpyo #kanpyo-dict
minigrep_mxcln

command line tool to search for a string in a file

v0.1.0 bin+lib #mini-grep #minigrep-mxcln #mxcln
strizer

minimal and fast library for text tokenization

v0.1.0 #tokenize #strizer #string-tokenizer #stream-tokenizer
ttf_word_wrap

Wraps text based on character width

v0.5.0 #word-wrap #font #wrap #word #string
hoedown

bindings for the Hoedown markdown processor

v6.0.0 600 #markdown #hoedown #html #processor #render
tectonic_bridge_harfbuzz

Expose the Harfbuzz C/C++ APIs to Rust/Cargo

v0.2.9 600 sys #harfbuzz #typesetting #tectonic #tex
emojicons

Parse :emoji: notation to unicode representation

v1.0.1 #emoji #emojicons #cat
cautious-octo-funicular

Test: shipping an mdbook with API docs

v0.1.5 #documentation #cautious-octo-funicular #cautious #docs #book
lingua-italian-language-model

The Italian language model for Lingua, an accurate natural language detection library

v1.2.0 9.2K #language-recognition #lingua #language-model #language-detection #nlp
textr

TeX-inspired plug-n-play interface for converting JSON documents into PDFs

v0.3.0 170 #pdf #textr #identifier
lindera-ipadic

A Japanese morphological dictionary for IPADIC

v0.41.0 25K #japanese #morphological #dictionary #ipadic
charwise

This lightweight, dependency-free rust library provides a convenient way to read characters from different resources

v1.0.1 #buffering #stream #character #lexer #peek
md_parser_wasm

A markdown parser written in Rust and compiled to WebAssembly

v0.3.5 bin+lib #parser #wasm #svelte #part
markdown2unicode

Converter from markdown notation to unicode characters

v0.2.1 bin+lib #character #unicode #spans #string #characters
unicode_skeleton

detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39

v0.1.1 #skeleton #confusable #unicode #unicode-text #text
cmark2tex

A small utility to convert markdown files to pdf exploiting tectonic

v0.4.0-beta.1 bin+lib #tex #cmark #common-mark
CorrosionMark

markdown parser libary

v0.1.1 #model #ast-parser #parser #libary #parse-to-ast-model
token-read

reading whitespace delimited files intended for competitive programming

v0.2.0 #parser #programming #token #input #line
minigrep_sopesto

minigrip es una aplicación hecha siguiendo la guía del libro *The Rust Programming Lenguage*. La misma busca recrear de forma minimalista la aplicación grep.

v0.1.1 bin+lib #mini-grep #minigrep-sopesto #sopesto
quoted-string-parser

Quoted string parser for grammar defined in RFC3261

v0.1.0 12K #quoted-string #parser #rfc-3261
mdbook-files

Preprocessor for mdbook which renders files from a directory as an interactive widget

v0.2.0 bin+lib #mdbook #mdbook-files #widgets #serve #book
books_description_parser

A Rust-based parser to extract book details from structured markdown-like text and output them in formats like JSON or Rust structs for further processing

v0.1.0 bin+lib #book #description #parser #grammar
word_iter

Iterator over all words in a string

v0.2.1 #iterator #string #word
quick_io

facilitate input and output within programs, with a set of macros

v2.0.0 #quick #quick-io #character #down #right #write #mv #20 #addstr #10
sitdown

Static site generator

v0.2.1 bin+lib #sitdown #generator #directory
df_cp437

Decoder for CP437 to UTF-8

v1.1.0 #cp437 #utf-8 #df-cp437
csv_coincidence

Tool designed to efficiently search for and identify specific patterns within CSV files

v0.1.1 #lib #csv #coincidence #file #find-partial-matches
kilo

small, fast utility crate/library for manipulating strings and generating sourcemaps with all in Magic 🪄

v0.1.0 #source-map #kilo #magic-string #string-manipulating #parser
unidecode

pure ASCII transliterations of Unicode strings

v0.3.0 254K #transliteration #ascii #unidecode #unicode #unidecoder
lix-score

Calculate LIX score for a given text and language

v0.1.0 #nlp #readability #lix #text-analysis #language
rustextile

Textile markup language parser for Rust

v1.0.2 #html #markup #textile #text #block #table #image
backslash

parsing escape characters

v0.2.0 550 #character #backslash #characters
decline-word

Choose word form based on given number

v0.1.2 #numbers #word #decline
mime-rs

A text processing framework, inspired by Emacs lisp and keyboard macros

v0.3.0 #scripting #text-processing #mime-rs #cpp
txtframe

Creates a frame for text

v0.4.0 #frame #text #format #width #fill #top-line #left-top #right-top #left-btm #right-btm
iterlower

Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior

v1.0.1 #unicode #greek #azeri #turkish
yozuk-model

NLP model generator for Yozuk

v0.22.11 #yozuk #yozuk-model #model
makudaun

Markdown renderer tool made on Rust

v0.0.2 bin+lib #markdown #markdown-converter #makudaun #html #render-markdown #flavor
mdbook-reference-table

mdBook preprocessor to create reference tables

v0.1.0 app #mdbook #table #pre-processor
text-tables

A terminal/text table prettifier with no dependencies

v0.3.1 850 #pretty #terminal #ascii #table #cli
mdbook-extended-markdown-table

Preprocessor for mdBook that generates tables with merged cells from ASCII text

v0.1.0 bin+lib #mdbook-preprocessor #mdbook #markdown-tables #table #mdbook-pre-processor #markdown #diagram #build-utils
stfu

Shut The Ferris Up - profanity filtering for Rust

v0.1.0 #word #bad #filter #censor #profanity #act #words
html2runes

An HTML to Text converter

v1.0.1 950 bin+lib #markdown-converter #plain-text #html #markdown
twemoji-rs

A word-cloud image generation crate

v0.1.2 #emoji #unicode #icons #image
chisel-parsers

Chisel parser front ends

v1.1.0 #parser #chisel #chisel-parsers #end
morc

Dead simple, minimal markdown generator library written in Rust

v0.0.2 #markdown #markdown-generator #library #md #generator #readme
docstring

manipulating and parsing documentation strings

v0.2.4 #documentation #doc-string #move-idl
aki-json-pick

The json pick out command

v0.1.10 600 bin+lib #json #filter #text
text_to_emoji

Convert text to emoji

v0.1.0 #emoji #text #convert #rust
poetry-book

Create a poetry book in latex, starting from plain text

v0.1.3 #poem #latex #book #poetry #verse
blingfire

Wrapper for the BlingFire tokenization library

v1.0.0 1.8K #nlp #tokenize #machine-learning
xsystem

Conversion between the Esperanto x-system and Unicode circumflexes

v0.1.0 #esperanto #xsystem #character #unicode-chars #x-to-unicode
terminal-emoji

safely displaying emoji inside of terminals

v0.4.1 1.5K #terminal #emoji #terminal-emoji
presciidoc

Preprocessing AsciiDoc for other tools

v0.4.1 app #asciidoc #documentation #redhat
unicode-utf8

that converts utf-8 bytes to a unicode scalar value, and vice versa

v0.1.3 #versa #utf-8 #unicode
pdf_encoding

Font related encodings

v0.4.0 300 #pdf #encoding #pdf-encoding #system
openlibrary-rs

A wrapper around openlibrary's Web API

v0.3.1 #book #ebook #openlibrary #api-bindings #author #books #search
norm-email

strip email provider defined behaviour from email addresses

v0.1.0 #emoji #unicode #addresses
re2

Wrapper for the re2 C++ regex library

v0.0.8 #re2 #syntax #matching #boundary #character #digits
rough

A very simple and opinionated static site generator

v0.2.0 app #rough #format #static-site-generator #markdown
quartz_commands

Generates a parser at compile-time for handling commands similar in structure to those of Minecraft

v0.1.0 #cli-parser #command #parser #minecraft #command-line-tool #cli-command
alpino-tokenize

Wrapper around the Alpino tokenizer for Dutch

v0.4.0 app #tokenize #dutch #alpino-tokenizer
cmdcjones_minigrep

A minimal grep clone from the Rust Book

v0.1.0 bin+lib #book #mini-grep #cmdcjones-minigrep
paxcii

Transform images and videos to ascii

v0.5.1 bin+lib #paxcii #ascii #image #video #ascii-art #com-watch #v-jt-xl-ln-aas #command-line
mdbook-trace

A traceable document preprocessor for mdbook

v0.1.1 app #mdbook-preprocessor #mdbook #mdbook-pre-processor
chapter-8-exercises

Exercises from the 8th chapter of the book

v0.1.0 app #book #chapter #chapter-8-exercises
rckive-genpdf

User-friendly PDF generator written in pure Rust

v0.4.0 #pdf #text-layout #element #family #page #text #table #file #system #document
jpreprocess-dictionary

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.12.0 310 bin+lib #text-to-speech #open-j-talk #library
suffix

arrays

v1.3.0 7.5K bin+lib #search-index #text-search #search #unicode #text
jiang_mini_grep

minigrep 查询文件的某个字符

v0.1.1 bin+lib #mini-grep #grep #jiang-mini-grep #查询文件的 #锈书中的
catmark

Console printer for CommonMark

v0.2.2 app #common-mark #catmark #terminal #syntax-highlighting #ansi #markdown
ssml-parser

parsing speech sythnesis markup language

v0.1.4 #language #ssml #parser #parse-ssml
johalun/module

FreeBSD kernel module in Rust

GitHub 0.1.0 3.6K nightly #module #binary-heap #html #machine
pdf_composer_base

PDF Composer base functionality crate

v0.3.0 #markdown #yaml #composer #generate #pdf #margin
xsv

A high performance CSV command line toolkit

v0.13.0 1.1K app #csv #slice #csv-tsv #tsv #command
readable-readability

Really fast readability

v0.4.0 700 #dom #extract #text #html #html-text #text-extract
encoding-next-types

Traits and types for the encoding package

v0.2.0 1.1K #charset #unicode #encoding-next #package
hsk

Return HSK Level for Simplified Chinese Characters

v0.1.1 #chinese #hsk #hanzi #character
mdbook-mathpunc

An mdbook preprocessor that prevents line breaks between inline math blocks and punctuation marks when using katex

v0.2.0 bin+lib #mdbook-preprocessor #mdbook #katex #punctuation #mdbook-pre-processor
asciifolding

ascii folding library

v0.1.0 1.2K #ascii #folding #unicode #lucene
tiniestsegmenter

Compact Japanese segmenter

v0.3.0 #tokenize #japanese #nlp #ngrams
vaporetto_tantivy

Vaporetto Tokenizer for Tantivy

v0.22.3 500 #tantivy #tokenize #japanese #tokenizer
pdf_form

programatically filling out pdf forms

v0.4.0 #forms #pdf #field
with-str-bytes

Safely manipulate the bytes of a UTF-8 string

v1.0.0 no-std #ascii-text #string #ascii-string #utf-8 #byte #ascii #safe
bytepiece_rs

The Bytepiece Tokenizer Implemented in Rust

v0.2.2 #nlp #tokenize #bytepiece #deep-learning #tokenizer
morsels_lang_ascii

Basic ascii tokenizer for morsels

v0.7.3 #ascii #morsels #search #package
md-to-html

CLI tool to convert Markdown files to HTML

v0.1.1 app #html #md-to-html #文件 #特性 #文件名 #页面
uniaxe

replace Unicode letters with Ascii equivalents

v0.1.1 #ascii #unicode #text-processing #cleaning #equivalent
text_converter

A trait that helps with manipulating text

v0.1.0 #converter #text #text-converter #reverse-text #format
basic_lexer

Basic lexical analyzer for parsing and compiling

v0.2.1 #tokenize #line-comment #tokenizer #compilation #set-line-comment
tfidf-summarizer

Basic tf-idf compute for documents

v2.0.0 #tf-idf #nlp #document #summarizer #text-processing #documents
bgrp

A very simple minigrep in terminal

v0.1.0 bin+lib #terminal #bgrp
overlap

shows overlap text in files

v0.0.2 bin+lib #text #cli #overlap
jellybean-pack-1

Sweet syntax highlighting with tree-sitter

v0.0.2 #syntax-highlighting #highlight #tree-sitter
minigrep-yogie

A demo Rust to grep some word from rust-lang.com

v0.1.1 bin+lib #mini-grep #minigrep-yogie #yogie
maybe_utf8

Byte container optionally encoded as UTF-8

v0.2.3 nightly #utf-8 #container #string
termbook

behind the termbook-cli

v1.4.2 #common-mark #mdbook #markdown #terminal
mdbook-post

A CLI for add post to mdbook

v0.1.3 app #mdbook #rust-book #post #book
ewts-c

Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (c lib)

v0.1.0 #converter #ewts #tibetan #localization
ergrep

grep strings within a line from a text file

v0.1.1 bin+lib #ergrep #file #mini-grep #string
fnew

A Unicode-aware line-oriented drop-in replacement for coreutils' fold

v1.0.1 app #coreutils #fold #text-processing #command-line-tool
bgrep

grep tailored to handle binary patterns and files

v1.0.0 app #grep #regex #binary #search-pattern #pattern
minigrep-bahadir

A fun project to learn the great language Rust

v0.1.1 bin+lib #mini-grep #minigrep-bahadir #bahadir
password-characters

help with the "enter the 12th, 35th, and 63rd characters from your password" situations

v1.0.1 app #character #situations #password #characters
static_format

Format strings with no runtime overhead

v0.0.3 macro no-std #const-format #no-std #const #format
rusty_word_builder

Syllable and Word generation library written fully in Rust

v0.6.3 #word #syllable #language #linguistics #conlang
unicode_font

Convert unicode characters between fonts

v0.1.1 bin+lib #unicode #font #character #characteristics
lingua-slovak-language-model

The Slovak language model for Lingua, an accurate natural language detection library

v1.2.0 8.7K #language-recognition #lingua #language-detection #nlp #language-identification
mdtranslation-cli

Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents

v0.1.0 app #translation #markdown #common-mark #localization #document
spyglass

Search engine for documents, inspired by bioinformatics

v1.1.0 #spyglass #wildcard #character #bioinformatics #regex #distance
text-sanitizer

convert text to plain ASCII text

v1.6.0 #utf-8 #ascii #unicode #sanitizing #text-processing
bookbinder_latex

Produce latex and pdf books

v0.1.1 #latex #book #bookbinder-latex #books #markdown
pix-brcode

A ready to use compliant PIX specification, featuring fast de/serialization

v0.1.0 #brcode #pix #emv-qrcps #pdf #pix-toolbelt
rustrawi

Rust port of the original PHP Sastrawi

v0.1.2 #tokenize #nlp #stem #sastrawi #stopword
vaporetto_rules

Rule-base filters for Vaporetto

v0.6.5 1.0K no-std #japanese #morphological-analysis #tokenize #morphological
utf8reader

wrapper around Reader that returns a stream of UTF-8 characters

v0.1.0 #character #utf8reader #reader #access #code-point #characters
gulpeaseindex

Calculate Gulpease index for a given text and language

v0.1.0 #nlp #readability #gulpease #text-analysis
caseformat

Power flow case data format

v0.1.0 bin+lib #format #caseformat #directory
grep-clone

A mini grep clone from the Rust-lang official tutorial

v0.2.1 bin+lib #grep-clone #search #clone #tutorial #html
STKLR

STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.

v0.0.42 bin+lib #documentation #search-pattern #rustdoc #pattern #sed #docs
rsonpath-test-codegen

Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.

v0.5.1 #json-path #simd #query #parser #json
rs_handstrength

relative to board omaha hand strength calculator and equity on flop

v0.4.3 160 nightly #flop #rs-handstrength #handstrength #pdf
codepage

Mapping between Windows code page numbers and encoding_rs character encodings

v0.1.2 163K #winapi #codepage #unicode #windows
minigrep_flict

Simplest text-in-file search engine from rust book

v0.1.1 bin+lib #mini-grep #book #minigrep-flict #engine
minigrep_iaziz786

grep

v0.1.0 bin+lib #mini-grep #case-insensitive #filename
llmvm-chat

An llmvm frontend that acts as a CLI chat interface

v0.1.1 app #artificial-intelligence #llm #llmvm #chat #demo #ai
japanese-ruby-filter

Japanese ruby notation parser

v0.1.0 #japanese-ruby #ruby #japanese-ruby-filter #text #parser #pulldown-cmark
emoji_converter

Converts text to emojis

v0.1.0 #emoji #converter #unicode #rust #unicode-text #text
font-map-core

Core font-parsing capabilities for font-map

v0.2.9 140 #font #true-type-font #svg #macro #api-bindings #true-type #preview
ascii-alphabetic-char

Traits for ASCII alphabetic characters

v0.1.1 #alphabetic #ascii #ascii-alphabetic-char
corpus-preproc

A preprocessor for text and HTML corpora

v0.1.0 app #pre-processor #corpus #word #cli #character #mark #element #text #break
xmldecl

Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way

v0.2.0 1.3K #web #charset #unicode
mnumonic

A tiny library to convert opaque binary data to and from a human-memorable phrase

v0.2.0 #human-readable #word #convert #encode #words
mathml-latex

Convert between MathML and LaTeX

v0.0.3 #latex #mathml #mathml-latex #convert #commit #monorepo
lindera-assets

A helper crate to fetch assets and build dictionary for lindera

v0.32.3 10K #japanese #dictionary #assets #morphological
align_text

Aligns lines in a block of text within a number of columns

v1.0.0 #pretty-print #text #format
encoding

Character encoding support for Rust

v0.2.33 210K #unicode #charset #ascii #encoder-trap
jieba-macros

jieba-rs proc-macro

v0.7.1 9.0K macro #nlp #chinese #segmenation #proc-macro #jieba
latin1str

Windows-1252 string types

v0.1.3 #latin1str #encoded #nul-terminated #utf-8 #ascii #slice #nul-bytes #encoding
yeslogic-fontconfig

RENAMED: use the fontconfig crate instead

v0.1.1 #fontconfig #font #wrapper #search
tex

The νTeX typesetting engine

v0.1.1 bin+lib #typesetting #latex #engine #format
trexter

Text progression tracking library

v0.1.1 #text-processing #trexter #unit
ascii-read

BufRead-like methods for reading into an AsciiString

v0.1.0 #ascii-text #ascii-string #ascii #reader #string #ascii-buf-read #line
askama-filters

Extra template filters for Askama

v0.1.3 #askama #html #text-html #filter #text
json-peek

Amature JSON parser library designed for my specific need

v0.0.2 nightly #json #peek #json-peek #parser
tectonic_engine_xetex

The XeTeX engine as a reusable crate

v0.4.4 650 sys #typesetting #xetex #tex
conveyance

A stop-gap CLI for conveyancing

v0.1.3 app #docx #xml #word
token-counter

wc for tokens: count tokens in files with HF Tokenizers

v0.1.0 app #nlp #tokenize #token-counter #tokenizer #count #stdin #pattern
rustascii

Display Rust in ASCII

v0.1.2 #ascii #donis #rustascii
dom-content-extraction

Content extraction via text density paper

v0.3.10 170 bin+lib #density #dom-text-density #document #html #content-extraction #scraping #paper #documents
gbx

GBX (Grundbuch-Exchange) Dateiformat

v1.0.1 #dateiformat #gbx #pdf
typeline_ext_http

http(s) tooling for typeline

v0.1.0 #stream #shell #pipeline #tl
pdftotext

High-level library that binds to Poppler to extract text from a PDF

v0.1.5 #pdf #text #poppler #api-bindings
html-to-pulldown-cmark-events

Parse HTML to pulldown-cmark's events

v0.1.12 #events #pulldown-cmark #html
lindera-dictionary-builder

Shared code for building Lindera dictionary files

v0.32.3 12K #japanese #builder #dictionary #unidic #morphological
naveengrep

command line tool similar to the grep

v0.1.0 bin+lib #grep #naveengrep #name
brack-tokenizer

The tokenizer for the Brack programming language

v0.1.0 #tokenize #language #brack
h_hangul

Korean Characters

v0.1.0 bin+lib #character #hangul #h-hangul #characters
rs_html_parser_tokenizer

Rs Html Parser Tokenizer

v0.0.10 #html-parser #tokenize #tags #input #tokenizer #case-insensitive #instructions
minigrep_desonglll

grep implementation from The Rust Programing Book

v0.1.2 bin+lib #book #mini-grep #minigrep-desonglll #query #txt
pdf_forms

programatically filling out pdf forms

v0.3.4 #forms #pdf #field #pdf-form
borderrs

Add stylish borders around your text and datastructures

v0.1.1 #ansi-term #unicode #ansi-terminal #ascii #terminal #ansi #data-structures #cli
ascii-to-hex

A small, simple library to converting an ASCII text string into its hexadecimal equivalent

v0.1.1 #ascii #string #ascii-to-hex
slicer

that slices string slices into smaller string slices

v0.1.1 330 #slice #string #parser #as-slicer #skip-over
jellybean-pack-0

Sweet syntax highlighting with tree-sitter

v0.0.2 #syntax-highlighting #highlight #tree-sitter
mdify

A CLI tool that translates md files to html while keeping project structure

v0.4.1 app #structure #mdify #html
tpng

A small tool that prints truecolor png renderings to the terminal using unicode block characters

v0.1.6 bin+lib #tpng #character #true-color #characters
genpdfi

User-friendly PDF generator written in pure Rust

v0.2.1 140 #pdf #text-layout #text #layout
my_project_parser_super_puper

A brief description

v0.1.0 bin+lib #parser #description #super
folia

High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)

v0.0.6 bin+lib #annotations #nlp #linguistics #xml #text-processing #annotation #declaration
grep-searcher

Fast line oriented regex searching as a library

v0.1.14 128K #regex #grep #search-pattern #pattern
ruby-parser

A parser for the Ruby language

v0.0.0-dev1 #parser #array #escaping #language #input #background #mri
grep-table-converter

A cli utility to convert grep result to table (csv, markdown, textile)

v0.0.3 bin+lib #grep #grep-table-converter #csv #markdown #line #filename #textile #file #testing
fontfor

find fonts which can show a specified character and preview them in terminal or browser

v0.4.3 750 app #command-line-utilities #font #character #cli-utils #cli
my_parser_kma_test_group_3_1

A brief description

v0.1.0 bin+lib #description #kma #testing #parser
tabled

An easy to use library for pretty print tables of Rust structs and enums

v0.18.0 962K no-std #pretty-table #pretty-print #tabled #terminal #format #table
jpreprocess-dictionary-builder

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.10.0 #open-j-talk #text-to-speech #library
findtext_pdf

Search text in PDF

v0.1.2 130 bin+lib #pdf #text-search #search #text #cli
lindera-wasm

A morphological analysis library for WebAssembly

v0.40.2 250 #morphological-analysis #wasm #library
wz-conf

Configuration options for wz

v1.0.1 #wz #rayon #wz-conf #character #format #byte #word #line
unic-ucd-hangul

UNIC — Unicode Character Database — Hangul Syllable Composition & Decomposition

v0.9.0 9.8K #hangul #unicode #decomposition #unicode-text #unic #text #decompose-syllable
ascii_utils

handle ASCII characters

v0.9.3 790K #ascii #character #ascii-utils #ascii-characters
alphabet-encoder

A quick and dirty way to deal with escape characters

v0.1.1 #character #alphabet #alphabet-encoder #characters
unicode_clusters

variable width unicode characters as single items, allowing for array like indexing etc

v0.1.2 #character #grapheme #unicode #unicode-text #cluster #text
shift_or_euc

Detects among the Japanese legacy encodings

v0.1.0 #charset #web #shift-jis
dvi2html

converter

v0.2.0 #html #converter #dvi2html #com-kisonecat-dvi2html
domrs

Document builder and serializer

v0.0.16 #css #svg #svg-css #serialization #html #web-page #builder
escaped-delimiter

Iterator of delimited slices with escape characters

v0.1.0 #escaping #iterator #text #character #character-escaping

Next page?

regex

textwrap

encoding_rs

similar

fancy-regex

heck

const_format

unicode-normalization

convert_case

unicode-segmentation

ropey

lazy-regex

pulldown-cmark

unicase

deunicode

scraper

unicode-bidi

rustybuzz

html2text

emojis

ammonia

lopdf

termimad

widestring

mdbook

lngcnv

strip-ansi-escapes

prettydiff

fuzzy-matcher

unicode-general-category

regress

linkify

pulldown-cmark-to-cmark

text-splitter

finl_unicode

lindera

printpdf

onig

garde

titlecase

font-kit

charabia

roff

unicode-script

synoptic

const-str

unescaper

Inflector

diff

nucleo

mkrs

os_display

diffy

edit

chardetng

stringsext

inlinable_string

hyperlink

smartcat

cruet

line-index

wana_kana

uuhelp_parser

str_indices

whyq

mdxjs

xan

ferris-says

unicode_names2

stringzilla

autocorrect

entities

ascii

blockwatch

jieba-rs

mdbook-katex

google-translate2-cli

epub-builder

unicode-case-mapping

ncount