#thai #nlp #library #text #command-line

app khatson

Attacut ported Thai word segmentation/breaking command line

2 unstable releases

0.2.0 Jul 16, 2021
0.1.0 Jul 15, 2021

#9 in #thai

Apache-2.0

670KB
128 lines

khatson

Attacut Thai word tokenizer ported to Rust

Status

WIP

Install

cargo install khatson

Run

khatson < input.txt > output.txt

Dependencies

~8–11MB
~220K SLoC