3 releases

Uses old Rust 2015

0.0.3 Feb 16, 2018
0.0.2 Feb 15, 2018
0.0.1 Feb 14, 2018

#13 in #corpus

CC0 license

18KB
310 lines

opus-parse

This library can parse OPUS's monolingual XML files. Currently it's only been tested on the OpenSubtitles corpus.

See also opus_tools which has an overlapping purpose.

Dependencies

~6–15MB
~188K SLoC