16 releases (8 breaking)
0.9.4 | Jul 27, 2020 |
---|---|
0.9.0 | Jun 23, 2020 |
0.7.0 | Nov 8, 2019 |
0.6.1 | Jun 21, 2019 |
0.2.0 | Sep 28, 2018 |
#449 in Machine learning
63 downloads per month
Used in finalfrontier-utils
180KB
4.5K
SLoC
finalfrontier
Introduction
finalfrontier is a Rust program for training word embeddings. finalfrontier currently has the following features:
- Models:
- skip-gram (Mikolov et al., 2013)
- structured skip-gram (Ling et al., 2015)
- directional skip-gram (Song et al., 2018)
- dependency (Levy and Goldberg, 2014)
- Output formats:
- finalfusion
- fastText
- word2vec binary
- word2vec text
- GloVe text
- Noise contrastive estimation (Gutmann and Hyvärinen, 2012)
- Subword representations (Bojanowski et al., 2016)
- Hogwild SGD (Recht et al., 2011)
- Quantized embeddings through the
finalfusion quantize
command.
The trained embeddings can be stored in the versatile finalfusion
format, which can be read and used with the
finalfusion crate
and the
finalfusion
Python module.
The minimum required Rust version is currently 1.40.
Where to go from here
- Installation
- Quickstart
- Manual pages:
- finalfrontier-skipgram(1) — train word embeddings with the (structured) skip-gram model
- finalfrontier-deps(1) — train word embeddings with dependency contexts
- finalfusion crate
- Python module
Dependencies
~13–24MB
~342K SLoC