10 releases (5 breaking)

0.16.0 Feb 8, 2025
0.15.1 Jan 6, 2025
0.15.0 Dec 28, 2024
0.14.1 Nov 16, 2024
0.11.1 Jul 17, 2024

#677 in Machine learning

Download history 94/week @ 2024-10-23 21/week @ 2024-10-30 100/week @ 2024-11-13 16/week @ 2024-11-20 1/week @ 2024-11-27 3/week @ 2024-12-04 1/week @ 2024-12-11 108/week @ 2024-12-25 107/week @ 2025-01-01 25/week @ 2025-01-08 123/week @ 2025-02-05

125 downloads per month

MIT/Apache

2MB
52K SLoC

rten-generate is a layer on top of RTen which handles the generation loop for auto-regressive transformer models (aka. "transformer decoders" or "generative AI"). This includes managing the KV cache, sampling and post-processing logits etc.


lib.rs:

Utilities to simplify running auto-regressive RTen models such as transformer decoders.

For working examples, see the examples in the rten-examples crate which import rten_generate.

Dependencies

~1.6–3MB
~61K SLoC