| Crates.io | rten-generate |
| lib.rs | rten-generate |
| version | 0.22.1 |
| created_at | 2024-07-05 21:09:36.400636+00 |
| updated_at | 2025-09-18 18:22:28.925717+00 |
| description | Utilities to simplify running auto-regressive models with RTen |
| homepage | https://github.com/robertknight/rten |
| repository | https://github.com/robertknight/rten |
| max_upload_size | |
| id | 1293359 |
| size | 98,389 |
rten-generate is a layer on top of RTen which handles the generation loop for auto-regressive transformer models (aka. "transformer decoders" or "generative AI"). This includes managing the KV cache, sampling and post-processing logits etc.