Crates.io | llama2_rs |
lib.rs | llama2_rs |
version | 0.1.1 |
source | src |
created_at | 2023-07-25 09:51:07.433981 |
updated_at | 2023-07-25 09:56:33.636497 |
description | A rust port of llama2.c |
homepage | |
repository | |
max_upload_size | |
id | 925362 |
size | 34,353 |
This was derived from https://github.com/karpathy/llama2.c to run multi-threaded inference.
It's 3+ times faster to run inference using this Rust port than the original llama2.c.