Crates.io | simple_llama |
lib.rs | simple_llama |
version | 0.1.3-b |
source | src |
created_at | 2024-07-27 10:34:13.923638 |
updated_at | 2024-07-28 13:31:33.462346 |
description | A crate for run llama.cpp in Rust. based on llama-cpp-2 |
homepage | |
repository | https://github.com/L-jasmine/simple_llama |
max_upload_size | |
id | 1317171 |
size | 38,617 |
This project, Simple Llama, is a library that encapsulates commonly used large model prompts based on the llama-cpp-2 framework. It aims to simplify the interaction with large-scale models by providing a streamlined interface for managing and invoking model prompts. This library is designed to enhance the efficiency and ease of use for developers working with large models in various applications.
git clone https://github.com/L-jasmine/simple_llama
wget https://huggingface.co/second-state/Llama-3-8B-Instruct-GGUF/resolve/main/Meta-Llama-3-8B-Instruct-Q5_K_M.gguf
This project uses dynamic linking to connect to llama.cpp, so it is necessary to download or compile the llama.cpp dynamic link library in advance.
Before running the project, you need to configure environment variables to specify the location of the Llama library and the search path for dynamic link libraries. Please follow the steps below:
export LLAMA_LIB={LLama_Dynamic_Library_Dir}
# export LD_LIBRARY_PATH={LLama_Dynamic_Library_Dir}
Use the following command to run the example program:
cargo run --example simple -- --model-path Meta-Llama-3-8B-Instruct-Q5_K_M.gguf --model-type llama3 --prompt-path static/prompt.example.toml
We welcome any form of contributions, including bug reports, new feature suggestions, and code submissions.
This project is licensed under the MIT License.