llm_devices

Crates.iollm_devices
lib.rsllm_devices
version0.0.2
sourcesrc
created_at2024-10-04 19:22:47.251522
updated_at2024-10-04 21:41:29.692482
descriptionManaging Devices and Builds for LLMs
homepagehttps://github.com/shelbyJenkins/llm_client
repositoryhttps://github.com/shelbyJenkins/llm_client
max_upload_size
id1396998
size58,310
Shelby Jenkins (ShelbyJenkins)

documentation

README

llm_devices: Managing Devices and Builds for LLMs

This crate is part of the llm_client crate.

The llm_interface crate uses it as a dependency for building llama.cpp.

It's functionality includes:

  • Cloning the specified tag, and building llama.cpp.

  • Checking for device availabilty (CUDA, MacOS) to determine what platform to build for.

  • Fetching available VRAM or system RAM for estimating the correct model to load.

  • Offloading model layers to memory.

  • Logging tools.

See the build documentation for more notes.

Commit count: 50

cargo fmt