llm_devices

Crates.iollm_devices
lib.rsllm_devices
version0.0.3
sourcesrc
created_at2024-10-04 19:22:47.251522
updated_at2025-01-29 01:26:24.671944
descriptionllm_devices: Device management and build system for LLM inference
homepagehttps://github.com/shelbyJenkins/llm_client
repositoryhttps://github.com/shelbyJenkins/llm_client
max_upload_size
id1396998
size69,808
Shelby Jenkins (ShelbyJenkins)

documentation

README

llm_devices: Device management and build system for LLM inference

API Documentation

The llm_devices crate is a workspace member of the llm_client project. It is used as a dependency by the llm_interface crate for building llama.cpp.

Features

  • Automated building of llama.cpp with appropriate platform-specific optimizations
  • Device detection and configuration for CPU, RAM, CUDA (Linux/Windows), and Metal (macOS)
  • Manages memory by detecting available VRAM/RAM, estimating model fit, and distributing layers across devices
  • Logging tools
Commit count: 64

cargo fmt