| Crates.io | hf-mem |
| lib.rs | hf-mem |
| version | 0.0.5 |
| created_at | 2025-03-10 11:00:18.277163+00 |
| updated_at | 2025-07-14 17:13:42.877915+00 |
| description | CLI to estimate inference memory requirements from the Hugging Face Hub |
| homepage | |
| repository | https://github.com/alvarobartt/hf-mem |
| max_upload_size | |
| id | 1586440 |
| size | 76,057 |
CLI to estimate inference memory requirements from the Hugging Face Hub
$ cargo install hf-mem
$ hf-mem --help
CLI to estimate inference memory requirements from the Hugging Face Hub
Usage: hf-mem [OPTIONS] --model-id <MODEL_ID>
Options:
-m, --model-id <MODEL_ID> ID of the model on the Hugging Face Hub
-r, --revision <REVISION> Revision of the model on the Hugging Face Hub [default: main]
-t, --token <TOKEN> Hugging Face Hub token with read access over the provided model ID, optional
-d, --dtype <DTYPE> Target dtype for conversion (float32, float16, bfloat16, float8, float4)
-h, --help Print help
-V, --version Print version
safetensors files on the Hugging Face
Hub that contain the metadatamodel-00000-of-00000.safetensors and not sharded i.e.
model.safetensors filesggufThis project is licensed under either of the following licenses, at your option:
Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in this project by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.