Crates.io | valentinus |
lib.rs | valentinus |
version | 0.5.3 |
source | src |
created_at | 2024-07-12 12:34:04.957532 |
updated_at | 2024-10-22 13:26:19.233679 |
description | Next generation vector database built with LMDB bindings |
homepage | |
repository | https://github.com/kn0sys/valentinus |
max_upload_size | |
id | 1300631 |
size | 310,325 |
next generation vector db built with lmdb bindings
git clone https://github.com/kn0sys/valentinus && cd valentinus
var | usage | default |
---|---|---|
LMDB_USER |
working directory of the user for database | $USER |
LMDB_MAP_SIZE |
Sets max environment size, i.e. size in memory/disk of all data | 20% of available memory |
ONNX_PARALLEL_THREADS |
parallel execution mode for this session | 1 |
VALENTINUS_CUSTOM_DIM |
embeddings dimensions for custom models | all-mini-lm-6 -> 384 |
VALENTINUS_LMDB_ENV |
environment for the database (i.e. test, prod) | test |
all-MiniLM-L12-v2_onnx
directorymkdir all-MiniLM-L12-v2_onnx
cd all-MiniLM-L12-v2_onnx && wget https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2/resolve/main/config.json
wget https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2/resolve/main/onnx/model.onnx
wget https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2/resolve/main/special_tokens_map.json
wget https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2/resolve/main/tokenizer_config.json
wget https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2/resolve/main/tokenizer.json
wget https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2/resolve/main/vocab.txt
RUST_TEST_THREADS=1 cargo test
see examples