Crates.io | csep |
lib.rs | csep |
version | 0.1.9 |
source | src |
created_at | 2024-04-26 20:19:29.126248 |
updated_at | 2024-07-04 22:15:12.917666 |
description | Cosine Similarity Embeddings Print |
homepage | |
repository | |
max_upload_size | |
id | 1221766 |
size | 125,521 |
Cosine Similarity Embeddings Print
╭─── ╭────┬───────╮
│ ╰──╮ ├─ ╭───╯
╰───────╯ ╰── ╵
Like Grep (Global Regular Expression Print) takes a regular expression and prints all the lines that have a match in it, Csep (Cosine Similarity Embeddings Print) takes an input phrase and prints all the chunks that are similar to it.
The goal of this project is to give users command line access to semantic search in the same way that grep is used for regular expressions. This not only gives you a command line semantic search tool on any unix like system, but also allows you to use it in scripts and pipelines. If you combine it with a command line llm tool like chat-gipity or Ollama you could even potentially perform RAG in a simple unix shell script.
You can then install csep from this source using:
cargo install --path .
Or you can pull whatever the latest published version is from crates.io with
cargo install csep
If you want to use the ollama client option, you will need to install ollama and pull the default all-minilm model, or any model you wish to use with the model switch, since ollama currently doesnt suppor pulling the models for embeddings automatically like it does with llms.
ollama pull all-minilm
Per embedding, fastembed is actually much slower, but due to the overhead of making requests to ollama, for large directories, the embeddings cache builds much faster when using fastembed.