neuralassimilator

Crates.io	neuralassimilator
lib.rs	neuralassimilator
version	0.1.0
source	src
created_at	2024-09-08 17:34:22.220142
updated_at	2024-09-08 17:34:22.220142
description	NeuralAssimilator is a Rust crate for fine-tuning Language Learning Models (LLMs) from unstructured text.
homepage	https://github.com/jamesfebin/neuralassimilator
repository	https://github.com/jamesfebin/neuralassimilator
max_upload_size
id	1368376
size	117,555

Febin John James (jamesfebin)

documentation

NeuralAssimilator is a Rust crate for fine-tuning Language Learning Models (LLMs) from unstructured text.

Add this to your Cargo.toml:

[dependencies]
neuralassimilator = "0.1.0"

NeuralAssimilator can be used via its command-line interface:

neuralassimilator --input ./input_folder --output ./output_folder --chunk-size 10000 --model gpt-4o-mini-2024-07-18 --use-case "Creative writing"

--input or -i: Input directory path (default: "./input")
--output or -o: Output file or directory path (optional)
--chunk-size: Size of text chunks to process (default: 10000)
--model: LLM model to use (default: "gpt-4o-mini-2024-07-18")
--use-case: Specific use case for prompt generation (default: "Creative writing")

Input Processing: The crate reads input files from the specified directory and chunks them into manageable sizes.
Prompt Tuning: Based on the given use case, it generates appropriate prompts for the LLM.
Instruction Generation: For each chunk-prompt pair, it generates instruction-response pairs using the specified LLM.
Output: The resulting pairs are written to a JSONL file in the specified output location.
Fine-tuning: The generated dataset can then be used to fine-tune the LLM.

Contributions are welcome! Please feel free to submit a Pull Request.

This project is licensed under the MIT License.

Commit count: 0