tfidf-summarizer

Crates.iotfidf-summarizer
lib.rstfidf-summarizer
version2.0.0
sourcesrc
created_at2023-11-09 00:03:53.000501
updated_at2023-11-09 00:03:53.000501
descriptionBasic tf-idf compute for documents
homepagehttps://github.com/zakariahajji/tfidf_summarizer
repositoryhttps://github.com/zakariahajji/tfidf_summarizer
max_upload_size
id1029546
size9,505
Zakaria Hajji (zakariahajji)

documentation

README

TF-IDF Summarizer

tfidf-summarizer is a Rust crate for computing Term Frequency-Inverse Document Frequency (TF-IDF) vectors for a set of documents. This statistical measure is used to evaluate how important a word is to a document in a collection or corpus.

Features

  • Tokenization of text data.
  • Calculation of term frequency (TF) for words in each document.
  • Calculation of document frequency (DF) for words across all documents.
  • Calculation of inverse document frequency (IDF) across all documents.
  • Computation of TF-IDF scores for words in each document.

Installation

Add this to your Cargo.toml:

[dependencies]
tfidf-summarizer = "0.1.0"
Commit count: 9

cargo fmt