Crates.io | textcat |
lib.rs | textcat |
version | 0.3.2 |
source | src |
created_at | 2021-03-05 19:51:56.220663 |
updated_at | 2022-01-05 17:58:07.241776 |
description | Simple library to detect text categories. It can be used to detect the language of a given text |
homepage | |
repository | https://github.com/crodas/textcat-rs |
max_upload_size | |
id | 364511 |
size | 36,739 |
Library to extract N-Grams from texts. This is a low level library. Lingo is build on top of this library to detect human languages on texts.
This library provides tools to train with sample texts, extracting N-Grams from texts, create sample and train categories. The trained data can be serialized to be used later. The library also provides tools to detect to which pretained category a given text would be closer to.