untanglr

Crates.iountanglr
lib.rsuntanglr
version1.1.0
sourcesrc
created_at2021-07-22 20:22:53.611173
updated_at2022-09-02 21:28:50.292209
descriptionProbabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
homepage
repositoryhttps://github.com/abutnaru/untanglr
max_upload_size
id426018
size4,415,695
Andrei Butnaru (abutnaru)

documentation

README

Untanglr

Untanglr

Untanglr takes in a some mangled words and makes sense out of them so you dont have to. It goes through the input and splits it probabilistically into words. The crate includes both a bin.rs and a lib.rs to facilitate both usage as a command line utility, and as a library that you can use in your code.

Usage

Pass the tangled words as a cli argument:

$ untanglr thequickbrownfoxjumpedoverthelazydog
the quick brown fox jumped over the lazy dog

Or use it in your projects:

extern crate untanglr;

fn main() {
	let lm = untanglr::LanguageModel::new();
	println!("{:?}", lm.untangle("helloworld"));
}

Installation

If you find that untanglr might be useful on your machine you can install it. Just make sure cargo is installed and run:

$ cargo install untanglr

Note: Don't be discouraged if this project hasn't been updated in a while. I will address potential issues but the crate does not need regular updates.

Credits

I have developed this project around Derek Anderson's wordninja python implementation for some exercising in rust while producing something useful.

Commit count: 34

cargo fmt