budoux

Crates.iobudoux
lib.rsbudoux
version0.1.1
sourcesrc
created_at2022-01-11 13:56:00.566266
updated_at2022-05-15 12:46:21.769207
descriptionRust port of BudouX (machine learning powered line break organizer tool).
homepage
repositoryhttps://github.com/sg0hsmt/budoux-rs
max_upload_size
id512235
size117,264
(sg0hsmt)

documentation

README

BudouX-rs

Crates.io API reference Test License

BudouX-rs is a rust port of BudouX (machine learning powered line break organizer tool).

Note: This project contains the deliverables of the BudouX project.

Note: BudouX-rs supported plain text only, not supports html inputs.

Demo

https://sg0hsmt.github.io/budoux-rs/

Documentation

https://docs.rs/crate/budoux/

Usage

Split sentences with internal model.

let model = budoux::models::default_japanese_model();
let words = budoux::parse(model, "これはテストです。");

assert_eq!(words, vec!["これは", "テストです。"])

Load model from json file and split sentences using the loaded model.

let file = File::open(path_to_json).unwrap();
let reader = BufReader::new(file);
let model: budoux::Model = serde_json::from_reader(reader).unwrap();
let words = budoux::parse(&model, "これはテストです。");

assert_eq!(words, vec!["これは", "テストです。"])

Test

cargo test

You can use GitHub Actions locally by act.

act -j test

Generate model from original BudouX

go generate ./...

Note: Generate model is require Go 1.13 or later.

Commit count: 30

cargo fmt