khmercut

Crates.iokhmercut
lib.rskhmercut
version0.1.5
sourcesrc
created_at2023-10-19 05:45:23.20815
updated_at2023-10-19 05:45:23.20815
descriptionA blazingly fast Khmer word segmentation tool written in Rust.
homepagehttps://github.com/seanghay/khmercut-rs
repositoryhttps://github.com/seanghay/khmercut-rs
max_upload_size
id1007507
size17,720,165
Seanghay Yath (seanghay)

documentation

README

khmercut.rs

A Blazingly Fast Khmer Word Segmentation Tool written in Rust.

let input_str = "ឃាត់ខ្លួនជនសង្ស័យ០៤នាក់ Hello, world ករណីលួចខ្សែភ្លើង នៅស្រុកព្រៃនប់។".to_string();

for token in khmercut::tokenize(&input_str) {
    print!("{}|", token);
}

// => ឃាត់ខ្លួន|ជនសង្ស័យ|០៤|នាក់| |Hello,| |world| |ករណី|លួច|ខ្សែភ្លើង| |នៅ|ស្រុក|ព្រៃនប់|។|

Run

cargo run
Commit count: 16

cargo fmt