Crates.io | detone |
lib.rs | detone |
version | 1.0.1 |
source | src |
created_at | 2019-05-21 08:52:58.642245 |
updated_at | 2024-07-04 11:16:20.687381 |
description | Decompose Vietnamese tone marks |
homepage | https://docs.rs/detone/ |
repository | https://github.com/hsivonen/detone |
max_upload_size | |
id | 135782 |
size | 31,527 |
An iterator adapter that takes an iterator over char
yielding a sequence of
char
s in Normalization Form C (this precondition is not checked!) and
yields char
s either such that tone marks that wouldn't otherwise fit into
windows-1258 are decomposed or such that text is decomposed into orthographic
units.
Use cases include preprocessing before encoding Vietnamese text into windows-1258 or converting precomposed Vietnamese text into a form that looks like it was written with the (non-IME) Vietnamese keyboard layout (e.g. for machine learning training or benchmarking purposes).
Please see the file named COPYRIGHT.
Generated API documentation is available online.
1.60 to use, 1.67 to run tests. Pin version 1.0.0 of this crate if you need an even lower MSRV; there are no non-test changes.