Crates.io | unicode-language |
lib.rs | unicode-language |
version | 2.0.3 |
source | src |
created_at | 2023-08-14 12:21:09.475113 |
updated_at | 2024-05-31 07:55:51.300284 |
description | A library to detect language coverage given a list of codepoints. |
homepage | |
repository | https://github.com/the-type-founders/unicode-language-rs/ |
max_upload_size | |
id | 944107 |
size | 283,642 |
This library detects language support given a list of Unicode codepoints. This is primarily useful for finding out what languages a font supports.
// Input codepoints as a vector of Unicode ranges
let codepoints = vec![[65, 121]];
// Detect languages with a threshold of 0.5
let results = detect(codepoints, 0.5);
// results[0].tag = "en"
// results[1].tag = "nl"
// ...
The result is a vector of Match
structs, with the following signature:
struct Match {
// BCP 47 language tag.
tag: &'static str,
// English name.
name: &'static str,
// Name in native script.
native: &'static str,
// Number of codepoints matched.
count: u32,
// Score (number of codepoints matched divided by the total).
score: f64,
}
The language data is derived from Adobe Font's speakeasy library. The data is extracted at build time and statically compiled as part of the library.
This library is licensed under the Apache-2.0 license. Copyright 2023, The Type Founders.