Crates.io | detect-lang |
lib.rs | detect-lang |
version | 0.1.5 |
source | src |
created_at | 2020-02-08 16:52:26.617009 |
updated_at | 2020-02-22 21:20:24.061865 |
description | Language detection given paths and extensions |
homepage | |
repository | https://github.com/vallentin/detect-lang |
max_upload_size | |
id | 206394 |
size | 19,665 |
This crate is a utility for identifying names of programming languages (and related files) from paths and file extensions.
This is not a crate for detecting natural languages.
Add this to your Cargo.toml
:
[dependencies]
detect-lang = "0.1"
Release notes are available in the repo at CHANGELOG.md.
Languages can be identified from paths using from_path
or directly from extensions using from_extension
.
use detect_lang::from_path;
assert_eq!(from_path("foo.rs").unwrap().name(), "Rust");
assert_eq!(from_path("foo.md").unwrap().name(), "Markdown");
use detect_lang::from_extension;
assert_eq!(from_extension("rs").unwrap().name(), "Rust");
assert_eq!(from_extension("md").unwrap().name(), "Markdown");
// The case is ignored
assert_eq!(from_path("foo.jSoN").unwrap().name(), "JSON");
assert_eq!(from_extension("jSoN").unwrap().name(), "JSON");
In short, the language id
is a lowercase version of name
.
However, it also replaces symbols making it usable as a URL slug.
For instance foo.hpp
is identified as language name C++
and
language ID cpp
.
use detect_lang::from_path;
assert_eq!(from_path("foo.rs").unwrap().id(), "rust");
assert_eq!(from_path("foo.cpp").unwrap().id(), "cpp");
assert_eq!(from_path("foo.hpp").unwrap().id(), "cpp");
use detect_lang::from_extension;
assert_eq!(from_extension("rs").unwrap().id(), "rust");
assert_eq!(from_extension("cpp").unwrap().id(), "cpp");
assert_eq!(from_extension("hpp").unwrap().id(), "cpp");
// The case is ignored
assert_eq!(from_path("foo.jSoN").unwrap().id(), "json");
assert_eq!(from_extension("jSoN").unwrap().id(), "json");
If the extension is guaranteed to always be lowercase,
then consider using from_lowercase_extension
to avoid
allocation and conversion to lowercase.
use detect_lang::{from_extension, from_lowercase_extension, Language};
assert_eq!(from_lowercase_extension("json"), Some(Language("JSON", "json")));
assert_eq!(from_lowercase_extension("jSoN"), None);
assert_eq!(from_extension("json"), Some(Language("JSON", "json")));
assert_eq!(from_extension("jSoN"), Some(Language("JSON", "json")));
use std::path::Path;
use detect_lang::{from_path, Language};
let path = Path::new("foo.rs");
match from_path(path) {
// Language(name, id)
Some(Language(_, "rust")) => println!("This is Rust"),
Some(Language(..)) => println!("Well it's not Rust"),
None => println!("Ehh, what?"),
}