detect-lang

Crates.iodetect-lang
lib.rsdetect-lang
version0.1.5
sourcesrc
created_at2020-02-08 16:52:26.617009
updated_at2020-02-22 21:20:24.061865
descriptionLanguage detection given paths and extensions
homepage
repositoryhttps://github.com/vallentin/detect-lang
max_upload_size
id206394
size19,665
Christian Vallentin (vallentin)

documentation

https://docs.rs/detect-lang

README

detect-lang

Build Status Latest Version Docs License

This crate is a utility for identifying names of programming languages (and related files) from paths and file extensions.

This is not a crate for detecting natural languages.

Usage

Add this to your Cargo.toml:

[dependencies]
detect-lang = "0.1"

Releases

Release notes are available in the repo at CHANGELOG.md.

Paths and Extensions

Languages can be identified from paths using from_path or directly from extensions using from_extension.

use detect_lang::from_path;
assert_eq!(from_path("foo.rs").unwrap().name(), "Rust");
assert_eq!(from_path("foo.md").unwrap().name(), "Markdown");

use detect_lang::from_extension;
assert_eq!(from_extension("rs").unwrap().name(), "Rust");
assert_eq!(from_extension("md").unwrap().name(), "Markdown");

// The case is ignored
assert_eq!(from_path("foo.jSoN").unwrap().name(), "JSON");
assert_eq!(from_extension("jSoN").unwrap().name(), "JSON");

Language ID

In short, the language id is a lowercase version of name. However, it also replaces symbols making it usable as a URL slug.

For instance foo.hpp is identified as language name C++ and language ID cpp.

use detect_lang::from_path;
assert_eq!(from_path("foo.rs").unwrap().id(), "rust");
assert_eq!(from_path("foo.cpp").unwrap().id(), "cpp");
assert_eq!(from_path("foo.hpp").unwrap().id(), "cpp");

use detect_lang::from_extension;
assert_eq!(from_extension("rs").unwrap().id(), "rust");
assert_eq!(from_extension("cpp").unwrap().id(), "cpp");
assert_eq!(from_extension("hpp").unwrap().id(), "cpp");

// The case is ignored
assert_eq!(from_path("foo.jSoN").unwrap().id(), "json");
assert_eq!(from_extension("jSoN").unwrap().id(), "json");

Always Lowercase

If the extension is guaranteed to always be lowercase, then consider using from_lowercase_extension to avoid allocation and conversion to lowercase.

use detect_lang::{from_extension, from_lowercase_extension, Language};

assert_eq!(from_lowercase_extension("json"), Some(Language("JSON", "json")));
assert_eq!(from_lowercase_extension("jSoN"), None);

assert_eq!(from_extension("json"), Some(Language("JSON", "json")));
assert_eq!(from_extension("jSoN"), Some(Language("JSON", "json")));

Match Example

use std::path::Path;
use detect_lang::{from_path, Language};

let path = Path::new("foo.rs");
match from_path(path) {
    //   Language(name, id)
    Some(Language(_, "rust")) => println!("This is Rust"),
    Some(Language(..))        => println!("Well it's not Rust"),
    None                      => println!("Ehh, what?"),
}
Commit count: 18

cargo fmt