Crates.io | chardet |
lib.rs | chardet |
version | 0.2.4 |
source | src |
created_at | 2017-08-24 15:06:09.030651 |
updated_at | 2017-09-15 14:24:42.962131 |
description | rust version of chardet |
homepage | |
repository | https://github.com/thuleqaid/rust-chardet |
max_upload_size | |
id | 28834 |
size | 6,401,483 |
Rust version of chardet.
Put this in your Cargo.toml
:
[dependencies]
chardet = "0.2"
Then put this in your crate root:
extern crate chardet;
Using with encoding:
extern crate chardet;
extern crate encoding;
use chardet;
use std::fs::OpenOptions;
use std::io::prelude::*;
use encoding::DecoderTrap;
use encoding::label::encoding_from_whatwg_label;
// open text file
let mut fh = OpenOptions::new().read(true).open(filepath).expect(
"Could not open file",
);
let mut reader: Vec<u8> = Vec::new();
// read file
fh.read_to_end(&mut reader).expect("Could not read file");
// detect charset of the file
let result = detect(&reader);
// result.0 Encode
// result.1 Confidence
// result.2 Language
// decode file into utf-8
let coder = encoding_from_whatwg_label(charset2encoding(&result.0));
if coder.is_some() {
let utf8reader = coder.unwrap().decode(&reader, DecoderTrap::Ignore).expect("Error");
}