Crates.io | chisel-decoders |
lib.rs | chisel-decoders |
version | 1.1.0 |
source | src |
created_at | 2023-03-17 16:42:19.329602 |
updated_at | 2023-10-30 22:34:09.270244 |
description | Chisel UTF-8 and ASCII byte stream decoder implementations |
homepage | |
repository | https://github.com/jonnycoombes/chisel-core/tree/trunk/chisel-decoders |
max_upload_size | |
id | 812842 |
size | 17,503,941 |
This crate contains a very simple, lean implementations of decoders that will consume u8
bytes from a given
Read
implementation, and decode into the Rust internal char
type using either UTF-8 or ASCII.
The decoder implementations are pretty fast and loose: under the covers they utilise some bit-twiddlin' in
conjunction with the unsafe transmute
function to do the conversions.
No string allocations are used during conversion.
Usage is very simple, provided you have something that implements Read
in order to source some bytes:
Just wrap your array in a mut
reader, and then plug it into a new instance of either Utf8Decoder
:
# use std::io::BufReader;
# use chisel_decoders::utf8::Utf8Decoder;
let buffer: &[u8] = &[0x10, 0x12, 0x23, 0x12];
let mut reader = BufReader::new(buffer);
let _decoder = Utf8Decoder::new(&mut reader);
If you're fairly certain that you're dealing with ASCII only, then just pick the AsciiDecoder
instead:
# use std::io::BufReader;
# use chisel_decoders::ascii::AsciiDecoder;
let buffer: &[u8] = &[0x10, 0x12, 0x23, 0x12];
let mut reader = BufReader::new(buffer);
let _decoder = AsciiDecoder::new(&mut reader);
Just crack open your file, wrap in a Read
instance and then plug into a new instance of Utf8Decoder
:
# use std::fs::File;
# use std::io::BufReader;
# use std::path::PathBuf;
# use chisel_decoders::utf8::Utf8Decoder;
let path = PathBuf::from("./Cargo.toml");
let f = File::open(path);
let mut reader = BufReader::new(f.unwrap());
let _decoder = Utf8Decoder::new(&mut reader);
chars
Once you've created an instance of a specific decoder, you simply iterate over the chars
in
order to pull out the decoded characters (a decoder implements Iterator<Item=char>
):
# use std::fs::File;
# use std::io::BufReader;
# use std::path::PathBuf;
# use chisel_decoders::utf8::Utf8Decoder;
let path = PathBuf::from("./Cargo.toml");
let f = File::open(path);
let mut reader = BufReader::new(f.unwrap());
let decoder = Utf8Decoder::new(&mut reader);
for c in decoder {
println!("char: {}", c)
}
As you would expect, just cargo build
in order to build the crate.
If you have any suggestions, requests or even just comments relating to this crate, then please just add an issue and I'll try and take a look when I get change. Please feel free to fork this repo if you want to utilise/modify this code in any of your own work.