matchgen

Crates.iomatchgen
lib.rsmatchgen
version0.3.0
sourcesrc
created_at2023-02-27 21:54:13.36964
updated_at2024-06-18 07:02:38.24958
descriptionGenerate functions to quickly map byte string prefixes to values
homepagehttps://github.com/danielparks/matchgen
repositoryhttps://github.com/danielparks/matchgen
max_upload_size
id796387
size1,472,616
Daniel Parks (danielparks)

documentation

README

Generate functions to quickly map byte string prefixes to values

docs.rs Crates.io Rust version 1.60+

TreeMatcher can be used from a build script to generate a matcher function that maps byte sequences to arbitrary values. It returns the mapped value (or None) and the remainder of the input.

For example, suppose you generate a matcher for all HTML entities called entity_matcher():

assert!(entity_matcher(b"×XYZ") == (Some("×"), b"XYZ".as_slice()));
  • The prefixes it checks do not all have to be the same length.
  • If more than one prefix matches, it will return the longest one.
  • If nothing matches, it will return (None, &input).

Since the matchers only check the start of the input, you will want to use iter().position() or the memchr crate to find the start of a potential match.

It can also be configured to accept an iterator over bytes as input instead of a slice.

Simple example

To create a matcher to handle the four basic HTML entities, use a build script like the following:

use matchgen::TreeMatcher;
use std::env;
use std::error::Error;
use std::fs::File;
use std::io::{BufWriter, Write};
use std::path::Path;

fn main() -> Result<(), Box<dyn Error>> {
    let out_path = Path::new(&env::var("OUT_DIR")?).join("matcher.rs");
    let mut out = BufWriter::new(File::create(out_path)?);

    TreeMatcher::new("pub fn entity_decode", "u8")
        .doc("Decode basic HTML entities.")
        .add(b"&amp;", "b'&'")
        .add(b"&lt;", "b'<'")
        .add(b"&gt;", "b'>'")
        .add(b"&quot;", "b'\"'")
        .render(&mut out)?;

    Ok(())
}

To use the matcher:

include!(concat!(env!("OUT_DIR"), "/matcher.rs"));

fn main() {
    assert_eq!(
      entity_decode(b"&amp; on &amp; on"),
      (Some(b'&'), b" on &amp; on".as_slice()),
    );
}

Development status

This is potentially stable. I’m letting it bake a while to see if I come up with new features or better ways to accomplish the same thing before I release version 1.0.

I am open to suggestions.

License

This project dual-licensed under the Apache 2 and MIT licenses. You may choose to use either.

Contributions

Unless you explicitly state otherwise, any contribution you submit as defined in the Apache 2.0 license shall be dual licensed as above, without any additional terms or conditions.

Commit count: 144

cargo fmt