tesseract-static

Crates.iotesseract-static
lib.rstesseract-static
version0.2.1
created_at2023-03-16 18:49:26.389362+00
updated_at2025-01-25 21:09:01.971679+00
descriptionSTATICALLY LINKED tesseract + leptonica bindings for easy inclusion of tesseract-ocr in binary applications
homepage
repository
max_upload_size
id811990
size130,072
Felix Schütt (fschutt)

documentation

README

tesseract-static

Rust crate to automatically link and build tesseract and leptonica into a Rust program.

The crate includes the leptonica.dll / tesseract.dll at compile time, so these binaries will be automatically shipped with a build.

To rebuild tesseract / leptonica from source, cd rebuild-tesseract && cargo run --release then copy the output from /target/release/leptonica.dll and /target/release/tesseract.dll.

Usage

Run cargo run --release --example simple for an example:

# Cargo.toml
[dependencies]
tesseract-static = { version = "0.2.0", features = ["parse"] }
use tesseract_static::tesseract::Tesseract;

const TRAINING_DATA: &[u8] = include_bytes!("./eng.traineddata");

fn main() {

    // setup training data
    let parent = std::env::temp_dir();
    std::fs::write(&parent.join("eng.traineddata"), &TRAINING_DATA[..]).unwrap();

    let hocr_xml = Tesseract::new(Some(&parent.display().to_string()), Some("eng"))
    .unwrap()
    .set_image_from_mem(include_bytes!("../../testocr.pnm"))
    .unwrap()
    .get_hocr_text(1)
    .unwrap();

    let hocr = tesseract_static::parse::ParsedHocr::new(&hocr_xml).unwrap();

    println!("{hocr:#?}"); // parsed hOCR text from tesseract, includes rect bounds
}

License

MIT

Commit count: 0

cargo fmt