processing_chain

Crates.ioprocessing_chain
lib.rsprocessing_chain
version0.2.2
sourcesrc
created_at2022-09-04 07:29:46.019107
updated_at2022-11-27 13:23:52.282486
descriptionRust library to set up processing chains of large amounts of data.
homepage
repositoryhttps://github.com/giorgiosavastano/process
max_upload_size
id658157
size41,829
Giorgio Savastano (giorgiosavastano)

documentation

README

processing-chain

processing-chain provides a convenient way to seamlessly set up processing chains for large amounts of data.

Please read the API documentation on docs.rs or take a look at the examples.

processing-chain is based on the concept of Item which is an abstraction that is used to spawn all the processes in parallel. All the user needs to do is define:

  • The Items to be processed
  • The function that processes a single Item

processing-chain will take care of spawning the process across all Items via parallelization. The user can also provide some extra processing configuration information (e.g., overwrite).

Highlights

  • Set-up generic data processing chains

Define the Items

Using a JSON file

[
    {
        "name": "item_1",
        "input_item_paths": ["test_1.npy", "test_2.npy", "test_2.npy"],
        "output_item_paths": ["output_1.nc"]
    },
    {
        "name": "item_2",
        "input_item_paths": ["test_1.npy", "test_2.npy"],
        "output_item_paths": ["output_2.nc"]
    },
    {
        "name": "item_3",
        "input_item_paths": ["test_6.npy", "test_7.npy", "test_8.npy"],
        "output_item_paths": ["output_3.nc"]
    }
]

Write the _process_item function

In rust:

fn _process_item(item: &Item) -> Result<bool> {
    // define how to process a single item
    println!(
        "Processing {} {:?} -> {:?}",
        item.name, item.input_item_path, item.output_item_path
    );
    // ...

    Ok(true)
}

If your function is written in Python and you don't feel like converting it to Rust (yet), you could use the inline-python crate.

use inline_python::python;

fn _process_item(item: &Item) -> Result<bool> {
    // define how to process a single item
    python! {
        print("Processing {} {} -> {}".format('item.name, item.input_item_path, item.output_item_path))
	};
    // ...

    Ok(true)
}

Some examples can be found here.

Commit count: 44

cargo fmt