jwalk-meta

Crates.iojwalk-meta
lib.rsjwalk-meta
version0.9.4
sourcesrc
created_at2023-04-27 04:05:52.469762
updated_at2024-04-01 16:47:50.825832
descriptionFilesystem walk performed in parallel with streamed and sorted results.
homepagehttps://github.com/brmmm3/jwalk-meta
repositoryhttps://github.com/brmmm3/jwalk-meta
max_upload_size
id850025
size177,280
Marty B. (brmmm3)

documentation

https://docs.rs/jwalk-meta/

README

jwalk-meta

Filesystem walk.

  • Performed in parallel using rayon
  • Entries streamed in sorted order
  • Custom sort/filter/skip/state

This is a fork of https://github.com/Byron/jwalk. This project adds optional collecting metadata to improve performance if metadata is needed later.

Build Status Latest version

Usage

Add this to your Cargo.toml:

[dependencies]
jwalk-meta = "0.9"

Lean More: docs.rs/jwalk-meta

Example

Recursively iterate over the "foo" directory sorting by name:

use jwalk_meta::{WalkDir};

for entry in WalkDir::new("foo").sort(true) {
  println!("{}", entry?.path().display());
}

Inspiration

This crate is inspired by both walkdir and ignore. It attempts to combine the parallelism of ignore with walkdir's streaming iterator API. Some code and comments are copied directly from walkdir.

Why use this crate?

This crate is particularly good when you want streamed sorted results. In my tests it's about 4x walkdir speed for sorted results with metadata. Also this crate's process_read_dir callback allows you to arbitrarily sort/filter/skip/state entries before they are yielded.

Why not use this crate?

Directory traversal is already pretty fast. If you don't need this crate's speed then walkdir provides a smaller and more tested single threaded implementation.

This crates parallelism happens at the directory level. It will help when walking deep file systems with many directories. It wont help when reading a single directory with many files.

Benchmarks

Benchmarks comparing this crate with walkdir and ignore.

Commit count: 112

cargo fmt