| Crates.io | spatialbench-cli |
| lib.rs | spatialbench-cli |
| version | 0.1.0 |
| created_at | 2025-12-02 07:38:19.242337+00 |
| updated_at | 2025-12-02 07:38:19.242337+00 |
| description | Blazing fast pure Rust SpatialBench data generator command line tool. |
| homepage | https://github.com/apache/sedona-spatialbench/ |
| repository | https://github.com/apache/sedona-spatialbench/ |
| max_upload_size | |
| id | 1961246 |
| size | 291,995 |
See the main README.md for full documentation.
Install this tool with Python:
pip install spatialbench-cli
Install Rust and this tool:
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
cargo install spatialbench-cli
We tried to make the spatialbench-cli experience as close to dbgen as possible for no other
reason than maybe make it easier for you to have a drop-in replacement.
$ spatialbench-cli -h
TPC-H Data Generator
Usage: spatialbench-cli [OPTIONS]
Options:
-s, --scale-factor <SCALE_FACTOR>
Scale factor to address (default: 1) [default: 1]
-o, --output-dir <OUTPUT_DIR>
Output directory for generated files (default: current directory) [default: .]
-T, --tables <TABLES>
Which tables to generate (default: all) [possible values: vehicle, driver, customer, trip, building, zone]
-p, --parts <PARTS>
Number of parts to generate (manual parallel generation) [default: 1]
--part <PART>
Which part to generate (1-based, only relevant if parts > 1) [default: 1]
-f, --format <FORMAT>
Output format: parquet, tbl, csv (default: parquet) [default: parquet] [possible values: parquet, tbl, csv]
-n, --num-threads <NUM_THREADS>
The number of threads for parallel generation, defaults to the number of CPUs [default: 8]
-c, --parquet-compression <PARQUET_COMPRESSION>
Parquet block compression format. Default is SNAPPY [default: SNAPPY]
-v, --verbose
Verbose output (default: false)
--stdout
Write the output to stdout instead of a file
-h, --help
Print help (see more with '--help')
For example generating a dataset with a scale factor of 1 (1GB) can be done like this:
$ spatialbench-cli -s 1 --output-dir=/tmp/spatialbench