| Crates.io | datapool |
| lib.rs | datapool |
| version | 0.1.4 |
| created_at | 2022-09-01 09:56:25.407186+00 |
| updated_at | 2022-09-06 21:40:39.452405+00 |
| description | Tool to generate a pool of testdata |
| homepage | |
| repository | https://github.com/sqsh-project/datapool |
| max_upload_size | |
| id | 656566 |
| size | 17,617 |
A tool to generate synthetic floating point datasets. Currently the only datasets with a gaussian/normal distribution are supported. Further distributions can be added if the need may rise.
Use stable cargo release to install the toolset:
cargo install datapool
Use latest version from github:
git clone https://github.com/sqsh-project/datapool.git && \
cd datapool && \
cargo install --path .
The easiest way to generate a dataset is to specify mean, standard deviation and the number of numbers to be generated via num:
datapool --mean 10 --std 2 --size 1000 # mandatory arguments
The default generates single-precision floating-point data. This can be changed using the --datatype argument:
datapool --mean 10 --std 2 --size 1000 --datatype double
Should it be necessary to define a different endianess than the machine native the --endianess argument can be used
datapool --mean 10 --std 2 --size 1000 --endianess little
By providing a --seed value the generated data can be reproduced on different environments:
datapool --mean 10 --std 2 --size 1000 --seed 42
The data can be output to a file by piping the result to a file:
datapool --mean 10 --std 2 --size 1000 > /tmp/data.raw # save to file
For more information the help menu can be used: datapool --help