datapool

Crates.iodatapool
lib.rsdatapool
version0.1.4
sourcesrc
created_at2022-09-01 09:56:25.407186
updated_at2022-09-06 21:40:39.452405
descriptionTool to generate a pool of testdata
homepage
repositoryhttps://github.com/sqsh-project/datapool
max_upload_size
id656566
size17,617
Cayoglu (ucyo)

documentation

https://docs.rs/datapool

README

datapool

A tool to generate synthetic floating point datasets. Currently the only datasets with a gaussian/normal distribution are supported. Further distributions can be added if the need may rise.

Install

Use stable cargo release to install the toolset:

cargo install datapool

Use latest version from github:

git clone https://github.com/sqsh-project/datapool.git && \
cd datapool && \
cargo install --path .

Usage

The easiest way to generate a dataset is to specify mean, standard deviation and the number of numbers to be generated via num:

datapool --mean 10 --std 2 --size 1000 # mandatory arguments

The default generates single-precision floating-point data. This can be changed using the --datatype argument:

datapool --mean 10 --std 2 --size 1000 --datatype double

Should it be necessary to define a different endianess than the machine native the --endianess argument can be used

datapool --mean 10 --std 2 --size 1000 --endianess little

By providing a --seed value the generated data can be reproduced on different environments:

datapool --mean 10 --std 2 --size 1000 --seed 42

The data can be output to a file by piping the result to a file:

datapool --mean 10 --std 2 --size 1000 > /tmp/data.raw  # save to file

For more information the help menu can be used: datapool --help

Commit count: 8

cargo fmt