Crates.io | sirun |
lib.rs | sirun |
version | 0.1.9 |
source | src |
created_at | 2021-05-21 20:15:35.784582 |
updated_at | 2022-02-10 22:18:35.307471 |
description | A benchmark test runner |
homepage | |
repository | |
max_upload_size | |
id | 400476 |
size | 221,747 |
sirun
(pronounced like "siren") is a tool for taking basic perfomance
measurements of a process covering its entire lifetime. It gets memory and
timing information from the kernel and also allows
Statsd messages to be sent to
udp://localhost:$SIRUN_STATSD_PORT
(the port is assigned randomly by sirun,
but you can also set it yourself), and those will be included in the outputted
metrics.
It's intended that this tool be used for shorter-running benchmarks, and not for long-lived processes that don't die without external interaction. You could certainly use it for long-lived processes, but that's not where it shines.
cargo install sirun
Release bundles are provided for each supported plaform. Extract the binary somewhere and use it.
Make sure you have rustup
installed, and use that to
ensure you have the latest stable Rust toolchain enabled.
cargo install .
With SSH
cargo install --git ssh://git@github.com:22/DataDog/sirun.git --branch main
or with HTTPS
cargo install --git https://github.com/DataDog/sirun.git --branch main
See also the documentation.
Create a JSON or YAML file with the following properties:
name
: This will be included in the results JSON.run
: The command to run and test. You can format this like a shell
command with arguments, but note that it will not use a shell as an
intermediary process. Note that subprocesses will not be measured via the
kernel, but they can still use Statsd. To send metrics to Statsd from inside
this process, send them to udp://localhost:$SIRUN_STATSD_PORT
.setup
: A command to run before the test. Use this to ensure the
availability of services, or retrieve some last-minute dependencies. This can
be formatted the same way as run
. It will be run repeatedly at 1 second
intervals until it exits with status code 0.teardown
: A command to run after the test. This is run in the same
manner as setup
, except after the test has run instead of before.timeout
: If provided, this is the maximum time, in seconds, a run
test
can run for. If it times out, sirun
will exit with no results, aborting the
test.env
: A set of environment variables to make available to the run
and
setup
programs. This should be an object whose keys are the environment
variable names and whose values are the environment variable values.iterations
: The number of times to run the the run
test. The results
for each iteration will be in an iterations
array in the resultant JSON. The
default is 1.cachegrind
: If set to true
, will run the test (after having already
run it normally) using cachegrind to add an instruction count to the results
JSON. Will only happen once, and be inserted into the top level JSON,
regardless of iterations
. This requires that valgrind
is installed on your
system.variants
: An array or object whose values are config objects, whose
properties may be any of the properties above. It's not recommended to include
name
in a variant. The variant name (if variants
is an object) or index
(if variants
is an array) will be included in resultant JSON.GIT_COMMIT_HASH
: If set, will include a version
in the
results.SIRUN_NAME
: If set, will include a name
in the results. This overrides
any name
property set in config JSON/YAML.SIRUN_NO_STDIO
: If set, supresses output from the tested program.SIRUN_VARIANT
: Selects which variant of the test to run. If the
variants
property exists in the config JSON/YAML, and this variable is not
set, then all variants will be run, one-by-one, each having its own line of
output JSON.SIRUN_STATSD_PORT
: The UDP port on localhost to use for Statsd
communication between tested processes and sirun. By default a random port
will be assigned. You should read this variable from tested programs to
determine which port to send data to.Here's an example JSON file. As an example of a setup
script, it's checking for
connectivity to Google. The run
script doesn't do much, but it does send a
single metric (with name udp.data
and value 50) via Statsd. It times out after
4 seconds, and we're not likely to reach that point.
{
"setup": "curl -I http://www.google.com -o /dev/null",
"run": "bash -c \"echo udp.data:50\\|g > /dev/udp/127.0.0.1/$SIRUN_STATSD_PORT\"",
"timeout": 4
}
You can then pass this JSON file to sirun
on the command line. Remember that
you can use environment variables to set the git commit hash and test name in
the output.
SIRUN_NAME=test_some_stuff GIT_COMMIT_HASH=123abc sirun ./my_benchmark.json
This will output something like the following.
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0
{"version":"123abc","name":"test_some_stuff",iterations:[{"user.time":6389.0,"system.time":8737.0,"udp.data":50.0,"max.res.size":2240512.0}]}
If you provide the --summarize
option, sirun
will switch to summary mode. In
summary mode, it will read from stdin
, expecting line-by-line of output from
previous sirun runs. It will then aggregate them by test name and variant, and
provide summary statistics over iterations. The output is pretty-printed JSON.
E.g.
$ sirun foo-test.json >> results.ndjson
$ sirun bar-test.json >> results.ndjson
$ sirun baz-test.json >> results.ndjson
$ cat results.ndjson | sirun --summarize > summary.json
Licensed under either of
at your option.
Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.