Crates.io | quickdiv |
lib.rs | quickdiv |
version | 0.1.1 |
source | src |
created_at | 2023-09-12 15:46:41.074266 |
updated_at | 2023-10-11 20:20:47.240163 |
description | Faster repeated division and modulo operations by the same divisor |
homepage | |
repository | https://github.com/dtrifuno/quickdiv |
max_upload_size | |
id | 970779 |
size | 46,119 |
QuickDiv is a Rust crate that allows you to speed up repeated division and modulo operations by the same divisor, based on the libdivide C/C++ library.
On most hardware today integer division operations take longer to execute compared to operations like multiplication and addition. Because of this, compilers generally optimize division by a constant, by replacing it with a cheaper sequence of shifts, multiplications and additions. This crate lets you apply a similar algorithm to optimize division by values that are only known at runtime.
Performance gains will vary between platforms, CPUs, and integer widths, but you can expect dividing an integer by a precomputed divisor to be somewhere between 2 to 10 times faster compared to the built-in hardware division method. Note that preparing the divisor is more expensive than a single unoptimized division: it will take at least 2 divisions by the same divisor to break even.
This crate supports primitive integer types of all widths, in both signed and
unsigned variants. It requires Rust version 1.54 or greater. It is #![no_std]
and #![forbid(unsafe_code)]
.
use quickdiv::DivisorU64;
fn is_quadratic_residue(q: u64, modulus: u64) -> bool {
// Initializing a divisor is more expensive than a single
// unoptimized division, to gain a benefit you must divide
// multiple times by the same divisor.
let modulus = DivisorU64::new(modulus);
// The original value can be recovered by using ::get().
for x in (0..modulus.get()) {
// A divisor can be used as the second operand with
// the / and % operators.
if (x * x) % modulus == q {
return true;
}
}
false
}
assert!(is_quadratic_residue(152, 169));
assert!(!is_quadratic_residue(51, 111));
The following benchmarks should give a rough sense of the kind of speed-up you
can expect. The numbers represent throughput in millions of elements per second
(larger is better) on various tasks.
For Quotient Sum
we divide a collection of random integers by a fixed
divisor, in LCG
we compute the remainder given a fixed modulus
and, finally, in FizzBuzz
we check the divisibility of random integers by a
fixed divisor.
Task | CPU | Compiler | QuickDiv |
---|---|---|---|
Quotient Sum | 248.3 | 838 | 823.5 |
LCG | 168.8 | 252.7 | 255 |
FizzBuzz | 41.13 | 1350 | 556 |
Note that while QuickDiv computes the remainder and checks if its
zero, the compiler uses a different method to directly check divisibility,
leading to faster performance on the FizzBuzz
task.
u64
only. Performance can vary with width and
signedness.If you would like to run these benchmarks yourself, check out the benchmarks
crate in the GitHub
repository.
Licensed under any of:
by your choice.
Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be multi-licensed as above, without any additional terms or conditions.