| Crates.io | rapidhash |
| lib.rs | rapidhash |
| version | 4.1.0 |
| created_at | 2024-09-17 02:53:47.364133+00 |
| updated_at | 2025-09-12 11:29:51.471274+00 |
| description | An extremely fast, high quality, platform-independent hashing algorithm. |
| homepage | |
| repository | https://github.com/hoxxep/rapidhash |
| max_upload_size | |
| id | 1377087 |
| size | 234,715 |
A rust implementation of rapidhash, the official successor to wyhash.
unsafe feature is disabled (default). This implementation has also been fuzz-tested with cargo fuzz.const.std::hash::Hasher compatible hasher for HashMap and HashSet.Sponsored by Upon, inheritance vaults for your digital life. Ensure your family can access your devices, accounts, and assets when the unexpected happens.
Following rust's std::hash traits, the underlying hash function may change between minor versions, and is only suitable for in-memory hashing. These types are optimised for speed and minimal DoS resistance, available in the rapidhash::fast and rapidhash::quality flavours.
RapidHasher: A std::hash::Hasher compatible hasher that uses the rapidhash algorithm.RandomState: A std::hash::BuildHasher for initialising the hasher with a random seed and secrets.SeedableState: A std::hash::BuildHasher for initialising the hasher with the custom seed and secrets.RapidHashMap and RapidHashSet: Helper types for using fast::RandomState with HashMap and HashSet.use rapidhash::RapidHashMap;
// A HashMap using RapidHasher for fast in-memory hashing.
let mut map = RapidHashMap::default();
map.insert("key", "value");
use std::hash::BuildHasher;
use rapidhash::quality::SeedableState;
// Using the RapidHasher directly for in-memory hashing.
let hasher = SeedableState::fixed();
assert_eq!(hasher.hash_one(b"hello world"), 3348275917668072623);
Full compatibility with C++ rapidhash algorithms, methods are provided for all rapidhash V1, V2, and V3 (with micro/nano) variants. These are stable functions whose output will not change between crate versions.
use std::hash::{BuildHasher, Hasher};
use rapidhash::v3::{rapidhash_v3_seeded, rapidhash_v3_file_seeded, RapidSecrets};
/// Set your global hashing secrets.
/// - For HashDoS resistance, choose a randomised secret.
/// - For C++ compatibility, use the `seed_cpp` method or `DEFAULT_RAPID_SECRETS`.
const RAPID_SECRETS: RapidSecrets = RapidSecrets::seed(0x123456);
/// A helper function for your chosen rapidhash version and secrets.
#[inline]
pub fn rapidhash(data: &[u8]) -> u64 {
rapidhash_v3_seeded(data, &RAPID_SECRETS)
}
/// Hash streaming data with the rapidhash V3 algorithm.
pub fn rapidhash_stream<R: std::io::Read>(reader: R) -> std::io::Result<u64> {
rapidhash_v3_file_seeded(reader, &RAPID_SECRETS)
}
assert_eq!(rapidhash(b"hello world"), 11653223729569656151);
assert_eq!(rapidhash_stream(std::io::Cursor::new(b"hello world")).unwrap(), 11653223729569656151);
Please see the portable-hash crate for why using the standard library hashing traits is not recommended for portable hashing. Rapidhash is planning to implement the PortableHash and PortableHasher traits in a future release.
Rapidhash can also be installed as a CLI tool to hash files or stdin. This is not a cryptographic hash, but should be much faster than cryptographic hashes. This is fully compatible with the C++ rapidhash V1, V2, and V3 algorithms.
Output is the decimal string of the u64 hash value.
# install
cargo install rapidhash
# hash a file (output: 8543579700415218186)
rapidhash --v3 example.txt
# hash stdin (output: 8543579700415218186)
echo "example" | rapidhash --v3
default: stdstd: Enables the RapidHashMap and RapidHashSet helper types.rand: Enables using the rand library to more securely initialise RandomState. Includes the rand crate dependency.rng: Enables RapidRng, a fast, non-cryptographic PRNG based on rapidrng. Includes the rand_core crate dependency.unsafe: Uses unsafe pointer arithmetic to skip some unnecessary bounds checks for a small 3-4% performance improvement.nightly: Enable nightly-only features for even faster hashing, such as overriding Hasher::write_str and likely hints.In our benchmarking, rapidhash is one of the fastest general-purpose non-cryptographic hash functions. It places second to gxhash on some benchmarks, but gxhash is not portable, requires AES instructions to compile, and its main advantage is hashing string types. Between rapidhash, gxhash, and the default siphasher, we see little reason to use other hash functions on modern platforms without specifically benchmarking them for your workload and platform.
Rapidhash uses raw throughput benchmarks (the charts) to measure performance over various input sizes, and the foldhash benchmark suite (the txt tables) to measure workloads that are closer to real-world usage. The foldhash suite benchmarks hashers by measuring raw hash throughput, hashmap lookup miss, hashmap lookup hit, and hashmap insertion performance on a wide variety of commonly hashed types.
The benchmarks have been compiled with and without -C target-cpu=native on a variety of platforms to demonstrate rapidhash's strong all-round performance. The full results are available in the docs folder and are summarised below.
┌────────────────┬─────────────┬─────────────┬────────────┬────────────┬────────┬───────┬─────────┐
│ metric ┆ rapidhash-f ┆ rapidhash-q ┆ foldhash-f ┆ foldhash-q ┆ fxhash ┆ ahash ┆ siphash │
╞════════════════╪═════════════╪═════════════╪════════════╪════════════╪════════╪═══════╪═════════╡
│ avg_rank ┆ 2.11 ┆ 3.53 ┆ 2.84 ┆ 4.62 ┆ 2.88 ┆ 5.05 ┆ 6.97 │
│ geometric_mean ┆ 4.29 ┆ 4.82 ┆ 4.83 ┆ 5.24 ┆ 5.50 ┆ 5.94 ┆ 22.17 │
└────────────────┴─────────────┴─────────────┴────────────┴────────────┴────────┴───────┴─────────┘
┌────────────────┬─────────────┬─────────────┬────────────┬────────────┬────────┬────────┬───────┬─────────┐
│ metric ┆ rapidhash-f ┆ rapidhash-q ┆ foldhash-f ┆ foldhash-q ┆ gxhash ┆ fxhash ┆ ahash ┆ siphash │
╞════════════════╪═════════════╪═════════════╪════════════╪════════════╪════════╪════════╪═══════╪═════════╡
│ avg_rank ┆ 2.23 ┆ 3.94 ┆ 3.30 ┆ 5.08 ┆ 4.69 ┆ 3.16 ┆ 5.64 ┆ 7.97 │
│ geometric_mean ┆ 4.25 ┆ 4.79 ┆ 4.79 ┆ 5.19 ┆ 4.93 ┆ 5.48 ┆ 5.91 ┆ 21.99 │
└────────────────┴─────────────┴─────────────┴────────────┴────────────┴────────┴────────┴───────┴─────────┘
┌────────────────┬─────────────┬─────────────┬────────────┬────────────┬────────┬───────┬─────────┐
│ metric ┆ rapidhash-f ┆ rapidhash-q ┆ foldhash-f ┆ foldhash-q ┆ fxhash ┆ ahash ┆ siphash │
╞════════════════╪═════════════╪═════════════╪════════════╪════════════╪════════╪═══════╪═════════╡
│ avg_rank ┆ 2.27 ┆ 3.88 ┆ 3.08 ┆ 4.66 ┆ 2.11 ┆ 5.05 ┆ 6.97 │
│ geometric_mean ┆ 7.82 ┆ 9.03 ┆ 8.53 ┆ 9.66 ┆ 8.02 ┆ 10.98 ┆ 29.31 │
└────────────────┴─────────────┴─────────────┴────────────┴────────────┴────────┴───────┴─────────┘
┌────────────────┬─────────────┬─────────────┬────────────┬────────────┬────────┬────────┬───────┬─────────┐
│ metric ┆ rapidhash-f ┆ rapidhash-q ┆ foldhash-f ┆ foldhash-q ┆ gxhash ┆ fxhash ┆ ahash ┆ siphash │
╞════════════════╪═════════════╪═════════════╪════════════╪════════════╪════════╪════════╪═══════╪═════════╡
│ avg_rank ┆ 2.59 ┆ 4.20 ┆ 3.38 ┆ 5.28 ┆ 4.09 ┆ 2.50 ┆ 5.98 ┆ 7.97 │
│ geometric_mean ┆ 7.84 ┆ 8.97 ┆ 8.56 ┆ 9.68 ┆ 8.59 ┆ 8.15 ┆ 11.16 ┆ 32.59 │
└────────────────┴─────────────┴─────────────┴────────────┴────────────┴────────┴────────┴───────┴─────────┘
┌────────────────┬─────────────┬─────────────┬────────────┬────────────┬────────┬───────┬─────────┐
│ metric ┆ rapidhash-f ┆ rapidhash-q ┆ foldhash-f ┆ foldhash-q ┆ fxhash ┆ ahash ┆ siphash │
╞════════════════╪═════════════╪═════════════╪════════════╪════════════╪════════╪═══════╪═════════╡
│ avg_rank ┆ 2.05 ┆ 3.75 ┆ 2.81 ┆ 4.42 ┆ 3.09 ┆ 4.91 ┆ 6.97 │
│ geometric_mean ┆ 4.67 ┆ 5.38 ┆ 5.27 ┆ 5.99 ┆ 6.13 ┆ 6.50 ┆ 23.66 │
└────────────────┴─────────────┴─────────────┴────────────┴────────────┴────────┴───────┴─────────┘
┌────────────────┬─────────────┬─────────────┬────────────┬────────────┬────────┬────────┬───────┬─────────┐
│ metric ┆ rapidhash-f ┆ rapidhash-q ┆ foldhash-f ┆ foldhash-q ┆ gxhash ┆ fxhash ┆ ahash ┆ siphash │
╞════════════════╪═════════════╪═════════════╪════════════╪════════════╪════════╪════════╪═══════╪═════════╡
│ avg_rank ┆ 2.56 ┆ 4.36 ┆ 3.45 ┆ 5.38 ┆ 4.31 ┆ 3.36 ┆ 4.61 ┆ 7.97 │
│ geometric_mean ┆ 4.68 ┆ 5.34 ┆ 5.24 ┆ 5.91 ┆ 5.01 ┆ 5.98 ┆ 5.63 ┆ 25.75 │
└────────────────┴─────────────┴─────────────┴────────────┴────────────┴────────┴────────┴───────┴─────────┘
┌────────────────┬─────────────┬─────────────┬────────────┬────────────┬────────┬───────┬─────────┐
│ metric ┆ rapidhash-f ┆ rapidhash-q ┆ foldhash-f ┆ foldhash-q ┆ fxhash ┆ ahash ┆ siphash │
╞════════════════╪═════════════╪═════════════╪════════════╪════════════╪════════╪═══════╪═════════╡
│ avg_rank ┆ 1.86 ┆ 3.83 ┆ 2.86 ┆ 4.50 ┆ 2.95 ┆ 5.03 ┆ 6.97 │
│ geometric_mean ┆ 4.52 ┆ 5.18 ┆ 4.95 ┆ 5.55 ┆ 5.67 ┆ 6.33 ┆ 20.24 │
└────────────────┴─────────────┴─────────────┴────────────┴────────────┴────────┴───────┴─────────┘
┌────────────────┬─────────────┬─────────────┬────────────┬────────────┬────────┬────────┬───────┬─────────┐
│ metric ┆ rapidhash-f ┆ rapidhash-q ┆ foldhash-f ┆ foldhash-q ┆ gxhash ┆ fxhash ┆ ahash ┆ siphash │
╞════════════════╪═════════════╪═════════════╪════════════╪════════════╪════════╪════════╪═══════╪═════════╡
│ avg_rank ┆ 2.38 ┆ 4.69 ┆ 3.52 ┆ 5.30 ┆ 4.08 ┆ 3.39 ┆ 4.69 ┆ 7.97 │
│ geometric_mean ┆ 4.46 ┆ 5.09 ┆ 4.88 ┆ 5.42 ┆ 4.73 ┆ 5.58 ┆ 5.26 ┆ 21.34 │
└────────────────┴─────────────┴─────────────┴────────────┴────────────┴────────┴────────┴───────┴─────────┘
target-cpu=native or specific feature flags to compile. Gxhash is a great choice for applications that can guarantee the availability of AES instructions and mostly hash strings, but rapidhash may be preferred for hashing tuples and structs, or by libraries that aim to support a wide range of platforms.target-cpu=native on various x86 and ARM chips.lto = "fat" and codegen-units = 1 in your Cargo.toml release and bench profiles to ensure consistent inlining, application performance, and benchmarking results. For example:
[profile.release]
opt-level = 3
lto = "fat"
codegen-units = 1
Rapidhash is a keyed hash function and the rust implementation deviates from its C++ counterpart by also randomising the secrets array. The algorithm primarily relies on the same 128-bit folded multiply mixing step used by foldhash and ahash's fallback algorithm. It aims to be immune to length extension and re-ordering attacks.
We believe rapidhash is a minimally DoS resistant hash function, such that a non-interactive attacker cannot trivially create collisions if they do not know the seed or secrets. The adverb "minimally" is used to describe that rapidhash is not a cryptographic hash, it is possible to construct collisions if the seed or secrets are known, and it may be possible for an interactive attacker to learn the seed by observing hash outputs or application response times over a large number of inputs.
Provided rapidhash has been instantiated through RandomState or RapidSecrets using a randomised secret seed, we believe rapidhash is minimally resistant to hash DoS attacks.
C++ compatibility is presented in rapidhash::v1, rapidhash::v2, and rapidhash::v3 modules. The output for these is guaranteed to be stable between major crate versions.
Rapidhash V3 is the recommended, fastest, and most recent version of the hash. Streaming is only possible with the rapidhash V3 algorithm. Others are provided for backwards compatibility.
Rust hashing traits (RapidHasher, RandomState, etc.) are implemented in rapidhash::fast, rapidhash::quality, and rapidhash::inner modules. These are not guaranteed to give a consistent hash output between platforms, compiler versions, or crate versions as the rust Hasher trait is not suitable for portable hashing.
rapidhash::fast for optimal hashing speed with a slightly lower hash quality. Best for most datastructures such as HashMap and HashSet usage.rapidhash::quality where statistical hash quality is the priority, such as HyperLogLog or MinHash algorithms.rapidhash::inner to set advanced parameters to configure the hash function specifically to your use case.The minimum supported Rust version (MSRV) is 1.71.0.
The rapidhash crate follows the following versioning scheme:
rapidhash_v* method output.RapidHasher output.Portable hash outputs (eg. rapidhash_v3) are guaranteed to be stable. In-memory hash outputs (eg. RapidHasher) may change between minor versions to allow us to freely improve performance.
This project is licensed under both the MIT and Apache-2.0 licenses. You are free to choose either license.
With thanks to Nicolas De Carli for the original rapidhash C++ implementation, which is licensed under the MIT License.
With thanks to Orson Peters for his work on foldhash, which inspired much of the integer hashing optimisations in this crate. Some of the RapidHasher string hashing optimisations have also made their way back into foldhash as a thanks.
With thanks to Justin Bradford for letting us use the rapidhash crate name 🍻