Crates.io | slimmer_box |
lib.rs | slimmer_box |
version | 0.6.5 |
source | src |
created_at | 2023-02-18 22:39:45.160649 |
updated_at | 2023-11-13 21:16:21.857244 |
description | A packed alternative to Box |
homepage | |
repository | https://github.com/qqwy/rust-slimmer_box |
max_upload_size | |
id | 788511 |
size | 74,003 |
A SlimmerBox<T> is a packed alternative to Box<T> whose 'fat' pointer is 'slimmer'
A normal Box<[T]>
is an owned 'fat pointer' that contains both the 'raw' pointer to memory
as well as the size (as an usize) of the managed slice.
On 64-bit targets (where sizeof(usize) == sizeof(u64)), this makes a Box<[T]>
take up 16 bytes (128 bits, 2 words).
That's a shame: It means that if you build an enum that contains a Box<[T]>
,
then it will at least require 24 bytes (196 bits, 3 words) of stack memory.
But it is rather common to work with slices that will never be that large.
For example, what if we store the size in a u32 instead?
Will your slices really contain more than 2ˆ32 (4_294_967_296) elements?
a [u8; 2^32]
takes 4GiB of space.
And since the length is counted in elements, a [u64; 2^32]
takes 32GiB.
So lets slim this 'fat' pointer down! By storing the length inside a u32 rather than a u64, a SlimmerBox<[T], u32> only takes up 12 bytes (96 bits, 1.5 words) rather than 16 bytes.
This allows it to be used inside another structure, such as in one or more variants of an enum. The resulting structure will then still only take up 16 bytes.
In situations where you are trying to optimize for memory usage, cache locality, etc, this might make a difference:
The following 'small str optimization' enum still only takes up two words,
just like a normal &str
would:
use slimmer_box::SlimmerBox;
pub enum CompactStr {
Small{buffer: [u8; 14], len: u8}, // <- Or, using the `modular_bitfield` crate, this could even be { buffer: [u8; 15], len: u4} !
Large{ptr: SlimmerBox<str>},
}
impl From<&str> for CompactStr {
fn from(val: &str) -> CompactStr {
if val.len() < 14 {
let len = val.len() as u8;
let mut buffer = [0u8; 14];
buffer[0..val.len()].copy_from_slice(val.as_bytes());
CompactStr::Small{ buffer, len }
} else {
CompactStr::Large{ ptr: SlimmerBox::new(val) }
}
}
}
let compact_str: CompactStr = "hello world".into();
assert_eq!(core::mem::size_of_val(&compact_str), 16);
// An Option<CompactStr> also only takes up two words:
assert_eq!(core::mem::size_of_val(&Some(compact_str)), 16);
(A full version of this example including Debug, Display and Deref traits can be found in this test)
The following immutable AST still only takes up two words. Even Option<AST>
is only two words:
pub enum AST {
Bool(bool),
Int(i64),
Float(f64),
Str(SlimmerBox<str>),
Bytes(SlimmerBox<[u8]>),
List(SlimmerBox<[AST]>),
// 2^32 - 7 other variants could be added and the size would still stay the same :-)
}
assert_eq!(core::mem::size_of::<AST>(), 16);
assert_eq!(core::mem::size_of::<Option<AST>>(), 16);
With some care, you could even combine the above two examples together, and still end up with an AST type that takes up just two words!
SlimmerBox<T, u32> is the most common version, and therefore u32 is the default SlimmerMetadata to use. But it is possible to use another variant, if you are sure that your data will be even shorter.
()
is used for sized types. In this case a SlimmerBox will only contain the normal pointer and be exactly 1 word size, just like a normal Box containing a sized type.SlimmerMetadata | max DST length¹ | resulting size (32bit) | resulting size (64bit) | Notes |
---|---|---|---|---|
() | - | 4 bytes | 8 bytes | Used for normal sized types. Identical in size to a normal Box |
u8 | 255 | 5 bytes | 9 bytes | |
u16 | 65535 | 6 bytes | 10 bytes | Identical to Box |
u32 | 4294967295 | 8 bytes (2 words) | 12 bytes | Identical to Box |
u64 | 18446744073709551615 | -² | 16 bytes (2 words) | Identical to Box |
str
and in the number of elements for slices.Just like a normal Box, sizeof(Option<SlimmerBox<T>>) == sizeof(SlimmerBox<T>)
.
rkyv's Archive, Serialize and Deserialize have been implemented for SlimmerBox.
The serialized version of a SlimmerBoxrkyv::ArchivedBox<[T]>
.
This is a match made in heaven, since rkyv's relative pointers use only 32 bits for the pointer part as well as the length part.
As such, sizeof(rkyv::Archived<SlimmerBox<T>>) == 8
bytes (!).
(This is assuming rkyv's feature size_32
is used which is the default.
Changing it to size_64
is rarely useful for the same reason as the rant about lengths above.)
You can not use a SlimmerBox to store a trait object.
This is because the metadata of a dyn
pointer is another full-sized pointer. We cannot make that smaller!
no_std
supportSlimmerBox works perfectly fine in no_std
environments, as long as the alloc
crate is available.
(The only thing that is missing in no_std environments are implementations for SlimmerPointee of std::ffi::OsStr
and std::ffi::CStr
, neither of which exists when std
is disabled.)
(Below examples assume a 64-bit system)
Smaller than a normal Box for dynamically-sized types like slices or strings:
use slimmer_box::SlimmerBox;
let array: [u64; 4] = [1, 2, 3, 4];
let boxed_slice: Box<[u64]> = Box::from(&array[..]);
assert_eq!(core::mem::size_of_val(&boxed_slice), 16);
let slimmer_boxed_slice: SlimmerBox<[u64]> = SlimmerBox::new(&array[..]);
assert_eq!(core::mem::size_of_val(&slimmer_boxed_slice), 12);
Just like normal Box for normal, Sized types:
use slimmer_box::SlimmerBox;
let int = 42;
let boxed_int = Box::new(&int);
assert_eq!(core::mem::size_of_val(&boxed_int), 8);
let slimmer_boxed_int: SlimmerBox<u64, ()> = SlimmerBox::new(&int);
assert_eq!(core::mem::size_of_val(&slimmer_boxed_int), 8);
You can configure how much space you want to use for the length of a dynamically-sized slice or str:
use slimmer_box::SlimmerBox;
let array: [u64; 4] = [1, 2, 3, 4];
// Holds at most 255 elements:
let tiny: SlimmerBox<[u64], u8> = SlimmerBox::new(&array);
assert_eq!(core::mem::size_of_val(&tiny), 9);
// Holds at most 65535 elements or a str of 64kb:
let small: SlimmerBox<[u64], u16> = SlimmerBox::new(&array);
assert_eq!(core::mem::size_of_val(&small), 10);
// Holds at most 4294967295 elements or a str of 4GB:
let medium: SlimmerBox<[u64], u32> = SlimmerBox::new(&array);
assert_eq!(core::mem::size_of_val(&medium), 12);
// Holds at most 18446744073709551615 elements, or a str of 16EiB:
let large: SlimmerBox<[u64], u64> = SlimmerBox::new(&array); // <- Indistinguishable from a normal Box
assert_eq!(core::mem::size_of_val(&large), 16);
You can turn a Box into a SlimmerBox and vice-versa:
use slimmer_box::SlimmerBox;
let message = "hello, world!";
let boxed = Box::new(message);
let slimmer_box = SlimmerBox::from_box(boxed);
let again_boxed = SlimmerBox::into_box(slimmer_box);
"std"
. Enabled by default. Disable the default features to use the crate in no_std environments. slimmer_box
does require the alloc
crate to be available."rkyv"
. Enable support for the rkyv zero-copy serialisation/deserialisation library, which is a very good match for this crate!"serde"
. Enable support for the serde serialisation/deserialisation library.The minimum supported Rust version of slimmer_box
is 1.58.1.