Crates.io | rustybuzz |
lib.rs | rustybuzz |
version | 0.20.1 |
source | src |
created_at | 2020-07-04 11:56:31.690942 |
updated_at | 2024-11-12 11:53:33.625252 |
description | A complete harfbuzz shaping algorithm port to Rust. |
homepage | |
repository | https://github.com/harfbuzz/rustybuzz |
max_upload_size | |
id | 261263 |
size | 1,384,929 |
rustybuzz
is a complete harfbuzz's
shaping algorithm port to Rust.
Matches harfbuzz
v10.0.1, commit c7ef6a2e (one commit after v10.0.1)
Because you can add rustybuzz = "*"
to your project and it just works.
No need for a C++ compiler. No need to configure anything. No need to link to system libraries.
rustybuzz passes nearly all of harfbuzz shaping tests (2221 out of 2252 to be more precise). So it's mostly identical, but there are still some tiny edge-cases which are not implemented yet or cannot be implemented at all.
graphite2
library support.mort
table is not supported, since it's deprecated by Apple.avar2
as well as other parts of the boring-expansion-spec are not supported yet.At the moment, performance isn't that great. We're 1.5-2x slower than harfbuzz.
See benches/README.md for details.
rustybuzz is not a faithful port.
harfbuzz can roughly be split into 6 parts: shaping, subsetting, TrueType parsing, Unicode routines, custom containers and utilities (harfbuzz doesn't use C++ std) and glue for system/3rd party libraries. In the mean time, rustybuzz contains only shaping. All of the TrueType parsing was moved to the ttf-parser. Subsetting was removed. Unicode code was mostly moved to external crates. We don't need custom containers because Rust's std is good enough. And we do not use any non Rust libraries, so no glue code either.
In the end, we still have around 23 KLOC. While harfbuzz is around 80 KLOC.
As mentioned above, rustybuzz has around 23 KLOC. But this is not strictly true, because there are a lot of auto-generated data tables.
You can find the "real" code size using:
tokei --exclude hb/unicode_norm.rs --exclude hb/ot_shaper_vowel_constraints.rs \
--exclude '*_machine.rs' --exclude '*_table.rs' src
Which gives us around 17 KLOC, which is still a lot.
Since the port is finished, there is not much to do other than syncing it with a new harfbuzz releases. However, there is still lots of potential areas of improvement:
rustybuzz
passes the whole harfbuzz
test suite, this does not mean that
output will always be 100% identical to harfbuzz
. Given the complexity of the code base, there are bound to be
other bugs that just have not been discovered yet. One potential way of addressing this issue could be to create a
fuzzer that takes random fonts, and shapes them with a random set of Unicode codepoints as well as
input settings. In case of a discovered discrepancy, this test case could then be investigated and once the
bug has been identified, added to our custom test suite. On the one hand we could use the Google Fonts font
collection for this so that the fonts can be added to the repository,
but we could also just use MacOS/Windows system fonts and only test them in CI, similarly
to how it's currently done for AAT in harfbuzz
.harfbuzz
contains tons of optimization structures
(accelerators and caches) which have not been included in rustybuzz
. As a result of this,
performance is much worse in many cases, as mentioned above (although in the grand scheme of things
rustybuzz
is still very performant), but the upside of excluding all those optimizations is that
the code base is much simpler and straightforward. This makes it a lot easier to backport new changes
(which already is a very difficult task). Now that we are back in sync with harfbuzz
, we can consider
attempting to port some of the major optimization improvements from harfbuzz
, but we should do so carefully
to not make it even harder to keep the code bases in sync.rustybuzz
tries its best to make the code base look like a 1:1 C++ to Rust translation,
which is the case for most parts of the code, but there also are many variations (most notable in AAT) that arise
from the fact that a lot of the C++ concepts are not straightforward to port. Nevertheless, there probably still
are parts of the code that probably could be made more similar, given that someone puts time into looking into that.All of this is a lot of work, so contributions are more than welcome.
The library is completely safe.
We do have one unsafe
to cast between two POD structures, which is perfectly safe.
But except that, there are no unsafe
in this library and in most of its dependencies
(excluding bytemuck
).
rustybuzz
is licensed under the MIT.
harfbuzz
is licensed under the Old MIT