harfruzz
is a fork of rustybuzz
to explore porting from ttf-parser
to
read-fonts
to avoid shipping (and maintaining)
multiple implementations of core font parsing for skrifa
consumers.
Further context in googlefonts/fontations#956.
rustybuzz
is a complete harfbuzz's
shaping algorithm port to Rust.
Matches harfbuzz
v9.0.0.
https://github.com/googlefonts/oxidize outlines Google Fonts motivations to try to migrate font production and consumption to Rust.
The following conformance issues need to be fixed:
- harfruzz does not yet fully pass the harfbuzz shaping or fuzzing tests
- Malformed fonts will cause an error
- HarfBuzz uses fallback/dummy shaper in this case
- No Arabic fallback shaper
- This requires the ability to build lookups on the fly. In HarfBuzz (C++) this requires serialization code that is associated with subsetting.
avar2
as well as other parts of the boring-expansion-spec are not supported yet.
- No font size property. Shaping is always using UnitsPerEm. You should scale the result manually.
- Most of the TrueType and Unicode handling code was moved into separate crates.
- harfruzz doesn't interact with any system libraries and must produce exactly the same results on all OS'es and targets.
mort
table is not supported, since it's deprecated by Apple.- No
graphite
library support.
At the moment, performance isn't that great. We're 1.5-2x slower than harfbuzz.
See benches/README.md for details.
harfruzz is not a faithful port.
harfbuzz (C++ edition) can roughly be split into 6 parts:
- shaping, handled by harfruzz
- subsetting, (
hb-subset
) moves to a standalone crate, klippa - TrueType parsing, handled by
read-fonts
- Unicode routines, migrated to external crates
- custom containers and utilities (harfbuzz doesn't use C++ std), reimplemented in
fontations
where appropriate (e.g. int set) - glue for system/3rd party libraries, just gone
You can find the "real" code size (eliminating generated code) using:
tokei --exclude hb/unicode_norm.rs --exclude hb/ot_shaper_vowel_constraints.rs \
--exclude '*_machine.rs' --exclude '*_table.rs' src
Since the port is finished, there is not much to do other than syncing it with a new harfbuzz releases. However, there is still lots of potential areas of improvement:
- Wider test coverage: Currently, we only test the result of the positioned glyphs in the shaping output. We should add tests so that we can test other parts of the API as well, such as glyph extents and glyph flags.
- Fuzzing against harfbuzz: While
rustybuzz
passes the wholeharfbuzz
test suite, this does not mean that output will always be 100% identical toharfbuzz
. Given the complexity of the code base, there are bound to be other bugs that just have not been discovered yet. One potential way of addressing this issue could be to create a fuzzer that takes random fonts, and shapes them with a random set of Unicode codepoints as well as input settings. In case of a discovered discrepancy, this test case could then be investigated and once the bug has been identified, added to our custom test suite. On the one hand we could use the Google Fonts font collection for this so that the fonts can be added to the repository, but we could also just use MacOS/Windows system fonts and only test them in CI, similarly to how it's currently done for AAT inharfbuzz
. - Performance:
harfbuzz
contains tons of optimization structures (accelerators and caches) which have not been included inrustybuzz
. As a result of this, performance is much worse in many cases, as mentioned above (although in the grand scheme of thingsrustybuzz
is still very performant), but the upside of excluding all those optimizations is that the code base is much simpler and straightforward. This makes it a lot easier to backport new changes (which already is a very difficult task). Now that we are back in sync withharfbuzz
, we can consider attempting to port some of the major optimization improvements fromharfbuzz
, but we should do so carefully to not make it even harder to keep the code bases in sync. - Code alignment:
rustybuzz
tries its best to make the code base look like a 1:1 C++ to Rust translation, which is the case for most parts of the code, but there also are many variations (most notable in AAT) that arise from the fact that a lot of the C++ concepts are not straightforward to port. Nevertheless, there probably still are parts of the code that probably could be made more similar, given that someone puts time into looking into that.
All of this is a lot of work, so contributions are more than welcome.
Unsafe code is forbidden by a #![forbid(unsafe_code)]
attribute in the root
of the library.
- harfbuzz_rs - bindings to the actual harfbuzz library. As of v2 doesn't expose subsetting and glyph outlining, which harfbuzz supports.
- allsorts - shaper and subsetter. As of v0.6 doesn't support variable fonts and Apple Advanced Typography. Relies on some unsafe code.
- swash - Supports variable fonts, text layout and rendering. No subsetting. Relies on some unsafe code. As of v0.1.4 has zero tests.
harfruzz
is licensed under the MIT license.
harfbuzz
is licensed under the Old MIT