42 releases (23 stable)
new 3.1.0 | Apr 7, 2025 |
---|---|
2.6.0 | Nov 5, 2024 |
2.5.0 | Jul 8, 2024 |
2.2.0 | Dec 12, 2023 |
0.6.1 | Mar 27, 2017 |
#41 in Encoding
593,260 downloads per month
Used in 195 crates
(58 directly)
245KB
3.5K
SLoC
bitstream-io
A Rust library for reading or writing binary values to or from streams which may not be aligned at a whole byte.
This library is intended to be flexible enough to wrap
around any stream which implements the Read
or Write
traits.
It also supports a wide array of integer data types as
containers for those binary values.
Minimum Compiler Version
Beginning with version 2.4, the minimum compiler version has been
updated to Rust 1.79 in order to support compile-time assertion
in const
blocks, which can be used to check for a class
of errors at compile-time rather than runtime.
lib.rs
:
Traits and helpers for bitstream handling functionality
Bitstream readers are for reading signed and unsigned integer values from a stream whose sizes may not be whole bytes. Bitstream writers are for writing signed and unsigned integer values to a stream, also potentially un-aligned at a whole byte.
Both big-endian and little-endian streams are supported.
The only requirement for wrapped reader streams is that they must
implement the io::Read
trait, and the only requirement
for writer streams is that they must implement the io::Write
trait.
In addition, reader streams do not consume any more bytes from the underlying reader than necessary, buffering only a single partial byte as needed. Writer streams also write out all whole bytes as they are accumulated.
Readers and writers are also designed to work with integer types of any possible size. Many of Rust's built-in integer types are supported by default.
Minimum Compiler Version
Beginning with version 2.4, the minimum compiler version has been updated to Rust 1.79.
The issue is that reading an excessive number of bits to a type which is too small to hold them, or writing an excessive number of bits from too small of a type, are always errors:
use std::io::{Read, Cursor};
use bitstream_io::{BigEndian, BitReader, BitRead};
let data = [0; 10];
let mut r = BitReader::endian(Cursor::new(&data), BigEndian);
let x: Result<u32, _> = r.read_var(64); // reading 64 bits to u32 always fails at runtime
assert!(x.is_err());
but those errors will not be caught until the program runs, which is less than ideal for the common case in which the number of bits is already known at compile-time.
But starting with Rust 1.79, we can now have read and write methods which take a constant number of bits and can validate the number of bits are small enough for the type being read/written at compile-time:
use std::io::{Read, Cursor};
use bitstream_io::{BigEndian, BitReader, BitRead};
let data = [0; 10];
let mut r = BitReader::endian(Cursor::new(&data), BigEndian);
let x: Result<u32, _> = r.read::<64, _>(); // doesn't compile at all
Since catching potential bugs at compile-time is preferable to encountering errors at runtime, this will hopefully be an improvement in the long run.
Changes From 2.X.X
Version 3.0.0 has made many breaking changes to the BitRead
and
BitWrite
traits.
The BitRead::read
method takes a constant number of bits,
and the BitRead::read_var
method takes a variable number of bits
(reversing the older BitRead2::read_in
and BitRead2::read
calling methods to emphasize using the constant-based one,
which can do more validation at compile-time).
A new BitRead2
trait uses the older calling convention
for compatibility with existing code and is available
for anything implementing BitRead
.
In addition, the main reading methods return primitive types which
implement a new Integer
trait,
which delegates to BitRead::read_unsigned
or BitRead::read_signed
depending on whether the output
is an unsigned or signed type.
BitWrite::write
and BitWrite::write_var
work
similarly to the reader's read
methods, taking anything
that implements Integer
and writing an unsigned or
signed value to BitWrite::write_unsigned
or
BitWrite::write_signed
as appropriate.
And as with reading, a BitWrite2
trait is offered
for compatibility.
In addition, the Huffman code handling has been rewritten to use a small amount of macro magic to write code to read and write symbols at compile-time. This is significantly faster than the older version and can no longer fail to compile at runtime.
Lastly, there's a new BitCount
struct which wraps a humble
u32
but encodes the maximum possible number of bits
at the type level.
This is intended for file formats which encode the number
of bits to be read in the format itself.
For example, FLAC's predictor coefficient precision
is a 4 bit value which indicates how large each predictor
coefficient is in bits
(each coefficient might be an i32
type).
By keeping track of the maximum value at compile time
(4 bits' worth, in this case), we can eliminate
any need to check that coefficients aren't too large
for an i32
at runtime.
This is accomplished by using BitRead::read_count
to
read a BitCount
and then reading final values with
that number of bits using BitRead::read_counted
.
Migrating From Pre 1.0.0
There are now BitRead
and BitWrite
traits for bitstream
reading and writing (analogous to the standard library's
Read
and Write
traits) which you will also need to import.
The upside to this approach is that library consumers
can now make functions and methods generic over any sort
of bit reader or bit writer, regardless of the underlying
stream byte source or endianness.
Dependencies
~225–365KB