3 unstable releases

0.2.0 Jun 19, 2022
0.1.1 Jun 15, 2022
0.1.0 Jun 14, 2022

#1596 in Data structures

BSD-2-Clause

47KB
775 lines

Fenny

Fenny is a library for Fenwick Trees, also known as Binary Indexed Trees (BIT). They are a data structure that can efficiently (as in O(log n)) support three operations.

a[i] += v;
a[..=i].iter().sum();

The third operation is a binary search on prefix sums.

Here's a simple example usage:

let mut tree = [0; 13];   // An array of zeros is a fenwick tree of zeros.
fenny::update(&mut tree, 2 /* 0-based index */, 7);
fenny::update(&mut tree, 5, 20);
fenny::psum(&tree, 2);  // Returns 7.
fenny::psum(&tree, 5);  // Returns 27.

let root_p1 = fenny::get_root_p1(tree.len());
// Returns Some(5), as a[..=5].iter().sum() == 7 + 20 > 26.
fenny::first_larger(&tree, root_p1, 26);
fenny::first_larger(&tree, root_p1, 27); // Returns None.

Slope-offset

What's better than a fenwick tree? Two of course! If we add another tree into the mix, we can support an additional operation, the range update. Also in the same juicy O(log n) time. The size of the range doesn't matter, it can update the whole array in the same time as updating a single element.

let mut slope = [0i64; 13];
let mut offset = [0i64; 13];
// Conceptually a[2..=5] += 3;
fenny::update_so(&mut slope, &mut offset, 2..=5, 3);
fenny::psum_so(&slope, &offset, 2); // Returns 3.
fenny::psum_so(&slope, &offset, 3); // Returns 6.
let root_p1 = fenny::get_root_p1(slope.len());
fenny::first_larger_so(&slope, &offset, root_p1, 3); // 3.

Higher dimensions

This exists in 2d and 3d. psum_2d(&tree, p) computes a sum over all points q satisfying (q.y <= p.y && q.x <= p.x)

use fenny::*;
let dim = Dim2{y: 5, x: 8};
let mut tree = vec![0; dim.y * dim.x];
update_2d(&mut tree, dim , Point2{y: 3, x: 6}, 7);
// Returns 0
psum_2d(&tree, dim, Point2{y: 2, x: 6});
// Returns 0
psum_2d(&tree, dim, Point2{y: 3, x: 5});
// Returns 7
psum_2d(&tree, dim, Point2{y: 3, x: 6});
// Returns 7
psum_2d(&tree, dim, Point2{y: 4, x: 7});

These operations run in log(dim.x) * log (dim.y) * log (dim.z)

Higher dimension, slope offset, lexicographic

First a little background. This is something I came up with to help with arithmetic coding. We have a probability density function over a discretized space. Now to turn this into an efficient encoding, we need to map our space to [0, 1), where every cell corresponds to a segment with a length proportional to its probability. So lets imagine we lay all the cells out in a line, and then we compute the prefix sums of the probabilities. That will get us segments wohoo!

Ok, but one issue. We would like to efficiently manipulate the probabilities of the cells, as they exist in 3d space. In particular we would like to pick a box with corners (x0, y0, z0) and (x1, y1, z1), and increase the weight of all cells in that box in one swoop. We could do range updates in 1d with a slope-offset construction, anything similar for 3d? Yes there is! Turns out that by having one slope per dimension, we can achieve our goal! This is similar to an algorithm by Mishra, but we can get away a little cheaper by only suppporting lexicographic queries rather than arbitrary boxes. During the decoding step it's necessary to go from a number in [0, 1) and find the corresponding cell. This can be done using first_larger function family.

With that background out of the way, let's look at the operations.

use fenny::*;
let dim = Dim3{z: 12, y: 6, x: 8};
let mut slope_z = vec![0i64; dim.z];
let mut slope_y = vec![0i64; dim.z * dim.y];
let mut slope_x = vec![0i64; dim.z * dim.y * dim.x];
let mut offset  = vec![0i64; dim.z * dim.y * dim.x];
let p0 = Point3{z: 3, y: 2, x: 1};
let p1 = Point3{z: 7, y: 5, x: 5};
let p2 = Point3{z: 4, y: 0, x: 0};
// Updates all points q with p0.x<=q.x<=p1.x && p0.y<=q.x<=p1.y && p0.z<=q.x<=p1.z
update_so_3d_lex(&mut slope_z, &mut slope_y, &mut slope_x, &mut offset, dim, p0, p1, 11);
// Our points out are laid out in lexicographic order according to z, y, x.
// Since 4, 0, 0 comes after (3, 2, 1), (3, 2, 2), ... (3, 5, 5) our result is 5 * 4 * 11;
psum_so_3d_lex(&slope_z, &slope_y, &slope_x, &offset, dim, p2);
// (4, 0, 0) has a psum of 5*4*11, but so does (3, 5, 5), so that's the result.
first_larger_so_3d_lex(&slope_z, &slope_y, &slope_x, &offset, dim, 5 * 4 * 11 - 1);

These operations run in log(dim.x) * log (dim.y) * log (dim.z)

Acknowledgements

I found a topcoder article written by boba5551 very helpful in understanding the Fenwick trees. The diagrams were great.

The wikipedia article had a good overview as well as useful information on how to use to do range updates using two combined trees.

The fenwick rust library is well written and was an inspiration.

No runtime deps