61 releases

new 0.21.4 Jan 14, 2025
0.21.0 Dec 13, 2024
0.20.1 Nov 29, 2024
0.18.21 May 23, 2024
0.10.3 Nov 28, 2022

#14 in Hardware support

Download history 18540/week @ 2024-09-26 18861/week @ 2024-10-03 19825/week @ 2024-10-10 18030/week @ 2024-10-17 18197/week @ 2024-10-24 20158/week @ 2024-10-31 18239/week @ 2024-11-07 15722/week @ 2024-11-14 20063/week @ 2024-11-21 21908/week @ 2024-11-28 20316/week @ 2024-12-05 21606/week @ 2024-12-12 16800/week @ 2024-12-19 12832/week @ 2024-12-26 21806/week @ 2025-01-02 21041/week @ 2025-01-09

75,900 downloads per month
Used in 210 crates (28 directly)

MIT license

1.5MB
30K SLoC

pulp is a safe abstraction over SIMD instructions, that allows you to write a function once and dispatch to equivalent vectorized versions based on the features detected at runtime.

Documentation Crate

Autovectorization example

use pulp::Arch;
fn main(){
    let mut v = (0..1000).map(|i| i as f64).collect::<Vec<_>>();
    let arch = Arch::new();

    arch.dispatch(|| {
        for x in &mut v {
            *x *= 2.0;
        }
    });

    for (i, x) in v.into_iter().enumerate() {
        assert_eq!(x, 2.0 * i as f64);
    }
}

Manual vectorization example

use pulp::{Arch, Simd, WithSimd};

struct TimesThree<'a>(&'a mut [f64]);
impl<'a> WithSimd for TimesThree<'a> {
    // No output, the input is modified in place to save time allocating a new vector
    type Output = ();

    #[inline(always)]
    fn with_simd<S: Simd>(self, simd: S) -> Self::Output {
        let v = self.0;
        // the tail is the remainder left after allocating v into simd vectors
        // len(tail) = len(v) % simd_vector_length
        let (head, tail) = S::as_mut_simd_f64s(v);

        // fill the simd vectors with 3.0
        let three = simd.splat_f64s(3.0);

        for x in head {
            *x = simd.mul_f64s(three, *x);
        }
        // the tail is not loaded into simd vectors hence non-simd operations are used
        for x in tail {
            *x = *x * 3.0;
        }
    }
}
fn main() {
    let mut v = (0..1000).map(|i| i as f64).collect::<Vec<_>>();
    let arch = Arch::new();
    arch.dispatch(TimesThree(&mut v)); // dynamically dispatch the function to the correct simd implementation
    for (i, x) in v.into_iter().enumerate() {
        assert_eq!(x, 3.0 * i as f64);
    }
}

Less boilerplate using pulp::with_simd

Only available with the macro feature.

Requires the first non-lifetime generic parameter, as well as the function's first input parameter to be the SIMD type.

use pulp::Simd;

// the macro creates a `sum` function
#[pulp::with_simd(sum = pulp::Arch::new())]
#[inline(always)]
fn sum_with_simd<'a, S: Simd>(simd: S, v: &'a mut [f64]) {
    let (head, tail) = S::as_mut_simd_f64s(v);

    // fill the simd vectors with 3.0
    let three = simd.splat_f64s(3.0);

    for x in head {
        *x = simd.mul_f64s(three, *x);
    }

    for x in tail {
        *x = *x * 3.0;
    }
}
fn main() {
    let mut v = (0..1000).map(|i| i as f64).collect::<Vec<_>>();
    sum(&mut v);

    for (i, x) in v.into_iter().enumerate() {
        assert_eq!(x, 3.0 * i as f64);
    }
}

Dependencies

~1MB
~17K SLoC