#audio-processing #traits #abstraction #buffer #processor #higher-level #traits-structs

audio-processor-traits

Traits for audio processor types and audio buffer types. Heavily subject to change.

16 releases (8 stable)

4.3.0 Aug 26, 2024
4.2.0 Jan 17, 2024
4.1.0 May 22, 2023
3.2.0 Mar 22, 2023
0.3.1 Jul 23, 2021

#30 in Audio

Download history 67/week @ 2024-09-24 65/week @ 2024-10-01 39/week @ 2024-10-08 41/week @ 2024-10-15 25/week @ 2024-10-22 44/week @ 2024-10-29 42/week @ 2024-11-05 9/week @ 2024-11-12 41/week @ 2024-11-19 56/week @ 2024-11-26 45/week @ 2024-12-03 76/week @ 2024-12-10 32/week @ 2024-12-17 13/week @ 2024-12-31 13/week @ 2025-01-07

68 downloads per month
Used in 18 crates (16 directly)

MIT license

96KB
1.5K SLoC

Audio Processor Traits

crates.io docs.rs


Traits for audio processor types and audio buffer types. Heavily subject to change.

Primer

This is very much exploration based, and I'm still finding out the best API to express in Rust types.

I think I've found a good abstraction to handle AudioBuffer and basic FX AudioProcessor, but there's quite a lot more to uncover.

Motivation

It'd be great to have generic traits for structs that process audio.

These may be 1 or more abstractions on top of audio processing which enable higher-level systems to manipulate audio processing nodes.

At its core, audio processing may be defined as a function:

fn process_audio(_input: &mut [f32]) { /* ... */ }

input would be your input/output buffer, and you could perform mutable operations on it. However, this does not encode several audio processing concerns, such as:

  • The buffer won't be always f32, it might be f64
  • The buffer channel count isn't expressed
  • The buffer layout for multichannel data isn't expressed (e.g. Interleaved data or otherwise)
  • Sample rate isn't expressed
  • Dry/wet configurations aren't expressed
  • MIDI & other concerns aren't expressed

We'd like some abstraction that covers some of these issues. Without thinking about external system problems (such as MIDI, state & dry/wet), a basic audio processor trait can solve buffer/sample conversion issues.

AudioBuffer

An AudioBuffer trait is provided. It provides an abstraction to get the size of the buffer and modify it.

The AudioBuffer trait may wrap samples in different layouts or with different ownership, however, it's recommended to process samples using the AudioBuffer::frame, AudioBuffer::frame_mut, AudioBuffer::slice and AudioBuffer::slice_mut.

The reason for this is that using slice iterators is much more efficient than iterating over a range of numbers and calling AudioBuffer::get.

It's very unfortunate, but there's not a uniform optimised way to iterate between buffers that have different layouts provided in this crate yet.

Something I think should work is to have some kind of channel iterator which will wrap the slice iterators. The channel count should be low so losing some optimisation when reading a frame shouldn't be an issue.

Note that 10-20x slowdown is based on a very trivial "gain" work-load. In practice this might be an issue.

On my computer, there's around 600ns overhead per 512 samples to use the get/set functions.

With the _unchecked versions which skip bounds checking, the overhead is around 300ns and with frames/slice there's no overhead.

In comparison, my current implementation of interleaved to VST buffer conversion takes around 1.15us to convert from CPAL into VST and 1.1us to convert out of VST back to CPAL.

That'd be roughly 2us spent on conversions per 512 sample block, vs a 300-600ns slowdown from iteration.

This has made me think is that it might be better that AudioProcessors only expose a process_sample(&mut self, sample: f32) function so that they are really not connected to the AudioBuffer at all.

AudioProcessor

The AudioProcessor trait is only two methods:

pub trait AudioProcessor {
    type SampleType;
    fn prepare(&mut self, _settings: AudioProcessorSettings) {}
    fn process<BufferType: AudioBuffer<SampleType = Self::SampleType>>(
        &mut self,
        data: &mut BufferType,
    );
}

It provides a prepare callback, where channel & sample rate configuration will be provided and a process callback where a generic AudioBuffer is provided.

Design notes

SampleType associated type

The SampleType is provided as an associated type to both the AudioBuffer and the AudioProcessor traits. This enables implementors to use generic SampleType types in their processors.

For example, this is the SilenceAudioProcessor implementation in this crate, which should work for any num::Float type and any AudioBuffer implementation:

pub struct SilenceAudioProcessor<SampleType>(PhantomData<SampleType>);

impl<SampleType: num::Float + Send> AudioProcessor for SilenceAudioProcessor<SampleType> {
    type SampleType = SampleType;

    fn process<BufferType: AudioBuffer<SampleType = Self::SampleType>>(
        &mut self,
        output: &mut BufferType,
    ) {
        for sample_index in 0..output.num_samples() {
            for channel_index in 0..output.num_channels() {
                output.set(
                    channel_index,
                    sample_index,
                    <BufferType as AudioBuffer>::SampleType::zero(),
                );
            }
        }
    }
}

Pending work

  • MIDI trait
  • Richer API for applications
  • State management guidelines, using a background ref-counting garbage-collector & immutable 'state handle' references (while still allowing the internal state of a processor to be mutable)
  • Automatic implementation of the VST API for all trait implementors
  • Automatic implementation of the LV2 API for all trait implementors
  • Automatic implementation of a "stand-alone" cpal based App for all trait implementors (see audio-processor-standalone in this repository)
  • An audio-graph implementation
  • GUI support
  • Testing tools

Buffer performance

On a trivial gain benchmark, performance using the AudioBuffer::get APIs is between 10-20x worse on a VecAudioBuffer than a Vec.

Other things to measure:

  • Measure buffer conversion time on a small window size
    • Compare this with overhead of the AudioBuffer abstraction
  • Measure overhead in contrast to the grand scheme of a non-trivial processor (not gain)

Other thoughts

  • Which is the more efficient layout?
  • Should audio-processors expose a process function that works sample by sample? Perhaps this is easier to optimise

License

MIT

Dependencies

~0.8–3.5MB
~62K SLoC