11 unstable releases (3 breaking)

0.5.0 Aug 10, 2024
0.4.2 May 28, 2024
0.3.3 May 1, 2024
0.3.2 Mar 13, 2024
0.2.0 Dec 8, 2022

#644 in Parser implementations

Download history 92/week @ 2024-07-08 5/week @ 2024-07-22 19/week @ 2024-07-29 140/week @ 2024-08-05 72/week @ 2024-08-12 44/week @ 2024-08-19 146/week @ 2024-08-26 110/week @ 2024-09-02 124/week @ 2024-09-09 96/week @ 2024-09-16 193/week @ 2024-09-23 101/week @ 2024-09-30 114/week @ 2024-10-07 108/week @ 2024-10-14

526 downloads per month
Used in 4 crates

MIT license

405KB
10K SLoC

anndata-rs: A Rust/Python package for reading data in the h5ad format

Motivation

The goal of this library is to complement the anndata package by providing an out-of-core AnnData implementation.

Unlike the backed mode in the anndata package, anndata-rs's AnnData object is fully backed and always stays in sync with the data stored in the hard drive.

Here are the key features of this implementation:

  • AnnData is fully backed by the underlying hdf5 file. Any operations on the AnnData object will be reflected on the hdf5 file.
  • All elements are lazily loaded. No matter how large is the file, opening it consume almost zero memory. Matrix data can be accessed and processed by chunks, which keeps the memory usage to the minimum.
  • In-memory cache can be turned on to speed up the repetitive access of elements.
  • An AnnDataSet object to lazily concatenate multiple AnnData objects.

Limitations:

  • Only a subset of the h5ad specifications are implemented. For example, the .layer and .raw is not supported. To request a missing feature, please open a new issue.
  • No views. Subsetting the AnnData will modify the data inplace or make a copy.

Installation

We do not provide installation instructions here. Right now this package is bundled with the SnapATAC2 package. Please install SnapATAC2 to get these features.

Tutorials

Click here to read tutorials.

Dependencies

~28–39MB
~671K SLoC