45 breaking releases
0.46.0 | Nov 7, 2024 |
---|---|
0.44.0 | Sep 4, 2024 |
0.43.0 | Jul 14, 2024 |
0.38.0 | Mar 28, 2024 |
0.3.0 | Jul 30, 2021 |
#2611 in Parser implementations
1,770 downloads per month
Used in 37 crates
(3 directly)
370KB
8K
SLoC
noodles
noodles attempts to provide specification-compliant (when applicable) implementations of libraries for handling various bioinformatics file formats. It currently supports BAM 1.6, BCF 2.2, BED, BGZF, CRAM 3.0/3.1, CSI, FASTA, FASTQ, GFF3, GTF 2.2, htsget 1.3, refget 2.0, SAM 1.6, tabix, and VCF 4.3/4.4.
Usage
noodles is published on crates.io. Early versions can be used in projects, but keep in mind that the API is still considered experimental.
noodles is split into multiple crates by file format. For convenience, a
top-level meta crate named noodles
can be added to your project's dependency
list; and formats, listed as features. For example, to work with the BAM
format, add the noodles
crate and enable the bam
feature.
cargo add noodles --features bam
Each enabled feature can then be imported by its re-exported name, e.g.,
use noodles::bam;
Feature flags
Individual crates may have optional features that can be enabled using feature flags.
async
: Enables asynchronous I/O with Tokio. (BAM, BCF, BGZF, CRAM, CSI, FASTA, FASTQ, GFF, SAM, tabix, and VCF)libdeflate
: Use libdeflate to encode and decode DEFLATE streams. (BGZF and CRAM)
Examples
Each crate may have its own examples directory, and all examples are runnable
as an application. After cloning the repository, run cargo run --release --example
for a list of available examples. Use the example name as the option
argument and append program arguments to the command, e.g.,
cargo run --release --example bam_write > sample.bam
cargo run --release --example bam_read_header sample.bam
lib.rs
:
noodles-tabix handles the reading and writing of the tabix format.
A tabix (TBI) is an index file typically used to allow random access of an accompanied file that is
- bgzipped,
- tab-delimited,
- grouped by reference sequence name, and
- coordinate sorted by start position.
It can be used to find relevant records for a given genomic region.
Examples
Read a tabix file
use noodles_tabix as tabix;
let index = tabix::read("sample.vcf.gz.tbi")?;
Dependencies
~2.2–8.5MB
~69K SLoC