56 stable releases (4 major)

new 4.5.0 Feb 18, 2025
4.3.0 Nov 27, 2024
3.14.1 Aug 5, 2024
3.12.0 Jul 30, 2024
0.6.0 Oct 2, 2023

#5 in Machine learning

Download history 500/week @ 2024-10-30 635/week @ 2024-11-06 768/week @ 2024-11-13 1310/week @ 2024-11-20 1262/week @ 2024-11-27 961/week @ 2024-12-04 1050/week @ 2024-12-11 384/week @ 2024-12-18 494/week @ 2024-12-25 721/week @ 2025-01-01 1144/week @ 2025-01-08 994/week @ 2025-01-15 825/week @ 2025-01-22 980/week @ 2025-01-29 1259/week @ 2025-02-05 1150/week @ 2025-02-12

4,342 downloads per month
Used in 12 crates (11 directly)

Apache-2.0

400KB
2.5K SLoC

FastEmbed-rs 🦀

Rust implementation of @qdrant/fastembed

Crates.io MIT Licensed Semantic release

🍕 Features

🔍 Not looking for Rust?

🤖 Models

Text Embedding

Click to see full List

Sparse Text Embedding

Image Embedding

Reranking

🚀 Installation

Run the following command in your project directory:

cargo add fastembed

Or add the following line to your Cargo.toml:

[dependencies]
fastembed = "4"

📖 Usage

Text Embeddings

use fastembed::{TextEmbedding, InitOptions, EmbeddingModel};

// With default InitOptions
let model = TextEmbedding::try_new(Default::default())?;

// With custom InitOptions
let model = TextEmbedding::try_new(
    InitOptions::new(EmbeddingModel::AllMiniLML6V2).with_show_download_progress(true),
)?;

let documents = vec![
    "passage: Hello, World!",
    "query: Hello, World!",
    "passage: This is an example passage.",
    // You can leave out the prefix but it's recommended
    "fastembed-rs is licensed under Apache  2.0"
    ];

 // Generate embeddings with the default batch size, 256
 let embeddings = model.embed(documents, None)?;

 println!("Embeddings length: {}", embeddings.len()); // -> Embeddings length: 4
 println!("Embedding dimension: {}", embeddings[0].len()); // -> Embedding dimension: 384

Image Embeddings

use fastembed::{ImageEmbedding, ImageInitOptions, ImageEmbeddingModel};

// With default InitOptions
let model = ImageEmbedding::try_new(Default::default())?;

// With custom InitOptions
let model = ImageEmbedding::try_new(
    ImageInitOptions::new(ImageEmbeddingModel::ClipVitB32).with_show_download_progress(true),
)?;

let images = vec!["assets/image_0.png", "assets/image_1.png"];

// Generate embeddings with the default batch size, 256
let embeddings = model.embed(images, None)?;

println!("Embeddings length: {}", embeddings.len()); // -> Embeddings length: 2
println!("Embedding dimension: {}", embeddings[0].len()); // -> Embedding dimension: 512

Candidates Reranking

use fastembed::{TextRerank, RerankInitOptions, RerankerModel};

let model = TextRerank::try_new(
    RerankInitOptions::new(RerankerModel::BGERerankerBase).with_show_download_progress(true),
)?;

let documents = vec![
    "hi",
    "The giant panda (Ailuropoda melanoleuca), sometimes called a panda bear, is a bear species endemic to China.",
    "panda is animal",
    "i dont know",
    "kind of mammal",
    ];

// Rerank with the default batch size, 256 and return document contents
let results = model.rerank("what is panda?", documents, true, None)?;
println!("Rerank result: {:?}", results);

Alternatively, local model files can be used for inference via the try_new_from_user_defined(...) methods of respective structs.

✊ Support

To support the library, please consider donating to our primary upstream dependency, ort - The Rust wrapper for the ONNX runtime.

📄 LICENSE

Apache 2.0 © 2024

Dependencies

~17–34MB
~608K SLoC