5 releases

0.1.4	Jan 23, 2025
0.1.3	Jan 23, 2025
0.1.2	Jan 23, 2025
0.1.1	Jan 22, 2025
0.1.0	Jan 22, 2025

#333 in Web programming

MIT license

93KB
1.5K SLoC

async-llm

async-llm is a Rust library for working with OpenAI-compatible providers, including OpenAI, Gemini, OpenRouter, and Ollama.

Note: This repository is currently a work-in-progress and is under active development. As such, breaking changes may occur frequently. Please proceed with caution if using this code in production or for critical projects. We recommend checking the commit history and pull requests for the latest updates. Contributions, feedback, and issue reports are welcome! 🚧

Why Choose `async-llm`?

Relying solely on OpenAI isn't ideal for every application. You can find numerous forum discussions about issues with OpenAI billing and availability. Using OpenAI-compatible providers gives you more options and flexibility. However, working with these APIs can be tricky due to differences in their specifications.

While some crates focus only on OpenAI or attempt a one-size-fits-all approach, async-llm takes a balanced path:

Follows the OpenAI Standard: Closely aligns with the original OpenAI API design.
Flexible for Compatibility: Adapts to compatible APIs by making all possible fields optional.
Simplifies Integration: Provides traits to handle provider-specific tweaks without complexity.
Built for Performance: Fully asynchronous, leveraging Rust's async ecosystem.

With async-llm, you can seamlessly work with multiple OpenAI-compatible APIs while maintaining a clean and consistent codebase.

Examples

Name	Description
`openai`	OpenAI example
`openrouter`	OpenRouter example
`ollama`	Ollama example
`gemini`	Gemini example

Usage

async fn example_basic() -> Result<(), Error> {
    let request = ChatRequest::new(
        "gpt-4o-mini",
        vec![
            ChatMessage::system("You are a helpful assistant"),
            ChatMessage::user("Who are you?"),
        ],
    );
    tracing::info!("request: \n{}", request.to_string_pretty()?);

    let response = request.send().await?;
    tracing::info!("response: \n{}", response.to_string_pretty()?);

    Ok(())
}

async fn example_basic_stream() -> Result<(), Error> {
    let request = ChatRequest::new(
        "gpt-4o-mini",
        vec![
            ChatMessage::system("You are a helpful assistant"),
            ChatMessage::user("Who are you?"),
        ],
    )
    .with_stream();
    tracing::info!("request: \n{}", request.to_string_pretty()?);

    let mut response = request.send_stream().await?;
    while let Some(result) = response.next().await {
        match result {
            Ok(response) => {
                tracing::info!("response: \n{}", response.to_string_pretty()?);
            }
            Err(e) => {
                tracing::error!("error = \n {e}");
            }
        }
    }

    Ok(())
}

Known Issues

Anthropic: Anthropic requires a custom provider implementation as it does not expose a /chat/completions API directly. This is currently a work-in-progress.

We are actively working to address these issues. If you encounter any problems or have suggestions, please feel free to open an issue or contribute a fix! 🛠️

TODO

Add tests
Gemini integration
Ollama integration
Anthropic integration
Better error handling
Examples for custom Provider and HTTPClient
OpenAI Embedding API
Better HTTP Client with backon
Better documentation

Development

Install just

  cargo install just

Install additional tools

  just install

Start development

  just dev

Start development with example name

  just dev ollama

Start development with example name and RUST_LOG=trace

  just trace ollama

Run tests

  just test

Run selected tests with debug tracing

  just test-one chat

Generate test data

  just generate

References

Dependencies

~8–21MB
~305K SLoC