71 releases

new 0.1.87 Jan 15, 2025
0.1.85 Dec 5, 2024
0.1.84 Nov 27, 2024
0.1.64 Jul 26, 2024
0.1.45 Mar 27, 2024

#66 in Machine learning

Download history 98/week @ 2024-09-26 297/week @ 2024-10-03 437/week @ 2024-10-10 229/week @ 2024-10-17 218/week @ 2024-10-24 307/week @ 2024-10-31 171/week @ 2024-11-07 650/week @ 2024-11-14 239/week @ 2024-11-21 292/week @ 2024-11-28 363/week @ 2024-12-05 193/week @ 2024-12-12 100/week @ 2024-12-19 97/week @ 2024-12-26 311/week @ 2025-01-02 211/week @ 2025-01-09

743 downloads per month
Used in 5 crates

MIT/Apache

5.5MB
108K SLoC

C++ 50K SLoC // 0.1% comments C 34K SLoC // 0.1% comments CUDA 8K SLoC // 0.0% comments Metal Shading Language 4.5K SLoC // 0.1% comments Objective-C 3.5K SLoC // 0.0% comments Python 3.5K SLoC // 0.1% comments GLSL 3K SLoC // 0.0% comments Rust 2.5K SLoC // 0.0% comments

llama-cpp-rs-2

A wrapper around the llama-cpp library for rust.

Info

This is part of the project powering all the LLMs at utilityai, it is tightly coupled llama.cpp and mimics its API as closly as possible while being safe in order to stay up to date.

Dependencies

This uses bindgen to build the bindings to llama.cpp. This means that you need to have clang installed on your system.

If this is a problem for you, open an issue, and we can look into including the bindings.

See bindgen for more information.

Disclaimer

This crate is not safe. There is absolutly ways to misuse the llama.cpp API provided to create UB, please create an issue if you spot one. Do not use this code for tasks where UB is not acceptable.

This is not a simple library to use. In an ideal world a nice abstraction would be written on top of this crate to provide an ergonomic API - the benefits of this crate over raw bindings is safety (and not much of it as that) and not much else.

We compensate for this shortcoming (we hope) by providing lots of examples and good documentation. Testing is a work in progress.

Contributing

Contributions are welcome. Please open an issue before starting work on a non-trivial PR.

Dependencies

~0.6–3.5MB
~67K SLoC