5 stable releases

1.1.3 Oct 22, 2024
1.1.2 Jun 24, 2024
1.1.1 Jun 23, 2024
1.0.1 Jan 23, 2024
1.0.0 Aug 9, 2023

#97 in Text processing

Download history 3/week @ 2024-07-20 17/week @ 2024-07-27 13/week @ 2024-09-14 18/week @ 2024-09-21 4/week @ 2024-09-28 1/week @ 2024-10-12 128/week @ 2024-10-19 16/week @ 2024-10-26

146 downloads per month

MPL-2.0 license

100KB
2K SLoC

llmvm-core

Crates.io GitHub

The core application for llmvm which is responsible for managing the following:

  • Sending generation requests to backends
  • Managing message threads
  • Model presets
  • Prompt templates
  • Projects/workspaces

Installation

Install this application using cargo.

cargo install llmvm-core

Usage

The core can either be invoked directly, or via a frontend that utilizes core.

To invoke directly, execute llmvm-core -h for details.

Rust frontends should use build_core_service_from_config from the llmvm-protocol crate to create a stdio or HTTP client, to communicate with the core.

llmvm-core http-server can be invoked to create a HTTP server for remote clients.

Configuration

Run the core executable to generate a configuration file at:

  • Linux: ~/.config/llmvm/core.toml.
  • macOS: ~/Library/Application Support/com.djandries.llmvm/core.toml
  • Windows: AppData\Roaming\djandries\llmvm\config\core.toml
Key Required? Description
thread_ttl_secs No The max time-to-live for threads in seconds. Threads with a last modified time older than the TTL will be removed.
stdio_client No Configuration for all backend stdio clients. See llmvm-protocol for details.
http_backends.<backend name> No HTTP client configurations for remote backends. See llmvm-protocol for details.

Projects / workspaces

Projects can be created in given directory/workspace by running llmvm-core init-project. An .llmvm directory will be created in the current directory.

Inside the new directory, some subdirectories will be created for presets, prompts, threads, logs, config and weights.

Using projects can be useful for isolating the above resources for a given workspace. All llmvm commands called within a workspace with a project will read/write to/from the .llmvm subdirectories. The global directory for each resource type will be used as a fallback, if a specific resource does not exist.

Prompt templates

Handlebars templates can be saved and used for prompt generation.

Prompt templates are saved in the prompts directory of the current project directory, or global user data directory. The filename must end in .hbs.

The global prompt directory is located at:

  • Linux: ~/.local/share/llmvm/prompts.
  • macOS: ~/Library/Application Support/com.djandries.llmvm/prompts
  • Windows: AppData\Roaming\djandries\llmvm\data\prompts

System role prompts

The system_role block helper may be used to define a system role prompt. Here is an example:

{{#system_role}}
Do a good job.
{{/system_role}}

Anything outside of the block helper will be sent using the "user" role.

Built-in templates / template examples

The core comes with some built-in templates. See the prompts directory in this repo to see all built-in templates.

Threads

System, user and assistant messages can be stored in threads. If the frontend sets save_thread to true in the generation request, the prompt and model response will be saved to a new or existing thread. The generated thread ID will be provided in the response, via the thread_id key. Existing threads may be used in generation requests, by specifying existing_thread_id in the request.

Threads are stored as JSON arrays in the threads directory of the current project directory, or global user data directory.

The global threads directory is located at:

  • Linux: ~/.local/share/llmvm/threads.
  • macOS: ~/Library/Application Support/com.djandries.llmvm/threads
  • Windows: AppData\Roaming\djandries\llmvm\data\threads

Old threads will be automatically deleted, if they were not updated recently. If the last modified age exceeds the thread TTL (14 days by default), the thread will be deleted.

Presets

Presets contain generation parameters for core generation requests. The presets may contain prompt template ids, model ids, model parameters, prompt parameters and more. Here are the keys for a preset:

Key Description
model Model ID to use for generation.
prompt_template_id ID for a saved prompt template.
custom_prompt_template Text for a custom prompt template.
max_tokens Maximum amount of tokens to generate.
model_parameters Table of parameters for the model itself.
prompt_parameters Table of parameters for completing the prompt template.

Preset ids may be specified in generation requests, and preset settings may be overridden explicitly within the request.

Model presets are saved in the presets directory of the current project directory, or global user data directory. The filename must end in .toml.

The global preset directory is located at:

  • Linux: ~/.local/share/llmvm/presets.
  • macOS: ~/Library/Application Support/com.djandries.llmvm/presets
  • Windows: AppData\Roaming\djandries\llmvm\data\presets

Built-in presets / preset examples

The core comes with some built-in presets. See the presets directory in this repo to see all built-in presets.

Logging

The crate uses the tracing library to handle logging. By default, all logging is written to stderr. If the --log-to-file switch is provided, logs will be saved to the logs directory of the current project directory, or global user data directory.

The global logs directory is located at:

  • Linux: ~/.local/share/llmvm/logs.
  • macOS: ~/Library/Application Support/com.djandries.llmvm/logs
  • Windows: AppData\Roaming\djandries\llmvm\data\logs

License

Mozilla Public License, version 2.0

Dependencies

~26–38MB
~677K SLoC