9 releases (5 breaking)
0.6.2 | Nov 28, 2024 |
---|---|
0.6.0 | Sep 22, 2024 |
0.3.0 | Jul 27, 2024 |
#391 in Rust patterns
55KB
890 lines
static-keys
Static key is a mechanism used by Linux kernel to speed up checks of seldomly changed features. We brought it to Rust userland applications for Linux, macOS and Windows! (Currently nightly Rust required. For reasons, see FAQ).
Currently CI-tested platforms:
-
Linux
x86_64-unknown-linux-gnu
x86_64-unknown-linux-musl
i686-unknown-linux-gnu
aarch64-unknown-linux-gnu
riscv64gc-unknown-linux-gnu
loongarch64-unknown-linux-gnu
-
macOS
aarch64-apple-darwin
-
Windows
x86_64-pc-windows-msvc
i686-pc-windows-msvc
-
Bare metal (No CI)
- Should work with above-mentioned architectures. For more detail, see FAQ.
Note that when using cross-rs to build loongarch64-unknown-linux-gnu
target, you should use latest cross-rs avaiable on GitHub. See Evian-Zhang/static-keys#4 for more details.
For more comprehensive explanations and FAQs, you can refer to GitHub Pages (中文版文档).
Motivation
It's a common practice for modern applications to be configurable, either by CLI options or config files. Those values controlled by configuration flags are usually not changed after application initialization, and are frequently accessed during the whole application lifetime.
let flag = CommandlineArgs::parse();
loop {
if flag {
do_something();
}
do_common_routines();
}
Although flag
will not be modified after application initialization, the if
-check still happens in each round, and in x86-64, it may be compiled to the following test
-jnz
instructions.
test eax, eax ; Check whether eax register is 0
jnz do_something ; If not zero, jump to do_something
do_common_routines:
; Do common routines
ret
do_something:
; Do something
jmp do_common_routines ; Jump to do_common_routines
Although the if
-check is just test
-jnz
instructions, it can still be speedup. What about making the check just a jmp
(skip over the do_something
branch) or nop
(always do_something
)? This is what static keys do. To put it simply, we modify the instruction at runtime. After getting the flag
passed from commandline, we dynamically modify the if flag {}
check to be a jmp
or nop
according to the flag
value.
For example, if user-specified flag
is false
, the assembled instructions will be dynamically modified to the following nop
instruction.
nop DWORD PTR [rax+rax*1+0x0]
do_common_routines:
; Do common routines
ret
do_something:
; Do something
jmp do_common_routines
If flag
is true
, then we will dynamically modify the instruction to an unconditional jump instruction:
jmp do_something
do_common_routines:
; Do common routines
ret
do_something:
; Do something
jmp do_common_routines
There is no more test
and conditional jumps, just a nop
(which means this instruction does nothing) or jmp
.
Although replacing a test
-jnz
pair to nop
may be minor improvement, however, as documented in linux kernel, if the configuration check involves global variables, this replacement can decrease memory cache pressure. And in server applications, such configuration may be shared between multiple threads in Arc
s, which has much more overhead than just nop
or jmp
.
Usage
To use this crate, currently nightly Rust is required. And in the crate root top, you should declare usage of unstable features asm_goto
.
#![feature(asm_goto)]
First, add this crate to your Cargo.toml
:
[dependencies]
static-keys = "0.6"
At the beginning of main
function, you should invoke static_keys::global_init
to initialize.
fn main() {
static_keys::global_init();
// Do other things...
}
Then you should define a static key to hold the value affected by user-controlled flag, and enable or disable it according to the user passed flag.
// FLAG_STATIC_KEY is defined with initial value `false`
define_static_key_false!(FLAG_STATIC_KEY);
fn application_initialize() {
let flag = CommandlineArgs::parse();
if flag {
unsafe {
FLAG_STATIC_KEY.enable();
}
}
}
Note that you can enable or disable the static key any number of times at any time. And more importantly, it is very dangerous if you modify a static key in a multi-threads environment. Always spawn threads after you complete the modification of such static keys. And to make it more clear, it is absolutely safe to use this static key in multi-threads environment as below. The modification of static keys may be less efficient, while since the static keys are used to seldomly changed features, the modifications rarely take place, so the inefficiency does not matter. See FAQ for more explanation.
After the definition, you can use this static key at if
-check as usual (you can see here and here to know more about the likely
-unlikely
API semantics). A static key can be used at multiple if
-checks. If the static key is modified, all locations using this static key will be modified to jmp
or nop
accordingly.
fn run() {
loop {
if static_branch_unlikely!(FLAG_STATIC_KEY) {
do_something();
}
do_common_routines();
}
}
References
- The Linux kernel official documentation : static-keys
- Linux
static_key
internals - Rust for Linux also has an implementation for static keys, please refer to Rust-for-Linux/linux#1084 for more information. My implementaion's
break
-based inline assembly layout is inspired by this great work.
Dependencies
~0–33MB
~469K SLoC