5 releases
0.4.2 | Mar 11, 2020 |
---|---|
0.4.1 | Mar 11, 2020 |
0.4.0 | Mar 9, 2020 |
0.3.1 | Feb 25, 2020 |
0.3.0 | Feb 25, 2020 |
#1607 in Command line utilities
27 downloads per month
15KB
376 lines
PerfTacho
PerfTacho is a small application for computing performance statistics of an executable.
The executable can be any binary file or a shell script. PerfTacho measures the total execution time and can also capture execution time output from the executable. PerfTacho can run the executable multiple times and compute statistics from the recorded performance data.
Usage
perftacho [-tachoOptions] <command> [params]
Available options:
-tachoRepeat=<n> repeats the execution n times and calculates statistics
-tachoShowDetails together with -tachoRepeat: show each individual duration
-tachoShowOutput displays the output of the executed command
-tachoASCII together with -tachoShowOutput: filter out non ASCII characters in output
-tachoTag=<tag> adds an informational tag to the output
-tachoRegEx[=<re>] capture durations from output (see documentation below)
Example
perftacho -tachoTag=MyTest -tachoRepeat=5 -tachoShowDetail curl https://www.google.com
perftacho -tachoShowOutput ls -l
Calculated Statistics
When using PerfTacho in repeated execution mode (-tachoRepeat=n) the following statistics are calculated:
- avg: the arithmetic average of the execution time
- 95% conf. interval: the interval [avg-d, avg+d] for which there is a 95% probability that
the true avg is contained in
- min: minimum execution time
- max: maximum execution time
- std_dev: the standard deviation of the execution time
- n_recommended: the recommended number of repetitions to get a confidence interval
smaller than 5% of the avg value, i.e. [5%-avg, avg+5%]
Some mathematical background: The goal is to calculate the "true" average execution time. What is the true average execution time and why "true"? Just try out the following with a sample executable: Run PerfTacho with lets say 5 repetitions (-tachoRepeat=5). It will calculate the average execution time for this sample of 5 executions. Now do it again an again. You will see that the calculated average execution time is more or less similar for each experiment but not exactly the same. The calculated statistics for a given sample of executions is just an estimation for the "true" values. The variation of the calculated average value depends on the variation of the individual measurements. Their variation is expressed through the standard deviation std_dev. A small standard deviation means you only need few measurements to calculate a reasonable approximation for the average execution time. Vice versa, a high value means you need a higher number of measurements. The 95% confidence interval expresses a range for which you can be 95% confident that the "true" average execution time value is contained. If you have a strong variation in measured data the range will be quite broad. To narrow down this range you need to increase the number of measurements (-tachoRepeat). n_recommended estimates the number of measurements you probably will need to get a confidence interval of +/- 5% around the average value.
Consult https://en.wikipedia.org/wiki/Confidence_interval to get some more information
Typical use cases
Evaluating performance of an executable and its algorithms
A software developer wants to evaluate the performance of an executable which comprises of several algorithms. Typically, the developer wants to measure the total execution time and the individual execution time for the algorithms to track where the elapsed time is spent. For this purpose the developer just needs to surround the execution of the algorithms with execution time logging and write the measured execution time to stdout.
Example (Java):
long startTime = System.currentTimeMillis();
runMyAlgorithm();
long duration = System.currentTimeMillis() - startTime;
System.out.printf("Duration MyAlgorithm [%d ms]", duration);
The executable should produce the following output:
Duration MyAlgorithm [321 ms]
If the multiple algorithms shall be measured just add the output accordingly, i.e:
Algo1 [321 ms] ... some other output ... Algo2 [456 ms]
PerfTacho parses the output of the executable and captures the performance data with a regular expression. With the current default regular expression
"\[(\-?\d+[\.,]?\d*)\s?(s|ms|ns)\]"
it matches the following output examples:
[123.0 ms] [123.0ms] [123 s] [123,45 s] [12345ns]
The following command runs the executable in multiple passes and computes statistics from the recorded performance data:
perftacho -tachoRepeat=5 -tachoShowDetails -tachoRegEx MyProgram
The output might look like this:
Tacho : duration in ms
1012.00 102.00 100.02 119.00
1014.00 116.00 100.01 107.00
1014.00 114.00 100.02 115.00
1013.00 105.00 100.01 103.00
1014.00 112.00 100.02 105.00
Tacho : avg: 1013.40ms / 95% conf. interval 0.78 / min: 1012ms / max: 1014ms / stddev 0.89 ms / n_recommended 1
Tacho : avg: 109.80ms / 95% conf. interval 5.27 / min: 102ms / max: 116ms / stddev 6.02 ms / n_recommended 5
Tacho : avg: 100.01ms / 95% conf. interval 0.00 / min: 100.009ms / max: 100.018ms / stddev 0.00 ms / n_recommended 1
Tacho : avg: 109.80ms / 95% conf. interval 6.02 / min: 103ms / max: 119ms / stddev 6.87 ms / n_recommended 6
In this example the executable "MyProgramm" was run 5 times (-tachoRepeat=5 option). PerfTacho captured 3 performance data items from the output. From this data the table as shown above is generated (and displayed, -tachoShowDetails option).
The first column shows the total execution time, the last 3 columns show the captured execution time data from the output. All data is transformed to milliseconds. For each column a row with statistics is computed and displayed below the table.
Installation
Perftacho is a self contained binary file without any dependencies. You have the following options:
Build and install the binary from GitHub sources
You need to have Rust and Cargo installed Please consult https://doc.rust-lang.org/cargo/getting-started/installation.html
Get the sources:
git clone https://github.com/qrider71/tacho.git
cd tacho
cargo build --release
cd target/release
Copy the perftacho binary to your binary folder (wgich should be in your path),
e.g. on Linux:
sudo cp perftacho /usr/local/bin/
Get and build the latest release from crates
You need to have Rust and Cargo installed Please consult https://doc.rust-lang.org/cargo/getting-started/installation.html
cargo install perftacho
Cargo installs the compiled binary into your bin folder
Mac OSX
You can install from sources as described above or download the binary from Github: https://github.com/qrider71/tacho/releases
You should pick the file perftacho-osx-x.y.z and copy it to your bin folder (which should be in your PATH)
Alternatively, you can install with homebrew (https://brew.sh/)
brew tab qrider71/perftacho
brew install perftacho
Linux
You can install from sources as described above or download the binary from Github: https://github.com/qrider71/tacho/releases
You should pick the file perftacho-linux-x.y.z and copy it to your bin folder (which should be in your PATH)
Windows
You can install from sources as described above or download the binary from Github: https://github.com/qrider71/tacho/releases
You should pick the file perftacho-windows-x.y.z.exe and copy it to your bin folder (which should be in your PATH)
Roadmap
I plan the following roadmap. Please contact me if you have any comments or ideas for useful features:
V.1.0
Version 1.0 provides all command line tool features for measuring the performance of a single executable
- Simple measurement of the runtime duration (implemented)
- Statistical analysis of multiple runs (implemented)
- Grabbing performance data from the output of the executable with regular expressions (implemented)
V.2.0
Version 2.0 provides features for comparing the performance of several executable. It can be used for benchmarking different implementation. The execution plans can be specified in a configuration file.
- Configuration file (yaml) based performance testing plans for several executable
- Exporting the results in different formats
V.3.0
Version 3.0 provides multi-threading options for running the performance tests. The focus is on measuring the performance under multi-threaded access and stress test conditions
- multi-threading options for running the performance tests
Dependencies
~2.3–3.5MB
~59K SLoC