#parquet #viewing #egui #built #metadata #query #format

app parqbench

A utility for viewing parquet files built on egui

4 releases

0.2.0 Sep 25, 2022
0.1.2 Sep 24, 2022
0.1.1 Sep 24, 2022
0.1.0 Sep 23, 2022

#15 in #viewing

GPL-3.0-or-later

33KB
701 lines

ParqBench

A simple, cross-platform, utility for viewing parquet files, build on egui and arrow.

TODO List

  • CI for builds/releases
  • load partitioned dataset
  • tab layout/tree
  • add controls and metadata to side panels
  • parse pandas format metadata
  • open with hooks (cmd line args)
  • notification for errors
  • support all filetypes supported by datafusion
  • Update datafusion and egui/eframe
  • Auto resize columns
  • revisit &str vs String usages
  • Ui for basic query operations
  • source configuration in query pane
  • rich metadata with parquet crate

Installation

Generic, portable binaries for Windows and Linux are available on the releases.

ParqBench is tested for Linux (Manjaro 22.0, kernel 5.15.60) and Windows 10 (21H2). Releases are built with:

cargo build --release --target x86_64-pc-windows-gnu cargo build --release --target x86_64-unknown-linux-gnu

The builds are self-contained, portable, executables. The resulting binaries are placed in target/<target>.

Dependencies

~51–87MB
~1.5M SLoC