#csv #converter #gz

app csv2pq

CSV to Apache parquet converter

2 releases

Uses new Rust 2024

0.1.1 Mar 27, 2025
0.1.0 Mar 4, 2025

#329 in Database interfaces

Download history 58/week @ 2025-02-26 70/week @ 2025-03-05 133/week @ 2025-03-26

261 downloads per month

MIT/Apache

15KB
318 lines

csv2pq — CSV to Apache Parquet converter

Crates.io

Installation

Install Rust with Cargo and then:

cargo install csv2pq

Usage examples

csv2pq somedata.csv.gz

produces somedata.parquet.

csv2pq --rm somedata.csv

produces somedata.parquet and removes the original csv file.

csv2pq --f64='*' --f32=col1,col2 --i32='*' --i64=col10 --i64=col11

sets default float and integer data types to Float64 and Int32. Sets col1 and col2 to Float32, col10 and col11 to Int64.

Parquet and Arrow underlying implementation

This project is just a CLI for Apache Arrow implementation in Rust.

Other converters

If this utility doesn't fit your needs, try arrow-tools.

Dependencies

~32MB
~712K SLoC