#dump #csv #convert #xml #discogs #release #data

app discogs2csv

Converts a Discogs releases dump into a CSV

1 unstable release

0.1.0 Apr 6, 2023

#6 in #discogs

MIT license

10KB
166 lines

Discogs2csv

An little tool that converts a Discogs release XML dump into a CSV.

Installation

cargo install discogs2csv

Usage

First download a release dump from the Discogs website:

curl -O 'https://discogs-data-dumps.s3-us-west-2.amazonaws.com/data/2023/discogs_20230301_releases.xml.gz'

Then simply feed it to the discogs2csv command:

gunzip --stdout discogs_20230301_releases.xml.gz | discogs2csv > tracks.csv

Optionally you could convert this CSV into a typed JSON-line:

cargo install csv2ndjson-lite
cat tracks.csv | csv2ndjson-lite --arrays genre --numbers id released-timestamp duration-float > tracks.ndjson

Dependencies

~3.5MB
~53K SLoC