#url #wayback #archive #machine #cli #cli-tool #internet

bin+lib wayback-archiver

CLI tool for archiving URLs to the Internet Archive's "Wayback Machine"

2 releases

0.1.1 Sep 18, 2021
0.1.0 Sep 18, 2021

#6 in #wayback

MIT license

16KB
277 lines

wayback-archiver

CLI archival tool for the Wayback Machine

Installation

$ cargo install wayback-archiver

Examples:

$ wayback-archiver google.com

$ wayback-archiver --urls-file urls.txt --out archive.json

$ echo "google.com\nwikipedia.org\ngithub.com" | wayback-archiver --out=archive.json --merge

Usage

USAGE:
    wayback-archiver [FLAGS] [OPTIONS] [URLS]...

ARGS:
    <URLS>...    URLs to archive using the Wayback Machine. URLs can also be provided using
                 stdin, or with --urls_file

FLAGS:
    -h, --help       Print help information
    -m, --merge      If set, the results are merged with the (existing) contents of the --out file
    -V, --version    Print version information

OPTIONS:
    -o, --out <OUT>                If set, archived URLs are saved to the path specified by this
                                   flag. Otherwise, URLs are printed at the end of the command run
    -u, --urls-file <URLS_FILE>    A file containing urls to archive

Attribution

This tool only functions because of the Internet Archive. Please consider dontating to their cause.

Dependencies

~11–25MB
~386K SLoC