16 releases (breaking)
Uses old Rust 2015
0.13.1 | Dec 1, 2018 |
---|---|
0.13.0 | Oct 15, 2018 |
0.12.1 | Jun 18, 2017 |
0.11.0 | Mar 12, 2017 |
0.4.0 | Jun 7, 2016 |
#1436 in Database interfaces
46 downloads per month
18KB
424 lines
Maman
Maman is a Rust Web Crawler saving pages on Redis.
Pages are send to list <MAMAN_ENV>:queue:maman
using
Sidekiq job format
{
"class": "Maman",
"jid": "b4a577edbccf1d805744efa9",
"retry": true,
"created_at": 1461789979, "enqueued_at": 1461789979,
"args": {
"document":"<html><body><a href='#' /><a href='/new' /></html>",
"urls": ["https://example.net/new"],
"headers": {"content-type": "text/html"},
"url": "https://example.net/"
}
}
Dependencies
Installation
With cargo
cargo install maman
With make
PREFIX=~/.local make install
Usage
maman URL [LIMIT] [MIME_TYPES]
LIMIT
must be an integer or 0
is the default, meaning no limit.
Environment variables
Defaults
- MAMAN_ENV=development
- REDIS_URL="redis://127.0.0.1/"
Others
- RUST_LOG=maman=info
LICENSE
The MIT License
Copyright (c) 2016-2018 Laurent Arnoud laurent@spkdev.net
Dependencies
~25–35MB
~615K SLoC