maman

Crates.iomaman
lib.rsmaman
version0.13.1
sourcesrc
created_at2016-05-03 20:13:45.227109
updated_at2018-12-01 16:05:45.084095
descriptionRust Web Crawler
homepagehttps://github.com/spk/maman
repositoryhttps://github.com/spk/maman.git
max_upload_size
id4964
size28,132
Laurent Arnoud (spk)

documentation

README

Maman

Maman is a Rust Web Crawler saving pages on Redis.

Pages are send to list <MAMAN_ENV>:queue:maman using Sidekiq job format

{
"class": "Maman",
"jid": "b4a577edbccf1d805744efa9",
"retry": true,
"created_at": 1461789979, "enqueued_at": 1461789979,
"args": {
    "document":"<html><body><a href='#' /><a href='/new' /></html>",
    "urls": ["https://example.net/new"],
    "headers": {"content-type": "text/html"},
    "url": "https://example.net/"
    }
}

Dependencies

Installation

With cargo

cargo install maman

With make

PREFIX=~/.local make install

Usage

maman URL [LIMIT] [MIME_TYPES]

LIMIT must be an integer or 0 is the default, meaning no limit.

Environment variables

Defaults

  • MAMAN_ENV=development
  • REDIS_URL="redis://127.0.0.1/"

Others

  • RUST_LOG=maman=info

LICENSE

The MIT License

Copyright (c) 2016-2018 Laurent Arnoud laurent@spkdev.net


Build Version Documentation License Project status Dependency status

Commit count: 147

cargo fmt