urlharvest

Crates.iourlharvest
lib.rsurlharvest
version0.5.5
sourcesrc
created_at2021-09-16 11:27:43.292715
updated_at2021-11-03 10:32:13.832299
descriptionURL harvester for IRC irssi logs, no bot
homepage
repositoryhttps://github.com/sjm42/urlharvest-rs
max_upload_size
id452192
size121,994
Sami J. Mäkinen (sjm42)

documentation

README

URL harvester for IRC, no bot

  • Harvest URLs from irssi IRC client logs, insert into SQLite db
  • Fetch metadata, e.g. title and update db.
  • Generate html pages.
  • Implement a search page

Please note: this harvester is tailing your IRC client (irssi) logs on disk and does not need or include an ircbot of any kind. Consider running your irssi on a cloud vm to stay "always connected" :-)

Any other kind of chat log source would be trivial to implement. Basically, the chat text is just scanned with regex match and detected URLs are saved & indexed.

Commit count: 731

cargo fmt