| Crates.io | urlharvest |
| lib.rs | urlharvest |
| version | 0.5.5 |
| created_at | 2021-09-16 11:27:43.292715+00 |
| updated_at | 2021-11-03 10:32:13.832299+00 |
| description | URL harvester for IRC irssi logs, no bot |
| homepage | |
| repository | https://github.com/sjm42/urlharvest-rs |
| max_upload_size | |
| id | 452192 |
| size | 121,994 |
Please note: this harvester is tailing your IRC client (irssi) logs on disk and does not need or include an ircbot of any kind. Consider running your irssi on a cloud vm to stay "always connected" :-)
Any other kind of chat log source would be trivial to implement. Basically, the chat text is just scanned with regex match and detected URLs are saved & indexed.