Crates.io | boilerpipe |
lib.rs | boilerpipe |
version | 0.6.0 |
source | src |
created_at | 2020-11-12 12:16:47.493405 |
updated_at | 2021-08-10 12:29:44.087317 |
description | Library for text extraction from HTML documents |
homepage | |
repository | https://github.com/0nkery/boilerpipe-rs |
max_upload_size | |
id | 311537 |
size | 2,049,756 |
This is the Rust port of the Golang port of excellent Java library boilerpipe
which cleans up the boilerplate and extracts text content from HTML documents.
This library implements Article Extractor only and text content only (no images, links etc).