| Crates.io | spider_transformations |
| lib.rs | spider_transformations |
| version | 2.37.109 |
| created_at | 2024-09-21 11:37:35.50831+00 |
| updated_at | 2025-07-08 18:54:26.262879+00 |
| description | Transformation utils to use for spider |
| homepage | https://github.com/spider-rs/spider_transformations |
| repository | https://github.com/spider-rs/spider_transformations |
| max_upload_size | |
| id | 1382144 |
| size | 297,066 |
A high-performance transformation library for Rust, used by Spider Cloud for AI-powered content cleaning across multiple locales.
This project depends on the spider crate.
[dependencies]
spider_transformations = "2"
use spider_transformations::transformation::content;
fn main() {
// page comes from the spider object when streaming.
let mut conf = content::TransformConfig::default();
conf.return_format = content::ReturnFormat::Markdown;
let content = content::transform_content(&page, &conf, &None, &None);
}
There are several chunking utils in the transformation mod.
This project has rewrites and forks of html2md, and html2text for performance and bug fixes.
MIT