html_sanitizer

Crates.iohtml_sanitizer
lib.rshtml_sanitizer
version0.1.1
sourcesrc
created_at2018-08-27 15:25:30.468659
updated_at2018-08-27 15:30:16.971357
descriptionA customizable HTML sanitizer
homepage
repositoryhttps://github.com/trangar/html_sanitizer
max_upload_size
id81665
size11,901
Trangar (VictorKoenders)

documentation

README

This crate is a simple HTML sanitizer, build on top of html5ever

With this crate, you can determine for every HTML tag what you want to sanitize. This is done by the Tag struct that gets passed for every HTML tag.

use std::fs::File;
use html_sanitizer::TagParser;

fn main() {
    let mut file = File::open("your_html_document.html").unwrap();
    let mut tag_parser = TagParser::new(&mut file);
    let result = tag_parser.walk(|tag| {
        if tag.name == "html" || tag.name == "body" {
            // ignore <html> and <body> tags, but still parse their children
            tag.ignore_self();
        } else if tag.name == "head" || tag.name == "script" || tag.name == "style" {
            // Ignore <head>, <script> and <style> tags, and all their children
            tag.ignore_self_and_contents();
        } else if tag.name == "a" {
            // Allow specific attributes
            tag.allow_attribute(String::from("href"));
        } else if tag.name == "img" {
            // Completely rewrite tags and their children
            tag.rewrite_as(String::from("<b>Images not allowed</b>"));
        } else {
            // Allow specific attributes
            tag.allow_attribute(String::from("style"));
        }
    });
    // result contains a string of your sanitized HTML
}
Commit count: 20

cargo fmt