| Crates.io | oak-html |
| lib.rs | oak-html |
| version | 0.0.1 |
| created_at | 2025-10-21 00:16:12.880546+00 |
| updated_at | 2026-01-23 04:29:18.950615+00 |
| description | HTML markup language parser with support for web content and document structure processing. |
| homepage | https://github.com/ygg-lang/oaks |
| repository | https://github.com/ygg-lang/oaks |
| max_upload_size | |
| id | 1892945 |
| size | 107,750 |
High-performance incremental HTML parser for the oak ecosystem with flexible configuration, optimized for web development and document processing.
Oak-html is a robust parser for HTML, designed to handle complete HTML syntax including modern features. Built on the solid foundation of oak-core, it provides both high-level convenience and detailed AST generation for web development and document processing.
Basic example:
use oak_html::HtmlParser;
fn main() -> Result<(), Box<dyn std::error::Error>> {
let parser = HtmlParser::new();
let html_content = r#"
<!DOCTYPE html>
<html>
<head>
<title>My Page</title>
</head>
<body>
<h1>Hello, HTML!</h1>
<p>This is a paragraph.</p>
</body>
</html>
"#;
let document = parser.parse_document(html_content)?;
println!("Parsed HTML document successfully.");
Ok(())
}
use oak_html::{HtmlParser, ast::Document};
let parser = HtmlParser::new();
let html_content = r#"
<!DOCTYPE html>
<html>
<head><title>Test</title></head>
<body><h1>Hello</h1></body>
</html>
"#;
let document = parser.parse_document(html_content)?;
println!("Document has {} elements", document.elements.len());
use oak_html::{HtmlParser, ast::Element};
let parser = HtmlParser::new();
let html_content = r#"
<div class="container" id="main">
<p>Content</p>
</div>
"#;
let element = parser.parse_element(html_content)?;
println!("Element tag: {}", element.tag_name);
use oak_html::{HtmlParser, lexer::Token};
let parser = HtmlParser::new();
let tokens = parser.tokenize("<div>content</div>")?;
for token in tokens {
println!("{:?}", token.kind);
}
use oak_html::HtmlParser;
let parser = HtmlParser::new();
let invalid_html = r#"
<html>
<head><title>Test</title>
<body><h1>Hello</h1>
<!-- Missing closing tags -->
"#;
match parser.parse_document(invalid_html) {
Ok(document) => println!("Parsed HTML document successfully."),
Err(e) => {
println!("Parse error at line {} column {}: {}",
e.line(), e.column(), e.message());
if let Some(context) = e.context() {
println!("Error context: {}", context);
}
}
}
The parser generates a comprehensive AST with the following main structures:
Oak of html integrates seamlessly with:
Check out the examples directory for comprehensive examples:
Contributions are welcome!
Please feel free to submit pull requests at the project repository or open issues.