| Crates.io | html-meta-scraper |
| lib.rs | html-meta-scraper |
| version | 0.2.0 |
| created_at | 2025-04-28 04:10:51.085585+00 |
| updated_at | 2025-08-01 16:42:28.811923+00 |
| description | Scrape and extract metadata like title, description, images, and favicon from HTML documents. |
| homepage | |
| repository | https://github.com/46ki75/html-meta-scraper |
| max_upload_size | |
| id | 1651781 |
| size | 39,159 |
Scrape and extract metadata like title, description, images, and favicon from HTML documents.
<title>, OGP metadata (og:title, og:description, og:image)twitter:title, twitter:description, twitter:image)<link rel="icon" href="...">)og:title → twitter:title → <title>)Add this to your Cargo.toml:
[dependencies]
html-meta-scraper = "0.1.0"
use html_meta_scraper::MetaScraper;
let html = r#"
<html>
<head>
<meta property="og:title" content="Example Title" />
<meta name="twitter:description" content="Example Description" />
<link rel="icon" href="/favicon.ico" />
</head>
</html>
"#;
let scraper = MetaScraper::new(html);
assert_eq!(scraper.title(), Some("Example Title".to_string()));
assert_eq!(scraper.description(), Some("Example Description".to_string()));
assert_eq!(scraper.favicon(), Some("/favicon.ico".to_string()));
| Method | Description |
|---|---|
title() |
Retrieves page title (og:title → twitter:title → <title>) |
description() |
Retrieves page description (og:description → twitter:description → description) |
image() |
Retrieves page image URL (og:image → twitter:image) |
favicon() |
Retrieves favicon URL (<link rel="icon">) |
lang() |
Retrieves language (<html lang="en">) |
extract_* methods |
Low-level methods to extract specific metadata |