Crates.io | no_browser |
lib.rs | no_browser |
version | 0.3.0 |
source | src |
created_at | 2023-05-08 14:18:58.459572 |
updated_at | 2023-06-19 19:06:18.929285 |
description | a light-weight, head-less 'web browser' based on reqwest |
homepage | |
repository | https://github.com/lbarnkow/no_browser |
max_upload_size | |
id | 859845 |
size | 70,381 |
The no_browser
crate strives to provide a high-level API for programmatically interacting with web pages through
a light-weight, head-less "web browser". This is not a real web browser, like Firefox or Chrome.
no_browser
builds on top of reqwest handling http requests, http redirects and
cookie management, as well as, scraper parsing html code. This crate uses
CSS selectors to freely access any element of a
given web page. It also provides its own abstraction to fill out and submit web forms.
However, no_browser
has no support for client-side JavaScript or any concept of actually rendering web pages. So if
you want to test modern JavaScript-driven web apps, opt for a crate driving a real web browser like
fantoccini or thirtyfour, instead.
let browser = Browser::builder().finish()?;
// Lets go to the Wikipedia main page
let mut page = browser.navigate_to("https://en.wikipedia.org/", None)?;
// the title tag should be "Wikipedia, the free encyclopedia"
assert_eq!(
page.select_first("head > title")?.inner_html(),
"Wikipedia, the free encyclopedia"
);
// the main page should welcome us
assert!(page
.select_first("h1 > span.mw-headline")?
.inner_html()
.starts_with("Welcome to"));
// fill out the search form ...
let search_form = page.form_by_id_mut("searchform")?;
search_form
.input_mut(InputType::Search, "search")?
.set_value(Some("rust programming language".to_owned()));
// ... and submit
let page = browser.submit_form(search_form, None)?;
// the new title tag should be "Rust (programming language) - Wikipedia"
assert_eq!(
page.select_first("head > title")?.inner_html(),
"Rust (programming language) - Wikipedia"
);
// The main title on the html page should be quite similar
assert_eq!(
page.select_first("span.mw-page-title-main")?.inner_html(),
"Rust (programming language)"
);
// The main title should have a subtitle showing we've been redirected
assert!(page
.select_first("div#contentSub span.mw-redirectedfrom")?
.inner_html()
.starts_with("(Redirected from"));
// The table of contents should have more than 10 entries
assert!(page
.select("div#vector-toc ul#mw-panel-toc-list li.vector-toc-list-item")?
.len() > 10);
Licensed under either of
SPDX-License-Identifier: MIT OR Apache-2.0
Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.