Crates.io | spider_chrome |
lib.rs | spider_chrome |
version | 2.13.7 |
source | src |
created_at | 2024-10-02 17:48:25.146602 |
updated_at | 2024-11-09 18:28:42.642804 |
description | Library for interacting with a chrome instance with the chrome devtools protocol for spider |
homepage | https://github.com/spider-rs/spider/spider_chrome |
repository | https://github.com/spider-rs/spider |
max_upload_size | |
id | 1394460 |
size | 427,346 |
chromiumoxide provides a high-level and async API to control Chrome or Chromium over the DevTools Protocol. It comes with support for all types of the Chrome DevTools Protocol and can launch a headless or full (non-headless) Chrome or Chromium instance or connect to an already running instance.
use futures::StreamExt;
use chromiumoxide::browser::{Browser, BrowserConfig};
#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
// create a `Browser` that spawns a `chromium` process running with UI (`with_head()`, headless is default)
// and the handler that drives the websocket etc.
let (mut browser, mut handler) =
Browser::launch(BrowserConfig::builder().with_head().build()?).await?;
// spawn a new task that continuously polls the handler
let handle = tokio::task::spawn(async move {
while let Some(h) = handler.next().await {
if h.is_err() {
break;
}
}
});
// create a new browser page and navigate to the url
let page = browser.new_page("https://en.wikipedia.org").await?;
// find the search bar type into the search field and hit `Enter`,
// this triggers a new navigation to the search result page
page.find_element("input#searchInput")
.await?
.click()
.await?
.type_str("Rust programming language")
.await?
.press_key("Enter")
.await?;
let html = page.wait_for_navigation().await?.content().await?;
browser.close().await?;
let _ = handle.await;
Ok(())
}
The current API still lacks some functionality, but the Page::execute
function allows sending all chromiumoxide_types::Command
types (see Generated Code). Most Element
and Page
functions are basically just simplified command constructions and combinations, like Page::pdf
:
pub async fn pdf(&self, params: PrintToPdfParams) -> Result<Vec<u8>> {
let res = self.execute(params).await?;
Ok(base64::decode(&res.data)?)
}
If you need something else, the Page::execute
function allows for writing your own command wrappers. PRs are very welcome if you think a meaningful command is missing a designated function.
chromiumoxide
comes with support for the async-std
and tokio
runtime.
By default chromiumoxide
is configured with async-std
.
Use chromiumoxide
with the async-std
runtime:
chromiumoxide = { git = "https://github.com/mattsse/chromiumoxide", branch = "main"}
To use the tokio
runtime instead add features = ["tokio-runtime"]
and set default-features = false
to disable the default runtime (async-std
):
chromiumoxide = { git = "https://github.com/mattsse/chromiumoxide", features = ["tokio-runtime"], default-features = false, branch = "main"}
This configuration is made possible primarily by the websocket crate of choice: async-tungstenite
.
The chromiumoxide_pdl
crate contains a PDL parser, which is a rust rewrite of a python script in the chromium source tree and a Generator
that turns the parsed PDL files into rust code. The chromiumoxide_cdp
crate only purpose is to invoke the generator during its build process and include the generated output before compiling the crate itself. This separation is done merely because the generated output is ~60K lines of rust code (not including all the proc macro expansions). So expect the compiling to take some time.
The generator can be configured and used independently, see chromiumoxide_cdp/build.rs.
Every chrome pdl domain is put in its own rust module, the types for the page domain of the browser_protocol are in chromiumoxide_cdp::cdp::browser_protocol::page
, the runtime domain of the js_protocol in chromiumoxide_cdp::cdp::js_protocol::runtime
and so on.
vanilla.aslushnikov.com is a great resource to browse all the types defined in the pdl files. This site displays Command
types as defined in the pdl files as Method
. chromiumoxid
sticks to the Command
nomenclature. So for everything that is defined as a command type in the pdl (=marked as Method
on vanilla.aslushnikov.com) chromiumoxide
contains a type for command and a designated type for the return type. For every command there is a <name of command>Params
type with builder support (<name of command>Params::builder()
) and its corresponding return type: <name of command>Returns
. All commands share an implementation of the chromiumoxide_types::Command
trait.
All Events are bundled in single enum (CdpEvent
)
By default chromiumoxide
will try to find an installed version of chromium on the computer it runs on.
It is possible to download and install one automatically for some platforms using the fetcher
.
Ther features are currently a bit messy due to a Cargo bug and will be changed once it is resolved. Based on your runtime and TLS configuration you should enable one of the following:
_fetcher-rusttls-tokio
_fetcher-native-tokio
use std::path::Path;
use futures::StreamExt;
use chromiumoxide::browser::{BrowserConfig};
use chromiumoxide::fetcher::{BrowserFetcher, BrowserFetcherOptions};
#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
let download_path = Path::new("./download");
tokio::fs::create_dir_all(&download_path).await?;
let fetcher = BrowserFetcher::new(
BrowserFetcherOptions::builder()
.with_path(&download_path)
.build()?,
);
let info = fetcher.fetch().await?;
let config = BrowserConfig::builder()
.chrome_executable(info.executable_path)
.build()?,
}
export CDP_NO_EXPERIMENTAL=true
). This is because the use of some experimental pdl types in the *.pdl
files themselves are not marked as experimental.Q: A new chromium instance is being launched but then times out.
A: Check that your chromium language settings are set to English. chromiumoxide
tries to parse the debugging port from the chromium process output and that is limited to english.
Licensed under either of these:
Apache License, Version 2.0, (LICENSE-APACHE or https://www.apache.org/licenses/LICENSE-2.0)
MIT license (LICENSE-MIT or https://opensource.org/licenses/MIT)
KeyDefinition
and typing support among others is taken from.