Crates.io | botagent |
lib.rs | botagent |
version | 0.1.0 |
source | src |
created_at | 2024-08-27 06:40:09.138038 |
updated_at | 2024-08-27 06:40:09.138038 |
description | A bot user agent detection library using regex patterns. |
homepage | https://github.com/nubilfi/botagent |
repository | https://github.com/nubilfi/botagent |
max_upload_size | |
id | 1352945 |
size | 29,003 |
botagent is a Rust library for detecting bot user agents using regular expressions. It reads patterns from a JSON file, compiles them into a regex, and checks user agents against these patterns.
pcre2
crate for high-performance regex matching.To use this library in your project, add the following to your Cargo.toml
:
[dependencies]
botagent = "0.1"
Or by run cargo add botagent
command.
To check if a given user agent string matches any known bot patterns:
use botagent::is_bot;
fn main() {
let is_bot = is_bot("Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)", "patterns.json").unwrap();
println!("Is bot: {}", is_bot);
}
If you want to know which bot pattern matched the user agent:
use botagent::is_bot_match;
fn main() {
let matched_pattern = is_bot_match("Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)", "patterns.json").unwrap();
if let Some(pattern) = matched_pattern {
println!("Matched bot pattern: {}", pattern);
} else {
println!("No bot pattern matched.");
}
}
The bot patterns are stored in a JSON file, with each pattern being a regular expression string. Here is an example patterns.json
:
[
"(?<! (?:channel/|google/))google(?!(app|/google| pixel))",
"(?<! cu)bots?(?:\\b|_)"
]
Each string in the array is a pattern that will be compiled into a single regular expression to match against user agent strings.
To run the tests, you can use the following command:
cargo test
This will also run the documentation tests to ensure that all code examples in the documentation are correct.
This library follows the original logic and pattern list from isbot and it uses a regular expression that matches bots and only bots. I'll try to ensure it aligns closely with the original logic, but tailored to work seamlessly in a Rust environment, feel free to reach out with any feedback or suggestions!
This library was inspired by and largely based on the work from isbot. The original TypeScript code was rewritten in Rust.