Crates.io | mdurl |
lib.rs | mdurl |
version | 0.3.1 |
source | src |
created_at | 2022-09-10 19:40:13.015401 |
updated_at | 2022-09-17 01:07:26.934303 |
description | URL parser and formatter that gracefully handles invalid input. |
homepage | https://github.com/rlidwka/mdurl.rs |
repository | https://github.com/rlidwka/mdurl.rs |
max_upload_size | |
id | 662678 |
size | 91,375 |
URL parser and formatter that gracefully handles invalid input.
It is a rust port of mdurl.js library, created specifically for url rendering in markdown-it parser.
This function takes URL, decodes it, and fits it into N characters, replacing the rest with "…" symbol (that's called "url elision").
This is similar to what Chromium would show you in status bar when you hover your mouse over a link.
use mdurl::format_url_for_humans as format;
let url = "https://www.reddit.com/r/programming/comments/vxttiq/\
comment/ifyqsqt/?utm_source=reddit&utm_medium=web2x&context=3";
assert_eq!(format(url, 20), "reddit.com/…/ifyqsq…");
assert_eq!(format(url, 30), "www.reddit.com/r/…/ifyqsqt/?u…");
assert_eq!(format(url, 50), "www.reddit.com/r/programming/comments/…/ifyqsqt/?…");
Check out this demo to play around with different URLs and lengths.
humanize-url crate tries to achieve similar goals, let me know if there are others.
In order to achieve the task above, a new url parser had to be created, so here it is:
let url = "https://www.reddit.com/r/programming/comments/vxttiq/\
comment/ifyqsqt/?utm_source=reddit&utm_medium=web2x&context=3";
let u = mdurl::parse_url(url);
assert_eq!(u.hostname, Some("www.reddit.com".into()));
assert_eq!(u.pathname, Some("/r/programming/comments/vxttiq/comment/ifyqsqt/".into()));
assert_eq!(u.search, Some("?utm_source=reddit&utm_medium=web2x&context=3".into()));
This function uses a non-standard parsing algorithm derived from node.js legacy URL parser.
You should probably be using rust-url crate instead. Unfortunately, it isn't suitable for the task of pretty-printing urls because you can't customize parts of Url returned by that library (for example, rust-url will always encode non-ascii hostname with punycode, this implementation will not).