llamap

Crates.iollamap
lib.rsllamap
version0.1.0
created_at2025-11-04 15:38:37.147879+00
updated_at2025-11-04 15:38:37.147879+00
descriptionCLI tool to build llms.txt from sitemap.xml
homepage
repositoryhttps://github.com/mlavrinenko/llamap
max_upload_size
id1916414
size165,476
Maxim Lavrinenko (mlavrinenko)

documentation

README

llamap

A CLI tool to build llms.txt from sitemap.xml.

Workflow and Examples

  1. Scrape sitemap.xml URL and save their HTML to a local SQLite database.
# Scrape or re-scrape a website using sitemap and save pages to a database
llamap scrape https://www.sitemaps.org/sitemap.xml sitemaps.org.sqlite
  1. Parse text content and title from web pages using multiple extraction methods.
# Parse or re-parse HTMLs of all stored database pages using dom_smoothie (default)
llamap parse sitemaps.org.sqlite --text-by dom_smoothie
# Re-parse a specific page using fast_html2md
llamap parse sitemaps.org.sqlite --target https://www.sitemaps.org/faq.html --text-by fast_html2md
  1. Summarize scraped content using different LLM providers and customizable prompt.
# Summarize unsummarized pages using an LLM model
llamap summarize sitemaps.org.sqlite ollama://8b@qwen3
# Summarize all pages (including those already summarized)
llamap summarize sitemaps.org.sqlite ollama://8b@qwen3 --target all
# Summarize a specific page with a custom prompt template
llamap summarize sitemaps.org.sqlite ollama://8b@qwen3 --target=https://www.sitemaps.org/faq.html --prompt-file /path/to/prompt.txt
  1. Compose the final llms.txt file from database summaries.
# Compose the final llms.txt file
llamap compose sitemaps.org.sqlite sitemaps.org.llms.txt
#
llamap scrape -vvv https://www.sitemaps.org/sitemap.xml sitemaps.org.sqlite

Also, at each step you can configure verbosity using multiple -v (0=error, 1=warn, 2=info, 3=debug, 4=trace).

References

Ideas / TODOs

  • Implement migrations system when needed
  • Add llm_readability parser support
Commit count: 0

cargo fmt