tokenmonster

Crates.iotokenmonster
lib.rstokenmonster
version0.1.0
created_at2025-09-24 20:28:49.579313+00
updated_at2025-09-24 20:28:49.579313+00
descriptionGreedy tiktoken-like tokenizer with embedded vocabulary (cl100k-base approximator)
homepage
repositoryhttps://github.com/martinboros/native-vector-store
max_upload_size
id1853790
size1,274,228
Martin Boros (mboros1)

documentation

README

tokenmonster

Greedy tiktoken-like tokenizer with an embedded vocabulary, intended for fast, allocation-light tokenization.

Features

  • Greedy tokenization compatible with common LLM vocabularies
  • Zero-copy where possible; minimal allocations
  • Optional tiny test vocabulary via the tiny_vocab feature

License: MIT

Commit count: 0

cargo fmt