Crates.io | booky |
lib.rs | booky |
version | 0.8.0 |
created_at | 2025-04-27 13:07:20.742803+00 |
updated_at | 2025-09-14 17:02:35.843929+00 |
description | A tool to analyze English text |
homepage | |
repository | https://github.com/DougLau/booky/ |
max_upload_size | |
id | 1651099 |
size | 979,908 |
A tool to analyze English text
Installation:
cargo install booky
The word
sub-command lists words from the built-in lexicon.
-f
option lists all known word forms-c
option filters words by class. Provide a comma-separated list to
specify classes:Option | Word Class |
---|
A
| Adjective
Av
| Adverb
C
| Conjunction
D
| Determiner
I
| Interjection
N
| Noun
P
| Preposition
Pn
| Pronoun
V
| Verb
The read
sub-command reads UTF-8 text from stdin
, which can be redirected
from a file. With no additional options, a summary of token kinds is listed:
> booky read < Dr_Jeckyll_And_Mr_Hyde.txt
3915 l Lexicon
1 f Foreign
4 o Ordinal
0 r Roman
2 n Number
12 a Acronym
37 p Proper
16 s Symbol
7 u Unknown
Comma-separated options can be added to list all tokens of a kind.
Option | Kind | Description |
---|
l
| Lexicon | Found in built-in lexicon
f
| Foreign | Foreign words (non-English)
o
| Ordinal | Ordinal numbers (1st, 2nd, etc.)
r
| Roman | Roman numerals (IV, LXI, etc.)
n
| Number | Other words containing numbers
a
| Acronym | Acronyms / initialisms (ALL-CAPS)
p
| Proper | Proper names / nouns
s
| Symbol | Symbols / letters
u
| Unknown | Unknown (no other kind)
A
| All | All kinds
The hl
sub-command adds highlighting to a text.