aozora-core

Crates.ioaozora-core
lib.rsaozora-core
version0.7.1
created_at2026-01-03 16:44:57.901113+00
updated_at2026-01-04 16:12:08.637238+00
descriptionCore library for Aozora Bunko format processing
homepage
repositoryhttps://github.com/takahashim/aozora2
max_upload_size
id2020443
size490,263
Masayoshi Takahashi (takahashim)

documentation

README

aozora-core

Core library for processing Aozora Bunko format text.

日本語

Features

  • Tokenizer (lexical analysis)
  • Parser (syntax analysis)
  • Gaiji (JIS external characters) conversion
  • Accent notation conversion
  • Encoding detection and conversion (UTF-8 / Shift_JIS)
  • ZIP file processing

Usage

use aozora_core::tokenizer::tokenize;
use aozora_core::parser::parse;

// Tokenize
let tokens = tokenize("漢字《かんじ》");
assert_eq!(tokens.len(), 2);

// Parse
let nodes = parse(&tokens);

License

MIT

Commit count: 0

cargo fmt