lindera-py

Crates.io	lindera-py
lib.rs	lindera-py
version	0.45.0
created_at	2024-12-06 01:34:18.926525+00
updated_at	2025-08-02 14:19:33.684116+00
description	Python binding for Lindera.
homepage	https://github.com/lindera-morphology/lindera-py
repository	https://github.com/lindera-morphology/lindera-py
max_upload_size
id	1473798
size	454,613

Minoru OSUKA (mosuka)

documentation

https://docs.rs/lindera-py

README

lindera-py

Python binding for Lindera, a Japanese morphological analysis engine.

Install project dependencies

pyenv : https://github.com/pyenv/pyenv?tab=readme-ov-file#installation
Poetry : https://python-poetry.org/docs/#installation
Rust : https://www.rust-lang.org/tools/install

Install Python

# Install Python
% pyenv install 3.13.5

Setup repository and activate virtual environment

# Clone lindera-py project repository
% git clone git@github.com:lindera/lindera-py.git
% cd lindera-py

# Set Python version for this project
% pyenv local 3.12.3

# Make Python virtual environment
% python -m venv .venv

# Activate Python virtual environment
% source .venv/bin/activate

# Initialize lindera-py project
(.venv) % make init

Install lindera-py as a library in the virtual environment

This command takes a long time because it builds a library that includes all the dictionaries.

(.venv) % make maturin-develop

Example code

from lindera_py import Segmenter, Tokenizer, load_dictionary


def main():
    # load the dictionary
    dictionary = load_dictionary("ipadic")

    # create a segmenter
    segmenter = Segmenter("normal", dictionary)

    # create a tokenizer
    tokenizer = Tokenizer(segmenter)

    text = "関西国際空港限定トートバッグを東京スカイツリーの最寄り駅であるとうきょうスカイツリー駅で買う"
    print(f"text: {text}\n")

    # tokenize the text
    tokens = tokenizer.tokenize(text)

    for token in tokens:
        print(token.text)


if __name__ == "__main__":
    main()

Commit count: 128

lindera-py

documentation

README

lindera-py

Install project dependencies

Install Python

Setup repository and activate virtual environment

Install lindera-py as a library in the virtual environment

Example code

cargo fmt