lindera-cc-cedict

Crates.iolindera-cc-cedict
lib.rslindera-cc-cedict
version1.2.0
created_at2022-03-07 16:49:04.552746+00
updated_at2025-09-13 10:51:23.479216+00
descriptionA Japanese morphological dictionary for CC-CEDICT.
homepagehttps://github.com/lindera/lindera
repositoryhttps://github.com/lindera/lindera
max_upload_size
id545112
size60,505
Minoru OSUKA (mosuka)

documentation

https://docs.rs/lindera-cc-cedict

README

Lindera CC-CE-DICT

License: MIT Crates.io

Dictionary version

This repository contains CC-CEDICT-MeCab.

Dictionary format

Refer to the manual for details on the unidic-mecab dictionary format and part-of-speech tags.

Index Name (Chinese) Name (English) Notes
0 表面形式 Surface
1 左语境ID Left context ID
2 右语境ID Right context ID
3 成本 Cost
4 词类 Part-of-speech
5 词类1 Part-of-speech subcategory 1
6 词类2 Part-of-speech subcategory 2
7 词类3 Part-of-speech subcategory 3
8 併音 Pinyin
9 繁体字 Traditional
10 簡体字 Simplified
11 定义 Definition

User dictionary format (CSV)

Simple version

Index Name (Japanese) Name (English) Notes
0 表面形式 Surface
1 词类 Part-of-speech
2 併音 Pinyin

Detailed version

Index Name (Japanese) Name (English) Notes
0 表面形式 Surface
1 左语境ID Left context ID
2 右语境ID Right context ID
3 成本 Cost
4 词类 Part-of-speech
5 词类1 Part-of-speech subcategory 1
6 词类2 Part-of-speech subcategory 2
7 词类3 Part-of-speech subcategory 3
8 併音 Pinyin
9 繁体字 Traditional
10 簡体字 Simplified
11 定义 Definition
12 - - After 12, it can be freely expanded.

API reference

The API reference is available. Please see following URL:

Commit count: 621

cargo fmt