13 breaking releases

new 0.38.0 Nov 13, 2024
0.36.0 Nov 2, 2024
0.32.2 Jun 29, 2024
0.29.0 Mar 18, 2024
0.27.0 Jul 10, 2023

#1750 in Text processing

Download history 973/week @ 2024-07-29 811/week @ 2024-08-05 1002/week @ 2024-08-12 815/week @ 2024-08-19 1210/week @ 2024-08-26 1316/week @ 2024-09-02 896/week @ 2024-09-09 1171/week @ 2024-09-16 1129/week @ 2024-09-23 1205/week @ 2024-09-30 1279/week @ 2024-10-07 1456/week @ 2024-10-14 1381/week @ 2024-10-21 1510/week @ 2024-10-28 1642/week @ 2024-11-04 2040/week @ 2024-11-11

6,830 downloads per month
Used in 5 crates (via lindera)

MIT license

130KB
2.5K SLoC

Lindera IPADIC NEologd

License: MIT Crates.io

Dictionary version

This repository contains mecab-ipadic-neologd.

Dictionary format

Refer to the manual for details on the IPADIC dictionary format and part-of-speech tags.

Index Name (Japanese) Name (English) Notes
0 表層形 Surface
1 左文脈ID Left context ID
2 右文脈ID Right context ID
3 コスト Cost
4 品詞 Major POS classification
5 品詞細分類1 Middle POS classification
6 品詞細分類2 Small POS classification
7 品詞細分類3 Fine POS classification
8 活用形 Conjugation type
9 活用型 Conjugation form
10 原形 Base form
11 読み Reading
12 発音 Pronunciation

User dictionary format (CSV)

Simple version

Index Name (Japanese) Name (English) Notes
0 表層形 surface
1 品詞 Major POS classification
2 読み Reading

Detailed version

Index Name (Japanese) Name (English) Notes
0 表層形 Surface
1 左文脈ID Left context ID
2 右文脈ID Right context ID
3 コスト Cost
4 品詞 POS
5 品詞細分類1 POS subcategory 1
6 品詞細分類2 POS subcategory 2
7 品詞細分類3 POS subcategory 3
8 活用形 Conjugation type
9 活用型 Conjugation form
10 原形 Base form
11 読み Reading
12 発音 Pronunciation
13 - - After 13, it can be freely expanded.

API reference

The API reference is available. Please see following URL:

Dependencies

~20–30MB
~611K SLoC