[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[debian-devel:18459] Bug#697421: ITP: unidic-mecab -- free Japanese Dictionaries for mecab
Package: wnpp
Severity: wishlist
Owner: Hideki Yamane <henrich@debian.org>
X-Debbugs-CC: debian-devel@lists.debian.org, debian-devel@lists.debian.or.jp
Package name: unidic-mecab
Version: 2.1.1
Upstream Author: The UniDic Consortium
URL: http://sourceforge.jp/projects/unidic/
License: BSD-3-cluase
LPGL-2.1
GPL-2
Description: free Japanese Dictionaries for mecab
unidic-mecab is a Dictionary for MeCab, Japanese morphological analysis
implementation.
.
* All entries are based on the definition of "SUW (short-unit word)" that is
specified by NINJAL (The National Institute for Japanese Language and
Linguistics), which provides word segmentation in uniform size suited for
linguistic research.
* It has three-layered structure with
- lemma
- form
- spelling
And it can provide a clear distinction of two types of word variant:
spelling variant and form variant.
* It is useful for research of Speech processing since it can be added
accent and shift in sound information.
note: please fix weird description
(in Japanese)詳しい人がいたら説明文を適宜修正してください
--
Regards,
Hideki Yamane henrich @ debian.or.jp/org
http://wiki.debian.org/HidekiYamane