home edit page issue tracker

Morphology

As described in Introduction, the parts-of-speech of Japanese are defined as a mapping from UniDic POS tags, because the UniDic guideline is fully established and widely used in Japanese NLP.

The following table defines a mapping from UniDic SUW POS tags into Universal Dependencies POS tags (this table is not finalized yet; any suggestions are welcome).

UD POSUniDic SUW POS
ADJ形容詞(adjective), 連体詞(adnomial), 形状詞(adjectival noun)
ADV副詞(adverb)
INTJ感動詞(interjection)
NOUN名詞-普通名詞(common noun), 接頭辞(prefix), 接尾辞(suffix)
PROPN名詞-固有名詞(proper noun)
VERB動詞(verb)
ADP助詞-格助詞(case particle), 助詞-係助詞(binding particle)
AUX助動詞(auxiliary verb)
CONJ接続詞(conjunction), 助詞-格助詞(case particle)
DET連体詞(adnomial)
NUM名詞-数詞(numeral noun)
PART助詞-副助詞(adverbial particle), 助詞-終助詞(phrase final particle)
PRON代名詞(pronoun)
SCONJ助詞-接続助詞(conjunctive particle), 助詞-準体助詞(nominal particle)
PUNCT補助記号(supplementary symbol)
SYM記号(symbol), 補助記号(supplementary symbol)
X空白(white space)

Several UniDic POS tags are mapped into different UD POS tags depending on additional information like lemmas, and/or syntactic context.

Currently, we do not use features in Japanese.