home kk/pos edit page issue tracker

NOUN: noun

Nouns inflect for case, number and possession. Nouns receive nominal morphology. Other parts of speech may be derived into nouns, such as adjectives.

Proper nouns are not annotated as NOUN but rather PROPN.

Examples


Treebank Statistics (UD_Kazakh)

There are 694 NOUN lemmas (42%), 1075 NOUN types (45%) and 1564 NOUN tokens (31%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: ел, жыл, ғасыр, ж., бала, орыс, адам, мемлекет, жер, тіл

The 10 most frequent NOUN types: _, ж., орыс, ел, әулеті, ғасырдың, елде, парсы, тілдерін, ғасырда

The 10 most frequent ambiguous lemmas: бас (NOUN 14, VERB 8), қала (NOUN 14, VERB 1), мал (NOUN 9, VERB 1), ана (NOUN 5, DET 1), ет (NOUN 4, VERB 3), іш (NOUN 4, VERB 2), қос (NOUN 4, VERB 3, ADJ 1), азаматтық (NOUN 3, ADJ 1), бай (NOUN 3, ADJ 2), млн. (NOUN 3, NUM 2)

The 10 most frequent ambiguous types: _ (VERB 134, PART 74, NOUN 63, ADJ 60, PRON 14, CONJ 13, AUX 9, ADP 6, PROPN 4, ADV 4, NUM 3, PUNCT 1), жылы (NOUN 5, ADJ 1), млн. (NOUN 3, NUM 2), Батыс (NOUN 2, ADJ 2), КСРО (NOUN 2, PROPN 2), Темір (NOUN 1, PROPN 1), бай (ADJ 2, NOUN 1), етті (VERB 2, NOUN 1), оқу (VERB 2, NOUN 1), салжұқтар (PROPN 1, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 1.548991 (the average of all parts of speech is 1.458688).

The 1st highest number of forms (17) was observed with the lemma “бала”: _, Балаларды, Балалардың, бала, балалар, балалардан, балалармен, балаларына, балаларынан, балама, баламды, баласы, баласын, баласына, балаға, балаң, балаңа.

The 2nd highest number of forms (13) was observed with the lemma “ел”: _, ел, елде, елдегі, елдер, елдерден, елдерді, елдері, елдерімен, елді, елдің, елі, елінің.

The 3rd highest number of forms (10) was observed with the lemma “бас”: _, бас, бастарын, басы, басыма, басын, басына, басынан, басында, басың.

NOUN does not occur with any features.

Relations

NOUN nodes are attached to their parents using 19 different relations: kk-dep/nmod (423; 27% instances), kk-dep/nsubj (320; 20% instances), kk-dep/nmod:poss (293; 19% instances), kk-dep/dobj (218; 14% instances), kk-dep/conj (112; 7% instances), kk-dep/root (48; 3% instances), kk-dep/compound (44; 3% instances), kk-dep/appos (23; 1% instances), kk-dep/remnant (16; 1% instances), kk-dep/amod (13; 1% instances), kk-dep/iobj (12; 1% instances), kk-dep/name (9; 1% instances), kk-dep/parataxis (9; 1% instances), kk-dep/advcl (8; 1% instances), kk-dep/nummod (6; 0% instances), kk-dep/ccomp (5; 0% instances), kk-dep/acl (2; 0% instances), kk-dep/advmod (2; 0% instances), kk-dep/vocative (1; 0% instances)

Parents of NOUN nodes belong to 11 different parts of speech: VERB (835; 53% instances), NOUN (556; 36% instances), ADJ (70; 4% instances), ROOT (48; 3% instances), PROPN (28; 2% instances), NUM (10; 1% instances), PRON (9; 1% instances), ADV (5; 0% instances), AUX (1; 0% instances), CONJ (1; 0% instances), PUNCT (1; 0% instances)

611 (39%) NOUN nodes are leaves.

588 (38%) NOUN nodes have one child.

208 (13%) NOUN nodes have two children.

157 (10%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 19.

Children of NOUN nodes are attached using 25 different relations: kk-dep/nmod:poss (392; 23% instances), kk-dep/amod (295; 18% instances), kk-dep/punct (206; 12% instances), kk-dep/conj (116; 7% instances), kk-dep/det (92; 5% instances), kk-dep/acl (85; 5% instances), kk-dep/cop (70; 4% instances), kk-dep/nmod (59; 4% instances), kk-dep/cc (57; 3% instances), kk-dep/nsubj (54; 3% instances), kk-dep/case (52; 3% instances), kk-dep/compound (49; 3% instances), kk-dep/nummod (44; 3% instances), kk-dep/appos (30; 2% instances), kk-dep/advmod (22; 1% instances), kk-dep/remnant (16; 1% instances), kk-dep/advcl (10; 1% instances), kk-dep/parataxis (9; 1% instances), kk-dep/name (5; 0% instances), kk-dep/discourse (3; 0% instances), kk-dep/aux (2; 0% instances), kk-dep/iobj (2; 0% instances), kk-dep/ccomp (1; 0% instances), kk-dep/csubj (1; 0% instances), kk-dep/dobj (1; 0% instances)

Children of NOUN nodes belong to 14 different parts of speech: NOUN (556; 33% instances), PUNCT (206; 12% instances), ADJ (203; 12% instances), VERB (162; 10% instances), NUM (145; 9% instances), PROPN (140; 8% instances), DET (91; 5% instances), CONJ (57; 3% instances), ADP (52; 3% instances), PRON (28; 2% instances), PART (16; 1% instances), ADV (14; 1% instances), AUX (2; 0% instances), INTJ (1; 0% instances)


NOUN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]