home ru/pos edit page issue tracker

DET: determiner

Definition

Determiners are words that modify nouns or noun phrases and express the reference of the noun phrase in context. That is, a determiner may indicate whether the noun is referring to a definite or indefinite element of a class, to a closer or more distant element, to an element belonging to a specified person or thing, to a particular number or quantity, etc.

An important point to note is that the traditional grammar of Russian does not define determiners as a separate word class. Russian does not have articles. Most determiners are traditionally called pronouns; that is, an UD-conformant annotation of Russian must distinguish between substantive pronouns (UD tag PRON) and attributive pronouns (UD tag DET).

Examples


Treebank Statistics (UD_Russian)

There are 1 DET lemmas (7%), 96 DET types (0%) and 1673 DET tokens (2%). Out of 15 observed tags, the rank of DET is: 5 in number of lemmas, 10 in number of types and 12 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: это, который, этого, того, том, которые, то, все, этом, тем

The 10 most frequent ambiguous lemmas: _ (NOUN 26660, PUNCT 18807, ADJ 12528, ADP 10735, VERB 9436, PROPN 7604, CONJ 3168, ADV 2142, NUM 1900, PRON 1763, X 1700, DET 1673, SCONJ 624, PART 491, SYM 158)

The 10 most frequent ambiguous types: это (DET 102, PRON 1), который (DET 102, PRON 1), этого (DET 78, PRON 1), том (DET 77, NOUN 2), то (DET 63, X 14, ADV 11, CONJ 7, SCONJ 2, ADP 2), все (DET 51, X 2, ADV 1), тем (DET 42, ADV 1), несколько (DET 35, ADV 11), всего (DET 25, ADV 5, X 3), всё (DET 13, ADV 2, X 2, NOUN 1)

Morphology

The form / lemma ratio of DET is 96.000000 (the average of all parts of speech is 2046.733333).

The 1st highest number of forms (96) was observed with the lemma “_”: ., All, It, a, alle, der, ein, la, the, Такое, более, в, весь, все, всего, всей, всем, всеми, всему, всех, всея, всю, вся, всё, всём, какая, какие, каких, какого, какое, какой, каком, какому, которая, которого, которое, которой, котором, которому, которую, которые, который, которым, которыми, которых, менее, много, некоторое, некоторыми, некоторых, немало, немного, несколькими, нескольких, несколько, сей, сих, сколько, столько, т., та, такие, такими, такого, те, тем, теми, тех, то, того, тое, той, том, тому, тот, ту, чего, чей, чем, чему, что, чьи, чьим, чьё, эта, эти, этим, этими, этих, это, этого, этой, этом, этому, этот, эту.

DET occurs with 5 features: ru-feat/Case (1667; 100% instances), ru-feat/Number (1667; 100% instances), ru-feat/Animacy (1666; 100% instances), ru-feat/Gender (1243; 74% instances), ru-feat/Person (5; 0% instances)

DET occurs with 15 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=3

DET occurs with 49 feature combinations. The most frequent feature combination is Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing (171 tokens). Examples: это, то, что, все, которое, всё, т., the, Такое

Relations

DET nodes are attached to their parents using 20 different relations: ru-dep/det (736; 44% instances), ru-dep/nmod (406; 24% instances), ru-dep/nsubj (273; 16% instances), ru-dep/dobj (76; 5% instances), ru-dep/iobj (48; 3% instances), ru-dep/advmod (33; 2% instances), ru-dep/expl (28; 2% instances), ru-dep/amod (13; 1% instances), ru-dep/mark (11; 1% instances), ru-dep/nsubjpass (10; 1% instances), ru-dep/case (9; 1% instances), ru-dep/mwe (8; 0% instances), ru-dep/discourse (7; 0% instances), ru-dep/conj (6; 0% instances), ru-dep/root (3; 0% instances), ru-dep/remnant (2; 0% instances), ru-dep/dep (1; 0% instances), ru-dep/list (1; 0% instances), ru-dep/nummod (1; 0% instances), ru-dep/punct (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (858; 51% instances), VERB (632; 38% instances), ADJ (67; 4% instances), PROPN (38; 2% instances), ADV (28; 2% instances), ADP (16; 1% instances), NUM (12; 1% instances), PRON (10; 1% instances), DET (7; 0% instances), ROOT (3; 0% instances), SYM (2; 0% instances)

1160 (69%) DET nodes are leaves.

390 (23%) DET nodes have one child.

89 (5%) DET nodes have two children.

34 (2%) DET nodes have three or more children.

The highest child degree of a DET node is 11.

Children of DET nodes are attached using 20 different relations: ru-dep/case (328; 47% instances), ru-dep/discourse (82; 12% instances), ru-dep/punct (65; 9% instances), ru-dep/mwe (57; 8% instances), ru-dep/acl:relcl (47; 7% instances), ru-dep/goeswith (33; 5% instances), ru-dep/ccomp (18; 3% instances), ru-dep/advmod (11; 2% instances), ru-dep/amod (9; 1% instances), ru-dep/cc (7; 1% instances), ru-dep/conj (7; 1% instances), ru-dep/advcl (5; 1% instances), ru-dep/det (5; 1% instances), ru-dep/neg (5; 1% instances), ru-dep/nmod (5; 1% instances), ru-dep/nsubj (3; 0% instances), ru-dep/cc:preconj (2; 0% instances), ru-dep/remnant (2; 0% instances), ru-dep/acl (1; 0% instances), ru-dep/parataxis (1; 0% instances)

Children of DET nodes belong to 11 different parts of speech: ADP (334; 48% instances), X (98; 14% instances), PUNCT (81; 12% instances), VERB (72; 10% instances), ADV (28; 4% instances), NOUN (24; 3% instances), PART (24; 3% instances), ADJ (16; 2% instances), CONJ (8; 1% instances), DET (7; 1% instances), PROPN (1; 0% instances)


Treebank Statistics (UD_Russian-SynTagRus)

There are 24 DET lemmas (0%), 234 DET types (0%) and 21227 DET tokens (2%). Out of 15 observed tags, the rank of DET is: 10 in number of lemmas, 6 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: ЭТОТ, СВОЙ, ВЕСЬ, ТАКОЙ, НАШ, ТОТ, ЕГО, ИХ, ЕЕ, МОЙ

The 10 most frequent DET types: его, все, их, эти, этот, этой, ее, этого, этом, всех

The 10 most frequent ambiguous lemmas: ЭТОТ (DET 4950, ADJ 249, NOUN 6), СВОЙ (DET 2955, ADJ 112), ВЕСЬ (DET 2596, ADJ 394), ТАКОЙ (DET 1783, ADJ 705), НАШ (DET 1463, ADJ 81), ТОТ (DET 1462, ADJ 692, NOUN 529), ЕГО (DET 1396, ADJ 81), ИХ (DET 853, ADJ 35), ЕЕ (DET 562, ADJ 35), МОЙ (DET 531, ADJ 47)

The 10 most frequent ambiguous types: его (PRON 1463, DET 1314, ADJ 81), все (DET 904, NOUN 850, PART 333, ADJ 213), их (PRON 1171, DET 798, ADJ 35), эти (DET 596, ADJ 61), этот (DET 582, ADJ 36), этой (DET 689, ADJ 13), ее (PRON 774, DET 521, ADJ 35), этого (NOUN 511, DET 464, ADJ 15), этом (NOUN 761, DET 461, ADJ 11), всех (DET 446, NOUN 143, ADJ 56)

Morphology

The form / lemma ratio of DET is 9.750000 (the average of all parts of speech is 2.787274).

The 1st highest number of forms (15) was observed with the lemma “СВОЙ”: свое, своего, своей, своем, своему, своею, свои, своим, своими, своих, свой, свою, своя, своё, своём.

The 2nd highest number of forms (14) was observed with the lemma “МОЙ”: мое, моего, моей, моем, моему, моею, мои, моим, моими, моих, мой, мою, моя, моём.

The 3rd highest number of forms (13) was observed with the lemma “ВЕСЬ”: весь, все, всего, всей, всем, всеми, всему, всех, всея, всю, вся, всё, всём.

DET occurs with 3 features: ru-feat/Case (18416; 87% instances), ru-feat/Number (18416; 87% instances), ru-feat/Gender (12449; 59% instances)

DET occurs with 11 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

DET occurs with 25 feature combinations. The most frequent feature combination is _ (2811 tokens). Examples: его, их, ее, её

Relations

DET nodes are attached to their parents using 1 different relations: ru-dep/det (21227; 100% instances)

Parents of DET nodes belong to 4 different parts of speech: NOUN (21146; 100% instances), PRON (76; 0% instances), SYM (4; 0% instances), PART (1; 0% instances)

20034 (94%) DET nodes are leaves.

1112 (5%) DET nodes have one child.

79 (0%) DET nodes have two children.

2 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 4 different relations: ru-dep/advmod (897; 70% instances), ru-dep/punct (292; 23% instances), ru-dep/neg (86; 7% instances), ru-dep/aux (1; 0% instances)

Children of DET nodes belong to 2 different parts of speech: PART (984; 77% instances), PUNCT (292; 23% instances)


DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]