home ru/feat edit page issue tracker

Gender: gender

Gender is a lexical feature of nouns and inflectional feature of other parts of speech (adjectives, verbs) that mark agreement with nouns. There are three values of gender: masculine, feminine, and neuter.

See also the related feature of Animacy.

Masc: masculine gender

Nouns denoting male persons are masculine. Other nouns may be also grammatically masculine, without any relation to sex.

Examples

Fem: feminine gender

Nouns denoting female persons are feminine. Other nouns may be also grammatically feminine, without any relation to sex.

Examples

Neut: neuter gender

This third gender is for nouns that are neither masculine nor feminine (grammatically). Nouns whose nominative suffix is -о  or -е  (including a large group of deverbative nouns denoting actions) are usually neuter.

Examples


Treebank Statistics (UD_Russian)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

44287 tokens (45%) have a non-empty value of Gender. 21516 types (72%) occur at least once with a non-empty value of Gender. 1 lemmas (0) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: ru-pos/NOUN (19921; 20% instances), ru-pos/ADJ (9645; 10% instances), ru-pos/PROPN (7367; 7% instances), ru-pos/VERB (4485; 5% instances), ru-pos/DET (1243; 1% instances), ru-pos/PRON (1018; 1% instances), ru-pos/NUM (595; 1% instances), ru-pos/X (10; 0% instances), ru-pos/ADV (2; 0% instances), ru-pos/SCONJ (1; 0% instances).

NOUN

19921 ru-pos/NOUN tokens (75% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (19909; 100%), Animacy=Inan (17838; 90%).

NOUN tokens may have the following values of Gender:

ADJ

9645 ru-pos/ADJ tokens (77% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (9640; 100%), Animacy=Inan (8765; 91%).

ADJ tokens may have the following values of Gender:

PROPN

7367 ru-pos/PROPN tokens (97% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (7363; 100%), Animacy=Inan (4032; 55%).

PROPN tokens may have the following values of Gender:

VERB

4485 ru-pos/VERB tokens (48% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (4484; 100%), Person=EMPTY (4469; 100%), Tense=Past (4225; 94%), Voice=EMPTY (3035; 68%), Case=EMPTY (3033; 68%), Animacy=EMPTY (3032; 68%), Mood=Ind (3031; 68%), Aspect=Perf (2654; 59%).

VERB tokens may have the following values of Gender:

DET

1243 ru-pos/DET tokens (74% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (1242; 100%), Animacy=Inan (1140; 92%).

DET tokens may have the following values of Gender:

PRON

1018 ru-pos/PRON tokens (58% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1018; 100%), Reflex=EMPTY (1016; 100%), Person=3 (930; 91%).

PRON tokens may have the following values of Gender:

NUM

595 ru-pos/NUM tokens (31% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Animacy=Inan (483; 81%), Number=Sing (303; 51%).

NUM tokens may have the following values of Gender:

X

10 ru-pos/X tokens (1% of all X tokens) have a non-empty value of Gender.

X tokens may have the following values of Gender:

ADV

2 ru-pos/ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

ADV tokens may have the following values of Gender:

SCONJ

1 ru-pos/SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (7406; 98%), PROPN –[name]–> PROPN (981; 99%), NOUN –[appos]–> PROPN (781; 60%), NOUN –[acl]–> VERB (527; 86%), VERB –[nsubj]–> PROPN (457; 70%), NOUN –[det]–> DET (440; 97%), PROPN –[conj]–> PROPN (411; 72%), VERB –[auxpass]–> VERB (403; 95%), VERB –[nsubjpass]–> NOUN (385; 93%), PROPN –[nmod]–> NOUN (383; 81%).


Treebank Statistics (UD_Russian-SynTagRus)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

426230 tokens (41%) have a non-empty value of Gender. 86665 types (79%) occur at least once with a non-empty value of Gender. 32791 lemmas (82%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: ru-pos/NOUN (297781; 29% instances), ru-pos/ADJ (75474; 7% instances), ru-pos/VERB (35570; 3% instances), ru-pos/DET (12449; 1% instances), ru-pos/AUX (4022; 0% instances), ru-pos/NUM (934; 0% instances).

NOUN

297781 ru-pos/NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (245803; 83%), Number=Sing (220773; 74%).

NOUN tokens may have the following values of Gender:

Paradigm ТОMascFemNeut
Case=Accто
Case=Datтому
Case=Genтоготого
Case=Insтемтем
Case=Locтом
Case=Nomто

Gender seems to be lexical feature of NOUN. 99% lemmas (21689) occur only with one value of Gender.

ADJ

75474 ru-pos/ADJ tokens (66% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (75474; 100%), Degree=Pos (75212; 100%).

ADJ tokens may have the following values of Gender:

Paradigm КОТОРЫЙMascFemNeut
Animacy=Anim|Case=Accкоторого
Animacy=Inan|Case=Accкоторыйкоторые
Case=Accкоторуюкоторое
Case=Datкоторомукоторойкоторому
Case=Genкоторогокоторойкоторого
Case=Insкоторымкоторойкоторым
Case=Locкоторомкоторойкотором
Case=Nomкоторыйкотораякоторое

VERB

35570 ru-pos/VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (35570; 100%), Person=EMPTY (35570; 100%), Tense=Past (32886; 92%), Case=EMPTY (28700; 81%), Voice=Act (27943; 79%), VerbForm=Fin (25093; 71%), Mood=Ind (25093; 71%), Aspect=Perf (22400; 63%).

VERB tokens may have the following values of Gender:

Paradigm МОЧЬMascFemNeut
Aspect=Imp|Case=Acc|Tense=Pres|VerbForm=Partмогущую
Aspect=Imp|Case=Nom|Tense=Pres|VerbForm=Partмогущее
Aspect=Imp|Mood=Ind|Tense=Past|VerbForm=Finмогмогламогло
Aspect=Perf|Mood=Ind|Tense=Past|VerbForm=Finсмогсмогласмогло

DET

12449 ru-pos/DET tokens (59% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (12449; 100%).

DET tokens may have the following values of Gender:

Paradigm ЭТОТMascFemNeut
Case=Accэтот, этого, этоэтуэто
Case=Datэтомуэтойэтому
Case=Genэтогоэтойэтого
Case=Insэтимэтойэтим
Case=Locэтомэтойэтом
Case=Nomэтотэтаэто

AUX

4022 ru-pos/AUX tokens (51% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Tense=Past (4022; 100%), Person=EMPTY (4022; 100%), Number=Sing (4022; 100%), Aspect=Imp (4022; 100%), Voice=Act (4022; 100%), Mood=Ind (4020; 100%), VerbForm=Fin (4020; 100%).

AUX tokens may have the following values of Gender:

Paradigm БЫТЬMascFemNeut
Case=Loc|VerbForm=Partбывшем
Case=Nom|VerbForm=Partбывший
Mood=Ind|VerbForm=Finбылбылабыло

NUM

934 ru-pos/NUM tokens (7% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Case=Acc (504; 54%).

NUM tokens may have the following values of Gender:

Paradigm ДВАMascFemNeut
Animacy=Anim|Case=Accдвухдвух
Animacy=Inan|Case=Accдвадве
Case=Accдва
Case=Nomдвадведва

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (53997; 66%), NOUN –[det]–> DET (12412; 59%), NOUN –[conj]–> NOUN (10983; 51%), NOUN –[amod]–> VERB (5874; 57%), NOUN –[appos]–> NOUN (5604; 78%), NOUN –[name]–> NOUN (4996; 99%), ADJ –[nsubj]–> NOUN (3676; 66%), VERB –[conj]–> VERB (3223; 54%), ADJ –[conj]–> ADJ (2652; 95%), VERB –[auxpass]–> AUX (1312; 76%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]