home sl/feat edit page issue tracker

Gender: gender

Gender is a lexical feature of nouns and proper nouns, and an inflectional feature of other parts of speech (adjectives, verbs, auxiliary, pronouns, determiners and numerals) that mark agreement with nouns.

Masc: masculine gender

Examples

Fem: feminine gender

Examples

Neut: neuter gender

Examples

Conversion from JOS

All tokens with feature Gender=masculine are converted to Gender=Masc, all tokens with feature Gender=feminine are converted to Gender=Fem and all tokens with feature Gender=neuter are converted to Gender=Neut.


Treebank Statistics (UD_Slovenian)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

64731 tokens (46%) have a non-empty value of Gender. 29150 types (92%) occur at least once with a non-empty value of Gender. 14613 lemmas (87%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: sl-pos/NOUN (30139; 21% instances), sl-pos/ADJ (15027; 11% instances), sl-pos/VERB (7644; 5% instances), sl-pos/PROPN (4682; 3% instances), sl-pos/PRON (3877; 3% instances), sl-pos/DET (2855; 2% instances), sl-pos/NUM (486; 0% instances), sl-pos/AUX (21; 0% instances).

NOUN

30139 sl-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (21345; 71%).

NOUN tokens may have the following values of Gender:

Paradigm potMascFem
Case=Acc|Number=Singpot
Case=Acc|Number=Plurpoti
Case=Dat|Number=Singpoti
Case=Gen|Number=Singpotapoti
Case=Gen|Number=Plurpoti
Case=Ins|Number=Singpotjo
Case=Ins|Number=Plurpotmi
Case=Loc|Number=Singpoti
Case=Loc|Number=Plurpoteh
Case=Nom|Number=Singpot
Case=Nom|Number=Plurpoti

Gender seems to be lexical feature of NOUN. 100% lemmas (6404) occur only with one value of Gender.

ADJ

15027 sl-pos/ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (13768; 92%), VerbForm=EMPTY (13098; 87%), Definite=EMPTY (12959; 86%), Number=Sing (10131; 67%).

ADJ tokens may have the following values of Gender:

Paradigm drugMascFemNeut
Case=Acc|Definite=Def|Number=Singdrugi
Case=Acc|Definite=Ind|Number=Singdrug
Case=Acc|Number=Singdrugegadrugodrugo
Case=Acc|Number=Plurdrugedrugedruga
Case=Dat|Number=Singdrugemudrugi
Case=Dat|Number=Plurdrugim
Case=Gen|Number=Singdrugegadrugedrugega
Case=Gen|Number=Plurdrugihdrugih
Case=Ins|Number=Singdrugimdrugodrugim
Case=Ins|Number=Plurdrugimidrugimi
Case=Loc|Number=Singdrugemdrugidrugem
Case=Loc|Number=Dualdrugih
Case=Loc|Number=Plurdrugihdrugihdrugih
Case=Nom|Definite=Def|Number=Singdrugi
Case=Nom|Definite=Ind|Number=Singdrug
Case=Nom|Number=Singdrugadrugo
Case=Nom|Number=Plurdrugidrugedruga

VERB

7644 sl-pos/VERB tokens (44% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: VerbForm=Part (7644; 100%), Person=EMPTY (7644; 100%), Mood=EMPTY (7644; 100%), Tense=EMPTY (7644; 100%), Negative=EMPTY (7644; 100%), Number=Sing (5091; 67%), Aspect=Perf (4188; 55%).

VERB tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Number=Singbilbilabilo, blo
Number=Dualbila, blabili
Number=Plurbilibilebila

PROPN

4682 sl-pos/PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (4408; 94%), Case=Nom (2427; 52%).

PROPN tokens may have the following values of Gender:

Paradigm EUMascFem
Case=AccEU
Case=GenEU
Case=LocEU
Case=NomEUEU

Gender seems to be lexical feature of PROPN. 99% lemmas (2571) occur only with one value of Gender.

PRON

3877 sl-pos/PRON tokens (56% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (3854; 99%), Number=Sing (3000; 77%), Variant=EMPTY (2535; 65%), Person=EMPTY (2189; 56%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Case=Acc|Number=Singnjeganjo
Case=Acc|Number=Sing|Variant=Shortgajoga
Case=Acc|Number=Dualnjiju
Case=Acc|Number=Dual|Variant=Shortjujuju
Case=Acc|Number=Plurnjih, nje
Case=Acc|Number=Plur|Variant=Shortjihjihjih
Case=Dat|Number=Singnjemunjej
Case=Dat|Number=Sing|Variant=Shortmujimu
Case=Dat|Number=Dualnjima
Case=Dat|Number=Dual|Variant=Shortjimajima
Case=Dat|Number=Plurnjimnjim
Case=Dat|Number=Plur|Variant=Shortjimjimjim
Case=Gen|Number=Singnjeganjenjega
Case=Gen|Number=Sing|Variant=Shortgajega
Case=Gen|Number=Dualnjiju
Case=Gen|Number=Dual|Variant=Shortju
Case=Gen|Number=Plurnjihnjihnjih
Case=Gen|Number=Plur|Variant=Shortjihjihjih
Case=Ins|Number=Singnjimnjonjim
Case=Ins|Number=Dualnjimanjima
Case=Ins|Number=Plurnjiminjiminjimi
Case=Loc|Number=Singnjemnjejnjem
Case=Loc|Number=Dualnjiju
Case=Loc|Number=Plurnjihnjihnjih
Case=Nom|Number=Singonona
Case=Nom|Number=Dualonadva
Case=Nom|Number=Pluroni

DET

2855 sl-pos/DET tokens (86% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Degree=EMPTY (2855; 100%), Gender[psor]=EMPTY (2476; 87%), Reflex=EMPTY (2399; 84%), Number[psor]=EMPTY (2051; 72%), Person=EMPTY (2051; 72%), Number=Sing (1913; 67%), Poss=EMPTY (1595; 56%).

DET tokens may have the following values of Gender:

Paradigm taMascFemNeut
Case=Acc|Number=Singta, tegatoto
Case=Acc|Number=Dualti
Case=Acc|Number=Plurteteta
Case=Dat|Number=Singtemutejtemu
Case=Dat|Number=Plurtem
Case=Gen|Number=Singtegatetega
Case=Gen|Number=Dualteh
Case=Gen|Number=Plurtehtehteh
Case=Ins|Number=Singtemtotem
Case=Ins|Number=Plurtemitemi
Case=Loc|Number=Singtemtejtem
Case=Loc|Number=Plurtehtehteh
Case=Nom|Number=Singtatato
Case=Nom|Number=Dualta
Case=Nom|Number=Plurtiteta

NUM

486 sl-pos/NUM tokens (25% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (486; 100%), NumType=Card (481; 99%).

NUM tokens may have the following values of Gender:

Paradigm enMascFemNeut
Case=Acc|Number=Singen, enegaenoeno
Case=Dat|Number=Singenemueni
Case=Gen|Number=Singenegaeneenega
Case=Ins|Number=Singenimenoenim
Case=Loc|Number=Singenemenienem
Case=Loc|Number=Plurenih
Case=Nom|Number=Singenenaeno
Case=Nom|Number=Plureni

AUX

21 sl-pos/AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Tense=EMPTY (21; 100%), VerbForm=Part (21; 100%), Person=EMPTY (21; 100%), Mood=EMPTY (21; 100%), Negative=EMPTY (21; 100%), Number=Sing (14; 67%).

AUX tokens may have the following values of Gender:

Paradigm bitiMascFem
Number=Singbilbila
Number=Dualbila
Number=Plurbili

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (11306; 99%), NOUN –[det]–> DET (2835; 86%), ADJ –[nsubj]–> NOUN (880; 98%), NOUN –[nmod]–> PROPN (839; 55%), PROPN –[name]–> PROPN (683; 100%), ADJ –[conj]–> ADJ (632; 93%), VERB –[nsubj]–> PROPN (578; 73%), VERB –[conj]–> VERB (550; 68%), PROPN –[amod]–> ADJ (246; 100%), PROPN –[conj]–> PROPN (224; 71%).


Treebank Statistics (UD_Slovenian-SST)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

9593 tokens (33%) have a non-empty value of Gender. 4435 types (73%) occur at least once with a non-empty value of Gender. 2968 lemmas (75%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: sl-pos/NOUN (3627; 12% instances), sl-pos/ADJ (1665; 6% instances), sl-pos/PRON (1635; 6% instances), sl-pos/VERB (1292; 4% instances), sl-pos/DET (660; 2% instances), sl-pos/PROPN (444; 2% instances), sl-pos/NUM (270; 1% instances).

NOUN

3627 sl-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=EMPTY (3255; 90%), Number=Sing (2737; 75%).

NOUN tokens may have the following values of Gender:

Paradigm očiMascFem
Case=Gen|Number=Pluroči
Case=Nom|Number=Singoči

Gender seems to be lexical feature of NOUN. 100% lemmas (1525) occur only with one value of Gender.

ADJ

1665 sl-pos/ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=EMPTY (1479; 89%), Degree=Pos (1443; 87%), Definite=EMPTY (1352; 81%), Number=Sing (1266; 76%), Case=Nom (879; 53%).

ADJ tokens may have the following values of Gender:

Paradigm drugMascFemNeut
Case=Acc|Definite=Def|Number=Singdrugi
Case=Acc|Number=Singdrugodrugo
Case=Acc|Number=Plurdrugedruge
Case=Dat|Number=Singdrugemu
Case=Gen|Number=Singdrugegadrugedrugega
Case=Gen|Number=Plurdrugihdrugih
Case=Ins|Number=Singdrugodrugim
Case=Ins|Number=Plurdrugimi
Case=Loc|Number=Singdrugidrugem
Case=Loc|Number=Dualdrugih
Case=Nom|Definite=Def|Number=Singdrugi
Case=Nom|Definite=Ind|Number=Singdrug
Case=Nom|Number=Singdrugadrugo
Case=Nom|Number=Plurdrugidruge

PRON

1635 sl-pos/PRON tokens (63% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Variant=EMPTY (1425; 87%), Number=Sing (1356; 83%), Person=EMPTY (1218; 74%), Case=Nom (877; 54%).

PRON tokens may have the following values of Gender:

Paradigm taMascFemNeut
Case=Acc|Number=Singtatoto
Case=Acc|Number=Plurtete
Case=Dat|Number=Singtemutemu
Case=Gen|Number=Singtegatetega
Case=Gen|Number=Plurtehteh
Case=Ins|Number=Singtem
Case=Loc|Number=Singtem
Case=Nom|Number=Singtatato
Case=Nom|Number=Plurtiteta

VERB

1292 sl-pos/VERB tokens (28% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (1292; 100%), Mood=EMPTY (1292; 100%), Negative=EMPTY (1292; 100%), VerbForm=Part (1292; 100%), Tense=EMPTY (1292; 100%), Number=Sing (886; 69%).

VERB tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Aspect=Imp|Number=Singbilbilo
Number=Singbilbilabilo
Number=Dualbila
Number=Plurbilibilebila

DET

660 sl-pos/DET tokens (93% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Person=EMPTY (547; 83%), Poss=EMPTY (547; 83%), Number[psor]=EMPTY (547; 83%), Number=Sing (474; 72%), PronType=Dem (331; 50%).

DET tokens may have the following values of Gender:

Paradigm taMascFemNeut
Case=Acc|Number=Singta, tegatoto
Case=Acc|Number=Plurteteta
Case=Dat|Number=Singtej
Case=Dat|Number=Plurtemtem
Case=Gen|Number=Singtegatetega
Case=Gen|Number=Plurtehtehteh
Case=Ins|Number=Singtemtotem
Case=Ins|Number=Plurtemitemi
Case=Loc|Number=Singtemtejtem
Case=Loc|Number=Plurtehteh
Case=Nom|Number=Singtatato
Case=Nom|Number=Dualti
Case=Nom|Number=Plurtiteta

PROPN

444 sl-pos/PROPN tokens (59% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (403; 91%), Case=Nom (233; 52%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (306) occur only with one value of Gender.

NUM

270 sl-pos/NUM tokens (54% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (270; 100%), NumType=Card (269; 100%), Number=Sing (153; 57%).

NUM tokens may have the following values of Gender:

Paradigm enMascFemNeut
Case=Acc|Number=Singen, enegaenoeno
Case=Acc|Number=Plurene
Case=Dat|Number=Singenemu
Case=Gen|Number=Singenegaene
Case=Gen|Number=Plurenih
Case=Ins|Number=Singenimenoenim
Case=Loc|Number=Singeni
Case=Nom|Number=Singenenaeno
Case=Nom|Number=Plureniena

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (941; 99%), NOUN –[det]–> DET (580; 90%), NOUN –[nummod]–> NUM (139; 53%), NOUN –[conj]–> NOUN (106; 60%), ADJ –[nsubj]–> NOUN (76; 96%), PROPN –[name]–> PROPN (75; 100%), ADJ –[nsubj]–> PRON (53; 80%), ADJ –[conj]–> ADJ (50; 93%), NOUN –[appos]–> NOUN (30; 64%), ADJ –[det]–> DET (28; 93%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]