Animacy
: animacy
In contrast to some other languages, the Slovenian tagset does not consider Animacy
to be a lexical feature, as certain types of inanimate nouns, such as institutions, personified objects, brand names etc., often take on both semantic and grammatical features of animate nouns.
Animacy is thus only marked as an inflectional feature of masculine nouns and proper nouns to distinguish between animate and inanimate word forms in accusative singular, e.g. Odstrigla si je koder. “She cut off a curl.” (inanimate) vs. Videla je kodra. “She saw a poodle.” (animate).
Anim
: animate
Animate
value is attributed to masculine nouns in accusative singular usually ending in -a:
- človeka “man”, delfina “dolphin”, Davida “David”, ministra “minister”
Note that grammatical animatness can also apply to semantically inanimate nouns, such as car names, personified objects, brand names, card names etc.
- Renaulta “Renault”, pomivalca “washing machine”, bordojca “Bordeaux wine”, asa “ace” etc.
Inan
: inanimate
Inanimate
value is attributed to all other masculine nouns in accusative singular:
- avto “car”, fikus “ficus”, čevelj “shoe”
Conversion from JOS
All nouns with Animate=yes are converted to Animacy=Anim
and all nouns with Animate=no are converted to Animacy=Inan
.
Treebank Statistics (UD_Slovenian)
This feature is universal.
It occurs with 2 different values: Anim
, Inan
.
2220 tokens (2%) have a non-empty value of Animacy
.
1134 types (4%) occur at least once with a non-empty value of Animacy
.
1080 lemmas (6%) occur at least once with a non-empty value of Animacy
.
The feature is used with 2 part-of-speech tags: sl-pos/NOUN (1958; 1% instances), sl-pos/PROPN (262; 0% instances).
NOUN
1958 sl-pos/NOUN tokens (6% of all NOUN
tokens) have a non-empty value of Animacy
.
The most frequent other feature values with which NOUN
and Animacy
co-occurred: Number=Sing (1958; 100%), Gender=Masc (1958; 100%), Case=Acc (1958; 100%).
NOUN
tokens may have the following values of Animacy
:
Anim
(209; 11% of non-emptyAnimacy
): otroka, predsednika, človeka, duha, moža, prijatelja, boga, bolnika, nasprotnika, sinaInan
(1749; 89% of non-emptyAnimacy
): dan, čas, način, primer, denar, del, sistem, svet, teden, konecEMPTY
(28181): leta, let, strani, dela, leto, letih, ljudi, življenje, delu, tolarjev
Paradigm duh | Anim | Inan |
---|---|---|
duha | duh |
Animacy
seems to be lexical feature of NOUN
. 100% lemmas (849) occur only with one value of Animacy
.
PROPN
262 sl-pos/PROPN tokens (6% of all PROPN
tokens) have a non-empty value of Animacy
.
The most frequent other feature values with which PROPN
and Animacy
co-occurred: Case=Acc (262; 100%), Gender=Masc (262; 100%), Number=Sing (262; 100%).
PROPN
tokens may have the following values of Animacy
:
Anim
(171; 65% of non-emptyAnimacy
): Andreja, Billyja, Henrika, Boja, Damijana, Filipa, Francija, Hočevarja, Janeza, JohnaInan
(91; 35% of non-emptyAnimacy
): Dunaj, Irak, Nato, Windows, Bruselj, JBX, Jeruzalem, Virtual, ATI, AfganistanEMPTY
(4420): Slovenije, Sloveniji, Slovenija, EU, Ljubljani, ZDA, Slovenijo, Evropi, Mariboru, LJUBLJANA
Animacy
seems to be lexical feature of PROPN
. 100% lemmas (229) occur only with one value of Animacy
.
Relations with Agreement in Animacy
The 10 most frequent relations where parent and child node agree in Animacy
:
PROPN –[name]–> PROPN (50; 100%),
PROPN –[conj]–> PROPN (23; 72%).
Treebank Statistics (UD_Slovenian-SST)
This feature is universal.
It occurs with 2 different values: Anim
, Inan
.
397 tokens (1%) have a non-empty value of Animacy
.
235 types (4%) occur at least once with a non-empty value of Animacy
.
235 lemmas (6%) occur at least once with a non-empty value of Animacy
.
The feature is used with 2 part-of-speech tags: sl-pos/NOUN (372; 1% instances), sl-pos/PROPN (25; 0% instances).
NOUN
372 sl-pos/NOUN tokens (10% of all NOUN
tokens) have a non-empty value of Animacy
.
The most frequent other feature values with which NOUN
and Animacy
co-occurred: Number=Sing (372; 100%), Case=Acc (372; 100%), Gender=Masc (372; 100%).
NOUN
tokens may have the following values of Animacy
:
Anim
(37; 10% of non-emptyAnimacy
): cimra, gospoda, otroka, sina, atija, babeka, dedca, duha, ekonomista, eksponentaInan
(335; 90% of non-emptyAnimacy
): dan, način, denar, izraz, petek, teden, primer, čas, konec, mesecEMPTY
(3255): bistvu, redu, strani, jutro, leto, stvari, evrov, koncu, gospod, hvala
Animacy
seems to be lexical feature of NOUN
. 100% lemmas (214) occur only with one value of Animacy
.
PROPN
25 sl-pos/PROPN tokens (3% of all PROPN
tokens) have a non-empty value of Animacy
.
The most frequent other feature values with which PROPN
and Animacy
co-occurred: Number=Sing (25; 100%), Case=Acc (25; 100%), Gender=Masc (25; 100%).
PROPN
tokens may have the following values of Animacy
:
Anim
(8; 32% of non-emptyAnimacy
): arturja, boruca, giordanota, miklavža, petra, planinška, poljanška, sinclairjaInan
(17; 68% of non-emptyAnimacy
): paranoid, rodik, triglav, erasmus, etnoblog, frutiq, ikš, lech, maribor, piranEMPTY
(733): [name:personal], [name:surname], slovenija, sloveniji, [name:address], jones, slovenije, tom, [name:organisation], david
Animacy
seems to be lexical feature of PROPN
. 100% lemmas (21) occur only with one value of Animacy
.
Relations with Agreement in Animacy
The 10 most frequent relations where parent and child node agree in Animacy
:
NOUN –[appos]–> NOUN (3; 60%),
NOUN –[reparandum]–> NOUN (3; 60%),
NOUN –[parataxis]–> NOUN (2; 67%),
PROPN –[conj]–> PROPN (1; 100%),
PROPN –[name]–> PROPN (1; 100%).
Animacy in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]