Number
: number
Number is an inflectional feature of nouns, adjectives, verbs. In the tagset it is encoded as: singular (s), plural (p), count (c), pluralia tantum (l). Singularia tantum is not encoded.
Sing: singular number
A singular noun denotes one person, animal or thing.
Examples: [bg] молив / moliv (pencil)
Plur: plural number
A plural noun denotes several persons, animals or things.
Examples: [bg] моливи / molivi (pencils)
Count: count plural form
A form that is used as plural for masculine non-person nouns after numerals. This is a remnant of the dual form.
Examples: [bg] 2 молива / (2) moliva (2 pencils-count)
Ptan: plurale tantum
Some nouns appear only in the plural form even though they denote one thing (semantic singular); some tagsets mark this distinction.
Examples: [bg] финанси, дънки / finansi, danki (finances, jeans)
Coll: collective / mass / singulare tantum
Collective or mass or singulare tantum is a special case of singular. It applies to words that use grammatical singular to describe sets of objects, i.e. semantic plural.
Examples: [bg] човечество / chovechestvo (mankind)
Treebank Statistics (UD_Bulgarian)
This feature is universal but the values Count
are language-specific.
It occurs with 4 different values: Count
, Plur
, Ptan
, Sing
.
86664 tokens (55%) have a non-empty value of Number
.
27544 types (104%) occur at least once with a non-empty value of Number
.
13974 lemmas (94%) occur at least once with a non-empty value of Number
.
The feature is used with 9 part-of-speech tags: bg-pos/NOUN (33611; 22% instances), bg-pos/VERB (19376; 12% instances), bg-pos/ADJ (13351; 9% instances), bg-pos/PROPN (8167; 5% instances), bg-pos/PRON (5324; 3% instances), bg-pos/DET (2393; 2% instances), bg-pos/NUM (2038; 1% instances), bg-pos/AUX (2008; 1% instances), bg-pos/ADV (396; 0% instances).
NOUN
33611 bg-pos/NOUN tokens (98% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Definite=Ind (20653; 61%).
NOUN
tokens may have the following values of Number
:
Count
(883; 3% of non-emptyNumber
): %, лв., млн., $, месеца, дни, лева, млрд., долара, пътиPlur
(8631; 26% of non-emptyNumber
): г., години, пари, страни, проблеми, представители, сили, промени, фирми, паритеPtan
(321; 1% of non-emptyNumber
): хората, хора, души, преговори, преговорите, финансите, боеприпаси, книжа, книжата, белезнициSing
(23776; 71% of non-emptyNumber
): г., време, година, част, страната, събрание, път, страна, края, съветEMPTY
(538): президентът, глава, стратегии, властта, собственост, партия, президент, интерес, въпрос, гласувания
Paradigm лев | Sing | Plur | Count |
---|---|---|---|
Definite=Def | лева, Левът | ||
Definite=Ind | лев | левове | |
лв., лева |
VERB
19376 bg-pos/VERB tokens (99% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Voice=Act (18025; 93%), VerbForm=Fin (16611; 86%), Definite=EMPTY (16611; 86%), Mood=Ind (16458; 85%), Person=3 (13729; 71%), Tense=Pres (11733; 61%), Aspect=Imp (11178; 58%).
VERB
tokens may have the following values of Number
:
Plur
(5716; 30% of non-emptyNumber
): са, могат, имат, бяха, сме, съобщиха, можем, били, имаме, работятSing
(13660; 70% of non-emptyNumber
): е, има, няма, може, трябва, беше, каза, съобщи, бъде, заявиEMPTY
(176): е, каза, няма, откри, поздрави, посочи, беше, заяви, са, благодари
Paradigm съм | Sing | Plur |
---|---|---|
Definite=Ind|Gender=Masc|Mood=Ind|VerbForm=Part|Voice=Act | бил | |
Definite=Ind|Gender=Fem|Mood=Ind|VerbForm=Part|Voice=Act | била | |
Definite=Ind|Gender=Neut|Mood=Ind|VerbForm=Part|Voice=Act | било | |
Definite=Ind|Mood=Ind|VerbForm=Part|Voice=Act | били | |
Mood=Cnd|Person=3|Tense=Past|VerbForm=Fin | би | |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act | съм | сме |
Mood=Ind|Person=1|VerbForm=Fin|Voice=Act | бях | |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin|Voice=Act | си | сте |
Mood=Ind|Person=2|VerbForm=Fin|Voice=Act | бяхте | |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | е | са |
Mood=Ind|Person=3|VerbForm=Fin|Voice=Act | беше, бе | бяха |
ADJ
13351 bg-pos/ADJ tokens (98% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: VerbForm=EMPTY (11879; 89%), Aspect=EMPTY (11879; 89%), Voice=EMPTY (11879; 89%), Degree=Pos (11076; 83%), Definite=Ind (7399; 55%).
ADJ
tokens may have the following values of Number
:
Plur
(3902; 29% of non-emptyNumber
): други, другите, последните, нови, новите, първите, различни, българските, големи, въоръженитеSing
(9449; 71% of non-emptyNumber
): народното, българската, нова, друг, 2001, цялата, 2000, европейската, голяма, новияEMPTY
(238): US, т.-нар., българския, интелектуалната, жп, политически, Европейската, държавен, държавният, народна
Paradigm нов | Sing | Plur |
---|---|---|
Case=Voc|Degree=Pos|Gender=Masc | Нови | |
Definite=Def|Degree=Pos | новите | |
Definite=Def|Degree=Pos|Gender=Masc | новия, новият | |
Definite=Def|Degree=Pos|Gender=Fem | новата | |
Definite=Def|Degree=Pos|Gender=Neut | новото | |
Definite=Def|Degree=Sup|Gender=Masc | най-новият | |
Definite=Def|Degree=Sup|Gender=Fem | най-новата | |
Definite=Def|Degree=Sup|Gender=Neut | Най-новото | |
Definite=Ind|Degree=Pos | нови | |
Definite=Ind|Degree=Pos|Gender=Masc | нов | |
Definite=Ind|Degree=Pos|Gender=Fem | нова | |
Definite=Ind|Degree=Pos|Gender=Neut | ново | |
Definite=Ind|Degree=Sup | най-нови |
PROPN
8167 bg-pos/PROPN tokens (97% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Definite=Ind (7889; 97%), Gender=Masc (5113; 63%).
PROPN
tokens may have the following values of Number
:
Plur
(141; 2% of non-emptyNumber
): САЩ, Балканите, БДЖ, ОДС, DM, Балкани, Гласове, Полимери, РМД-та, АлпиPtan
(8; 0% of non-emptyNumber
): Кремиковци, ОАЕ, Брадвари, ДрагалевциSing
(8018; 98% of non-emptyNumber
): България, София, Иван, ЕС, Европа, СДС, Костов, Петър, Георги, ТурцияEMPTY
(261): Стоянов, Петър, България, Македония, де, Р-300, ван, -, 2000, Велико
Paradigm сдс | Sing | Plur |
---|---|---|
Definite=Def|Gender=Masc | СДС | |
Definite=Def|Gender=Neut | СДС-та | |
Definite=Ind|Gender=Masc | СДС |
Number
seems to be lexical feature of PROPN
. 100% lemmas (2857) occur only with one value of Number
.
PRON
5324 bg-pos/PRON tokens (53% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Poss=EMPTY (5324; 100%), Reflex=EMPTY (5324; 100%), PronType=Prs (3376; 63%), Case=Nom (3275; 62%).
PRON
tokens may have the following values of Number
:
Plur
(1412; 27% of non-emptyNumber
): които, те, ги, тях, нас, ни, ние, им, всички, виSing
(3912; 73% of non-emptyNumber
): това, той, го, който, тя, която, му, което, него, азEMPTY
(4770): се, си, му, ни, й, им, ми, себе, ви, ти
Paradigm аз | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc|Person=3 | го, него | |
Case=Acc|Gender=Fem|Person=3 | я, нея | |
Case=Acc|Gender=Neut|Person=3 | го, него | |
Case=Acc|Person=1 | ме, мен, мене | нас, ни |
Case=Acc|Person=2 | те, тебе, ви, вас, теб | вас, ви |
Case=Acc|Person=3 | ги, тях | |
Case=Dat|Gender=Masc|Person=3 | му, нему | |
Case=Dat|Gender=Fem|Person=3 | й | |
Case=Dat|Gender=Neut|Person=3 | му | |
Case=Dat|Person=1 | ми, мен, мене | ни |
Case=Dat|Person=2 | ти, ви | ви |
Case=Dat|Person=3 | им, тям | |
Case=Nom|Gender=Masc|Person=3 | той | |
Case=Nom|Gender=Fem|Person=3 | тя | |
Case=Nom|Gender=Neut|Person=3 | то | |
Case=Nom|Person=1 | аз | ние, ний |
Case=Nom|Person=2 | ти, вие | вие |
Case=Nom|Person=3 | те | |
Gender=Fem|Person=3 | й | |
Person=1 | ми | ни |
Person=2 | ти, ви | ви |
Person=3 | им |
DET
2393 bg-pos/DET tokens (98% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Person=EMPTY (2015; 84%), Poss=EMPTY (1893; 79%), Definite=EMPTY (1615; 67%).
DET
tokens may have the following values of Number
:
Plur
(699; 29% of non-emptyNumber
): тези, всички, нашите, какви, някои, своите, такива, техните, наши, тияSing
(1694; 71% of non-emptyNumber
): тази, този, това, един, какво, една, всеки, всяка, едно, свояEMPTY
(40): всички, тази, този, всеки, нашата, нашия, нейната, Всяко, Моят, Тези
Paradigm този | Sing | Plur |
---|---|---|
Gender=Masc | този, тоя, оня, онзи | |
Gender=Fem | тази, тая, онази, тeзи | |
Gender=Neut | това, онова, туй | |
тези, тия, онези, ония |
NUM
2038 bg-pos/NUM tokens (97% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=Card (2038; 100%), Definite=Ind (1908; 94%), Gender=EMPTY (1544; 76%).
NUM
tokens may have the following values of Number
:
Plur
(1810; 89% of non-emptyNumber
): две, два, 2, 3, 10, 20, три, двамата, 000, дветеSing
(228; 11% of non-emptyNumber
): един, една, 1, едно, 0, Единият, едното, 0,1, 0.00, една-единственаEMPTY
(66): три, две, половин, двамата, двете, два, един, тримата, 1, 3
Number
seems to be lexical feature of NUM
. 100% lemmas (407) occur only with one value of Number
.
AUX
2008 bg-pos/AUX tokens (99% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Voice=Act (1906; 95%), VerbForm=Fin (1889; 94%), Mood=Ind (1787; 89%), Aspect=Imp (1760; 88%), Person=3 (1667; 83%), Tense=Pres (1384; 69%).
AUX
tokens may have the following values of Number
:
Plur
(645; 32% of non-emptyNumber
): са, бяха, бъдат, сме, били, сте, биха, бихте, биват, бихмеSing
(1363; 68% of non-emptyNumber
): е, бе, бъде, беше, съм, би, бил, бих, си, билаEMPTY
(23): е, са, би, бъдат, съм, беше, бъде, сме
Paradigm съм | Sing | Plur |
---|---|---|
Definite=Ind|Gender=Masc|Tense=Past|VerbForm=Part|Voice=Act | бил | |
Definite=Ind|Gender=Fem|Tense=Past|VerbForm=Part|Voice=Act | била | |
Definite=Ind|Gender=Neut|Tense=Past|VerbForm=Part|Voice=Act | било | |
Definite=Ind|Tense=Past|VerbForm=Part|Voice=Act | били | |
Mood=Cnd|Person=1|VerbForm=Fin | бих | бихме |
Mood=Cnd|Person=2|VerbForm=Fin | Би | бихте |
Mood=Cnd|Person=3|VerbForm=Fin | би | биха |
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin|Voice=Act | бях | бяхме |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act | съм | сме |
Mood=Ind|Person=2|Tense=Past|VerbForm=Fin|Voice=Act | беше | |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin|Voice=Act | си | сте |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin|Voice=Act | бе, беше | бяха |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | е | са |
ADV
396 bg-pos/ADV tokens (6% of all ADV
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADV
and Number
co-occurred: PronType=EMPTY (396; 100%), Degree=EMPTY (280; 71%).
ADV
tokens may have the following values of Number
:
Plur
(396; 100% of non-emptyNumber
): много, повече, повечето, малко, по-малко, най-много, най-малко, Многая, Най-малкото, малкотоEMPTY
(6162): още, вчера, само, вече, когато, защото, обаче, сега, как, така
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (11111; 98%),
NOUN –[nmod]–> NOUN (5883; 61%),
VERB –[nsubj]–> NOUN (4719; 94%),
VERB –[dobj]–> NOUN (2953; 59%),
VERB –[nmod]–> NOUN (2700; 58%),
NOUN –[nmod]–> PROPN (2629; 82%),
VERB –[nsubj]–> PRON (2037; 98%),
NOUN –[det]–> DET (1934; 99%),
VERB –[ccomp]–> VERB (1865; 75%),
NOUN –[conj]–> NOUN (1688; 78%).
Number in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]