home bg/pos edit page issue tracker

ADJ: adjective

Definition

Adjectives are words that typically modify nouns and specify their properties or attributes. They may also function as predicates, as in

Example: [bg] Колата е зелена / Kolata e zelena (The car is green.)

The ADJ tag is intended for ordinary adjectives only. See DET for determiners and NUM for numerals.

In Bulgarian the words that map to the ADJ tag from the BulTreeBank tagset are:

Example: [bg] добър / dobar (good) 7-годишен / 7-godishen (seven-years-old)

Example: [bg] Иванова книга / Ivanova kniga (Ivan’s book)

Example: [bg] втори / vtori (second)

Example: [bg] идващ / idvasht (coming)

Example: [bg] намерен / nameren (found)

Example: [bg] направил / napravil (made)

Note that the symbol `#’, used in the Universal POS section indicates a holder for arbitrary number of features, suppressed in the respective tag as irrelevant in the BulTreeBank tagset, when mapped to the Universal one.


Treebank Statistics (UD_Bulgarian)

There are 3121 ADJ lemmas (20%), 6326 ADJ types (23%) and 13589 ADJ tokens (9%). Out of 16 observed tags, the rank of ADJ is: 2 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent ADJ lemmas: нов, друг, български, голям, народен, пръв, държавен, европейски, цял, втори

The 10 most frequent ADJ types: други, народното, българската, нова, другите, нови, европейската, последните, 2001, друг

The 10 most frequent ambiguous lemmas: нов (ADJ 301, PROPN 3), български (ADJ 229, ADV 2), голям (ADJ 229, PROPN 1), европейски (ADJ 124, ADV 1), политически (ADJ 107, ADV 3), икономически (ADJ 54, ADV 2), следвам (ADJ 52, VERB 14), мина-(се) (ADJ 50, VERB 37), стар (ADJ 49, PROPN 4), син (ADJ 44, NOUN 20)

The 10 most frequent ambiguous types: 2001 (ADJ 42, NUM 1), 2000 (ADJ 40, NUM 12, PROPN 4), български (ADJ 30, ADV 2), политически (ADJ 33, ADV 3), 1 (NUM 53, ADJ 29, PROPN 1), II (ADJ 18, PROPN 1), останалите (ADJ 10, VERB 1), европейски (ADJ 13, ADV 1), Южна (ADJ 14, PROPN 1), свързани (ADJ 13, VERB 3)

Morphology

The form / lemma ratio of ADJ is 2.026914 (the average of all parts of speech is 1.728233).

The 1st highest number of forms (25) was observed with the lemma “голям”: големи, големите, големия, големият, голям, голяма, голямата, голямо, най-големи, най-големите, най-големия, най-големият, най-голям, най-голяма, най-голямата, най-голямо, най-голямото, по-големи, по-големите, по-големия, по-голям, по-голяма, по-голямата, по-голямо, по-голямото.

The 2nd highest number of forms (21) was observed with the lemma “добър”: Най-добра, добра, добрата, добри, добрите, добрият, добро, доброто, добър, най-добрата, най-добри, най-добрите, най-добрия, най-добрият, най-доброто, най-добър, по-добра, по-добри, по-добрият, по-добро, по-добър.

The 3rd highest number of forms (17) was observed with the lemma “висок”: висок, висока, високата, високи, високите, високия, високо, високото, най-висок, най-високата, най-високите, най-високо, по-висок, по-висока, по-високи, по-високите, по-високо.

ADJ occurs with 10 features: bg-feat/Number (13351; 98% instances), bg-feat/Definite (13292; 98% instances), bg-feat/Degree (11557; 85% instances), bg-feat/Gender (9449; 70% instances), bg-feat/Aspect (1472; 11% instances), bg-feat/VerbForm (1472; 11% instances), bg-feat/Voice (1472; 11% instances), bg-feat/NumType (895; 7% instances), bg-feat/Tense (519; 4% instances), bg-feat/Case (24; 0% instances)

ADJ occurs with 19 feature-value pairs: Aspect=Imp, Aspect=Perf, Case=Voc, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Ord, Number=Plur, Number=Sing, Tense=Past, Tense=Pres, VerbForm=Part, Voice=Act, Voice=Pass

ADJ occurs with 128 feature combinations. The most frequent feature combination is Definite=Ind|Degree=Pos|Number=Plur (1788 tokens). Examples: други, нови, различни, големи, български, добри, народни, подобни, финансови, военни

Relations

ADJ nodes are attached to their parents using 14 different relations: bg-dep/amod (11869; 87% instances), bg-dep/conj (443; 3% instances), bg-dep/root (381; 3% instances), bg-dep/dobj (329; 2% instances), bg-dep/nmod (191; 1% instances), bg-dep/nsubj (149; 1% instances), bg-dep/ccomp (77; 1% instances), bg-dep/iobj (54; 0% instances), bg-dep/advcl (34; 0% instances), bg-dep/acl (32; 0% instances), bg-dep/csubj (14; 0% instances), bg-dep/nsubjpass (10; 0% instances), bg-dep/xcomp (4; 0% instances), bg-dep/csubjpass (2; 0% instances)

Parents of ADJ nodes belong to 10 different parts of speech: NOUN (11703; 86% instances), VERB (744; 5% instances), ROOT (381; 3% instances), ADJ (371; 3% instances), PROPN (326; 2% instances), NUM (24; 0% instances), DET (19; 0% instances), ADV (14; 0% instances), PRON (5; 0% instances), PART (2; 0% instances)

11202 (82%) ADJ nodes are leaves.

1069 (8%) ADJ nodes have one child.

508 (4%) ADJ nodes have two children.

810 (6%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 11.

Children of ADJ nodes are attached using 20 different relations: bg-dep/punct (1228; 22% instances), bg-dep/nmod (844; 15% instances), bg-dep/cop (577; 10% instances), bg-dep/advmod (568; 10% instances), bg-dep/nsubj (479; 9% instances), bg-dep/conj (458; 8% instances), bg-dep/det (380; 7% instances), bg-dep/cc (370; 7% instances), bg-dep/case (274; 5% instances), bg-dep/mark (83; 1% instances), bg-dep/neg (65; 1% instances), bg-dep/expl (58; 1% instances), bg-dep/advcl (52; 1% instances), bg-dep/aux (43; 1% instances), bg-dep/discourse (28; 1% instances), bg-dep/acl (22; 0% instances), bg-dep/csubj (18; 0% instances), bg-dep/iobj (7; 0% instances), bg-dep/dobj (5; 0% instances), bg-dep/amod (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: PUNCT (1228; 22% instances), NOUN (1057; 19% instances), VERB (720; 13% instances), PRON (628; 11% instances), ADV (588; 11% instances), ADJ (371; 7% instances), CONJ (367; 7% instances), ADP (271; 5% instances), PROPN (108; 2% instances), SCONJ (77; 1% instances), INTJ (71; 1% instances), PART (52; 1% instances), AUX (13; 0% instances), DET (6; 0% instances), NUM (3; 0% instances)


ADJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]