home cs/pos edit page issue tracker

ADJ: adjective

Definition

Adjectives are words that typically modify nouns and specify their properties or attributes. They may also function as predicates, as in

To auto je zelené.  “The car is green.”

The ADJ tag is intended for ordinary adjectives only. See DET for determiners and NUM for cardinal numerals.

In accord with the UD approach, adjectival ordinal numerals (první, sedmý, stopadesátý)  are tagged as adjectives, although the traditional grammar classifies them as numerals. They behave like adjectives both morphologically and syntactically, with the exception that they cannot be compared and negated.

Most Czech adjectives inflect for Gender (velký – velká – velké)  “big”, Number (velký – velcí),  Case (velký – velkého – velkému – velkém – velkým),  Degree (velký – větší – největší),  and Negation (velký – nevelký). 

Examples

Border cases

Passive participles lie on the border between verbs and adjectives. Core participial forms (ending in consonant or short vowel) are tagged VERB. Long forms are participial adjectives and they are tagged ADJ. For example:

Their meaning is almost identical but the usage slightly varies. Both groups can be used in nominal predication with copula. Only true participles (verbs) can be used to form the passive voice (but it may be sometimes difficult to distinguish from copula constructions, see AUX). On the other hand, the participial adjectives inflect for case and thus can modify nouns.

There is an analogy with some adjectives that preserved so called nominal (short) forms. And these adjectives are not derived from verbs. Example:

Here both groups are ADJ. The nominal forms are used in predication, the standard forms both in predication and to modify nouns.

References


Treebank Statistics (UD_Czech)

There are 14158 ADJ lemmas (24%), 36819 ADJ types (28%) and 180811 ADJ tokens (12%). Out of 17 observed tags, the rank of ADJ is: 3 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent ADJ lemmas: český, velký, nový, další, první, jiný, druhý, vysoký, dobrý, celý

The 10 most frequent ADJ types: první, další, české, nové, druhé, poslední, státní, dalších, možné, vlastní

The 10 most frequent ambiguous lemmas: velký (ADJ 2468, ADV 1), obchodní (ADJ 588, ADV 1), starý (ADJ 567, NOUN 5), známý (ADJ 560, NOUN 21), domácí (ADJ 515, NOUN 5), mladý (ADJ 443, NOUN 3), třeba (ADJ 409, ADV 404), blízký (ADJ 314, NOUN 2), vedoucí (ADJ 156, NOUN 145), spolkový (ADJ 117, NOUN 1)

The 10 most frequent ambiguous types: vlastní (ADJ 464, VERB 76), třeba (ADJ 408, ADV 372), hlavní (ADJ 298, NOUN 3), tzv (ADJ 359, ADV 1), domácí (ADJ 230, NOUN 2), dobré (ADJ 211, NOUN 1), vysoké (ADJ 190, NOUN 1), a (CONJ 31068, ADJ 183, NOUN 49, ADP 7), lepší (ADJ 169, VERB 2), o (ADP 9669, ADJ 110, PUNCT 99, NOUN 4)

Morphology

The form / lemma ratio of ADJ is 2.600579 (the average of all parts of speech is 2.195950).

The 1st highest number of forms (32) was observed with the lemma “známý”: nejznámější, nejznámějších, nejznámějším, neznáma, neznámo, neznámou, neznámá, neznámé, neznámého, neznámém, neznámí, neznámý, neznámých, neznámým, neznámými, znám, známa, známi, známo, známou, známy, známá, známé, známého, známém, známému, známí, známý, známých, známým, známými, známější.

The 2nd highest number of forms (31) was observed with the lemma “dobrý”: Dobrú, dobrou, dobrá, dobré, dobrého, dobrém, dobrému, dobrý, dobrých, dobrým, dobrými, dobří, lepší, lepších, lepšího, lepším, lepšími, lepšímu, nedobrou, nedobrá, nedobré, nedobrého, nedobrý, nedobrých, nejlepší, nejlepších, nejlepšího, nejlepším, nejlepšími, nejlepšímu, nelepší.

The 3rd highest number of forms (31) was observed with the lemma “velký”: největší, největších, největšího, největším, největšími, největšímu, nevelkou, nevelká, nevelké, nevelkého, nevelký, nevelkých, nevelkým, nevelkými, velcí, velkou, velká, velké, velkého, velkém, velkému, velký, velkých, velkým, velkými, větší, větších, většího, větším, většími, většímu.

ADJ occurs with 20 features: Number (176213; 97% instances), Gender (176190; 97% instances), Case (174220; 96% instances), Negative (173109; 96% instances), Degree (166322; 92% instances), Animacy (73924; 41% instances), NumType (4990; 3% instances), NameType (4756; 3% instances), Aspect (4498; 2% instances), Tense (4498; 2% instances), VerbForm (4498; 2% instances), Voice (4498; 2% instances), Gender[psor] (2707; 1% instances), Poss (2707; 1% instances), Foreign (2669; 1% instances), Variant (1889; 1% instances), Abbr (1714; 1% instances), Hyph (398; 0% instances), Style (62; 0% instances), NumValue (30; 0% instances)

ADJ occurs with 61 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Foreign, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Hyph=Yes, NameType=Com, NameType=Com,Geo, NameType=Com,Giv, NameType=Com,Oth, NameType=Com,Pro, NameType=Com,Pro,Sur, NameType=Com,Sur, NameType=Geo, NameType=Geo,Giv, NameType=Geo,Oth, NameType=Geo,Pro, NameType=Geo,Sur, NameType=Giv, NameType=Giv,Sur, NameType=Nat, NameType=Oth, NameType=Oth,Sur, NameType=Pro, NameType=Sur, Negative=Neg, Negative=Pos, NumType=Gen, NumType=Ord, NumType=Sets, NumValue=1, Number=Dual, Number=Plur, Number=Plur,Sing, Number=Sing, Poss=Yes, Style=Arch, Style=Coll, Tense=Past, Tense=Pres, Variant=Short, VerbForm=Part, Voice=Act

ADJ occurs with 761 feature combinations. The most frequent feature combination is Case=Gen|Degree=Pos|Gender=Fem|Negative=Pos|Number=Sing (13492 tokens). Examples: české, evropské, nové, národní, politické, slovenské, státní, světové, celé, velké

Relations

ADJ nodes are attached to their parents using 26 different relations: amod (157524; 87% instances), conj (7760; 4% instances), root (4892; 3% instances), foreign (1691; 1% instances), dep (1489; 1% instances), xcomp (1193; 1% instances), acl (967; 1% instances), nsubj (942; 1% instances), ccomp (940; 1% instances), dobj (760; 0% instances), advmod (725; 0% instances), advcl (692; 0% instances), iobj (472; 0% instances), appos (332; 0% instances), csubj (154; 0% instances), parataxis (77; 0% instances), name (56; 0% instances), cc (50; 0% instances), nsubjpass (43; 0% instances), csubjpass (27; 0% instances), nmod (15; 0% instances), advmod:emph (4; 0% instances), mwe (3; 0% instances), case (1; 0% instances), mark (1; 0% instances), vocative (1; 0% instances)

Parents of ADJ nodes belong to 16 different parts of speech: NOUN (153531; 85% instances), ADJ (6909; 4% instances), VERB (6642; 4% instances), PROPN (6470; 4% instances), ROOT (4892; 3% instances), PRON (1199; 1% instances), NUM (785; 0% instances), ADV (214; 0% instances), DET (87; 0% instances), PART (40; 0% instances), SYM (14; 0% instances), CONJ (11; 0% instances), ADP (9; 0% instances), INTJ (3; 0% instances), SCONJ (3; 0% instances), PUNCT (2; 0% instances)

147581 (82%) ADJ nodes are leaves.

14369 (8%) ADJ nodes have one child.

7406 (4%) ADJ nodes have two children.

11455 (6%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 21.

Children of ADJ nodes are attached using 31 different relations: punct (17741; 21% instances), advmod (11758; 14% instances), cop (9163; 11% instances), conj (7666; 9% instances), nmod (7482; 9% instances), cc (6266; 8% instances), nsubj (5590; 7% instances), dobj (3891; 5% instances), mark (2249; 3% instances), csubj (2052; 2% instances), case (1921; 2% instances), advcl (1461; 2% instances), advmod:emph (1138; 1% instances), dep (633; 1% instances), xcomp (623; 1% instances), expl (538; 1% instances), aux (527; 1% instances), appos (405; 0% instances), foreign (308; 0% instances), nummod (261; 0% instances), amod (253; 0% instances), acl (151; 0% instances), parataxis (123; 0% instances), ccomp (106; 0% instances), det (81; 0% instances), neg (66; 0% instances), name (42; 0% instances), discourse (22; 0% instances), auxpass:reflex (5; 0% instances), det:nummod (2; 0% instances), vocative (2; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: PUNCT (17744; 22% instances), NOUN (15022; 18% instances), VERB (14141; 17% instances), ADV (11949; 14% instances), ADJ (6909; 8% instances), CONJ (6266; 8% instances), PRON (2807; 3% instances), SCONJ (2169; 3% instances), ADP (1921; 2% instances), PROPN (1701; 2% instances), NUM (775; 1% instances), AUX (527; 1% instances), PART (480; 1% instances), DET (106; 0% instances), INTJ (5; 0% instances), SYM (4; 0% instances)


Treebank Statistics (UD_Czech-CAC)

There are 8135 ADJ lemmas (28%), 19830 ADJ types (31%) and 70528 ADJ tokens (14%). Out of 16 observed tags, the rank of ADJ is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent ADJ lemmas: velký, nový, další, jiný, pracovní, celý, každý, různý, základní, socialistický

The 10 most frequent ADJ types: další, pracovní, první, jednotlivých, základní, nové, možno, socialistické, různých, každý

The 10 most frequent ambiguous lemmas: dobrý (ADJ 418, NOUN 1), starý (ADJ 263, NOUN 4), pracující (ADJ 190, NOUN 22), známý (ADJ 159, NOUN 8), vedoucí (NOUN 187, ADJ 90), zkušený (ADJ 18, ADV 1), dospělý (NOUN 22, ADJ 17), taneční (ADJ 17, NOUN 4), potřeba (NOUN 303, ADJ 14), milý (ADJ 12, NOUN 1)

The 10 most frequent ambiguous types: vlastní (ADJ 172, VERB 1), dobré (ADJ 88, NOUN 1), pracujících (ADJ 86, NOUN 15), pracující (ADJ 64, NOUN 3), vedoucí (NOUN 86, ADJ 51), lepší (ADJ 43, VERB 1), staré (ADJ 32, NOUN 1), výchovné (ADJ 37, NOUN 1), stará (ADJ 13, VERB 2), starého (ADJ 14, NOUN 1)

Morphology

The form / lemma ratio of ADJ is 2.437615 (the average of all parts of speech is 2.206260).

The 1st highest number of forms (29) was observed with the lemma “malý”: malou, malá, malé, malého, malém, malému, malí, malý, malých, malým, malými, menší, menších, menšího, menším, menšími, menšímu, nejmenší, nejmenších, nejmenšího, nejmenším, nejmenšími, nemalou, nemalá, nemalé, nemalému, nemalý, nemalým, nemenší.

The 2nd highest number of forms (27) was observed with the lemma “velký”: největší, největších, největšího, největším, největšími, nevelké, nevelkém, nevelký, nevelkým, velcí, velkou, velká, velké, velkého, velkém, velkému, velký, velkých, velkým, velkýma, velkými, větší, větších, většího, větším, většími, většímu.

The 3rd highest number of forms (27) was observed with the lemma “známý”: Nejznámější, Nejznámějších, Nejznámějším, neznámou, neznámy, neznámá, neznámé, neznámého, neznámém, neznámému, neznámý, neznámých, neznámým, znám, známa, známo, známou, známy, známá, známé, známého, známém, známý, známých, známým, známými, známější.

ADJ occurs with 20 features: Number (70235; 100% instances), Gender (70223; 100% instances), Case (69438; 98% instances), Negative (69016; 98% instances), Degree (65943; 93% instances), Animacy (28063; 40% instances), Aspect (2143; 3% instances), Tense (2143; 3% instances), VerbForm (2143; 3% instances), Voice (2143; 3% instances), NumType (863; 1% instances), Variant (798; 1% instances), NameType (768; 1% instances), Gender[psor] (649; 1% instances), Poss (649; 1% instances), Hyph (132; 0% instances), Foreign (116; 0% instances), Style (18; 0% instances), NumValue (10; 0% instances), Abbr (9; 0% instances)

ADJ occurs with 50 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Foreign, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Hyph=Yes, NameType=Com, NameType=Com,Pro, NameType=Geo, NameType=Geo,Giv, NameType=Geo,Sur, NameType=Giv, NameType=Oth, NameType=Pro, NameType=Sur, Negative=Neg, Negative=Pos, NumType=Gen, NumType=Ord, NumType=Sets, NumValue=1, Number=Dual, Number=Plur, Number=Plur,Sing, Number=Sing, Poss=Yes, Style=Coll, Tense=Past, Tense=Pres, Variant=Short, VerbForm=Part, Voice=Act

ADJ occurs with 529 feature combinations. The most frequent feature combination is Case=Gen|Degree=Pos|Gender=Fem|Negative=Pos|Number=Sing (5958 tokens). Examples: socialistické, pracovní, nové, společenské, lidské, národní, české, celé, vědecké, druhé

Relations

ADJ nodes are attached to their parents using 24 different relations: amod (61377; 87% instances), conj (4345; 6% instances), root (1912; 3% instances), dep (504; 1% instances), acl (380; 1% instances), xcomp (343; 0% instances), advcl (295; 0% instances), dobj (273; 0% instances), nsubj (251; 0% instances), advmod (213; 0% instances), appos (175; 0% instances), ccomp (160; 0% instances), iobj (88; 0% instances), csubj (71; 0% instances), foreign (47; 0% instances), parataxis (44; 0% instances), nsubjpass (14; 0% instances), csubjpass (13; 0% instances), cop (9; 0% instances), cc (5; 0% instances), nmod (4; 0% instances), name (3; 0% instances), advmod:emph (1; 0% instances), mwe (1; 0% instances)

Parents of ADJ nodes belong to 15 different parts of speech: NOUN (61026; 87% instances), ADJ (4036; 6% instances), VERB (2152; 3% instances), ROOT (1912; 3% instances), PROPN (689; 1% instances), PRON (300; 0% instances), SYM (169; 0% instances), NUM (124; 0% instances), ADV (63; 0% instances), DET (36; 0% instances), CONJ (12; 0% instances), PART (4; 0% instances), ADP (2; 0% instances), SCONJ (2; 0% instances), INTJ (1; 0% instances)

56773 (80%) ADJ nodes are leaves.

5935 (8%) ADJ nodes have one child.

3387 (5%) ADJ nodes have two children.

4433 (6%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 19.

Children of ADJ nodes are attached using 31 different relations: punct (5563; 17% instances), advmod (4894; 15% instances), conj (4399; 13% instances), cc (3439; 10% instances), cop (3404; 10% instances), nmod (3211; 10% instances), nsubj (1973; 6% instances), dobj (1650; 5% instances), csubj (886; 3% instances), mark (748; 2% instances), case (567; 2% instances), advmod:emph (553; 2% instances), advcl (522; 2% instances), expl (253; 1% instances), dep (246; 1% instances), xcomp (211; 1% instances), aux (183; 1% instances), appos (177; 1% instances), amod (83; 0% instances), parataxis (72; 0% instances), acl (58; 0% instances), det (47; 0% instances), foreign (23; 0% instances), nummod (23; 0% instances), ccomp (20; 0% instances), neg (20; 0% instances), discourse (12; 0% instances), auxpass:reflex (2; 0% instances), name (2; 0% instances), vocative (2; 0% instances), det:nummod (1; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: NOUN (6476; 19% instances), PUNCT (5562; 17% instances), VERB (5304; 16% instances), ADV (5023; 15% instances), ADJ (4036; 12% instances), CONJ (3379; 10% instances), PRON (1140; 3% instances), SCONJ (771; 2% instances), ADP (563; 2% instances), PART (231; 1% instances), PROPN (224; 1% instances), AUX (185; 1% instances), NUM (168; 1% instances), SYM (126; 0% instances), DET (55; 0% instances), INTJ (1; 0% instances)


Treebank Statistics (UD_Czech-CLTT)

There are 643 ADJ lemmas (24%), 1413 ADJ types (30%) and 6539 ADJ tokens (19%). Out of 15 observed tags, the rank of ADJ is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent ADJ lemmas: účetní, konsolidovaný, právní, jiný, uvedený, finanční, zvláštní, hmotný, dlouhodobý, povinný

The 10 most frequent ADJ types: účetní, účetních, účetního, konsolidované, konsolidující, finanční, účetním, povinny, výroční, právní

The 10 most frequent ambiguous lemmas: účetní (ADJ 1467, NOUN 22), provozní (ADJ 17, NOUN 3), do (ADP 191, ADJ 6), od (ADP 73, ADJ 4), pod (ADP 31, ADJ 1)

The 10 most frequent ambiguous types: účetní (ADJ 873, NOUN 21), vlastní (ADJ 18, VERB 1), delší (ADJ 15, VERB 4), provozní (ADJ 13, NOUN 2), do (ADP 186, ADJ 6), něm (ADJ 4, ADV 3), od (ADP 55, ADJ 4), provozních (ADJ 2, NOUN 1), pod (ADP 31, ADJ 1), ustanovení (NOUN 63, ADJ 1)

Morphology

The form / lemma ratio of ADJ is 2.197512 (the average of all parts of speech is 1.764161).

The 1st highest number of forms (13) was observed with the lemma “hmotný”: hmotné, hmotného, hmotném, hmotný, hmotným, nehmotné, nehmotného, nehmotném, nehmotnému, nehmotný, nehmotných, nehmotným, nehmotnými.

The 2nd highest number of forms (12) was observed with the lemma “uvedený”: neuvedená, neuvedené, uvedenou, uvedená, uvedené, uvedeného, uvedeném, uvedenému, uvedený, uvedených, uvedeným, uvedenými.

The 3rd highest number of forms (10) was observed with the lemma “jiný”: jinou, jiná, jiné, jiného, jiném, jinému, jiný, jiných, jiným, jinými.

ADJ occurs with 17 features: Gender (6524; 100% instances), Number (6524; 100% instances), Negative (6495; 99% instances), Case (6421; 98% instances), Degree (6093; 93% instances), Animacy (2622; 40% instances), Aspect (288; 4% instances), Tense (288; 4% instances), VerbForm (288; 4% instances), Voice (288; 4% instances), Variant (103; 2% instances), NumType (43; 1% instances), Hyph (11; 0% instances), Abbr (4; 0% instances), Style (4; 0% instances), Gender[psor] (1; 0% instances), Poss (1; 0% instances)

ADJ occurs with 32 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Gender[psor]=Masc, Hyph=Yes, Negative=Neg, Negative=Pos, NumType=Ord, Number=Plur, Number=Plur,Sing, Number=Sing, Poss=Yes, Style=Coll, Tense=Pres, Variant=Short, VerbForm=Part, Voice=Act

ADJ occurs with 162 feature combinations. The most frequent feature combination is Case=Gen|Degree=Pos|Gender=Fem|Negative=Pos|Number=Sing (611 tokens). Examples: účetní, konsolidované, reálné, finanční, pořizovací, účtové, české, Evropské, stanovené, výroční

Relations

ADJ nodes are attached to their parents using 17 different relations: amod (6080; 93% instances), conj (233; 4% instances), root (82; 1% instances), acl (41; 1% instances), advcl (21; 0% instances), xcomp (21; 0% instances), advmod (13; 0% instances), dobj (12; 0% instances), case (9; 0% instances), dep (9; 0% instances), ccomp (5; 0% instances), appos (4; 0% instances), csubjpass (2; 0% instances), nmod (2; 0% instances), nsubj (2; 0% instances), parataxis (2; 0% instances), iobj (1; 0% instances)

Parents of ADJ nodes belong to 8 different parts of speech: NOUN (6150; 94% instances), ADJ (225; 3% instances), ROOT (82; 1% instances), VERB (75; 1% instances), ADV (3; 0% instances), X (2; 0% instances), PART (1; 0% instances), PRON (1; 0% instances)

5438 (83%) ADJ nodes are leaves.

625 (10%) ADJ nodes have one child.

233 (4%) ADJ nodes have two children.

243 (4%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 13.

Children of ADJ nodes are attached using 22 different relations: nmod (486; 21% instances), punct (314; 14% instances), dobj (250; 11% instances), conj (229; 10% instances), cc (196; 9% instances), cop (195; 9% instances), advmod (159; 7% instances), nsubj (154; 7% instances), xcomp (78; 3% instances), advcl (60; 3% instances), mark (43; 2% instances), expl (28; 1% instances), case (21; 1% instances), csubj (13; 1% instances), dep (11; 0% instances), advmod:emph (7; 0% instances), amod (6; 0% instances), appos (6; 0% instances), aux (4; 0% instances), parataxis (3; 0% instances), det (1; 0% instances), neg (1; 0% instances)

Children of ADJ nodes belong to 14 different parts of speech: NOUN (889; 39% instances), PUNCT (314; 14% instances), VERB (311; 14% instances), ADJ (225; 10% instances), CONJ (191; 8% instances), ADV (98; 4% instances), X (86; 4% instances), PRON (71; 3% instances), SCONJ (47; 2% instances), ADP (21; 1% instances), NUM (6; 0% instances), AUX (4; 0% instances), DET (1; 0% instances), PART (1; 0% instances)


ADJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]