home cs/pos edit page issue tracker

ADP: adposition

Definition

Czech has only prepositions but no postpositions or circumpositions. They occur before a complement noun phrase (noun, pronoun) and they form a single structure with the complement to express its grammatical and semantic relation to another unit within a clause.

Some prepositions take the form of fixed multiword expressions, e.g. na rozdíl od  “in contrast to”, v souvislosti s  “in connection with”. The component words are then still tagged according to their basic use (na  is ADP, rozdíl  is NOUN, etc.) and their status as multiword expressions are accounted for in the syntactic annotation.

Examples

References


Treebank Statistics (UD_Czech)

There are 114 ADP lemmas (0%), 132 ADP types (0%) and 145943 ADP tokens (10%). Out of 17 observed tags, the rank of ADP is: 7 in number of lemmas, 9 in number of types and 5 in number of tokens.

The 10 most frequent ADP lemmas: v, na, z, s, o, do, k, pro, za, po

The 10 most frequent ADP types: v, na, o, z, s, do, ve, k, pro, za

The 10 most frequent ambiguous lemmas: v (ADP 36265, NOUN 9, ADJ 1), z (ADP 11656, NOUN 19), s (ADP 11169, NOUN 90, PART 21, X 3, ADJ 1), o (ADP 10328, PUNCT 100, NOUN 10, ADJ 3, INTJ 2), do (ADP 7414, PROPN 11, VERB 3, NOUN 2), k (ADP 7084, NOUN 15), po (ADP 3847, NOUN 5), podle (ADP 3564, ADV 1), u (ADP 2378, NOUN 4), bez (ADP 1147, NOUN 1)

The 10 most frequent ambiguous types: v (ADP 26490, NOUN 4, ADJ 3), o (ADP 9669, ADJ 110, PUNCT 99, NOUN 4), z (ADP 8838, NOUN 18), s (ADP 8728, NOUN 381, PART 21, ADJ 9), do (ADP 6970, PROPN 11, NOUN 2, VERB 2), k (ADP 5508, NOUN 10, ADJ 1), po (ADP 3165, NOUN 6), podle (ADP 2143, ADV 1), při (ADP 2004, NOUN 1), u (ADP 2039, NOUN 3)

Morphology

The form / lemma ratio of ADP is 1.157895 (the average of all parts of speech is 2.195950).

The 1st highest number of forms (3) was observed with the lemma “a”: a, ala, à.

The 2nd highest number of forms (3) was observed with the lemma “k”: k, ke, ku.

The 3rd highest number of forms (3) was observed with the lemma “nad”: n, nad, nade.

ADP occurs with 6 features: cs-feat/AdpType (145943; 100% instances), cs-feat/Case (145304; 100% instances), cs-feat/Foreign (592; 0% instances), cs-feat/NameType (71; 0% instances), cs-feat/Abbr (23; 0% instances), cs-feat/Aspect (1; 0% instances)

ADP occurs with 18 feature-value pairs: Abbr=Yes, AdpType=Comprep, AdpType=Prep, AdpType=Voc, Aspect=Imp, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Foreign=Foreign, NameType=Com, NameType=Geo, NameType=Geo,Giv,Sur, NameType=Oth, NameType=Pro, NameType=Sur

ADP occurs with 31 feature combinations. The most frequent feature combination is AdpType=Prep|Case=Loc (50236 tokens). Examples: v, na, o, po, při, za

Relations

ADP nodes are attached to their parents using 13 different relations: cs-dep/case (143856; 99% instances), cs-dep/mwe (1479; 1% instances), cs-dep/foreign (438; 0% instances), cs-dep/advmod (51; 0% instances), cs-dep/nmod (41; 0% instances), cs-dep/mark (34; 0% instances), cs-dep/conj (16; 0% instances), cs-dep/dep (10; 0% instances), cs-dep/root (10; 0% instances), cs-dep/name (3; 0% instances), cs-dep/dobj (2; 0% instances), cs-dep/nsubj (2; 0% instances), cs-dep/appos (1; 0% instances)

Parents of ADP nodes belong to 14 different parts of speech: NOUN (112932; 77% instances), PROPN (16535; 11% instances), PRON (10551; 7% instances), NUM (2093; 1% instances), ADJ (1921; 1% instances), ADP (1107; 1% instances), ADV (491; 0% instances), VERB (95; 0% instances), DET (91; 0% instances), SYM (85; 0% instances), PART (30; 0% instances), ROOT (10; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)

143269 (98%) ADP nodes are leaves.

1837 (1%) ADP nodes have one child.

826 (1%) ADP nodes have two children.

11 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 5.

Children of ADP nodes are attached using 12 different relations: cs-dep/mwe (3420; 97% instances), cs-dep/punct (40; 1% instances), cs-dep/foreign (17; 0% instances), cs-dep/nmod (12; 0% instances), cs-dep/cc (11; 0% instances), cs-dep/conj (11; 0% instances), cs-dep/dep (10; 0% instances), cs-dep/acl (2; 0% instances), cs-dep/advmod (2; 0% instances), cs-dep/dobj (2; 0% instances), cs-dep/advmod:emph (1; 0% instances), cs-dep/case (1; 0% instances)

Children of ADP nodes belong to 10 different parts of speech: NOUN (2339; 66% instances), ADP (1107; 31% instances), PUNCT (40; 1% instances), PROPN (12; 0% instances), CONJ (11; 0% instances), ADJ (9; 0% instances), ADV (4; 0% instances), PRON (3; 0% instances), VERB (3; 0% instances), NUM (1; 0% instances)


Treebank Statistics (UD_Czech-CAC)

There are 71 ADP lemmas (0%), 79 ADP types (0%) and 48391 ADP tokens (10%). Out of 16 observed tags, the rank of ADP is: 7 in number of lemmas, 9 in number of types and 5 in number of tokens.

The 10 most frequent ADP lemmas: v, na, s, z, k, o, pro, do, za, při

The 10 most frequent ADP types: v, na, s, z, o, k, pro, do, ve, za

The 10 most frequent ambiguous lemmas: v (ADP 12769, NOUN 3), s (ADP 3842, PART 13), z (ADP 3690, NOUN 2), bez (ADP 364, NOUN 1), kolem (ADP 142, ADV 3), vedle (ADP 100, ADV 3), místo (NOUN 366, ADP 50, SCONJ 2), de (ADP 29, NOUN 1), a (CONJ 15539, ADP 4), vstříc (ADV 4, ADP 4)

The 10 most frequent ambiguous types: v (ADP 9248, NOUN 3), na (ADP 6587, CONJ 1), s (ADP 3122, PART 13), z (ADP 2831, NOUN 2), se (PRON 7715, ADP 601), kolem (ADP 132, ADV 3, NOUN 1), pomocí (ADP 97, NOUN 18), vzhledem (ADP 75, NOUN 4), vedle (ADP 71, ADV 3), během (ADP 50, NOUN 1)

Morphology

The form / lemma ratio of ADP is 1.112676 (the average of all parts of speech is 2.206260).

The 1st highest number of forms (3) was observed with the lemma “k”: k, ke, ku.

The 2nd highest number of forms (2) was observed with the lemma “bez”: bez, beze.

The 3rd highest number of forms (2) was observed with the lemma “díky”: dík, díky.

ADP occurs with 4 features: cs-feat/AdpType (48391; 100% instances), cs-feat/Case (48249; 100% instances), cs-feat/Foreign (64; 0% instances), cs-feat/NameType (4; 0% instances)

ADP occurs with 13 feature-value pairs: AdpType=Comprep, AdpType=Prep, AdpType=Voc, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Foreign=Foreign, NameType=Com, NameType=Oth, NameType=Pro

ADP occurs with 21 feature combinations. The most frequent feature combination is AdpType=Prep|Case=Loc (17552 tokens). Examples: v, na, o, při, po

Relations

ADP nodes are attached to their parents using 11 different relations: cs-dep/case (47541; 98% instances), cs-dep/mwe (766; 2% instances), cs-dep/nmod (29; 0% instances), cs-dep/foreign (25; 0% instances), cs-dep/mark (9; 0% instances), cs-dep/advmod (8; 0% instances), cs-dep/advmod:emph (4; 0% instances), cs-dep/conj (4; 0% instances), cs-dep/dobj (3; 0% instances), cs-dep/auxpass:reflex (1; 0% instances), cs-dep/dep (1; 0% instances)

Parents of ADP nodes belong to 13 different parts of speech: NOUN (39935; 83% instances), PRON (3622; 7% instances), PROPN (2265; 5% instances), SYM (608; 1% instances), NUM (587; 1% instances), ADJ (563; 1% instances), ADP (500; 1% instances), ADV (133; 0% instances), PART (130; 0% instances), VERB (21; 0% instances), DET (20; 0% instances), SCONJ (6; 0% instances), CONJ (1; 0% instances)

46922 (97%) ADP nodes are leaves.

1077 (2%) ADP nodes have one child.

390 (1%) ADP nodes have two children.

2 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 4.

Children of ADP nodes are attached using 10 different relations: cs-dep/mwe (1840; 99% instances), cs-dep/nmod (11; 1% instances), cs-dep/cc (3; 0% instances), cs-dep/conj (3; 0% instances), cs-dep/punct (2; 0% instances), cs-dep/advmod (1; 0% instances), cs-dep/amod (1; 0% instances), cs-dep/case (1; 0% instances), cs-dep/dobj (1; 0% instances), cs-dep/foreign (1; 0% instances)

Children of ADP nodes belong to 9 different parts of speech: NOUN (1348; 72% instances), ADP (500; 27% instances), SYM (4; 0% instances), CONJ (3; 0% instances), ADJ (2; 0% instances), PRON (2; 0% instances), PROPN (2; 0% instances), PUNCT (2; 0% instances), ADV (1; 0% instances)


Treebank Statistics (UD_Czech-CLTT)

There are 32 ADP lemmas (1%), 39 ADP types (1%) and 3942 ADP tokens (11%). Out of 15 observed tags, the rank of ADP is: 7 in number of lemmas, 9 in number of types and 4 in number of tokens.

The 10 most frequent ADP lemmas: v, k, podle, na, o, s, z, do, za, pro

The 10 most frequent ADP types: v, podle, na, o, k, s, do, za, z, pro

The 10 most frequent ambiguous lemmas: do (ADP 191, ADJ 6), od (ADP 73, ADJ 4), včetně (ADP 47, ADV 7), pod (ADP 31, ADJ 1), místo (NOUN 2, ADP 1), uvnitř (ADV 3, ADP 1)

The 10 most frequent ambiguous types: do (ADP 186, ADJ 6), od (ADP 55, ADJ 4), včetně (ADP 47, ADV 7), se (PRON 467, ADP 34), pod (ADP 31, ADJ 1), prostřednictvím (ADP 17, NOUN 1), místo (ADP 1, NOUN 1), uvnitř (ADV 3, ADP 1)

Morphology

The form / lemma ratio of ADP is 1.218750 (the average of all parts of speech is 1.764161).

The 1st highest number of forms (2) was observed with the lemma “bez”: bez, beze.

The 2nd highest number of forms (2) was observed with the lemma “k”: k, ke.

The 3rd highest number of forms (2) was observed with the lemma “od”: od, ode.

ADP occurs with 2 features: cs-feat/AdpType (3942; 100% instances), cs-feat/Case (3938; 100% instances)

ADP occurs with 8 feature-value pairs: AdpType=Comprep, AdpType=Prep, AdpType=Voc, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc

ADP occurs with 11 feature combinations. The most frequent feature combination is AdpType=Prep|Case=Loc (1316 tokens). Examples: v, o, na, při, po

Relations

ADP nodes are attached to their parents using 6 different relations: cs-dep/case (3862; 98% instances), cs-dep/mwe (70; 2% instances), cs-dep/auxpass:reflex (5; 0% instances), cs-dep/nmod (3; 0% instances), cs-dep/conj (1; 0% instances), cs-dep/mark (1; 0% instances)

Parents of ADP nodes belong to 10 different parts of speech: NOUN (3399; 86% instances), X (235; 6% instances), PRON (200; 5% instances), ADP (66; 2% instances), ADJ (21; 1% instances), ADV (11; 0% instances), VERB (6; 0% instances), NUM (2; 0% instances), SCONJ (1; 0% instances), SYM (1; 0% instances)

3677 (93%) ADP nodes are leaves.

203 (5%) ADP nodes have one child.

62 (2%) ADP nodes have two children.

The highest child degree of a ADP node is 2.

Children of ADP nodes are attached using 2 different relations: cs-dep/mwe (324; 99% instances), cs-dep/nmod (3; 1% instances)

Children of ADP nodes belong to 3 different parts of speech: NOUN (260; 80% instances), ADP (66; 20% instances), X (1; 0% instances)


ADP in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]