Hyph
: hyphenated compound or part of it
Boolean feature. Is this the first part of a hyphenated compound?
Compound adjectives with hyphens, such as česko-slovenský “Czech-Slovak” get split during tokenization. The last part, slovenský, is an independent adjective with full inflection paradigm. However, the first part, česko, is a form that does not occur elsewhere than in compounds (the independent form would be český).
Yes: it is part of hyphenated compound
Examples
- česko-slovenský “Czecho-Slovak”
Treebank Statistics (UD_Czech)
This feature is language-specific.
It occurs with 1 different values: Yes
.
398 tokens (0%) have a non-empty value of Hyph
.
146 types (0%) occur at least once with a non-empty value of Hyph
.
136 lemmas (0%) occur at least once with a non-empty value of Hyph
.
The feature is used with 1 part-of-speech tags: cs-pos/ADJ (398; 0% instances).
ADJ
398 cs-pos/ADJ tokens (0% of all ADJ
tokens) have a non-empty value of Hyph
.
The most frequent other feature values with which ADJ
and Hyph
co-occurred: Degree=EMPTY (398; 100%), Animacy=EMPTY (398; 100%), Case=EMPTY (398; 100%), Number=EMPTY (398; 100%), Gender=EMPTY (398; 100%), Negative=Pos (397; 100%).
ADJ
tokens may have the following values of Hyph
:
Yes
(398; 100% of non-emptyHyph
): česko, izraelsko, německo, rusko, srbsko, americko, matematicko, polsko, maďarsko, anglo
Hyph
seems to be lexical feature of ADJ
. 100% lemmas (136) occur only with one value of Hyph
.
Treebank Statistics (UD_Czech-CAC)
This feature is language-specific.
It occurs with 1 different values: Yes
.
132 tokens (0%) have a non-empty value of Hyph
.
65 types (0%) occur at least once with a non-empty value of Hyph
.
60 lemmas (0%) occur at least once with a non-empty value of Hyph
.
The feature is used with 1 part-of-speech tags: cs-pos/ADJ (132; 0% instances).
ADJ
132 cs-pos/ADJ tokens (0% of all ADJ
tokens) have a non-empty value of Hyph
.
The most frequent other feature values with which ADJ
and Hyph
co-occurred: Number=EMPTY (132; 100%), Gender=EMPTY (132; 100%), Negative=Pos (132; 100%), Case=EMPTY (132; 100%), Degree=EMPTY (132; 100%), Animacy=EMPTY (132; 100%).
ADJ
tokens may have the following values of Hyph
:
Yes
(132; 100% of non-emptyHyph
): elektro, marxisticko, Karpatsko, teoreticko, rakousko, československo, dodavatelsko, Koh, francouzsko, hospodářsko
Hyph
seems to be lexical feature of ADJ
. 100% lemmas (60) occur only with one value of Hyph
.
Relations with Agreement in Hyph
The 10 most frequent relations where parent and child node agree in Hyph
:
ADJ –[amod]–> ADJ (3; 100%).
Treebank Statistics (UD_Czech-CLTT)
This feature is language-specific.
It occurs with 1 different values: Yes
.
11 tokens (0%) have a non-empty value of Hyph
.
3 types (0%) occur at least once with a non-empty value of Hyph
.
3 lemmas (0%) occur at least once with a non-empty value of Hyph
.
The feature is used with 1 part-of-speech tags: cs-pos/ADJ (11; 0% instances).
ADJ
11 cs-pos/ADJ tokens (0% of all ADJ
tokens) have a non-empty value of Hyph
.
The most frequent other feature values with which ADJ
and Hyph
co-occurred: Negative=Pos (11; 100%), Case=EMPTY (11; 100%), Gender=EMPTY (11; 100%), Number=EMPTY (11; 100%), Animacy=EMPTY (11; 100%), Degree=EMPTY (11; 100%).
ADJ
tokens may have the following values of Hyph
:
Yes
(11; 100% of non-emptyHyph
): do, od, pod