home fi/feat edit page issue tracker

Clitic: clitic

(Please note: this part of the documentation is not yet completed.)

Language-specific feature identifying clitics attached to the word.

Finnish has a number of particle clitics used to express questions, politeness, or focus. UD Finnish captures the presence of these clitics using the Clitic feature, which takes one or more of the following values, with multiple values expressing combinations, for example Clitic=Ko,S for -kos (-ko + -s) as in voikos.

Kin

Expresses focus. Can often be translated into English as also. Forms contrasting pair with -kaan.

Examples

Kaan

Expresses focus in negative contexts. Realized as -kaan or -kään. Forms contrasting pair with -kin.

Examples

Ko

Expresses a question. Realized as -ko or -kö.

Examples

Han

Realized as -han or -hän.

Examples

Pa

Realized as -pa or -pä.

Examples

S

TODO

Examples

Ka

Realized as -ka or -kä. Attached to the negative verb ei, serves also as a conjunction.

Examples

References


Treebank Statistics (UD_Finnish)

This feature is language-specific. It occurs with 7 different values: Han, Ka, Kaan, Kin, Ko, Pa, S. Some words have combined values of the feature; 4 combinations have been observed: Han|Ko, Han|Pa, Ko|S, Pa|S.

1661 tokens (1%) have a non-empty value of Clitic. 977 types (2%) occur at least once with a non-empty value of Clitic. 531 lemmas (2%) occur at least once with a non-empty value of Clitic. The feature is used with 11 part-of-speech tags: fi-pos/VERB (778; 0% instances), fi-pos/ADV (242; 0% instances), fi-pos/NOUN (221; 0% instances), fi-pos/PRON (191; 0% instances), fi-pos/AUX (106; 0% instances), fi-pos/ADJ (69; 0% instances), fi-pos/PROPN (22; 0% instances), fi-pos/SCONJ (12; 0% instances), fi-pos/ADP (10; 0% instances), fi-pos/NUM (9; 0% instances), fi-pos/CONJ (1; 0% instances).

VERB

778 fi-pos/VERB tokens (2% of all VERB tokens) have a non-empty value of Clitic.

The most frequent other feature values with which VERB and Clitic co-occurred: InfForm=EMPTY (756; 97%), PartForm=EMPTY (748; 96%), Degree=EMPTY (748; 96%), Case=EMPTY (739; 95%), VerbForm=Fin (726; 93%), Voice=Act (724; 93%), Number=Sing (613; 79%), Person=3 (508; 65%), Mood=Ind (411; 53%).

VERB tokens may have the following values of Clitic:

Paradigm ollaHanHan,KoKaanKinKoKo,SPaPa,S
Case=Nom|Degree=Pos|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actollutkaan
Connegative=Yes|Mood=Cnd|VerbForm=Finolisikaan
Connegative=Yes|Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Finolekkaan
Connegative=Yes|Mood=Ind|Tense=Pres|VerbForm=Finolekaanolekin
Mood=Cnd|Number=Sing|Person=3|VerbForm=Fin|Voice=ActOlisihanOlisikohanolisikinolisikoOlisipa
Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin|Voice=Actolinko
Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actolenkinolenko
Mood=Ind|Number=Sing|Person=2|Tense=Pres|VerbForm=Fin|Voice=Actoletkooletpa
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actonkionks
Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=ActolihanolikaanolikinolikoOlikosolipaolipas
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=ActOnhanonkaanonkinonkoonkosonpaOnpas
Mood=Ind|Number=Plur|Person=3|Style=Coll|Tense=Past|VerbForm=Fin|Voice=Actolihan
Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin|Voice=Actolivathanolivatkinolivatkoolivatpa
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actovathanovatkaanovatkin, olemmekinOvatko
Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=PassOllaas

ADV

242 fi-pos/ADV tokens (2% of all ADV tokens) have a non-empty value of Clitic.

ADV tokens may have the following values of Clitic:

Paradigm niinHanKaanKinPa
niinhänniinkäänniinkinNiinpä

NOUN

221 fi-pos/NOUN tokens (0% of all NOUN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NOUN and Clitic co-occurred: Number=Sing (153; 69%).

NOUN tokens may have the following values of Clitic:

Paradigm miesHanKaanKin
Case=Gen|Number=Plurmiestenkin
Case=Nom|Number=SingmieshänMieskin
Case=Nom|Number=Sing|Number[psor]=Sing|Person[psor]=1miehenikin
Case=Nom|Number=Sing|Person[psor]=3miehensäkään

Clitic seems to be lexical feature of NOUN. 95% lemmas (184) occur only with one value of Clitic.

PRON

191 fi-pos/PRON tokens (2% of all PRON tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PRON and Clitic co-occurred: Person=EMPTY (151; 79%), Number=Sing (146; 76%).

PRON tokens may have the following values of Clitic:

Paradigm seHanKaanKinPaS
Case=Ade|Number=Singsilläkin
Case=Ade|Number=Plurniilläkin
Case=Ade|Number=Plur|Style=Collniilki
Case=Ela|Number=Singsiitähänsiitäkinsiitäs
Case=Ela|Number=Plurniistäkin
Case=Gen|Number=SingSenhänsenkäänsenkin
Case=Gen|Number=Plurniidenkin
Case=Ill|Number=Singsiihenkin
Case=Ine|Number=SingSiinäpä
Case=Nom|Number=Singsehänsekäänsekin
Case=Nom|Number=Plurnekin
Case=Par|Number=SingSitähänsitäkäänsitäkin

AUX

106 fi-pos/AUX tokens (3% of all AUX tokens) have a non-empty value of Clitic.

The most frequent other feature values with which AUX and Clitic co-occurred: VerbForm=Fin (102; 96%), Voice=Act (98; 92%), Number=Sing (87; 82%), Mood=Ind (76; 72%), Person=3 (72; 68%), Tense=Pres (63; 59%).

AUX tokens may have the following values of Clitic:

Paradigm ollaHanKaanKinKoKo,SPa,S
Case=Nom|Degree=Pos|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actollutkaan
Connegative=Yes|Mood=Ind|Tense=Pres|VerbForm=Finolekaan
Mood=Cnd|Number=Sing|Person=3|VerbForm=Fin|Voice=Actolisikinolisiko
Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin|Voice=Actolinkinolinko
Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin|Voice=ActolenkaanOlenkinolenko
Mood=Ind|Number=Sing|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actoot
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actonks
Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=Actolikaanolikinoliko
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=ActOnhanonkinonkoonpas
Mood=Ind|Number=Plur|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actootteko
Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin|Voice=Actolivatkin
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actovatkin

ADJ

69 fi-pos/ADJ tokens (1% of all ADJ tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADJ and Clitic co-occurred: Number=Sing (52; 75%), Degree=Pos (50; 72%).

ADJ tokens may have the following values of Clitic:

Paradigm hyväKaanKin
Degree=Pos|Number=Singhyvääkin
Degree=Cmp|Number=Singparempaakaanparempaakin
Degree=Cmp|Number=Plurparempiakaan

PROPN

22 fi-pos/PROPN tokens (0% of all PROPN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PROPN and Clitic co-occurred: Number=Sing (21; 95%).

PROPN tokens may have the following values of Clitic:

Paradigm SuomiKaanKin
Case=GenSuomenkaan
Case=IneSuomessakin
Case=NomSuomikin

Clitic seems to be lexical feature of PROPN. 94% lemmas (17) occur only with one value of Clitic.

SCONJ

12 fi-pos/SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Clitic.

SCONJ tokens may have the following values of Clitic:

Paradigm josKinKo
joskinjosko

ADP

10 fi-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADP and Clitic co-occurred: AdpType=Post (6; 60%).

ADP tokens may have the following values of Clitic:

Paradigm jälkeenKaanKin
jälkeenkäänjälkeenkin

NUM

9 fi-pos/NUM tokens (0% of all NUM tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NUM and Clitic co-occurred: Number=Sing (9; 100%), NumType=Card (9; 100%).

NUM tokens may have the following values of Clitic:

Paradigm yksiKaanKin
Case=Ablyhdeltäkään
Case=Essyhtenäkin
Case=Nomyksikin
Case=Paryhtäkään

CONJ

1 fi-pos/CONJ tokens (0% of all CONJ tokens) have a non-empty value of Clitic.

CONJ tokens may have the following values of Clitic:


Treebank Statistics (UD_Finnish-FTB)

This feature is language-specific. It occurs with 7 different values: Han, Ka, Kaan, Kin, Ko, Pa, S. Some words have combined values of the feature; 9 combinations have been observed: Han|Ka, Han|Kin, Han|Ko, Han|Pa, Ka|S, Kaan|Ko, Kin|Ko, Ko|S, Pa|S.

2949 tokens (2%) have a non-empty value of Clitic. 1730 types (4%) occur at least once with a non-empty value of Clitic. 762 lemmas (4%) occur at least once with a non-empty value of Clitic. The feature is used with 10 part-of-speech tags: fi-pos/VERB (1673; 1% instances), fi-pos/NOUN (354; 0% instances), fi-pos/PRON (309; 0% instances), fi-pos/ADV (220; 0% instances), fi-pos/PART (119; 0% instances), fi-pos/ADJ (110; 0% instances), fi-pos/DET (78; 0% instances), fi-pos/PROPN (57; 0% instances), fi-pos/NUM (21; 0% instances), fi-pos/ADP (8; 0% instances).

VERB

1673 fi-pos/VERB tokens (4% of all VERB tokens) have a non-empty value of Clitic.

The most frequent other feature values with which VERB and Clitic co-occurred: PartForm=EMPTY (1629; 97%), InfForm=EMPTY (1616; 97%), Voice=Act (1585; 95%), Case=EMPTY (1572; 94%), VerbForm=Fin (1571; 94%), Number=Sing (1313; 78%), Mood=Ind (986; 59%), Person=3 (947; 57%).

VERB tokens may have the following values of Clitic:

Paradigm ollaHanHan,KoKaanKinKoKo,SPaPa,SS
Case=Gen|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actolleenkaan
Case=Gen|Number=Sing|PartForm=Pres|VerbForm=Part|Voice=Actolevankaan
Case=Ine|InfForm=2|VerbForm=Inf|Voice=Actollessakaan
Case=Lat|InfForm=1|VerbForm=Inf|Voice=ActollakaanOllakoollapa
Case=Nom|Number=Sing|PartForm=Past|Style=Coll|VerbForm=Part|Voice=Actollukkaanollukkiollukko
Case=Nom|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actollutkaan
Case=Nom|Number=Plur|PartForm=Past|VerbForm=Part|Voice=Actolleetkaanolleetkin
Connegative=Yes|Mood=Ind|Number=Sing|Tense=Past|VerbForm=Fin|Voice=Actollutkaan
Connegative=Yes|Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actookin
Connegative=Yes|Mood=Ind|Tense=Pres|VerbForm=Fin|Voice=Actolekaanolekin
Mood=Cnd|Number=Sing|Person=1|VerbForm=Fin|Voice=ActOlisinko
Mood=Cnd|Number=Sing|Person=2|VerbForm=Fin|Voice=ActOlisitpa
Mood=Cnd|Number=Sing|Person=3|Style=Coll|VerbForm=Fin|Voice=ActOiskohanoliskinolisko, oisko
Mood=Cnd|Number=Sing|Person=3|VerbForm=Fin|Voice=ActOlisihanOlisikohanolisikaanolisikinolisikoOlisipa
Mood=Cnd|Number=Plur|Person=2|VerbForm=Fin|Voice=ActOlisitteko
Mood=Cnd|Number=Plur|Person=3|VerbForm=Fin|Voice=Actolisivatko
Mood=Imp|Number=Sing|Person=2|VerbForm=Fin|Voice=Actolekinolepa
Mood=Imp|Number=Sing|Person=3|VerbForm=Fin|Voice=Actolkoonkinolkoonpa
Mood=Ind|Number=Sing|Person=1|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actooks, oonko, olenk, Oonksmä
Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin|Voice=ActolinkinolinkoOlinpa
Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin|Voice=ActOlenhanolenkinolenko
Mood=Ind|Number=Sing|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actootko, ootsä, Ooksää, oleksä, ook, Ookkonää, oleks
Mood=Ind|Number=Sing|Person=2|Tense=Past|VerbForm=Fin|Voice=ActOlithanOlitkoOlitkos
Mood=Ind|Number=Sing|Person=2|Tense=Pres|VerbForm=Fin|Voice=ActOletkohanoletkaanoletkinoletkoOletkosoletpa
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Past|VerbForm=Fin|Voice=Actolikiioliks, olik
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actonkohaonkionks, onk
Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=ActolihanolikohanolikaanolikinolikoOlikosolipaOlipas
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=ActonhanOnkohanonkaanonkinonkoonkosonpaOnpas
Mood=Ind|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actolemmeko
Mood=Ind|Number=Plur|Person=2|Style=Coll|Tense=Past|VerbForm=Fin|Voice=ActOlitteks
Mood=Ind|Number=Plur|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actootteko, Oottekste
Mood=Ind|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin|Voice=Actoletteko
Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin|Voice=Actolivatkaan
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actovathanovatkaanovatkinovatko
Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=PassOllaas
Mood=Ind|Tense=Past|VerbForm=Fin|Voice=PassOltiinhanOltiinkin
Mood=Ind|Tense=Pres|VerbForm=Fin|Voice=PassOllaanhanollaanpas
Mood=Pot|Number=Sing|Person=3|VerbForm=Fin|Voice=ActlieneekäänLiekö, lieneekö

NOUN

354 fi-pos/NOUN tokens (1% of all NOUN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NOUN and Clitic co-occurred: Number=Sing (263; 74%).

NOUN tokens may have the following values of Clitic:

Paradigm lapsiHanHan,KinKaanKin
Case=Ade|Number=Plurlapsillakin
Case=Ela|Number=SingLapsestakin
Case=Ill|Number=PlurLapsiinhan
Case=Nom|Number=Singlapsikaanlapsikin
Case=Nom|Number=Plurlapsetkin
Case=Par|Number=Plur|Style=Colllapsijakkiihan

Clitic seems to be lexical feature of NOUN. 92% lemmas (249) occur only with one value of Clitic.

PRON

309 fi-pos/PRON tokens (3% of all PRON tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PRON and Clitic co-occurred: Number=Sing (238; 77%), Person=EMPTY (203; 66%), Case=Nom (178; 58%).

PRON tokens may have the following values of Clitic:

Paradigm seHanKaanKaan,KoKinKoPaPa,S
Case=Adesilläkin
Case=ElasiitähänsiitäkäänSiitäkinSiitäpä
Case=Gensenhänsenkään
Case=IllsiihenkinSiihenkö
Case=Inesiinäkinsiinäpä
Case=Ine|Style=Collsiinähä
Case=NomsehänsekäänsekinseköSepäSepäs
Case=Nom|Style=Collseki
Case=ParSitähänsitäkäänsitäkäänkösitäkinSitäkö

ADV

220 fi-pos/ADV tokens (2% of all ADV tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADV and Clitic co-occurred: PronType=EMPTY (132; 60%).

ADV tokens may have the following values of Clitic:

Paradigm miksiHanHan,KaHan,KoKoPa
miksihänMiksikähänMiksiköhänMiksikömiksipä

PART

119 fi-pos/PART tokens (2% of all PART tokens) have a non-empty value of Clitic.

PART tokens may have the following values of Clitic:

Paradigm kylläHanKaanKinPaPa,S
_kyllähänkylläkäänkylläkinKylläpäkylläpäs
Style=Collkylhän

ADJ

110 fi-pos/ADJ tokens (1% of all ADJ tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADJ and Clitic co-occurred: Number=Sing (70; 64%).

ADJ tokens may have the following values of Clitic:

Paradigm omaKinPa
Case=Ela|Number=Plur|Style=Collomistaki
Case=Nom|Number=SingOmapa
Case=Nom|Number=Pluromatkin

Clitic seems to be lexical feature of ADJ. 95% lemmas (73) occur only with one value of Clitic.

DET

78 fi-pos/DET tokens (2% of all DET tokens) have a non-empty value of Clitic.

The most frequent other feature values with which DET and Clitic co-occurred: Number=Sing (47; 60%).

DET tokens may have the following values of Clitic:

Paradigm tämäHanKaanKinKo
Case=EssTänäkääntänäkin
Case=Gentämänkääntämänkin
Case=Inetässäkin
Case=NomTämähänTämäkäänTämäkö
Case=Par|Style=Colltätäkä

PROPN

57 fi-pos/PROPN tokens (1% of all PROPN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PROPN and Clitic co-occurred: Number=Sing (56; 98%).

PROPN tokens may have the following values of Clitic:

Paradigm suomiKinKo
Case=ElaSuomestakin
Case=GenSuomenkin
Case=IneSuomessakinSuomessako
Case=NomSuomikin

Clitic seems to be lexical feature of PROPN. 98% lemmas (44) occur only with one value of Clitic.

NUM

21 fi-pos/NUM tokens (1% of all NUM tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NUM and Clitic co-occurred: NumType=Card (21; 100%), Number=Sing (20; 95%), Case=Nom (15; 71%).

NUM tokens may have the following values of Clitic:

Paradigm yksiKaanKin
Case=Essyhtenäkään
Case=Genyhdenkin
Case=Nomyksikäänyksikin

ADP

8 fi-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Clitic.

ADP tokens may have the following values of Clitic: