home fi/pos edit page issue tracker

CONJ: coordinating conjunction

A coordinating conjunction is a word that links words or larger constituents without syntactically subordinating one to the other and expresses a semantic relationship between them.

For subordinating conjunctions, see SCONJ.

In correlative (paired) coordinating conjuctions such as sekä - että “both - and” and joko - tai “either - or”, both words are annotated CONJ (see also cc:preconj).

Examples

References


Treebank Statistics (UD_Finnish)

There are 35 CONJ lemmas (0%), 41 CONJ types (0%) and 7355 CONJ tokens (4%). Out of 15 observed tags, the rank of CONJ is: 12 in number of lemmas, 13 in number of types and 8 in number of tokens.

The 10 most frequent CONJ lemmas: ja, tai, mutta, sekä, eli, vaan, että, vai, kuin, niin

The 10 most frequent CONJ types: ja, tai, mutta, sekä, eli, vaan, että, vai, kuin, niin

The 10 most frequent ambiguous lemmas: ja (CONJ 5240, ADV 8), tai (CONJ 646, ADV 1), mutta (CONJ 578, ADV 2, SCONJ 1), eli (CONJ 123, ADV 1), vaan (CONJ 108, ADV 37), että (SCONJ 1297, CONJ 58, INTJ 1), vai (CONJ 52, ADV 2), kuin (SCONJ 428, ADV 49, CONJ 48), niin (ADV 357, CONJ 34, SCONJ 4, INTJ 2), mitä (ADV 14, CONJ 7, SCONJ 1)

The 10 most frequent ambiguous types: ja (CONJ 5154, ADV 8), mutta (CONJ 501, ADV 1), eli (CONJ 113, VERB 6), vaan (CONJ 103, ADV 31), että (SCONJ 1239, CONJ 58, INTJ 1), vai (CONJ 44, ADV 2), kuin (SCONJ 426, ADV 47, CONJ 46), niin (ADV 290, CONJ 34, SCONJ 2, INTJ 2), mut (CONJ 2, PRON 1), mitä (PRON 177, ADV 12, CONJ 4, SCONJ 1)

Morphology

The form / lemma ratio of CONJ is 1.171429 (the average of all parts of speech is 2.036755).

The 1st highest number of forms (3) was observed with the lemma “mutta”: mut, mutt, mutta.

The 2nd highest number of forms (2) was observed with the lemma “joko”: joko, jokö.

The 3rd highest number of forms (2) was observed with the lemma “kuin”: ku, kuin.

CONJ occurs with 3 features: fi-feat/Style (14; 0% instances), fi-feat/Typo (2; 0% instances), fi-feat/Clitic (1; 0% instances)

CONJ occurs with 3 feature-value pairs: Clitic=Ko, Style=Coll, Typo=Yes

CONJ occurs with 4 feature combinations. The most frequent feature combination is _ (7338 tokens). Examples: ja, tai, mutta, sekä, eli, vaan, että, vai, kuin, niin

Relations

CONJ nodes are attached to their parents using 7 different relations: fi-dep/cc (7192; 98% instances), fi-dep/cc:preconj (128; 2% instances), fi-dep/advmod (12; 0% instances), fi-dep/mark (11; 0% instances), fi-dep/mwe (8; 0% instances), fi-dep/root (3; 0% instances), fi-dep/compound:nn (1; 0% instances)

Parents of CONJ nodes belong to 13 different parts of speech: NOUN (2993; 41% instances), VERB (2787; 38% instances), ADJ (619; 8% instances), PROPN (601; 8% instances), ADV (149; 2% instances), NUM (88; 1% instances), PRON (79; 1% instances), SYM (13; 0% instances), X (12; 0% instances), ADP (5; 0% instances), CONJ (4; 0% instances), ROOT (3; 0% instances), AUX (2; 0% instances)

7324 (100%) CONJ nodes are leaves.

26 (0%) CONJ nodes have one child.

3 (0%) CONJ nodes have two children.

2 (0%) CONJ nodes have three or more children.

The highest child degree of a CONJ node is 5.

Children of CONJ nodes are attached using 5 different relations: fi-dep/punct (19; 48% instances), fi-dep/mwe (16; 40% instances), fi-dep/nmod (3; 8% instances), fi-dep/compound:nn (1; 3% instances), fi-dep/nsubj (1; 3% instances)

Children of CONJ nodes belong to 5 different parts of speech: PUNCT (19; 48% instances), ADV (13; 33% instances), CONJ (4; 10% instances), NOUN (3; 8% instances), PROPN (1; 3% instances)


Treebank Statistics (UD_Finnish-FTB)

There are 21 CONJ lemmas (0%), 24 CONJ types (0%) and 4776 CONJ tokens (3%). Out of 16 observed tags, the rank of CONJ is: 13 in number of lemmas, 14 in number of types and 9 in number of tokens.

The 10 most frequent CONJ lemmas: ja, mutta, tai, vai, vaan, sekä, eli, että, joko, kuin

The 10 most frequent CONJ types: ja, mutta, tai, vai, vaan, sekä, eli, että, mut, joko

The 10 most frequent ambiguous lemmas: ja (CONJ 3230, PART 2), mutta (CONJ 604, SCONJ 1), vaan (CONJ 126, PART 86), eli (CONJ 55, PART 12), että (SCONJ 1772, CONJ 38, PART 3), kuin (SCONJ 647, CONJ 23), elikkä (CONJ 4, PART 4), joskaan (CONJ 4, SCONJ 1), vaikka (SCONJ 204, PART 73, CONJ 4), joskin (SCONJ 5, CONJ 2)

The 10 most frequent ambiguous types: ja (CONJ 3080, PART 2), vaan (CONJ 126, PART 73), eli (CONJ 55, VERB 5), että (SCONJ 1404, CONJ 38, VERB 2), mut (CONJ 21, PRON 3), kuin (SCONJ 615, CONJ 23), elikkä (CONJ 4, PART 1), joskaan (CONJ 4, SCONJ 1), vaikka (SCONJ 134, PART 65, CONJ 3), joskin (SCONJ 5, CONJ 2)

Morphology

The form / lemma ratio of CONJ is 1.142857 (the average of all parts of speech is 2.044212).

The 1st highest number of forms (3) was observed with the lemma “mutta”: mut, mutt, mutta.

The 2nd highest number of forms (2) was observed with the lemma “vaikka”: vaikk, vaikka.

The 3rd highest number of forms (1) was observed with the lemma “eli”: eli.

CONJ occurs with 1 features: fi-feat/Style (49; 1% instances)

CONJ occurs with 1 feature-value pairs: Style=Coll

CONJ occurs with 2 feature combinations. The most frequent feature combination is _ (4727 tokens). Examples: ja, mutta, tai, vai, vaan, sekä, eli, että, joko, kuin

Relations

CONJ nodes are attached to their parents using 4 different relations: fi-dep/cc (4726; 99% instances), fi-dep/advmod (23; 0% instances), fi-dep/mwe (14; 0% instances), fi-dep/conj (13; 0% instances)

Parents of CONJ nodes belong to 14 different parts of speech: VERB (1974; 41% instances), NOUN (1606; 34% instances), ADJ (533; 11% instances), PROPN (334; 7% instances), ADV (139; 3% instances), PRON (78; 2% instances), NUM (51; 1% instances), PART (15; 0% instances), CONJ (12; 0% instances), ADP (11; 0% instances), SCONJ (10; 0% instances), DET (7; 0% instances), X (4; 0% instances), INTJ (2; 0% instances)

4680 (98%) CONJ nodes are leaves.

94 (2%) CONJ nodes have one child.

1 (0%) CONJ nodes have two children.

1 (0%) CONJ nodes have three or more children.

The highest child degree of a CONJ node is 3.

Children of CONJ nodes are attached using 2 different relations: fi-dep/punct (87; 88% instances), fi-dep/conj (12; 12% instances)

Children of CONJ nodes belong to 2 different parts of speech: PUNCT (87; 88% instances), CONJ (12; 12% instances)


CONJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]