SYM
: symbol
A symbol is a word-like entity that differs from ordinary words by form, function, or both.
Examples
- $, %, §, ©
- +, −, ×, ÷, =, <, >
- :), ♥‿♥, 😝
- john.doe@universal.org, http://universaldependencies.org/, 1-800-COMPANY
Treebank Statistics (UD_Finnish)
There are 196 SYM
lemmas (1%), 198 SYM
types (0%) and 458 SYM
tokens (0%).
Out of 15 observed tags, the rank of SYM
is: 8 in number of lemmas, 9 in number of types and 13 in number of tokens.
The 10 most frequent SYM
lemmas: :), %, &, :D, ;), +, 3.Rf3, >, 2.f4, E21
The 10 most frequent SYM
types: :), %, &, :D, ;), +, 3.Rf3, >, 2.f4, E21
The 10 most frequent ambiguous lemmas: :) (SYM 68, PUNCT 1), % (SYM 37, NOUN 9), & (SYM 21, PROPN 1), + (SYM 16, PROPN 2), °C (SYM 3, NOUN 1), A (NOUN 21, PROPN 7, SYM 1), B (NOUN 3, PROPN 1, SYM 1), K (SYM 1, PROPN 1), V (ADJ 10, NOUN 1, SYM 1), × (PROPN 4, SYM 1)
The 10 most frequent ambiguous types: :) (SYM 68, PUNCT 1), & (SYM 21, PROPN 1), + (SYM 16, PROPN 2), A (NOUN 9, PROPN 7, SYM 1), B (NOUN 3, PROPN 1, SYM 1), V (ADJ 7, NOUN 1, SYM 1), × (PROPN 4, SYM 1)
- :)
- &
- +
- SYM 16: - Ruisleipä + oivariini + oltermanni maistuu vaan niin hyvältä .
- PROPN 2: 2. Korvataan liitteessä II olevan II osan 2 kohdan A alakohdan taulukossa 4 sarakkeessa jäljempänä vasemmalla lueteltujen lajien kohdalla olevat merkinnät jäljempänä oikealla olevilla merkinnöillä : Alopecurus pratensis 2 Arrhenatherum elatius 2 Dactylis glomerata 2 Festuca arundinacea 2 Festuca ovina 2 Festuca pratensis 2 Festuca rubra 2 Lolium multiflorum 2 Lolium perenne 2 Lolium × boucheanum 2 Phalaris aquatica 2 Hedysarum coronarium 2 Lotus corniculatus 3 Lupinus albus 2 Lupinus angustifolius 2 Lupinus luteus 2 Medicago sativa 3 Medicago × varia 3 Onobrychis viciifolia 2 Pisum sativum 2 Trifolium alexandrinum 3 Trifolium hybridum 3 Trifolium incarnatum 3 Trifolium resupinatum 3 Trigonella foenum-graecum 2 Vicia faba 2 Vicia pannonica 2 Vicia sativa 2 Vicia villosa 2 Brassica napus var. napobrassica 2 Brassica oleracea convar. acephala var. medullosa + var. viridis 3 Raphanus sativus var. oleiformis 2 .
- A
- B
- V
- ADJ 7: Hänen isänsä oli kuningas Mithridates V Euergetes .
- NOUN 1: Siitä kehittyivät kreikkalaisen kirjaimiston digamma ja ypsilon , myöhemmin latinalaisen kirjaimiston F , V ja Y sekä edelleen U ja W .
- SYM 1: Gliese 581 eli HO Librae on Vaa’an tähdistössä sijaitseva punainen kääpiötähti , jonka spektriluokka on M2,5 V .
- ×
- PROPN 4: 2. Korvataan liitteessä II olevan II osan 2 kohdan A alakohdan taulukossa 4 sarakkeessa jäljempänä vasemmalla lueteltujen lajien kohdalla olevat merkinnät jäljempänä oikealla olevilla merkinnöillä : Alopecurus pratensis 2 Arrhenatherum elatius 2 Dactylis glomerata 2 Festuca arundinacea 2 Festuca ovina 2 Festuca pratensis 2 Festuca rubra 2 Lolium multiflorum 2 Lolium perenne 2 Lolium × boucheanum 2 Phalaris aquatica 2 Hedysarum coronarium 2 Lotus corniculatus 3 Lupinus albus 2 Lupinus angustifolius 2 Lupinus luteus 2 Medicago sativa 3 Medicago × varia 3 Onobrychis viciifolia 2 Pisum sativum 2 Trifolium alexandrinum 3 Trifolium hybridum 3 Trifolium incarnatum 3 Trifolium resupinatum 3 Trigonella foenum-graecum 2 Vicia faba 2 Vicia pannonica 2 Vicia sativa 2 Vicia villosa 2 Brassica napus var. napobrassica 2 Brassica oleracea convar. acephala var. medullosa + var. viridis 3 Raphanus sativus var. oleiformis 2 .
- SYM 1: Tämä sopii hyvin yhteen sen kanssa , että tähti on vanha , ikä 7 - 11 × 109 vuotta .
Morphology
The form / lemma ratio of SYM
is 1.010204 (the average of all parts of speech is 2.036755).
The 1st highest number of forms (2) was observed with the lemma “SRT#8”: SRT-8, SRT-8:ssa.
The 2nd highest number of forms (2) was observed with the lemma “°C”: °C, °C:ta.
The 3rd highest number of forms (1) was observed with the lemma “#”: #.
SYM
occurs with 1 features: fi-feat/Case (2; 0% instances)
SYM
occurs with 2 feature-value pairs: Case=Ine
, Case=Par
SYM
occurs with 3 feature combinations.
The most frequent feature combination is _
(456 tokens).
Examples: :), %, &, :D, ;), +, 3.Rf3, >, 2.f4, E21
Relations
SYM
nodes are attached to their parents using 23 different relations: fi-dep/discourse (118; 26% instances), fi-dep/name (95; 21% instances), fi-dep/nmod (67; 15% instances), fi-dep/dobj (27; 6% instances), fi-dep/appos (26; 6% instances), fi-dep/punct (26; 6% instances), fi-dep/nsubj (19; 4% instances), fi-dep/conj (12; 3% instances), fi-dep/root (11; 2% instances), fi-dep/compound:nn (10; 2% instances), fi-dep/cc (9; 2% instances), fi-dep/nsubj:cop (7; 2% instances), fi-dep/advcl (6; 1% instances), fi-dep/compound (6; 1% instances), fi-dep/remnant (4; 1% instances), fi-dep/dep (3; 1% instances), fi-dep/nummod (3; 1% instances), fi-dep/parataxis (3; 1% instances), fi-dep/acl:relcl (2; 0% instances), fi-dep/advmod (1; 0% instances), fi-dep/amod (1; 0% instances), fi-dep/case (1; 0% instances), fi-dep/vocative (1; 0% instances)
Parents of SYM
nodes belong to 10 different parts of speech: VERB (147; 32% instances), NOUN (138; 30% instances), SYM (86; 19% instances), ADJ (33; 7% instances), PROPN (26; 6% instances), ROOT (11; 2% instances), NUM (8; 2% instances), ADV (4; 1% instances), X (3; 1% instances), PRON (2; 0% instances)
299 (65%) SYM
nodes are leaves.
39 (9%) SYM
nodes have one child.
64 (14%) SYM
nodes have two children.
56 (12%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 13.
Children of SYM
nodes are attached using 19 different relations: fi-dep/punct (152; 36% instances), fi-dep/name (90; 21% instances), fi-dep/nummod (51; 12% instances), fi-dep/nmod (21; 5% instances), fi-dep/nsubj:cop (18; 4% instances), fi-dep/conj (17; 4% instances), fi-dep/cop (15; 4% instances), fi-dep/cc (13; 3% instances), fi-dep/compound:nn (12; 3% instances), fi-dep/advmod (11; 3% instances), fi-dep/acl:relcl (5; 1% instances), fi-dep/appos (5; 1% instances), fi-dep/compound (4; 1% instances), fi-dep/remnant (4; 1% instances), fi-dep/mark (3; 1% instances), fi-dep/acl (2; 0% instances), fi-dep/amod (2; 0% instances), fi-dep/advcl (1; 0% instances), fi-dep/case (1; 0% instances)
Children of SYM
nodes belong to 12 different parts of speech: PUNCT (152; 36% instances), SYM (86; 20% instances), NUM (69; 16% instances), NOUN (57; 13% instances), VERB (24; 6% instances), CONJ (13; 3% instances), ADV (12; 3% instances), ADJ (5; 1% instances), PRON (3; 1% instances), PROPN (3; 1% instances), SCONJ (2; 0% instances), ADP (1; 0% instances)
Treebank Statistics (UD_Finnish-FTB)
There are 6 SYM
lemmas (0%), 6 SYM
types (0%) and 22 SYM
tokens (0%).
Out of 16 observed tags, the rank of SYM
is: 16 in number of lemmas, 16 in number of types and 16 in number of tokens.
The 10 most frequent SYM
lemmas: %, &, /, +, *, @
The 10 most frequent SYM
types: %, &, /, +, *, @
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 2.044212).
The 1st highest number of forms (1) was observed with the lemma “%”: %.
The 2nd highest number of forms (1) was observed with the lemma “&”: &.
The 3rd highest number of forms (1) was observed with the lemma “*”: *.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 1 different relations: fi-dep/dep (22; 100% instances)
Parents of SYM
nodes belong to 3 different parts of speech: NOUN (11; 50% instances), PROPN (7; 32% instances), VERB (4; 18% instances)
13 (59%) SYM
nodes are leaves.
7 (32%) SYM
nodes have one child.
2 (9%) SYM
nodes have two children.
The highest child degree of a SYM
node is 2.
Children of SYM
nodes are attached using 2 different relations: fi-dep/nummod (8; 73% instances), fi-dep/punct (3; 27% instances)
Children of SYM
nodes belong to 2 different parts of speech: NUM (8; 73% instances), PUNCT (3; 27% instances)
SYM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]