SYM
: symbol
The English SYM
covers PTB tags NFP (except for lines of separators, which become PUNCT), #, $, SYM, and for the percent sign (%).
Treebank Statistics (UD_English)
There are 82 SYM
lemmas (0%), 82 SYM
types (0%) and 758 SYM
tokens (0%).
Out of 17 observed tags, the rank of SYM
is: 11 in number of lemmas, 11 in number of types and 17 in number of tokens.
The 10 most frequent SYM
lemmas: $, -, :), %, /, +, |, :(, :-), :d
The 10 most frequent SYM
types: $, -, :), %, /, +, |, :(, :-), :D
The 10 most frequent ambiguous lemmas: $ (SYM 294, NOUN 4), - (PUNCT 1651, SYM 117, X 11), :) (SYM 58, PUNCT 2), % (SYM 46, X 1), / (PUNCT 242, SYM 32, X 2), + (SYM 25, CONJ 1), | (SYM 20, PUNCT 1), x (NOUN 10, SYM 6, X 2, ADP 1), … (PUNCT 325, SYM 5), = (PUNCT 5, SYM 4)
The 10 most frequent ambiguous types: $ (SYM 294, NOUN 4), - (PUNCT 1651, SYM 117, X 11), :) (SYM 58, PUNCT 2), % (SYM 46, X 1), / (PUNCT 242, SYM 32, X 2), + (SYM 25, CONJ 1), | (SYM 20, PUNCT 1), x (NOUN 5, SYM 5, X 1), … (PUNCT 325, SYM 5), = (PUNCT 5, SYM 4)
- $
- -
- :)
- %
- /
- +
- |
- x
- …
- =
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.173797).
The 1st highest number of forms (1) was observed with the lemma “###”: ###.
The 2nd highest number of forms (1) was observed with the lemma “$”: $.
The 3rd highest number of forms (1) was observed with the lemma “%”: %.
SYM
occurs with 1 features: en-feat/Number (48; 6% instances)
SYM
occurs with 1 feature-value pairs: Number=Sing
SYM
occurs with 2 feature combinations.
The most frequent feature combination is _
(710 tokens).
Examples: $, -, :), /, +, |, :(, :-), :D, x
Relations
SYM
nodes are attached to their parents using 24 different relations: en-dep/case (117; 15% instances), en-dep/discourse (113; 15% instances), en-dep/root (106; 14% instances), en-dep/nmod (78; 10% instances), en-dep/punct (70; 9% instances), en-dep/dobj (65; 9% instances), en-dep/compound (56; 7% instances), en-dep/nmod:npmod (24; 3% instances), en-dep/cc (23; 3% instances), en-dep/appos (21; 3% instances), en-dep/list (19; 3% instances), en-dep/conj (18; 2% instances), en-dep/advmod (16; 2% instances), en-dep/parataxis (13; 2% instances), en-dep/nsubjpass (5; 1% instances), en-dep/acl:relcl (2; 0% instances), en-dep/advcl (2; 0% instances), en-dep/ccomp (2; 0% instances), en-dep/nsubj (2; 0% instances), en-dep/nummod (2; 0% instances), en-dep/amod (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/reparandum (1; 0% instances), en-dep/xcomp (1; 0% instances)
Parents of SYM
nodes belong to 13 different parts of speech: NOUN (203; 27% instances), VERB (197; 26% instances), NUM (114; 15% instances), ROOT (106; 14% instances), ADJ (39; 5% instances), PROPN (34; 4% instances), SYM (30; 4% instances), X (17; 2% instances), ADV (11; 1% instances), DET (3; 0% instances), CONJ (2; 0% instances), ADP (1; 0% instances), PRON (1; 0% instances)
398 (53%) SYM
nodes are leaves.
122 (16%) SYM
nodes have one child.
89 (12%) SYM
nodes have two children.
149 (20%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 11.
Children of SYM
nodes are attached using 26 different relations: en-dep/nummod (303; 33% instances), en-dep/punct (167; 18% instances), en-dep/case (80; 9% instances), en-dep/appos (65; 7% instances), en-dep/nmod (56; 6% instances), en-dep/compound (53; 6% instances), en-dep/advmod (38; 4% instances), en-dep/cop (23; 3% instances), en-dep/nsubj (22; 2% instances), en-dep/cc (20; 2% instances), en-dep/det (20; 2% instances), en-dep/conj (19; 2% instances), en-dep/advcl (9; 1% instances), en-dep/amod (7; 1% instances), en-dep/nmod:npmod (5; 1% instances), en-dep/parataxis (4; 0% instances), en-dep/acl:relcl (3; 0% instances), en-dep/mark (3; 0% instances), en-dep/acl (2; 0% instances), en-dep/discourse (2; 0% instances), en-dep/nmod:poss (2; 0% instances), en-dep/aux (1; 0% instances), en-dep/dobj (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/nmod:tmod (1; 0% instances), en-dep/xcomp (1; 0% instances)
Children of SYM
nodes belong to 17 different parts of speech: NUM (355; 39% instances), PUNCT (163; 18% instances), NOUN (130; 14% instances), ADP (81; 9% instances), VERB (40; 4% instances), SYM (30; 3% instances), ADV (29; 3% instances), DET (23; 3% instances), CONJ (20; 2% instances), ADJ (14; 2% instances), PRON (13; 1% instances), PROPN (4; 0% instances), SCONJ (2; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances), X (1; 0% instances)
Treebank Statistics (UD_English-ESL)
There are 1 SYM
lemmas (6%), 1 SYM
types (6%) and 39 SYM
tokens (0%).
Out of 17 observed tags, the rank of SYM
is: 15 in number of lemmas, 15 in number of types and 17 in number of tokens.
The 10 most frequent SYM
lemmas: _
The 10 most frequent SYM
types: _
The 10 most frequent ambiguous lemmas: _ (NOUN 15986, VERB 15080, DET 10562, PRON 9758, PUNCT 9580, ADP 8546, ADJ 5857, ADV 5704, AUX 4533, PART 3531, CONJ 3198, SCONJ 2520, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)
The 10 most frequent ambiguous types: _ (NOUN 15986, VERB 15080, DET 10562, PRON 9758, PUNCT 9580, ADP 8546, ADJ 5857, ADV 5704, AUX 4533, PART 3531, CONJ 3198, SCONJ 2520, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)
- _
- NOUN 15986: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- VERB 15080: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 10562: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PRON 9758: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PUNCT 9580: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADP 8546: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADJ 5857: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADV 5704: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- AUX 4533: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PART 3531: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- CONJ 3198: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SCONJ 2520: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PROPN 1795: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- NUM 844: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- INTJ 80: _ _ _ _ _ _ _ _ _ _ _ _
- X 68: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SYM 39: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “_”: _.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 10 different relations: en-dep/dobj (7; 18% instances), en-dep/nmod (7; 18% instances), en-dep/compound (6; 15% instances), en-dep/conj (5; 13% instances), en-dep/nsubj (5; 13% instances), en-dep/punct (3; 8% instances), en-dep/appos (2; 5% instances), en-dep/root (2; 5% instances), en-dep/acl:relcl (1; 3% instances), en-dep/case (1; 3% instances)
Parents of SYM
nodes belong to 7 different parts of speech: NOUN (14; 36% instances), VERB (12; 31% instances), SYM (7; 18% instances), NUM (2; 5% instances), ROOT (2; 5% instances), ADJ (1; 3% instances), PROPN (1; 3% instances)
4 (10%) SYM
nodes are leaves.
14 (36%) SYM
nodes have one child.
9 (23%) SYM
nodes have two children.
12 (31%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 13.
Children of SYM
nodes are attached using 13 different relations: en-dep/nummod (35; 41% instances), en-dep/punct (10; 12% instances), en-dep/case (7; 8% instances), en-dep/nmod (7; 8% instances), en-dep/conj (6; 7% instances), en-dep/advmod (4; 5% instances), en-dep/acl:relcl (3; 3% instances), en-dep/cc (3; 3% instances), en-dep/cop (3; 3% instances), en-dep/det (3; 3% instances), en-dep/nsubj (3; 3% instances), en-dep/amod (1; 1% instances), en-dep/appos (1; 1% instances)
Children of SYM
nodes belong to 10 different parts of speech: NUM (35; 41% instances), PUNCT (10; 12% instances), NOUN (8; 9% instances), ADP (7; 8% instances), SYM (7; 8% instances), VERB (7; 8% instances), ADV (4; 5% instances), DET (4; 5% instances), CONJ (3; 3% instances), ADJ (1; 1% instances)
Treebank Statistics (UD_English-LinES)
There are 1 SYM
lemmas (6%), 2 SYM
types (0%) and 6 SYM
tokens (0%).
Out of 17 observed tags, the rank of SYM
is: 15 in number of lemmas, 17 in number of types and 17 in number of tokens.
The 10 most frequent SYM
lemmas: _
The 10 most frequent SYM
types: %, -%
The 10 most frequent ambiguous lemmas: _ (NOUN 14939, VERB 11076, PUNCT 10025, ADP 8281, DET 7865, PRON 7793, ADJ 5305, ADV 4610, AUX 3168, PROPN 2792, CONJ 2535, PART 2131, SCONJ 1512, NUM 581, INTJ 159, X 43, SYM 6)
The 10 most frequent ambiguous types: % (SYM 4, NOUN 2), -% (SYM 2, NOUN 1)
- %
- -%
Morphology
The form / lemma ratio of SYM
is 2.000000 (the average of all parts of speech is 597.705882).
The 1st highest number of forms (2) was observed with the lemma “_”: %, -%.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 5 different relations: en-dep/advmod (2; 33% instances), en-dep/amod (1; 17% instances), en-dep/appos (1; 17% instances), en-dep/dobj (1; 17% instances), en-dep/nmod (1; 17% instances)
Parents of SYM
nodes belong to 2 different parts of speech: VERB (4; 67% instances), NOUN (2; 33% instances)
0 (0%) SYM
nodes are leaves.
0 (0%) SYM
nodes have one child.
3 (50%) SYM
nodes have two children.
3 (50%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 3.
Children of SYM
nodes are attached using 6 different relations: en-dep/nummod (5; 33% instances), en-dep/punct (4; 27% instances), en-dep/case (3; 20% instances), en-dep/det (1; 7% instances), en-dep/mark (1; 7% instances), en-dep/nmod (1; 7% instances)
Children of SYM
nodes belong to 6 different parts of speech: NUM (5; 33% instances), PUNCT (4; 27% instances), ADP (3; 20% instances), ADV (1; 7% instances), DET (1; 7% instances), NOUN (1; 7% instances)
SYM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]