NUM
: numeral
This document is a placeholder for the language-specific documentation
for NUM
.
Treebank Statistics (UD_Spanish)
There are 2276 NUM
lemmas (6%), 2383 NUM
types (5%) and 11017 NUM
tokens (3%).
Out of 16 observed tags, the rank of NUM
is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.
The 10 most frequent NUM
lemmas: dos, 2010, tres, 0, 3, cuatro, 1, 6, 2, 10
The 10 most frequent NUM
types: dos, 2010, tres, 0, cuatro, 3, 1, 2, 10, 4
The 10 most frequent ambiguous lemmas: dos (NUM 592, PROPN 7, NOUN 5, ADP 2, X 2), tres (NUM 231, PROPN 4, NOUN 2), cuatro (NUM 164, NOUN 2, PROPN 1), 6 (NUM 157, NOUN 1), uno (DET 7651, PRON 540, NUM 108, PROPN 2, NOUN 2, ADJ 2, X 1), 2000 (NUM 102, NOUN 1), cinco (NUM 91, PROPN 3, NOUN 2), ii (NUM 70, PROPN 9), i (NUM 52, PROPN 22, CONJ 20, X 16), mil (NUM 47, NOUN 34, PROPN 3)
The 10 most frequent ambiguous types: dos (NUM 569, NOUN 5, X 2, ADP 1), tres (NUM 224, NOUN 1), cuatro (NUM 157, NOUN 1), 2000 (NUM 94, NOUN 1), cinco (NUM 87, NOUN 2), II (NUM 70, PROPN 9), un (DET 3886, NUM 53), I (NUM 52, PROPN 21, X 10), siete (NUM 36, PROPN 1, NOUN 1), una (DET 3269, PRON 181, NUM 35, NOUN 2, X 1)
- dos
- NUM 569: Una hora más tarde Knowles grabaron « » y otras dos canciones .
- NOUN 5: Los primeros dos de cada grupo disputan las semifinales .
- X 2: Tânia Regina dos Santos Silva ( n. 1968 ) es una botánica , y profesora brasileña .
- ADP 1: Fernando Teixeira dos Santos , ministro portugués de Finanzas , está en Beijing para reunir se con su homólogo Xie Xuren y con el gobernador de el Banco Central , Zhou Xiaochuan .
- tres
- cuatro
- 2000
- cinco
- NUM 87: ¡ Más bien cinco días ! “
- NOUN 2: En breves horas el inglés David Beckham podría firmar su nuevo contrato con el Paris Saint - Germain por año y medio , 18 meses en los que este cobra en torno a los 11 millones de euros , unos cinco en el primer año , en el medio y 7 millones por la segunda temporada que completara con los parisinos .
- II
- un
- I
- siete
- NUM 36: Montoya ganó siete carreras .
- PROPN 1: También alcanzó el número uno en Nueva Zelanda , el número dos en el Reino Unido y número siete en Australia y seis en Canadá .
- NOUN 1: Dos grupos de delfines que sumaban nueve ejemplares vararon este sábado en dos diferentes puntos de la costa asturiana , según ha informado la Coordinadora para el Estudio y la Protección de las Especies Marinas ( Cepesma ) , que ha indicado que fueron salvados siete y llevados de nuevo a mar abierto .
- una
- DET 3269: Ae Fond Kiss … es una película dirigida por Ken Loach .
- PRON 181: Me parece una de las mejores opciones hoy en día .
- NUM 35: Me ha sacado de apuros mas de una vez .
- NOUN 2: Mauricio Cuevas colaboró con 62 en manganas a pie y Juan Dimas acomodó una a caballo de 29 .
- X 1: Planet of Dinosaurs , es una es una película de ciencia ficción estadounidense de 1978 , dirigida por James K. Shmea .
Morphology
The form / lemma ratio of NUM
is 1.047012 (the average of all parts of speech is 1.255739).
The 1st highest number of forms (4) was observed with the lemma “2000”: 2,000, 2.000, 2000, 25000.
The 2nd highest number of forms (4) was observed with the lemma “3”: 3, 33, 36, 37.
The 3rd highest number of forms (3) was observed with the lemma “1100”: 1,100, 1.100, 1100.
NUM
occurs with 3 features: es-feat/NumType (11017; 100% instances), es-feat/Number (1627; 15% instances), es-feat/Gender (208; 2% instances)
NUM
occurs with 5 feature-value pairs: Gender=Fem
, Gender=Masc
, NumType=Card
, Number=Plur
, Number=Sing
NUM
occurs with 9 feature combinations.
The most frequent feature combination is NumType=Card
(9378 tokens).
Examples: dos, 2010, 0, 3, 1, 2, 10, 4, tres, 5
Relations
NUM
nodes are attached to their parents using 19 different relations: es-dep/nummod (6021; 55% instances), es-dep/nmod (3843; 35% instances), es-dep/appos (454; 4% instances), es-dep/conj (436; 4% instances), es-dep/dep (91; 1% instances), es-dep/nsubj (70; 1% instances), es-dep/dobj (46; 0% instances), es-dep/root (22; 0% instances), es-dep/name (10; 0% instances), es-dep/nsubjpass (5; 0% instances), es-dep/parataxis (5; 0% instances), es-dep/compound (4; 0% instances), es-dep/advmod (3; 0% instances), es-dep/amod (2; 0% instances), es-dep/acl (1; 0% instances), es-dep/advcl (1; 0% instances), es-dep/cc (1; 0% instances), es-dep/det (1; 0% instances), es-dep/iobj (1; 0% instances)
Parents of NUM
nodes belong to 15 different parts of speech: NOUN (5084; 46% instances), PROPN (1986; 18% instances), VERB (1872; 17% instances), SYM (1066; 10% instances), NUM (636; 6% instances), X (223; 2% instances), ADJ (67; 1% instances), PRON (33; 0% instances), ROOT (22; 0% instances), ADV (18; 0% instances), ADP (3; 0% instances), DET (3; 0% instances), CONJ (2; 0% instances), AUX (1; 0% instances), PART (1; 0% instances)
5877 (53%) NUM
nodes are leaves.
3193 (29%) NUM
nodes have one child.
1301 (12%) NUM
nodes have two children.
646 (6%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 32.
Children of NUM
nodes are attached using 20 different relations: es-dep/case (3106; 38% instances), es-dep/punct (1202; 15% instances), es-dep/nmod (1183; 14% instances), es-dep/det (968; 12% instances), es-dep/advmod (461; 6% instances), es-dep/conj (450; 6% instances), es-dep/cc (345; 4% instances), es-dep/dep (156; 2% instances), es-dep/nummod (115; 1% instances), es-dep/appos (62; 1% instances), es-dep/amod (35; 0% instances), es-dep/cop (26; 0% instances), es-dep/acl:relcl (20; 0% instances), es-dep/nsubj (17; 0% instances), es-dep/acl (11; 0% instances), es-dep/advcl (9; 0% instances), es-dep/compound (7; 0% instances), es-dep/aux (3; 0% instances), es-dep/parataxis (3; 0% instances), es-dep/mark (2; 0% instances)
Children of NUM
nodes belong to 16 different parts of speech: ADP (3129; 38% instances), PUNCT (1202; 15% instances), DET (1034; 13% instances), PROPN (947; 12% instances), NUM (636; 8% instances), ADV (423; 5% instances), CONJ (346; 4% instances), NOUN (209; 3% instances), SYM (83; 1% instances), VERB (67; 1% instances), ADJ (45; 1% instances), X (42; 1% instances), PRON (12; 0% instances), AUX (3; 0% instances), SCONJ (2; 0% instances), PART (1; 0% instances)
Treebank Statistics (UD_Spanish-AnCora)
There are 1532 NUM
lemmas (5%), 1551 NUM
types (4%) and 8767 NUM
tokens (2%).
Out of 17 observed tags, the rank of NUM
is: 5 in number of lemmas, 5 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: dos, ciento, tres, cinco, cuatro, ambos, seis, 20, 30, siete
The 10 most frequent NUM
types: dos, ciento, tres, cinco, cuatro, seis, 20, ambos, siete, 30
The 10 most frequent ambiguous lemmas: dos (NUM 940, DET 1), tres (NUM 446, NOUN 1), cinco (NUM 232, NOUN 3), cuatro (NUM 227, NOUN 1), seis (NUM 152, NOUN 1), 20 (NUM 133, SYM 6), 30 (NUM 104, SYM 1, NOUN 1), siete (NUM 102, NOUN 1), medio (NOUN 170, NUM 94, ADJ 38, ADV 7, PROPN 1), 10 (NUM 93, SYM 4, NOUN 2)
The 10 most frequent ambiguous types: dos (NUM 928, NOUN 1), tres (NUM 432, NOUN 1), cinco (NUM 231, NOUN 3), cuatro (NUM 221, NOUN 1), seis (NUM 152, NOUN 1), siete (NUM 102, NOUN 1), 30 (NUM 104, NOUN 1), 10 (NUM 88, NOUN 2), 12 (NUM 86, NOUN 1), ocho (NUM 71, NOUN 7)
- dos
- tres
- NUM 432: Blackefer lleva tres años en la Red .
- NOUN 1: Holt ( 1-0 ) , con la brillante labor en el montículo , logró dar descanso a los relevistas de los Astros que últimamente han trabajado mucho , a el cubrir la ruta en que regaló dos bases por bolas y ponchó a tres para registrar el primer triunfo de el año .
- cinco
- cuatro
- seis
- siete
- NUM 102: Camargo se escapó a falta de siete kilómetros .
- NOUN 1: El jugador portugués expresó no obstante su confianza en que el Deportivo no precisará de sus servicios para obtener un resultado satisfactorio en el partido de Liga que el próximo domingo , a partir de las siete y media de la tarde , disputará en Riazor ante el Oviedo .
- 30
- 10
- 12
- ocho
Morphology
The form / lemma ratio of NUM
is 1.012402 (the average of all parts of speech is 1.501056).
The 1st highest number of forms (5) was observed with the lemma “4”: 4, 4A, 4B, 4to, cuatro.
The 2nd highest number of forms (4) was observed with the lemma “1”: 1, 1ro, un, uno.
The 3rd highest number of forms (3) was observed with the lemma “12”: 12, 12-M, km.12.
NUM
occurs with 4 features: es-feat/NumForm (5262; 60% instances), es-feat/NumType (4904; 56% instances), es-feat/Number (2866; 33% instances), es-feat/Gender (291; 3% instances)
NUM
occurs with 7 feature-value pairs: Gender=Fem
, Gender=Masc
, NumForm=Digit
, NumType=Card
, NumType=Frac
, Number=Plur
, Number=Sing
NUM
occurs with 12 feature combinations.
The most frequent feature combination is NumForm=Digit
(3862 tokens).
Examples: 20, 30, 10, 15, 12, 18, 24, 16, 50, 17
Relations
NUM
nodes are attached to their parents using 20 different relations: es-dep/nummod (5659; 65% instances), es-dep/compound (901; 10% instances), es-dep/nmod (423; 5% instances), es-dep/appos (420; 5% instances), es-dep/advmod (413; 5% instances), es-dep/dobj (308; 4% instances), es-dep/conj (304; 3% instances), es-dep/nsubj (232; 3% instances), es-dep/dep (30; 0% instances), es-dep/root (27; 0% instances), es-dep/mwe (11; 0% instances), es-dep/det (10; 0% instances), es-dep/acl (8; 0% instances), es-dep/advcl (6; 0% instances), es-dep/ccomp (4; 0% instances), es-dep/iobj (4; 0% instances), es-dep/parataxis (4; 0% instances), es-dep/case (1; 0% instances), es-dep/cop (1; 0% instances), es-dep/csubj (1; 0% instances)
Parents of NUM
nodes belong to 13 different parts of speech: NOUN (5189; 59% instances), VERB (1148; 13% instances), NUM (888; 10% instances), DET (758; 9% instances), PROPN (306; 3% instances), ADV (209; 2% instances), ADJ (174; 2% instances), PRON (32; 0% instances), ROOT (27; 0% instances), ADP (20; 0% instances), AUX (12; 0% instances), CONJ (2; 0% instances), X (2; 0% instances)
5175 (59%) NUM
nodes are leaves.
1642 (19%) NUM
nodes have one child.
780 (9%) NUM
nodes have two children.
1170 (13%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 20.
Children of NUM
nodes are attached using 22 different relations: es-dep/case (1764; 23% instances), es-dep/det (1594; 21% instances), es-dep/compound (1127; 15% instances), es-dep/punct (956; 13% instances), es-dep/nmod (884; 12% instances), es-dep/cc (272; 4% instances), es-dep/conj (257; 3% instances), es-dep/amod (220; 3% instances), es-dep/advmod (176; 2% instances), es-dep/appos (143; 2% instances), es-dep/advcl (65; 1% instances), es-dep/cop (47; 1% instances), es-dep/nsubj (40; 1% instances), es-dep/name (26; 0% instances), es-dep/mark (16; 0% instances), es-dep/nummod (14; 0% instances), es-dep/acl (13; 0% instances), es-dep/aux (6; 0% instances), es-dep/parataxis (4; 0% instances), es-dep/dobj (2; 0% instances), es-dep/mwe (1; 0% instances), es-dep/neg (1; 0% instances)
Children of NUM
nodes belong to 14 different parts of speech: ADP (1819; 24% instances), DET (1597; 21% instances), NOUN (1082; 14% instances), PUNCT (959; 13% instances), NUM (888; 12% instances), PRON (326; 4% instances), ADJ (254; 3% instances), CONJ (244; 3% instances), ADV (151; 2% instances), PROPN (142; 2% instances), VERB (84; 1% instances), AUX (48; 1% instances), SYM (19; 0% instances), SCONJ (15; 0% instances)
NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]