home it/pos edit page issue tracker

PROPN: proper noun

Definition

A proper noun is a noun that is the name (or part of the name) of a unique entity, be it an individual, a place, or an object.

Acronyms of proper nouns, such as UN and NATO, are also tagged as PROPN.

Corresponding language-specific part-of-speech tags

SP: Proper noun

Examples


Treebank Statistics (UD_Italian)

There are 5313 PROPN lemmas (27%), 5358 PROPN types (18%) and 13345 PROPN tokens (5%). Out of 17 observed tags, the rank of PROPN is: 2 in number of lemmas, 3 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: Shakespeare, Balzac, Italia, Stati, Europa, San, Roma, Uniti, Albania, Marco

The 10 most frequent PROPN types: Shakespeare, Balzac, Italia, stati, Europa, San, Uniti, Albania, Marco, Roma

The 10 most frequent ambiguous lemmas: de (ADP 37, PROPN 5, X 3, DET 1), Stato (PROPN 40, NOUN 3), Germania (PROPN 28, NOUN 1), a (ADP 7046, NOUN 10, X 8, DET 3, ADV 2, CONJ 1, PROPN 1), Grande (PROPN 13, ADJ 1), Mondiali (PROPN 12, NOUN 2), Internazionale (PROPN 11, ADJ 1), Regione (PROPN 11, NOUN 1), C (PROPN 9, X 1), Ministro (PROPN 9, NOUN 1)

The 10 most frequent ambiguous types: stati (AUX 105, NOUN 46, VERB 25, PROPN 1), de (ADP 42, PROPN 5, X 2), Stato (NOUN 88, PROPN 40), Unione (PROPN 37, NOUN 6), europea (ADJ 31, PROPN 5), Camera (PROPN 23, NOUN 23), nazioni (NOUN 7, PROPN 1), Broglio (PROPN 20, NOUN 2), Nord (PROPN 20, NOUN 3), A (ADP 286, PROPN 19, DET 4)

Morphology

The form / lemma ratio of PROPN is 1.008470 (the average of all parts of speech is 1.491496).

The 1st highest number of forms (2) was observed with the lemma “Aids”: AIDS, Aids.

The 2nd highest number of forms (2) was observed with the lemma “As”: AS, As.

The 3rd highest number of forms (2) was observed with the lemma “Bogotà”: BOGOTÀ, Bogotà.

PROPN occurs with 3 features: it-feat/Degree (1; 0% instances), it-feat/Gender (1; 0% instances), it-feat/Number (1; 0% instances)

PROPN occurs with 3 feature-value pairs: Degree=Abs, Gender=Fem, Number=Plur

PROPN occurs with 3 feature combinations. The most frequent feature combination is _ (13343 tokens). Examples: Shakespeare, Balzac, Italia, stati, Europa, San, Uniti, Albania, Marco, Roma

Relations

PROPN nodes are attached to their parents using 15 different relations: it-dep/nmod (6497; 49% instances), it-dep/name (3125; 23% instances), it-dep/nsubj (1963; 15% instances), it-dep/conj (725; 5% instances), it-dep/dobj (357; 3% instances), it-dep/root (221; 2% instances), it-dep/appos (208; 2% instances), it-dep/nsubjpass (146; 1% instances), it-dep/xcomp (57; 0% instances), it-dep/parataxis (15; 0% instances), it-dep/vocative (13; 0% instances), it-dep/advcl (8; 0% instances), it-dep/ccomp (5; 0% instances), it-dep/acl:relcl (3; 0% instances), it-dep/csubj (2; 0% instances)

Parents of PROPN nodes belong to 17 different parts of speech: NOUN (4653; 35% instances), PROPN (4397; 33% instances), VERB (3585; 27% instances), PRON (251; 2% instances), ROOT (221; 2% instances), ADJ (180; 1% instances), ADV (17; 0% instances), NUM (15; 0% instances), AUX (5; 0% instances), INTJ (4; 0% instances), PUNCT (4; 0% instances), SYM (4; 0% instances), DET (3; 0% instances), ADP (2; 0% instances), CONJ (2; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)

4875 (37%) PROPN nodes are leaves.

3604 (27%) PROPN nodes have one child.

2496 (19%) PROPN nodes have two children.

2370 (18%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 38.

Children of PROPN nodes are attached using 30 different relations: it-dep/case (5118; 28% instances), it-dep/name (3192; 18% instances), it-dep/det (3013; 17% instances), it-dep/punct (2634; 15% instances), it-dep/nmod (1178; 7% instances), it-dep/conj (811; 4% instances), it-dep/cc (534; 3% instances), it-dep/amod (410; 2% instances), it-dep/appos (295; 2% instances), it-dep/acl:relcl (217; 1% instances), it-dep/nummod (165; 1% instances), it-dep/acl (163; 1% instances), it-dep/advmod (136; 1% instances), it-dep/cop (42; 0% instances), it-dep/nsubj (25; 0% instances), it-dep/parataxis (23; 0% instances), it-dep/advcl (18; 0% instances), it-dep/det:predet (16; 0% instances), it-dep/mark (11; 0% instances), it-dep/neg (11; 0% instances), it-dep/det:poss (9; 0% instances), it-dep/mwe (8; 0% instances), it-dep/ccomp (5; 0% instances), it-dep/aux (4; 0% instances), it-dep/discourse (3; 0% instances), it-dep/foreign (2; 0% instances), it-dep/vocative (2; 0% instances), it-dep/compound (1; 0% instances), it-dep/dep (1; 0% instances), it-dep/dobj (1; 0% instances)

Children of PROPN nodes belong to 17 different parts of speech: ADP (5082; 28% instances), PROPN (4397; 24% instances), DET (3038; 17% instances), PUNCT (2634; 15% instances), NOUN (980; 5% instances), CONJ (532; 3% instances), VERB (435; 2% instances), ADJ (430; 2% instances), NUM (241; 1% instances), ADV (194; 1% instances), PRON (48; 0% instances), SCONJ (11; 0% instances), SYM (8; 0% instances), X (7; 0% instances), AUX (4; 0% instances), PART (4; 0% instances), INTJ (3; 0% instances)


PROPN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]