Dependencies
Note: nmod, neg, and punct appear in two places.
|
|
|
||||||||||||||||||||||||||||||||||||||||||||
|
|
|
||||||||||||||||||||||||||||||||||||||||||||
|
|
|
acl
: clausal modifier of noun
acl
stands for finite and non-finite clauses that modify a nominal. The acl
relation
contrasts with the advcl relation, which is used for adverbial clauses
that modify a predicate. The head of the acl
relation is the noun
that is modified, and the dependent is the head of the clause that
modifies the noun.
мужчина , о котором я говорил \n man , about whom I have-talked
acl(мужчина, говорил)
acl(man, have-talked)
This relation is also used for optional depictives. The adjective is taken to modify the nominal of which it provides a secondary predication. See u-dep/xcomp for further discussion of resultatives and depictives.
Алена вошла в комнату незамеченной . \n Alena entered to room unnoticed .
acl(Алена, незамеченной)
acl(Alena, unnoticed)
Он нарисовал свою жену обнаженной . \n He painted his wife naked .
acl(жену, обнаженной)
acl(wife, naked)
advcl
: adverbial clause modifier
An adverbial clause modifier is a clause which modifies a verb or other predicate (adjective, etc.), as a modifier not as a core complement. This includes things such as a temporal clause, consequence, conditional clause, purpose clause, etc. The dependent must be clausal (or else it is an advmod) and the dependent is the main predicate of the clause.
Если вы знаете , кто это сделал , вы должны сказать учителю . \n If you know , who this did , you have to-tell the-teacher .
advcl(должны, знаете)
advcl(have, know)
Он спешил , чтобы прийти вовремя . \n He rushed , in-order-to come in-time .
advcl(спешил, прийти)
advcl(rushed, come)
advmod
: adverbial modifier
An adverbial modifier of a word is a (non-clausal) adverb or adverbial phrase that serves to modify the meaning of the word.
We differentiate adverbials realized as adverbs (advmod
) and
adverbials realized by noun phrases or adpositional phrases
(nmod).
генетически модифицированная еда \n genetically modified food
advmod(модифицированная, генетически)
advmod(modified, genetically)
менее часто \n less often
advmod(часто, менее)
advmod(often, less)
amod
: adjectival modifier
An adjectival modifier of an NP is any adjectival phrase that serves to modify the meaning of the NP.
Василий ест куриную лапшу . \n Vasily eats chicken noodles .
amod(лапшу, куриную)
amod(noodles, chicken)
Первое впечатление было хорошим . \n First impression was positive .
amod(впечатление, Первое)
amod(impression, First)
nsubj(хорошим, впечатление)
nsubj(positive, impression)
Exception: if the modifying adjectival word is pronominal (i.e. tagged ru-pos/DET), the relation is det instead of amod
.
все эти ваши новые группы \n all these your new groups
amod(группы, новые)
amod(groups, new)
det(группы, ваши)
det(groups, your)
det(группы, эти)
det(groups, these)
det(группы, все)
det(groups, all)
appos
: appositional modifier
An appositional modifier of a noun is a nominal immediately following the first noun that serves to define or modify that noun. It includes parenthesized examples, as well as defining abbreviations in one of these structures.
Карл Четвертый , римский император и чешский король , царствовал в XIV веке . \n Charles IV , Roman Emperor and Czech king , ruled in 14th century .
appos(Карл, император)
appos(Charles, Emperor)
conj(император, король)
conj(Emperor, king)
Польская Социалистическая Партия ( ПСП ) \n Polish Socialist Party ( PSP )
appos(Партия, ПСП)
appos(Party, PSP)
There is a slight deviation from the universal standard in case of more than one appositive nominal.
Instead of attaching them all to the first noun, all the appositive modifiers are put together in coordination,
then attached as appos
to the modified noun.
This is done regardless whether the appositives are joined by a coordinating conjunction or just a comma.
Пришел Михаил , мой брат и кузен Давида . \n Came Mikhail , my brother and cousin of-David .
appos(Михаил, брат)
appos(Mikhail, brother)
conj(брат, кузен)
conj(brother, cousin)
Михаил , мой брат , кузен Давида , придет завтра . \n Mikhail , my brother , cousin of-David , will-come tomorrow .
appos(Михаил, брат)
appos(Mikhail, brother)
conj(брат, кузен)
conj(brother, cousin)
appos
is also used to link key-value pairs in addresses, signatures, etc. (see also the list label):
Иван Петров , тел . : 8(495)-000-11-22 , e-mail : petrov@mail.ru
name(Иван, Петров)
list(Иван, тел)
list(Иван, e-mail)
appos(тел, 8(495)-000-11-22-7)
appos(e-mail, petrov@mail.ru)
aux
: auxiliary
An auxiliary of a clause is a form of the auxiliary verb быть “to be” used to construct the periphrastic future tense or conditional.
Exception: Auxiliary verb used to construct the passive
voice is not labeled aux
but auxpass.
Note that besides aux
and auxpass
, the verb быть may also act as a copula
and as the main verb.
Завтра я буду выступать в университете . \n Tomorrow I will give-a-talk at the-university .
aux(выступать, буду)
aux(give-a-talk, will)
Если бы я пришел раньше , я бы не пропустил поезд . \n If had I come earlier , I would not have-missed the-train .
aux(пришел, бы-2)
aux(come, had)
aux(пропустил, бы-8)
aux(have-missed, would)
auxpass
: passive auxiliary
A passive auxiliary of a clause is a form of the auxiliary verb быть “to be” used to construct the periphrastic passive voice (in the past or future tense, in the infinitive, imperative or conditional mood).
Кеннеди был убит . \n Kennedy was killed .
auxpass(убит, был)
auxpass(killed, was)
Кеннеди будет убит . \n Kennedy will-be killed .
auxpass(убит, будет)
auxpass(killed, will-be)
Кеннеди не знал , что ему суждено быть убитым . \n Kennedy did-not-anticipate that his fate is to-be killed .
auxpass(убитым, быть)
auxpass(killed, to-be)
case
: case marking
Russian uses a combination of oblique morphological cases and prepositions to express semantic case.
Prepositions in UD are treated as dependents of the noun they attach to. The relation between the noun and the preposition is labeled case
.
В пятницу.Acc по дороге.Dat к деревне.Dat мы разговаривали o событии.Loc . \n On Friday during the-journey to the-village we talked about the-event .
case(пятницу.Acc, В)
case(Friday, On)
case(дороге.Dat, по)
case(the-journey, during)
case(деревне.Dat, к)
case(the-village, to)
case(событии.Loc, o)
case(the-event, about)
cc
: coordinating conjunction
For more on coordination, see the conj relation.
A cc
is the relation between the first conjunct and
the coordinating conjunction delimiting another conjunct.
(Note: different dependency grammars have different treatments of coordination.
We take the first conjunct as the head of the coordination.)
Он старый и мудрый мужчина. \n He-is old and wise man .
cc(старый, и)
cc(old, and)
A coordinating conjunction may also appear at the beginning of a
sentence. This is also called a cc
, and it depends on the root
predicate of the sentence.
(In fact there is a coordination that spans multiple sentences.
We cannot attach a word to the first conjunct because it is in another sentence.
Thus we attach it to the first conjunct available in the current sentence: its main predicate.)
И потом мы ушли . \n And then we left .
cc(ушли, И)
cc(left, And)
У нас есть яблоки , груши , апельсины и бананы . \n By us are apples , pears , oranges and bananas .
nsubj(есть, яблоки)
nsubj(are, apples)
conj(яблоки, груши)
conj(яблоки, апельсины)
conj(яблоки, бананы)
conj(apples, pears)
conj(apples, oranges)
conj(apples, bananas)
cc(яблоки, и)
cc(apples, and)
punct(яблоки, ,-5)
punct(яблоки, ,-7)
punct(apples, ,-17)
punct(apples, ,-19)
ccomp
: clausal complement
A clausal complement of a verb or adjective is a dependent clause which is a core argument. That is, it functions like an object of the verb, or adjective.
Он говорит , что она работает в музее . \n He says , that she works in a museum .
ccomp(говорит, работает)
ccomp(says, works)
Such clausal complements may be finite or nonfinite. However, if the subject of the clausal complement is controlled (that is, must be the same as the higher subject or object, with no other possible interpretation) the appropriate relation is xcomp.
Генерал приказал нам выйти . \n General ordered us to-leave .
ccomp(приказал, выйти)
ccomp(ordered, to-leave)
Мы начали писать тест . \n We started to-write test .
xcomp(начали, писать)
xcomp(started, to-write)
The key difference here is that, while it is possible to interpret the first
sentence to mean that the general will not be leaving, in the second
sentence it is clear that the subject of writing can only be we. This is
what distinguishes ccomp
and xcomp
.
compound
: compound
compound
is one of the relations in UD for compounding.
The rightmost numeral is the head, the other numerals are attached as its modifiers.
Это будет стоить максимум 5 тысяч рублей . \n It will cost at-most 5 thousand rubles .
nummod:gov(рублей, тысяч)
nummod:gov(rubles, thousand)
compound(тысяч, 5-5)
compound(thousand, 5-14)
Это будет стоить пятьдесят пять тысяч рублей . \n It will cost fifty five thousand rubles .
nummod:gov(рублей, тысяч)
nummod:gov(rubles, thousand)
compound(тысяч, пятьдесят)
compound(thousand, fifty)
compound(тысяч, пять)
compound(thousand, five)
conj
: conjunct
A conjunct is the relation between two elements connected by a
coordinating conjunction, such as and, or, or a comma or other punctuation.
We treat coordination asymmetrically in what is known as the Stanford style:
The head of the relation is the first
conjunct and all the other conjuncts depend on it via the conj
relation.
Он старый и мудрый мужчина . \n He-is old and wise man .
conj(старый, мудрый)
conj(old, wise)
У нас есть яблоки , груши , апельсины и бананы . \n By us are apples , pears , oranges and bananas .
nsubj(есть, яблоки)
nsubj(are, apples)
conj(яблоки, груши)
conj(яблоки, апельсины)
conj(яблоки, бананы)
conj(apples, pears)
conj(apples, oranges)
conj(apples, bananas)
cc(яблоки, и)
cc(apples, and)
punct(яблоки, ,-5)
punct(яблоки, ,-7)
punct(apples, ,-17)
punct(apples, ,-19)
Coordinate clauses are treated the same way as coordination of other constituent types:
Он пришел домой , помылся и пошел в кровать . \n He came home , showered-himself and went to bed .
conj(пришел, помылся)
conj(пришел, пошел)
conj(came, showered-himself)
conj(came, went)
punct(пришел, ,-4)
punct(came, ,-15)
cc(пришел, и)
cc(came, and)
See universal/conj for more details on various coordination-related issues.
cop
: copula
A copula is the relation between the nominal predicate and
the copular verb быть “to be” (or its variants бывал, был).
The verb становиться “to become”, despite being counted among copular verbs by some authors,
is not analyzed as cop
.
We normally take a copula as a dependent of its complement (the nominal predicate). The nominal predicate is usually a noun, an adjective or a participle.
Андрей был честным человеком . \n Andrew was an honest man .
cop(человеком, был)
cop(man, was)
csubj
: clausal subject
A clausal subject is a clausal syntactic subject of a clause, i.e., the subject is itself a clause. The governor of this relation might not always be a verb: when the verb is a copular verb, the root of the clause is the complement of the copular verb. The dependent is the main lexical verb or other predicate of the subject clause.
Забыть Ваню трудно . \n To-forget Vanya , it-is-hard .
csubj(трудно, забыть)
csubj(it-is-hard, To-forget)
csubjpass
: clausal passive subject
A clausal passive subject is a clausal syntactic subject of a passive clause.
Мне было сказано , чтобы я все хорошо взвесил . \n To-me it-has-been told , that I it very-well weigh .
csubjpass(сказано, взвесил)
csubjpass(told, weigh)
dep
: unspecified dependency
A dependency is labeled as dep
when a system is unable to
determine a more precise dependency relation between two words. This
may be because of a weird grammatical construction, a limitation in
software, a parser error, or
because of an unresolved long distance dependency.
А к чему – было не понятно . \n And for what – it-was not clear .
dep(понятно, чему)
dep(clear, what)
det
: determiner
The relation determiner (det
) holds between a nominal head and its determiner.
This relation is used for pronominal adjectival modifiers of noun phrases.
Non-pronominal adjectives are tagged ru-pos/ADJ and the relation is labeled amod.
Тот человек уже здесь . \n The man already is-here .
det(человек, Тот)
det(man, The)
Это моя ошибка . \n This-is my mistake .
det(ошибка, моя)
det(mistake, my)
Какая книга тебе нравится больше всего ? \n Which book to-you is-nice most of-all ?
det(книга, Какая)
det(book, Which)
NB: The relation of the pronominal quantifiers сколько, столько to their head is a subtype of the det
relation:
either ru-dep/det:numgov or ru-dep/det:nummod.
det:numgov
: pronominal quantifier governing the case of the noun
Pronominal quantifiers are labeled det:numgov
instead of det
because they normally do not agree with the quantified noun in case
(unlike non-quantifying determiners).
The quantifier requires the counted noun to be in its genitive form. The whole phrase (quantifier + noun) is treated as a singular neuter noun phrase and it can fill roles where nominative, accusative or vocative noun phrases are expected.
In UD the quantifier depends on the quantified noun and the det:numgov
label is used to preserve
the information about case conditions.
Сколько мужчин играло в карты ? \n How-many men played cards ?
det:numgov(мужчин, Сколько)
nsubj(играло, мужчин)
dobj(играло, карты)
case(карты, в)
punct(играло, ?-6)
det:numgov(men, How-many)
nsubj(played, men)
dobj(played, cards)
punct(played, ?-12)
See nummod for a broader discussion of the various situations with quantifiers.
det:nummod
: pronominal quantifier agreeing in case with the noun
Pronominal quantifiers are labeled det:nummod
or det:numgov instead of det
because they normally do not agree with the quantified noun in case
(unlike non-quantifying determiners).
They do agree only if the whole phrase (quantifier + noun) fills a role where
genitive, dative, locative or instrumental noun phrases are expected.
In these situations they are labeled det:nummod
.
Не помню , со сколькими мужчинами я играл в карты . \n Not I-remember , with how-many men I played cards .
neg(помню, Не)
ccomp(помню, играл)
punct(играл, ,-3)
nsubj(играл, я)
dobj(играл, карты)
case(карты, в)
iobj(играл, мужчинами)
case(мужчинами, со)
det:nummod(мужчинами, сколькими)
punct(помню, .-11)
neg(I-remember, Not)
ccomp(I-remember, played)
punct(played, ,-15)
nsubj(played, I)
dobj(played, cards)
iobj(played, men)
case(men, with)
det:nummod(men, how-many)
punct(I-remember, .-22)
See nummod for a broader discussion of the various situations with quantifiers.
discourse
: discourse element
At present we use the discourse
relation for what is labeled AuxO
in the Prague Dependency Treebank:
redundant or emotional items, redundant coreferential pronoun.
čemu že se to zpronevěřily \n what that themselves it they-betrayed
discourse(zpronevěřily, to)
discourse(they-betrayed, it)
(meaning “… what they betrayed”)
dislocated
: dislocated elements
This document is a placeholder for the language-specific documentation
for dislocated
.
dobj
: direct object
The direct object of a verb is the noun phrase that denotes the entity acted upon. Most often the direct object is in the accusative case but there are verbs that require their objects be in other cases (except for nominative and vocative).
Accusative example:
Я купил машину . \n I have-bought car .
dobj(купил, машину)
dobj(have-bought, car)
Genitive example:
Я не боюсь этого . \n I-am not afraid of this .
dobj(боюсь, этого)
dobj(afraid, this)
Dative example:
Но это не соответствует действительности . \n But this not corresponds to-reality .
dobj(соответствует, действительности)
dobj(corresponds, to-reality)
Instrumental example:
Президент руководит страной . \n The President rules the country .
dobj(руководит, страной)
dobj(rules, country)
In general, if there is just one object, it should be labeled dobj
,
regardless of the morphological case or semantic role that it bears. If there are two or more
objects, one of them should be dobj
and the others should be
iobj. In such cases it is necessary to decide what is the most
directly affected object (patient). The one exception is when there is a clausal complement. Then the clausal complement is regarded as a “clausal direct object” and an object nominal will be an iobj.
expl
: expletive: reflexive pronoun with an inherently reflexive verb
Reflexive pronouns (see the feature cs-feat/Reflex) usually replace objects of verbs. However, some verbs are inherently reflexive, i.e. the verb always occurs with a reflexive prounoun, and the pronoun cannot be replaced by a non-reflexive pronoun.
With these verbs, the reflexive pronoun is attached as expl
instead of dobj.
(Note that the expl
relation is first used for this purpose in the UD release 1.2,
to increase parallelism with other languages. In the previous releases this usage of
reflexive se/si was labeled compound:reflex
.)
Martin se bojí zvířat . \n Martin REFLEX fears animals .
expl(bojí, se)
expl(fears, REFLEX)
foreign
: foreign words
We use foreign
to label sequences of foreign words. These are given
a linear analysis: the head is the first token in the foreign phrase.
foreign
does not apply to loanwords or to foreign names.
It applies to quoted foreign text incorporated in a sentence/discourse
of the host language (unless we want to and know how to annotate the
internal structure according to the syntax of the foreign language).
Он придет , когда полетят свиньи ( When pigs fly ) .
foreign(When, pigs)
foreign(When, fly)
goeswith
: goes with
This relation links two parts of a word that are separated in text that is not well edited. The head is in some sense the “main” part, often the second part.
Он остановился чтобы о тдышаться .
goeswith(тдышаться, о)
iobj
: indirect object
The indirect object of a verb is any nominal phrase that is a core argument of the verb but is not its subject or direct object. The prototypical example is the recipient of ditransitive verbs of exchange:
Павел дал Петру две груши . \n Pavel gave Peter two pears .
iobj(дал, Петру)
iobj(gave, Peter)
However, Russian allows other semantic roles as additional objects. The morphological ru-feat/Case of the objects is dictated by verb valency.
In the following Russian example, the verb takes two arguments. One of them is direct object (patient), the other is indirect (addressee). It is parallel to how the English translation would be annotated (where there is no morphological case marking) and also to verbs of giving (cf. a similar sentence, he gave my daughter a class of maths).
Он преподает моей дочери математику . \n He teaches my daughter.Dat maths.Acc .
dobj(преподает, математику)
iobj(преподает, дочери)
dobj(teaches, maths.Acc)
iobj(teaches, daughter.Dat)
In general, if there is just one object, it should be labeled dobj, regardless of the morphological case or semantic role. For example, преподавать “to teach” can take either the subject matter or the recipient as the only object, and in both cases it would be analyzed as the dobj:
Он преподает математику . \n He teaches maths .
dobj(преподает, математику)
dobj(teaches, maths)
Он преподает студентам . \n He teaches students .
dobj(преподает, студентам)
dobj(teaches, students)
The one exception is when there is a clausal complement. Then the clausal complement is regarded as a “clausal direct object” and an object nominal will be an iobj, parallel to the simple ditransitive case:
Она сказала студентам , что сегодня вечером нужно учиться . \n She told students , that today evening they-needed to-study .
iobj(сказала, студентам)
ccomp(сказала, нужно)
iobj(told, students)
ccomp(told, they-needed)
Она рассказала студентам свой план . \n She told students her plan .
iobj(рассказала, студентам)
dobj(рассказала, план)
iobj(told, students)
dobj(told, plan)
If there are two or more objects, one of them should be dobj and the others should be iobj
. In such cases it is necessary to decide what is the most directly affected object (patient), and the Accusative ru-feat/Case has priority over the Dative, Instrumental, and Genitive cases.
list
: list
The list
relation is used for chains of comparable items.
In lists with more than two items, all items of the list should modify the first one.
Informal and web text often contains passages which are meant to be interpreted as lists but are parsed as single sentences.
Email signatures often contain these structures, in the form of contact information:
the different contact information items are labeled as list
; the key-value pair relations are labeled as appos.
However, list
should not be over-used. If a construction can be easily analyzed
using the grammatical relations of standard sentences, such as when there is overt coordination,
then it should be analyzed with these more standard relations, even if it is laid out as a list typographically.
Стив Джонс тел.: 555-9814 e-mail: jones@mail.ru
name(Стив-1, Джонс-2)
list(Стив-1, тел.:-3)
list(Стив-1, e-mail:-5)
appos(тел.:-3, 555-9814-4)
appos(e-mail:-5, jones@mail.ru-6)
mark
: marker
A marker is the subordinating conjunction introducing a finite clause subordinate to another clause. The mark is a dependent of the subordinate clause head.
Я не знал , что ты говоришь по-немецки . \n I didn't know , that you speak German .
mark(говоришь, что)
mark(speak, that)
Мы отправим товар , как только на наш счет поступят деньги . \n We will-dispatch goods , as soon-as at our account arrive money .
mark(поступят, как)
mwe(как, только)
mark(arrive, as)
mwe(as, soon-as)
Note that the subordinating conjunction should not be confused with relative pronouns and adverbs.
These fill a valency slot in the frame of the subordinate predicate,
and are labeled according to their role in the frame, they are not mark
:
Я бы хотел узнать , когда нам поступят деньги на счет . \n I would like to-know , when us arrive money at account .
advmod(поступят, когда)
advmod(arrive, when)
mwe
: multi-word expression
The multi-word expression (modifier) relation is one of the
relations for compounding and it is intended for function words.
In Russian it is used for multi-word prepositions.
The leftmost word is the head and all the other words are attached as its mwe
modifiers.
В отличие от тебя , мне нечего стыдиться . \n In contrast to you , I-have nothing to-be-ashamed .
mwe(В, отличие)
mwe(В, от)
case(тебя, В)
mwe(In, contrast)
mwe(In, to)
case(you, In)
Multi-word prepositions usually form a contiguous span of the sentence. In exceptional cases however, they can be interrupted by emphasizers and other words:
в сравнении , например , с долгом \n in comparison , for-example , to loans
mwe(в, сравнении)
mwe(в, с)
case(долгом, в)
advmod:emph(долгом, например)
mwe(in, comparison)
mwe(in, to)
case(loans, in)
advmod:emph(loans, for-example)
name
: name
name
is one of the relations for compounding in UD.
In Russian it is used to join the first name of a person with the patronymic and the last name.
The leftmost name is always the head and the other name(s) are attached to it.
The relation is not used to attach adjectives to nouns within multi-word names of places, organizations etc.
Президент Владимир Владимирович Путин вчера посетил Минеральные Воды . \n President Vladimir Vladimirovich Putin yesterday visited Mineralnye Vody .
name(Владимир, Владимирович)
name(Vladimir, Vladimirovich)
name(Владимир, Путин)
name(Vladimir, Putin)
nmod(Владимир, Президент)
nmod(Vladimir, President)
amod(Воды, Минеральные)
amod(Vody, Mineralnye)
neg
: negation modifier
The negation modifier is the relation between the negation word не and the word it modifies.
Negation in Russian is most of the time expressed using a bound morpheme (the prefix не-). Occurrences of the morpheme as a separate word are rare in comparison to other languages, yet they exist.
Она не ест рыбу . \n She not eats fish .
neg(ест, не)
neg(eats, not)
Ему был нужен четвероногий , а не двуногий друг . \n To-him was needed quadrupedal , and not bipedal friend .
neg(двуногий, не)
neg(bipedal, not)
Note that the negative determiners (pronouns) are attached as det, not neg
:
Премьер не был никаким дураком . \n The-Prime-Minister not was no fool .
det(дураком, никаким)
det(fool, no)
nmod
: nominal modifier
The nmod
relation is used for nominal modifiers. They depend either
on another noun (group “noun dependents”) or on a predicate (group
“non-core dependents of clausal predicates”).
nmod
is a noun (or noun phrase) functioning as a
non-core (oblique) argument or adjunct.
This means that it functionally corresponds to an adverbial when it attaches to a verb, adjective or other adverb.
But when attaching to a noun, it usually corresponds to a non-agreeing attribute in genitive.
In Russian the nmod
relation covers only those possessives that are expressed using the genitive.
If a possessive adjective is used, the relation is labeled amod.
If a possessive determiner (pronoun) is used, the relation is labeled det.
карта студента \n card of-student
nmod(карта, студента)
nmod(card, of-student)
студенческая карта \n student's card
amod(карта, студенческая)
amod(card, student's)
его карта \n his card
det(карта, его)
det(card, his)
nmod
is also used for temporal nominal modifiers:
Последний раз мы разговаривали зимой . \n Last time we talked in-winter .
nmod(разговаривали, зимой)
nmod(talked, in-winter)
Agreeing attribute of a noun
nmod
noun phrases attached to nouns are usually in the genitive ru-feat/Case and follow the modified noun.
However, there is also a different kind of nmod
that precedes the modified noun and agrees with it in case and number.
A typical example is a title attached to a name of a person. The relation is similar to the name relation
that links the first and the last name, but it is not labeled name
because the title is not part of the name:
российский президент Владимир Владимирович Путин \n Russian president Vladimir Vladimirovich Putin
amod(президент, российский)
amod(president, Russian)
nmod(Владимир, президент)
nmod(Vladimir, president)
name(Владимир, Путин)
name(Vladimir, Putin)
name(Владимир, Владимирович)
name(Vladimir, Vladimirovich)
Note that the same thing can be also expressed using an apposition. In the case of apposition, the title follows the modified name and is separated by a punctuation symbol:
Владимир Путин , российский президент
name(Владимир, Путин)
punct(президент, ,-3)
amod(президент, российский)
appos(Путин, президент)
nsubj
: nominal subject
A nominal subject is a NP which is the syntactic subject of a clause; in Russian, the phrase is in the Nominative Case.
(See csubj for when the subject is clausal. See nsubjpass and csubjpass for when the subject is not the proto-agent argument due to valence changing operations.)
The governor of the nsubj
relation might not always be a verb: when the verb is a copular verb, the root of the clause is the complement of the copular verb, which can be an adjective or noun.
Ленинградскую симфонию написал Шостакович . \n Leningrad Symphony wrote Shostakovich .
nsubj(написал, Шостакович)
nsubj(wrote, Shostakovich)
Автомобиль красный . \n Car is-red .
nsubj(красный, Автомобиль)
nsubj(is-red, Car)
nsubjpass
: passive nominal subject
A passive nominal subject is a noun phrase which is the syntactic subject of a passive clause.
Шварценберг был побеждён Земаном . \n Schwarzenberg was defeated by-Zeman .
nsubjpass(побеждён, Шварценберг)
nsubjpass(defeated, Schwarzenberg)
Reflexive passive (the meaning is “This will be solved tomorrow.”)
Это решится завтра . \n This will-be-solved tomorrow .
nsubjpass(решится, Это)
nsubjpass(will-be-solved, This)
nummod
: numeric modifier
A numeric modifier of a noun is any number phrase that serves to modify the meaning of the noun with a quantity.
Ян съел три стейка . \n Jan ate three steaks .
nummod(стейка, три)
nummod(steaks, three)
Agreement and government with Russian quantifiers
The morphological and syntactic behavior of Russian numerals is a complex matter. Small cardinal numerals один “one”, два “two”, три “three” and четыре “four” agree with the counted noun in case (один also agrees in gender and number; два also agrees in gender). They behave as if they modify the counted noun; they are similar to adjectives in this respect. Examples:
- Один мужчина спал, два мужчины играли в карты. “One man slept, two men played cards.”
- Одна женщина спала, две женщины играли в карты. “One woman slept, two women played cards.”
- Одно чудовище спало, два чудовища играли в карты. “One monster slept, two monsters played cards.”
It is straightforward to convert such dependencies to nummod
:
Одно чудовище спало . \n One monster slept .
nummod(чудовище, Одно)
nsubj(спало, чудовище)
punct(спало, .-4)
nummod(monster, One)
nsubj(slept, monster)
punct(slept, .-9)
The numeral does not govern the noun in all morphological cases. The following table shows the case of the whole phrase (numeral + noun; first column) and the consequences for the case of the parts (note that these numerals have only three distinct morphological forms, resulting in homonymy).
Phrase Case | Example | Numeral Case | Noun Case |
---|---|---|---|
Nom | пять мужчин | Nom | Gen |
Gen | пяти мужчин | Gen | Gen |
Dat | пяти мужчинам | Dat | Dat |
Acc | пять мужчин | Acc | Gen |
Voc | пять мужчин | Voc | Gen |
Loc | пяти мужчинах | Loc | Loc |
Ins | пятью мужчинами | Ins | Ins |
We can say that the noun has the case of the whole phrase if it is dative, locative or instrumental. The numeral then agrees with the noun in case. The numeral forces the noun to the genitive case if the whole phrase is nominative, accusative or vocative (but the vocative usage is rather hypothetical). In genitive, the noun and the numeral agree with each other; but note that the numeral uses its inflected form, as in the other cases where it agrees with the noun.
Pronominal quantifiers behave as high-value numerals and govern the quantifed nouns:
- Сколько мужчин играло в карты? “How many men played cards?”
- Несколько (много, мало) мужчин играло в карты. “Several (many, few) men played cards.”
- Столько мужчин, играющих в карты, я никогда не видел. “I have never seen so many men playing cards.”
The UD conversion uses a structure that is parallel among all the above cases, and also with universal dependencies in other languages. The counted noun is always the head and the numeral is always attached as its modifier. Nevertheless, we use different relation labels to mark situations where the numeral (or quantifier) actually governs the morphological case of the noun. There are four labels used:
Numeric | Pronominal | |
Noun governs | nummod | det:nummod |
Numeral governs | nummod:gov | det:numgov |
Три мужчины играли в карты . \n Three men played cards .
nummod:gov(мужчины, Три)
nsubj(играли, мужчины)
dobj(играли, карты)
case(карты, в)
punct(играли, .-6)
nummod:gov(men, Three)
nsubj(played, men)
dobj(played, cards)
punct(played, .-12)
Пять мужчин играло в карты . \n Five men played cards .
nummod:gov(мужчин, Пять)
nsubj(играло, мужчин)
dobj(играло, карты)
case(карты, в)
punct(играло, .-6)
nummod:gov(men, Five)
nsubj(played, men)
dobj(played, cards)
punct(played, .-12)
Сколько мужчин играло в карты ? \n How-many men played cards ?
det:numgov(мужчин, Сколько)
nsubj(играло, мужчин)
dobj(играло, карты)
case(карты, в)
punct(играло, ?-6)
det:numgov(men, How-many)
nsubj(played, men)
dobj(played, cards)
punct(played, ?-12)
Играл я в карты с пятью мужчинами . \n Played I cards with five men .
nsubj(Играл, я)
dobj(Играл, карты)
case(карты, в)
iobj(Играл, мужчинами)
case(мужчинами, с)
nummod(мужчинами, пятью)
punct(Играл, .-8)
nsubj(Played, I)
dobj(Played, cards)
iobj(Played, men)
case(men, with)
nummod(men, five)
punct(Played, .-16)
Не помню , со сколькими мужчинами я играл в карты . \n Not I-remember , with how-many men I played cards .
neg(помню, Не)
ccomp(помню, играл)
punct(играл, ,-3)
nsubj(играл, я)
dobj(играл, карты)
case(карты, в)
iobj(играл, мужчинами)
case(мужчинами, со)
det:nummod(мужчинами, сколькими)
punct(помню, .-11)
neg(I-remember, Not)
ccomp(I-remember, played)
punct(played, ,-15)
nsubj(played, I)
dobj(played, cards)
iobj(played, men)
case(men, with)
det:nummod(men, how-many)
punct(I-remember, .-22)
Additional remarks
In the typical case, million is tagged as a NOUN in genitive, it is preceded by a smaller number, and it is not followed by smaller numerals (as it is in million five hundred thousand). It is followed by the counted noun and gets the following analysis:
50 миллионов крон \n 50 millions of-crowns
nummod:gov(миллионов, 50-1)
nummod:gov(millions, 50-5)
nmod(миллионов, крон)
nmod(millions, of-crowns)
On the other hand the word тысяча “thousand” may be a noun (на площади были тысячи людей “there were thousands of people in the square”) or a numeral:
максимум 50 тысяч крон \n at-most 50 thousand crowns
advmod:emph(крон, максимум)
nummod:gov(крон, тысяч)
compound(тысяч, 50-2)
advmod:emph(crowns, at-most)
nummod:gov(crowns, thousand)
compound(thousand, 50-7)
Note that the two numeral words in the above example are joined using the compound relation. Also note that the intensifier максимум is attached to the head of the phrase (крон) and not to the number.
Similarly there may be other nodes (such as punctuation) that are attached to the head of the phrase and they are related to the whole phrase rather than directly to the head noun:
( 9 дней ) \n ( 9 days )
punct(дней, (-1)
nummod:gov(дней, 9-2)
punct(дней, )-4)
punct(days, (-6)
nummod:gov(days, 9-7)
punct(days, )-9)
5 минут с установкой \n 5 minutes including adjustment
nummod:gov(минут, 5-1)
nmod(минут, установкой)
case(установкой, с)
nummod:gov(minutes, 5-6)
nmod(minutes, adjustment)
case(adjustment, including)
Dates
Директор предложил расформировать персонал к 31 . 12 . \n Director proposed to-disband the-staff towards 31 st December .
advmod(расформировать, 12)
case(12, к)
punct(12, .-9)
nummod(12, 31-6)
punct(31-6, .-7)
advmod(to-disband, December)
case(December, towards)
punct(December, .-19)
nummod(December, 31-16)
punct(31-16, st)
Numbered objects
House number in address is attached as nummod
to the name of the street:
в доме на Тверской 12 \n in the-house at Tverskaya 12
nmod(доме, Тверской-4)
case(Тверской-4, на-3)
nummod(Тверской-4, 12-5)
nmod(the-house, Tverskaya)
case(Tverskaya, at)
nummod(Tverskaya, 12-11)
nummod:gov
: numeric modifier governing the case of the noun
nummod:gov
differs from nummod
in that the numeral requires the counted noun to be in its genitive form.
The whole phrase (numeral + noun) is treated as a singular neuter noun phrase
and it can fill roles where nominative, accusative or vocative noun phrases are expected.
In UD the numeral is annotated as dependent of the noun and the nummod:gov
label is used to preserve
the information about case conditions.
Пять мужчин играло в карты . \n Five men played cards .
nummod:gov(мужчин, Пять)
nsubj(играло, мужчин)
dobj(играло, карты)
case(карты, в)
punct(играло, .-6)
nummod:gov(men, Five)
nsubj(played, men)
dobj(played, cards)
punct(played, .-12)
See nummod for a broader discussion of the various situations with quantifiers.
parataxis
: parataxis
The parataxis relation (from Greek for “place side by side”) is a relation between a word (often the main predicate of a sentence) and other elements, such as a sentential parenthetical or a clause after a “:” or a “;”, placed side by side without any explicit coordination, subordination, or argument relation with the head word. Parataxis is a discourse-like equivalent of coordination, and so usually obeys an iconic ordering. Hence it is normal for the first part of a sentence to be the head and the second part to be the parataxis dependent, regardless of the headedness properties of the language. But things do get more complicated, such as cases of parentheticals, which appear medially.
An inventory of constructions to which parataxis has been applied
Treatment of reported speech
For this reported speech example:
Тот парень , сказал Вася , ушёл рано утром . \n The guy , said Vasya , left early in-the-morning .
parataxis(ушёл, сказал)
parataxis(left, said)
There are paraphrases that convey essentially the same meaning but with a different syntactic structure. When the reported speech is embedded in a subordinate clause (with or without an overt complementizer что “that”), the subordinate clause is a ccomp of the speech verb. When the reported speech follows the speech verb and is separated by a colon, the reported speech forms a main clause that attaches to the preceding main clause with a parataxis relation, hence with the speech verb as its head. However, when the speech verb occurs as a medial or final parenthetical, the relation is reversed and the speech verb is treated as a parataxis of the reported speech. This analysis is not uncontroversial but follows many authorities, such as Huddleston and Pullum (2002), The Cambridge Grammar of the English Language (see chapter 11, section 9).
Вася сказал , что тот парень ушёл рано утром . \n Vasya said , that the guy left early in-the-morning .
ccomp(сказал, ушёл)
ccomp(said, left)
Вася сказал : " Тот парень ушёл рано утром . " \n Vasya said : " The guy left early in-the-morning . "
parataxis(сказал, ушёл)
parataxis(said, left)
" Тот парень ушёл рано утром , " сказал Вася . \n " The guy left early in-the-morning , " said Vasya .
parataxis(ушёл, сказал)
parataxis(left, said)
" Тот парень , " — сказал Вася , — " ушёл рано утром . " \n " The guy , " — said Vasya , — " left early in-the-morning . "
parataxis(ушёл, сказал)
parataxis(left, said)
An argument for this analysis is that in the cases analyzed as embedding, the entire clause can be further embedded (I was taken aback when John said the guy left early in the morning.), while this is not possible with medial or final placement of the speech verb (*I was taken aback when the guy left early this morning, John said.).
News article bylines
The parataxis
relation should be used to connect the parts of a news article byline.
There does not seem to be a better relation to use.
Вашингтон ( CNN ) :
parataxis(Вашингтон, CNN)
punct
: punctuation
This is used for any piece of punctuation in a clause.
Иди домой ! \n Go home !
punct(Иди, !-3)
punct(Go, !-7)
Tokens with the relation punct
always attach to content words (except in cases of ellipsis) and can never have dependents.
Since punct
is not a normal dependency relation, the usual criteria for determining the head word do not apply.
Instead, we use the following principles:
- A punctuation mark separating coordinated units is attached to the first conjunct.
- A punctuation mark preceding or following a subordinated unit is attached to this unit.
- Within the relevant unit, a punctuation mark is attached at the highest possible node that preserves projectivity.
- Paired punctuation marks (quotes and brackets) should be attached to the same word unless that would create non-projectivity. This word is usually the head of the phrase enclosed in the paired punctuation.
Мы покупаем яблоки , груши , апельсины и бананы . \n We buy apples , pears , oranges and bananas .
dobj(покупаем, яблоки)
dobj(buy, apples)
conj(яблоки, груши)
conj(яблоки, апельсины)
conj(яблоки, бананы)
conj(apples, pears)
conj(apples, oranges)
conj(apples, bananas)
cc(яблоки, и)
cc(apples, and)
punct(яблоки, ,-4)
punct(яблоки, ,-6)
punct(apples, ,-15)
punct(apples, ,-17)
Мужчина , которого ты вчера видел , пришёл снова . \n Man , whom you-have yesterday seen , came again .
punct(видел, ,-2)
punct(seen, ,-13)
punct(видел, ,-7)
punct(seen, ,-18)
punct(пришёл, .-10)
punct(came, .-21)
Аббревиатура напр . означает " например " . \n The-abbreviation eg . means " for-example " .
punct(напр, .-3)
punct(eg, .-12)
punct(например, "-5)
punct(for-example, "-14)
punct(например, "-7)
punct(for-example, "-16)
punct(означает, .-8)
punct(means, .-17)
remnant
: remnant in ellipsis
The remnant
relation is used to provide a satisfactory treatment of certain instances of
ellipsis (in the case of gapping and stripping, where a predicational or verbal
head gets elided).
Павел заказал говядину , а Мария свинину . \n Pavel ordered beef and Maria pork .
nsubj(заказал, Павел)
nsubj(ordered, Pavel)
dobj(заказал, говядину)
dobj(ordered, beef)
remnant(Павел, Мария)
remnant(Pavel, Maria)
remnant(говядину, свинину)
remnant(beef, pork)
reparandum
: overridden disfluency
We use reparandum
to indicate disfluencies overridden in a speech
repair. The disfluency is the dependent of the repair.
Идите напра- налево . \n Go to-the-righ- to-the-left .
reparandum(налево, напра-)
reparandum(to-the-left, to-the-righ-)
root
: root
The root
grammatical relation points to the root of the sentence. A
fake node ROOT
is used as the governor. The ROOT
node is indexed
with 0, since the indexing of real words in the sentence starts at 1.
ROOT Люблю индийскую еду . \n ROOT I-love Indian food .
root(ROOT-1, Люблю)
root(ROOT-7, I-love)
There is just one node with the root
dependency relation in every
tree. If the main predicate is not present (due to
ellipsis)
and there are
multiple orphaned dependents, the leftmost dependent is promoted to
the head (root) position and the other orphans are attached to it.
An example sentence-like segment that lacks the main verb: И что он на это ? “And what [does] he [say] to it?”
ROOT И что он на это ? \n ROOT And what he to it ?
root(ROOT-1, И)
root(ROOT-9, And)
dobj(И, что)
dobj(And, what)
nmod(И, это)
nmod(And, it)
case(это, на)
case(it, to)
nsubj(И, он)
nsubj(And, he)
punct(И, ?-7)
punct(And, ?-15)
vocative
: vocative
The vocative
relation is used to mark dialogue participant addressed in text.
The relation links the addressee’s name to its host sentence.
In Russian, the addressee’s name can also appear in the vocative Case form.
Мужчина , Вы дурак ! \n Sir , you-are a-fool !
vocative(дурак, Мужчина)
vocative(a-fool, Sir)
xcomp
: open clausal complement
An open clausal complement (xcomp
) of a verb or an adjective is a
predicative or clausal complement without its own subject. The
reference of the subject is necessarily determined by an argument
external to the xcomp (normally by the subject of the next higher
clause). This is often referred to as obligatory control.
These complements are always non-finite, and they are
complements (arguments of the higher verb or adjective) rather than
adjuncts/modifiers, such as a purpose clause. The name xcomp
is
borrowed from Lexical-Functional Grammar.
Хочешь плавать ? \n Do-you-want to-swim ?
xcomp(Хочешь, плавать)
xcomp(Do-you-want, to-swim)
Я начал там работать вчера . \n I started there to-work yesterday .
xcomp(начал, работать)
xcomp(started, to-work)
Note that the above condition “without its own subject” does not mean that a
clause is an xcomp
just because its subject is not overt.
The subject must be necessarily inherited from a fixed position in the higher clause.
That is, there should be no available interpretation where the subject of the lower clause may be distinct
from the specified role of the upper clause.
In cases where the missing subject may or must be distinct from a fixed role in the higher clause,
ccomp should be used instead, as below. This includes cases of arbitrary subjects and anaphoric control.
Начальник приказал выкопать ров . \n Boss ordered to-dig ditch .
ccomp(приказал, выкопать)
ccomp(ordered, to-dig)
Pro-drop languages have clauses where the subject is not present as a separate word,
yet it is inherently present (and often deducible from the form of the verb)
and it does not depend on arguments from a higher clause.
Thus in neither of the following two Russian examples is there any overt subject,
yet only the second example contains an xcomp
.
Пишу , потому что я это обещал . \n I-write , because that I-have it promised .
advcl(Пишу, обещал)
advcl(I-write, promised)
Я обещал писать . \n I have-promised to-write .
xcomp(обещал, писать)
xcomp(have-promised, to-write)