home edit page issue tracker

Introduction

The Norwegian UD treebank is based on the Bokmål section of the Norwegian Dependency Treebank (NDT), which is a syntactic treebank of Norwegian. NDT was developed 2011-2014 at the National Library of Norway in collaboration with the Text Laboratory and the Department of Informatics at the University of Oslo. NDT has been automatically converted to the UD scheme by Lilja Øvrelid at the University of Oslo.

Acknowledgements

Thanks to Petter Hohle for creating the data splits (train/dev/test). Thanks also to the annotators and other contributors to the original NDT treebank: Per Erik Solberg, Kari Kinn, Pål Kristian Eriksen, Arne Skjærholt, Kristin Hagen, Janne Bondi Johannessen.

References

Kristin Hagen, Janne Bondi Johannessen and Anders Nøklestad: “A Constraint-based Tagger for Norwegian”. 2000. Proceedings of the 17th Scandinavian Conference in Linguistics.

Kari Kinn, Per Erik Solberg and Pål Kristian Eriksen. “NDT Guidelines for Morphological Annotation”. National Library Tech Report.

Per Erik Solberg, Arne Skjærholt, Lilja Øvrelid, Kristin Hagen and Janne Bondi Johannessen. 2014. “The Norwegian Dependency Treebank”, Proceedings of LREC 2014, Reykjavik