Show simple item record

dc.contributor.authorOepen, Stephan
dc.contributor.authorToutanova, Kristina
dc.contributor.authorShieber, Stuart
dc.contributor.authorManning, Christopher
dc.contributor.authorFlickinger, Dan
dc.contributor.authorBrants, Thorsten
dc.date.accessioned2008-11-10T20:03:08Z
dc.date.issued2002
dc.identifier.citationStephan Oepen, Kristina Toutanova, Stuart M. Shieber, Christopher Manning, Dan Flickinger, and Thorsten Brants. The LinGO redwoods treebank: Motivation and preliminary applications. In Proceedings of the Eighteenth International Conference on Computational Linguistics, Taipei, Taiwan, 2002.en
dc.identifier.urihttp://nrs.harvard.edu/urn-3:HUL.InstRepos:2252613
dc.description.abstractThe LinGO Redwoods initiative is a seed activity in the design and development of a new type of treebank. While several medium- to large-scale treebanks exist for English (and for other major languages), pre-existing publicly available resources exhibit the following limitations: (i) annotation is mono-stratal, either encoding topological (phrase structure) or tectogrammatical (dependency) information, (ii) the depth of linguistic information recorded is comparatively shallow, (iii) the design and format of linguistic representation in the treebank hard-wires a small, predefined range of ways in which information can be extracted from the treebank, and (iv) representations in existing treebanks are static and over the (often year- or decade-long) evolution of a large-scale treebank tend to fall behind the development of the field. LinGO Redwoods aims at the development of a novel treebanking methodology, rich in nature and dynamic both in the ways linguistic data can be retrieved from the treebank in varying granularity and in the constant evolution and regular updating of the treebank itself. Since October 2001, the project is working to build the foundations for this new type of treebank, to develop a basic set of tools for treebank construction and maintenance, and to construct an initial set of 10,000 annotated trees to be distributed together with the tools under an open-source license.en
dc.description.sponsorshipEngineering and Applied Sciencesen
dc.language.isoen_USen
dc.publisherAssociation for Computing Machineryen
dc.relation.isversionofhttp://doi.acm.org/1071884.1071909en
dash.licenseLAA
dc.titleThe LinGO redwoods treebank: Motivation and preliminary applicationsen
dc.relation.journalProceedings of the Eighteenth International Conference on Computational Linguisticsen
dash.depositing.authorShieber, Stuart
dc.identifier.doi10.3115/1071884.1071909
dash.identifier.orcid0000-0002-7733-8195*
dash.contributor.affiliatedGoodridge, Andrew


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record