Rich Linguistic Structure from Large-Scale Web Data

DSpace/Manakin Repository

Rich Linguistic Structure from Large-Scale Web Data

Citable link to this page

 

 
Title: Rich Linguistic Structure from Large-Scale Web Data
Author: Yamangil, Elif
Citation: Yamangil, Elif. 2013. Rich Linguistic Structure from Large-Scale Web Data. Doctoral dissertation, Harvard University.
Full Text & Related Files:
Abstract: The past two decades have shown an unexpected effectiveness of Web-scale data in natural language processing. Even the simplest models, when paired with unprecedented amounts of unstructured and unlabeled Web data, have been shown to outperform sophisticated ones. It has been argued that the effectiveness of Web-scale data has undermined the necessity of sophisticated modeling or laborious data set curation. In this thesis, we argue for and illustrate an alternative view, that Web-scale data not only serves to improve the performance of simple models, but also can allow the use of qualitatively more sophisticated models that would not be deployable otherwise, leading to even further performance gains.
Terms of Use: This article is made available under the terms and conditions applicable to Other Posted Material, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#LAA
Citable link to this page: http://nrs.harvard.edu/urn-3:HUL.InstRepos:11181110
Downloads of this work:

Show full Dublin Core record

This item appears in the following Collection(s)

 
 

Search DASH


Advanced Search
 
 

Submitters