Towards robust context-sensitive sentence alignment for monolingual corpora

DSpace/Manakin Repository

Towards robust context-sensitive sentence alignment for monolingual corpora

Citable link to this page

. . . . . .

Title: Towards robust context-sensitive sentence alignment for monolingual corpora
Author: Shieber, Stuart; Nelken, Rani

Note: Order does not necessarily reflect citation order of authors.

Citation: Rani Nelken and Stuart M. Shieber. Towards robust context-sensitive sentence alignment for monolingual corpora. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL-06), Trento, Italy, 3-7 April 2006.
Full Text & Related Files:
Abstract: Aligning sentences belonging to comparable monolingual corpora has been suggested as a first step towards training text rewriting algorithms, for tasks such as summarization or paraphrasing. We present here a new monolingual sentence alignment algorithm, combining a sentence-based TF*IDF score, turned into a probability distribution using logistic regression, with a global alignment dynamic programming algorithm. Our approach provides a simpler and more robust solution achieving a substantial improvement in accuracy over existing systems.
Published Version: http://www.aclweb.org/anthology-new/E/E06/E06-1021.pdf
Terms of Use: This article is made available under the terms and conditions applicable to Other Posted Material, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#LAA
Citable link to this page: http://nrs.harvard.edu/urn-3:HUL.InstRepos:2252597

Show full Dublin Core record

This item appears in the following Collection(s)

  • FAS Scholarly Articles [7106]
    Peer reviewed scholarly articles from the Faculty of Arts and Sciences of Harvard University
 
 

Search DASH


Advanced Search
 
 

Submitters