Towards robust context-sensitive sentence alignment for monolingual corpora
MetadataShow full item record
CitationRani Nelken and Stuart M. Shieber. Towards robust context-sensitive sentence alignment for monolingual corpora. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL-06), Trento, Italy, 3-7 April 2006.
AbstractAligning sentences belonging to comparable monolingual corpora has been suggested as a first step towards training text rewriting algorithms, for tasks such as summarization or paraphrasing. We present here a new monolingual sentence alignment algorithm, combining a sentence-based TF*IDF score, turned into a probability distribution using logistic regression, with a global alignment dynamic programming algorithm. Our approach provides a simpler and more robust solution achieving a substantial improvement in accuracy over existing systems.
Citable link to this pagehttp://nrs.harvard.edu/urn-3:HUL.InstRepos:2252597
- FAS Scholarly Articles