Comma restoration using constituency information

DSpace/Manakin Repository

Comma restoration using constituency information

Citable link to this page


Title: Comma restoration using constituency information
Author: Tao, Xiaopeng; Shieber, Stuart ORCID  0000-0002-7733-8195

Note: Order does not necessarily reflect citation order of authors.

Citation: Stuart M. Shieber and Xiaopeng Tao. Comma restoration using constituency information. In Proceedings of the 2003 Human Language Technology Conference and Conference of the North American Chapter of the Association for Computational Linguistics, pages 221-227, Edmonton, AB, Canada, 2003.
Full Text & Related Files:
Abstract: Automatic restoration of punctuation from unpunctuated text has application in improving the fluency and applicability of speech recognition systems. We explore the possibility that syntactic information can be used to improve the performance of an HMM-based system for restoring punctuation (specifically, commas) in text. Our best methods reduce sentence error rate substantially - by some 20%, with an additional 8% reduction possible given improvements in extraction of the requisite syntactic information.
Published Version:
Terms of Use: This article is made available under the terms and conditions applicable to Open Access Policy Articles, as set forth at
Citable link to this page:
Downloads of this work:

Show full Dublin Core record

This item appears in the following Collection(s)


Search DASH

Advanced Search