Publication:

Pointing Isn't Always Rude: Using Pointer Networks to Improve Word Prediction in a Language Model

Loading...
Thumbnail Image

Date

2018-06-29

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Abstract

One of the problems in natural language generation is that of rare and unknown words. Pointer networks presents a method to mitigate this problem by allowing models to reference words in the source text and directly copy them. In this paper I propose the application of a pointer mechanism to the log bilinear language model, and analyze the effects on it compared to the original model. The results show that while pointer networks improve the log bilinear model's performance on a smaller dataset, it does not produce improved results on a larger dataset. I therefore present theories for these results and suggest further work that can be done to rectify shortcomings.

Description

Other Available Sources

Research Data

Keywords

Computer Science

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories