Neural Network Models for Hate Speech Classification in Tweets
Abstract
The increase in hate speech on social media in recent years calls for improved detection methods. While traditional techniques rely on manually monitoring hate speech, there is a growing interest in applying machine learning to text classification. Improvements in hate speech classification would have important implications as social media companies such as Twitter, Facebook, and Reddit begin to enforce hate speech regulations. Linear classifiers, support vector machines, and neural networks have shown promising results in hate speech classification, and we expand on research done on a Twitter dataset of 16K annotated tweets to incorporate metadata such as retweet and favorite counts on tweets, as well as user follower and friend counts to improve classification accuracy by 2% on convolutional and recurrent neural network models. We also train hate speech-specific word embeddings to capture the code words appropriated by hate speech culprits to target specific groups of people. Task-specific word embeddings show an additional 2% increase in accuracy on hate speech classification.Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#LAACitable link to this page
http://nrs.harvard.edu/urn-3:HUL.InstRepos:38811552
Collections
- FAS Theses and Dissertations [6136]
Contact administrator regarding this item (to report mistakes or request changes)