Publication:

Neural Network Models for Hate Speech Classification in Tweets

Loading...
Thumbnail Image

Date

2018-06-29

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Abstract

The increase in hate speech on social media in recent years calls for improved detection methods. While traditional techniques rely on manually monitoring hate speech, there is a growing interest in applying machine learning to text classification. Improvements in hate speech classification would have important implications as social media companies such as Twitter, Facebook, and Reddit begin to enforce hate speech regulations. Linear classifiers, support vector machines, and neural networks have shown promising results in hate speech classification, and we expand on research done on a Twitter dataset of 16K annotated tweets to incorporate metadata such as retweet and favorite counts on tweets, as well as user follower and friend counts to improve classification accuracy by 2% on convolutional and recurrent neural network models. We also train hate speech-specific word embeddings to capture the code words appropriated by hate speech culprits to target specific groups of people. Task-specific word embeddings show an additional 2% increase in accuracy on hate speech classification.

Description

Other Available Sources

Research Data

Keywords

Computer Science, Artificial Intelligence

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories