-
Notifications
You must be signed in to change notification settings - Fork 1
/
Abstract.Rmd
27 lines (21 loc) · 2.41 KB
/
Abstract.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
title: "Decrypting the Alt Right"
subtitle: "Using Natural Language Processing to Identify Alt Right Commentary on Facebook"
author: "Fabio Votta^[Fabio Votta is a graduate Political Science student at the University of Stuttgart and is a tutor for statistics with *R*. Contact: [fabio.votta@gmail.com](mailto:fabio.votta@gmail.com)] and Simon Roth^[Simon Roth is a graduate Political Science student at the University of Stuttgart and Junior Data Scientist at Paraboost. Contact: [nomis.roth@gmx.net](mailto:nomis.roth@gmx.net)]"
date: "10 December 2017"
output: pdf_document
header-includes:
- \usepackage{microtype}
- \usepackage{lmodern}
- \usepackage[ngerman, english]{babel}
---
**Keywords:** *Alt Right, Online Extremism, Natural Language Processing, Sentiment Analysis, Topic Modeling*
\pagenumbering{gobble}
\vspace{.5cm}
This paper aims to track "Alt Right" discourse on Facebook and extract psychometric and linguistic features of their online commentary. More specifically, we identify content that aligns with Alt Right narratives in the Facebook comment sections of relevant media outlets.
As a first step, we gathered 1.3 Million Facebook comments from several Alt Right figureheads (e.g. Milo Yiannopolis, Paul Joseph Watson and others). After text cleaning procedures, we applied sentiment analysis to identify hateful comments, leaving us with a dataset of polarized commentaries. Moreover, we used topic modeling to pinpoint Alt Right related themes (for example "anti-feminism" and "anti-white racism") resulting in a text corpus in which the relevant Alt Right narratives were successfully isolated.
In a last step, we collected 1.1 million Facebook comments on several media outlets (CNN, The New York Times, The Washington Post, ABC, Breitbart and Fox News) in order to compare them to our cleaned Alt Right corpus. We estimated the cosine similarity between the document term matrices to unravel the prevalence of Alt Right discourse on media outlets. We find that Alt Right narratives are mostly found on Fox News and Breitbart, but they can also be tracked on mainstream media outlets such as CNN and The Washington Post. Our results provide a first methodological attempt at classifying Alt Right content and its spread into mainstream media channels, which could be applied across different domains in the future.
\vspace{.5cm}
\begin{center}
\includegraphics[width=0.75\textwidth]{ab_cosine_similarity.png}
\end{center}