Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp012f75rb649
Title: Constructing and Deconstructing an Empowerment-Offensiveness Classifier: Intersections of Big Data, Polarization, and Bias on Twitter During the 2016 Presidential Election
Authors: Kirgios, Erika
Advisors: LaPaugh, Andrea S.
Department: Computer Science
Class Year: 2017
Abstract: Twitter is becoming an increasingly common forum for discussion of politics and current events. Such conversation can often be contentious and polarizing. Using Twitter data from the week of the 2016 presidential election, this paper aims to improve upon previous classifiers trained to detect hate speech by using neural network models and psycholinguistic content. Furthermore, this paper adds complexity to the current literature on offensiveness detection by classifying tweets as empowering and neutral as well as offensive while maintaining high accuracy. Through a sociolinguistic and technosocial lens, we discuss the process of algorithmic construction with online text corpora, merging technical and sociocultural literature to address ethical concerns and sources of algorithmic bias. We also offer a novel tool to detect spam tweets from tweet metadata and content alone rather than relying on user history, achieving high accuracy of 89.9\% and low false positive rate of .3\%.
URI: http://arks.princeton.edu/ark:/88435/dsp012f75rb649
Type of Material: Princeton University Senior Theses
Language: en_US
Appears in Collections:Computer Science, 1988-2020

Files in This Item:
File SizeFormat 
written_final_report.pdf4.77 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.