Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp012f75rb649
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorLaPaugh, Andrea S.-
dc.contributor.authorKirgios, Erika-
dc.date.accessioned2017-07-20T13:24:20Z-
dc.date.available2017-07-20T13:24:20Z-
dc.date.created2017-05-05-
dc.date.issued2017-5-5-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp012f75rb649-
dc.description.abstractTwitter is becoming an increasingly common forum for discussion of politics and current events. Such conversation can often be contentious and polarizing. Using Twitter data from the week of the 2016 presidential election, this paper aims to improve upon previous classifiers trained to detect hate speech by using neural network models and psycholinguistic content. Furthermore, this paper adds complexity to the current literature on offensiveness detection by classifying tweets as empowering and neutral as well as offensive while maintaining high accuracy. Through a sociolinguistic and technosocial lens, we discuss the process of algorithmic construction with online text corpora, merging technical and sociocultural literature to address ethical concerns and sources of algorithmic bias. We also offer a novel tool to detect spam tweets from tweet metadata and content alone rather than relying on user history, achieving high accuracy of 89.9\% and low false positive rate of .3\%.en_US
dc.language.isoen_USen_US
dc.titleConstructing and Deconstructing an Empowerment-Offensiveness Classifier: Intersections of Big Data, Polarization, and Bias on Twitter During the 2016 Presidential Electionen_US
dc.typePrinceton University Senior Theses-
pu.date.classyear2017en_US
pu.departmentComputer Scienceen_US
pu.pdf.coverpageSeniorThesisCoverPage-
pu.contributor.authorid960741861-
pu.contributor.advisorid010000279-
Appears in Collections:Computer Science, 1988-2020

Files in This Item:
File SizeFormat 
written_final_report.pdf4.77 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.