Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01mw22v8323
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorVanderbei, Robert J-
dc.contributor.authorKabir, Tazim-
dc.date.accessioned2019-08-16T13:55:03Z-
dc.date.available2019-08-16T13:55:03Z-
dc.date.created2019-04-16-
dc.date.issued2019-08-16-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp01mw22v8323-
dc.description.abstractThe United States is a heterogeneous nation, containing diverse regional sub-populations with dif- ferent dialects and political stances. Research has already been conducted about different dialects of the United States, but there existed a gap in research on how dialects influence voting pattern. In this study, we investigate whether someone’s state and vote can be predicted based on their dialect, how states relate into regions based on dialect and voting, and whether dialect and past voting data can be used to predict presidential elections. Using Linear Discriminant Analysis modelling techniques on data from the Harvard Dialect Survey, we find that the dialect data of a survey re- spondent can be predict the respondent’s state with over 80% accuracy and census region with over 93% accuracy. By correlating state dialect and correlating state 1996-2016 election data, we devise a 7-region state classification system that shows that states speak and vote in blocs. Further applying Linear Discriminant Analysis, we show that dialect and past voting behavior can predict the current political leanings of over 96% of states and predict over 94% of swing states. We apply LDA models to predict a close 2020 Republican victory that could easily swing Democrat. Our analysis and modeling techniques could prove useful for voice recognition technology to analyze the geographic location of a speaker and improve targeted advertising based on dialect. Also, our dialect and voting models could help predict future elections, and our analysis could help political organizations target their campaign trails and political rhetoric.en_US
dc.format.mimetypeapplication/pdf-
dc.language.isoenen_US
dc.titleLightning Bugs or Fireflies? A Machine-Learning Approach to U.S. Dialects and Presidential Electionsen_US
dc.typePrinceton University Senior Theses-
pu.date.classyear2019en_US
pu.departmentOperations Research and Financial Engineering*
pu.pdf.coverpageSeniorThesisCoverPage-
pu.contributor.authorid960909498-
pu.certificateEngineering and Management Systems Programen_US
Appears in Collections:Operations Research and Financial Engineering, 2000-2020

Files in This Item:
File Description SizeFormat 
KABIR-TAZIM-THESIS.pdf2.63 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.