Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp019z903229x
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorLaPaugh, Andrea-
dc.contributor.authorSwenson, Hannah-
dc.date.accessioned2016-07-01T13:30:44Z-
dc.date.available2016-07-01T13:30:44Z-
dc.date.created2016-04-29-
dc.date.issued2016-07-01-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/dsp019z903229x-
dc.description.abstractNot only are women underrepresented in computer science, but their percentage has been declining since the 1980’s. In order to analyze women’s potential impact on the programming world, I study female contribution and gender dynamics in Open Source Software (OSS) communities. GitHub, an OSS platform that allows for programmers world-wide to collaborate on software projects, is the largest publicly available source of OSS project data. I implement a supervised machine learning algorithm to predict, given certain factors, whether a programming team on GitHub has female members, in the hopes of showing positive correlations between having women on the team and project productivity or success. While my linear regression model predicts whether a team has women to a certain degree of accuracy, it is limited by the training data that is sparse of women. This sparsity in the dataset is consistent with and speaks to the broader issue of female underrepresentation in computer science.en_US
dc.format.extent46 pages*
dc.language.isoen_USen_US
dc.titleWomen’s Impact on GitHub: A Machine Learning Approachen_US
dc.typePrinceton University Senior Theses-
pu.date.classyear2016en_US
pu.departmentComputer Scienceen_US
pu.pdf.coverpageSeniorThesisCoverPage-
Appears in Collections:Computer Science, 1988-2020

Files in This Item:
File SizeFormat 
Swenson_Hannah_2016_Thesis.pdf1.07 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.