Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01kk91fp39h
Title: Are We There Yet? Contextual Effects of Computer Model Judgments of Similarity vs. Human Judgments
Authors: Kong-Johnson, Noe
Advisors: Goldberg, Adele E.
Department: Neuroscience
Certificate Program: Linguistics Program
Class Year: 2019
Abstract: Similarity is used every day to help people organize the world and make classifications and generalizations. Quantitative models used to study similarity, including vector space models, represent words as vectors in multi-dimensional space and determine the similarity between two words by calculating the cosine of the angle between the two vectors. This thesis demonstrates that existing models do not reproduce human judgments of non-transitive aspects of similarity: specifically, the effect of order or context. I investigate if humans judge two words as more similar after a “context” pair of words or after a random pair of words, using Amazon’s Mechanical Turk crowd-sourcing platform. Human judgments are found to be asymmetric, with similarity increased when a relevant context is provided by the context pair. Word2Vec, a well-known vector space model that uses cosine distance to calculate similarity between pairs of words, is unaffected by other comparisons and therefore is unable to capture the effect of context. These results confirm that humans take context into account when judging between words, whereas Word2Vec does not. I suggest a neural circuit, which includes regions implicated in the semantic circuit, that may be involved in the similarity judgment task as performed by humans.
URI: http://arks.princeton.edu/ark:/88435/dsp01kk91fp39h
Type of Material: Princeton University Senior Theses
Language: en
Appears in Collections:Neuroscience, 2017-2020

Files in This Item:
File Description SizeFormat 
KONG-JOHNSON-NOE-THESIS.pdf1.43 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.