Conditioning Language Models for Domain

Arora, Karan

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01h415pd38k

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Narasimhan, Karthik	-
dc.contributor.author	Arora, Karan	-
dc.date.accessioned	2019-07-24T17:51:13Z	-
dc.date.available	2019-07-24T17:51:13Z	-
dc.date.created	2019-05-10	-
dc.date.issued	2019-07-24	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp01h415pd38k	-
dc.description.abstract	We consider a setting in which a language model - given access to some information about an input’s domain - is trained to learn a task over an entire distribution of domains, with the goal of generalizing to inputs from domains that are not in its training data. Drawing inspiration from existing methods outside our problem setting, we develop a mechanism that conditions an operation in a language model to modify its representation of an input based on information about a domain. This mechanism is meant to be trained jointly with the taskperforming model, and makes few assumptions about the model architecture. We perform experiments in which we compare the performance of a model that is augmented with our mechanism to a baseline that is not for language modeling and sentiment analysis tasks. While the conditioning mechanism does not currently provide a performance improvement on real data, experiments with synthetic data suggest that it is capable of doing so, and that some fine-tuning and further experimentation may enable it to work better.	en_US
dc.format.mimetype	application/pdf	-
dc.language.iso	en	en_US
dc.title	Conditioning Language Models for Domain	en_US
dc.type	Princeton University Senior Theses	-
pu.date.classyear	2019	en_US
pu.department	Computer Science	en_US
pu.pdf.coverpage	SeniorThesisCoverPage	-
pu.contributor.authorid	960978935	-
Appears in Collections:	Computer Science, 1988-2020

Files in This Item:

File	Size	Format
ARORA-KARAN-THESIS.pdf	857.02 kB	Adobe PDF	Request a copy

Show simple item record

Search

Browse