PReMVOS with ConvLSTM; Exploiting Recurrence for Video Object Segmentation

Hertan, Freddy

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01rj430740p

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Boumal, Nicolas	-
dc.contributor.advisor	Jha, Niraj	-
dc.contributor.author	Hertan, Freddy	-
dc.date.accessioned	2019-07-25T18:39:52Z	-
dc.date.available	2019-07-25T18:39:52Z	-
dc.date.created	2019-05-06	-
dc.date.issued	2019-07-25	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp01rj430740p	-
dc.description.abstract	In this paper, we present a new method for semi-supervised video object segmentation, PRe- MVOS with ConvLSTM. Given the first frame ground-truth label, our method automatically generates accurate and consistent pixel masks for objects in the rest of the video sequence. In producing these masks, we build heavily upon the state-of-the-art PReMVOS method that won the DAVIS 2018 Video Object Segmentation Challenge and the YouTube-VOS 1st Large-scale Video Object Segmentation Challenge. A new multi-scale convolutional LSTM (ConvLSTM) module is added to the end of Youtube-VOS version of PReMVOS in order to incorporate temporal information about mask predictions in previous frames. Because ConvLSTMs have relatively few parameters, we are able to implement this module with little overhead, adding only 0.06 seconds per frame to the run time. Our method improves the mean J score DAVIS 2017 validation set above the Youtube-VOS version of PReMVOS, and we approach the performance of the much slower DAVIS 2018 version of PReMVOS. We thus bridge the performance gap, at least in terms of J score, between the DAVIS and Youtube-VOS versions of PReMVOS without increasing run time considerably. Our method performs best in the single-object case, boosting the mean J score of the Youtube-VOS version of PReMVOS by 0.55 while only exhibiting a decrease of 0.05 in mean F score. We thus demonstrate empirically that videos contain temporal information that can be used to boost segmentation accuracy.	en_US
dc.format.mimetype	application/pdf	-
dc.language.iso	en	en_US
dc.title	PReMVOS with ConvLSTM; Exploiting Recurrence for Video Object Segmentation	en_US
dc.type	Princeton University Senior Theses	-
pu.date.classyear	2019	en_US
pu.department	Mathematics	en_US
pu.pdf.coverpage	SeniorThesisCoverPage	-
pu.contributor.authorid	961166988	-
pu.certificate	Applications of Computing Program	en_US
Appears in Collections:	Mathematics, 1934-2020

Files in This Item:

File	Description	Size	Format
HERTAN-FREDDY-THESIS.pdf		1.47 MB	Adobe PDF	Request a copy

Show simple item record

Search

Browse