Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp01rj430740p
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Boumal, Nicolas | - |
dc.contributor.advisor | Jha, Niraj | - |
dc.contributor.author | Hertan, Freddy | - |
dc.date.accessioned | 2019-07-25T18:39:52Z | - |
dc.date.available | 2019-07-25T18:39:52Z | - |
dc.date.created | 2019-05-06 | - |
dc.date.issued | 2019-07-25 | - |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/dsp01rj430740p | - |
dc.description.abstract | In this paper, we present a new method for semi-supervised video object segmentation, PRe- MVOS with ConvLSTM. Given the first frame ground-truth label, our method automatically generates accurate and consistent pixel masks for objects in the rest of the video sequence. In producing these masks, we build heavily upon the state-of-the-art PReMVOS method that won the DAVIS 2018 Video Object Segmentation Challenge and the YouTube-VOS 1st Large-scale Video Object Segmentation Challenge. A new multi-scale convolutional LSTM (ConvLSTM) module is added to the end of Youtube-VOS version of PReMVOS in order to incorporate temporal information about mask predictions in previous frames. Because ConvLSTMs have relatively few parameters, we are able to implement this module with little overhead, adding only 0.06 seconds per frame to the run time. Our method improves the mean J score DAVIS 2017 validation set above the Youtube-VOS version of PReMVOS, and we approach the performance of the much slower DAVIS 2018 version of PReMVOS. We thus bridge the performance gap, at least in terms of J score, between the DAVIS and Youtube-VOS versions of PReMVOS without increasing run time considerably. Our method performs best in the single-object case, boosting the mean J score of the Youtube-VOS version of PReMVOS by 0.55 while only exhibiting a decrease of 0.05 in mean F score. We thus demonstrate empirically that videos contain temporal information that can be used to boost segmentation accuracy. | en_US |
dc.format.mimetype | application/pdf | - |
dc.language.iso | en | en_US |
dc.title | PReMVOS with ConvLSTM; Exploiting Recurrence for Video Object Segmentation | en_US |
dc.type | Princeton University Senior Theses | - |
pu.date.classyear | 2019 | en_US |
pu.department | Mathematics | en_US |
pu.pdf.coverpage | SeniorThesisCoverPage | - |
pu.contributor.authorid | 961166988 | - |
pu.certificate | Applications of Computing Program | en_US |
Appears in Collections: | Mathematics, 1934-2020 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
HERTAN-FREDDY-THESIS.pdf | 1.47 MB | Adobe PDF | Request a copy |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.