Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp01pr76f603m
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Yang, Haw | - |
dc.contributor.author | Song, Nancy | - |
dc.date.accessioned | 2017-07-25T14:24:53Z | - |
dc.date.available | 2019-07-01T09:15:51Z | - |
dc.date.created | 2017-04-14 | - |
dc.date.issued | 2017-4-14 | - |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/dsp01pr76f603m | - |
dc.description.abstract | Change points, discrete jumps in measurements, are key to understanding the underlying dynamics of a system. Though many methods have been proposed for change point detection, there has been no explicit investigation into the limitations of these methods, specifically change point detection in Gaussian distributed time series that use log-likelihood ratio tests. This thesis characterizes Gaussian mean change point (GMCP) detection to probe the parameters of its effectiveness and limitations, and further extends the detection method to include changes in both the mean and variance, also known as Gaussian mean and variance change point (GMVCP) detection. Both GMCP and GMVCP detection were found to perform well even when there are subtle changes in the mean and/or variance and a high background-to-noise ratio and can, thus, be applied to many different kinds of chemical data that result in time series. However, rapid advancements in technology have made the analysis of large amounts of data unfeasible due to time and memory constraints. This thesis implements parallel computing or processing, the simultaneous calculation or processing of segments of a dataset, on change point detection as a possible solution. The parallel algorithm proposed here includes a novel data segmentation scheme, and evalutes this algorithm by applying it to GMCP detection. Time decreased superlinearly with respect to the number of processes, and detection accuracy was minimally affected, suggesting that this parallelization algorithm is a valid method that addresses the problem of analyzing ”big data.” | en_US |
dc.language.iso | en_US | en_US |
dc.title | The Characterization and Parallelization of Change Point Detection in Gaussian Distributed Time Series | en_US |
dc.type | Princeton University Senior Theses | - |
pu.embargo.terms | 2019-07-01 | - |
pu.date.classyear | 2017 | en_US |
pu.department | Chemistry | en_US |
pu.pdf.coverpage | SeniorThesisCoverPage | - |
pu.contributor.authorid | 960738474 | - |
pu.contributor.advisorid | 960504780 | - |
pu.certificate | Applications of Computing Program | en_US |
pu.mudd.walkin | yes | en_US |
Appears in Collections: | Chemistry, 1926-2020 |
Files in This Item:
File | Size | Format | |
---|---|---|---|
NS_Thesis_Signed.pdf | 15.51 MB | Adobe PDF | Request a copy |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.