Improving Noise Robustness in
Automatic Speaker Recognition Systems

Smith, Jamie

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01h989r5537

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Moretti, Christopher	-
dc.contributor.author	Smith, Jamie	-
dc.date.accessioned	2015-06-26T16:31:34Z	-
dc.date.available	2015-06-26T16:31:34Z	-
dc.date.created	2015-04-30	-
dc.date.issued	2015-06-26	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp01h989r5537	-
dc.description.abstract	Abstract Most automatic speaker recognition systems perform poorly on data with low signal-tonoise ratios (SNRs). In this paper, we analyze the performance of Spear, an open-source, comprehensive speaker recognition toolkit, on speech utterances from the TIMIT corpus injected with background noise. We suggest and implement changes for the voice activity detection (VAD) and feature extraction steps of the Spear toolchain to improve its overall noise robustness. Speci cally, we propose replacing Spear's simple VAD, which classi es frame-level energy into two groups, with a new VAD, which uses a posteriori signal-tonoise weighted energy distance. For feature extraction, we consider the e ectiveness of using gammatone frequency cepstral coe cients (GFCCs) instead of traditional mel-scale frequency cepstral coe cients (MFCCs). We prove the superiority of GFCCs for data with low SNRs by incorporating GFCC feature extraction in the Spear toolchain and then testing it on the noisy TIMIT data. Then, we further propose a new, modi ed version of MFCCs that is even more noise-robust than GFCCs.	en_US
dc.format.extent	60 pages	*
dc.language.iso	en_US	en_US
dc.title	Improving Noise Robustness in Automatic Speaker Recognition Systems	en_US
dc.type	Princeton University Senior Theses	-
pu.date.classyear	2015	en_US
pu.department	Computer Science	en_US
pu.pdf.coverpage	SeniorThesisCoverPage	-
Appears in Collections:	Computer Science, 1988-2020

Files in This Item:

File	Size	Format
PUTheses2015-Smith_Jamie.pdf	413.62 kB	Adobe PDF	Request a copy

Show simple item record

Search

Browse