Please use this identifier to cite or link to this item:
http://arks.princeton.edu/ark:/88435/dsp01r207tp37c
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | August, David I | en_US |
dc.contributor.author | Zhang, Yun | en_US |
dc.contributor.other | Computer Science Department | en_US |
dc.date.accessioned | 2012-08-01T19:35:34Z | - |
dc.date.available | 2012-08-01T19:35:34Z | - |
dc.date.issued | 2012 | en_US |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/dsp01r207tp37c | - |
dc.description.abstract | Transient faults are emerging as a critical reliability concern for modern microprocessors. Recently, microprocessors have been designed with lower voltage level ,smaller and faster transistors enabled by improved fabrication technology. A combination of increased density of transistors on chip, reduced noise margin of each transistor, and voltage scaling are making hardware systems more susceptible to transient faults than ever. Both hardware or software solutions have been proposed for transient fault tolerance. The hardware approach typically adds redundant hardware modules to the system, thus requiring extra chip area as well as higher hardware design and verification cost. In ad- dition, the scope and mechanism of fault tolerance are hardwired at design time, which could be suboptimal with the change of deployment environment. Unlike hardware solu- tions, software-only techniques do not require any specialized hardware extensions and are more flexible with the scope of protection and the change of environment. However, even the best-performing software-only fault tolerance techniques incur significant performance cost. The overhead of prior work comes from doubled register usage, frequent inter-core communication, or barrier synchronizations. These factors prevent existing software tech- niques from being adopted widely. To address these problems, this dissertation proposes Runtime Software-only Speculative Fault Tolerance (RSFT). The key insights behind this dissertation are: (1) not all values are equally important. Transient faults may alter a transistor's value, which is never used. Only the values that will affect the externally visible behavior of a program must be verified before being used; (2) Value speculation can efficiently remove data dependences introduced by cross checking values produced in the program and its redundant copy with high confidence, thus significantly improves program runtime performance. RSFT serves as a virtual layer between the application and the underlying platform. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Princeton, NJ : Princeton University | en_US |
dc.relation.isformatof | The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the <a href=http://catalog.princeton.edu> library's main catalog </a> | en_US |
dc.subject | Fault Tolerance | en_US |
dc.subject | Runtime | en_US |
dc.subject | Speculation | en_US |
dc.subject.classification | Computer science | en_US |
dc.title | Runtime Speculative Software-Only Fault Tolerance | en_US |
dc.type | Academic dissertations (Ph.D.) | en_US |
pu.projectgrantnumber | 690-2143 | en_US |
Appears in Collections: | Computer Science |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Zhang_princeton_0181D_10157.pdf | 803.27 kB | Adobe PDF | View/Download |
Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.