A Self-Checking Hardware Journal for a Fault-Tolerant Processor Architecture

Joint Authors

Ramazani, Abbas
Dandache, Abbas
Amin, Mohsin
Diou, Camille
Monteiro, Fabrice

Source

International Journal of Reconfigurable Computing

Issue

Vol. 2011, Issue 2011 (31 Dec. 2011), pp.1-15, 15 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2011-06-28

Country of Publication

Egypt

No. of Pages

15

Main Subjects

Information Technology and Computer Science

Abstract EN

We introduce a specialized self-checking hardware journal being used as a centerpiece in our design strategy to build a processor tolerant to transient faults.

Fault tolerance here relies on the use of error detection techniques in the processor core together with journalization and rollback execution to recover from erroneous situations.

Effective rollback recovery is possible thanks to using a hardware journal and chosing a stack computing architecture for the processor core instead of the usual RISC or CISC.

The main objective of the journalization and the hardware self-checking journal is to prevent data not yet validated to be sent to the main memory, and allow to fast rollback execution on faulty situations.

The main memory, supposed to be fault secure in our model, only contains valid (uncorrupted) data obtained from fault-free computations.

Error control coding techniques are used both in the processor core to detect errors and in the HW journal to protect the temporarily stored data from possible changes induced by transient faults.

Implementation results on an FPGA of the Altera Stratix-II family show clearly the relevance of the approach, both in terms of performance/area tradeoff and fault tolerance effectiveness, even for high error rates.

American Psychological Association (APA)

Amin, Mohsin& Ramazani, Abbas& Monteiro, Fabrice& Diou, Camille& Dandache, Abbas. 2011. A Self-Checking Hardware Journal for a Fault-Tolerant Processor Architecture. International Journal of Reconfigurable Computing،Vol. 2011, no. 2011, pp.1-15.
https://search.emarefa.net/detail/BIM-511743

Modern Language Association (MLA)

Amin, Mohsin…[et al.]. A Self-Checking Hardware Journal for a Fault-Tolerant Processor Architecture. International Journal of Reconfigurable Computing No. 2011 (2011), pp.1-15.
https://search.emarefa.net/detail/BIM-511743

American Medical Association (AMA)

Amin, Mohsin& Ramazani, Abbas& Monteiro, Fabrice& Diou, Camille& Dandache, Abbas. A Self-Checking Hardware Journal for a Fault-Tolerant Processor Architecture. International Journal of Reconfigurable Computing. 2011. Vol. 2011, no. 2011, pp.1-15.
https://search.emarefa.net/detail/BIM-511743

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-511743