Conference proceeding
Communication and Migration Energy Aware Design Space Exploration for Multicore Systems with Intermittent Faults
DESIGN, AUTOMATION & TEST IN EUROPE, pp 1631-1636
01 Jan 2013
Abstract
Shrinking transistor geometries, aggressive voltage scaling and higher operating frequencies have negatively impacted the dependability of embedded multicore systems. Most existing research works on fault-tolerance have focused on transient and permanent faults of cores. Intermittent faults are a separate class of defects resulting from on-chip temperature, pressure and voltage variations and lasting for a few cycles to several seconds or more. Operations of cores impacted by intermittent faults are suspended during these cycles but come back alive when conditions become favorable.
This paper proposes a technique to model the availability of multiprocessor systems-on-chip (MPSoCs) with intermittent and reparable device defects. This model is based on Markov chain with stochastic fault distribution and can be applied even for permanent faults. Based on this model, a design space pruning technique is proposed to select a set of task mappings (with variable resource usage), which minimizes the task communication energy while satisfying the MPSoC availability constraint. Moreover, task migration overhead is also minimized, which is an important consideration for frequently occurring intermittent and temperature related faults, where prolonged system downtime during task re-mapping is not desired. Experiments conducted with real-life and synthetic application task graphs demonstrate that the proposed technique minimizes communication energy by 30% and reduces migration overhead by 50% as compared to the existing approaches.
Metrics
Details
- Title
- Communication and Migration Energy Aware Design Space Exploration for Multicore Systems with Intermittent Faults
- Creators
- Anup Das - Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, SingaporeAkash Kumar - Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, SingaporeBharadwaj Veeravalli - Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
- Publication Details
- DESIGN, AUTOMATION & TEST IN EUROPE, pp 1631-1636
- Series
- Design Automation and Test in Europe Conference and Exhibition
- Publisher
- Assoc Computing Machinery
- Number of pages
- 6
- Grant note
- R-263-000-655-133 / Singapore Ministry of Education; Ministry of Education, Singapore
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Electrical and Computer Engineering
- Web of Science ID
- WOS:000415129400316
- Scopus ID
- 2-s2.0-84885614344
- Other Identifier
- 991019295295704721
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Web of Science research areas
- Automation & Control Systems
- Computer Science, Hardware & Architecture
- Engineering, Electrical & Electronic