SHF:Small: Benchmarking of Transient and Intermittent Errors and Their Application to Microarchitecture

SHF:Small:瞬态和间歇性错误的基准测试及其在微架构中的应用

基本信息

  • 批准号:
    1219186
  • 负责人:
  • 金额:
    $ 40万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2012
  • 资助国家:
    美国
  • 起止时间:
    2012-08-01 至 2016-07-31
  • 项目状态:
    已结题

项目摘要

Computing infrastructure has been a driving force for our socio-economic progress in the past several decades. From drug discovery to space exploration, every scientific and engineering domain relies on computer systems to accurately analyze complex datasets. Historically, computational accuracy has been taken for granted in all these disciplines, but this notion is changing. While rapidly shrinking transistor dimensions lead to exponential power and performance benefits, the trend is also creating several unwanted side effects in computer system reliability. There are two types of errors that will become prevalent in the near future: (1) multi-bit soft errors where alpha particles and neutrons cause multiple bits to flip at the same time, and (2) intermittent errors that occur due to stress accumulation over the lifetime of a computer. Thus it is critical to benchmark the impact of these errors on the lifetime of a computer chip. Only when the impact is accurately measured is it possible to judiciously deploy solutions to improve reliability. Since any protection scheme comes with a cost, it is necessary to understand when a particular protection scheme being considered, such as parity or single-error-correcting double-error-detecting code, is too much or too little. This project presents two solutions for benchmarking multi-bit soft errors and intermittent errors. This project will develop a unified methodology to benchmark the impacts of single-bit and multi-bit soft errors on caches protected with an arbitrary protection scheme, such as an inter-leaved, block-level or word-level error correcting code. Such a benchmarking framework will significantly enhance a computer designer's ability to objectively evaluate the performance, power, and reliability tradeoffs of various protection schemes proposed for protecting caches. This research also develops a methodology to benchmark the vulnerability of an instruction set architecture (ISA) to intermittent errors. Each instruction in an ISA specification is enhanced to quantify the amount of stress that it is expected to cause on the underlying microarchitecture of a chip. The stress level information from the ISA is combined with operating conditions of the chip to continuously monitor intermittent error probability during application execution. Any unwanted degradation in chip reliability is then tackled by software exception handlers, which trigger redundant execution of vulnerable code. Broader societal impact will result from these research solutions. Benchmarking is essential to objectively evaluate the cost-benefit tradeoffs of various solutions currently being proposed to tackle reliability concerns. Without benchmarking, building a system to meet reliability specifications is a guessing game. By providing the right set of tools to initiate just-in-time error correction and recovery mechanisms, a computer designer can significantly lower the cost of providing reliable computations.
在过去的几十年中,计算基础设施一直是我们社会经济进步的推动力。从药物发现到太空探索,每个科学和工程领域都依赖计算机系统来准确分析复杂的数据集。从历史上看,在所有这些学科中都认为计算精度已被认为是理所当然的,但是这个概念正在改变。尽管快速缩小晶体管尺寸会带来指数的功率和性能优势,但趋势也在计算机系统可靠性中产生了几种不必要的副作用。在不久的将来,有两种类型的误差将变得普遍:(1)多位软误差,其中α颗粒和中子会导致多个位同时翻转,以及(2)由于计算机的寿命中应力积累而导致的间歇性误差。因此,将这些错误对计算机芯片的生命周期的影响进行基准测试至关重要。只有在准确衡量影响的情况下,才有可能明智地部署解决方案以提高可靠性。由于任何保护方案都带有成本,因此有必要了解何时考虑特定的保护方案(例如奇偶校验或单纠正双回交代码)太多或太少。该项目提出了两种解决方案,用于基于多位软错误和间歇性错误。 该项目将开发一种统一的方法,以基于单位和多位软误差对受任意保护方案保护的缓存的影响,例如相互关系,块级或单词级别的错误纠正代码。 这样的基准测试框架将显着增强计算机设计人员客观地评估提出的用于保护缓存的各种保护方案的性能,权力和可靠性权衡的能力。 这项研究还开发了一种方法,以基于指令集体系结构(ISA)对间歇性错误的脆弱性进行基准测试。 ISA规范中的每条指令都得到增强,以量化芯片的基础微体系结构的应力量。来自ISA的应力级信息与芯片的操作条件相结合,以在应用程序执行过程中连续监视间歇性误差概率。 然后,通过软件异常处理程序来解决芯片可靠性中的任何不需要的降解,这会触发弱势代码的冗余执行。这些研究解决方案将造成更广泛的社会影响。基准测试对于客观地评估目前建议解决可靠性问题的各种解决方案的成本效益折衷。如果不进行基准测试,就可以构建满足可靠性规格的系统是一个猜测游戏。通过提供正确的工具来启动即时误差校正和恢复机制,计算机设计人员可以显着降低提供可靠计算的成本。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Murali Annavaram其他文献

A privacy mechanism for mobile-based urban traffic monitoring
  • DOI:
    10.1016/j.pmcj.2014.12.007
  • 发表时间:
    2015-07-01
  • 期刊:
  • 影响因子:
  • 作者:
    Chi Wang;Hua Liu;Kwame-Lante Wright;Bhaskar Krishnamachari;Murali Annavaram
  • 通讯作者:
    Murali Annavaram
Differentially Private Next-Token Prediction of Large Language Models
大型语言模型的差分隐私下一个标记预测

Murali Annavaram的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Murali Annavaram', 18)}}的其他基金

SHF: Small: ML Accelerator Cohort Architecture
SHF:小型:ML 加速器群组架构
  • 批准号:
    2224319
  • 财政年份:
    2022
  • 资助金额:
    $ 40万
  • 项目类别:
    Standard Grant
Student Travel Support for the 2018 International Symposium on Computer Architecture (ISCA)
2018 年计算机体系结构国际研讨会 (ISCA) 学生旅行支持
  • 批准号:
    1812942
  • 财政年份:
    2018
  • 资助金额:
    $ 40万
  • 项目类别:
    Standard Grant
SHF:Small: Accelerating Graph Analytics Through Coordinated Storage, Memory and Computing Advances
SHF:Small:通过协调存储、内存和计算进步加速图形分析
  • 批准号:
    1719074
  • 财政年份:
    2017
  • 资助金额:
    $ 40万
  • 项目类别:
    Standard Grant
IEEE International Symposium on Workload Characterization (IISWC) Student Subsidy Proposal
IEEE 国际工作负载表征研讨会 (IISWC) 学生资助提案
  • 批准号:
    1104542
  • 财政年份:
    2011
  • 资助金额:
    $ 40万
  • 项目类别:
    Standard Grant
CAREER: From Nonstop-Monitoring to Nano-ISA: An Adaptive Multi-Dimensional Framework for Processor Reliability
职业生涯:从不间断监控到 Nano-ISA:处理器可靠性的自适应多维框架
  • 批准号:
    0954211
  • 财政年份:
    2010
  • 资助金额:
    $ 40万
  • 项目类别:
    Continuing Grant
CSR-PSCE,SM: Trade-offs Between Static Power, Performance and Reliability in Future Chip Multiprocessors
CSR-PSCE,SM:未来芯片多处理器静态功耗、性能和可靠性之间的权衡
  • 批准号:
    0834799
  • 财政年份:
    2008
  • 资助金额:
    $ 40万
  • 项目类别:
    Standard Grant
CSR-PSCE,SM: A Holistic Design Approach to Reliability Using 3D Stacked
CSR-PSCE,SM:使用 3D 堆叠的可靠性整体设计方法
  • 批准号:
    0834798
  • 财政年份:
    2008
  • 资助金额:
    $ 40万
  • 项目类别:
    Standard Grant
CT-ISG: A Game Theoretic Framework for Privacy Preservation in Community-Based Mobile Applications
CT-ISG:基于社区的移动应用程序中隐私保护的博弈论框架
  • 批准号:
    0831545
  • 财政年份:
    2008
  • 资助金额:
    $ 40万
  • 项目类别:
    Standard Grant

相似国自然基金

靶向Treg-FOXP3小分子抑制剂的筛选及其在肺癌免疫治疗中的作用和机制研究
  • 批准号:
    32370966
  • 批准年份:
    2023
  • 资助金额:
    50 万元
  • 项目类别:
    面上项目
化学小分子激活YAP诱导染色质可塑性促进心脏祖细胞重编程的表观遗传机制研究
  • 批准号:
    82304478
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
靶向小胶质细胞的仿生甘草酸纳米颗粒构建及作用机制研究:脓毒症相关性脑病的治疗新策略
  • 批准号:
    82302422
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
HMGB1/TLR4/Cathepsin B途径介导的小胶质细胞焦亡在新生大鼠缺氧缺血脑病中的作用与机制
  • 批准号:
    82371712
  • 批准年份:
    2023
  • 资助金额:
    49 万元
  • 项目类别:
    面上项目
小分子无半胱氨酸蛋白调控生防真菌杀虫活性的作用与机理
  • 批准号:
    32372613
  • 批准年份:
    2023
  • 资助金额:
    50 万元
  • 项目类别:
    面上项目

相似海外基金

Bioorthogonal probe development for highly parallel in vivo imaging
用于高度并行体内成像的生物正交探针开发
  • 批准号:
    10596786
  • 财政年份:
    2023
  • 资助金额:
    $ 40万
  • 项目类别:
Development and evaluation of a combined X-ray transmission and diffraction imaging system for pathology
用于病理学的组合 X 射线透射和衍射成像系统的开发和评估
  • 批准号:
    10699271
  • 财政年份:
    2023
  • 资助金额:
    $ 40万
  • 项目类别:
HealthyU-Latinx: A Technology-based Tool for addressing Health Literacy in Latinx Secondary Students and their Families
HealthyU-Latinx:一种基于技术的工具,用于提高拉丁裔中学生及其家庭的健康素养
  • 批准号:
    10699830
  • 财政年份:
    2023
  • 资助金额:
    $ 40万
  • 项目类别:
A Novel Sublingual Vaccine to Prevent Neisseria Gonorrhoeae Infection
预防淋病奈瑟菌感染的新型舌下疫苗
  • 批准号:
    10699065
  • 财政年份:
    2023
  • 资助金额:
    $ 40万
  • 项目类别:
EFFICIENT DIFFERENTIATION, SCALE-UP, AND MATURATION OF IPS DERIVED CARDIOMYOCYTES
IPS 来源的心肌细胞的有效分化、放大和成熟
  • 批准号:
    10761485
  • 财政年份:
    2023
  • 资助金额:
    $ 40万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了