Collaborative Research: CSR: Small: Cross-layer learning-based Energy-Efficient and Resilient NoC design for Multicore Systems
协作研究:CSR:小型:基于跨层学习的多核系统节能和弹性 NoC 设计
基本信息
- 批准号:2321224
- 负责人:
- 金额:$ 37.5万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-10-01 至 2026-09-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
The proliferation of multiple cores on the chip has signaled the advent of communication-centric, rather than computation-centric systems. Consequently, the design of low latency, high bandwidth, power-efficient, and reliable Network-on-Chips (NoCs) is proving to be one of the most critical challenges to achieving the performance potential of future multicore systems. However, as multicores are facilitating an enormous integration capacity, rapid transistor scaling has led to a steady degradation of the device and circuit reliability: unpredictable device behavior will undeniably increase and will result in a significant increase in faults (both permanent and transient), and hardware failures. The ramifications for the NoC are immense: a single fault in the NoC may paralyze the working of the entire chip. While considerable efforts are undertaken to tackle the reliability challenge of NoCs, most current solutions concentrate on local optimizations within the entire NoC abstractions (e.g., circuit, message, and network layers). These solutions tend to possess limited knowledge of the overall system and are therefore reactive in behavior, making worst-case assumptions and overprovisioning, and as a result, they introduce significant power, area, and performance overheads while not completely solving the reliability challenge.This research project tackles the critical NoC reliability challenge by developing a comprehensive, cooperative, and adaptive multi-layer approach for designing reliable NoCs from fault-susceptible components, with globally-optimized power, performance, and costs. To achieve this research goal, this project is organized into four interrelated research tasks. First, this research project conducts a comprehensive study of the fundamental mechanisms that underlie the reliability issues across NoC abstractions. A detailed analysis of the dynamic interactions of NoC abstractions and design trade-offs. Second, the research project develops a cross-layer NoC architecture for resilient on-chip communication with machine-learning-based optimization. Third, this research project aims to incorporate the application layer and off-chip communications and develops a holistic design framework that can automatically capture and adapt to the various computation and communication requirements of different applications with optimized performance, power, and reliability. Finally, the project evaluates the designed framework by developing a cycle-accurate simulation framework and an FPGA prototype. This project will significantly advance the fundamental understanding of the interplay between the NoC and the rest of the components on the chip (cores, memory, etc.) as well as design tradeoffs between performance, power, reliability, and cost in future massively defective nanometer technologies. The developed NoC framework will benefit future multi-core architectures and computing systems with system-level performance and reliability improvements.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
芯片上多个核心的扩散表明以通信为中心而不是以计算为中心的系统的出现。因此,低潜伏期,较高的带宽,效力和可靠的网络(NOC)的设计被证明是实现未来多核心系统的性能潜力的最关键挑战之一。但是,随着多学院正在促进巨大的集成能力,快速晶体管缩放导致设备的稳定下降和电路可靠性:不可预测的设备行为将不可否认地增加,并且会导致故障(永久和瞬态)和硬件故障的故障显着增加。 NOC的后果是巨大的:NOC中的单个故障可能会使整个芯片的工作瘫痪。尽管为应对NOC的可靠性挑战而竭尽全力,但当前大多数解决方案都集中在整个NOC抽象(例如电路,消息和网络层)中的本地优化。这些解决方案倾向于对整个系统具有有限的知识,因此在行为上具有反应性,使最坏的假设和过度覆盖能力,因此,它们引入了重要的能力,区域和性能间接费用,而不是完全解决可靠性挑战。本研究项目通过与全面的合作和适应性的构建方法来应对重要的NOC可靠性挑战,以实现合作和适应性的构图,并确定了适度的构图,该方法是在适当的挑战中,该研究项目的设计项目是合作,适度的,该方法的设计项目,并确定了适应性的构图。全球优化的功率,性能和成本。为了实现这一研究目标,该项目被组织成四个相互关联的研究任务。首先,该研究项目对基本机制进行了全面研究,这些基本机制是NOC抽象跨越可靠性问题的基础。对NOC抽象和设计权衡的动态相互作用的详细分析。其次,研究项目开发了一种跨层NOC体系结构,用于具有基于机器学习的优化的弹性芯片通信。第三,该研究项目旨在结合应用层和芯片外通信,并开发一个整体设计框架,可以自动捕获并适应具有优化性能,功率和可靠性的不同应用程序的各种计算和通信要求。最后,该项目通过开发循环精确的仿真框架和FPGA原型来评估设计的框架。该项目将大大提高对NOC与芯片(内核,内存等)中其余组件之间相互作用的基本理解,以及在未来的大量缺陷纳米技术中的性能,功率,能力,可靠性和成本之间的设计权衡。开发的NOC框架将使未来的多核架构和计算系统受益于系统级的性能和可靠性改进。该奖项反映了NSF的法定任务,并被认为是值得通过基金会的知识分子优点和更广泛影响的评估评估标准来通过评估来获得支持的。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

暂无数据
数据更新时间:2024-06-01
Ahmed Louri的其他基金
Collaborative Research: DESC: Type II: Multi-Function Cross-Layer Electro-Optic Fabrics for Reliable and Sustainable Computing Systems
合作研究:DESC:II 型:用于可靠和可持续计算系统的多功能跨层电光织物
- 批准号:23246442324644
- 财政年份:2023
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Standard GrantStandard Grant
Collaborative Research: SHF: Medium: EPIC: Exploiting Photonic Interconnects for Resilient Data Communication and Acceleration in Energy-Efficient Chiplet-based Architectures
合作研究:SHF:中:EPIC:利用光子互连实现基于节能 Chiplet 的架构中的弹性数据通信和加速
- 批准号:23115432311543
- 财政年份:2023
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Continuing GrantContinuing Grant
SHF: Small: Holistic Design of High-performance and Energy-efficient Accelerators for Graph Neural Networks
SHF:小型:图神经网络高性能、高能效加速器的整体设计
- 批准号:21319462131946
- 财政年份:2021
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Standard GrantStandard Grant
Collaborative Research: SHF: Medium: Neural-Network-based Stochastic Computing Architectures with applications to Machine Learning
合作研究:SHF:中:基于神经网络的随机计算架构及其在机器学习中的应用
- 批准号:19539801953980
- 财政年份:2020
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Continuing GrantContinuing Grant
SHF: Medium: Collaborative Research: Photonic Neural Network Accelerators for Energy-efficient Heterogeneous Multicore Architectures
SHF:媒介:协作研究:用于节能异构多核架构的光子神经网络加速器
- 批准号:19011651901165
- 财政年份:2019
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Continuing GrantContinuing Grant
SHF: Small: Collaborative Research: Integrated Framework for System-Level Approximate Computing
SHF:小型:协作研究:系统级近似计算的集成框架
- 批准号:18124951812495
- 财政年份:2018
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Standard GrantStandard Grant
SHF: Medium: Collaborative Research: Machine Learning Enabled Network-on-Chip Architectures Optimized for Energy, Performance and Reliability
SHF:中:协作研究:支持机器学习的片上网络架构,针对能源、性能和可靠性进行了优化
- 批准号:17029801702980
- 财政年份:2017
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Continuing GrantContinuing Grant
SHF: Small: Collaborative Research: Power-Efficient and Reliable 3D Stacked Reconfigurable Photonic Network-on-Chips for Scalable Multicore Architectures
SHF:小型:协作研究:用于可扩展多核架构的高效且可靠的 3D 堆叠可重构光子片上网络
- 批准号:15470341547034
- 财政年份:2015
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Standard GrantStandard Grant
SHF: Small: Collaborative Research: A Holistic Design Methodology for Fault-Tolerant and Robust Network-on-Chips (NoCs) Architectures
SHF:小型:协作研究:容错和鲁棒片上网络 (NoC) 架构的整体设计方法
- 批准号:15470351547035
- 财政年份:2015
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Standard GrantStandard Grant
XPS: FULL: CCA: Collaborative Research: SPARTA: a Stream-based Processor And Run-Time Architecture
XPS:完整:CCA:协作研究:SPARTA:基于流的处理器和运行时架构
- 批准号:15470361547036
- 财政年份:2015
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Standard GrantStandard Grant
相似国自然基金
善行得善果?后疫情时代嵌入式和边缘式CSR对员工幸福感的跨层影响研究
- 批准号:72102183
- 批准年份:2021
- 资助金额:24.00 万元
- 项目类别:青年科学基金项目
善行得善果?后疫情时代嵌入式和边缘式CSR对员工幸福感的跨层影响研究
- 批准号:
- 批准年份:2021
- 资助金额:30 万元
- 项目类别:
基于脊髓突触可塑性探讨“调气”电针远端腧穴干预CSR模型大鼠的中枢镇痛效应及机制研究
- 批准号:82160934
- 批准年份:2021
- 资助金额:34 万元
- 项目类别:地区科学基金项目
利用输运模型和机器学习方法研究CSR能区的低温高密核物质
- 批准号:
- 批准年份:2020
- 资助金额:50 万元
- 项目类别:联合基金项目
基于兰州HIRFL-CSR装置对轻原子核的团簇结构及晕结构的理论研究
- 批准号:
- 批准年份:2020
- 资助金额:60 万元
- 项目类别:联合基金项目
相似海外基金
Collaborative Research: CSR: Medium: Scaling Secure Serverless Computing on Heterogeneous Datacenters
协作研究:CSR:中:在异构数据中心上扩展安全无服务器计算
- 批准号:23122062312206
- 财政年份:2023
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Continuing GrantContinuing Grant
Collaborative Research: CSR: Medium: Architecting GPUs for Practical Homomorphic Encryption-based Computing
协作研究:CSR:中:为实用的同态加密计算构建 GPU
- 批准号:23122762312276
- 财政年份:2023
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Continuing GrantContinuing Grant
Collaborative Research: CSR: Medium: Fortuna: Characterizing and Harnessing Performance Variability in Accelerator-rich Clusters
合作研究:CSR:Medium:Fortuna:表征和利用富含加速器的集群中的性能变异性
- 批准号:23126892312689
- 财政年份:2023
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Continuing GrantContinuing Grant
Collaborative Research: CSR: Medium: Fortuna: Characterizing and Harnessing Performance Variability in Accelerator-rich Clusters
合作研究:CSR:Medium:Fortuna:表征和利用富含加速器的集群中的性能变异性
- 批准号:24012442401244
- 财政年份:2023
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Continuing GrantContinuing Grant
Collaborative Research: CSR: Small: Caphammer: A New Security Exploit in Energy Harvesting Systems and its Countermeasures
合作研究:CSR:小型:Caphammer:能量收集系统的新安全漏洞及其对策
- 批准号:23146812314681
- 财政年份:2023
- 资助金额:$ 37.5万$ 37.5万
- 项目类别:Continuing GrantContinuing Grant