CAREER: A Hardware and Software Architecture for Data-Centric Parallel Computing

职业:以数据为中心的并行计算的硬件和软件架构

基本信息

项目摘要

Energy efficiency is the key challenge facing computer systems. To improveperformance under a limited energy budget, systems are becoming increasinglyparallel, featuring many smaller and simpler cores, and heterogeneous,featuring cores specialized for certain tasks. Even with these improvements,two critical challenges remain. First, without reducing data movement, memoryaccesses and communication will dominate energy consumption. Thus, limitingdata movement must become a primary design objective. Second, these systemswill be highly complex, and will need powerful abstractions to shieldprogrammers from this complexity. Current systems are designed in acomputation-centric way that is a poor match for these challenges. Memoryhierarchies are hardware-managed and opaque to software, which needlesslyincreases data movement; and runtimes lack the proper hardware mechanisms andsoftware policies to manage heterogeneous resources efficiently.This research project takes a holistic approach to addressing these challenges, byco-designing an architecture and runtime system that efficiently run dynamicparallel applications on systems with heterogeneous cores and memories.Redesigning hardware to be directly exploited by a dynamic runtime enables (a)many more opportunities to reduce data movement, (b) better usage ofheterogeneous resources, and (c) much faster adaptation to changing applicationneeds and available resources. Three key components underlie this design.First, a scalable memory system incorporates combinations of heterogeneousmemory technologies to improve efficiency, and exposes them to software, whichcan divide these physical memories into many virtual cache and memoryhierarchies to finely control data placement. Second, specialized programmableengines orchestrate communication among cores, accelerate intensive runtimefunctions such as load balancing, and monitor how tasks use hardware resourcesto guide runtime decisions. Third, a hardware-accelerated runtime leveragesthis novel architectural support to place data and computation to minimize datamovement, use the most suitable core for each task, and quickly respond tochanging application needs. This runtime targets a high-level programming modelthat lets programmers express fine-grained and irregular task, data, andpipeline parallelism. These techniques build on an analytical design approachthat makes hardware easy to understand and predict, and enables runtimes tonavigate multi-dimensional tradeoffs efficiently.If successful, this project will make heterogeneous systems more efficient,more broadly applicable, and easier to program. It will especially benefitapplications with dynamic and fine-grained parallelism, advancing key emergingdomains where these workloads are pervasive, such as graph analytics and onlinedata-intensive services. In addition, the infrastructure developed as part ofthis project will be publicly released, enabling others to build on the resultsof this work.
能源效率是计算机系统面临的主要挑战。为了在有限的能量预算下提高性能,系统变得越来越并行,具有许多更小、更简单的内核,以及专门用于某些任务的异构内核。即使有了这些改进,仍然存在两个关键挑战。首先,在不减少数据移动的情况下,内存访问和通信将主导能源消耗。因此,限制数据移动必须成为主要设计目标。其次,这些系统将非常复杂,并且需要强大的抽象来保护程序员免受这种复杂性的影响。当前的系统是以计算为中心的方式设计的,无法应对这些挑战。内存层次结构由硬件管理,对软件不透明,这不必要地增加了数据移动;和运行时缺乏适当的硬件机制和软件策略来有效管理异构资源。该研究项目采用整体方法来解决这些挑战,通过共同设计架构和运行时系统,在具有异构核心和内存的系统上有效运行动态并行应用程序。重新设计硬件由动态运行时直接利用可以(a)有更多机会减少数据移动,(b)更好地利用异构资源,以及(c)更快地适应不断变化的应用程序需求和可用资源。该设计基于三个关键组件。首先,可扩展的内存系统结合了异构内存技术来提高效率,并将它们暴露给软件,软件可以将这些物理内存划分为许多虚拟缓存和内存层次结构,以精细地控制数据放置。其次,专门的可编程引擎协调内核之间的通信,加速负载平衡等密集的运行时功能,并监控任务如何使用硬件资源来指导运行时决策。第三,硬件加速运行时利用这种新颖的架构支持来放置数据和计算,以最大限度地减少数据移动,为每个任务使用最合适的核心,并快速响应不断变化的应用程序需求。该运行时的目标是高级编程模型,使程序员能够表达细粒度和不规则的任务、数据和管道并行性。这些技术建立在分析设计方法的基础上,使硬件易于理解和预测,并使运行时能够有效地进行多维权衡。如果成功,该项目将使异构系统更高效、更广泛适用且更易于编程。它将特别有利于具有动态和细粒度并行性的应用程序,推动这些工作负载普遍存在的关键新兴领域,例如图形分析和在线数据密集型服务。此外,作为该项目一部分开发的基础设施将公开发布,使其他人能够在这项工作的成果的基础上进行开发。

项目成果

期刊论文数量(6)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
PHI: Architectural Support for Synchronization- and Bandwidth-Efficient Commutative Scatter Updates
PHI:对同步和带宽高效的可交换分散更新的架构支持
Harmonizing Speculative and Non-Speculative Execution in Architectures for Ordered Parallelism
协调有序并行架构中的推测和非推测执行
Safecracker: Leaking Secrets through Compressed Caches
Safecracker:通过压缩缓存泄露秘密
Leveraging Caches to Accelerate Hash Tables and Memoization
利用缓存加速哈希表和记忆化
Livia: Data-Centric Computing Throughout the Memory Hierarchy
Livia:整个内存层次结构中以数据为中心的计算
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Daniel Sanchez Martin其他文献

Daniel Sanchez Martin的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Daniel Sanchez Martin', 18)}}的其他基金

Collaborative Research: PPoSS: LARGE: A Full-Stack Architecture for Sparse Computation
协作研究:PPoSS:LARGE:稀疏计算的全栈架构
  • 批准号:
    2217099
  • 财政年份:
    2022
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
SHF: Small: A Scalable Architecture for Ubiquitous Parallelism
SHF:小型:无处不在的并行性的可扩展架构
  • 批准号:
    1814969
  • 财政年份:
    2018
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
SHF:Small:Scalable Memory Hierarchies with Fine-Grained QoS Guarantees
SHF:Small:具有细粒度 QoS 保证的可扩展内存层次结构
  • 批准号:
    1318384
  • 财政年份:
    2013
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant

相似国自然基金

面向软件定义芯片的硬件真随机数发生器关键技术研究
  • 批准号:
    62104129
  • 批准年份:
    2021
  • 资助金额:
    20 万元
  • 项目类别:
    青年科学基金项目
泛在系统软件的软硬件协同设计方法及其结构模型和运行机理
  • 批准号:
  • 批准年份:
    2020
  • 资助金额:
    293 万元
  • 项目类别:
    重点项目
面向应用安全的特权级硬件挖掘研究
  • 批准号:
    61902374
  • 批准年份:
    2019
  • 资助金额:
    27.0 万元
  • 项目类别:
    青年科学基金项目
基于NVRAM的高性能软件栈关键技术研究
  • 批准号:
    61902405
  • 批准年份:
    2019
  • 资助金额:
    26.0 万元
  • 项目类别:
    青年科学基金项目
软件定义硬件中数据密集型应用的高性能映射技术研究
  • 批准号:
    61804017
  • 批准年份:
    2018
  • 资助金额:
    27.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

CAREER: Data-Driven Hardware and Software Techniques to Enable Sustainable Data Center Services
职业:数据驱动的硬件和软件技术,以实现可持续的数据中心服务
  • 批准号:
    2340042
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
CAREER: Efficient Large Language Model Inference Through Codesign: Adaptable Software Partitioning and FPGA-based Distributed Hardware
职业:通过协同设计进行高效的大型语言模型推理:适应性软件分区和基于 FPGA 的分布式硬件
  • 批准号:
    2339084
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
CAREER: Enabling Scalable and Resilient Quantum Computer Architectures through Synergistic Hardware-Software Co-Design
职业:通过协同硬件软件协同设计实现可扩展且有弹性的量子计算机架构
  • 批准号:
    2340267
  • 财政年份:
    2024
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
CAREER: Enabling Robust and Adaptive Architectures through a Decoupled Security-Centric Hardware/Software Stack
职业:通过解耦的以安全为中心的硬件/软件堆栈实现鲁棒性和自适应架构
  • 批准号:
    2238548
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
CAREER: Countering Emerging Software Threats with Adaptive Hardening, Debloating, and Hardware-assisted Protection
职业:通过自适应强化、反膨胀和硬件辅助保护来应对新兴软件威胁
  • 批准号:
    2238467
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了