Collaborative Research: SI2-SSI: EVOLVE: Enhancing the Open MPI Software for Next Generation Architectures and Applications

合作研究:SI2-SSI:EVOLVE:增强下一代架构和应用的开放式 MPI 软件

基本信息

  • 批准号:
    1663887
  • 负责人:
  • 金额:
    $ 30.88万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2017
  • 资助国家:
    美国
  • 起止时间:
    2017-06-01 至 2022-05-31
  • 项目状态:
    已结题

项目摘要

For nearly two decades, the Message Passing Interface (MPI) has been an essential part of the High-Performance Computing ecosystem and consequently a key enabler for important scientific breakthroughs. It is a fundamental building block for most large-scale simulations from physics, chemistry, biology, material sciences as engineering. Open MPI is an open source implementation of the MPI specification, widely used and adopted by the research community as well as industry. The Open MPI library is jointly developed and maintained by a consortium of academic institutions, national labs and industrial partners. It is installed on virtually all large-scale computer systems in the US as well as in the rest of the world. The goal of this project is to enhance and modernize the Open MPI library in the context of the ongoing evolution of modern computer systems, and to ensure its future operability on all upcoming architectures. We aim at implementing fundamental software techniques that can be used in many-core systems to execute MPI-based parallel applications more efficiently, and to tolerate process and memory failures at all scales, from current systems, up to the extreme scales expected before the end of the decade.Open MPI is an open source implementation of the Message Passing Interface (MPI) specification. The MPI API is currently being extended to consider the needs of application developers in terms of efficiency, productivity and resilience. The project will also support academic involvement in the design, development and evaluation of the Open MPI software, and ensure academic presence in the MPI Forum. The goal of this proposal is to enhance the Open MPI software library, focusing on two aspects: (1) Extend Open MPI to support new features of the MPI specification. Open MPI will continue to support all new features of current and upcoming MPI specifications. The two most significant areas within the context of this proposal are (a) extensions to better support hybrid programming models and (b) support for fault tolerance in MPI applications. To improve support for hybrid programming models, the MPI Forum is currently considering introducing the notion of MPI Endpoints, which could be used by different threads of an MPI rank to instantiate multiple separate communication contexts. The goal within this project is to develop an implementation of endpoints to support effective hybrid programming model, and to extend the concept to other aspects of parallel applications such as File I/O operations. One of the project partners (UTK) leads the current proposal in the MPI Forum to expose failures and ensure the continuation of the execution of MPI applications. In the context of this SSI proposal, the goal is to harden, improve, and expand the support of the existing ULFM implementation in Open MPI and thus enable end-users to design application-specific resilience approaches for future platforms. (2) Enhance the Open MPI core to support new architectures and improve scalability. While Open MPI has demonstrated very good scalability in the past, there is significant work to be done to ensure similarly good performance on future architectures. Specifically, we propose a groundbreaking rework of the startup environment that will improve process launch scalability, increase support for asynchronous progress of operations, enable support for accelerators, and reduce sensitivity to system noise. The project would also enhance the support for File I/O operations as part of the Open MPI package by expanding our work on highly scalable collective I/O operations through delegation and exploring the utilization of burst buffers as temporary storage.
近二十年来,消息传递接口 (MPI) 一直是高性能计算生态系统的重要组成部分,因此也是重要科学突破的关键推动者。它是物理、化学、生物学、材料科学和工程领域大多数大规模模拟的基本构建模块。 Open MPI 是 MPI 规范的开源实现,被研究界和行业广泛使用和采用。 Open MPI 库由学术机构、国家实验室和工业合作伙伴组成的联盟共同开发和维护。它几乎安装在美国以及世界其他地区的所有大型计算机系统上。该项目的目标是在现代计算机系统不断发展的背景下增强和现代化 Open MPI 库,并确保其未来在所有即将推出的架构上的可操作性。我们的目标是实现可用于多核系统的基本软件技术,以更有效地执行基于 MPI 的并行应用程序,并容忍所有规模的进程和内存故障,从当前系统到最终预期的极端规模Open MPI 是消息传递接口 (MPI) 规范的开源实现。 MPI API 目前正在扩展,以考虑应用程序开发人员在效率、生产力和弹性方面的需求。该项目还将支持学术界参与 Open MPI 软件的设计、开发和评估,并确保学术界在 MPI 论坛中的存在。该提案的目标是增强Open MPI软件库,重点关注两个方面:(1)扩展Open MPI以支持MPI规范的新功能。 Open MPI 将继续支持当前和即将推出的 MPI 规范的所有新功能。该提案中两个最重要的领域是(a)更好地支持混合编程模型的扩展和(b)对 MPI 应用程序中容错的支持。为了改善对混合编程模型的支持,MPI 论坛目前正在考虑引入 MPI 端点的概念,MPI 级别的不同线程可以使用它来实例化多个单独的通信上下文。该项目的目标是开发端点的实现来支持有效的混合编程模型,并将概念扩展到并行应用程序的其他方面,例如文件 I/O 操作。项目合作伙伴之一 (UTK) 在 MPI 论坛中主导了当前的提案,以揭露故障并确保 MPI 应用程序的继续执行。在此 SSI 提案的背景下,目标是强化、改进和扩展对 Open MPI 中现有 ULFM 实施的支持,从而使最终用户能够为未来平台设计特定于应用程序的弹性方法。 (2)增强Open MPI核心以支持新架构并提高可扩展性。虽然 Open MPI 过去已展现出非常好的可扩展性,但要确保在未来架构上具有同样良好的性能,还有大量工作要做。具体来说,我们建议对启动环境进行突破性的改造,以提高流程启动的可扩展性,增加对异步操作进度的支持,启用对加速器的支持,并降低对系统噪声的敏感性。该项目还将通过委托扩展我们对高度可扩展的集体 I/O 操作的工作,并探索使用突发缓冲区作为临时存储,从而增强对作为 Open MPI 包一部分的文件 I/O 操作的支持。

项目成果

期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
On Overlapping Communication and File I/O in Collective Write Operation
Parallel I/O on Compressed Data Files: Semantics, Algorithms, and Performance Evaluation
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Edgar Gabriel其他文献

Edgar Gabriel的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Edgar Gabriel', 18)}}的其他基金

SI2-SSE: Collaborative Research: ADAPT: Next Generation Message Passing Interface (MPI) Library - Open MPI
SI2-SSE:协作研究:ADAPT:下一代消息传递接口 (MPI) 库 - 开放 MPI
  • 批准号:
    1339763
  • 财政年份:
    2013
  • 资助金额:
    $ 30.88万
  • 项目类别:
    Standard Grant
SI2-SSI: Collaborative Research: A Glass Box Approach to Enabling Open, Deep Interactions in the HPC Toolchain
SI2-SSI:协作研究:在 HPC 工具链中实现开放、深度交互的玻璃盒方法
  • 批准号:
    1148052
  • 财政年份:
    2012
  • 资助金额:
    $ 30.88万
  • 项目类别:
    Standard Grant
II-NEW: A Heterogeneous Testbed for Exploring Emerging HPC Tools, Programming Languages, and Applications
II-新:用于探索新兴 HPC 工具、编程语言和应用程序的异构测试平台
  • 批准号:
    0958464
  • 财政年份:
    2010
  • 资助金额:
    $ 30.88万
  • 项目类别:
    Continuing Grant
CAREER: Dynamic Run-Time Optimization of Parallel, Adaptive and Hybrid Applications
职业:并行、自适应和混合应用程序的动态运行时优化
  • 批准号:
    0846002
  • 财政年份:
    2009
  • 资助金额:
    $ 30.88万
  • 项目类别:
    Continuing Grant

相似国自然基金

离子型稀土渗流-应力-化学耦合作用机理与溶浸开采优化研究
  • 批准号:
    52364012
  • 批准年份:
    2023
  • 资助金额:
    32 万元
  • 项目类别:
    地区科学基金项目
亲环蛋白调控作物与蚜虫互作分子机制的研究
  • 批准号:
    32301770
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
基于金属-多酚网络衍生多相吸波体的界面调控及电磁响应机制研究
  • 批准号:
    52302362
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
职场网络闲逛行为的作用结果及其反馈效应——基于行为者和观察者视角的整合研究
  • 批准号:
    72302108
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
EIF6负调控Dicer活性促进EV71复制的分子机制研究
  • 批准号:
    32300133
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

Collaborative Research: SI2-SSI: Expanding Volunteer Computing
合作研究:SI2-SSI:扩展志愿者计算
  • 批准号:
    2039142
  • 财政年份:
    2020
  • 资助金额:
    $ 30.88万
  • 项目类别:
    Standard Grant
SI2-SSI: Collaborative Research: Einstein Toolkit Community Integration and Data Exploration
SI2-SSI:协作研究:Einstein Toolkit 社区集成和数据探索
  • 批准号:
    2114580
  • 财政年份:
    2020
  • 资助金额:
    $ 30.88万
  • 项目类别:
    Continuing Grant
Collaborative Research: SI2-SSI: Expanding Volunteer Computing
合作研究:SI2-SSI:扩展志愿者计算
  • 批准号:
    2001752
  • 财政年份:
    2019
  • 资助金额:
    $ 30.88万
  • 项目类别:
    Standard Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
  • 批准号:
    1743178
  • 财政年份:
    2018
  • 资助金额:
    $ 30.88万
  • 项目类别:
    Continuing Grant
Collaborative Research: NISC SI2-S2I2 Conceptualization of CFDSI: Model, Data, and Analysis Integration for End-to-End Support of Fluid Dynamics Discovery and Innovation
合作研究:NISC SI2-S2I2 CFDSI 概念化:模型、数据和分析集成,用于流体动力学发现和创新的端到端支持
  • 批准号:
    1743185
  • 财政年份:
    2018
  • 资助金额:
    $ 30.88万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了