SHF:Medium:Collaborative Research:A comprehensive methodology to pursue reproducible accuracy in ensemble scientific simulations on multi- and many-core platforms
SHF:中:协作研究:在多核和众核平台上追求集合科学模拟的可重复精度的综合方法
基本信息
- 批准号:1841552
- 负责人:
- 金额:$ 15.73万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2018
- 资助国家:美国
- 起止时间:2018-06-01 至 2019-05-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Ensemble simulations of scientific phenomena typically run for weeks or even months on high-performance computing clusters. The already high level of concurrency of these computing environments is expected to significantly increase in the near future, causing simulations to suffer not only from numerical errors due to limited arithmetic precision but also from the non-determinism in the execution associated with multithreading. Ultimately this trend can compromise the simulation results and break the scientific community's trust in ensemble simulations. This project tackles this problem and defines a methodology to enable the reproducible accuracy of large ensemble simulations on exascale platforms that include multi- and many-core processors. This project moves along two major fronts. First, the investigators identify common sources of accuracy errors and study their accumulation, propagation, and runtime effects in a controlled environment. This phase includes three research activities: (i) generating code motifs that model those computations that may lead to accuracy errors; (ii) providing multiple implementations of these motifs, called code inspectors, targeting different parallel platforms; and (iii) evaluating the accuracy and runtime of these implementations using a variety of datasets and stress conditions. Second, by installing these code inspectors in real scientific code bases, the investigators study their behavior in uncertain environments. This phase includes two research activities: (i) prioritizing code segments based on quantitative impact scores and matching segments to inspector motifs; and (ii) finding the optimal code inspector implementations and patching the code with them so as to optimize the overall result variance. The applications targeted in this project are deterministic chaotic applications including n-body atomic system simulations and astrophysical simulations.
科学现象的集合模拟通常在高性能计算集群上运行数周甚至数月。这些计算环境已经很高的并发性在不久的将来会显着增加,这会导致模拟不仅因算术精度有限而遭受数值错误,而且会遭受与多线程相关的执行中的非确定性。最终,这种趋势可以损害模拟结果,并打破科学界对集合模拟的信任。该项目解决了此问题,并定义了一种方法,以使大型集合模拟在包括多核处理器在内的Exascale平台上的可重复准确性。 该项目沿两个主要战线移动。首先,研究人员确定了准确性错误的常见来源,并研究其在受控环境中的积累,传播和运行时效应。此阶段包括三个研究活动:(i)生成代码图案,以建模可能导致准确性错误的计算; (ii)提供这些图案的多个实现,称为代码检查员,以不同的并行平台为目标; (iii)使用各种数据集和应力条件评估这些实现的准确性和运行时间。其次,通过将这些代码检查员安装在实际的科学代码库中,研究人员在不确定的环境中研究了其行为。此阶段包括两项研究活动:(i)根据定量影响得分和匹配范围与检查员图案相匹配的代码段的优先级; (ii)找到最佳代码检查器实现并使用它们修补代码,以优化总体结果差异。该项目中针对的应用程序是确定性混乱应用,包括N体原子系统模拟和天体物理模拟。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Michela Taufer其他文献
Enhancing Scientific Research with FAIR Digital Objects in the National Science Data Fabric
利用国家科学数据结构中的 FAIR 数字对象加强科学研究
- DOI:
- 发表时间:
2023 - 期刊:
- 影响因子:0
- 作者:
Michela Taufer;Heberth Martinez;Jakob Luettgau;Lauren Whitnah;G. Scorzelli;P. Newell;Aashish Panta;P. Bremer;Douglas Fils;Christine R. Kirkpatrick;V. Pascucci;Kathryn Mohror;J. Shalf - 通讯作者:
J. Shalf
Integrating FAIR Digital Objects (FDOs) into the National Science Data Fabric (NSDF) to Revolutionize Dataflows for Scientific Discovery
将 FAIR 数字对象 (FDO) 集成到国家科学数据结构 (NSDF) 中,彻底改变科学发现的数据流
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
Michela Taufer;Heberth Martinez;Jakob Luettgau;Lauren Whitnah;†. GiorgioScorzelli;†. PaniaNewel;Aashish Panta;Timo Bremer;§. DougFils;¶. ChristineR.Kirkpatrick;Nina McCurdy;V. Pascucci;U. Knoxville;†. U.Utah;R. LLNL ‡;Research Center - 通讯作者:
Research Center
Michela Taufer的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Michela Taufer', 18)}}的其他基金
EAGER: A Comprehensive Approach for Generating, Sharing, Searching, and Using High-Resolution Terrain Parameters
EAGER:生成、共享、搜索和使用高分辨率地形参数的综合方法
- 批准号:
2334945 - 财政年份:2023
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
Collaborative Research: SHF: Small: Model-driven Design and Optimization of Dataflows for Scientific Applications
协作研究:SHF:小型:科学应用数据流的模型驱动设计和优化
- 批准号:
2331152 - 财政年份:2023
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
SHF: Small: Methods, Workflows, and Data Commons for Reducing Training Costs in Neural Architecture Search on High-Performance Computing Platforms
SHF:小型:降低高性能计算平台上神经架构搜索训练成本的方法、工作流程和数据共享
- 批准号:
2223704 - 财政年份:2022
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
Collaborative Research: Elements: SENSORY: Software Ecosystem for kNowledge diScOveRY - a data-driven framework for soil moisture applications
协作研究:要素:SENSORY:知识发现的软件生态系统 - 土壤湿度应用的数据驱动框架
- 批准号:
2103845 - 财政年份:2021
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
Collaborative Research: PPoSS: Planning: Performance Scalability, Trust, and Reproducibility: A Community Roadmap to Robust Science in High-throughput Applications
协作研究:PPoSS:规划:性能可扩展性、信任和可重复性:高通量应用中稳健科学的社区路线图
- 批准号:
2028923 - 财政年份:2020
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
Collaborative Research: EAGER: Advancing Reproducibility in Multi-Messenger Astrophysics
合作研究:EAGER:提高多信使天体物理学的可重复性
- 批准号:
2041977 - 财政年份:2020
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
SHF: Medium: Collaborative Research: ANACIN-X: Analysis and modeling of Nondeterminism and Associated Costs in eXtreme scale applications
SHF:中:协作研究:ANACIN-X:极端规模应用中的非确定性和相关成本的分析和建模
- 批准号:
1900888 - 财政年份:2019
- 资助金额:
$ 15.73万 - 项目类别:
Continuing Grant
Collaborative: EAGER: Exploring and Advancing the State of the Art in Robust Science in Gravitational Wave Physics
合作:EAGER:探索和推进引力波物理学稳健科学的最新技术
- 批准号:
1841399 - 财政年份:2018
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
Collaborative: EAGER: Exploring and Advancing the State of the Art in Robust Science in Gravitational Wave Physics
合作:EAGER:探索和推进引力波物理学稳健科学的最新技术
- 批准号:
1823372 - 财政年份:2018
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: In Situ Data Analytics for Next Generation Molecular Dynamics Workflows
BIGDATA:IA:协作研究:下一代分子动力学工作流程的原位数据分析
- 批准号:
1841758 - 财政年份:2018
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
相似国自然基金
复合低维拓扑材料中等离激元增强光学响应的研究
- 批准号:12374288
- 批准年份:2023
- 资助金额:52 万元
- 项目类别:面上项目
基于管理市场和干预分工视角的消失中等企业:特征事实、内在机制和优化路径
- 批准号:72374217
- 批准年份:2023
- 资助金额:41.00 万元
- 项目类别:面上项目
托卡马克偏滤器中等离子体的多尺度算法与数值模拟研究
- 批准号:12371432
- 批准年份:2023
- 资助金额:43.5 万元
- 项目类别:面上项目
中等质量黑洞附近的暗物质分布及其IMRI系统引力波回波探测
- 批准号:12365008
- 批准年份:2023
- 资助金额:32 万元
- 项目类别:地区科学基金项目
中等垂直风切变下非对称型热带气旋快速增强的物理机制研究
- 批准号:42305004
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
Collaborative Research: SHF: Medium: Differentiable Hardware Synthesis
合作研究:SHF:媒介:可微分硬件合成
- 批准号:
2403134 - 财政年份:2024
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
Collaborative Research: SHF: Medium: Enabling Graphics Processing Unit Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的图形处理单元性能仿真
- 批准号:
2402804 - 财政年份:2024
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
Collaborative Research: SHF: Medium: Tiny Chiplets for Big AI: A Reconfigurable-On-Package System
合作研究:SHF:中:用于大人工智能的微型芯片:可重新配置的封装系统
- 批准号:
2403408 - 财政年份:2024
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
Collaborative Research: SHF: Medium: Toward Understandability and Interpretability for Neural Language Models of Source Code
合作研究:SHF:媒介:实现源代码神经语言模型的可理解性和可解释性
- 批准号:
2423813 - 财政年份:2024
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant
Collaborative Research: SHF: Medium: Enabling GPU Performance Simulation for Large-Scale Workloads with Lightweight Simulation Methods
合作研究:SHF:中:通过轻量级仿真方法实现大规模工作负载的 GPU 性能仿真
- 批准号:
2402806 - 财政年份:2024
- 资助金额:
$ 15.73万 - 项目类别:
Standard Grant