CSR: Small: Accelerating Data Intensive Scientific Workflows with Consistency Contracts
CSR:小:通过一致性合同加速数据密集型科学工作流程
基本信息
- 批准号:2317556
- 负责人:
- 金额:$ 59.97万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-10-01 至 2026-09-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Advanced discovery in scientific computing increasingly depends upon the successful execution of complex workflows that combine multiple applications together to run in concert on a large high performance cluster. A widespread challenge in this setting is the performance of the shared parallel filesystem. Because each file system interaction has very different needs in terms of performance and consistency, the filesystem is obliged to follow the most conservative approach to handle the worst case. As a result, peak performance is rarely achieved. We propose "consistency contracts" as the solution to this problem. This novel approach requires the workflow as a whole to declare its intended uses of the file system at the start of each execution, allowing the runtime system to perform a variety of optimizations. This project will evaluate the concept of consistency contracts by constructing an experimental system (Pledge) that enables and enforces contracts on existing data intensive workflows, with minimal disruption to current practice. We hypothesize that this approach will significantly improve performance for data intensive scientific applications running on high performance clusters,and has the potential to be more widely applied. Our focus on consistency results from our observation that today's shared parallel filesystem is asked to fill multiple roles: moving large files, delivering complex software trees, providing buffers between tasks, and providing synchronization between tasks. Current filesystems provide the most conservative sequential consistency to handle the worst case. Rather than depend upon the shared filesystem to perform last-minute runtime arbitration of every individual filesystem operation, we argue that the workflow as a whole should declare its intentions for the duration of the execution, indicating the paths, access modes, and consistency requirements needed for the entire workflow run. With a contract in hand, the runtime system can then perform a variety of optimizations that exploit the internal storage and I/O capacity of the cluster as a whole, for example utilizing a streamlined approach for read-only access. We hypothesize that workflow-level consistency management will yield higher effective I/O bandwidth and transaction rates than strict global consistency management for data intensive scientific applications running on high performance clusters. These improved I/O rates will translate into faster end-to-end runtimes and fewer unexpected performance failures for end users and system administrators.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
科学计算中的高级发现越来越多地取决于成功执行复杂的工作流,这些工作流将多个应用程序结合在一起,以在大型高性能集群上共同运行。在这种情况下,广泛的挑战是共享并行文件系统的性能。 由于每个文件系统交互在性能和一致性方面都有不同的需求,因此文件系统必须遵循最保守的方法来处理最坏情况。 结果,很少实现峰值性能。我们建议“一致性合同”作为解决此问题的解决方案。 这种新颖的方法要求整个工作流程在每个执行开始时声明其预期的文件系统用途,从而允许运行时系统执行各种优化。该项目将通过构建实验系统(质疑)来评估一致性合同的概念,该系统能够在现有数据密集型工作流程上实现和执行合同,而对当前实践的中断最小。我们假设这种方法将显着提高在高性能集群上运行的数据密集型科学应用程序的性能,并且有可能更广泛地应用。我们关注的关注一致性结果是,我们的观察结果是,当今的共享并行文件系统被要求填补多个角色:移动大文件,提供复杂的软件树,提供任务之间的缓冲区以及在任务之间提供同步。当前文件系统提供了处理最坏情况的最保守的顺序一致性。 与其依靠共享文件系统执行每个单个文件系统操作的最后一刻运行时仲裁,我们认为整个工作流程应在执行期间声明其意图,表明整个工作流程运行所需的路径,访问模式以及一致性要求。 借助合同,运行时系统可以执行各种优化,以利用整个集群的内部存储和I/O容量,例如使用流线方法来读取仅读取访问。我们假设工作流程级的一致性管理将产生比严格的全球一致性管理的有效I/O带宽和交易率更高的数据密集型科学应用程序,以高性能集群运行。这些提高的I/O利率将转化为更快的端到端运行时间,对于最终用户和系统管理员而言,意外的性能失败较少。该奖项反映了NSF的法定任务,并被认为是值得通过基金会的知识分子优点和更广泛影响的评估评估来审查标准的。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

暂无数据
数据更新时间:2024-06-01
Douglas Thain其他文献
Poster: Robust Meta-Workflow Management with Mufasa
海报:使用 Mufasa 进行稳健的元工作流程管理
- DOI:
- 发表时间:20222022
- 期刊:
- 影响因子:0
- 作者:Ben Lyons;Douglas ThainBen Lyons;Douglas Thain
- 通讯作者:Douglas ThainDouglas Thain
共 1 条
- 1
Douglas Thain的其他基金
CSSI Elements: DataSwarm: A User-Level Framework for Data Intensive Scientific Applications
CSSI 元素:DataSwarm:数据密集型科学应用程序的用户级框架
- 批准号:19313481931348
- 财政年份:2019
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:Standard GrantStandard Grant
REU Site: Data Intensive Scientific Computing
REU 站点:数据密集型科学计算
- 批准号:15603631560363
- 财政年份:2016
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:Standard GrantStandard Grant
SI2-SSE: Scaling up Science on Cyberinfrastructure with the Cooperative Computing Tools
SI2-SSE:利用协作计算工具扩大网络基础设施科学规模
- 批准号:16424091642409
- 财政年份:2016
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:Standard GrantStandard Grant
Collaborative Research: Software Sustainability: an SI^2 PI Workshop
协作研究:软件可持续性:SI^2 PI 研讨会
- 批准号:14191321419132
- 财政年份:2014
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:Standard GrantStandard Grant
SI2-SSE: Connecting Cyberinfrastructure with the Cooperative Computing Tools
SI2-SSE:将网络基础设施与协作计算工具连接起来
- 批准号:11483301148330
- 财政年份:2012
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:Standard GrantStandard Grant
Collaborative Research: II-New: Distributed Research Testbed (DiRT)
协作研究:II-新:分布式研究测试台 (DiRT)
- 批准号:08550470855047
- 财政年份:2009
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:Standard GrantStandard Grant
CAREER: Data Intensive Grid Computing on Active Storage Clusters
职业:活动存储集群上的数据密集型网格计算
- 批准号:06432290643229
- 财政年份:2007
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:Continuing GrantContinuing Grant
HECURADeconstructing Clusters for High End Biometric Applications
HECURA解构高端生物识别应用的集群
- 批准号:06214340621434
- 财政年份:2007
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:Standard GrantStandard Grant
SGER: Enabling Electronic Self-Defense with Dynamic Identities
SGER:通过动态身份实现电子自卫
- 批准号:05490870549087
- 财政年份:2005
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:Standard GrantStandard Grant
相似国自然基金
小胶质细胞NOX2在下丘脑促觉醒神经肽Orexin介导的β淀粉样蛋白加速阿尔茨海默病进程中的作用及分子机制研究
- 批准号:
- 批准年份:2019
- 资助金额:55 万元
- 项目类别:面上项目
小分子多肽跨膜输运序列选择性及其物理机制的理论研究
- 批准号:11804151
- 批准年份:2018
- 资助金额:27.0 万元
- 项目类别:青年科学基金项目
基于深度学习的小物体检测及其异构计算技术研究
- 批准号:61872200
- 批准年份:2018
- 资助金额:64.0 万元
- 项目类别:面上项目
内皮抗衰老蛋白SIRT1调控组织因子在巨细胞病毒隐性感染协同高脂血症加速脑小血管血栓形成中的作用
- 批准号:81801384
- 批准年份:2018
- 资助金额:21.0 万元
- 项目类别:青年科学基金项目
磁层和近地太阳风里小尺度磁结构的观测研究
- 批准号:41774153
- 批准年份:2017
- 资助金额:70.0 万元
- 项目类别:面上项目
相似海外基金
CC* INTEGRATION-SMALL: ADIABATIC MICROSERVICE LEVEL LOAD BALANCED FORWARDING ON PISA SWITCH FOR ACCELERATING URGENT PROCESSES IN SCIENCE DATA CENTER NETWORKS
CC* 集成小型:PISA 交换机上的绝热微服务级负载平衡转发,用于加速科学数据中心网络中的紧急进程
- 批准号:23467292346729
- 财政年份:2024
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:Standard GrantStandard Grant
Tele-FootX: Virtually Supervised Tele-Exercise Platform for Accelerating Plantar Wound Healing
Tele-FootX:用于加速足底伤口愈合的虚拟监督远程锻炼平台
- 批准号:1070132410701324
- 财政年份:2023
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:
New Technologies for Accelerating the Discovery and Characterization of Neuroactives that Address Substance Use Disorders
加速发现和表征解决药物使用障碍的神经活性物质的新技术
- 批准号:1068075410680754
- 财政年份:2023
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:
Accelerating Functional Maturation of Human iPSC-Derived Astrocytes
加速人 iPSC 衍生的星形胶质细胞的功能成熟
- 批准号:1069950510699505
- 财政年份:2023
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:
SBIR Phase II: Accelerating R&D through Streamlined Machine Learning Algorithms for Small Data Applications in Advanced Manufacturing
SBIR 第二阶段:加速 R
- 批准号:23250452325045
- 财政年份:2023
- 资助金额:$ 59.97万$ 59.97万
- 项目类别:Cooperative AgreementCooperative Agreement