PPoSS: Planning: Cross-Layer Design for Cost-Effective HPC in the Cloud
PPoSS:规划:云中经济高效 HPC 的跨层设计
基本信息
- 批准号:2028929
- 负责人:
- 金额:$ 25万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2020
- 资助国家:美国
- 起止时间:2020-10-01 至 2022-09-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Many high-performance computing (HPC) applications of national importance (e.g., nuclear simulations, climate modeling, drug discovery, epidemiology, and finance) process enormous datasets and have significant resource demands and strict performance/accuracy/power constraints. Ever-changing hardware elements (e.g., emerging new compute elements) and systems software (continuous fixes to operating systems, compilers and runtime systems) make hosting such HPC applications in locally-managed compute platforms increasingly less attractive. A promising alternate approach is to host these applications in the cloud. However, making legacy HPC applications cloud-ready and identifying the best blend of cloud services for a given application are significant challenges that need to be addressed. In this project, a holistic, cross-layer approach is taken to address the problem of securely mounting such HPC applications in the cloud with high efficiency, low cost, and good performance. A key distinguishing aspect of this project is that it combines both compile-time and run-time innovations and makes contributions to both client and cloud-provider sides. This project spans the following five complementary thrusts, all of which are made challenging by the increasing complexity and scale of the HPC applications of interest, and by the complexity of cloud service offerings and application service-level objectives: (i) characterizing HPC application behavior on myriad cloud infrastructural options; (ii) compiler support for HPC application cloudization; (iii) novel programming language support -- Object-as-a-Service (OaaS); (iv) workload placement and scheduling support; and (v) systems software support for PaaS/SaaS on heterogeneous hardware. The ultimate goal of this project is to devise systematic methodologies for mapping HPC applications to different types of services (spanning IaaS, SaaS, FaaS, OaaS) in multi/hybrid-cloud. This research facilitates improvements in the costs of running HPC applications. This project also enables easy transitioning of HPC applications from one cloud to another and provides data for cloud architecture designers to tune their systems better for current and future HPC workloads. In addition to its technical contributions, this project involves various educational and outreach activities as well. In particular, a new graduate curriculum for cloud computing focusing on HPC applications is created and freely disseminated. Finally, the code being developed and experimental results collected are documented and open-sourced.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
国家重要性(例如核模拟,气候建模,药物发现,流行病学和金融)的许多高性能计算(HPC)应用程序的应用程序庞大的数据集,并且具有巨大的资源需求以及严格的性能/准确性/功率约束。不断变化的硬件元素(例如,新兴的新计算元素)和系统软件(连续修复操作系统,编译器和运行时系统)使在本地管理的计算平台中托管此类HPC应用程序的吸引力越来越小。 一种有希望的替代方法是将这些应用程序托管在云中。 但是,将旧版HPC应用程序云准备好并确定给定应用程序的云服务的最佳融合是需要解决的重大挑战。在这个项目中,采用了整体,跨层的方法来解决以高效率,低成本和良好的性能将此类HPC应用程序牢固地安装在云中的问题。该项目的一个关键区别是,它结合了编译时间和运行时创新,并为客户和云提供者提供贡献。该项目跨越了以下五个互补推力,所有这些项目都是由于感兴趣的HPC应用程序的复杂性和规模日益增加,以及云服务产品的复杂性和应用程序服务级别的目标的挑战。 (ii)编译器支持HPC应用程序云; (iii)新颖的编程语言支持-Abs-As-A-Service(OAAS); (iv)工作负载安排和调度支持; (v)系统软件支持PAAS/SaaS在异质硬件上。 该项目的最终目的是将系统方法论映射到多类/混合云中的不同类型的服务(跨越IaaS,SaaS,FAAS,OAAS)。 这项研究促进了运行HPC应用程序成本的改善。该项目还可以轻松地将HPC应用程序从一个云转换为另一个云,并为云体系结构设计人员提供数据,以更好地调整其系统,以实现当前和将来的HPC工作负载。除了其技术贡献外,该项目还涉及各种教育和外展活动。特别是,创建并自由传播了针对HPC应用程序的云计算的新的研究生课程。 最后,正在制定的代码和收集的实验结果记录和开源。该奖项反映了NSF的法定任务,并使用基金会的知识分子优点和更广泛的影响审查标准,被认为值得通过评估来获得支持。
项目成果
期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
SHOWAR: Right-Sizing And Efficient Scheduling of Microservices
- DOI:10.1145/3472883.3486999
- 发表时间:2021-11
- 期刊:
- 影响因子:0
- 作者:A. F. Baarzi;G. Kesidis
- 通讯作者:A. F. Baarzi;G. Kesidis
Splice: An Automated Framework for Cost-and Performance-Aware Blending of Cloud Services
- DOI:10.1109/ccgrid54584.2022.00021
- 发表时间:2022-05
- 期刊:
- 影响因子:0
- 作者:Myungjun Son;S. Mohanty;Jashwant Raj Gunasekaran;Aman Jain;M. Kandemir;G. Kesidis;B. Urgaonkar
- 通讯作者:Myungjun Son;S. Mohanty;Jashwant Raj Gunasekaran;Aman Jain;M. Kandemir;G. Kesidis;B. Urgaonkar
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Mahmut Kandemir其他文献
A case for core-assisted bottleneck acceleration in GPUs
GPU 中核心辅助瓶颈加速的案例
- DOI:
- 发表时间:
2015 - 期刊:
- 影响因子:0
- 作者:
Nandita Vijaykumar;Gennady Pekhimenko;Adwait Jog;A. Bhowmick;Rachata Ausavarungnirun;Chita R. Das;Mahmut Kandemir;T. Mowry;O. Mutlu - 通讯作者:
O. Mutlu
Time-constrained optimization of multi-AUV cooperative mine detection
多AUV协同探雷的时间约束优化
- DOI:
10.1109/oceans.2008.5151971 - 发表时间:
2008 - 期刊:
- 影响因子:0
- 作者:
R. Prins;Mahmut Kandemir - 通讯作者:
Mahmut Kandemir
Mahmut Kandemir的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Mahmut Kandemir', 18)}}的其他基金
Collaborative Research: CNS Core: Small: Resource-efficient, Strongly Consistent Replication for the Cloud
合作研究:CNS 核心:小型:资源高效、强一致性的云复制
- 批准号:
2149389 - 财政年份:2022
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
SaTC: CORE: Small: Automatic Software Patching against Microarchitectual Attacks
SaTC:核心:小型:针对微架构攻击的自动软件修补
- 批准号:
1956032 - 财政年份:2020
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
SHF: Small: Characterizing and Optimizing 3D NAND Flash
SHF:小型:表征和优化 3D NAND 闪存
- 批准号:
1908793 - 财政年份:2019
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Frameworks: Re-Engineering Galaxy for Performance, Scalability and Energy Efficiency
框架:重新设计 Galaxy 以提高性能、可扩展性和能源效率
- 批准号:
1931531 - 财政年份:2019
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
XPS: FULL: A Fresh Look at Near Data Computing: Coordinated Data and Computation Government
XPS:完整:近数据计算的新视角:协调数据和计算政府
- 批准号:
1629129 - 财政年份:2016
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
CSR: Medium: Collaborative Research: Enabling GPUs as First-Class Computing Engines
CSR:媒介:协作研究:使 GPU 成为一流的计算引擎
- 批准号:
1409095 - 财政年份:2014
- 资助金额:
$ 25万 - 项目类别:
Continuing Grant
XPS: FULL:CCA: Extracting Scalable Parallelism by Relaxing the Contracts across the System Stack
XPS:FULL:CCA:通过放松整个系统堆栈的契约来提取可扩展的并行性
- 批准号:
1439021 - 财政年份:2014
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
SHF: Medium: Breaking the Physical Divide between Computation and NAND-Flash Storage
SHF:媒介:打破计算和 NAND 闪存存储之间的物理鸿沟
- 批准号:
1302557 - 财政年份:2013
- 资助金额:
$ 25万 - 项目类别:
Continuing Grant
SHF: Medium: Automatic Control Driven Resource Management in Chip Multiprocessors
SHF:中:芯片多处理器中自动控制驱动的资源管理
- 批准号:
0963839 - 财政年份:2010
- 资助金额:
$ 25万 - 项目类别:
Continuing Grant
Collaborative Research: Adaptive Techniques for Achieving End-to-End QoS in the I/O Stack on Petascale Multiprocessors
协作研究:在千万级多处理器上的 I/O 堆栈中实现端到端 QoS 的自适应技术
- 批准号:
0937949 - 财政年份:2009
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
相似国自然基金
交流微网逆变器BP神经网络和凸二次规划模型预测控制研究
- 批准号:52377195
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
面向绿色出行的智能路径计算与规划技术研究
- 批准号:62372194
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
抛光机器人柔性变刚度并联执行器宏微协调运动规划与主被动柔顺控制
- 批准号:52305016
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
乡村聚落空间分异机制及规划调控研究——以浙江地区为例
- 批准号:52378067
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
变刚度S形进气道曲线纤维路径规划及协同铺丝工艺研究
- 批准号:52305026
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
Collaborative Research: PPoSS: Planning: Cross-layer Coordination and Optimization for Scalable and Sparse Tensor Networks (CROSS)
合作研究:PPoSS:规划:可扩展和稀疏张量网络的跨层协调和优化(CROSS)
- 批准号:
2217028 - 财政年份:2022
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: PPoSS: Planning: Cross-layer Coordination and Optimization for Scalable and Sparse Tensor Networks (CROSS)
合作研究:PPoSS:规划:可扩展和稀疏张量网络的跨层协调和优化(CROSS)
- 批准号:
2217086 - 财政年份:2022
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: PPoSS: Planning: Cross-layer Coordination and Optimization for Scalable and Sparse Tensor Networks (CROSS)
合作研究:PPoSS:规划:可扩展和稀疏张量网络的跨层协调和优化(CROSS)
- 批准号:
2247309 - 财政年份:2022
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: PPoSS: Planning: Cross-layer Coordination and Optimization for Scalable and Sparse Tensor Networks (CROSS)
合作研究:PPoSS:规划:可扩展和稀疏张量网络的跨层协调和优化(CROSS)
- 批准号:
2217010 - 财政年份:2022
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: PPoSS: Planning: Cross-layer Coordination and Optimization for Scalable and Sparse Tensor Networks (CROSS)
合作研究:PPoSS:规划:可扩展和稀疏张量网络的跨层协调和优化(CROSS)
- 批准号:
2217020 - 财政年份:2022
- 资助金额:
$ 25万 - 项目类别:
Standard Grant