AF: Small: Communication-Aware Algorithms for Dynamic Allocation of Heterogeneous Resources
AF:小型:用于异构资源动态分配的通信感知算法
基本信息
- 批准号:2335187
- 负责人:
- 金额:$ 59.94万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2024
- 资助国家:美国
- 起止时间:2024-02-15 至 2027-01-31
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Modern computer infrastructure systems need to manage vast heterogeneous resources and run computations that are complex, distributed, and dynamic. Often, the distributed nature of the computations demands efficient communication; dynamic resources and computations demand online solutions without full knowledge of the future. For instance, scalable artificial intelligence (AI) requires the effective mapping of dynamic neural network computations into networks of computing devices. This project concerns the design of efficient and effective algorithms for scheduling large-scale computations in distributed infrastructure such as cloud systems and datacenter networks. The expected outcomes of the project are solutions for more effective processing of large AI tasks and resource allocation policies in cloud computing systems, with improved performance for business operations and mission-critical systems. The integrated educational component of the project includes training undergraduate and doctoral students in infrastructure algorithms, curriculum development, and outreach to engage high school students and inspire them to explore careers in math and computing.This project has two major thrusts. The first concerns the scheduling of precedence-constrained jobs and computation graphs in distributed networks and reconfigurable machines. This is motivated by the fact that as computational workloads get larger and more complex, it is often necessary to distribute many communicating jobs across a large network of devices. These devices may have different speeds, different computing capabilities, and restrictions on which jobs they can execute due to resource and security concerns. The second thrust concerns the online migration of computations and servers in a distributed system in response to dynamic requests. One motivation comes from data intensive applications that generate significant network traffic; to enable efficient communication among processes dispersed across many clusters, distributed systems are increasingly reconfigurable and strategically migrate processes to reduce communication. The presented problems include communication-aware scheduling of precedence-constrained jobs in networks with general delays, scheduling of split table jobs in reconfigurable machines, minimum-stretch embedding of graphs, heterogeneous variants of the classic online k-server problem, and online balanced graph partitioning. The technical approaches include new linear-programming based techniques that address communication and topological constraints, and new methods in online algorithms. This project also explores new frameworks for studying these problems, including learning-augmented algorithms through predictions of processing times and communication needs, reconfigurable architectures, and mobile ad hoc networks with heterogeneous devices.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
现代计算机基础架构系统需要管理大量的异质资源,并运行复杂,分布和动态的计算。 通常,计算的分布性质需要有效的沟通;动态资源和计算需要在线解决方案,而无需充分了解未来。例如,可扩展的人工智能(AI)需要将动态神经网络计算的有效映射到计算设备网络中。该项目涉及在分布式基础架构(例如云系统和数据中心网络)中计划大规模计算的高效算法的设计。 该项目的预期结果是解决云计算系统中大型AI任务和资源分配政策的解决方案,并改善了业务运营和关键任务系统的绩效。 该项目的综合教育组成部分包括培训基础设施算法,课程开发的本科生和博士生,并宣传与高中生互动,并激发他们探索数学和计算中的职业。该项目有两个主要的推力。 第一个涉及分布式网络和可重新配置机器中的优先限制的作业和计算图的调度。 这是因为随着计算工作负载变得更大,越来越复杂,通常有必要在大型设备网络上分发许多交流作业。这些设备可能具有不同的速度,不同的计算功能以及由于资源和安全问题而可以执行的作业的限制。 第二个推力涉及分布式系统中计算和服务器的在线迁移,以响应动态请求。一种动机来自产生大量网络流量的数据密集型应用程序。为了使分散在许多集群中的流程之间有效沟通,分布式系统越来越可重新配置,并且在战略上迁移过程以减少交流。提出的问题包括具有一般延迟的网络中的优先限制的作业的通信感知调度,可重新配置的机器中的拆分表作业安排,图形的最小拉伸嵌入,经典在线K-Server问题的异质变体以及在线平衡图形分区。技术方法包括新的基于线性编程的技术,这些技术解决了通信和拓扑约束,以及在线算法中的新方法。该项目还探讨了研究这些问题的新框架,包括通过预测处理时间和沟通需求,可重新配置的架构以及具有异质设备的移动临时网络,包括学习算法的算法。这奖反映了NSF的立法任务,并被认为是通过基础智力的评估来评估的,并值得通过评估来评估基金会的智力和广泛的范围。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Rajmohan Rajaraman其他文献
Stability of the rotating Skyrmion.
旋转斯格明子的稳定性。
- DOI:
- 发表时间:
1986 - 期刊:
- 影响因子:0
- 作者:
Rajmohan Rajaraman;H. Sommermann;Jochen Wambach;H. Wyld - 通讯作者:
H. Wyld
Rajmohan Rajaraman的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Rajmohan Rajaraman', 18)}}的其他基金
AF: Small: Embedding Distributed Computations and Flows in Networks
AF:小型:在网络中嵌入分布式计算和流程
- 批准号:
1909363 - 财政年份:2019
- 资助金额:
$ 59.94万 - 项目类别:
Standard Grant
AF: Small: Network Algorithms Under Adversarial and Stochastic Uncertainty
AF:小:对抗性和随机不确定性下的网络算法
- 批准号:
1422715 - 财政年份:2014
- 资助金额:
$ 59.94万 - 项目类别:
Standard Grant
ICES: Large: Collaborative: The Role of Space, Time and Information in Controlling Epidemics
ICES:大:协作:空间、时间和信息在控制流行病中的作用
- 批准号:
1216038 - 财政年份:2012
- 资助金额:
$ 59.94万 - 项目类别:
Standard Grant
How Blissful is Ignorance? The Role of Obliviousness in Network Optimization
无知有多幸福?
- 批准号:
0635119 - 财政年份:2006
- 资助金额:
$ 59.94万 - 项目类别:
Standard Grant
CAREER: Algorithms for Organizing and Scheduling Distributed Resources
职业:组织和调度分布式资源的算法
- 批准号:
9983901 - 财政年份:2000
- 资助金额:
$ 59.94万 - 项目类别:
Continuing Grant
相似国自然基金
靶向Treg-FOXP3小分子抑制剂的筛选及其在肺癌免疫治疗中的作用和机制研究
- 批准号:32370966
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
化学小分子激活YAP诱导染色质可塑性促进心脏祖细胞重编程的表观遗传机制研究
- 批准号:82304478
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
靶向小胶质细胞的仿生甘草酸纳米颗粒构建及作用机制研究:脓毒症相关性脑病的治疗新策略
- 批准号:82302422
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
HMGB1/TLR4/Cathepsin B途径介导的小胶质细胞焦亡在新生大鼠缺氧缺血脑病中的作用与机制
- 批准号:82371712
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
小分子无半胱氨酸蛋白调控生防真菌杀虫活性的作用与机理
- 批准号:32372613
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
相似海外基金
AF: Small: Polynomials, Communication, and Query Complexity
AF:小:多项式、通信和查询复杂性
- 批准号:
2220232 - 财政年份:2022
- 资助金额:
$ 59.94万 - 项目类别:
Standard Grant
AF: Small: Fundamental Questions in Communication and Computation Regarding Edit Type String Measures
AF:小:有关编辑类型字符串测量的通信和计算的基本问题
- 批准号:
2127575 - 财政年份:2021
- 资助金额:
$ 59.94万 - 项目类别:
Standard Grant
AF: Small: Challenges in Communication Complexity and Pseudorandomness
AF:小:通信复杂性和伪随机性的挑战
- 批准号:
2007682 - 财政年份:2020
- 资助金额:
$ 59.94万 - 项目类别:
Standard Grant
AF: Small: Intermediate models between communication complexity and query complexity
AF:小:通信复杂度和查询复杂度之间的中间模型
- 批准号:
2006443 - 财政年份:2020
- 资助金额:
$ 59.94万 - 项目类别:
Standard Grant
CCF: AF: Small: Algorithms, Parallelism and Communication Efficiency in Shortest Path Computations
CCF:AF:Small:最短路径计算中的算法、并行性和通信效率
- 批准号:
2008241 - 财政年份:2020
- 资助金额:
$ 59.94万 - 项目类别:
Standard Grant