CAREER: Rethinking PIM-Assisted GPU Computing for Multi-Tenant Artificial Intelligence

职业:重新思考用于多租户人工智能的 PIM 辅助 GPU 计算

基本信息

  • 批准号:
    2239638
  • 负责人:
  • 金额:
    $ 53.9万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2023
  • 资助国家:
    美国
  • 起止时间:
    2023-08-01 至 2028-07-31
  • 项目状态:
    未结题

项目摘要

Artificial intelligence (AI) systems have entered the “multi-tenant” era, where multiple deep neural network (DNN) models are executed simultaneously. This involves concurrent deployment, computation, and interaction of multiple DNN models, increasing computational complexity and triggering new challenges: (1) How can a scalable and flexible computing architecture be realized that can adaptively host heterogeneous and concurrent DNN models? (2) How can computing flexibility requirements in multi-tenant DNN scenarios be met? (3) How can an efficient, end-to-end toolchain for building next-generation AI applications be realized in this context? This project addresses these challenges through three research thrusts: Thrust 1 investigates a novel processing-in-memory (PIM)-assisted graphics processing unit (GPU) architecture with innovative multi-tenant support, addressing important resource contention and model interaction issues. Thrust 2 explores dedicated GPU- and PIM-oriented scheduling techniques to enhance the platform’s performance. Finally, thrust 3 further enhances the multi-tenant AI application development cycle with algorithm optimization and code deployment support.With the successful completion of these thrusts, this project can achieve breakthroughs in modern AI computing and support the next generation of AI applications. The proposed techniques have the potential to accelerate AI design and deployment, spurring even wider AI utilization. This can contribute to important application areas with societal importance, including autonomous driving, metaverse immersion, smart agriculture, and industrial infrastructure. This project will also benefit students --and by consequence, society-- by incorporating research results within relevant courses, increasing the participation of women and other underrepresented groups in computing, and sharing research results with researchers, companies, and government agencies.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
人工智能(AI)系统进入了“多租户”时代,在该时代中,简单地执行了多个深神经网络(DNN)模型。这涉及多个DNN模型的同时部署,计算和相互作用,提高了计算复杂性并触发新的挑战:(1)如何实现可以自适应宿主的异质和同意DNN模型的可扩展和灵活计算体系结构? (2)如何满足多租户DNN场景中的灵活性要求? (3)在这种情况下,如何实现用于构建下一代AI应用程序的高效的端到端工具链?该项目通过三个研究推力来解决这些挑战:Thrust 1研究了一种新颖的内存处理(PIM)辅助图形处理单元(GPU)架构,并具有创新的多租户支持,以解决重要的资源争夺和模型交互问题。推力2探讨了专用的GPU和面向PIM的调度技术,以提高平台的性能。最后,推力3进一步增强了通过算法优化和代码部署支持的多租户AI应用程序开发周期。在成功完成这些推力的过程中,该项目可以在现代AI计算中取得突破,并支持下一代AI应用程序。所提出的技术有可能加速AI设计和部署,从而激发了更广泛的AI利用率。这可以促进具有社会重要性的重要应用领域,包括自主驾驶,元融入,智能农业和工业基础设施。该项目还将通过增加相关课程中的研究成果,增加妇女和其他代表性不足的群体的参与,并与研究人员,公司和政府机构共享研究结果。这项奖项反映了NSF的法定任务,并通过基金会的智力评估评估,该奖项是对基金会的评估,该奖项将使研究成果受益。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Chenchen Liu其他文献

Estimation of antenna phase center offset for BDS IGSO and MEO satellites
BDS IGSO 和 MEO 卫星天线相位中心偏移估计
  • DOI:
    10.1007/s10291-018-0716-z
  • 发表时间:
    2018-02
  • 期刊:
  • 影响因子:
    4.9
  • 作者:
    Guanwen Huang;Xingyuan Yan;Qin Zhang;Chenchen Liu;Le Wang;Zhiwei Qin
  • 通讯作者:
    Zhiwei Qin
Joint relay-user selection in energy harvesting relay network with direct link
直接链路能量收集中继网络中的联合中继用户选择
  • DOI:
    10.1016/j.phycom.2018.03.014
  • 发表时间:
    2018-06
  • 期刊:
  • 影响因子:
    2.2
  • 作者:
    Chenchen Liu;Tiejun Lv
  • 通讯作者:
    Tiejun Lv
Design techniques of eNVM-enabled neuromorphic computing systems
支持 eNVM 的神经形态计算系统的设计技术
Towards Accurate and High-Speed Spiking Neuromorphic Systems with Data Quantization-Aware Deep Networks
通过数据量化感知深度网络实现准确、高速的尖峰神经形态系统
Integrative analysis of the metabolomes and transcriptomes of Ebola virus-infected cells: Uncovering pathways related to hepatic apoptosis
  • DOI:
    10.1016/j.gendis.2024.101377
  • 发表时间:
    2025-03-01
  • 期刊:
  • 影响因子:
  • 作者:
    Chenchen Liu;Zengguo Cao;Cheng Peng;Fangxu Li;Zixi Chen;Xinghai Zhang;Xiaoying Jia;Jinge Zhou;Wenting Mao;Entao Li;Gengfu Xiao;Sandra Chiu
  • 通讯作者:
    Sandra Chiu

Chenchen Liu的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Chenchen Liu', 18)}}的其他基金

CRII:CSR: Enabling High-Performance Deep Learning Computing System via Software and Hardware Co-Optimized Reconfiguration
CRII:CSR:通过软硬件协同优化重构实现高性能深度学习计算系统
  • 批准号:
    1850393
  • 财政年份:
    2019
  • 资助金额:
    $ 53.9万
  • 项目类别:
    Standard Grant
CRII:CSR: Enabling High-Performance Deep Learning Computing System via Software and Hardware Co-Optimized Reconfiguration
CRII:CSR:通过软硬件协同优化重构实现高性能深度学习计算系统
  • 批准号:
    1939380
  • 财政年份:
    2019
  • 资助金额:
    $ 53.9万
  • 项目类别:
    Standard Grant

相似海外基金

Care and Repair: Rethinking Contemporary Curation for Conditions of Crisis
护理与修复:重新思考危机条件下的当代策展
  • 批准号:
    DP240102206
  • 财政年份:
    2024
  • 资助金额:
    $ 53.9万
  • 项目类别:
    Discovery Projects
A Brave New World for Japanese Shakespeare Adaptations: Rethinking Shakespeare Studies through Adaptations
日本莎士比亚改编的美丽新世界:通过改编重新思考莎士比亚研究
  • 批准号:
    23K21920
  • 财政年份:
    2024
  • 资助金额:
    $ 53.9万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
PROTSENS Rethinking Alternative PROTein Extraction: Decoding SENsory-Protein Extraction Relationships
PROTSENS 重新思考替代性蛋白质提取:解码感觉-蛋白质提取关系
  • 批准号:
    EP/Z000785/1
  • 财政年份:
    2024
  • 资助金额:
    $ 53.9万
  • 项目类别:
    Fellowship
Caring Communities 1800-present: Rethinking Children's Social Care
关爱社区 1800 年至今:重新思考儿童的社会关怀
  • 批准号:
    MR/X034968/1
  • 财政年份:
    2024
  • 资助金额:
    $ 53.9万
  • 项目类别:
    Fellowship
High-rise landscapes: The afterlives of tower block 'failure' and rethinking urban futures
高层景观:塔楼“失败”的后遗症和重新思考城市未来
  • 批准号:
    MR/Y003586/1
  • 财政年份:
    2024
  • 资助金额:
    $ 53.9万
  • 项目类别:
    Fellowship
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了