SHF: Small: ML Accelerator Cohort Architecture
SHF:小型:ML 加速器群组架构
基本信息
- 批准号:2224319
- 负责人:
- 金额:$ 60万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2022
- 资助国家:美国
- 起止时间:2022-10-01 至 2025-09-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Machine learning (ML) models play an increasingly crucial role in people's daily lives: from autonomous driving, health care, data center management, to machine translation. To deal with the range of usages, model diversity has been growing: from convolutional neural networks, to embedding table-based recommendation systems, to graph neural networks to natural language processing. As model usage embeds in sensitive domains, such as speech- and emotion-recognition engines, data privacy is also a key requirement for the next generation ML hardware. As the model diversity has grown, the compute and communication demands also vary widely, both across models and within a model. And new privacy protecting execution models such as multi-party computing (MPC) demand fundamentally different hardware support. Hence, there is a need to rethink the design of ML hardware accelerators for the new era of privacy-preserving heterogeneous model usage. To address these concerns, this project focuses on the development of the ML Accelerator Cohort Architecture (MLACA). MLACA is collection of heterogeneous ML accelerator tiles that work in unison to adapt to the dynamically changing ML execution demands. MLACA uses multiple research thrusts to achieve its goals: the first thrust focuses on building a heterogeneous MLACA compute fabric that supports a wide range of dense and sparse execution paradigms, including novel support for private computing. The second thrust focuses on MLACA's memory and acceleration fabric that performs in-memory indexing acceleration for embedding tables, prefetching that uses perfect future knowledge of training data. MLACA's communication thrust focuses on distributed training acceleration, using techniques such as dynamic tensor decomposition that tradeoff computation and communication costs. The runtime system thrust manages MLACA fabric allocation across competing ML models to maximize the resource utilization and improve power efficiency. Technology transition is planned through strong industry collaborations with the USC-Meta center and Intel's Private AI Institute. This research uses NSF’s research experience for undergraduate funding, and USC's internal SURE and K-12 SHINE programs to engage high school students and teachers, and undergraduate students in preparing them for a career in ML systems design.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
机器学习(ML)模型在人们的日常生活中起着越来越重要的作用:从自主驾驶,医疗保健,数据中心管理到机器翻译。为了处理使用范围,模型多样性一直在增长:从卷积神经元网络到嵌入基于表的建议系统,再到将神经元网络绘制到自然语言处理。随着嵌入在敏感领域(例如语音和情感识别引擎)中的模型使用情况,数据隐私也是下一代ML硬件的关键要求。随着模型多样性的增长,跨模型和模型中的计算和通信需求也差异很大。以及保护执行模型(例如多方计算(MPC))的新隐私要求从根本上要求不同的硬件支持。因此,有必要重新考虑ML硬件加速器的设计,以实现隐私保护的新时代的异质模型使用。为了解决这些问题,该项目着重于ML加速器队列体系结构(MLACA)的开发。 MLACA是一致工作的异质ML加速器瓷砖的收集,以适应动态变化的ML执行要求。 MLACA使用多个研究推力来实现其目标:第一个推力重点是构建异质的MLACA计算织物,该织物支持广泛的密集和稀疏执行范式,包括对私人计算的新支持。第二个推力着重于MLACA的内存和加速织物,该结构执行内存索引加速度以嵌入表格,预购使用培训数据的完美未来知识。 MLACA的沟通推力专注于分布式培训加速,使用诸如权衡计算和通信成本的动态张量分解等技术。运行时系统推力管理跨竞争ML模型的MLACA织物分配,以最大程度地利用资源利用并提高功率效率。技术过渡是通过与USC-META中心和英特尔的私人AI研究所的强大行业合作计划的。这项研究利用NSF的研究经验用于本科资金,以及USC的内部确保和K-12 Shine计划,以吸引高中生和老师,并在本科生中为他们准备ML Systems Design的职业。这一奖项反映了NSF的法定任务,并通过使用该基金会的知识优点和广泛的影响来评估NSF的法定任务。
项目成果
期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
LAORAM: A Look Ahead ORAM Architecture for Training Large Embedding Tables
- DOI:10.1145/3579371.3589111
- 发表时间:2021-07
- 期刊:
- 影响因子:0
- 作者:Rachit Rajat;Yongqin Wang;M. Annavaram
- 通讯作者:Rachit Rajat;Yongqin Wang;M. Annavaram
Characterization of MPC-based Private Inference for Transformer-based Models
- DOI:10.1109/ispass55109.2022.00025
- 发表时间:2022-05
- 期刊:
- 影响因子:0
- 作者:Yongqin Wang;G. Suh;Wenjie Xiong;Benjamin Lefaudeux;Brian Knott;M. Annavaram;Hsien-Hsin S. Lee
- 通讯作者:Yongqin Wang;G. Suh;Wenjie Xiong;Benjamin Lefaudeux;Brian Knott;M. Annavaram;Hsien-Hsin S. Lee
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Murali Annavaram其他文献
A privacy mechanism for mobile-based urban traffic monitoring
- DOI:
10.1016/j.pmcj.2014.12.007 - 发表时间:
2015-07-01 - 期刊:
- 影响因子:
- 作者:
Chi Wang;Hua Liu;Kwame-Lante Wright;Bhaskar Krishnamachari;Murali Annavaram - 通讯作者:
Murali Annavaram
Differentially Private Next-Token Prediction of Large Language Models
大型语言模型的差分隐私下一个标记预测
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
James Flemings;Meisam Razaviyayn;Murali Annavaram - 通讯作者:
Murali Annavaram
Murali Annavaram的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Murali Annavaram', 18)}}的其他基金
Student Travel Support for the 2018 International Symposium on Computer Architecture (ISCA)
2018 年计算机体系结构国际研讨会 (ISCA) 学生旅行支持
- 批准号:
1812942 - 财政年份:2018
- 资助金额:
$ 60万 - 项目类别:
Standard Grant
SHF:Small: Accelerating Graph Analytics Through Coordinated Storage, Memory and Computing Advances
SHF:Small:通过协调存储、内存和计算进步加速图形分析
- 批准号:
1719074 - 财政年份:2017
- 资助金额:
$ 60万 - 项目类别:
Standard Grant
SHF:Small: Benchmarking of Transient and Intermittent Errors and Their Application to Microarchitecture
SHF:Small:瞬态和间歇性错误的基准测试及其在微架构中的应用
- 批准号:
1219186 - 财政年份:2012
- 资助金额:
$ 60万 - 项目类别:
Standard Grant
IEEE International Symposium on Workload Characterization (IISWC) Student Subsidy Proposal
IEEE 国际工作负载表征研讨会 (IISWC) 学生资助提案
- 批准号:
1104542 - 财政年份:2011
- 资助金额:
$ 60万 - 项目类别:
Standard Grant
CAREER: From Nonstop-Monitoring to Nano-ISA: An Adaptive Multi-Dimensional Framework for Processor Reliability
职业生涯:从不间断监控到 Nano-ISA:处理器可靠性的自适应多维框架
- 批准号:
0954211 - 财政年份:2010
- 资助金额:
$ 60万 - 项目类别:
Continuing Grant
CSR-PSCE,SM: Trade-offs Between Static Power, Performance and Reliability in Future Chip Multiprocessors
CSR-PSCE,SM:未来芯片多处理器静态功耗、性能和可靠性之间的权衡
- 批准号:
0834799 - 财政年份:2008
- 资助金额:
$ 60万 - 项目类别:
Standard Grant
CSR-PSCE,SM: A Holistic Design Approach to Reliability Using 3D Stacked
CSR-PSCE,SM:使用 3D 堆叠的可靠性整体设计方法
- 批准号:
0834798 - 财政年份:2008
- 资助金额:
$ 60万 - 项目类别:
Standard Grant
CT-ISG: A Game Theoretic Framework for Privacy Preservation in Community-Based Mobile Applications
CT-ISG:基于社区的移动应用程序中隐私保护的博弈论框架
- 批准号:
0831545 - 财政年份:2008
- 资助金额:
$ 60万 - 项目类别:
Standard Grant
相似国自然基金
靶向Treg-FOXP3小分子抑制剂的筛选及其在肺癌免疫治疗中的作用和机制研究
- 批准号:32370966
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
化学小分子激活YAP诱导染色质可塑性促进心脏祖细胞重编程的表观遗传机制研究
- 批准号:82304478
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
靶向小胶质细胞的仿生甘草酸纳米颗粒构建及作用机制研究:脓毒症相关性脑病的治疗新策略
- 批准号:82302422
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
HMGB1/TLR4/Cathepsin B途径介导的小胶质细胞焦亡在新生大鼠缺氧缺血脑病中的作用与机制
- 批准号:82371712
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
小分子无半胱氨酸蛋白调控生防真菌杀虫活性的作用与机理
- 批准号:32372613
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
相似海外基金
NeTS: Small: ML-Driven Online Traffic Analysis at Multi-Terabit Line Rates
NeTS:小型:ML 驱动的多太比特线路速率在线流量分析
- 批准号:
2331111 - 财政年份:2024
- 资助金额:
$ 60万 - 项目类别:
Standard Grant
CSR: Small: Latency-controlled Reduction of Data Center Expenses for Handling Bursty ML Inference Requests
CSR:小:通过延迟控制减少数据中心处理突发 ML 推理请求的费用
- 批准号:
2336886 - 财政年份:2024
- 资助金额:
$ 60万 - 项目类别:
Standard Grant
Implementing a coupled system of integrative ML modeling and data validation for elucidating microglial therapeutic targets in neurodegenerative disease
实施集成机器学习建模和数据验证的耦合系统,以阐明神经退行性疾病中的小胶质细胞治疗靶点
- 批准号:
10699794 - 财政年份:2023
- 资助金额:
$ 60万 - 项目类别:
Collaborative Research: CNS Core: Small: Optimizing Large-Scale Heterogeneous ML Platforms
合作研究:CNS Core:小型:优化大规模异构机器学习平台
- 批准号:
2146909 - 财政年份:2022
- 资助金额:
$ 60万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Small: Optimizing Large-Scale Heterogeneous ML Platforms
合作研究:CNS Core:小型:优化大规模异构机器学习平台
- 批准号:
2146814 - 财政年份:2022
- 资助金额:
$ 60万 - 项目类别:
Standard Grant