III: Small: Towards the Foundations of Training Deep Neural Networks: New Theory and Algorithms

III:小:迈向训练深度神经网络的基础:新理论和算法

基本信息

  • 批准号:
    2008981
  • 负责人:
  • 金额:
    $ 50万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2020
  • 资助国家:
    美国
  • 起止时间:
    2020-10-01 至 2024-09-30
  • 项目状态:
    已结题

项目摘要

Deep learning has achieved tremendous successes in the past decade. Despite these empirical successes, the theoretical understanding of deep learning is still largely falling behind. There exists a huge gap between the empirical successes of deep learning and conventional optimization and machine learning theories. This project aims to bridge this gap by establishing the theoretical foundations of deep learning to understand why and how it works, and use this theory to develop new models and algorithms. The expected outcome of this project includes new theories and the state-of-the-art approaches for deep learning. The project will push the frontier of deep learning and train next-generation researchers and practitioners in artificial intelligence. Research demonstrations and lab tours will be given to K-12 school students by showing the wide range of applications of AI and their connection to society, to motivate them to pursue a STEM discipline.This project consists of two synergistic research thrusts: (1) understanding the optimization dynamics of training algorithms such as stochastic gradient descent for deep learning models, and deriving algorithm-dependent generalization error bounds to assess their generalization performance; and (2) developing a new suite of faster training algorithms for deep learning, as well as principled neural architecture search algorithms guided by the generalization error bounds to design better neural network models. To evaluate the developed approaches, both theoretical analyses and extensive experimental evaluations will be performed on real-world benchmarks including but not limited to image classification and natural language processing. The open source software and course materials developed in this project will be made publicly available to the broader community, to help engineers and scientists better understand and apply deep learning.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
在过去的十年中,深度学习取得了巨大的成功。尽管取得了这些经验的成功,但对深度学习的理论理解仍在很大程度上落后。深度学习的经验成功与传统优化的经验成功与机器学习理论之间存在巨大差距。该项目旨在通过建立深度学习的理论基础来了解其原因和方式,并使用该理论来开发新的模型和算法来弥合这一差距。该项目的预期结果包括新理论和深度学习的最新方法。该项目将推动人工智能领域的深度学习和培训下一代研究人员和从业人员的前沿。 Research demonstrations and lab tours will be given to K-12 school students by showing the wide range of applications of AI and their connection to society, to motivate them to pursue a STEM discipline.This project consists of two synergistic research thrusts: (1) understanding the optimization dynamics of training algorithms such as stochastic gradient descent for deep learning models, and deriving algorithm-dependent generalization error bounds to assess their generalization performance; (2)为深度学习开发了一套新的更快的培训算法,以及以概括误差界限为指导的原则性神经体系结构搜索算法,以设计更好的神经网络模型。为了评估开发的方法,将对包括但不限于图像分类和自然语言处理在内的实际基准进行理论分析和广泛的实验评估。该项目中开发的开源软件和课程材料将公开向更广泛的社区公开使用,以帮助工程师和科学家更好地理解和应用深入学习。该奖项反映了NSF的法定任务,并被认为是通过基金会的知识分子的智力优点和更广泛影响的审查标准来通过评估来获得支持的。

项目成果

期刊论文数量(38)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Provable Robustness of Adversarial Training for Learning Halfspaces with Noise
The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift
  • DOI:
    10.48550/arxiv.2208.01857
  • 发表时间:
    2022-08
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Jingfeng Wu;Difan Zou;V. Braverman;Quanquan Gu;S. Kakade
  • 通讯作者:
    Jingfeng Wu;Difan Zou;V. Braverman;Quanquan Gu;S. Kakade
How Much Over-parameterization Is Sufficient to Learn Deep ReLU Networks?
  • DOI:
  • 发表时间:
    2019-11
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Zixiang Chen;Yuan Cao;Difan Zou;Quanquan Gu
  • 通讯作者:
    Zixiang Chen;Yuan Cao;Difan Zou;Quanquan Gu
Proxy Convexity: A Unified Framework for the Analysis of Neural Networks Trained by Gradient Descent
  • DOI:
  • 发表时间:
    2021-06
  • 期刊:
  • 影响因子:
    20.6
  • 作者:
    Spencer Frei;Quanquan Gu
  • 通讯作者:
    Spencer Frei;Quanquan Gu
Towards Understanding the Mixture-of-Experts Layer in Deep Learning
理解深度学习中的专家混合层
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Quanquan Gu其他文献

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
来自对抗性反馈的上下文决斗强盗的近乎最优算法
  • DOI:
    10.48550/arxiv.2404.10776
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Qiwei Di;Jiafan He;Quanquan Gu
  • 通讯作者:
    Quanquan Gu
Different patterns of gray matter density in early- and middle-late-onset Parkinson’s disease a voxel-based morphometry study
早发和中晚发帕金森病灰质密度的不同模式:基于体素的形态测量研究
  • DOI:
    10.1007/s11682-017-9745-4
  • 发表时间:
    2017
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Min Xuan;Xiaojun Guan;Peiyu Huang;Zhujing Shen;Quanquan Gu;Xinfeng Yu;Xiaojun Xu;Wei Luo;Minming Zhang
  • 通讯作者:
    Minming Zhang
Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent
使用随机梯度下降匹配 k 稀疏奇偶校验问题的统计查询下界
  • DOI:
    10.48550/arxiv.2404.12376
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Yiwen Kou;Zixiang Chen;Quanquan Gu;S. Kakade
  • 通讯作者:
    S. Kakade
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
用于文本到图像生成的扩散模型的自玩微调
  • DOI:
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Huizhuo Yuan;Zixiang Chen;Kaixuan Ji;Quanquan Gu
  • 通讯作者:
    Quanquan Gu
Iterative Teacher-Aware Learning
迭代式教师意识学习
  • DOI:
  • 发表时间:
    2021
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Luyao Yuan;Dongruo Zhou;Junhong Shen;Jingdong Gao;Jeffrey L. Chen;Quanquan Gu;Y. Wu;Song
  • 通讯作者:
    Song

Quanquan Gu的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Quanquan Gu', 18)}}的其他基金

Collaborative Research: Towards the Foundation of Approximate Sampling-Based Exploration in Sequential Decision Making
协作研究:为顺序决策中基于近似采样的探索奠定基础
  • 批准号:
    2323113
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
CPS: Medium: Collaborative Research: Provably Safe and Robust Multi-Agent Reinforcement Learning with Applications in Urban Air Mobility
CPS:中:协作研究:可证明安全且鲁棒的多智能体强化学习及其在城市空中交通中的应用
  • 批准号:
    2312094
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
CIF: Small: Collaborative Research: Rank Aggregation with Heterogeneous Information Sources: Efficient Algorithms and Fundamental Limits
CIF:小型:协作研究:异构信息源的排名聚合:高效算法和基本限制
  • 批准号:
    1911168
  • 财政年份:
    2019
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
III: Small: Collaborative Research: High-Dimensional Machine Learning Methods for Personalized Cancer Genomics
III:小:协作研究:个性化癌症基因组学的高维机器学习方法
  • 批准号:
    1903202
  • 财政年份:
    2018
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
BIGDATA: F: Collaborative Research: Taming Big Networks via Embedding
BIGDATA:F:协作研究:通过嵌入驯服大网络
  • 批准号:
    1855099
  • 财政年份:
    2018
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
CAREER: Scaling Up Knowledge Discovery in High-Dimensional Data Via Nonconvex Statistical Optimization
职业:通过非凸统计优化扩大高维数据中的知识发现
  • 批准号:
    1906169
  • 财政年份:
    2018
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
BIGDATA: F: Collaborative Research: Taming Big Networks via Embedding
BIGDATA:F:协作研究:通过嵌入驯服大网络
  • 批准号:
    1741342
  • 财政年份:
    2018
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
III: Small: Collaborative Learning with Incomplete and Noisy Knowledge
III:小:知识不完整且有噪音的协作学习
  • 批准号:
    1904183
  • 财政年份:
    2018
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
III: Small: Collaborative Research: High-Dimensional Machine Learning Methods for Personalized Cancer Genomics
III:小:协作研究:个性化癌症基因组学的高维机器学习方法
  • 批准号:
    1717206
  • 财政年份:
    2017
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
CAREER: Scaling Up Knowledge Discovery in High-Dimensional Data Via Nonconvex Statistical Optimization
职业:通过非凸统计优化扩大高维数据中的知识发现
  • 批准号:
    1652539
  • 财政年份:
    2017
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant

相似国自然基金

TIM-4调控小胶质细胞向吞噬型转化促进蛛网膜下腔出血后血液清除的作用及机制
  • 批准号:
    82301485
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
EGFR突变的肺腺癌向小细胞肺癌转变的分子机制及干预策略
  • 批准号:
    82341002
  • 批准年份:
    2023
  • 资助金额:
    200 万元
  • 项目类别:
    专项基金项目
巨噬细胞A20调控小管上皮细胞胞葬在AKI向CKD转变中的作用机制探讨
  • 批准号:
    82270728
  • 批准年份:
    2022
  • 资助金额:
    50 万元
  • 项目类别:
    面上项目
小胶质细胞外泌体调控卒中后星形胶质细胞亚型向神经干细胞转化的机制研究
  • 批准号:
    82271320
  • 批准年份:
    2022
  • 资助金额:
    52 万元
  • 项目类别:
    面上项目
小尺度场向电流时空分布特征及与沉降粒子关系的研究
  • 批准号:
    42174191
  • 批准年份:
    2021
  • 资助金额:
    59.00 万元
  • 项目类别:
    面上项目

相似海外基金

Collaborative Research: IIS-III: Small Towards Fair Outlier Detection
协作研究:IIS-III:小到公平的异常值检测
  • 批准号:
    2310481
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
III: Small: A New Machine Learning Paradigm Towards Effective yet Efficient Foundation Graph Learning Models
III:小型:一种新的机器学习范式,实现有效且高效的基础图学习模型
  • 批准号:
    2321504
  • 财政年份:
    2023
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
III: Small: Towards Highly Accurate Map Services
III:小:迈向高精度地图服务
  • 批准号:
    2203553
  • 财政年份:
    2022
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
III: Small: Towards Explainable Personalization
III:小:迈向可解释的个性化
  • 批准号:
    2007492
  • 财政年份:
    2020
  • 资助金额:
    $ 50万
  • 项目类别:
    Continuing Grant
III: Small: Towards Explainable Recommendation Systems
III:小:迈向可解释的推荐系统
  • 批准号:
    1910154
  • 财政年份:
    2019
  • 资助金额:
    $ 50万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了