CIF: Medium: Collaborative Research: Theory of Optimization Geometry and Algorithms for Neural Networks
CIF:媒介:协作研究:神经网络优化几何理论和算法
基本信息
- 批准号:1900145
- 负责人:
- 金额:$ 40万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-10-01 至 2024-09-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Deep learning has attracted a significant amount of interest in recent years due to its widespread applicability in computer vision, artificial intelligence and natural language processing, alongside recent strides in autonomous driving. The theoretical underpinnings behind such success, however, remain elusive to a large extent, hindering its further adoption in other applications. This project aims to advance the theoretical foundations of training neural networks in terms of optimization landscape and algorithmic efficacy, which in turn should have a measurable impact on the practice of deep learning by providing guiding principles for network design, algorithm selection, hyperparameter tuning, and adversarial training. This project adopts an interdisciplinary approach fusing ideas from machine learning, optimization, statistical signal processing, high-dimensional statistics, nonparametric statistics, and information theory. This project will likewise develop courses and tutorials on theoretical foundations of large-scale machine learning and provide extensive training opportunities for students at all levels.This project aims to develop a comprehensive theory to characterize the optimization landscape and geometry of loss functions and algorithmic regularizations of major neural network training problems, and explore how the network architecture---including depth, width, and activation functions---affect these properties, thus providing guidelines for the design of algorithms to train these networks more efficiently with theoretical performance guarantees. The project will explore the geometric properties and their impact on the optimization performance in training multi-layer neural networks, auto-encoders, generative adversarial networks, and adversarial training involving non-convex and saddle-point problems.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
近年来,由于其在计算机视觉,人工智能和自然语言处理中的广泛适用性以及最近在自主驾驶方面的进步,深度学习引起了极大的兴趣。但是,这种成功背后的理论基础在很大程度上仍然难以捉摸,阻碍了其在其他应用中的进一步采用。该项目旨在从优化景观和算法效力方面推进训练神经网络的理论基础,这反过来又应通过为网络设计,算法选择,超级参数调整和对抗性培训提供指导性的指导原理来对深度学习实践产生可衡量的影响。该项目采用了一种跨学科的方法,从机器学习,优化,统计信号处理,高维统计,非参数统计和信息理论融合了思想。 This project will likewise develop courses and tutorials on theoretical foundations of large-scale machine learning and provide extensive training opportunities for students at all levels.This project aims to develop a comprehensive theory to characterize the optimization landscape and geometry of loss functions and algorithmic regularizations of major neural network training problems, and explore how the network architecture---including depth, width, and activation functions---affect these properties, thus providing设计算法设计指南,可以通过理论性能保证更有效地培训这些网络。该项目将探索几何特性及其对培训多层神经网络,自动编码器,生成对抗网络以及涉及非convex和鞍点问题的对抗性培训的影响的影响。该奖项反映了NSF的法定任务,并通过评估基金会的范围,反映了NSF的法定任务,并已被评估了基金会的范围。
项目成果
期刊论文数量(26)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms
- DOI:
- 发表时间:2020-04
- 期刊:
- 影响因子:0
- 作者:Tengyu Xu;Zhe Wang-;Yingbin Liang
- 通讯作者:Tengyu Xu;Zhe Wang-;Yingbin Liang
Deterministic policy gradient: Convergence analysis
- DOI:
- 发表时间:2022
- 期刊:
- 影响因子:0
- 作者:Huaqing Xiong;Tengyu Xu;Lin Zhao;Yingbin Liang;Wei Zhang
- 通讯作者:Huaqing Xiong;Tengyu Xu;Lin Zhao;Yingbin Liang;Wei Zhang
History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms
- DOI:
- 发表时间:2019-10
- 期刊:
- 影响因子:0
- 作者:Kaiyi Ji;Zhe Wang-;Bowen Weng;Yi Zhou;W. Zhang;Yingbin Liang
- 通讯作者:Kaiyi Ji;Zhe Wang-;Bowen Weng;Yi Zhou;W. Zhang;Yingbin Liang
Convergence of Meta-Learning with Task-Specific Adaptation over Partial Parameters
- DOI:
- 发表时间:2020-06
- 期刊:
- 影响因子:0
- 作者:Kaiyi Ji;J. Lee;Yingbin Liang;H. Poor
- 通讯作者:Kaiyi Ji;J. Lee;Yingbin Liang;H. Poor
Generalized-Smooth Nonconvex Optimization is As Efficient As Smooth Nonconvex Optimization
- DOI:
- 发表时间:2023-03
- 期刊:
- 影响因子:0
- 作者:Ziyi Chen;Yi Zhou;Yingbin Liang;Zhaosong Lu
- 通讯作者:Ziyi Chen;Yi Zhou;Yingbin Liang;Zhaosong Lu
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Yingbin Liang其他文献
On the Equivalence of Two Achievable Regions for the Broadcast Channel
广播频道两个可达到区域的等效性
- DOI:
10.1109/tit.2010.2090236 - 发表时间:
2011 - 期刊:
- 影响因子:2.5
- 作者:
Yingbin Liang;G. Kramer;H. Poor - 通讯作者:
H. Poor
Capacity bounds for a class of cognitive interference channels with state
一类具有状态的认知干扰信道的容量界限
- DOI:
10.1109/allerton.2011.6120223 - 发表时间:
2011 - 期刊:
- 影响因子:0
- 作者:
Ruchen Duan;Yingbin Liang - 通讯作者:
Yingbin Liang
A New Perspective of Proximal Gradient Algorithms
近端梯度算法的新视角
- DOI:
- 发表时间:
2015 - 期刊:
- 影响因子:0
- 作者:
Yi Zhou;Yingbin Liang;Lixin Shen - 通讯作者:
Lixin Shen
State-Dependent Gaussian Interference Channels: Can State Be Fully Canceled?
状态相关的高斯干扰通道:状态可以完全取消吗?
- DOI:
10.1109/tit.2016.2530665 - 发表时间:
2016 - 期刊:
- 影响因子:2.5
- 作者:
Ruchen Duan;Yingbin Liang;S. Shamai - 通讯作者:
S. Shamai
A note on inexact gradient and Hessian conditions for cubic regularized Newton's method
关于三次正则牛顿法的不精确梯度和 Hessian 条件的说明
- DOI:
10.1016/j.orl.2019.01.009 - 发表时间:
2019 - 期刊:
- 影响因子:0
- 作者:
Zhe Wang;Yi Zhou;Yingbin Liang;Guanghui Lan - 通讯作者:
Guanghui Lan
Yingbin Liang的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Yingbin Liang', 18)}}的其他基金
RINGS: A Deep Reinforcement Learning Enabled Large-scale UAV Network with Distributed Navigation, Mobility Control, and Resilience
RINGS:深度强化学习支持的大规模无人机网络,具有分布式导航、移动控制和弹性
- 批准号:
2148253 - 财政年份:2022
- 资助金额:
$ 40万 - 项目类别:
Continuing Grant
Collaborative Research: CCSS: Learning to Optimize: From New Algorithms to New Theory
合作研究:CCSS:学习优化:从新算法到新理论
- 批准号:
2113860 - 财政年份:2021
- 资助金额:
$ 40万 - 项目类别:
Standard Grant
Collaborative Research: SCALE MoDL: Adaptivity of Deep Neural Networks
合作研究:SCALE MoDL:深度神经网络的适应性
- 批准号:
2134145 - 财政年份:2021
- 资助金额:
$ 40万 - 项目类别:
Continuing Grant
CIF: Small: Collaborative Research: Acceleration Algorithms for Large-scale Nonconvex Optimization
CIF:小型:协作研究:大规模非凸优化的加速算法
- 批准号:
1909291 - 财政年份:2019
- 资助金额:
$ 40万 - 项目类别:
Standard Grant
CIF: Small: Collaborative Research: Network Event Detection with Multistream Observations
CIF:小型:协作研究:通过多流观察进行网络事件检测
- 批准号:
1801855 - 财政年份:2017
- 资助金额:
$ 40万 - 项目类别:
Standard Grant
CIF: Medium: Collaborative Research: Nonconvex Optimization for High-Dimensional Signal Estimation: Theory and Fast Algorithms
CIF:中:协作研究:高维信号估计的非凸优化:理论和快速算法
- 批准号:
1761506 - 财政年份:2017
- 资助金额:
$ 40万 - 项目类别:
Continuing Grant
CIF: Medium: Collaborative Research: Nonconvex Optimization for High-Dimensional Signal Estimation: Theory and Fast Algorithms
CIF:中:协作研究:高维信号估计的非凸优化:理论和快速算法
- 批准号:
1704169 - 财政年份:2017
- 资助金额:
$ 40万 - 项目类别:
Continuing Grant
Management of Mobile Phone Sensing via Sparse Learning
通过稀疏学习管理手机传感
- 批准号:
1818904 - 财政年份:2017
- 资助金额:
$ 40万 - 项目类别:
Standard Grant
CIF: Small: Collaborative Research: Secret Key Generation Under Resource Constraints
CIF:小型:协作研究:资源限制下的密钥生成
- 批准号:
1801846 - 财政年份:2017
- 资助金额:
$ 40万 - 项目类别:
Standard Grant
CIF: Small: Collaborative Research: Secret Key Generation Under Resource Constraints
CIF:小型:协作研究:资源限制下的密钥生成
- 批准号:
1618127 - 财政年份:2016
- 资助金额:
$ 40万 - 项目类别:
Standard Grant
相似国自然基金
复合低维拓扑材料中等离激元增强光学响应的研究
- 批准号:12374288
- 批准年份:2023
- 资助金额:52 万元
- 项目类别:面上项目
基于管理市场和干预分工视角的消失中等企业:特征事实、内在机制和优化路径
- 批准号:72374217
- 批准年份:2023
- 资助金额:41.00 万元
- 项目类别:面上项目
托卡马克偏滤器中等离子体的多尺度算法与数值模拟研究
- 批准号:12371432
- 批准年份:2023
- 资助金额:43.5 万元
- 项目类别:面上项目
中等质量黑洞附近的暗物质分布及其IMRI系统引力波回波探测
- 批准号:12365008
- 批准年份:2023
- 资助金额:32 万元
- 项目类别:地区科学基金项目
中等垂直风切变下非对称型热带气旋快速增强的物理机制研究
- 批准号:42305004
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
Collaborative Research: CIF: Medium: Snapshot Computational Imaging with Metaoptics
合作研究:CIF:Medium:Metaoptics 快照计算成像
- 批准号:
2403122 - 财政年份:2024
- 资助金额:
$ 40万 - 项目类别:
Standard Grant
Collaborative Research: CIF-Medium: Privacy-preserving Machine Learning on Graphs
合作研究:CIF-Medium:图上的隐私保护机器学习
- 批准号:
2402815 - 财政年份:2024
- 资助金额:
$ 40万 - 项目类别:
Standard Grant
Collaborative Research: CIF-Medium: Privacy-preserving Machine Learning on Graphs
合作研究:CIF-Medium:图上的隐私保护机器学习
- 批准号:
2402817 - 财政年份:2024
- 资助金额:
$ 40万 - 项目类别:
Standard Grant
Collaborative Research: CIF-Medium: Privacy-preserving Machine Learning on Graphs
合作研究:CIF-Medium:图上的隐私保护机器学习
- 批准号:
2402816 - 财政年份:2024
- 资助金额:
$ 40万 - 项目类别:
Standard Grant
Collaborative Research: CIF:Medium:Theoretical Foundations of Compositional Learning in Transformer Models
合作研究:CIF:Medium:Transformer 模型中组合学习的理论基础
- 批准号:
2403074 - 财政年份:2024
- 资助金额:
$ 40万 - 项目类别:
Standard Grant