CAREER: Securing Deep Reinforcement Learning

职业：保护深度强化学习

基本信息

批准号：
2045948
负责人：
Xinyu Xing
金额：
$ 55.45万
依托单位：
Pennsylvania State Univ University Park
依托单位国家：
美国
项目类别：
Continuing Grant
财政年份：
2021
资助国家：
美国
起止时间：
2021-10-01 至 2022-04-30
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=2045948&HistoricalAwards=false
关键词：
CAREER Securing Deep Reinforcement Learning

项目摘要

Like many other deep learning techniques, deep reinforcement learning is vulnerable to adversarial attacks. In reinforcement learning, an adversarial attack manipulates a reinforcement learning agent's sensory observation, flummoxing it. Recently, research has demonstrated that an adversarial attack could be even more practical. Instead of implicitly assuming an attacker has the full control to influence an agent's sensory system, the new type of attack presents an adversarial agent to manipulate the target agent's environment and thus trigger it to react in an undesired fashion. Compared with the kind of attack that alters the sensory observation, the new attack is more difficult to counteract. First, the methods (e.g., adversarial training) commonly used for robustifying other deep learning techniques are no longer suitable for deep reinforcement learning. Second, given a reinforcement learning agent, there are few technical approaches to scrutinizing the agent and unveiling its flaws. This project intends to address these two significant problems by integrating and expanding upon a series of technical approaches used in explainable AI, adversarial training, and formal verification in conjunction with program synthesis. The basic idea is first to learn an adversarial agent informed by explainable AI. Using this learned agent, we then unveil the weakness of target agents and adversarially train them accordingly. Through a robustness check, we evaluate the enhanced agents. If a strengthened agent fails the adversary-resistance check, we fall back on formal verification and program synthesis techniques. Using this unified solution, reinforcement learning model developers could identify the policy flaws of reinforcement learning agents and effectively remediate their weaknesses. This project will provide a stack of technical solutions to scrutinizing and robustifying deep reinforcement learning. If successful, the project will significantly advance the field of AI security (for adversarial training and adversarial policy learning) and contribute to the field of machine learning (for explainable AI and verified AI). Besides, this project has the potential to improve the security of reinforcement learning applications significantly.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

与许多其他深度学习技术一样，深度强化学习很容易受到对抗性攻击。在强化学习中，对抗性攻击会操纵强化学习代理的感官观察，使其混乱。最近，研究表明对抗性攻击可能更加实用。这种新型攻击不是隐含地假设攻击者拥有影响代理感知系统的完全控制权，而是提出了一个对抗性代理来操纵目标代理的环境，从而触发其以不期望的方式做出反应。与那种改变感官观察的攻击相比，新的攻击更难以抵抗。首先，通常用于强化其他深度学习技术的方法（例如对抗性训练）不再适合深度强化学习。其次，对于强化学习代理来说，几乎没有什么技术方法可以检查该代理并揭示其缺陷。该项目旨在通过集成和扩展可解释人工智能、对抗性训练以及与程序合成相结合的形式验证中使用的一系列技术方法来解决这两个问题。基本思想是首先学习一个由可解释的人工智能告知的对抗代理。使用这个学习过的智能体，我们可以揭示目标智能体的弱点，并相应地进行对抗性训练。通过稳健性检查，我们评估增强的代理。如果强化的代理未能通过对手抵抗检查，我们就会依靠形式验证和程序综合技术。使用这个统一的解决方案，强化学习模型开发人员可以识别强化学习代理的策略缺陷并有效地纠正其弱点。该项目将提供一系列技术解决方案来审查和强化深度强化学习。如果成功，该项目将显着推进人工智能安全领域（针对对抗性训练和对抗性政策学习），并为机器学习领域（针对可解释人工智能和经过验证的人工智能）做出贡献。此外，该项目有可能显着提高强化学习应用的安全性。该奖项反映了 NSF 的法定使命，并通过使用基金会的智力价值和更广泛的影响审查标准进行评估，被认为值得支持。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Xinyu Xing其他文献

CGRED: class guided random early discarding

CGRED：类别引导随机早期丢弃

DOI：
10.1109/pccc.2004.1394975
发表时间：
2004-04-15
期刊：
IEEE International Conference on Performance, Computing, and Communications, 2004
影响因子：
0
作者：
Yuchun Guo;Yongxiang Zhao;Guangnong Song;Xinyu Xing;Changjia Chen
通讯作者：
Changjia Chen

This paper is included in the Proceedings of the 30th USENIX Security Symposium.

本文收录于第 30 届 USENIX 安全研讨会论文集。

DOI：
发表时间：
1970-01-01
期刊：
ArXiv
影响因子：
0
作者：
Xian Wu;Wenbo Guo;Hua Wei;Xinyu Xing
通讯作者：
Xinyu Xing

UCognito: Private Browsing without Tears

UCognito：无泪私密浏览

DOI：
10.1145/2810103.2813716
发表时间：
2015-10-12
期刊：
Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security
影响因子：
0
作者：
Meng Xu;Yeongjin Jang;Xinyu Xing;Taesoo Kim;Wenke Lee
通讯作者：
Wenke Lee

ShadowBound: Efficient Heap Memory Protection Through Advanced Metadata Management and Customized Compiler Optimization

ShadowBound：通过高级元数据管理和定制编译器优化实现高效堆内存保护

DOI：
10.1145/3459898.3463905
发表时间：
2024-06-04
期刊：
Proceedings of the 2021 ACM SIGPLAN International Symposium on Memory Management
影响因子：
0
作者：
Zheng Yu;Ganxiang Yang;Xinyu Xing
通讯作者：
Xinyu Xing

Automated Finite State Machine Extraction

自动有限状态机提取

DOI：
10.1145/3338502.3359760
发表时间：
2019-11-15
期刊：
Proceedings of the 3rd ACM Workshop on Forming an Ecosystem Around Software Transformation
影响因子：
0
作者：
Yongheng Chen;Linhai Song;Xinyu Xing;Fengyuan Xu;Wenfei Wu
通讯作者：
Wenfei Wu