CAREER: Temporal Causal Reinforcement Learning and Control for Autonomous and Swarm Cyber-Physical Systems

职业:自治和群体网络物理系统的时间因果强化学习和控制

基本信息

  • 批准号:
    2339774
  • 负责人:
  • 金额:
    $ 54.98万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2024
  • 资助国家:
    美国
  • 起止时间:
    2024-03-01 至 2029-02-28
  • 项目状态:
    未结题

项目摘要

Understanding the root cause of behavior is imperative for informed decision-making and preventing ineffective or biased policies. Currently, most AI-based learning and control modules embedded in cyber-physical systems (CPS) rely on statistical correlation rather than causality for decision-making. This not only results in incorrect decisions but also hinders the interpretability of learning, limiting transferability and scalability. This CAREER proposal aims to bridge the gap between causal inference and the growing capabilities of reinforcement learning (RL) in CPS. The proposed methods are transformative to a wide range of CPS applications, enabling more efficient and effective decision-making processes in autonomous and swarm CPS such as self-driving cars, drones, industrial robots, and swarm robots.This NSF CAREER proposal proposes a set of temporal causal RL and control approaches for CPS by leveraging the reasoning capabilities of temporal logics and causal diagrams in single-agent, multi-agent, and swarm system settings. The tools we develop will be implemented on multiple CPS testbeds and integrated with the proposed education plan. The proposed algorithms have the following unique and innovative features. Firstly, we will develop computationally efficient tools that can discover temporal causal knowledge from both observational and interventional data of a CPS in performing RL to improve the sampling efficiency and transferability. Secondly, we will develop multi-agent RL approaches for CPS in cooperative, non-cooperative, and incomplete information stochastic game environments where temporal causal knowledge is discovered in a distributed way for expediting RL. Lastly, we will develop scalable RL-based control methods for swarm systems utilizing temporal causal reasoning over agent-level features and swarm-level features such as densities and generalized moments. The education plan will impact the next generation of CPS and AI engineers and researchers through AI-assisted adaptive and interactive teaching, temporal-logic-based educational games, online interactive educational website design for temporal causal RL, and workshops and webinars with industrial partners.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
了解行为的根本原因对于明智的决策和预防无效或有偏见的政策至关重要。当前,大多数基于AI的学习和控制模块嵌入在网络物理系统中(CPS),依赖于统计相关性而不是因果关系来制定决策。这不仅会导致决策不正确,而且还阻碍了学习的解释性,限制可转移性和可扩展性。这项职业建议旨在弥合因果推断与CPS中增强学习能力(RL)增长能力之间的差距。 The proposed methods are transformative to a wide range of CPS applications, enabling more efficient and effective decision-making processes in autonomous and swarm CPS such as self-driving cars, drones, industrial robots, and swarm robots.This NSF CAREER proposal proposes a set of temporal causal RL and control approaches for CPS by leveraging the reasoning capabilities of temporal logics and causal diagrams in single-agent,多机构和群体系统设置。我们开发的工具将在多个CPS测试床上实施,并与拟议的教育计划集成。所提出的算法具有以下独特和创新的特征。首先,我们将开发计算高效的工具,这些工具可以从CPS的观察数据和介入数据中发现暂时性因果知识,以提高采样效率和可传递性。其次,我们将开发用于合作,非合作性和不完整信息随机游戏环境中CP的多代理RL方法,其中以分布式方式发现了为加快RL的分布式因果知识。最后,我们将利用代理级特征和群体级特征(例如密度和广义瞬间)开发基于RL RL的控制方法,用于群体系统。教育计划将通过AI辅助适应性和互动教学,基于时间逻辑的教育游戏,在线互动式教育网站设计,以及与工业合作伙伴的研讨会和网络研讨会来影响下一代CPS和AI工程师和研究人员,并影响了NSF的法定任务和审查企业的范围。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Zhe Xu其他文献

Dioxin Emission Concentration Forecasting Model for MSWI Process with Random Forest-Based Transfer Learning
基于随机森林的迁移学习的垃圾焚烧过程二恶英排放浓度预测模型
A Compliant Biomimetic Artificial Finger for Anthropomorphic Robotic Hands via 3 D Rapid Prototyping
通过 3D 快速原型制作用于拟人机器人手的顺应仿生人造手指
  • DOI:
  • 发表时间:
    2012
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Zhe Xu;Vikash Kumar;Evangelos A. Theodorou;Y. Matsuoka;E. Todorov
  • 通讯作者:
    E. Todorov
Inefficiency analysis of the scheduling game on limited identical machines with activation costs
有限同机上具有激活成本的调度博弈无效率分析
  • DOI:
    10.1016/j.ipl.2015.10.006
  • 发表时间:
    2016-04
  • 期刊:
  • 影响因子:
    0.5
  • 作者:
    Fang Xie;Yuzhong Zhang;Qingguo Bai;Zhe Xu
  • 通讯作者:
    Zhe Xu
Hierarchical Control for Head-to-Head Autonomous Racing
面对面自动赛车的分层控制
  • DOI:
    10.55417/fr.2024002
  • 发表时间:
    2022
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Rishabh Saumil Thakkar;Aryaman Singh Samyal;David Fridovich;Zhe Xu;U. Topcu
  • 通讯作者:
    U. Topcu
A Case of Primary Hepatic Extranodal Marginal Zone B-cell Lymphoma of Mucosa-associated Lymphoid Tissue (MALT) type Treated by Radiofrequency Ablation (RFA) and Literature Review
射频消融(RFA)治疗原发性肝结外边缘区B细胞粘膜相关淋巴组织(MALT)型淋巴瘤一例并文献复习
  • DOI:
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Zhe Xu
  • 通讯作者:
    Zhe Xu

Zhe Xu的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Zhe Xu', 18)}}的其他基金

CPS: Small: Neuro-Symbolic Learning and Control with High-Level Knowledge Inference
CPS:小型:具有高级知识推理的神经符号学习和控制
  • 批准号:
    2304863
  • 财政年份:
    2023
  • 资助金额:
    $ 54.98万
  • 项目类别:
    Standard Grant

相似国自然基金

拷贝数突变致良性癫痫伴中央颞区棘波语言障碍的认知心理学及神经影 像学研究
  • 批准号:
    82371201
  • 批准年份:
    2023
  • 资助金额:
    47 万元
  • 项目类别:
    面上项目
光遗传学调控中缝正中核五羟色胺能神经元改善颞叶癫痫焦虑共病的机制研究
  • 批准号:
    82371450
  • 批准年份:
    2023
  • 资助金额:
    49 万元
  • 项目类别:
    面上项目
m6A识别蛋白YTHDC2通过星形胶质细胞LCN2调控IKK/NF-κB/TNFα轴在颞叶癫痫的作用及机制研究
  • 批准号:
    82371463
  • 批准年份:
    2023
  • 资助金额:
    49 万元
  • 项目类别:
    面上项目
微针递送装载VEGF/PDGF-BB基因的腺相关病毒促进烟雾病高龄小鼠颞肌贴敷术后侧支血管生成的研究
  • 批准号:
    82301480
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
基于“额叶保持,颞叶损伤”假说的成功认知老化与轻度认知障碍脑功能影像机制研究
  • 批准号:
    32300924
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

Linking basal forebrain and entorhinal cortex vulnerability to preclinical Alzheimer's disease
将基底前脑和内嗅皮层的脆弱性与临床前阿尔茨海默病联系起来
  • 批准号:
    10506801
  • 财政年份:
    2022
  • 资助金额:
    $ 54.98万
  • 项目类别:
Linking basal forebrain and entorhinal cortex vulnerability to preclinical Alzheimer's disease
将基底前脑和内嗅皮层的脆弱性与临床前阿尔茨海默病联系起来
  • 批准号:
    10677886
  • 财政年份:
    2022
  • 资助金额:
    $ 54.98万
  • 项目类别:
Neural mechanisms of eye gaze perception in schizophrenia
精神分裂症眼睛注视感知的神经机制
  • 批准号:
    9918764
  • 财政年份:
    2016
  • 资助金额:
    $ 54.98万
  • 项目类别:
Neural mechanisms of eye gaze perception in schizophrenia
精神分裂症眼睛注视感知的神经机制
  • 批准号:
    9178692
  • 财政年份:
    2016
  • 资助金额:
    $ 54.98万
  • 项目类别:
Neural mechanisms of eye gaze perception in schizophrenia
精神分裂症眼睛注视感知的神经机制
  • 批准号:
    9301044
  • 财政年份:
    2016
  • 资助金额:
    $ 54.98万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了