Collaborative Research: SLES: Verifying and Enforcing Safety Constraints in AI-based Sequential Generation
合作研究:SLES:验证和执行基于人工智能的顺序生成中的安全约束
基本信息
- 批准号:2331967
- 负责人:
- 金额:$ 26万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-10-01 至 2026-09-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Artificial intelligence (AI) has achieved transformative impacts on various complex real-world challenges. Among its applications, sequential data are prevalent in many critical usages of AI when it directly engages with its users. Self-driving cars rely on AI to process sequences of sensor data from cameras and radars, and make a sequence of real-time decisions to ensure safe driving. Healthcare monitoring systems use AI to analyze sequences of patient health data, such as blood pressure, heart rate, and others, to detect anomalies and predict potential health issues. Chatbots utilize AI to understand natural language and generate safe, fair, and appropriate text responses as sequences of words and sentences. The sequential data produced by AI make its behavior hard to characterize because of the complex dependencies within the sequence, and a careless application of AI in these scenarios may lead to harmful consequences, such as a collision of an autonomous vehicle or the generation of biased or toxic texts. This project aims to study the safety of AI under scenarios with sequential data, provide assurance for its behavior in mission-critical environments, and ensure AI-based sequential generation can adhere to safety constraints and social norms. Ultimately, this research will help with reducing unexpected AI failures, preventing bias and discrimination in AI technologies, aligning AI systems with human values and societal norms, and building up public trust for AI-enabled applications.The technical contributions of this project consist of three thrusts. The first thrust develops a formal verification framework for assuring the safety of AI models for sequential generation tasks with rigorous mathematical guarantees. It includes a series of innovative verification algorithms for bound propagation and branch-and-bound for general non-linear functions involved in sequential generation models. These new verification methods will be integrated into the alpha-beta-CROWN neural network verifier, a well-known open-source toolbox developed by investigators. The second thrust involves training and inference algorithms that ensure sequential generation models comply with specified safety constraints, with a unique probabilistic framework that decomposes a safety constraint into action-level components and enforces them at each generation step. This approach can be integrated with model training to improve the safety performance of sequential generation models using posterior regularization techniques. Lastly, the third thrust aims to integrate the formal verification and constrained generation components above and apply them to three important real-world applications: safety of text generation, safety and stability of controlled systems, and robust AI-generated text detectors. This project will also result in tools to the broader AI community, including the alpha-beta-CROWN neural network verifier, and the shared data and benchmarks developed to evaluate the safety of sequential generation models.This project is supported by a partnership with the NSF and Open Philanthropy.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
人工智能 (AI) 对各种复杂的现实世界挑战产生了变革性影响。在其应用中,当人工智能直接与用户互动时,序列数据在人工智能的许多关键用途中普遍存在。自动驾驶汽车依靠人工智能处理来自摄像头和雷达的传感器数据序列,并做出一系列实时决策以确保安全驾驶。医疗保健监测系统使用人工智能来分析患者健康数据序列,例如血压、心率等,以检测异常并预测潜在的健康问题。聊天机器人利用人工智能来理解自然语言,并以单词和句子序列的形式生成安全、公平和适当的文本响应。由于序列中存在复杂的依赖关系,人工智能产生的序列数据使其行为难以表征,而人工智能在这些场景中的粗心应用可能会导致有害的后果,例如自动驾驶车辆的碰撞或产生有偏见的或有毒的文字。该项目旨在研究人工智能在序列数据场景下的安全性,为其在关键任务环境中的行为提供保证,并确保基于人工智能的序列生成能够遵守安全约束和社会规范。最终,这项研究将有助于减少意外的人工智能故障,防止人工智能技术中的偏见和歧视,使人工智能系统与人类价值观和社会规范保持一致,并建立公众对人工智能应用程序的信任。该项目的技术贡献包括三个推力。第一个重点是开发一个正式的验证框架,以确保具有严格数学保证的顺序生成任务的人工智能模型的安全性。它包括一系列创新的验证算法,用于顺序生成模型中涉及的一般非线性函数的边界传播和分支定界。这些新的验证方法将被集成到 alpha-beta-CROWN 神经网络验证器中,这是研究人员开发的著名开源工具箱。第二个重点涉及训练和推理算法,确保顺序生成模型符合指定的安全约束,并使用独特的概率框架将安全约束分解为操作级组件,并在每个生成步骤中强制执行。这种方法可以与模型训练相结合,以提高使用后验正则化技术的顺序生成模型的安全性能。最后,第三个重点旨在集成上述形式验证和约束生成组件,并将其应用于三个重要的现实应用:文本生成的安全性、受控系统的安全性和稳定性以及强大的人工智能生成文本检测器。该项目还将为更广泛的人工智能社区提供工具,包括 alpha-beta-CROWN 神经网络验证器,以及为评估顺序生成模型的安全性而开发的共享数据和基准。该项目得到了与 NSF 合作的支持该奖项反映了 NSF 的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Huan Zhang其他文献
Decoration of heparin and bovine serum albumin on polysulfone membrane assisted via polydopamine strategy for hemodialysis
聚多巴胺策略辅助下聚砜膜上肝素和牛血清白蛋白的修饰用于血液透析
- DOI:
10.1080/09205063.2016.1169479 - 发表时间:
2016-04-22 - 期刊:
- 影响因子:0
- 作者:
B. Xie;Ranran Zhang;Huan Zhang;Anxiu Xu;Yi Deng;Yalin Lv;F. Deng;Shicheng Wei - 通讯作者:
Shicheng Wei
Experimental and numerical study on the heat transfer performance of the radiant floor heating condenser with composite phase change material
复合相变材料地板辐射采暖冷凝器传热性能实验与数值研究
- DOI:
10.1016/j.applthermaleng.2022.118749 - 发表时间:
2022-06-01 - 期刊:
- 影响因子:6.4
- 作者:
T. Jiang;Chenxiao Zheng;Shijun You;Huan Zhang;Zhenjing Wu;Yaran Wang;Shen Wei - 通讯作者:
Shen Wei
Thermal properties of lauric acid/high‐density polyethylene form‐stabilized phase change materials doped with hybrid carbon nanofillers
掺杂杂化碳纳米填料的月桂酸/高密度聚乙烯稳定相变材料的热性能
- DOI:
10.1002/pc.27242 - 发表时间:
2023-01-26 - 期刊:
- 影响因子:5.2
- 作者:
Huan Zhang;Tingwei Fu;Wenze Wang;G. Fang - 通讯作者:
G. Fang
A Vertically Modularized Reconfigurable Wireless Power Transfer System: Architecture, Modeling, and Design
垂直模块化可重构无线功率传输系统:架构、建模和设计
- DOI:
10.1109/tpel.2022.3208315 - 发表时间:
2023-02-01 - 期刊:
- 影响因子:6.7
- 作者:
Huan Zhang;Yaoxia Shao;Ning Kang;Haojun Qin;Chengbin Ma;Meilin Liu - 通讯作者:
Meilin Liu
BronchusNet: Region and Structure Prior Embedded Representation Learning for Bronchus Segmentation and Classification
BronchusNet:用于支气管分割和分类的区域和结构先验嵌入式表示学习
- DOI:
10.48550/arxiv.2205.06947 - 发表时间:
2022-05-14 - 期刊:
- 影响因子:0
- 作者:
Wenhao Huang;Haifan Gong;Huan Zhang;Yu Wang;Haofeng Li;Guanbin Li;H. Shen - 通讯作者:
H. Shen
Huan Zhang的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
相似国自然基金
脂质合成调控细胞膜稳态介导贝莱斯芽孢杆菌耐受乙醇胁迫机制研究
- 批准号:32372284
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
哈茨木霉正向突变株协同贝莱斯芽孢杆菌SQR9发挥植物促生与生防功能的机制研究
- 批准号:32302679
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
mtrB基因调控色氨酸合成介导贝莱斯芽孢杆菌耐受乙醇胁迫的机制研究
- 批准号:32302023
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
贝莱斯芽孢杆菌VII型分泌系统分泌蛋白YukE介导的根系铁泄漏促进其根际定殖的机制研究
- 批准号:32370135
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
贝莱斯芽胞杆菌调控根际菌根真菌Sebacina vermifera协同促进金线莲生长的机制研究
- 批准号:32270296
- 批准年份:2022
- 资助金额:54 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: SLES: Guaranteed Tubes for Safe Learning across Autonomy Architectures
合作研究:SLES:跨自治架构安全学习的保证管
- 批准号:
2331879 - 财政年份:2024
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Guaranteed Tubes for Safe Learning across Autonomy Architectures
合作研究:SLES:跨自治架构安全学习的保证管
- 批准号:
2331878 - 财政年份:2024
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Bridging offline design and online adaptation in safe learning-enabled systems
协作研究:SLES:在安全的学习系统中桥接离线设计和在线适应
- 批准号:
2331880 - 财政年份:2023
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Safety under Distributional Shift in Learning-Enabled Power Systems
合作研究:SLES:学习型电力系统分配转变下的安全性
- 批准号:
2331776 - 财政年份:2023
- 资助金额:
$ 26万 - 项目类别:
Standard Grant
Collaborative Research: SLES: Safe Distributional-Reinforcement Learning-Enabled Systems: Theories, Algorithms, and Experiments
协作研究:SLES:安全的分布式强化学习系统:理论、算法和实验
- 批准号:
2331782 - 财政年份:2023
- 资助金额:
$ 26万 - 项目类别:
Standard Grant