S&AS: FND: COLLAB: Learning from Stories: Practical Value Alignment and Taskability for Autonomous Systems
S
基本信息
- 批准号:1849262
- 负责人:
- 金额:$ 30.87万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-06-01 至 2023-05-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
In the near future we are likely to see increasingly-capable autonomous systems operating in proximity to humans and immersed in society. As these systems become more sophisticated, they will interact increasingly with humans. With this increased human-agent interaction comes an increased obligation to ensure that autonomous systems do not cause even unintentional harm to a human. Creating systems that cannot intentionally or unintentionally harm humans in not an easy task. This is because there are infinitely many undesirable outcomes that can be achieved in an open world, making it impossible to instruct these systems to avoid each one. If the desired behavior cannot be directly specified, then it must be learned. Past approaches to learn these types of behaviors have focused on learning from human examples, but these methods are unlikely to scale. This research uses natural language explanations of behavior as a scalable alternative for training autonomous agents for safe operation. Naturalistic descriptions contain vast amounts of information about sociocultural norms, which make them rich sources for such training. Enabling systems to better understand and learn from such descriptions will enable human operators to more naturally specify goals or tasks for the agent to complete.This research explores the concept of learning via natural language descriptions of desired behavior. This technique uses procedural knowledge contained in natural language explanations to help train autonomous agents. Concretely, this approach learns utility functions that can be used to guide autonomous agents towards behaviors that are aligned with the description used for training. To accomplish this, researchers will create computational models capable of extracting both knowledge about sociocultural norms as well as procedural knowledge from naturally occurring corpora. These models will then be used to create behavior policies that are both aligned with sociocultural norms and procedurally plausible. To further ensure that these models can be practically deployed, researchers will enable their models to incorporate a "human in the loop" to provide online feedback about the quality of these learned behavior policies in terms of their social acceptability and appropriateness. Safeguards will also be investigated to protect the learned behavior policies against the effects of adversarial or malicious training examples.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
在不久的将来,我们可能会看到功能越来越强大的自主系统在人类附近运行并融入社会。随着这些系统变得更加复杂,它们将越来越多地与人类互动。随着人机交互的增加,确保自主系统不会对人类造成无意伤害的义务也随之增加。创建不会有意或无意伤害人类的系统并不是一件容易的事。这是因为在开放世界中可能会出现无数种不良结果,因此不可能指示这些系统避免每一种结果。如果无法直接指定所需的行为,则必须学习它。过去学习此类行为的方法主要集中于从人类例子中学习,但这些方法不太可能扩展。这项研究使用自然语言解释行为作为训练自主代理安全操作的可扩展替代方案。自然主义描述包含大量有关社会文化规范的信息,这使它们成为此类培训的丰富来源。使系统能够更好地理解和学习这些描述将使人类操作员能够更自然地指定代理要完成的目标或任务。这项研究探讨了通过期望行为的自然语言描述进行学习的概念。该技术使用自然语言解释中包含的程序知识来帮助训练自主代理。具体来说,这种方法学习效用函数,可用于指导自主代理采取与训练所用描述一致的行为。为了实现这一目标,研究人员将创建能够从自然存在的语料库中提取有关社会文化规范的知识以及程序性知识的计算模型。然后,这些模型将用于制定既符合社会文化规范又在程序上合理的行为政策。为了进一步确保这些模型可以实际部署,研究人员将使他们的模型纳入“循环中的人”,以提供有关这些学习行为政策的社会可接受性和适当性质量的在线反馈。还将调查保障措施,以保护学习行为政策免受对抗性或恶意训练示例的影响。该奖项反映了 NSF 的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(5)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Fabula Entropy Indexing: Objective Measures of Story Coherence
Fabula 熵索引:故事连贯性的客观衡量标准
- DOI:
- 发表时间:2021-01
- 期刊:
- 影响因子:0
- 作者:Castricato, Louis;Frazier, Spencer;Balloch, Jonathan;Riedl, Mark
- 通讯作者:Riedl, Mark
Learning Norms from Stories: A Prior for Value Aligned Agents
从故事中学习规范:价值一致代理的先决条件
- DOI:
- 发表时间:2020-01
- 期刊:
- 影响因子:0
- 作者:Frazier, Spencer;Al Nahian, Md Sultan;Riedl, Mark;Harrison, Brent
- 通讯作者:Harrison, Brent
Reducing Non-Normative Text Generation from Language Models
减少语言模型中的非规范文本生成
- DOI:10.18653/v1/2020.inlg-1.43
- 发表时间:2020-11-01
- 期刊:
- 影响因子:23.6
- 作者:Xiangyu Peng;Siyan Li;Spencer Frazier;Mark O. Riedl
- 通讯作者:Mark O. Riedl
Tell Me A Story Like I'm Five: Story Generation via Question Answering
像我五岁一样给我讲一个故事:通过问答生成故事
- DOI:
- 发表时间:2021-01
- 期刊:
- 影响因子:0
- 作者:Castricato, Louis;Frazier, Spencer;Balloch, Jonathan;Riedl, Mark
- 通讯作者:Riedl, Mark
Playing Text-Based Games with Common Sense
用常识玩基于文本的游戏
- DOI:
- 发表时间:2020-01
- 期刊:
- 影响因子:0
- 作者:Dambekodi, Sahith;Frazier, Spencer;Ammanabrolu, Prithviraj;Riedl, Mark
- 通讯作者:Riedl, Mark
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Mark Riedl其他文献
Creating Suspenseful Stories: Iterative Planning with Large Language Models
创造悬疑故事:利用大型语言模型进行迭代规划
- DOI:
10.48550/arxiv.2402.17119 - 发表时间:
2024-02-27 - 期刊:
- 影响因子:0
- 作者:
Kaige Xie;Mark Riedl - 通讯作者:
Mark Riedl
Mark Riedl的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Mark Riedl', 18)}}的其他基金
I-Corps: Aging in Place with Artificial Intelligence-Powered Augmented Reality
I-Corps:利用人工智能驱动的增强现实实现原地老龄化
- 批准号:
2406592 - 财政年份:2024
- 资助金额:
$ 30.87万 - 项目类别:
Standard Grant
Exploring Artificial Intelligence-enhanced Electronic Design Process Logs: Empowering High School Engineering Teachers
探索人工智能增强的电子设计过程日志:赋予高中工程教师权力
- 批准号:
2119135 - 财政年份:2021
- 资助金额:
$ 30.87万 - 项目类别:
Standard Grant
FW-HTF-RL: Collaborative Research: Future expert work in the age of "black box", data-intensive, and algorithmically augmented healthcare
FW-HTF-RL:协作研究:“黑匣子”、数据密集型和算法增强医疗保健时代的未来专家工作
- 批准号:
1928586 - 财政年份:2019
- 资助金额:
$ 30.87万 - 项目类别:
Standard Grant
CHS: Small: Scientific Design of Interactive Human Computation Systems
CHS:小型:交互式人类计算系统的科学设计
- 批准号:
1525967 - 财政年份:2015
- 资助金额:
$ 30.87万 - 项目类别:
Standard Grant
CAREER: Combining Crowdsourcing and Computational Creativity to Enable Narrative Generation for Education, Training, and Healthcare
职业:将众包和计算创造力相结合,为教育、培训和医疗保健生成叙事
- 批准号:
1350339 - 财政年份:2014
- 资助金额:
$ 30.87万 - 项目类别:
Continuing Grant
MAJOR: Assistive Artificial Intelligence to Support Creative Filmmaking in Computer Animation
专业:辅助人工智能支持计算机动画中的创意电影制作
- 批准号:
1002748 - 财政年份:2010
- 资助金额:
$ 30.87万 - 项目类别:
Standard Grant
相似国自然基金
基于Piezo2快速信号传导研究气滞胃痛方干预机械刺激诱导FD胃黏膜内脏高敏状态的作用机制
- 批准号:82305136
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
基于Piezo蛋白介导SCF/c-kit-JAK-STAT信号通路促进Cajal间质细胞增殖研究腹部推拿调控FD胃动力的作用机制
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
基于双展示工程噬菌体fd388-BH-WV应用于三阴性乳腺癌脑转移瘤的靶向“饥饿”治疗
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
柴胡疏肝散调节FD肠道菌群—线粒体crosstalk经KEAP1/PGAM5/AIFM1通路抑制ICC氧死亡促进胃动力机制的研究
- 批准号:
- 批准年份:2021
- 资助金额:55 万元
- 项目类别:面上项目
多约束条件下UTBB FD-SOI晶体管总剂量效应模型研究
- 批准号:
- 批准年份:2020
- 资助金额:64 万元
- 项目类别:面上项目
相似海外基金
S&AS:FND:COLLAB: Planning Coordinated Event Observation for Structured Narratives
S
- 批准号:
2313929 - 财政年份:2022
- 资助金额:
$ 30.87万 - 项目类别:
Standard Grant
S&AS: FND: COLLAB: Planning and Control of Heterogeneous Robot Teams for Ocean Monitoring
S
- 批准号:
2311967 - 财政年份:2022
- 资助金额:
$ 30.87万 - 项目类别:
Standard Grant
S&AS: FND: COLLAB: Adaptable Vehicular Sensing and Control for Fleet-Oriented Systems in Smart Cities
S
- 批准号:
1849238 - 财政年份:2019
- 资助金额:
$ 30.87万 - 项目类别:
Standard Grant
S&AS: FND: COLLAB: Learning from Stories: Practical Value Alignment and Taskability for Autonomous Systems
S
- 批准号:
1849231 - 财政年份:2019
- 资助金额:
$ 30.87万 - 项目类别:
Standard Grant
S&AS: FND: COLLAB: Planning Coordinated Event Observation for Structured Narratives
S
- 批准号:
1849249 - 财政年份:2019
- 资助金额:
$ 30.87万 - 项目类别:
Standard Grant