RI: Small: New tools for studying structural and inductive bias in NLP models
RI:小:研究 NLP 模型中的结构和归纳偏差的新工具
基本信息
- 批准号:2128145
- 负责人:
- 金额:$ 50万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2021
- 资助国家:美国
- 起止时间:2021-10-01 至 2024-09-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Modern natural language processing systems, based on neural networks trained using large amounts of text, are a key part of the infrastructure of the nation and the world. These systems power practical tools like machine translation, web search, or automatic question answering, as well as research tools that help scientists and policy makers. These language processing models have made enormous progress in many ways, yet systems still fail unexpectedly, their successes cannot be explained, and their blind spots lead to biases. This project develops new tools for studying language models: why they work as well as they do, what their limitations are, and what distortions they introduce into language understanding, with the goal of improved systems and helping mitigate negative impacts on society.This project develops and investigates four kinds of new analytic tools for studying the inductive biases of language models - the structural tendencies that determine what they can learn. The structural transfer-learning paradigm involves training language models on artificial languages that can be manipulated, to see which structural aspects improve performance on natural language. The challenge-task paradigm brings humans in the loop to develop new evaluations to study why and how language processing systems fail, such as on aspect of language that change over time. The new theoretical framework of sensitivity models the complexity of language processing tasks by measuring how responsive the classification is to minor changes in the input, demonstrating which tasks or examples are easy or hard. And new tools are introduced to measure how embeddings of words introduce structural distortions - exaggerations or understatements in word relationships - that can cause models to fail. Understanding the limitations of technology and what makes one system better or one task or dataset harder than another is a crucial step toward building better language processing systems.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
现代自然语言处理系统基于使用大量文本训练的神经网络,是国家和世界基础设施的关键部分。这些系统为机器翻译、网络搜索或自动问答等实用工具以及为科学家和政策制定者提供帮助的研究工具提供支持。这些语言处理模型在许多方面取得了巨大进步,但系统仍然会意外失败,它们的成功无法解释,而且它们的盲点会导致偏差。该项目开发用于研究语言模型的新工具:它们为什么有效、它们的局限性以及它们给语言理解带来的扭曲,目的是改进系统并帮助减轻对社会的负面影响。该项目开发并研究了四种新的分析工具,用于研究语言模型的归纳偏差 - 决定它们可以学到什么的结构倾向。结构迁移学习范式涉及在可操作的人工语言上训练语言模型,以了解哪些结构方面可以提高自然语言的性能。挑战任务范式让人类参与开发新的评估,以研究语言处理系统失败的原因和方式,例如随时间变化的语言方面。新的敏感性理论框架通过测量分类对输入微小变化的响应程度来模拟语言处理任务的复杂性,展示哪些任务或示例是简单或困难的。还引入了新工具来衡量单词嵌入如何引入结构扭曲(单词关系中的夸大或轻描淡写),这可能导致模型失败。了解技术的局限性以及是什么使一个系统更好或一个任务或数据集比另一个系统更难,是构建更好的语言处理系统的关键一步。该奖项反映了 NSF 的法定使命,并通过使用基金会的智力价值进行评估,被认为值得支持以及更广泛的影响审查标准。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Daniel Jurafsky其他文献
ReFT: Representation Finetuning for Language Models
ReFT:语言模型的表示微调
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Zhengxuan Wu;Aryaman Arora;Zheng Wang;Atticus Geiger;Daniel Jurafsky;Christopher D. Manning;Christopher Potts - 通讯作者:
Christopher Potts
How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis
LLM 的谈判能力如何?
- DOI:
10.48550/arxiv.2402.05863 - 发表时间:
2024-02-08 - 期刊:
- 影响因子:0
- 作者:
Federico Bianchi;P. Chia;Mert Yüksekgönül;Jacopo Tagliabue;Daniel Jurafsky;James Zou - 通讯作者:
James Zou
Psych-E: Configurable Response Generation using Personality Traits and Pragmatics
Psych-E:使用个性特征和语用学生成可配置的响应
- DOI:
10.18653/v1/2020.findings-emnlp.22 - 发表时间:
2024-09-14 - 期刊:
- 影响因子:0
- 作者:
Sumanth Dathathri;Andrea Madotto;Janice Lan;Eric Frank;Piero Molino;J. Yosinski;J. Devlin;Ming;Kenton Lee;Emily Dinan;Stephen Roller;Kurt Shuster;A. Dix;Janet Finlay;G. Abowd;Matt Gardner;Joel Grus;Oyvind Mark Neumann;Pradeep Tafjord;Nelson F Dasigi;Matthew Liu;Matej Gjurkovi´c;Mladen Karan;Iva Vukojevi´c;Karthik Gopalakrishnan;Behnam Hedayatnia;Qingrui Chen;Anna Gottardi;Sanjeev Kwatra;Anu;Raefer Venkatesh;Gabriel Dilek;Hakkani;Seokhwan Kim;Yang Liu;Mihail Eric;P. Micikevicius;Sharan Narang;Jonah Alben;Yixin Nie;Mary Williamson;Mohit Bansal;Douwe;Kishore Papineni;S. Roukos;Todd Ward;Hannah Rashkin;David Reitter;Gaurav Singh;Tomar;Zhancheng Ren;Qi;Xiaolei Diao;Naman Goyal;Da Ju;Yinhan Liu;Jing Xu;Myle Ott;Eric M. Smith;Y;J. Weston;Sougata Saha;Souvik Das;Elizabeth Soper;A. Stolcke;K. Ries;N. Coccaro;Elizabeth Shriberg;Rebecca Bates;Daniel Jurafsky;Paul Taylor;Rachel Martin;Ashish Vaswani;Noam M. Shazeer;Niki Parmar;Thomas Wolf;Lys;re Debut;re;Victor Sanh;Julien Chaumond;Clement Delangue;Anthony Moi;Pierric Cistac;Tim Rault;Rémi Louf;Morgan Funtow;Joe Davison;Sam Shleifer;Patrick von Platen;Clara Ma;Yacine Jernite;J. Plu;Canwen Xu;Teven Le Scao;Sylvain Gugger;Mariama Drame - 通讯作者:
Mariama Drame
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
CausalGym:对语言任务的因果可解释性方法进行基准测试
- DOI:
10.48550/arxiv.2402.12560 - 发表时间:
2024-02-19 - 期刊:
- 影响因子:0
- 作者:
Aryaman Arora;Daniel Jurafsky;Christopher Potts - 通讯作者:
Christopher Potts
Multilingual BERT has an accent: Evaluating English influences on fluency in multilingual models
多语言 BERT 有口音:评估英语对多语言模型流畅性的影响
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Isabel Papadimitriou;Kezia Lopez;Daniel Jurafsky - 通讯作者:
Daniel Jurafsky
Daniel Jurafsky的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Daniel Jurafsky', 18)}}的其他基金
RI: Medium: Deep Understanding: Integrating Neural and Symbolic Models of Meaning
RI:中:深度理解:整合意义的神经模型和符号模型
- 批准号:
1514268 - 财政年份:2015
- 资助金额:
$ 50万 - 项目类别:
Continuing Grant
RI: Small: Learning Meaning and Grammar from Interaction, Context, and the World
RI:小:从互动、情境和世界中学习意义和语法
- 批准号:
1216875 - 财政年份:2012
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
RI-Small: Unsupervised Learning of Meaning
RI-Small:无监督意义学习
- 批准号:
0811974 - 财政年份:2008
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
Modeling Pronunciation Variation for Universal Access to Speech Understanding
为普遍获得语音理解而建模发音变化
- 批准号:
9978025 - 财政年份:1999
- 资助金额:
$ 50万 - 项目类别:
Continuing Grant
CAREER: Spoken Lexical Processing in Humans and Machines
职业:人类和机器的口语词汇处理
- 批准号:
9733067 - 财政年份:1998
- 资助金额:
$ 50万 - 项目类别:
Continuing Grant
SGER: Using Text Coherence and Verbal Valence in Long- Distance N-grams
SGER:在长距离 N 元语法中使用文本连贯性和语言效价
- 批准号:
9704046 - 财政年份:1997
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
相似国自然基金
基于免疫多肽组学对小细胞肺癌新靶点STMN1抗原表位的解析及在TCR-T治疗中的应用研究
- 批准号:82303772
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
AMPK信号传递介导加州新小绥螨对高温适应的调控机制
- 批准号:32302425
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
基于多时序CT影像与病理WSI的非小细胞肺癌新辅助免疫治疗疗效预测研究
- 批准号:82360356
- 批准年份:2023
- 资助金额:32 万元
- 项目类别:地区科学基金项目
PHLDA3通过ALDH1A1调控非小细胞肺癌干性促进新辅助化疗耐药的作用和机制研究
- 批准号:82302950
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
基于液滴微流控开展非小细胞肺癌新抗原特异性TCR的可视化分析及其识别机制的研究
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
RI: Small: New Directions in Probabilistic Deep Learning: Exponential Families, Bayesian Nonparametrics and Empirical Bayes
RI:小:概率深度学习的新方向:指数族、贝叶斯非参数和经验贝叶斯
- 批准号:
2127869 - 财政年份:2021
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
RI: Small: A Study of New Aggregate Losses for Machine Learning
RI:小:机器学习新总损失的研究
- 批准号:
2008532 - 财政年份:2020
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
RI: Small: A Study of New Aggregate Losses for Machine Learning
RI:小:机器学习新总损失的研究
- 批准号:
2103450 - 财政年份:2020
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
RI: Small: A Study of New Aggregate Losses for Machine Learning
RI:小:机器学习新总损失的研究
- 批准号:
2103450 - 财政年份:2020
- 资助金额:
$ 50万 - 项目类别:
Standard Grant
RI: Small: Robotic Path Planning to Reveal Wireless Rays - A New Foundation for the Optimization of Networked Robotic Operations
RI:小型:揭示无线射线的机器人路径规划 - 优化网络机器人操作的新基础
- 批准号:
2008449 - 财政年份:2020
- 资助金额:
$ 50万 - 项目类别:
Standard Grant