The role of noise in information-theoretic models of sentence comprehension and production
噪声在句子理解和生成的信息论模型中的作用
基本信息
- 批准号:1534318
- 负责人:
- 金额:$ 36.69万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2015
- 资助国家:美国
- 起止时间:2015-09-01 至 2020-02-29
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Human language as it is produced and understood is full of errors: people make speech errors, they make typographical errors when typing / texting, and there is often background noise that makes it impossible to perceive words accurately. Given the noisy nature of human language in practice, it is surprising that people can understand one another so well. The question of how people can communicate given noise is not yet solved, and is the focus of our work. Understanding how humans can understand noisy language is critical for two reasons. First, language technologies must be capable of processing noisy language input: translation services need to account for errors in the text being translated; search engines need to process noisily-generated web content. Evidence concerning how humans understand language in noise can lead to improvements in the design of language technologies. In addition, until dialogue systems can produce coherent language responses--likely decades away--any practical application of such systems must be designed with an understanding of how humans deal with noisy or confusing language input. Second, on the clinical side, understanding how humans understand language which might contain errors will provide insights into language comprehension disorders. Recent research has shown that individuals with aphasia appear to assume the presence of more errors in the input than healthy participants, and thus show stronger reliance on their prior beliefs about the world when interpreting language. Applications of this work may lead to more efficient diagnosis and treatment options for such patients.The goals of the proposed research are two-fold. First, the researchers will investigate noise in the process of language comprehension, where noise falls into three categories (a) deletions, such that the listener / reader might miss something that was intended; (b) insertions, such that the producer might accidentally insert something; and (c) swaps, such that the producer might accidentally switch elements in the stream. Second, the researchers will investigate an information-theoretic approach to memory in sentence production, where memory is a source of potential errors in language use. Recent human vision research suggests that memory capacity is best modeled as a limitation on the complexity of the representations, in terms of information-theoretic units called "bits". Simple representations require very few bits of information, but complex representations require many. The proposed research extends this idea to language, such that high-frequency words and phrases such as "the boy sees the girl" should be stored easily in memory, while less frequent components such as "the woman who the man met was tall" should be difficult to store in memory.
人类的语言被产生和理解的是充满错误:人们犯了语音错误,在打字 /发短信时会出现印刷错误,并且经常有背景噪音使得无法准确感知单词。 考虑到人类语言在实践中的嘈杂性,人们可以很好地理解彼此。人们如何交流给定噪声的问题尚未解决,这是我们工作的重点。了解人类如何理解嘈杂的语言至关重要,原因有两个。首先,语言技术必须能够处理嘈杂的语言输入:翻译服务需要说明要翻译文本中的错误;搜索引擎需要处理嘈杂的网络内容。关于人类如何理解噪声语言的证据可以改善语言技术的设计。此外,直到对话系统可以产生连贯的语言响应(几十年来)之前,任何对此类系统的实际应用都必须以了解人类如何处理嘈杂或令人困惑的语言输入。其次,从临床方面来说,了解人类如何理解可能包含错误的语言将提供对语言理解障碍的见解。最近的研究表明,失语症的个人似乎比健康参与者承担更多的意见错误,因此在解释语言时对他们对世界的先前看法更依赖。这项工作的应用可能会导致此类患者更有效的诊断和治疗选择。拟议的研究的目标是两倍。 首先,研究人员将在语言理解过程中调查噪声,在语言理解过程中,噪声分为三类(a)删除,以使听众 /读者可能会错过任何预期的东西。 (b)插入,使生产者可能会意外插入一些东西; (c)交换,使生产者可能会意外切换流中的元素。 其次,研究人员将研究一种信息理论方法在句子生产中的记忆方法,其中记忆是语言使用中潜在错误的根源。 最近的人类视力研究表明,在称为“位”的信息理论单元方面,最好将记忆能力建模为对表示的复杂性的限制。 简单的表示需要很少的信息,但是复杂的表示需要很多。拟议的研究将这一想法扩展到语言,因此高频单词和短语(例如“男孩看到女孩”)应该很容易存储在记忆中,而诸如“男人遇见的女人身高的女人”等频繁的组成部分应该很难存储在记忆中。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Edward Gibson其他文献
Assessing the inferential strength of epistemic must
评估认知必须的推理强度
- DOI:
- 发表时间:
2023 - 期刊:
- 影响因子:2.1
- 作者:
Giuseppe Ricciardi;Rachel Ryskin;Edward Gibson - 通讯作者:
Edward Gibson
Variation in spatial concepts: Different frames of reference on different axes
空间概念的变化:不同轴上的不同参考系
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
Benjamin Pitt;Alexandra Carstensen;Edward Gibson;Steven T. Piantadosi - 通讯作者:
Steven T. Piantadosi
Concepts Are Restructured During Language Contact: The Birth of Blue and Other Color Concepts in Tsimane’-Spanish Bilinguals
语言接触过程中概念的重组:提斯曼-西班牙语双语者中蓝色和其他颜色概念的诞生
- DOI:
- 发表时间:
2023 - 期刊:
- 影响因子:0
- 作者:
Saima Malik;Kyle Mahowald;Bevil R. Conway;Edward Gibson - 通讯作者:
Edward Gibson
Recent Advances in Imaging of Barrett’s Esophagus
巴雷特食管影像学的最新进展
- DOI:
- 发表时间:
2018 - 期刊:
- 影响因子:0
- 作者:
Shekhar Sharma;Edward Gibson;N. Uedo;Rajvinder Singh - 通讯作者:
Rajvinder Singh
Preassessment Procedures for Learning Disabled Children
学习障碍儿童的预评估程序
- DOI:
10.1177/002221948601900904 - 发表时间:
1986 - 期刊:
- 影响因子:3
- 作者:
R. Harrington;Edward Gibson - 通讯作者:
Edward Gibson
Edward Gibson的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Edward Gibson', 18)}}的其他基金
Evaluating meaning-based explanations of syntactic island effects cross-linguistically
跨语言评估句法岛效应的基于意义的解释
- 批准号:
2020840 - 财政年份:2020
- 资助金额:
$ 36.69万 - 项目类别:
Standard Grant
Expanding the reach, impact and sustainability of ToyBox Study Malaysia: a kindergarten-based healthy behaviour intervention
扩大马来西亚玩具盒研究的范围、影响和可持续性:基于幼儿园的健康行为干预
- 批准号:
MR/V00607X/1 - 财政年份:2020
- 资助金额:
$ 36.69万 - 项目类别:
Research Grant
Improving healthy energy balance- and obesity-related behaviours among preschoolers in Malaysia: feasibility of adapting the ToyBox-Study
改善马来西亚学龄前儿童的健康能量平衡和肥胖相关行为:采用玩具盒研究的可行性
- 批准号:
MR/P013805/1 - 财政年份:2017
- 资助金额:
$ 36.69万 - 项目类别:
Research Grant
Workshop on Language Processing and Language Evolution: Special Session at the 2017 CUNY Conference on Human Sentence Processing
语言处理和语言进化研讨会:2017 年纽约市立大学人类句子处理会议特别会议
- 批准号:
1629983 - 财政年份:2016
- 资助金额:
$ 36.69万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: A Communicative Perspective on Quantitative Syntax
博士论文研究:数量句法的交际视角
- 批准号:
1551543 - 财政年份:2016
- 资助金额:
$ 36.69万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: Investigating cognitive and communicative pressures on natural language lexicons
博士论文研究:调查自然语言词典的认知和交际压力
- 批准号:
1451173 - 财政年份:2015
- 资助金额:
$ 36.69万 - 项目类别:
Standard Grant
Doctoral Dissertation: Investigating the role of grammatical representation in language learnability
博士论文:研究语法表征在语言可学习性中的作用
- 批准号:
1420785 - 财政年份:2014
- 资助金额:
$ 36.69万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: Causal Representations in Children's Transitive Sentences
博士论文研究:儿童及物句的因果表征
- 批准号:
1227892 - 财政年份:2012
- 资助金额:
$ 36.69万 - 项目类别:
Standard Grant
Origins of Numerical Competence: Assessment of Number Sense in Piraha
数字能力的起源:皮拉哈语数感评估
- 批准号:
1022684 - 财政年份:2010
- 资助金额:
$ 36.69万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: Discovering Semantic Primitives
博士论文研究:发现语义原语
- 批准号:
1025309 - 财政年份:2010
- 资助金额:
$ 36.69万 - 项目类别:
Standard Grant
相似国自然基金
信息噪音环境下投资者异质信念理论与监管应用研究:基于证券市场资产价格和交易行为数据的证据
- 批准号:72103141
- 批准年份:2021
- 资助金额:30 万元
- 项目类别:青年科学基金项目
噪音还是信息?----基于关注度和社交网络理论对股票论坛的研究
- 批准号:71673318
- 批准年份:2016
- 资助金额:48.0 万元
- 项目类别:面上项目
基于微观结构噪音分析的沪深300股指期货现货市场波动关系研究
- 批准号:71661028
- 批准年份:2016
- 资助金额:27.5 万元
- 项目类别:地区科学基金项目
环境噪音下腔QED的量子关联和量子信息研究
- 批准号:11364006
- 批准年份:2013
- 资助金额:50.0 万元
- 项目类别:地区科学基金项目
非均匀磁场中海森堡自旋链体系的纠缠态在环境噪音影响下的性质和演化
- 批准号:10664004
- 批准年份:2006
- 资助金额:25.0 万元
- 项目类别:地区科学基金项目
相似海外基金
Multi-scale feedbacks for robust organ development
多尺度反馈促进器官的健全发育
- 批准号:
10687672 - 财政年份:2023
- 资助金额:
$ 36.69万 - 项目类别:
Hybrid Intelligence for Trustable Diagnosis And Patient Management of Prostate Cancer (HIT-PIRADS)
用于前列腺癌可信诊断和患者管理的混合智能 (HIT-PIRADS)
- 批准号:
10611212 - 财政年份:2023
- 资助金额:
$ 36.69万 - 项目类别:
Multidimensional brain connectome features of depression and anxiety
抑郁和焦虑的多维脑连接组特征
- 批准号:
10571512 - 财政年份:2023
- 资助金额:
$ 36.69万 - 项目类别:
Quantifying the Brain Metabolism Underlying Task-Based BOLD Imaging
量化基于任务的 BOLD 成像背后的大脑代谢
- 批准号:
10432379 - 财政年份:2022
- 资助金额:
$ 36.69万 - 项目类别: