Collaborative Research: CompCog: Psychological, Computational, and Neural Adequacy in a Deep Learning Model of Human Speech Recognition
合作研究:CompCog:人类语音识别深度学习模型中的心理、计算和神经充分性
基本信息
- 批准号:2043950
- 负责人:
- 金额:$ 17.89万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2021
- 资助国家:美国
- 起止时间:2021-06-01 至 2025-05-31
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Computer technology for speech recognition has advanced to an amazing degree over the past decade. Many of us use it daily -- to dictate text messages on smart phones or to navigate automated phone systems. As good as these systems are, humans still outperform them in complex, crowded, and noisy acoustic environments. If more were known concerning how humans adapt to these challenging situations, speech technology might be made more adaptive and robust. For example, computer systems for speech recognition use complex "deep learning" networks that often need to be trained in ways that are very different from how humans learn language. Although neural network models aimed at simulating human language processing are much simpler, which allows scientists to develop hypotheses about how human language processing works, they don't use real speech as input. Instead, they use phonetic features that are more like text than speech and so fail to address the core problem of how humans map the acoustics of speech to words. This research program focuses on bridging the gap between the complex artificial neural network models used in current technologies for speech recognition and the simpler neural network models used to investigate how humans actually perceive speech.This research program builds on a new neural network model for speech that aims to achieve high recognition accuracy on many words produced by several speakers. Crucially, the model can do this with minimal complexity (using many fewer layers than commercial speech recognition systems), which allows researchers to understand the computations it performs. The research plans include extending the model to a large vocabulary, training on naturalistic speech, and adding biologically plausible preprocessing modeled on the human auditory pathways. The model will be compared with key aspects of human spoken word recognition behavior as well as with human neural responses to spoken speech. The work has the potential to generate new insights to advance speech technology by making it more robust in challenging environments, with potential impact on speech technology used for health, law, education, and the automatic captioning that makes speech accessible to the deaf and hard of hearing. In addition, individuals ranging from high school students to Ph.D. students will be part of the research team and will have rich research experiences that will promote development of technical skills useful for careers in academic research or a variety of non-academic careers.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
在过去的十年里,用于语音识别的计算机技术已经发展到了惊人的程度。我们中的许多人每天都使用它——在智能手机上听写短信或导航自动电话系统。尽管这些系统非常好,但在复杂、拥挤和嘈杂的声学环境中,人类的表现仍然优于它们。如果更多地了解人类如何适应这些具有挑战性的情况,语音技术可能会变得更具适应性和鲁棒性。例如,用于语音识别的计算机系统使用复杂的“深度学习”网络,这些网络通常需要以与人类学习语言的方式非常不同的方式进行训练。尽管旨在模拟人类语言处理的神经网络模型要简单得多,这使得科学家能够提出有关人类语言处理如何工作的假设,但它们并不使用真实的语音作为输入。相反,他们使用的语音特征更像文本而不是语音,因此无法解决人类如何将语音声学映射到单词的核心问题。该研究计划的重点是弥合当前语音识别技术中使用的复杂人工神经网络模型与用于研究人类如何实际感知语音的更简单的神经网络模型之间的差距。该研究计划建立在一种新的语音神经网络模型的基础上旨在对多个说话者产生的许多单词实现高识别准确率。至关重要的是,该模型可以以最小的复杂性(使用比商业语音识别系统少得多的层)来做到这一点,这使得研究人员能够理解它执行的计算。研究计划包括将模型扩展到大词汇量、自然语音训练,以及添加仿照人类听觉路径的生物学上合理的预处理。该模型将与人类口语单词识别行为的关键方面以及人类对口语的神经反应进行比较。这项工作有可能产生新的见解,通过使其在充满挑战的环境中更加稳健,来推进语音技术的发展,对用于健康、法律、教育的语音技术以及使聋哑人和残障人士能够理解语音的自动字幕产生潜在影响。听力。此外,从高中生到博士的个人。学生将成为研究团队的一部分,并将拥有丰富的研究经验,这将促进对学术研究职业或各种非学术职业有用的技术技能的发展。该奖项反映了 NSF 的法定使命,并被认为值得通过以下方式支持:使用基金会的智力价值和更广泛的影响审查标准进行评估。
项目成果
期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
A tale of two lexica: Investigating computational pressures on word representation with neural networks
两个词汇的故事:利用神经网络研究单词表示的计算压力
- DOI:10.3389/frai.2023.1062230
- 发表时间:2023-03
- 期刊:
- 影响因子:4
- 作者:Avcu, Enes;Hwang, Michael;Brown, Kevin Scott;Gow, David W.
- 通讯作者:Gow, David W.
Investigating the Extent to which Distributional Semantic Models Capture a Broad Range of Semantic Relations
研究分布式语义模型捕获广泛语义关系的程度
- DOI:10.1111/cogs.13291
- 发表时间:2023-05
- 期刊:
- 影响因子:2.5
- 作者:Brown, Kevin S.;Yee, Eiling;Joergensen, Gitte;Troyer, Melissa;Saltzman, Elliot;Rueckl, Jay;Magnuson, James S.;McRae, Ken
- 通讯作者:McRae, Ken
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Kevin Brown其他文献
Slow Light Imaging Spectroscopy: Sensitivity of the Instrument Function to Optical Thickness and Gate Delay
慢光成像光谱:仪器功能对光厚度和门延迟的灵敏度
- DOI:
10.2514/6.2024-0807 - 发表时间:
2024-01-04 - 期刊:
- 影响因子:0
- 作者:
A. Abbasszadehrad;Jason M. Meyers;Kevin Brown;J. Bak;James Creel;Arthur Dogariu;Richard B. Miles - 通讯作者:
Richard B. Miles
Quantification of the Stapedial Reflex Reveals Delayed Responses in Autism
镫骨反射的量化揭示了自闭症的延迟反应
- DOI:
10.1002/aur.1297 - 发表时间:
2013-10-01 - 期刊:
- 影响因子:4.7
- 作者:
Richard Lukose;Kevin Brown;Carol M. Barber;Randy Kulesza - 通讯作者:
Randy Kulesza
IODP Exp. 348ライザー掘削から明らかになったスプレー断層上盤の応力状態と微小変形
IODP Exp. 348 隔水管开挖揭示的张开断层上层应力状态和微小变形
- DOI:
- 发表时间:
2014 - 期刊:
- 影响因子:0
- 作者:
山本由弦・林為人・山本裕二・曽根大貴・Pierre Henry・大坪誠;Kevin Brown; Ana Crespo;IODP Expedition 348研究者 - 通讯作者:
IODP Expedition 348研究者
Language Models are Few-shot Learners for Prognostic Prediction
语言模型是用于预后预测的小样本学习者
- DOI:
10.48550/arxiv.2302.12692 - 发表时间:
2023-02-24 - 期刊:
- 影响因子:0
- 作者:
Zekai Chen;Mariann Micsinai Balan;Kevin Brown - 通讯作者:
Kevin Brown
Model sensitivity and robustness in the estimation of larval transport: A study of particle tracking parameters
幼虫运输估计中的模型灵敏度和鲁棒性:粒子跟踪参数的研究
- DOI:
10.1016/j.jmarsys.2013.03.004 - 发表时间:
2013-06-01 - 期刊:
- 影响因子:2.8
- 作者:
R. D. Simons;D. Siegel;Kevin Brown - 通讯作者:
Kevin Brown
Kevin Brown的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Kevin Brown', 18)}}的其他基金
HSI Planning Project: Examining Inclusion and Other Variables of STEM Retention at a Faith-Based, Residential University
HSI 规划项目:检查一所基于信仰的寄宿大学的包容性和 STEM 保留的其他变量
- 批准号:
2345328 - 财政年份:2024
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
CRCNS US-Spain Research Proposal: Collaborative Research: Tracking and modeling the neurobiology of multilingual speech recognition
CRCNS 美国-西班牙研究提案:合作研究:跟踪和建模多语言语音识别的神经生物学
- 批准号:
2207747 - 财政年份:2022
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
Collaborative Research: Modeling Spatiotemporal Control of EGFR-ERK Signaling in Gene-edited Cell Systems
合作研究:基因编辑细胞系统中 EGFR-ERK 信号传导的时空控制建模
- 批准号:
1906161 - 财政年份:2018
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
Collaborative Research: Modeling Spatiotemporal Control of EGFR-ERK Signaling in Gene-edited Cell Systems
合作研究:基因编辑细胞系统中 EGFR-ERK 信号传导的时空控制建模
- 批准号:
1715342 - 财政年份:2017
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
DCL: NSF INCLUDES Conference - Collaborative Research: Envisioning Impact
DCL:NSF 包括会议 - 协作研究:展望影响
- 批准号:
1650289 - 财政年份:2016
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
Experimental and theoretical investigation of rock friction at seismic slip rates
地震滑移率下岩石摩擦的实验和理论研究
- 批准号:
0838255 - 财政年份:2009
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
A Moored Geodetic Seafloor Monitoring System (GEOCE)
系泊海底大地监测系统 (GEOCE)
- 批准号:
0551363 - 财政年份:2007
- 资助金额:
$ 17.89万 - 项目类别:
Continuing Grant
Hydrologic Monitoring at the Nantroseize OOST Zone: A Pilot Study
Nantroseize OOST 区的水文监测:试点研究
- 批准号:
0646811 - 财政年份:2007
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
SGER - Hydrologic Monitoring of Leaking Coseismically Active Thrusts: A Pilot Study in the Nankai Subduction Zone
SGER - 同震活跃逆冲泄漏的水文监测:南开俯冲带的试点研究
- 批准号:
0628676 - 财政年份:2006
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
Next Generation Flux Meters For Enhanced Geochemical And High-Resolution Real-Time Benthic Flux Monitoring
用于增强地球化学和高分辨率实时底栖通量监测的下一代通量计
- 批准号:
0241998 - 财政年份:2003
- 资助金额:
$ 17.89万 - 项目类别:
Continuing Grant
相似国自然基金
IGF-1R调控HIF-1α促进Th17细胞分化在甲状腺眼病发病中的机制研究
- 批准号:82301258
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
CTCFL调控IL-10抑制CD4+CTL旁观者激活促口腔鳞状细胞癌新辅助免疫治疗抵抗机制研究
- 批准号:82373325
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
RNA剪接因子PRPF31突变导致人视网膜色素变性的机制研究
- 批准号:82301216
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
血管内皮细胞通过E2F1/NF-kB/IL-6轴调控巨噬细胞活化在眼眶静脉畸形中的作用及机制研究
- 批准号:82301257
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
基于多元原子间相互作用的铝合金基体团簇调控与强化机制研究
- 批准号:52371115
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: CompCog: Modeling Search within the Mental Lexicon
合作研究:CompCog:心理词典中的建模搜索
- 批准号:
2235362 - 财政年份:2023
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
Collaborative Research: CompCog: Modeling Search within the Mental Lexicon
合作研究:CompCog:心理词典中的建模搜索
- 批准号:
2235363 - 财政年份:2023
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
Collaborative Research: CompCog: RI: Medium: Understanding human planning through AI-assisted analysis of a massive chess dataset
合作研究:CompCog:RI:中:通过人工智能辅助分析海量国际象棋数据集了解人类规划
- 批准号:
2312374 - 财政年份:2023
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
Collaborative Research: CompCog: RI: Medium: Understanding human planning through AI-assisted analysis of a massive chess dataset
合作研究:CompCog:RI:中:通过人工智能辅助分析海量国际象棋数据集了解人类规划
- 批准号:
2312373 - 财政年份:2023
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant
Collaborative Research: CompCog: Adversarial Collaborative Research on Intuitive Physical Reasoning
协作研究:CompCog:直观物理推理的对抗性协作研究
- 批准号:
2121009 - 财政年份:2021
- 资助金额:
$ 17.89万 - 项目类别:
Standard Grant