Auditory Scene Analysis with Complex Sounds
复杂声音的听觉场景分析
基本信息
- 批准号:9759900
- 负责人:
- 金额:$ 37.42万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2016
- 资助国家:美国
- 起止时间:2016-09-01 至 2021-08-31
- 项目状态:已结题
- 来源:
- 关键词:AcousticsAddressAgeAlgorithmsAreaAttentionAuditoryAuditory systemBehavioralBinauralBrainCharacteristicsCochlear ImplantsCommunicationCompetenceComplexCuesDependenceDevelopmentDevicesDiseaseEarEngineeringExhibitsFrequenciesFrustrationGoalsGroupingHearingHearing AidsHumanImplantKnowledgeMeasuresMethodsModelingMusicNoisePerformanceProcessPropertyProsthesisPsychoacousticsReportingResearchRestaurantsRoleSignal TransductionSocial isolationSourceSpeechSpeech IntelligibilityStimulusStreamStructureSystemTechnologyTestingTimeVoiceWorkauditory comprehensionautomated speech recognitiondesigndevelopmental diseaseexperimental studyhearing impairmentimprovedinsightinterestnormal hearingnovelremediationsegregationsoundspeech recognitionspeech synthesissuccessvocal cord
项目摘要
PROJECT SUMMARY / ABSTRACT
Perhaps the most pervasive problem faced by listeners with hearing impairment or cochlear implants is the
difficulty of recognizing speech and other sounds in the presence of competing sound sources, as when
conversing at a restaurant. This difficulty in “sound segregation” – hearing a particular sound of interest when it
is embedded in a mixture of other sounds – often leads to frustration and social isolation, and is not adequately
addressed by current hearing aids and implants. Sound segregation difficulties are also commonly reported in
developmental auditory disorders. The long-term goal of the proposed research is to reveal the basis of sound
segregation and to provide insights that will facilitate improved prosthetic devices and remediation strategies,
as well as more effective machine systems for processing sounds, e.g. for automatic speech recognition. The
development of more effective devices, technologies, and therapies is currently limited by an incomplete
understanding of the factors that underlie sound segregation by normal-hearing listeners. In particular, little is
known about sound segregation with complex naturalistic sounds, in part because much of the research in this
area has been conducted using simple signals that are impoverished relative to the sounds listeners normally
encounter. We propose to enrich the understanding of sound segregation with three sets of experiments that
use novel sound synthesis methods to manipulate properties of natural speech and other sounds and test their
role in segregation with behavioral experiments in human listeners. Aim 1 manipulates the classically proposed
grouping cue provided by harmonic frequency relations and investigates the mechanisms underlying their
effect. Aim 2 investigates the role of prior knowledge of voice and speech structure on segregation, and should
help to explain why some voices are easier or harder to segregate than others. Aim 3 investigates the role of
attentive tracking in the segregation of sounds from mixtures, and will explore the factors that facilitate tracking
or cause it to fail. The results will reveal the mechanisms underlying sound segregation by the healthy auditory
system, and will provide insights into the factors that limit auditory comprehension in the presence of multiple
sound sources, hopefully suggesting new strategies for signal enhancement, prosthetic devices, and
behavioral remediation.
项目概要/摘要
也许有听力障碍或植入人工耳蜗的听众面临的最普遍的问题是
在存在竞争声源的情况下,难以识别语音和其他声音,例如
在餐馆里交谈时,会遇到“声音隔离”的困难——听到特定的感兴趣的声音。
嵌入其他声音的混合中——常常导致沮丧和社会孤立,并且没有充分体现
目前的助听器和植入物所解决的声音分离问题也经常被报道。
该研究的长期目标是揭示声音的基础。
隔离并提供有助于改进假肢装置和补救策略的见解,
以及更有效的声音处理机器系统,例如自动语音识别。
目前,更有效的设备、技术和疗法的开发受到不完整的限制
尤其是,对于听力正常的听众对声音隔离的因素了解甚少。
人们知道复杂的自然声音的声音分离,部分原因是这方面的大量研究
区域是使用简单的相对信号进行的,这些信号通常对声音听众来说是贫乏的
我们建议通过三组实验来丰富对声音分离的理解。
使用新颖的声音合成方法来操纵自然语音和其他声音的属性并测试它们
目标 1 操纵了经典提出的方法。
由谐波频率关系提供的分组线索并研究其背后的机制
目标 2 研究语音和语音结构的先验知识对分离的作用,并且应该。
有助于解释为什么某些声音比其他声音更容易或更难隔离。目标 3 调查了以下角色的作用。
仔细跟踪从混合物中分离声音,并将探索促进跟踪的因素
或导致其失败,结果将揭示健康听觉的声音隔离机制。
系统,并将深入了解在存在多个声音的情况下限制听觉理解的因素
声源,希望能够提出信号增强、假肢装置和
行为矫正。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Josh H McDermott其他文献
Commonality and variation in mental representations of music revealed by a cross-cultural comparison of rhythm priors in 15 countries
通过对 15 个国家节奏先验的跨文化比较揭示了音乐心理表征的共性和差异
- DOI:
10.1038/s41562-023-01800-9 - 发表时间:
2024-03-04 - 期刊:
- 影响因子:29.9
- 作者:
Nori Jacoby;Rainer Polak;Jessica A Grahn;Daniel J. Cameron;Kyung Myun Lee;Ricardo A. Godoy;E. Undurraga;Tomás Huanca;Timon Thalwitzer;Noumouké Doumbia;Daniel Goldberg;E. Margulis;Patrick C M Wong;Luis Jure;M. Rocamora;S. Fujii;Patrick E. Savage;Jun Ajimi;Rei Konno;Sho Oishi;Kelly Jakubowski;Andre Holzapfel;Esra Mungan;Ece Kaya;Preeti Rao;M. A. Rohit;Suvarna Alladi;Bronwyn Tarr;Manuel Anglada;Peter M C Harrison;Malinda J. McPherson;Sophie Dolan;Alex Durango;Josh H McDermott - 通讯作者:
Josh H McDermott
A PERCEPTUALLY INSPIRED GENERATIVE MODEL OF RIGID-BODY CONTACT SOUNDS
刚体接触声音的感知启发生成模型
- DOI:
- 发表时间:
2019 - 期刊:
- 影响因子:0
- 作者:
James Traer;M. Cusimano;Josh H McDermott - 通讯作者:
Josh H McDermott
Josh H McDermott的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Josh H McDermott', 18)}}的其他基金
Computational Cognitive Neuroscience of Human Auditory Cortex
人类听觉皮层的计算认知神经科学
- 批准号:
10468917 - 财政年份:2019
- 资助金额:
$ 37.42万 - 项目类别:
Computational Cognitive Neuroscience of Human Auditory Cortex
人类听觉皮层的计算认知神经科学
- 批准号:
9797408 - 财政年份:2019
- 资助金额:
$ 37.42万 - 项目类别:
Computational Cognitive Neuroscience of Human Auditory Cortex
人类听觉皮层的计算认知神经科学
- 批准号:
10246259 - 财政年份:2019
- 资助金额:
$ 37.42万 - 项目类别:
相似国自然基金
本体驱动的地址数据空间语义建模与地址匹配方法
- 批准号:41901325
- 批准年份:2019
- 资助金额:22.0 万元
- 项目类别:青年科学基金项目
时空序列驱动的神经形态视觉目标识别算法研究
- 批准号:61906126
- 批准年份:2019
- 资助金额:24.0 万元
- 项目类别:青年科学基金项目
针对内存攻击对象的内存安全防御技术研究
- 批准号:61802432
- 批准年份:2018
- 资助金额:25.0 万元
- 项目类别:青年科学基金项目
大容量固态硬盘地址映射表优化设计与访存优化研究
- 批准号:61802133
- 批准年份:2018
- 资助金额:23.0 万元
- 项目类别:青年科学基金项目
IP地址驱动的多径路由及流量传输控制研究
- 批准号:61872252
- 批准年份:2018
- 资助金额:64.0 万元
- 项目类别:面上项目
相似海外基金
Miniaturized AD/ADRD Microphysiological Systems Platform for High-throughput Screening
用于高通量筛选的小型化 AD/ADRD 微生理系统平台
- 批准号:
10761587 - 财政年份:2023
- 资助金额:
$ 37.42万 - 项目类别:
Ready to CONNECT: Conversation and Language in Autistic Teens
准备好联系:自闭症青少年的对话和语言
- 批准号:
10807563 - 财政年份:2023
- 资助金额:
$ 37.42万 - 项目类别:
Improving communication outcomes in children with cleft palate in rural India
改善印度农村地区腭裂儿童的沟通效果
- 批准号:
10741579 - 财政年份:2023
- 资助金额:
$ 37.42万 - 项目类别:
Effects of deep brain stimulation (DBS) on laryngeal function and associated behaviors in Parkinson Disease
深部脑刺激(DBS)对帕金森病喉功能和相关行为的影响
- 批准号:
10735930 - 财政年份:2023
- 资助金额:
$ 37.42万 - 项目类别:
Multimodal dMRI, MRS and MEG studies of language impairment in low-verbal ASD
低语言 ASD 语言障碍的多模态 dMRI、MRS 和 MEG 研究
- 批准号:
10636420 - 财政年份:2023
- 资助金额:
$ 37.42万 - 项目类别: