RI: Small: Collaborative Research: 'Houston We Have A Solution': Novel Speech Processing Advancements for Analysis of Large Asynchronous Multi-Channel Audio Corpora
RI:小型:协作研究:“休斯顿,我们有一个解决方案”:用于分析大型异步多通道音频语料库的新颖语音处理进步
基本信息
- 批准号:1219130
- 负责人:
- 金额:$ 36.52万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2012
- 资助国家:美国
- 起止时间:2012-09-01 至 2016-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This project is focused on developing new speech processing techniques which will transform access to large asynchronous multi-channel and diverse collections of multimedia materials. In particular, the algorithms developed are being employed to create a novel multi-source and multi-scale event reconstruction system that brings together the massive archives of the Apollo lunar missions, to create experiential interaction with historical materials. Specific research advancements are focused on state of the art acoustic environment analysis, speech recognition including keyword spotting, speaker identification under adverse conditions, multimodal content alignment, and automated linking for events and entities from spoken content. Specifically, the research is developing: (i) new techniques for noise- and channel-robust acoustic processing, exploiting missing-features concepts with novel feature extraction and compensation techniques, (ii) a new articulatory framework for speech recognition for robustness to variations in speech production, (iii) environmental "sniffing" techniques to automatically characterize acoustic environments to improve robustness, and (iv) automatic detection of novel task-specific audio-events. Since the data is asynchronous, unique speech analytics techniques are being formulated to address the large number of "local loop" intercom circuits in the NASA Mission Control Center, audio recorded onboard the two Apollo spacecrafts during specific mission events, and space-to-ground radio circuits. The specific speech, language, and knowledge extraction advancements will be integrated into a new automated evaluation model that reflects specific challenges encountered in the event reconstruction task. This platform will be deployed and evaluated by actual users from the Science and Engineering Education Center (SEEC) of the University of Texas at Dallas. Integration of robust speech processing algorithms with event reconstruction systems will have a direct and immediate impact on education, society, and government organizations. Working with NASA's Apollo mission data allows for the development of speech technology for challenging audio that contains severe communication channel artifacts, cross-talk/static/tones, and low signal-to-noise ratios. The software being developed in this project will be made available to any non-profit organization for use in audio/video search (download with training modules). Students working on senior design teams will also develop a Contact Science station to be deployed in Dallas, TX and overseen by the University of Texas in Dallas Science and Engineering Education Center to illustrate and assess student use of the advancements. As a lasting legacy for this project, this project team includes eminent historians of human space flight, who will explore opportunities to deploy this event reconstruction system in a museum setting where it can support both scholarship and public engagement, and we will make the system itself available on an open-source basis to support other researchers.
该项目的重点是开发新的语音处理技术,这些技术将改变对大型异步多渠道和多媒体材料集合的访问。特别是,正在采用开发的算法来创建一种新型的多源和多尺度事件重建系统,该系统汇集了Apollo Lunar任务的大量档案,以与历史材料建立体验互动。具体的研究进步集中在最先进的声学环境分析,语音识别,包括关键字发现,不利条件下的说话者识别,多模式内容对齐以及对口语内容的事件和实体的自动链接。具体而言,这项研究正在进行中:(i)用于噪声和频道稳定声学处理的新技术,通过新颖的特征提取和补偿技术利用缺失的功能概念,(ii)一种新的言语识别框架,以识别言论的鲁棒性,以对言语生产的变化进行稳健性,(iii)环境“嗅探”的环境,以自动化的环境,以自动化为特征,以使环境表现出各种自动化的环境。新颖的特定任务音频事件。由于数据是异步的,因此正在制定独特的语音分析技术,以解决NASA任务控制中心中大量的“本地环”对讲机,因此在特定的任务事件和空间到空间无线电通道的两个阿波罗航天台上记录了音频。具体的语音,语言和知识提取的进步将集成到一个新的自动化评估模型中,该模型反映了事件重建任务中遇到的特定挑战。该平台将由得克萨斯大学达拉斯分校的科学与工程教育中心(SEEC)的实际用户进行部署和评估。强大的语音处理算法与事件重建系统的整合将对教育,社会和政府组织产生直接和直接的影响。使用NASA的Apollo任务数据,可以开发语音技术,以挑战音频,其中包含严重的通信渠道伪像,跨对词/静态/音调和低信噪比。该项目中正在开发的软件将提供给任何非营利组织用于音频/视频搜索(使用培训模块下载)。从事高级设计团队的学生还将开发一个联络科学站,将在德克萨斯州达拉斯部署,并由德克萨斯大学达拉斯科学与工程教育中心监督,以说明和评估学生对进步的使用。 作为该项目的持久遗产,该项目团队包括人类太空飞行的著名历史学家,他们将探索在博物馆环境中部署该活动重建系统的机会,在该博物馆环境中,它可以支持奖学金和公众参与,我们将以开放式制度为基础提供该系统本身以支持其他研究人员。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
John Hansen其他文献
Springer Publisher
施普林格出版社
- DOI:
- 发表时间:
2004 - 期刊:
- 影响因子:0
- 作者:
Huseyin Abut;John Hansen;Kazuya Takeda (Eds.) - 通讯作者:
Kazuya Takeda (Eds.)
An energy and power-aware approach to high-level synthesis of asynchronous systems
用于异步系统高级综合的能量和功率感知方法
- DOI:
10.1109/iccad.2010.5654169 - 发表时间:
2010 - 期刊:
- 影响因子:0
- 作者:
John Hansen;Montek Singh - 通讯作者:
Montek Singh
Pedometer Use as Motivation for Physical Activity in Cardiac Tele-Rehabilitation
在心脏远程康复中使用计步器作为身体活动的动力
- DOI:
10.5334/ijic.2288 - 发表时间:
2015 - 期刊:
- 影响因子:2.4
- 作者:
C. Thorup;Mette Grønkjær;H. Spindler;J. Andreasen;John Hansen;B. Dinesen;Gitte Nielsen;E. E. Sørensen - 通讯作者:
E. E. Sørensen
Concurrency-Enhancing Transformations for Asynchronous Behavioral Specifications: A Data-Driven Approach
异步行为规范的并发增强转换:数据驱动的方法
- DOI:
- 发表时间:
2008 - 期刊:
- 影响因子:0
- 作者:
John Hansen;Montek Singh - 通讯作者:
Montek Singh
〔書評〕森本あんり著『アメリカ・キリスト教史-理念によって建てられた国の軌跡』(新教出版社)
【书评】森本杏里,《美国基督教史:一个由观念构建的国家的轨迹》(神协出版社)
- DOI:
- 发表时间:
2006 - 期刊:
- 影响因子:0
- 作者:
Huseyin Abut;John Hansen;Kazuya Takeda(Eds.);有田 節子;畠山 寛;幸野 稔;遠藤泰生 - 通讯作者:
遠藤泰生
John Hansen的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('John Hansen', 18)}}的其他基金
COLLABORATIVE RESEARCH: Social-Emotional Analysis of the Language Environment (SEAL): Key Word & Phrase Spotting in Early Childhood Care Settings
合作研究:语言环境的社会情感分析 (SEAL):关键词
- 批准号:
2234916 - 财政年份:2023
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
EAGER: Collaborative Research: Second Language Speech Production: Formulation of Objective Speech Intelligibility Measures and Learner-Specific Feedback
EAGER:协作研究:第二语言语音生成:客观语音清晰度测量和学习者特定反馈的制定
- 批准号:
2140415 - 财政年份:2021
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
CCRI: Medium: Developing a Multi-Channel Naturalistic Audio Corpora for the Natural Language Processing Research Community
CCRI:Medium:为自然语言处理研究界开发多通道自然音频语料库
- 批准号:
2016725 - 财政年份:2020
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
Workshops on NASA Apollo Mission Audio as a Community Research Resource
将 NASA 阿波罗任务音频作为社区研究资源的研讨会
- 批准号:
1943365 - 财政年份:2019
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
Collaborative Research: CSL-MultiAD: Assessing Collaborative STEM Learning through Rich Information Flow based on Multi-Sensor Audio Diarization
协作研究:CSL-MultiAD:通过基于多传感器音频二值化的丰富信息流评估协作 STEM 学习
- 批准号:
1918032 - 财政年份:2019
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
Doctoral Dissertation Research in DRMS: The Consumer Logic of Anti-Government Antagonism
DRMS博士论文研究:反政府对抗的消费者逻辑
- 批准号:
1357620 - 财政年份:2014
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
Collaborative Research: SBE Alliance: Great Lakes Alliance for the Social and Behavioral Sciences (GLASS)
合作研究:SBE 联盟:五大湖社会和行为科学联盟 (GLASS)
- 批准号:
0750599 - 财政年份:2007
- 资助金额:
$ 36.52万 - 项目类别:
Continuing Grant
Collaborative Research: Primary Elections for U.S. State and Federal Offices: A Comprehensive Database and Analysis
合作研究:美国州和联邦办公室初选:综合数据库和分析
- 批准号:
0617555 - 财政年份:2006
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
SBE Collaborative Research: Northwestern AGEP for SBE
SBE 合作研究:西北大学 AGEP for SBE
- 批准号:
0549069 - 财政年份:2005
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
Interactions of the TCR co-receptors and p56LCK in an Ectothermic Model
变温模型中 TCR 共受体和 p56LCK 的相互作用
- 批准号:
0453924 - 财政年份:2004
- 资助金额:
$ 36.52万 - 项目类别:
Continuing Grant
相似国自然基金
Small RNA调控I-F型CRISPR-Cas适应性免疫性的应答及分子机制
- 批准号:32000033
- 批准年份:2020
- 资助金额:24.0 万元
- 项目类别:青年科学基金项目
Small RNAs调控解淀粉芽胞杆菌FZB42生防功能的机制研究
- 批准号:31972324
- 批准年份:2019
- 资助金额:58.0 万元
- 项目类别:面上项目
变异链球菌small RNAs连接LuxS密度感应与生物膜形成的机制研究
- 批准号:81900988
- 批准年份:2019
- 资助金额:21.0 万元
- 项目类别:青年科学基金项目
肠道细菌关键small RNAs在克罗恩病发生发展中的功能和作用机制
- 批准号:31870821
- 批准年份:2018
- 资助金额:56.0 万元
- 项目类别:面上项目
基于small RNA 测序技术解析鸽分泌鸽乳的分子机制
- 批准号:31802058
- 批准年份:2018
- 资助金额:26.0 万元
- 项目类别:青年科学基金项目
相似海外基金
Collaborative Research: RI: Small: Foundations of Few-Round Active Learning
协作研究:RI:小型:少轮主动学习的基础
- 批准号:
2313131 - 财政年份:2023
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
Collaborative Research: RI: Small: Deep Constrained Learning for Power Systems
合作研究:RI:小型:电力系统的深度约束学习
- 批准号:
2345528 - 财政年份:2023
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
Collaborative Research: RI: Small: Motion Fields Understanding for Enhanced Long-Range Imaging
合作研究:RI:小型:增强远程成像的运动场理解
- 批准号:
2232298 - 财政年份:2023
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
Collaborative Research: RI: Small: End-to-end Learning of Fair and Explainable Schedules for Court Systems
合作研究:RI:小型:法院系统公平且可解释的时间表的端到端学习
- 批准号:
2232055 - 财政年份:2023
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant
Collaborative Research: RI: Small: End-to-end Learning of Fair and Explainable Schedules for Court Systems
合作研究:RI:小型:法院系统公平且可解释的时间表的端到端学习
- 批准号:
2232054 - 财政年份:2023
- 资助金额:
$ 36.52万 - 项目类别:
Standard Grant