RI: Small: Toward Efficient and Robust Dynamic Scene Understanding Based on Visual Correspondences
RI:小:基于视觉对应的高效、鲁棒的动态场景理解
基本信息
- 批准号:2310254
- 负责人:
- 金额:$ 59.39万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-08-01 至 2026-07-31
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Finding correspondences is a fundamental problem in computer vision; visual correspondences provide useful cues for a machine to understand its dynamic surroundings in a manner similar to what humans do. For instance, as an agent moves around, it may learn that objects that are far away like mountains typically do not move much, whereas nearby buildings and bushes appear to move rapidly in the environments as the agent changes position relative to them. Although significant advances have been made in solving various forms of visual correspondence problems, different correspondence models maintain different designs despite their inherent similarity, making the effective design principles and the learned representations difficult to transfer from one problem to another. In response to this challenge, this project aims to solve disparate visual correspondence problems with a unified model. In doing so, the project will also address two practical aspects of implementation of the developed models in scenarios with diverse visual appearance and significant resource constraints. These advances are expected to unlock novel applications and improve dynamic scene understanding in the areas of Augmented Reality, sports broadcasting, sports analytics, robotics, etc. The project outcomes may also unveil new markets and economic opportunities through solutions that augment cognitive and physical abilities of users in their daily lives. The team of researchers will actively integrate proposed research into the curriculum development and attract undergraduate researchers to the project. This project is particularly well-suited for outreach activities to broaden participation of underrepresented and K-12 students, by connecting abstract technical concepts with tangible research demonstrations.The project has three tightly connected thrusts, presenting fundamental advances in correspondence determination, in applications of these correspondences, and in making these algorithms efficient and robust in deployment. Concretely, first, a unified model to solve all the visual correspondence problems, ranging from 2D to 3D, will be developed, taking advantage of recent progress of the Transformer model and self-supervised learning from large-scale unlabeled data. The Transformer model naturally captures the correspondences of candidates with less inductive bias, making it a better choice to learn from the large-scale data and improve accuracy of data-poor domains when transferred from data-rich ones. Second, with the correspondences, novel applications will be unlocked to advance dynamic scene understanding, particularly for slow-motion video synthesis and robotic obstacle avoidance. Finally, the investigators will study mechanisms to improve efficiency and robustness when deploying the models on edge computing devices. The developed algorithms will be rigorously evaluated on standard benchmarks and in real-world deployment on edge devices.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
寻找对应关系是计算机视觉中的一个基本问题;视觉对应为机器以类似于人类的方式理解其动态环境提供了有用的线索。例如,当智能体四处移动时,它可能会了解到像山脉这样遥远的物体通常不会移动太多,而当智能体相对于它们改变位置时,附近的建筑物和灌木丛似乎在环境中快速移动。尽管在解决各种形式的视觉对应问题方面已经取得了重大进展,但不同的对应模型尽管具有固有的相似性,但仍保持着不同的设计,使得有效的设计原则和学习到的表示难以从一个问题转移到另一个问题。为了应对这一挑战,该项目旨在通过统一的模型解决不同的视觉对应问题。在此过程中,该项目还将解决在具有不同视觉外观和严重资源限制的场景中实施所开发模型的两个实际问题。这些进步预计将在增强现实、体育广播、体育分析、机器人等领域解锁新的应用并提高动态场景理解。项目成果还可能通过增强认知和身体能力的解决方案揭示新的市场和经济机会。用户在日常生活中。研究人员团队将积极将拟议的研究融入课程开发中,并吸引本科生研究人员参与该项目。该项目特别适合开展外展活动,通过将抽象的技术概念与具体的研究演示联系起来,扩大代表性不足的学生和 K-12 学生的参与。该项目具有三个紧密相连的主旨,展示了对应确定方面的根本性进展,以及这些方面的应用。通信,并使这些算法在部署中高效且稳健。具体而言,首先,将利用 Transformer 模型的最新进展和大规模无标签数据的自监督学习,开发一个解决从 2D 到 3D 的所有视觉对应问题的统一模型。 Transformer 模型自然地捕获了具有较少归纳偏差的候选者的对应关系,使其成为从大规模数据中学习并在从数据丰富的域转移时提高数据贫乏域的准确性的更好选择。其次,通过这些对应关系,将解锁新的应用程序以促进动态场景理解,特别是慢动作视频合成和机器人避障。最后,研究人员将研究在边缘计算设备上部署模型时提高效率和鲁棒性的机制。开发的算法将根据标准基准和边缘设备的实际部署进行严格评估。该奖项反映了 NSF 的法定使命,并通过使用基金会的智力优点和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Huaizu Jiang其他文献
Half&Half: New Tasks and Benchmarks for Studying Visual Common Sense
一半
- DOI:
- 发表时间:
2024-09-14 - 期刊:
- 影响因子:0
- 作者:
Ashish Singh;Hang Su;SouYoung Jin;Huaizu Jiang;Chetan Manjesh;Geng Luo;Ziwei He;Li Hong;E. Learned;Rosie Cowell - 通讯作者:
Rosie Cowell
In Defense of Grid Features for Visual Question Answering
- DOI:
10.1109/cvpr42600.2020.01028 - 发表时间:
2020-01-10 - 期刊:
- 影响因子:0
- 作者:
Huaizu Jiang;Ishan Misra;Marcus Rohrbach;E. Learned;Xinlei Chen - 通讯作者:
Xinlei Chen
Online Multi-Target Tracking with Unified Handling of Complex Scenarios
在线多目标跟踪,统一处理复杂场景
- DOI:
- 发表时间:
2015 - 期刊:
- 影响因子:10.6
- 作者:
Huaizu Jiang;Jinjun Wang;Yihong Gong;Na Rong;Zhenhua Chai;Nanning Zheng - 通讯作者:
Nanning Zheng
Supplementary Material: In Defense of Grid Features for Visual Question Answering
补充材料:捍卫视觉问答的网格功能
- DOI:
10.1007/s10456-020-09746-6 - 发表时间:
2024-09-13 - 期刊:
- 影响因子:9.8
- 作者:
Huaizu Jiang;Ishan Misra;Marcus Rohrbach;E. Learned;Xinlei Chen - 通讯作者:
Xinlei Chen
Probabilistic salient object contour detection based on superpixels
基于超像素的概率显着目标轮廓检测
- DOI:
10.1109/icip.2013.6738632 - 发表时间:
2013-09-01 - 期刊:
- 影响因子:0
- 作者:
Huaizu Jiang;Yang Wu;Zejian Yuan - 通讯作者:
Zejian Yuan
Huaizu Jiang的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
相似国自然基金
ALKBH5介导的SOCS3-m6A去甲基化修饰在颅脑损伤后小胶质细胞炎性激活中的调控作用及机制研究
- 批准号:82301557
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
miRNA前体小肽miPEP在葡萄低温胁迫抗性中的功能研究
- 批准号:
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:
PKM2苏木化修饰调节非小细胞肺癌起始细胞介导的耐药生态位的机制研究
- 批准号:82372852
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
基于翻译组学理论探究LncRNA H19编码多肽PELRM促进小胶质细胞活化介导电针巨刺改善膝关节术后疼痛的机制研究
- 批准号:82305399
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
CLDN6高表达肿瘤细胞亚群在非小细胞肺癌ICB治疗抗性形成中的作用及机制研究
- 批准号:82373364
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
相似海外基金
SaTC: CORE: Small: Toward Privacy Equity through Contextual Understanding of Self-Disclosure
SaTC:核心:小:通过自我披露的情境理解实现隐私公平
- 批准号:
2247723 - 财政年份:2023
- 资助金额:
$ 59.39万 - 项目类别:
Standard Grant
CIF:Small:Toward a Modern Theory of Compression: Manifold Sources and Learned Compressors
CIF:小:迈向现代压缩理论:流形源和学习压缩机
- 批准号:
2306278 - 财政年份:2023
- 资助金额:
$ 59.39万 - 项目类别:
Standard Grant
AF: Small: RUI: Toward High-Performance Block Krylov Subspace Algorithms for Solving Large-Scale Linear Systems
AF:小:RUI:用于求解大规模线性系统的高性能块 Krylov 子空间算法
- 批准号:
2327619 - 财政年份:2023
- 资助金额:
$ 59.39万 - 项目类别:
Standard Grant
AF: Small: RUI: Toward High-Performance Block Krylov Subspace Algorithms for Solving Large-Scale Linear Systems
AF:小:RUI:用于求解大规模线性系统的高性能块 Krylov 子空间算法
- 批准号:
2327619 - 财政年份:2023
- 资助金额:
$ 59.39万 - 项目类别:
Standard Grant
Toward Clinical Trial: AXL-STAT3 Targeting of Lung Tumor Microenvironments
走向临床试验:AXL-STAT3 靶向肺肿瘤微环境
- 批准号:
10660429 - 财政年份:2023
- 资助金额:
$ 59.39万 - 项目类别: