RI: Small: Toward Efficient and Robust Dynamic Scene Understanding Based on Visual Correspondences

RI:小:基于视觉对应的高效、鲁棒的动态场景理解

基本信息

  • 批准号:
    2310254
  • 负责人:
  • 金额:
    $ 59.39万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2023
  • 资助国家:
    美国
  • 起止时间:
    2023-08-01 至 2026-07-31
  • 项目状态:
    未结题

项目摘要

Finding correspondences is a fundamental problem in computer vision; visual correspondences provide useful cues for a machine to understand its dynamic surroundings in a manner similar to what humans do. For instance, as an agent moves around, it may learn that objects that are far away like mountains typically do not move much, whereas nearby buildings and bushes appear to move rapidly in the environments as the agent changes position relative to them. Although significant advances have been made in solving various forms of visual correspondence problems, different correspondence models maintain different designs despite their inherent similarity, making the effective design principles and the learned representations difficult to transfer from one problem to another. In response to this challenge, this project aims to solve disparate visual correspondence problems with a unified model. In doing so, the project will also address two practical aspects of implementation of the developed models in scenarios with diverse visual appearance and significant resource constraints. These advances are expected to unlock novel applications and improve dynamic scene understanding in the areas of Augmented Reality, sports broadcasting, sports analytics, robotics, etc. The project outcomes may also unveil new markets and economic opportunities through solutions that augment cognitive and physical abilities of users in their daily lives. The team of researchers will actively integrate proposed research into the curriculum development and attract undergraduate researchers to the project. This project is particularly well-suited for outreach activities to broaden participation of underrepresented and K-12 students, by connecting abstract technical concepts with tangible research demonstrations.The project has three tightly connected thrusts, presenting fundamental advances in correspondence determination, in applications of these correspondences, and in making these algorithms efficient and robust in deployment. Concretely, first, a unified model to solve all the visual correspondence problems, ranging from 2D to 3D, will be developed, taking advantage of recent progress of the Transformer model and self-supervised learning from large-scale unlabeled data. The Transformer model naturally captures the correspondences of candidates with less inductive bias, making it a better choice to learn from the large-scale data and improve accuracy of data-poor domains when transferred from data-rich ones. Second, with the correspondences, novel applications will be unlocked to advance dynamic scene understanding, particularly for slow-motion video synthesis and robotic obstacle avoidance. Finally, the investigators will study mechanisms to improve efficiency and robustness when deploying the models on edge computing devices. The developed algorithms will be rigorously evaluated on standard benchmarks and in real-world deployment on edge devices.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
寻找对应关系是计算机视觉中的一个基本问题;视觉对应为机器以类似于人类的方式理解其动态环境提供了有用的线索。例如,当智能体四处移动时,它可能会了解到像山脉这样遥远的物体通常不会移动太多,而当智能体相对于它们改变位置时,附近的建筑物和灌木丛似乎在环境中快速移动。尽管在解决各种形式的视觉对应问题方面已经取得了重大进展,但不同的对应模型尽管具有固有的相似性,但仍保持着不同的设计,使得有效的设计原则和学习到的表示难以从一个问题转移到另一个问题。为了应对这一挑战,该项目旨在通过统一的模型解决不同的视觉对应问题。在此过程中,该项目还将解决在具有不同视觉外观和严重资源限制的场景中实施所开发模型的两个实际问题。这些进步预计将在增强现实、体育广播、体育分析、机器人等领域解锁新的应用并提高动态场景理解。项目成果还可能通过增强认知和身体能力的解决方案揭示新的市场和经济机会。用户在日常生活中。研究人员团队将积极将拟议的研究融入课程开发中,并吸引本科生研究人员参与该项目。该项目特别适合开展外展活动,通过将抽象的技术概念与具体的研究演示联系起来,扩大代表性不足的学生和 K-12 学生的参与。该项目具有三个紧密相连的主旨,展示了对应确定方面的根本性进展,以及这些方面的应用。通信,并使这些算法在部署中高效且稳健。具体而言,首先,将利用 Transformer 模型的最新进展和大规模无标签数据的自监督学习,开发一个解决从 2D 到 3D 的所有视觉对应问题的统一模型。 Transformer 模型自然地捕获了具有较少归纳偏差的候选者的对应关系,使其成为从大规模数据中学习并在从数据丰富的域转移时提高数据贫乏域的准确性的更好选择。其次,通过这些对应关系,将解锁新颖的应用程序以促进动态场景理解,特别是慢动作视频合成和机器人避障。最后,研究人员将研究在边缘计算设备上部署模型时提高效率和鲁棒性的机制。开发的算法将根据标准基准和边缘设备的实际部署进行严格评估。该奖项反映了 NSF 的法定使命,并通过使用基金会的智力优点和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Huaizu Jiang其他文献

HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model
HouseCrafter:使用 2D 扩散模型将平面图提升为 3D 场景
  • DOI:
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Hieu T. Nguyen;Yiwen Chen;Vikram S. Voleti;Varun Jampani;Huaizu Jiang
  • 通讯作者:
    Huaizu Jiang
Weakly Supervised Learning for Salient Object Detection using Background Images
  • DOI:
  • 发表时间:
    2015-01
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Huaizu Jiang
  • 通讯作者:
    Huaizu Jiang
Weakly Supervised Learning for Salient Object Detection
Self-Supervised Depth Learning Improves Semantic Segmentation
自监督深度学习改善语义分割
  • DOI:
  • 发表时间:
    2017
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Huaizu Jiang
  • 通讯作者:
    Huaizu Jiang

Huaizu Jiang的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

相似国自然基金

单细胞分辨率下的石杉碱甲介导小胶质细胞极化表型抗缺血性脑卒中的机制研究
  • 批准号:
    82304883
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
小分子无半胱氨酸蛋白调控生防真菌杀虫活性的作用与机理
  • 批准号:
    32372613
  • 批准年份:
    2023
  • 资助金额:
    50 万元
  • 项目类别:
    面上项目
诊疗一体化PS-Hc@MB协同训练介导脑小血管病康复的作用及机制研究
  • 批准号:
    82372561
  • 批准年份:
    2023
  • 资助金额:
    49 万元
  • 项目类别:
    面上项目
非小细胞肺癌MECOM/HBB通路介导血红素代谢异常并抑制肿瘤起始细胞铁死亡的机制研究
  • 批准号:
    82373082
  • 批准年份:
    2023
  • 资助金额:
    49 万元
  • 项目类别:
    面上项目
FATP2/HILPDA/SLC7A11轴介导肿瘤相关中性粒细胞脂代谢重编程影响非小细胞肺癌放疗免疫的作用和机制研究
  • 批准号:
    82373304
  • 批准年份:
    2023
  • 资助金额:
    49 万元
  • 项目类别:
    面上项目

相似海外基金

CNS Core: Small: Toward Opportunistic, Fast, and Robust In-Cache AI Acceleration at the Edge
CNS 核心:小型:在边缘实现机会主义、快速且稳健的缓存内 AI 加速
  • 批准号:
    2228028
  • 财政年份:
    2023
  • 资助金额:
    $ 59.39万
  • 项目类别:
    Standard Grant
Toward Clinical Trial: AXL-STAT3 Targeting of Lung Tumor Microenvironments
走向临床试验:AXL-STAT3 靶向肺肿瘤微环境
  • 批准号:
    10660429
  • 财政年份:
    2023
  • 资助金额:
    $ 59.39万
  • 项目类别:
Toward synthetic chemically defined mRNA for human therapeutics
用于人类治疗的合成化学定义的 mRNA
  • 批准号:
    10649299
  • 财政年份:
    2023
  • 资助金额:
    $ 59.39万
  • 项目类别:
HCC: Small: Toward Computational Modeling of Autism Spectrum Disorder: Multimodal Data Collection, Fusion, and Phenotyping
HCC:小型:自闭症谱系障碍的计算模型:多模式数据收集、融合和表型分析
  • 批准号:
    2401748
  • 财政年份:
    2023
  • 资助金额:
    $ 59.39万
  • 项目类别:
    Standard Grant
Thirty Years of Retailing and Distribution System in Japan and the UK ; Toward the International Dissemination of Common Research Methods and the Construction of Relevant Theories
日本和英国三十年的零售和分销系统;
  • 批准号:
    23H00861
  • 财政年份:
    2023
  • 资助金额:
    $ 59.39万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了