III: Small: Collaborative Research: Algorithms, systems, and theories for exploiting data dependencies in crowdsourcing
III:小型:协作研究:在众包中利用数据依赖性的算法、系统和理论
基本信息
- 批准号:2007941
- 负责人:
- 金额:$ 25万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2020
- 资助国家:美国
- 起止时间:2020-10-01 至 2024-09-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Data are abundantly available to encode knowledge in many domains, such as biomedical research, online commerce, open government, education, and public health. Machine learning is a powerful tool to discover novel knowledge from data and to help individuals and organizations make informed decisions. However, machine learning needs to be bootstrapped by human-annotated knowledge, which can be expensive to obtain and also contain human errors. The team of researchers discovers and exploits the dependencies in the data, via novel methodologies to significantly reduce the cost and noises when providing critical knowledge for machine learning. The research outputs, including algorithms, systems, and theories, are sufficiently generic to benefit many domains where machine learning is applicable. By conducting the fundamental research, the team will train undergraduates and graduates for the STEM workforce in the nation.The researchers will collaborate to develop algorithms, systems, and theories for reducing costs and noises when annotating dependent data, termed as “structured annotations”, to provide supervision knowledge for machine learning. While the dependencies can make data annotations costly and error-prone, the researchers view the dependencies as a useful inductive bias for selective and accurate annotations. In particular, the research team proposes a human-in-the-loop system to aid the construction of proper probabilistic graphical models to encode the dependencies. The project team combines contextual and multi-armed bandits with scalable graph inference algorithms to reduce labeling costs. Based on the graphical bandits, the team addresses the budget allocation when querying labels of the same data point repetitively for robustness. With noisy human annotations, the team formulates optimization problems and algorithms to jointly infer the annotator competences and the ground truth labels of the data. From the theoretical perspective, the project will advance the active learning in crowdsourcing settings with more realistic noise distributions and will analyze the regrets in structured annotations. The project will result in datasets, algorithms, and a testbed system that benefit not only the core machine learning research community but also many domains that use machine learning.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
数据可用于编码许多领域的知识,例如生物医学研究、在线商务、开放政府、教育和公共卫生。机器学习是从数据中发现新知识并帮助个人和组织做出明智决策的强大工具。然而,机器学习需要由人工注释的知识来引导,这些知识的获取成本可能很高,而且还包含人为错误。研究人员团队通过新颖的方法发现和利用数据中的依赖性,以显着降低成本和噪音。为机器提供关键知识研究成果(包括算法、系统和理论)足够通用,可以使机器学习适用的许多领域受益。通过进行基础研究,该团队将为全国的 STEM 劳动力培训本科生和研究生。研究人员将合作开发算法、系统和理论,以减少注释依赖数据(称为“结构化注释”)时的成本和噪音,为机器学习提供监督知识。虽然依赖关系可能会使数据注释成本高昂且容易出错。查看依赖关系作为选择性和准确注释的有用归纳偏差,研究团队提出了一种人机循环系统,以帮助构建适当的概率图形模型来编码依赖关系。使用可扩展的图形推理算法来降低标记成本 基于图形 bandits,该团队在重复查询同一数据点的标签时解决了预算问题,以确保鲁棒性。制定优化问题和算法来共同推断注释者能力和数据的真实标签,该项目将以更真实的噪声分布推进众包环境中的主动学习,并分析结构化注释项目中的遗憾。将产生数据集、算法和测试平台系统,不仅有利于核心机器学习研究社区,而且有利于许多使用机器学习的领域。该奖项是 NSF 的法定使命,并通过使用基金会的智力评估进行评估,被认为值得支持优点和更广泛的影响审查标准。
项目成果
期刊论文数量(9)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
SagDRE: Sequence-Aware Graph-Based Document-Level Relation Extraction with Adaptive Margin Loss
- DOI:10.1145/3534678.3539304
- 发表时间:2022-08
- 期刊:
- 影响因子:0
- 作者:Ying Wei;Qi Li
- 通讯作者:Ying Wei;Qi Li
Relation-Aware Network with Attention-Based Loss for Few-Shot Knowledge Graph Completion
- DOI:10.1007/978-3-031-33380-4_8
- 发表时间:2023-06
- 期刊:
- 影响因子:0
- 作者:Qiao Qiao-Qiao;Yuepei Li;Kang Zhou;Qi Li
- 通讯作者:Qiao Qiao-Qiao;Yuepei Li;Kang Zhou;Qi Li
CPTAM: Constituency Parse Tree Aggregation Method
CPTAM:选区解析树聚合方法
- DOI:10.1137/1.9781611977172.71
- 发表时间:2022
- 期刊:
- 影响因子:0
- 作者:Kulkarni, Adithya;Sabetpour, Nasim;Markin, Alexey;Eulenstein, Oliver;Li, Qi
- 通讯作者:Li, Qi
Data Poisoning Attacks and Defenses to Crowdsourcing Systems
- DOI:10.1145/3442381.3450066
- 发表时间:2021-02
- 期刊:
- 影响因子:0
- 作者:Minghong Fang;Minghao Sun;Qi Li;N. Gong;Jinhua Tian;Jia Liu
- 通讯作者:Minghong Fang;Minghao Sun;Qi Li;N. Gong;Jinhua Tian;Jia Liu
Open-Domain Aspect-Opinion Co-Mining with Double-Layer Span Extraction
- DOI:10.1145/3534678.3539386
- 发表时间:2022-08
- 期刊:
- 影响因子:0
- 作者:Mohna Chakraborty;Adithya Kulkarni;Qi Li
- 通讯作者:Mohna Chakraborty;Adithya Kulkarni;Qi Li
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Qi Li其他文献
Competing orders in s-wave and p -wave superconductors
s 波和 p 波超导体的竞争订单
- DOI:
- 发表时间:
2008 - 期刊:
- 影响因子:0
- 作者:
Qi Li - 通讯作者:
Qi Li
Improving the representation of convective heat transfer in an urban canopy model
改进城市冠层模型中对流换热的表示
- DOI:
- 发表时间:
2018 - 期刊:
- 影响因子:0
- 作者:
Qi Li;Jiachuan Yang;Zhi;E. Bou‐Zeid - 通讯作者:
E. Bou‐Zeid
Delay Allograft Rejection in Lymphopenic Hosts by Downregulating the Homeostatic Proliferation of CD 8 + T Cells
通过下调 CD 8 T 细胞的稳态增殖来延迟淋巴细胞减少宿主的同种异体移植排斥
- DOI:
- 发表时间:
2010 - 期刊:
- 影响因子:0
- 作者:
D. Zecher;Qi Li;M. Oberbarnscheidt;A. Demetris;W. Shlomchik;D. Rothstein;Fadi G Lakkis - 通讯作者:
Fadi G Lakkis
Try before You Buy: Privacy-preserving Data Evaluation on Cloud-based Machine Learning Data Marketplace
先试后买:基于云的机器学习数据市场上的隐私保护数据评估
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
Qiyang Song;Jiahao Cao;Kun Sun;Qi Li;Ke Xu - 通讯作者:
Ke Xu
Microstrip-to- Waveguide Transition with Bias Tee for Photodiode Packaging at W-Band
用于 W 波段光电二极管封装的带偏置 T 形件的微带到波导过渡
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
C. Nie;Qi Li;Yuan Yao;Ting Zhang;Yaohui Yang - 通讯作者:
Yaohui Yang
Qi Li的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Qi Li', 18)}}的其他基金
AccelNet-Design: A Global Network of Networks of Integrated Urban Services (GNNIUS) for Healthy and Smart Cities
AccelNet-Design:面向健康和智慧城市的全球综合城市服务网络 (GNNIUS)
- 批准号:
2301858 - 财政年份:2023
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
CAREER: Achieving Quality Information Extraction from Scientific Documents with Heterogeneous Weak Supervisions
职业:通过异构弱监督实现科学文档中的质量信息提取
- 批准号:
2237831 - 财政年份:2023
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
CAREER: Multi-Scalar Transport and Similarity in the Urban Boundary Layer
职业:城市边界层的多标量交通和相似性
- 批准号:
2143664 - 财政年份:2022
- 资助金额:
$ 25万 - 项目类别:
Continuing Grant
Collaborative Research: Geoengineering of Urban Green Infrastructure to Improve Outdoor Livability
合作研究:城市绿色基础设施地球工程,提高户外宜居性
- 批准号:
2028842 - 财政年份:2020
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: CAS-MNP--Precursors of Long-Distance Aerial Transport of Microplastics from Urban Environments
合作研究:CAS-MNP——城市环境中长距离空中运输微塑料的前体
- 批准号:
2028644 - 财政年份:2020
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Design and Characterization of Two-Dimensional Electron Gas with Strong Spin-Orbit Coupling Based on Transition Metal Oxides
基于过渡金属氧化物的强自旋轨道耦合二维电子气的设计与表征
- 批准号:
1905833 - 财政年份:2019
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Multiferroic Tunnel Junction with Active Dual Layer Barrier
具有主动双层势垒的多铁性隧道结
- 批准号:
1411166 - 财政年份:2014
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Interfacial Electromagnetic Coupling in Multiferroic Tunnel Junctions
多铁性隧道结中的界面电磁耦合
- 批准号:
1207474 - 财政年份:2012
- 资助金额:
$ 25万 - 项目类别:
Continuing Grant
III: Small: An Automatic Framework for Processing Drosophila Embryonic Images
III:小型:处理果蝇胚胎图像的自动框架
- 批准号:
1016668 - 财政年份:2010
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
相似国自然基金
单细胞分辨率下的石杉碱甲介导小胶质细胞极化表型抗缺血性脑卒中的机制研究
- 批准号:82304883
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
小分子无半胱氨酸蛋白调控生防真菌杀虫活性的作用与机理
- 批准号:32372613
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
诊疗一体化PS-Hc@MB协同训练介导脑小血管病康复的作用及机制研究
- 批准号:82372561
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
非小细胞肺癌MECOM/HBB通路介导血红素代谢异常并抑制肿瘤起始细胞铁死亡的机制研究
- 批准号:82373082
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
FATP2/HILPDA/SLC7A11轴介导肿瘤相关中性粒细胞脂代谢重编程影响非小细胞肺癌放疗免疫的作用和机制研究
- 批准号:82373304
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: III: Small: High-Performance Scheduling for Modern Database Systems
协作研究:III:小型:现代数据库系统的高性能调度
- 批准号:
2322973 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: III: Small: High-Performance Scheduling for Modern Database Systems
协作研究:III:小型:现代数据库系统的高性能调度
- 批准号:
2322974 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: III: Small: A DREAM Proactive Conversational System
合作研究:III:小型:一个梦想的主动对话系统
- 批准号:
2336769 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: III: Small: A DREAM Proactive Conversational System
合作研究:III:小型:一个梦想的主动对话系统
- 批准号:
2336768 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
III: Small: Multiple Device Collaborative Learning in Real Heterogeneous and Dynamic Environments
III:小:真实异构动态环境中的多设备协作学习
- 批准号:
2311990 - 财政年份:2023
- 资助金额:
$ 25万 - 项目类别:
Standard Grant