Collaborative Research: CIF: Small: Sequential Decision Making Under Uncertainty With Submodular Rewards
合作研究:CIF:小:不确定性下的顺序决策与子模奖励
基本信息
- 批准号:2149617
- 负责人:
- 金额:$ 25万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2022
- 资助国家:美国
- 起止时间:2022-03-01 至 2025-02-28
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
Many companies, government agencies, and individuals make sequences of challenging decisions over time, for which they must choose from among many possible options, may have limited knowledge about the outcomes of their decisions, and will receive limited feedback. For example, search engines and content providers make decisions for what sets of websites, products, or media to recommend each time a user logs on to their system or submits a query, in some cases having limited knowledge of the users’ underlying preferences. If users' privacy is protected, then only users' past actions, such as which links or media were selected by earlier users, will be available as feedback to inform the search engine or content provider on what to recommend next. This project aims to develop provably good strategies that decision makers can use in such settings, aiding their decision making under uncertainty and with limited feedback. This project will also develop strategies for the more challenging setting where multiple decision makers must coordinate with each other on such problems, but have limited communication available to do so. Furthermore, this project will support undergraduate and graduate research training, as well as graduate-level course development, in machine learning and artificial intelligence, preparing students for careers in advanced technical fields.The goal of this project is to develop novel, provably good strategies for solving sequential decision problems (multi-armed bandit problems) when the actions available have a combinatorial structure (such as choosing subsets of products to recommend), the rewards have a diminishing returns property (submodularity), and there is no side-information available -- the only feedback comes from the reward itself. The proposed work builds on the rich literature of multi-armed bandits and of submodular optimization. The technical aims of the project are divided into two thrusts. The first thrust focuses on developing algorithms and identifying their regret bounds for combinatorial multi-armed bandit problems with submodular rewards and no additional feedback. The second thrust extends those strategies and regret analyses to a decentralized setting, where multiple agents coordinate to solve combinatorial multi-armed bandit problems, despite limited resources for communication.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
随着时间的推移,许多公司、政府机构和个人会做出一系列具有挑战性的决策,他们必须从许多可能的选项中进行选择,他们对其决策结果的了解可能有限,并且收到的反馈也有限。内容提供商在用户每次登录其系统或提交查询时决定推荐哪些网站、产品或媒体,在某些情况下对用户的潜在偏好了解有限。那么只有用户过去的操作,例如选择了哪些链接或媒体早期用户提供的信息将作为反馈,告知搜索引擎或内容提供商接下来要推荐什么。该项目旨在开发决策者可以在此类环境中使用的可证明的良好策略,帮助他们在不确定性和反馈有限的情况下做出决策。该项目还将针对更具挑战性的环境制定策略,在这种情况下,多个决策者必须在此类问题上相互协调,但可用的沟通有限。此外,该项目还将支持本科生和研究生以及研究生的研究培训。机器学习和人工智能方面的水平课程开发,为学生做好准备该项目的目标是开发新颖的、可证明良好的策略来解决顺序决策问题(多臂老虎机问题),当可用的操作具有组合结构(例如选择要推荐的产品子集)时,奖励具有递减收益属性(子模块性),并且没有可用的辅助信息——唯一的反馈来自奖励本身。所提出的工作建立在多臂老虎机和子模块优化目标的丰富文献之上。该项目分为两个重点。第一个重点是开发算法并确定具有子模块奖励的组合多臂强盗问题的遗憾界限,第二个重点将这些策略和遗憾分析扩展到分散的环境中。尽管沟通资源有限,但多个代理协调解决组合多臂老虎机问题。该奖项反映了 NSF 的法定使命,并通过使用基金会的智力优点和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Randomized Greedy Learning for Non-monotone Stochastic Submodular Maximization Under Full-bandit Feedback
全老虎机反馈下非单调随机子模最大化的随机贪婪学习
- DOI:
- 发表时间:2023-04
- 期刊:
- 影响因子:0
- 作者:Fourati, Fares;Aggarwal, Vaneet;Quinn, Christopher John;Alouini, Mohamed
- 通讯作者:Alouini, Mohamed
Fractional Budget Allocation for Influence Maximization
影响力最大化的部分预算分配
- DOI:10.1109/cdc49753.2023.10384250
- 发表时间:2023-12-13
- 期刊:
- 影响因子:0
- 作者:A. Umrawal;Vaneet Aggarwal;Christopher J. Quinn
- 通讯作者:Christopher J. Quinn
An explore-then-commit algorithm for submodular maximization under full-bandit feedback
全强盗反馈下子模最大化的探索然后提交算法
- DOI:
- 发表时间:2024-09-13
- 期刊:
- 影响因子:0
- 作者:G. Nie;Mridul Agarwal;A. Umrawal;V. Aggarwal;Christopher J. Quinn
- 通讯作者:Christopher J. Quinn
Size-constrained k-submodular maximization in near-linear time
近线性时间内尺寸约束的 k 子模最大化
- DOI:
- 发表时间:2023-07
- 期刊:
- 影响因子:0
- 作者:Nie, Guanyu;Zhu, Yanhui;Nadew, Yididiya Y.;Basu, Samik;Pavan, A.;Quinn, Christopher John
- 通讯作者:Quinn, Christopher John
A Community-Aware Framework for Social Influence Maximization
社会影响力最大化的社区意识框架
- DOI:10.1109/tetci.2023.3251362
- 发表时间:2022-07-18
- 期刊:
- 影响因子:5.3
- 作者:A. Umrawal;Christopher J. Quinn;V. Aggarwal
- 通讯作者:V. Aggarwal
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Christopher Quinn其他文献
Body image self-consciousness, sexting, and sexual satisfaction among midlife Canadians
加拿大中年人的身体形象自我意识、色情短信和性满意度
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Tasha A. Falconer;Christopher Quinn;R. Milhausen - 通讯作者:
R. Milhausen
Sameness and Difference in Psychological Research on Consensually Non-Monogamous Relationships: The Need for Invariance and Equivalence Testing
双方同意的非一夫一妻关系心理学研究的相同性和差异:不变性和等价性测试的需要
- DOI:
10.1007/s10508-020-01794-9 - 发表时间:
2020-08-28 - 期刊:
- 影响因子:3.8
- 作者:
J. Sakaluk;Christopher Quinn;A. Fisher;Connor E. Leshner;Ella Huber;J. Wood - 通讯作者:
J. Wood
Redundancy and Concept Analysis for Code-trained Language Models
- DOI:
- 发表时间:
2023-05-01 - 期刊:
- 影响因子:0
- 作者:
Arushi Sharma;Zefu Hu;Christopher Quinn;A. Jannesari - 通讯作者:
A. Jannesari
Sounding the Alarm: What Clinicians Need to Know about Physical, Emotional, and Cognitive Recovery After Venoarterial Extracorporeal Membrane Oxygenation*
敲响警钟:临床医生需要了解静脉动脉体外膜氧合后身体、情绪和认知恢复的知识*
- DOI:
10.1097/ccm.0000000000005900 - 发表时间:
2023-05-01 - 期刊:
- 影响因子:8.8
- 作者:
Kelly Higa;K. Mayer;Christopher Quinn;Lindsey E Jubina;A. Suarez;K. Colborn;S. Jolley;K. Enfield;J. Zwischenberger;C. Sevin;Jessica Y. Rove - 通讯作者:
Jessica Y. Rove
A Thematic Analysis of Men’s Sexual Compliance With Unwanted, Non-Coercive Sex
男性对不想要的非强迫性行为的性依从性的主题分析
- DOI:
- 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Christopher Quinn;Melissa K. Goncalves;D. Kennett;Ashley C. Grant - 通讯作者:
Ashley C. Grant
Christopher Quinn的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Christopher Quinn', 18)}}的其他基金
CRII: RI: Efficient Structure Learning and Approximation of Networks of Causally Interacting Processes
CRII:RI:因果交互过程网络的有效结构学习和逼近
- 批准号:
1566513 - 财政年份:2016
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
相似国自然基金
基于肿瘤病理图片的靶向药物敏感生物标志物识别及统计算法的研究
- 批准号:82304250
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
肠道普拉梭菌代谢物丁酸抑制心室肌铁死亡改善老龄性心功能不全的机制研究
- 批准号:82300430
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
社会网络关系对公司现金持有决策影响——基于共御风险的作用机制研究
- 批准号:72302067
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
面向图像目标检测的新型弱监督学习方法研究
- 批准号:62371157
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
面向开放域对话系统信息获取的准确性研究
- 批准号:62376067
- 批准年份:2023
- 资助金额:51 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: NSF-AoF: CIF: Small: AI-assisted Waveform and Beamforming Design for Integrated Sensing and Communication
合作研究:NSF-AoF:CIF:小型:用于集成传感和通信的人工智能辅助波形和波束成形设计
- 批准号:
2326622 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: NSF-AoF: CIF: Small: AI-assisted Waveform and Beamforming Design for Integrated Sensing and Communication
合作研究:NSF-AoF:CIF:小型:用于集成传感和通信的人工智能辅助波形和波束成形设计
- 批准号:
2326621 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: CIF: Small: Mathematical and Algorithmic Foundations of Multi-Task Learning
协作研究:CIF:小型:多任务学习的数学和算法基础
- 批准号:
2343600 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: CIF: Medium: Snapshot Computational Imaging with Metaoptics
合作研究:CIF:Medium:Metaoptics 快照计算成像
- 批准号:
2403123 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: NSF-AoF: CIF: Small: AI-assisted Waveform and Beamforming Design for Integrated Sensing and Communication
合作研究:NSF-AoF:CIF:小型:用于集成传感和通信的人工智能辅助波形和波束成形设计
- 批准号:
2326622 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant