Collaborative Research: SaTC: CORE: Medium: PREMED: Privacy-Preserving and Robust Computational Phenotyping using Multisite EHR Data

合作研究:SaTC:核心:中:PREMED:使用多站点 EHR 数据的隐私保护和鲁棒计算表型分析

基本信息

  • 批准号:
    2124104
  • 负责人:
  • 金额:
    $ 90万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2021
  • 资助国家:
    美国
  • 起止时间:
    2021-10-01 至 2025-09-30
  • 项目状态:
    未结题

项目摘要

Tensor analysis offers an effective approach to convert massive Electronic Health Records (EHRs) into meaningful and interpretable clinical concepts, or phenotypes, such as diseases and disease subtypes. It can cluster patients into subgroups and capture the interactions between multiple attributes (e.g., specific procedures used to treat a disease), enabling precision medicine. Effective phenotyping needs to be supported by a large number of diverse samples to avoid potential population bias. A major challenge is how to derive phenotypes jointly across multiple institutions, while preserving individual patients' privacy at each site. The goal of this project is to develop a federated tensor factorization framework for Privacy-preserving, Robust, and Efficient computational phenotyping using Multisite EHR Data (PREMED). While many techniques have been developed for federated learning for each of these goals, their synergy has not been well studied. Communication-efficient techniques such as compression have an intrinsic benefit to privacy (smaller disclosure risks) and robustness (smaller adversarial impact) due to the compressed and obfuscated communication. Further, federated tensor factorization presents unique challenges due to its multi-factor structure and unsupervised nature. The project aims to exploit the synergy between efficiency, privacy, and robustness and address the three interrelated challenges with a holistic approach, while utilizing the multi-factor structure of tensor factorization. The research outcome will allow institutions to jointly perform computational phenotyping using their privacy-protected data effectively and efficiently. This project includes a set of interrelated objectives including: (1) developing communication-efficient techniques for federated tensor factorization such as local Stochastic Gradient Descent (SGD) to reduce communication frequency; and multi-level compression methods to reduce per-round communication leveraging the multi-factor structure of tensor factorization; (2) developing privacy-preserving federated tensor factorization methods by exploiting the intrinsic privacy benefit of the communication-efficient techniques; and privacy-preserving input synthesization methods that offer more versatility; and (3) developing robust statistical aggregation methods for handling potential Byzantine failures and malicious sites by utilizing the intrinsic robustness benefit of the communication-efficient techniques; and robust learning-based aggregation methods for sparse settings based on truth inference and adaptive site valuation approaches. The project includes case studies using real EHR data from Emory and UTHealth for phenotype discovery and phenotype-based predictive studies in the context of Alzheimer's Disease and Sepsis. The project also includes a set of synergistic activities including organization of multi-site computational phenotyping challenges; development of collaborative sidecar courses; and active involvement of undergraduates, women and underrepresented groups.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
张量分析提供了一种有效方法,可将大量电子健康记录 (EHR) 转换为有意义且可解释的临床概念或表型,例如疾病和疾病亚型。它可以将患者分为亚组并捕获多个属性之间的相互作用(例如用于治疗疾病的特定程序),从而实现精准医疗。有效的表型分析需要大量不同样本的支持,以避免潜在的群体偏差。一个主要的挑战是如何跨多个机构联合得出表型,同时保护每个站点的个体患者的隐私。该项目的目标是使用多站点 EHR 数据 (PREMED) 开发一个联合张量分解框架,用于保护隐私、稳健且高效的计算表型。虽然针对这些目标中的每一个目标都开发了许多联邦学习技术,但它们的协同作用尚未得到充分研究。由于压缩和混淆的通信,诸如压缩之类的通信高效技术对隐私(较小的泄露风险)和鲁棒性(较小的对抗性影响)具有内在的好处。此外,联合张量分解由于其多因素结构和无监督性质而提出了独特的挑战。该项目旨在利用效率、隐私和鲁棒性之间的协同作用,并通过整体方法解决三个相互关联的挑战,同时利用张量分解的多因素结构。研究成果将使各机构能够利用其受隐私保护的数据有效且高效地联合执行计算表型分析。该项目包括一系列相互关联的目标,包括:(1)开发用于联合张量分解的通信高效技术,例如局部随机梯度下降(SGD)以降低通信频率;以及多级压缩方法,利用张量分解的多因子结构来减少每轮通信; (2)通过利用通信高效技术的内在隐私优势,开发保护隐私的联合张量分解方法;以及提供更多多功能性的保护隐私的输入合成方法; (3) 利用通信高效技术固有的鲁棒性优势,开发鲁棒的统计聚合方法来处理潜在的拜占庭故障和恶意站点;以及基于真值推理和自适应站点评估方法的稀疏设置的基于学习的鲁棒聚合方法。该项目包括使用埃默里大学和 UTHealth 的真实 EHR 数据进行案例研究,以发现阿尔茨海默病和脓毒症的表型和基于表型的预测研究。该项目还包括一系列协同活动,包括组织多位点计算表型挑战;开发协作边车课程;该奖项反映了 NSF 的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(22)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Echo of Neighbors: Privacy Amplification for Personalized Private Federated Learning with Shuffle Model
邻居的回声:使用 Shuffle 模型实现个性化私人联邦学习的隐私放大
Classification Auto-Encoder Based Detector Against Diverse Data Poisoning Attacks
基于分类自动编码器的检测器抵御各种数据中毒攻击
Efficient Sampling Approaches to Shapley Value Approximation
Shapley 值近似的高效采样方法
Comparison of time series clustering methods for identifying novel subphenotypes of patients with infection
识别感染患者新亚表型的时间序列聚类方法比较
  • DOI:
    10.1093/jamia/ocad063
  • 发表时间:
    2023-04
  • 期刊:
  • 影响因子:
    6.4
  • 作者:
    Bhavani, Sivasubramanium V;Xiong, Li;Pius, Abish;Semler, Matthew;Qian, Edward T;Verhoef, Philip A;Robichaux, Chad;Coopersmith, Craig M;Churpek, Matthew M
  • 通讯作者:
    Churpek, Matthew M
PubMed Author-assigned Keyword Extraction (PubMedAKE) Benchmark
PubMed 作者指定的关键词提取 (PubMedAKE) 基准
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Li Xiong其他文献

Virus evolutionary genetic algorithm for task collaboration of logistics distribution
物流配送任务协同的病毒进化遗传算法
  • DOI:
    10.1117/12.664555
  • 发表时间:
    2005-12-22
  • 期刊:
  • 影响因子:
    5.6
  • 作者:
    Fanghua Ning;Zichen Chen;Li Xiong
  • 通讯作者:
    Li Xiong
Coordinated optimization control strategy of hydropower and thermal power AGC units
水电、火电AGC机组协调优化控制策略
  • DOI:
    10.1016/j.egyr.2023.04.202
  • 发表时间:
    2024-09-14
  • 期刊:
  • 影响因子:
    5.2
  • 作者:
    Zhencheng Liang;Guangzhen Lu;Ling Li;Bin Li;Yixin Zhuo;Yude Yang;Guangming Li;Cuiyun Luo;Yangtian Ning;Li Xiong
  • 通讯作者:
    Li Xiong
Synthesis and evaluation of amide side-chain modified Agomelatine analogues as potential antidepressant-like agents.
酰胺侧链修饰的阿戈美拉汀类似物作为潜在抗抑郁药的合成和评价。
  • DOI:
    10.1016/j.bmcl.2014.02.065
  • 发表时间:
    2014-04-01
  • 期刊:
  • 影响因子:
    2.7
  • 作者:
    Ying Chang;Weiyi Pi;W. Ang;Yuanyuan Liu;Chunlong Li;Jiajia Zheng;Li Xiong;Tao Yang;Youfu Luo
  • 通讯作者:
    Youfu Luo
A New Learning Algorithm for Imbalanced Data—PCBoost
一种针对不平衡数据的新学习算法——PCBoost
Nectin-2 and DDX3 are biomarkers for metastasis and poor prognosis of squamous cell/adenosquamous carcinomas and adenocarcinoma of gallbladder.
Nectin-2和DDX3是鳞状细胞/腺鳞癌和胆囊腺癌转移和不良预后的生物标志物。

Li Xiong的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Li Xiong', 18)}}的其他基金

NSF Student Travel Support for 2022 ACM International Conference on Information and Management (CIKM)
NSF 学生参加 2022 年 ACM 国际信息与管理会议 (CIKM) 的旅行支持
  • 批准号:
    2232829
  • 财政年份:
    2022
  • 资助金额:
    $ 90万
  • 项目类别:
    Standard Grant
SCC-IRG JST: Hyperlocal Risk Monitoring and Pandemic Preparedness through Privacy-Enhanced Mobility and Social Interactions Analysis
SCC-IRG JST:通过隐私增强的移动性和社交互动分析进行超本地风险监控和流行病防范
  • 批准号:
    2125530
  • 财政年份:
    2021
  • 资助金额:
    $ 90万
  • 项目类别:
    Continuing Grant
SCC-PG: JST: Privacy-enhanced data-driven health monitoring for smart and connected senior communities
SCC-PG:JST:针对智能互联老年社区的隐私增强型数据驱动健康监测
  • 批准号:
    1952192
  • 财政年份:
    2020
  • 资助金额:
    $ 90万
  • 项目类别:
    Standard Grant
RAPID: Collaborative: REACT: Real-time Contact Tracing and Risk Monitoring via Privacy-enhanced Mobile Tracking
RAPID:协作:REACT:通过隐私增强型移动跟踪进行实时接触者追踪和风险监控
  • 批准号:
    2027783
  • 财政年份:
    2020
  • 资助金额:
    $ 90万
  • 项目类别:
    Standard Grant
I-Corps: iCloak: Privacy Preserving Individual Location Sharing
I-Corps:iCloak:隐私保护个人位置共享
  • 批准号:
    1619679
  • 财政年份:
    2016
  • 资助金额:
    $ 90万
  • 项目类别:
    Standard Grant
TWC: Small: Rigorous and Customizable Spatiotemporal Privacy for Location Based Applications
TWC:小型:基于位置的应用程序的严格且可定制的时空隐私
  • 批准号:
    1618932
  • 财政年份:
    2016
  • 资助金额:
    $ 90万
  • 项目类别:
    Standard Grant
TC: Small: Adaptive Differentially Private Data Release
TC:小型:自适应差分隐私数据发布
  • 批准号:
    1117763
  • 财政年份:
    2011
  • 资助金额:
    $ 90万
  • 项目类别:
    Standard Grant

相似国自然基金

IGF-1R调控HIF-1α促进Th17细胞分化在甲状腺眼病发病中的机制研究
  • 批准号:
    82301258
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
CTCFL调控IL-10抑制CD4+CTL旁观者激活促口腔鳞状细胞癌新辅助免疫治疗抵抗机制研究
  • 批准号:
    82373325
  • 批准年份:
    2023
  • 资助金额:
    49 万元
  • 项目类别:
    面上项目
RNA剪接因子PRPF31突变导致人视网膜色素变性的机制研究
  • 批准号:
    82301216
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
血管内皮细胞通过E2F1/NF-kB/IL-6轴调控巨噬细胞活化在眼眶静脉畸形中的作用及机制研究
  • 批准号:
    82301257
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
基于多元原子间相互作用的铝合金基体团簇调控与强化机制研究
  • 批准号:
    52371115
  • 批准年份:
    2023
  • 资助金额:
    50 万元
  • 项目类别:
    面上项目

相似海外基金

Collaborative Research: SaTC: CORE: Medium: Differentially Private SQL with flexible privacy modeling, machine-checked system design, and accuracy optimization
协作研究:SaTC:核心:中:具有灵活隐私建模、机器检查系统设计和准确性优化的差异化私有 SQL
  • 批准号:
    2317232
  • 财政年份:
    2024
  • 资助金额:
    $ 90万
  • 项目类别:
    Continuing Grant
Collaborative Research: NSF-BSF: SaTC: CORE: Small: Detecting malware with machine learning models efficiently and reliably
协作研究:NSF-BSF:SaTC:核心:小型:利用机器学习模型高效可靠地检测恶意软件
  • 批准号:
    2338302
  • 财政年份:
    2024
  • 资助金额:
    $ 90万
  • 项目类别:
    Continuing Grant
Collaborative Research: SaTC: CORE: Medium: Using Intelligent Conversational Agents to Empower Adolescents to be Resilient Against Cybergrooming
合作研究:SaTC:核心:中:使用智能会话代理使青少年能够抵御网络诱骗
  • 批准号:
    2330940
  • 财政年份:
    2024
  • 资助金额:
    $ 90万
  • 项目类别:
    Continuing Grant
Collaborative Research: SaTC: CORE: Medium: Using Intelligent Conversational Agents to Empower Adolescents to be Resilient Against Cybergrooming
合作研究:SaTC:核心:中:使用智能会话代理使青少年能够抵御网络诱骗
  • 批准号:
    2330941
  • 财政年份:
    2024
  • 资助金额:
    $ 90万
  • 项目类别:
    Continuing Grant
Collaborative Research: SaTC: CORE: Medium: Differentially Private SQL with flexible privacy modeling, machine-checked system design, and accuracy optimization
协作研究:SaTC:核心:中:具有灵活隐私建模、机器检查系统设计和准确性优化的差异化私有 SQL
  • 批准号:
    2317233
  • 财政年份:
    2024
  • 资助金额:
    $ 90万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了