Privacy Challenges of Genomic Data-Sharing Beacons and Solutions
基因组数据共享信标和解决方案的隐私挑战
基本信息
- 批准号:10674031
- 负责人:
- 金额:$ 30.19万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2020
- 资助国家:美国
- 起止时间:2020-08-01 至 2024-07-31
- 项目状态:已结题
- 来源:
- 关键词:AdoptedAlgorithmsAllelesBehaviorBudgetsCommunitiesDataData SetDevelopmentEffectivenessEnsureEquilibriumExtravasationGame TheoryGenomeGoalsGraphHIVIndividualLarge-Scale SequencingMathematicsMedicalMedicineMethodsModelingParticipantPersonsPhenotypePoliciesPrivacyProcessPropertyProtocols documentationResearchResearch ProposalsRiskRisk AssessmentRunningSecureSingle Nucleotide PolymorphismSourceStandardizationSystemTechniquesTimeUncertaintyUpdateWorkdata sharingdesigngenomic datanovelpreservationprivacy preservationprivacy protectionreconstructionresponserisk sharingsimulationtooltrustworthinessusability
项目摘要
Abstract. Availability of very large genomic datasets promises a revolution in medicine. However, it
has been shown that it is not straightforward to ensure anonymity of the participants in such datasets.
Sharing data in a privacy-preserving way stands as a major bottleneck in front of the medical progress.
Recently, a community-driven protocol has been widely adopted for sharing genomic data. So called
“genomic data-sharing beacon protocol” aims to provide a secure, easy to implement, and
standardized interface for data sharing by only allowing yes/no queries on the presence of specific
alleles in the dataset. Previously deemed robust against privacy threats, beacon protocol was recently
shown to be vulnerable against membership inference attacks despite its stringent policy. Currently,
there is no way to systematically assess beacons' privacy risks for neither the genome donors nor the
beacon operators. This cast doubts on usability of beacons from both parties' point of views. Setting
up a beacon is risky for beacon operators because of repercussions of possible breaches.
Furthermore, for the donors who lack technical background to comprehend the risk, it is often safer to
opt-out. Thus, a comprehensive understanding of the system's pitfalls and briefing the genome donors
and the beacon operators on potential threats are important issues to overcome to move forward. In
this proposal, we aim at (i) detecting and analyzing vulnerabilities of the genomic data-sharing
beacons, (ii) providing risk quantification tools for both the donors and data owners to inform both
parties on possible risks, and (iii) generating countermeasures against these vulnerabilities. We
provide extensive preliminary work on possible vulnerabilities of the beacon system and potential
countermeasures. For the first time, we will investigate the information leakage due to beacon updates,
which will guide beacon admins on when and how to update the content of the beacon. As the second
goal, we will design risk quantification algorithms to assess the risk and inform both the genome
donors and beacon operators on possible risks of sharing data. This will be the first attempt at helping
beacon operators and participants make informed decisions. We project that if this project is realized,
beacon system will be transparent in terms of privacy risks, which will reinstate the trustworthiness of
the system and increase its usability. This in turn will tear down the borders that stand in the way of
sharing genomic data and enable all downstream research that will benefit from larger data sizes. Our
final goal is to focus on countermeasures to protect sensitive information. We observe that current
approaches fail to protect the privacy of individuals and provide high data utility at the same time. We
will implement novel differential privacy and game theory-based techniques to ensure privacy-
preserving data sharing with high data utility.
摘要:超大型基因组数据集的出现预示着医学领域的一场革命。
事实证明,确保此类数据集中参与者的匿名性并不简单。
以保护隐私的方式共享数据是医学进步的主要瓶颈。
最近,社区驱动的协议已被广泛采用来共享基因组数据。
“基因组数据共享信标协议”旨在提供一个安全、易于实施、
用于数据共享的标准化接口,仅允许对特定数据是否存在进行是/否查询
数据集中的等位基因以前被认为能够抵御隐私威胁,最近信标协议被认为是强大的。
现在,
没有办法系统地评估基因组捐赠者和基因组的信标隐私风险。
从双方的角度来看,这对信标的可用性产生了怀疑。
由于可能的违规行为会产生影响,对信标运营商来说,建立信标是有风险的。
此外,对于缺乏技术背景来了解风险的捐助者来说,通常更安全的做法是
因此,全面了解该系统的缺陷并向基因组捐赠者通报情况。
潜在威胁的信标运营商是向前发展需要克服的重要问题。
该提案的目的是(i)检测和分析基因组数据共享的漏洞
信标,(ii) 为捐助者和数据所有者提供风险量化工具,以告知双方
各方可能存在的风险,以及 (iii) 针对这些漏洞制定对策。
就信标系统可能存在的漏洞和潜在风险提供广泛的前期工作
我们将首次调查因信标更新而导致的信息泄露,
这将指导信标管理员何时以及如何更新信标的内容。
目标,我们将设计风险量化算法来评估风险并告知基因组
捐赠者和信标运营商共享数据可能存在的风险,这将是帮助的第一次尝试。
我们预计,如果该项目得以实现,信标运营商和参与者将做出明智的决定。
信标系统在隐私风险方面将是透明的,这将恢复人们的可信度
系统并提高其可用性,这反过来又会打破阻碍的边界。
共享基因组数据并使所有下游研究能够从更大的数据量中受益。
我们观察到当前的情况,最终目标是集中于保护敏感信息的对策。
这些方法无法在保护个人隐私的同时提供较高的数据效用。
将实施新颖的差分隐私和基于博弈论的技术来确保隐私-
保持数据共享,数据利用率高。
项目成果
期刊论文数量(12)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Towards Robust Fingerprinting of Relational Databases by Mitigating Correlation Attacks.
通过减轻相关攻击实现关系数据库的鲁棒指纹识别。
- DOI:
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Ji, Tianxi;Ayday, Erman;Yilmaz, Emre;Li, Pan
- 通讯作者:Li, Pan
Tuning Privacy-Utility Tradeoff in Genomic Studies Using Selective SNP Hiding.
使用选择性 SNP 隐藏调整基因组研究中的隐私与实用性权衡。
- DOI:
- 发表时间:2023-04
- 期刊:
- 影响因子:0
- 作者:Alserr, Nour Almadhoun;Kale, Gulce;Mutlu, Onur;Tastan, Oznur;Ayday, Erman
- 通讯作者:Ayday, Erman
Genome Reconstruction Attacks Against Genomic Data-Sharing Beacons.
针对基因组数据共享信标的基因组重建攻击。
- DOI:
- 发表时间:2021
- 期刊:
- 影响因子:0
- 作者:Ayoz, Kerem;Ayday, Erman;Cicek, A Ercument
- 通讯作者:Cicek, A Ercument
Privacy-preserving federated genome-wide association studies via dynamic sampling.
通过动态采样进行隐私保护联合全基因组关联研究。
- DOI:
- 发表时间:2023-10-03
- 期刊:
- 影响因子:0
- 作者:Wang, Xinyue;Dervishi, Leonard;Li, Wentao;Ayday, Erman;Jiang, Xiaoqian;Vaidya, Jaideep
- 通讯作者:Vaidya, Jaideep
The effect of kinship in re-identification attacks against genomic data sharing beacons.
亲属关系在针对基因组数据共享信标的重新识别攻击中的影响。
- DOI:
- 发表时间:2020-12-30
- 期刊:
- 影响因子:0
- 作者:Ayoz, Kerem;Aysen, Miray;Ayday, Erman;Cicek, A Ercument
- 通讯作者:Cicek, A Ercument
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Erman Ayday其他文献
Erman Ayday的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Erman Ayday', 18)}}的其他基金
Accelerating Genomic Data Sharing and Collaborative Research with Privacy Protection
通过隐私保护加速基因组数据共享和协作研究
- 批准号:
10735407 - 财政年份:2023
- 资助金额:
$ 30.19万 - 项目类别:
Privacy Challenges of Genomic Data-Sharing Beacons and Solutions
基因组数据共享信标和解决方案的隐私挑战
- 批准号:
10031275 - 财政年份:2020
- 资助金额:
$ 30.19万 - 项目类别:
Privacy Challenges of Genomic Data-Sharing Beacons and Solutions
基因组数据共享信标和解决方案的隐私挑战
- 批准号:
10443776 - 财政年份:2020
- 资助金额:
$ 30.19万 - 项目类别:
Privacy Challenges of Genomic Data-Sharing Beacons and Solutions
基因组数据共享信标和解决方案的隐私挑战
- 批准号:
10223439 - 财政年份:2020
- 资助金额:
$ 30.19万 - 项目类别:
相似国自然基金
基于肿瘤病理图片的靶向药物敏感生物标志物识别及统计算法的研究
- 批准号:82304250
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
多模态高层语义驱动的深度伪造检测算法研究
- 批准号:62306090
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
高精度海表反照率遥感算法研究
- 批准号:42376173
- 批准年份:2023
- 资助金额:51 万元
- 项目类别:面上项目
基于新型深度学习算法和多组学研究策略鉴定非编码区剪接突变在肌萎缩侧索硬化症中的分子机制
- 批准号:82371878
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
基于深度学习与水平集方法的心脏MR图像精准分割算法研究
- 批准号:62371156
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
相似海外基金
Precision mapping of regulatory causal variants by expression CROPseq
通过表达 CROPseq 精确绘制调控因果变异
- 批准号:
10095869 - 财政年份:2021
- 资助金额:
$ 30.19万 - 项目类别:
Single cell, whole genome analysis of the aging human cardiomyocyte
衰老人类心肌细胞的单细胞、全基因组分析
- 批准号:
10326403 - 财政年份:2021
- 资助金额:
$ 30.19万 - 项目类别:
Single cell, whole genome analysis of the aging human cardiomyocyte
衰老人类心肌细胞的单细胞、全基因组分析
- 批准号:
10548738 - 财政年份:2021
- 资助金额:
$ 30.19万 - 项目类别:
Precision mapping of regulatory causal variants by expression CROPseq
通过表达 CROPseq 精确绘制调控因果变异
- 批准号:
10557093 - 财政年份:2021
- 资助金额:
$ 30.19万 - 项目类别:
Precision mapping of regulatory causal variants by expression CROPseq
通过表达 CROPseq 精确绘制调控因果变异
- 批准号:
10341085 - 财政年份:2021
- 资助金额:
$ 30.19万 - 项目类别: