TC:Large:Collaborative Research:Anonymizing Textual Data and its Impact on Utility

TC:大型:协作研究:匿名文本数据及其对实用性的影响

基本信息

  • 批准号:
    1011984
  • 负责人:
  • 金额:
    $ 35.7万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2010
  • 资助国家:
    美国
  • 起止时间:
    2010-09-01 至 2015-08-31
  • 项目状态:
    已结题

项目摘要

Data Protection laws that exempt data that is not individually identifiable have led to an explosion in anonymization research. Unfortunately, how well current de-identification and anonymization techniques control risks to privacy and confidentiality is not well understood. Neither is the usefulness of anonymized data for real-world applications. The project addresses anonymization on three fronts: 1) Textual data, even when explicit identifiers are removed (names, dates, locations), can contain highly identifiable information. For example, a sample of chief complaint fields from the Indiana Network for Patient Care (INPC) found several instances of "phantom limb pain". Amputees can be visually identifiable, but the HIPAA Safe Harbor rules do not list this as "identifying information". Any policy explicitly listing all types of identifying data is likely to fail. Through a joint effort with computer science and linguistics, the project is developing new methods to remove specific details from text while preserving meaning, eliminating such highly identifiable information without a priori knowledge of what would be identifying. 2) Current anonymization research is based on unproven measures of identifiability. Through a re-identification challenge on synthetic data (but based on real healthcare data), the project is evaluating the efficacy of these measures. Interdisciplinary teams of students are given challenge problems - anonymized data with hypothetical healthcare data - and asked to make (hypothetical) inferences about health information of individuals. The results can be used to calibrate the effectiveness of different anonymization measures. 3) The utility of anonymized data has been a concern among research: Does anonymized data provide credible research results? By partnering with healthcare studies at the Kinsey Institute and Purdue University School of Nursing, the project is comparing analyses on original data with analyses on anonymized data, and evaluating the impact of types of anonymization on research results. A related issue is determining the impact on data collection: Are individuals more candid in their responses if they know data will be anonymized? Outcomes are broadening the scope of research that can be performed on anonymized data, while ensuring that researchers know when access to individually identifiable data (with attendant restrictions and safeguards) is needed. Through these tasks, the project is advancing our ability to utilize the wealth of data we now collect for the benefit of society, while ensuring individual privacy is protected. For further information see the project web site at the URL: http://projects.cerias.purdue.edu/TextAnon
豁免无法单独识别的数据的数据保护定律导致匿名研究爆炸。不幸的是,目前的去识别和匿名技术如何控制隐私和机密性的风险。匿名数据对现实世界应用程序的有用性也不是。该项目在三个方面解决了匿名:1)即使删除了显式标识符(名称,日期,位置),文本数据也可能包含高度可识别的信息。例如,印第安纳州患者护理网络(INPC)的主要投诉领域样本发现了几个“幻影肢体疼痛”的实例。可以在视觉上识别amputees,但是HIPAA安全港规则并未将其列为“识别信息”。任何明确列出所有标识数据的策略都可能失败。通过与计算机科学和语言学的共同努力,该项目正在开发新的方法,以从文本中删除特定细节,同时保留含义,消除这种高度可识别的信息,而无需先验地了解要识别的内容。 2)当前的匿名研究基于未经证实的可识别性措施。通过对合成数据的重新识别挑战(但基于实际的医疗保健数据),该项目正在评估这些措施的功效。跨学科的学生团队有挑战问题 - 带有假设医疗保健数据的匿名数据 - 并要求对个人健康信息进行(假设)推论。结果可用于校准不同匿名度量的有效性。 3)匿名数据的效用在研究中一直是一个关注:匿名数据是否提供可靠的研究结果?通过与金西研究所和普渡大学护理学院的医疗保健研究合作,该项目正在将原始数据的分析与匿名数据的分析进行比较,并评估匿名类型对研究结果的影响。一个相关的问题是确定对数据收集的影响:如果个人知道数据会被匿名化,他们的响应是否更加坦率?结果正在扩大可以在匿名数据上执行的研究范围,同时确保研究人员知道何时需要访问单独识别的数据(带有限制和保障措施)。通过这些任务,该项目正在促进我们利用现在收集的大量数据的能力,同时确保保护个人隐私。有关更多信息

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Wei Jiang其他文献

The Effect of Mo Addition on Electrocatalytic Activity and Stability of Fe-Co-P-C Metallic Glasses for Hydrogen Evolution
Mo添加对Fe-Co-P-C金属玻璃析氢电催化活性和稳定性的影响
  • DOI:
    10.1149/1945-7111/ac1313
  • 发表时间:
    2021-07
  • 期刊:
  • 影响因子:
    3.9
  • 作者:
    Fabao Zhang;Wenbing Shan;Qingzhuo Hu;Wei Jiang;Dongdong Li;Bo Zhang
  • 通讯作者:
    Bo Zhang
Green Practices and Customer Evaluations of the Service Experience: The Moderating Roles of External Environmental Factors and Firm Characteristics
绿色实践与客户对服务体验的评价:外部环境因素和企业特征的调节作用
  • DOI:
    10.1007/s10551-022-05044-x
  • 发表时间:
    2022-01
  • 期刊:
  • 影响因子:
    6.1
  • 作者:
    Wei Jiang;Liwen Wang;Kevin Zheng Zhou
  • 通讯作者:
    Kevin Zheng Zhou
Overcoming oxygen heterogeneity of tumor microenvironments to boost cancer immunotherapy by oxygen-switchable ROS/RNS nanogenerators
通过氧可切换 ROS/RNS 纳米发电机克服肿瘤微环境的氧异质性以促进癌症免疫治疗
  • DOI:
    10.1016/j.nantod.2022.10
  • 发表时间:
    2023-02
  • 期刊:
  • 影响因子:
    17.4
  • 作者:
    Wei Jiang;Zhiqiang Shen;Zixuan Guo;Qin Wang;Quan Li;Jinming Hu;Yucai Wang
  • 通讯作者:
    Yucai Wang
Mapping ecosystem service value in Germany
Enhancement of surface discharge in catalyst pores in dielectric barrier discharges
介质阻挡放电中催化剂孔内表面放电的增强
  • DOI:
    10.1063/1.5082568
  • 发表时间:
    2019
  • 期刊:
  • 影响因子:
    3.2
  • 作者:
    Jian-Guo Gu;Ya Zhang;Ming-Xiang Gao;Hong-Yu Wang;Quan-Zhi Zhang;Lin Yi;Wei Jiang
  • 通讯作者:
    Wei Jiang

Wei Jiang的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Wei Jiang', 18)}}的其他基金

TWC SBE: Medium: Collaborative: Building a Privacy-Preserving Social Networking Platform from a Technological and Sociological Perspective
TWC SBE:媒介:协作:从技术和社会学角度构建保护隐私的社交网络平台
  • 批准号:
    1855391
  • 财政年份:
    2018
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Standard Grant
TWC SBE: Medium: Collaborative: Building a Privacy-Preserving Social Networking Platform from a Technological and Sociological Perspective
TWC SBE:媒介:协作:从技术和社会学角度构建保护隐私的社交网络平台
  • 批准号:
    1564101
  • 财政年份:
    2016
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Standard Grant
I-Corps: An Outsourced and Completely Private Social Network: You&Me
I-Corps:一个外包且完全私有的社交网络:您
  • 批准号:
    1522781
  • 财政年份:
    2015
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Standard Grant
Collaborative Research: Predatory Lending, Predatory Borrowing, and the Mortgage Crisis: Evidence from Loan-Level Data from a Large Bank
合作研究:掠夺性贷款、掠夺性借款和抵押危机:来自大型银行贷款水平数据的证据
  • 批准号:
    0851428
  • 财政年份:
    2009
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Continuing Grant
STTR Phase I: Fully Embedded Optical Interconnect Layers Based on Molded Polymer Lightwave Components for Large Field Size Printed Circuit Boards
STTR 第一阶段:用于大面积印刷电路板的基于模制聚合物光波组件的完全嵌入式光学互连层
  • 批准号:
    0539538
  • 财政年份:
    2006
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Standard Grant
CAREER: Data Quality Management through Statistical Quality Control and Data Mining
职业:通过统计质量控制和数据挖掘进行数据质量管理
  • 批准号:
    0542881
  • 财政年份:
    2006
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Continuing Grant
Role of Cdk Phosphorylation of HsCdc6 in DNA Replication
HsCdc6 Cdk 磷酸化在 DNA 复制中的作用
  • 批准号:
    0233887
  • 财政年份:
    2002
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Continuing Grant
Role of Cdk Phosphorylation of HsCdc6 in DNA Replication
HsCdc6 Cdk 磷酸化在 DNA 复制中的作用
  • 批准号:
    0078432
  • 财政年份:
    2000
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Continuing Grant

相似国自然基金

基于大塑性变形晶粒细化的背压触变反挤压锡青铜偏析行为调控研究
  • 批准号:
    52365047
  • 批准年份:
    2023
  • 资助金额:
    32 万元
  • 项目类别:
    地区科学基金项目
面向大跨度结构的高强多孔骨料内养护UHPC徐变性能与模型研究
  • 批准号:
    52308231
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
基于深度光学的大视场高分辨宽景深小型化显微成像
  • 批准号:
    62301293
  • 批准年份:
    2023
  • 资助金额:
    10 万元
  • 项目类别:
    青年科学基金项目
基于气体多通腔多模非线性效应的大能量可调谐光源的研究
  • 批准号:
    12374318
  • 批准年份:
    2023
  • 资助金额:
    52 万元
  • 项目类别:
    面上项目
二维氮化钼/磷化钼面内异质结构催化材料的设计合成及大电流密度析氢性能研究
  • 批准号:
    22379116
  • 批准年份:
    2023
  • 资助金额:
    50 万元
  • 项目类别:
    面上项目

相似海外基金

TC: Large: Collaborative Research: Facilitating Free and Open Access to Information on the Internet
TC:大型:合作研究:促进互联网上信息的自由和开放获取
  • 批准号:
    1540066
  • 财政年份:
    2015
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Continuing Grant
TC: Large: Collaborative Research: Facilitating Free and Open Access to Information on the Internet
TC:大型:合作研究:促进互联网上信息的自由和开放获取
  • 批准号:
    1111734
  • 财政年份:
    2012
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Continuing Grant
TC: Large: Collaborative Research: Facilitating Free and Open Access to Information on the Internet
TC:大型:合作研究:促进互联网上信息的自由和开放获取
  • 批准号:
    1111539
  • 财政年份:
    2012
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Continuing Grant
TC: Large: Collaborative Research: Facilitating Free and Open Access to Information on the Internet
TC:大型:合作研究:促进互联网上信息的自由和开放获取
  • 批准号:
    1111723
  • 财政年份:
    2012
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Continuing Grant
TC: Large: Collaborative Research: Practical Secure Two-Party Computation: Techniques, Tools, and Applications
TC:大型:协作研究:实用安全两方计算:技术、工具和应用
  • 批准号:
    1111781
  • 财政年份:
    2011
  • 资助金额:
    $ 35.7万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了