CAREER: Exploring and Exploiting Data-Centric Modeling for Fairness in Machine Learning
职业:探索和利用以数据为中心的建模以实现机器学习的公平性
基本信息
- 批准号:2239257
- 负责人:
- 金额:$ 54.77万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-05-01 至 2028-04-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
This project will lead to advances in dealing with data challenges to facilitate fairness in machine learning, promote broad utilization of machine-learning algorithms in high-stake applications, and ensure a fair and transparent decision-making process for future information systems. While machine-learning methods have achieved success in real-world applications, they often suffer from biases and show discrimination towards certain demographics especially in high-stakes applications, which risks significant harm to both society and individuals. Existing work focuses on “model-centric” computational approaches that build models while overlooking the importance of data quality. To tackle the challenges raised by the lack of high quality data and the lack of a comprehensive understanding of fairness in all its respects, this project will integrate model-centric with “data-centric” modeling, which systematically engineers the data needed for a fair decision-making process. The successful outcome of this multidisciplinary research will lead to effective and efficient algorithms that enhance the generalizability and trustworthiness of learned models, and improve the fairness of algorithms deployed in real-world systems in health informatics and disaster resilience. The education programs of this project will play an integral part in training the next generation of the U.S. workforce with critical Responsible Artificial Intelligence (RAI) technologies and attract and retain diverse members of the future workforce in STEM. The research goal of this project is to develop a computational framework for tackling data challenges in fairness through data-centric fairness mitigation solutions that explore and exploit data and prior knowledge. Complementing existing studies focusing on model-centric or data-driven approaches, this project investigates a novel research direction that systematically explores a data-centric fairness mitigation framework. Specifically, the research objectives include: (1) to explore and extract data characteristics on instances, features and a representative subset of examples in terms of fairness, allowing that fairness definitions and metrics may vary across real-world applications; (2) to expand and refine prior knowledge to guide the discrimination-mitigation process via instance augmentation, feature set expansion, and measurement redefinition perspectives; (3) to leverage interpretable and interactive data and prior knowledge as a key element for further improving fairness modeling; and (4) to demonstrate effectiveness on real-world applications including healthcare informatics and disaster resilience. The educational objectives are: (1) to incorporate responsible artificial intelligence (RAI) into curriculum design via integrating research findings and case studies into current and new courses; (2) to enhance public interest in and awareness of RAI by organizing data challenges and broadcasting information on social media platforms; and (3) to attract and retain women and underrepresented minorities to ensure a diverse future STEM workforce.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
该项目将在处理数据挑战方面取得进展,以促进机器学习的公平性,促进机器学习算法在高风险应用中的广泛使用,并确保未来信息系统的公平和透明的决策过程。尽管学习方法在现实世界的应用中取得了成功,但它们常常存在偏见,并对某些人群表现出歧视,尤其是在高风险的应用中,这可能对社会和个人造成重大损害,现有的工作重点是“以模型为中心”的计算方法。在构建模型的同时忽视数据的重要性为了应对缺乏高质量数据和缺乏对公平性各个方面的全面理解所带来的挑战,该项目将以模型为中心,进行“以数据为中心”的建模,系统地设计公平性所需的数据。这项多学科研究的成功成果将产生有效且高效的算法,增强学习模型的通用性和可信度,并提高在健康信息学和灾难恢复方面现实世界系统中部署的算法的公平性。该项目的教育计划将发挥不可或缺的作用参与使用关键的负责任人工智能 (RAI) 技术培训下一代美国劳动力,并吸引和留住 STEM 领域未来劳动力的多元化成员。该项目的研究目标是开发一个计算框架,以应对公平性受损的数据挑战。通过探索和利用数据和先验知识的以数据为中心的公平性缓解解决方案,该项目研究了一个新的研究方向,系统地探索以数据为中心的公平性缓解框架。研究目标包括:(1)在公平性方面探索和提取实例、特征和代表性示例子集的数据特征,允许公平性定义和度量在实际应用中可能有所不同;(2)扩展和完善先验知识通过实例增强、特征集扩展和测量重新定义视角来指导减少歧视过程;(3) 利用可解释和交互式数据和先验知识作为进一步改进公平性建模的关键要素;(4) 证明真实的有效性; - 世界应用,包括医疗信息学和灾难教育目标是:(1)通过将研究成果和案例研究融入当前和新课程,将负责任的人工智能(RAI)纳入课程设计;(2)通过组织数据挑战,增强公众对 RAI 的兴趣和认识。在社交媒体平台上传播信息;(3) 吸引和留住女性并反映代表性不足的少数群体,以确保未来 STEM 劳动力的多元化。该奖项是 NSF 的法定使命,通过使用基金会的智力优势和更广泛的评估,被认为值得支持。影响审查标准。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Na Zou其他文献
Retiring $Δ$DP: New Distribution-Level Metrics for Demographic Parity
退休 $Δ$DP:人口平等的新分配水平指标
- DOI:
- 发表时间:
2023 - 期刊:
- 影响因子:0
- 作者:
Xiaotian Han;Zhimeng Jiang;Hongye Jin;Zirui Liu;Na Zou;Qifan Wang;Xia Hu - 通讯作者:
Xia Hu
PolyJet 3D Printing: Predicting Color by Multilayer Perceptron Neural Network
PolyJet 3D 打印:通过多层感知器神经网络预测颜色
- DOI:
10.1016/j.stlm.2022.100049 - 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Xingjian Wei;Na Zou;Li Zeng;Zhijian Pei - 通讯作者:
Zhijian Pei
A Data Adaptive Biological Sequence Representation for Supervised Learning
用于监督学习的数据自适应生物序列表示
- DOI:
10.1007/s41666-018-0038-5 - 发表时间:
2018 - 期刊:
- 影响因子:5.9
- 作者:
Hande Cakin;Berk Gorgulu;M. Baydogan;Na Zou;Jing Li - 通讯作者:
Jing Li
Identification of the hybrids between Lilium brownii and L. davidii using fluorescence in situ hybridization (FISH)
使用荧光原位杂交 (FISH) 鉴定布朗百合和戴维百合之间的杂交种
- DOI:
10.17660/actahortic.2019.1237.13 - 发表时间:
2019-04 - 期刊:
- 影响因子:0
- 作者:
Like Wu;Wei Zheng;Kongzhong Xiao;Jie Zeng;Luomin Cui;Hui Li;Yanmei Liu;Na Zou;Junhuo Cai;Shujun Zhou - 通讯作者:
Shujun Zhou
Towards Assumption-free Bias Mitigation
迈向无假设偏见缓解
- DOI:
- 发表时间:
2023 - 期刊:
- 影响因子:0
- 作者:
Chia;Yu;Kwei;Xiaotian Han;Xia Hu;Na Zou - 通讯作者:
Na Zou
Na Zou的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Na Zou', 18)}}的其他基金
Collaborative Research: III: Medium: Towards Effective Detection and Mitigation for Shortcut Learning: A Data Modeling Framework
协作研究:III:媒介:针对捷径学习的有效检测和缓解:数据建模框架
- 批准号:
2310262 - 财政年份:2023
- 资助金额:
$ 54.77万 - 项目类别:
Standard Grant
III: Medium: Collaborative Research: Towards Effective Interpretation of Deep Learning: Prediction, Representation, Modeling and Utilization
III:媒介:协作研究:走向深度学习的有效解释:预测、表示、建模和利用
- 批准号:
1900990 - 财政年份:2019
- 资助金额:
$ 54.77万 - 项目类别:
Continuing Grant
相似国自然基金
肠道病毒抑制Wnt/β-catenin信号通路的机制解析与治疗手段探索
- 批准号:82302500
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
氟-18标记咪唑啉I2受体显像剂的研究及其在AD早期诊断中的应用探索
- 批准号:22306014
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
面向资源受限嵌入式系统的深度神经网络优化和软硬件架构协同探索
- 批准号:62372183
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
“物质-代谢-效应”模式探索瑶药石柑子介导TLR4/NF-κB通路抗RA作用的药效物质基础及作用机制
- 批准号:82304866
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
构建神经系统成纤维细胞多组学图谱探索其在神经系统发育中的功能
- 批准号:32371023
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
相似海外基金
Exploring and exploiting Type IV pili for DNA and antibiotic uptake
探索和利用 IV 型菌毛进行 DNA 和抗生素吸收
- 批准号:
478582 - 财政年份:2023
- 资助金额:
$ 54.77万 - 项目类别:
Operating Grants
Exploring and exploiting new representations for multivariate extremes
探索和利用多元极值的新表示
- 批准号:
EP/X010449/1 - 财政年份:2023
- 资助金额:
$ 54.77万 - 项目类别:
Research Grant
Exploring and Exploiting Epigenetic Plant Immunity
探索和利用表观遗传植物免疫
- 批准号:
BB/W015250/1 - 财政年份:2023
- 资助金额:
$ 54.77万 - 项目类别:
Research Grant
From Sensing to Collaboration: Engineering, Exploring and Exploiting the Building Blocks of Embodied Intelligence - An EPSRC Programme Grant
从感知到协作:工程、探索和利用体现智能的构建模块 - EPSRC 计划资助
- 批准号:
EP/V000748/1 - 财政年份:2021
- 资助金额:
$ 54.77万 - 项目类别:
Research Grant
Giglets: Exploring and exploiting opportunities in Canada and the Americas
Giglets:探索和利用加拿大和美洲的机会
- 批准号:
10017938 - 财政年份:2021
- 资助金额:
$ 54.77万 - 项目类别:
Collaborative R&D