BIGDATA: IA: Collaborative Research: Asynchronous Distributed Machine Learning Framework for Multi-Site Collaborative Brain Big Data Mining
BIGDATA:IA:协作研究:用于多站点协作大脑大数据挖掘的异步分布式机器学习框架
基本信息
- 批准号:2348159
- 负责人:
- 金额:$ 78万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-09-01 至 2024-09-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Recent advances in multimodal brain imaging and high throughput genotyping and sequencing techniques provide exciting new opportunities to ultimately improve our understanding of brain structure and neural dynamics, their genetic architecture, and their influences on cognition and behavior. However, data privacy and security issues have inhibited data sharing across institutes. Emerging multi-site collaborative data analysis can address these issues and facilitate data and computing resource sharing. In collaborative data analysis, the participating institutes keep their own data, which are analyzed and computed locally, and only share the computed results by communicating with a server. The server communicates with all institutes and updates the local models such that the trained machine learning models indirectly use all data and are shared with all institutes. Although some distributed/parallel computation techniques were recently proposed to address big data mining problems, most of them are synchronous models. Asynchronous distributed learning methods are much more efficient, because they allow the server to update the model with information from only one worker node without waiting for slow worker nodes in each round. However, the convergence analysis for the asynchronous distributed algorithms is much more difficult due to the inconsistent variables update across nodes. Thus, it is challenging to design efficient distributed machine learning algorithms for collaborative big data analysis. The research objective of this project is to address the computational challenges in the emerging multi-site collaborative data mining for brain big data. This project seeks to harness the opportunities of designing new efficient asynchronous distributed machine learning algorithms with rigorous theoretical foundations for multi-site collaborative brain big data mining, creating large-scale computational strategies and effective software tools to reveal sophisticated relationships among heterogeneous brain data. This project designs the asynchronous distributed machine learning and principled big data mining models to conduct the comprehensive study of brain imaging genomics and connectomics. Specifically, the principal investigators investigate: 1) collaborative genotype and phenotype association study using new asynchronous doubly stochastic proximal gradient algorithms; 2) communication-efficient multi-site collaborative data integration models to integrate imaging genomics data for predicting outcomes of interest; 3) collaborative deep learning algorithm speedup by the asynchronous distributed algorithms with applications in temporal cognitive change prediction; and 4) new graph convolutional deep learning models for brain network mining. It is innovative to integrate new distributed machine learning and data-intensive computing with brain imaging genomics and connectomics that hold great promise for a systems biology of the brain. The developed methods and tools impact other neuroimaging, genomics, and neuroscience research, and enable investigators working on brain science to effectively test their scientific hypotheses. This project will also facilitate the development of novel educational tools.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
多模式脑成像和高通量基因分型和测序技术的最新进展提供了令人兴奋的新机会,最终提高我们对大脑结构和神经动力学、其遗传结构及其对认知和行为影响的理解。然而,数据隐私和安全问题阻碍了机构之间的数据共享。新兴的多站点协作数据分析可以解决这些问题并促进数据和计算资源共享。在协作数据分析中,参与机构保留自己的数据,这些数据在本地进行分析和计算,并且仅通过与服务器通信来共享计算结果。服务器与所有机构通信并更新本地模型,以便训练后的机器学习模型间接使用所有数据并与所有机构共享。尽管最近提出了一些分布式/并行计算技术来解决大数据挖掘问题,但大多数都是同步模型。异步分布式学习方法效率更高,因为它们允许服务器仅使用来自一个工作节点的信息来更新模型,而无需在每一轮中等待慢速工作节点。然而,由于节点间变量更新不一致,异步分布式算法的收敛分析变得更加困难。因此,设计用于协作大数据分析的高效分布式机器学习算法具有挑战性。该项目的研究目标是解决新兴的脑大数据多站点协作数据挖掘中的计算挑战。该项目旨在利用设计新的高效异步分布式机器学习算法的机会,为多站点协作大脑大数据挖掘提供严格的理论基础,创建大规模计算策略和有效的软件工具来揭示异构大脑数据之间的复杂关系。该项目设计异步分布式机器学习和原则性大数据挖掘模型,以进行脑成像基因组学和连接组学的综合研究。具体来说,主要研究人员研究:1)使用新的异步双随机近端梯度算法进行协作基因型和表型关联研究; 2)高效通信的多站点协作数据集成模型,用于集成成像基因组学数据以预测感兴趣的结果; 3)通过异步分布式算法加速协作深度学习算法,并应用于时间认知变化预测; 4)用于脑网络挖掘的新图卷积深度学习模型。将新的分布式机器学习和数据密集型计算与脑成像基因组学和连接组学相结合是创新的,这为大脑系统生物学带来了巨大的希望。所开发的方法和工具影响其他神经影像学、基因组学和神经科学研究,并使从事脑科学研究的研究人员能够有效地测试他们的科学假设。该项目还将促进新型教育工具的开发。该奖项反映了 NSF 的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Heng Huang其他文献
Supervised Intra-embedding of Fisher Vectors for Histopathology Image Classification
用于组织病理学图像分类的 Fisher 向量的监督内嵌入
- DOI:
10.1007/978-3-319-66179-7_12 - 发表时间:
2017-09-10 - 期刊:
- 影响因子:0
- 作者:
Yang Song;Hang Chang;Heng Huang;Weidong (Tom) Cai - 通讯作者:
Weidong (Tom) Cai
An End-to-end Model of Predicting Diverse Ranking On Heterogeneous Feeds
预测异构提要多样化排名的端到端模型
- DOI:
- 发表时间:
2018-06-01 - 期刊:
- 影响因子:0
- 作者:
Zizhe Gao;Zhengxia Gao;Heng Huang;Zhuoren Jiang;Yuliang Yan - 通讯作者:
Yuliang Yan
N,S-Chelating triazole-thioether palladium for the one-pot synthesis of biaryls
N,S-螯合三唑硫醚钯用于联芳基化合物的一锅法合成
- DOI:
10.1071/ch22116 - 发表时间:
2022-11-10 - 期刊:
- 影响因子:1.1
- 作者:
Q. Yan;Heng Huang;Xiang Si - 通讯作者:
Xiang Si
A Compact 16-Channel Neural Signal Recorder with Wireless Power and Data Transmission
具有无线供电和数据传输功能的紧凑型 16 通道神经信号记录仪
- DOI:
10.1109/iscas46773.2023.10181720 - 发表时间:
2023-05-21 - 期刊:
- 影响因子:0
- 作者:
Heng Huang;Deng Luo;Wei Song;Milin Zhang;Zhihua Wang;Guolin Li - 通讯作者:
Guolin Li
Fast Vehicle Identification in Surveillance via Ranked Semantic Sampling Based Embedding
通过基于排序语义采样的嵌入实现监控中的快速车辆识别
- DOI:
10.24963/ijcai.2018/514 - 发表时间:
2018-07-01 - 期刊:
- 影响因子:0
- 作者:
Feng Zheng;Xin Miao;Heng Huang - 通讯作者:
Heng Huang
Heng Huang的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Heng Huang', 18)}}的其他基金
SCH: INT: New Machine Learning Framework to Conduct Anesthesia Risk Stratification and Decision Support for Precision Health
SCH:INT:用于进行麻醉风险分层和精准健康决策支持的新机器学习框架
- 批准号:
2347604 - 财政年份:2023
- 资助金额:
$ 78万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: New Machine Learning Empowered Nanoinformatics System for Advancing Nanomaterial Design
合作研究:III:媒介:新的机器学习赋能纳米信息学系统,促进纳米材料设计
- 批准号:
2347592 - 财政年份:2023
- 资助金额:
$ 78万 - 项目类别:
Standard Grant
Collaborative Research: CCRI: New: A Scalable Hardware and Software Environment Enabling Secure Multi-party Learning
协作研究:CCRI:新:可扩展的硬件和软件环境支持安全的多方学习
- 批准号:
2347617 - 财政年份:2023
- 资助金额:
$ 78万 - 项目类别:
Standard Grant
Collaborative Research: PPoSS: LARGE: Co-designing Hardware, Software, and Algorithms to Enable Extreme-Scale Machine Learning Systems
协作研究:PPoSS:大型:共同设计硬件、软件和算法以实现超大规模机器学习系统
- 批准号:
2348306 - 财政年份:2023
- 资助金额:
$ 78万 - 项目类别:
Continuing Grant
A New Machine Learning Framework for Single-Cell Multi-Omics Bioinformatics
单细胞多组学生物信息学的新机器学习框架
- 批准号:
2405416 - 财政年份:2023
- 资助金额:
$ 78万 - 项目类别:
Standard Grant
III: Medium: Collaborative Research: Integrating Large-Scale Machine Learning and Edge Computing for Collaborative Autonomous Vehicles
III:媒介:协作研究:集成大规模机器学习和边缘计算以实现协作自动驾驶汽车
- 批准号:
2348169 - 财政年份:2023
- 资助金额:
$ 78万 - 项目类别:
Continuing Grant
Collaborative Research: PPoSS: LARGE: Co-designing Hardware, Software, and Algorithms to Enable Extreme-Scale Machine Learning Systems
协作研究:PPoSS:大型:共同设计硬件、软件和算法以实现超大规模机器学习系统
- 批准号:
2217003 - 财政年份:2022
- 资助金额:
$ 78万 - 项目类别:
Continuing Grant
Collaborative Research: CCRI: New: A Scalable Hardware and Software Environment Enabling Secure Multi-party Learning
协作研究:CCRI:新:可扩展的硬件和软件环境支持安全的多方学习
- 批准号:
2213701 - 财政年份:2022
- 资助金额:
$ 78万 - 项目类别:
Standard Grant
A New Machine Learning Framework for Single-Cell Multi-Omics Bioinformatics
单细胞多组学生物信息学的新机器学习框架
- 批准号:
2225775 - 财政年份:2022
- 资助金额:
$ 78万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: New Machine Learning Empowered Nanoinformatics System for Advancing Nanomaterial Design
合作研究:III:媒介:新的机器学习赋能纳米信息学系统,促进纳米材料设计
- 批准号:
2211492 - 财政年份:2022
- 资助金额:
$ 78万 - 项目类别:
Standard Grant
相似国自然基金
甘蓝型油菜BnaA01.IA调控花序结构的分子机制解析
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
Ia型超新星抛射物元素丰度与时域观测特征相关性研究
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
年轻Ia型超新星遗迹在湍动背景场中的数值模拟研究
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
大豆GmCPSF73-Ia调控侧根发育的分子机制
- 批准号:
- 批准年份:2021
- 资助金额:58 万元
- 项目类别:面上项目
核因子IA通过调节破骨细胞分化影响骨稳态和骨量的作用及其机制研究
- 批准号:82072389
- 批准年份:2020
- 资助金额:55 万元
- 项目类别:面上项目
相似海外基金
BIGDATA: IA: Collaborative Research: Intelligent Solutions for Navigating Big Data from the Arctic and Antarctic
BIGDATA:IA:协作研究:导航北极和南极大数据的智能解决方案
- 批准号:
2308649 - 财政年份:2022
- 资助金额:
$ 78万 - 项目类别:
Standard Grant
BigData:IA:Collaborative Research: TIMES: A tensor factorization platform for spatio-temporal data
BigData:IA:协作研究:TIMES:时空数据张量分解平台
- 批准号:
2034479 - 财政年份:2020
- 资助金额:
$ 78万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Protecting Yourself from Wildfire Smoke: Big Data Driven Adaptive Air Quality Prediction Methodologies
大数据:IA:协作研究:保护自己免受野火烟雾的侵害:大数据驱动的自适应空气质量预测方法
- 批准号:
1838024 - 财政年份:2019
- 资助金额:
$ 78万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Intelligent Solutions for Navigating Big Data from the Arctic and Antarctic
BIGDATA:IA:协作研究:导航北极和南极大数据的智能解决方案
- 批准号:
1947584 - 财政年份:2019
- 资助金额:
$ 78万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Asynchronous Distributed Machine Learning Framework for Multi-Site Collaborative Brain Big Data Mining
BIGDATA:IA:协作研究:用于多站点协作大脑大数据挖掘的异步分布式机器学习框架
- 批准号:
1837964 - 财政年份:2019
- 资助金额:
$ 78万 - 项目类别:
Standard Grant