Collaborative Research: Statistical Methods for Integrated Analysis of High-Throughput Biomedical Data
合作研究:高通量生物医学数据综合分析的统计方法
基本信息
- 批准号:1661802
- 负责人:
- 金额:$ 17.35万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2016
- 资助国家:美国
- 起止时间:2016-09-01 至 2019-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Technological advances have led to a rapid proliferation of high-throughput "omics" data in medicine that hold the key to clinically effective personalized medicine. To realize this goal, statistical and computational tools to mine this data and discover biomarkers, drug targets, disrupted disease networks, and disease sub-types are urgently needed. There are, however, two primary factors which make the development of such statistical tools challenging. First, many high-throughput genomic technologies produce varied heterogeneous data, which include continuous data (microarrays, methylation arrays), count data (RNA-sequencing), and binary/categorical data (SNPs, CNV). These varied data sets do not always satisfy typical distributional assumptions imposed by standard high-dimensional statistical models. Second, in order for scientists to leverage all of their data and understand the complete molecular basis of disease, these varied omics data sets need to be combined into a single multivariate statistical model. This proposal seeks to address these two issues with a new statistical framework for integrated analysis of multiple sets of high-dimensional data measured on the same group of subjects. The key statistical approach uses the theory of exponential family distributions to generalize two foundational high-dimensional statistical frameworks, principal components analysis (PCA) and graphical models, so as to jointly analyze transcriptional, epi-genomics and functional genomics data. This research will be applied to high-throughput cancer genomics data and lead to new methods to (a) discover molecular cancer sub-types along with their genomic signatures and (b) build a holistic network model of disease. By leveraging information across all the different types available of genomic biomarkers, the proposed methods will have the potential to make scientific discoveries critical for personalized medicine. The proposed work will also be broadly applicable to integrating multiple sets of "omics" data, including genomics, proteomics, metabolomics, and imaging. Beyond medicine, the theoretical framework and statistical methods will make significant advances in the theory of exponential families, statistical learning, and the emerging field of integrative analysis as well as have broad applicability in other disciplines such as engineering and security. All results will be disseminated through publications, conferences, and open-source software; this research will also provide training and educational opportunities for doctoral and postdoctoral scholars.
技术进步导致了医学中高通量“ OMICS”数据的快速扩散,这些数据是临床有效的个性化医学的关键。为了实现这一目标,迫切需要迫切需要开采这些数据并发现生物标志物,药物靶标,疾病网络和疾病子类型的统计和计算工具。但是,有两个主要因素使这种统计工具的开发具有挑战性。首先,许多高通量基因组技术产生各种异质数据,其中包括连续数据(微阵列,甲基化阵列),计数数据(RNA序列)和二进制/分类数据(SNP,CNP,CNV)。这些多样化的数据集并不总是满足标准高维统计模型施加的典型分布假设。其次,为了使科学家利用其所有数据并了解疾病的完整分子基础,这些多样的OMIC数据集需要合并为一个多元统计模型。 该提案旨在通过一个新的统计框架解决这两个问题,以综合分析在同一受试者组上测量的多组高维数据。关键的统计方法使用指数式的家庭分布理论来概括两个基础高维统计框架,主要成分分析(PCA)和图形模型,以共同分析转录,Epi-genomics和功能基因组学数据。 这项研究将应用于高通量的癌症基因组学数据,并导致(a)发现分子癌亚型及其基因组特征,以及(b)建立整体疾病网络模型。 通过利用所有可用类型的基因组生物标志物的信息,提出的方法将有可能使科学发现对个性化医学至关重要。 所提出的工作还将广泛地用于整合多组“ OMICS”数据,包括基因组学,蛋白质组学,代谢组学和成像。 除了医学外,理论框架和统计方法还将在指数级家庭,统计学习和综合分析的新兴领域中取得重大进步,并在工程和安全等其他学科中具有广泛的适用性。 所有结果将通过出版物,会议和开源软件传播;这项研究还将为博士和博士后学者提供培训和教育机会。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Pradeep Ravikumar其他文献
Ordinal Graphical Models: A Tale of Two Approaches
序数图形模型:两种方法的故事
- DOI:
10.5555/3305890.3306018 - 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
A. Suggala;Eunho Yang;Pradeep Ravikumar - 通讯作者:
Pradeep Ravikumar
XMRF: an R package to fit Markov Networks to high-throughput genetics data
XMRF:一个 R 包,用于使马尔可夫网络适应高通量遗传学数据
- DOI:
- 发表时间:
2015 - 期刊:
- 影响因子:0
- 作者:
Ying;Genevera I. Allen;Yulia Baker;Eunho Yang;Pradeep Ravikumar;Zhandong Liu - 通讯作者:
Zhandong Liu
Heavy-tailed Streaming Statistical Estimation
重尾流统计估计
- DOI:
10.48550/arxiv.2108.11483 - 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Che-Ping Tsai;Adarsh Prasad;Sivaraman Balakrishnan;Pradeep Ravikumar - 通讯作者:
Pradeep Ravikumar
Learning Graphs with a Few Hubs - Supplementary
用几个中心学习图 - 补充
- DOI:
- 发表时间:
2010 - 期刊:
- 影响因子:0
- 作者:
Rashish Tandon;Pradeep Ravikumar - 通讯作者:
Pradeep Ravikumar
Sample based Explanations via Generalized Representers
通过广义代表进行基于样本的解释
- DOI:
10.48550/arxiv.2310.18526 - 发表时间:
2023 - 期刊:
- 影响因子:0
- 作者:
Che;Chih;Pradeep Ravikumar - 通讯作者:
Pradeep Ravikumar
Pradeep Ravikumar的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Pradeep Ravikumar', 18)}}的其他基金
RI: Medium: Foundations of Self-Supervised Learning Through the Lens of Probabilistic Generative Models
RI:媒介:通过概率生成模型的视角进行自我监督学习的基础
- 批准号:
2211907 - 财政年份:2022
- 资助金额:
$ 17.35万 - 项目类别:
Standard Grant
Collaborative Research: RI: Medium: A Rigorous, General Framework for Tractable Learning of Large-Scale DAGs from Data
协作研究:RI:Medium:从数据中轻松学习大规模 DAG 的严格通用框架
- 批准号:
1955532 - 财政年份:2020
- 资助金额:
$ 17.35万 - 项目类别:
Continuing Grant
RI: Small: Non-parametric Machine Learning in the Age of Deep and High-Dimensional Models
RI:小:深度和高维模型时代的非参数机器学习
- 批准号:
1909816 - 财政年份:2019
- 资助金额:
$ 17.35万 - 项目类别:
Standard Grant
Collaborative Research: Physics-Based Machine Learning for Sub-Seasonal Climate Forecasting
合作研究:基于物理的机器学习用于次季节气候预测
- 批准号:
1934584 - 财政年份:2019
- 资助金额:
$ 17.35万 - 项目类别:
Continuing Grant
CAREER: A New Neat Framework for Statistical Machine Learning
职业:统计机器学习的新简洁框架
- 批准号:
1661755 - 财政年份:2016
- 资助金额:
$ 17.35万 - 项目类别:
Continuing Grant
BIGDATA: F: DKA: Collaborative Research: High-Dimensional Statistical Machine Learning for Spatio-Temporal Climate Data
BIGDATA:F:DKA:协作研究:时空气候数据的高维统计机器学习
- 批准号:
1664720 - 财政年份:2016
- 资助金额:
$ 17.35万 - 项目类别:
Standard Grant
BIGDATA: F: DKA: Collaborative Research: High-Dimensional Statistical Machine Learning for Spatio-Temporal Climate Data
BIGDATA:F:DKA:协作研究:时空气候数据的高维统计机器学习
- 批准号:
1447574 - 财政年份:2014
- 资助金额:
$ 17.35万 - 项目类别:
Standard Grant
Collaborative Research: Statistical Methods for Integrated Analysis of High-Throughput Biomedical Data
合作研究:高通量生物医学数据综合分析的统计方法
- 批准号:
1264033 - 财政年份:2013
- 资助金额:
$ 17.35万 - 项目类别:
Continuing Grant
RI: Small: Collaborative Research: Statistical ranking theory without a canonical loss
RI:小:协作研究:没有典型损失的统计排名理论
- 批准号:
1320894 - 财政年份:2013
- 资助金额:
$ 17.35万 - 项目类别:
Standard Grant
CAREER: A New Neat Framework for Statistical Machine Learning
职业:统计机器学习的新简洁框架
- 批准号:
1149803 - 财政年份:2012
- 资助金额:
$ 17.35万 - 项目类别:
Continuing Grant
相似国自然基金
统计过程监测理论在肠道菌群健康指数构建过程中的适用性研究
- 批准号:32372345
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
基于统计分布的超灵敏/单分子表面增强光谱定量检测研究
- 批准号:12374028
- 批准年份:2023
- 资助金额:53 万元
- 项目类别:面上项目
环境混合污染物的健康效应统计分析方法研究
- 批准号:82373690
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
基于语义增强多点地质统计的三维地质体精细建模方法研究
- 批准号:42372345
- 批准年份:2023
- 资助金额:54 万元
- 项目类别:面上项目
SMR#3841 高维统计分析研究的机遇和挑战
- 批准号:12381240133
- 批准年份:2023
- 资助金额:2.00 万元
- 项目类别:国际(地区)合作研究与交流项目
相似海外基金
Collaborative Research: Urban Vector-Borne Disease Transmission Demands Advances in Spatiotemporal Statistical Inference
合作研究:城市媒介传播疾病传播需要时空统计推断的进步
- 批准号:
2414688 - 财政年份:2024
- 资助金额:
$ 17.35万 - 项目类别:
Continuing Grant
Collaborative Research: IMR: MM-1A: Scalable Statistical Methodology for Performance Monitoring, Anomaly Identification, and Mapping Network Accessibility from Active Measurements
合作研究:IMR:MM-1A:用于性能监控、异常识别和主动测量映射网络可访问性的可扩展统计方法
- 批准号:
2319592 - 财政年份:2023
- 资助金额:
$ 17.35万 - 项目类别:
Continuing Grant
Collaborative Research: Enabling Hybrid Methods in the NIMBLE Hierarchical Statistical Modeling Platform
协作研究:在 NIMBLE 分层统计建模平台中启用混合方法
- 批准号:
2332442 - 财政年份:2023
- 资助金额:
$ 17.35万 - 项目类别:
Standard Grant
Characterizing the genetic etiology of delayed puberty with integrative genomic techniques
利用综合基因组技术表征青春期延迟的遗传病因
- 批准号:
10663605 - 财政年份:2023
- 资助金额:
$ 17.35万 - 项目类别:
Integrative Data Science Approach to Advance Care Coordination of ADRD by Primary Care Providers
综合数据科学方法促进初级保健提供者对 ADRD 的护理协调
- 批准号:
10722568 - 财政年份:2023
- 资助金额:
$ 17.35万 - 项目类别: