Informatics platform for mammalian gene regulation at isoform-level

异构体水平的哺乳动物基因调控信息学平台

基本信息

  • 批准号:
    8658144
  • 负责人:
  • 金额:
    $ 33.72万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2013
  • 资助国家:
    美国
  • 起止时间:
    2013-05-02 至 2016-04-30
  • 项目状态:
    已结题

项目摘要

DESCRIPTION (provided by applicant): In recent years, the notion of "one gene makes one protein that functions in one signaling pathway" in mammalian cells has been shown to be overly simplistic. Recent evidence suggests that more than 50% of the human genes produce multiple protein isoforms, through alternative splicing and alternative usage of transcription initiation and/or termination. Notably, the disruption of many of these genes is implicated in cancer and several neuropsychiatric disorders. For majority of human genes the resulting multiple protein isoforms are functionally different and can participate in different signaling pathways. However, nearly after a decade since the completion of the human genome draft sequence, we still assume "gene" as the basic functional unit in a cell. We argue that the isoform-level gene products - "transcript variants" and "protein isoforms" are the basic functional units in a mammalian cell, and accordingly, the informatics resources for managing and analyzing gene regulation data in mammalian cells should adopt "gene isoform centric" rather than "gene centric" approaches. We propose to build an informatics platform for understanding gene regulation at isoform-level by developing statistically rigorous bioinformatics resources for processing Next-Generation Sequencing (NGS) data. Recently, computational approaches that combine seemingly disparate experimental data have been successful in developing concise gene regulation models and transcriptional modules. We plan to extend these methodologies to perform integrative analysis of multiple high-throughput data sets currently generated across different laboratories, including ours at Wistar, into computational models to predict different transcriptional isoforms of mammalian genes and protein-DNA interactions at isoform level. We will apply innovative statistical modeling approaches that combine state-of-the-art meta-classification algorithms, such as Na¿ve Bayes Tree, Bagging and LogitBoost, with Random Forest feature selection to classify different types of target promoters with good classification accuracy and reduced instability, in order to predict gene promoters and infer the protein-DNA interactions from ChIP-seq data. The computational models and the derived information will be integrated into a novel database, which will serve as an in silico platform for transcriptional regulation studies. This will be completed by pursuing the following aims, (1) Develop statistically rigorous novel algorithms and bioinformatics pipelines to identify the orthologous promoters, corresponding transcript variants and protein isoforms that are conserved between human and mouse, (2) develop novel algorithms and informatics pipelines for integrative analysis of NGS datasets to estimate the activity and expression of both known and novel promoters and their transcript variants, in various tissues, developmental stages, and disease conditions, and (3) develop a web-accessible database for integrating the information generated. The novel bioinformatics methods developed by this project will help in silico discovery and research for accelerating the linkage of phenotypic and genomic information, at gene-isoform level.
描述(由申请人提供):近年来,在哺乳动物细胞中“一个基因产生一种在一种信号通路中发挥作用的蛋白质”的概念已被证明过于简单化。最近的证据表明,超过 50% 的人类基因。通过选择性剪接和转录起始和/或终止的选择性使用,产生多种蛋白质同种型。值得注意的是,许多这些基因的破坏与癌症和由此产生的几种神经精神疾病有关。多种蛋白质同工型在功能上有所不同,并且可以参与不同的信号传导途径。然而,自人类基因组草图序列完成近十年后,我们仍然认为“基因”是细胞中的基本功能单位。水平基因产物——“转录变体”和“蛋白质亚型”是基本功能 因此,管理和分析哺乳动物细胞基因调控数据的信息学资源应采用“以基因异构体为中心”而不是“以基因为中心”的方法,我们建议建立一个信息学平台来理解异构体的基因调控。最近,看似不同的实验数据的计算方法已成功开发出简洁的基因调控模型和转录模块。我们计划将这些方法扩展到对当前在不同实验室(包括我们在 Wistar 的实验室)生成的多个高通量数据集进行综合分析,以预测哺乳动物基因的不同转录异构体和异构体水平上的蛋白质-DNA 相互作用。应用创新的统计建模方法,结合最先进的元分类算法,例如 Na¿ ve Bayes Tree、Bagging 和 LogitBoost,通过随机森林特征选择对不同类型的目标启动子进行分类,具有良好的分类精度和降低的不稳定性,以便预测基因启动子并从 ChIP-seq 数据推断蛋白质-DNA 相互作用的计算模型。所得信息将被整合到一个新的数据库中,该数据库将作为转录调控研究的计算机平台,这将通过追求以下目标来完成:(1)开发严格的新算法和生物信息学管道。识别人类和小鼠之间保守的直系同源启动子、相应的转录本变体和蛋白质亚型,(2) 开发新的算法和信息学管道,用于 NGS 数据集的综合分析,以估计已知和新型启动子及其转录本的活性和表达(3) 开发一个可通过网络访问的数据库来整合生成的信息,该项目开发的新型生物信息学方法将有助于计算机发现和研究。用于在基因亚型水平上加速表型和基因组信息的联系。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

RAMANA V DAVULURI其他文献

RAMANA V DAVULURI的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('RAMANA V DAVULURI', 18)}}的其他基金

Developing novel deep-learning based methods for deciphering non-coding gene regulatory code
开发基于深度学习的新型方法来破译非编码基因调控密码
  • 批准号:
    10451673
  • 财政年份:
    2021
  • 资助金额:
    $ 33.72万
  • 项目类别:
Developing novel deep-learning based methods for deciphering non-coding gene regulatory code
开发基于深度学习的新型方法来破译非编码基因调控密码
  • 批准号:
    10615784
  • 财政年份:
    2021
  • 资助金额:
    $ 33.72万
  • 项目类别:
Informatics Platform for Mammalian Gene Regulation at Isoform-level
异构体水平的哺乳动物基因调控信息学平台
  • 批准号:
    10273985
  • 财政年份:
    2020
  • 资助金额:
    $ 33.72万
  • 项目类别:
Informatics Platform for Mammalian Gene Regulation at Isoform-level
异构体水平的哺乳动物基因调控信息学平台
  • 批准号:
    9922347
  • 财政年份:
    2013
  • 资助金额:
    $ 33.72万
  • 项目类别:
Informatics Platform for Mammalian Gene Regulation at Isoform-level
异构体水平的哺乳动物基因调控信息学平台
  • 批准号:
    8843951
  • 财政年份:
    2013
  • 资助金额:
    $ 33.72万
  • 项目类别:
Bioinformatics Facility
生物信息学设施
  • 批准号:
    7945001
  • 财政年份:
    2009
  • 资助金额:
    $ 33.72万
  • 项目类别:
Genomewide discovery & analysis of alternative promoters
全基因组发现
  • 批准号:
    7371108
  • 财政年份:
    2006
  • 资助金额:
    $ 33.72万
  • 项目类别:
Genomewide discovery & analysis of alternative promoters
全基因组发现
  • 批准号:
    7678211
  • 财政年份:
    2006
  • 资助金额:
    $ 33.72万
  • 项目类别:
Genomewide discovery & analysis of alternative promoters
全基因组发现
  • 批准号:
    7226994
  • 财政年份:
    2006
  • 资助金额:
    $ 33.72万
  • 项目类别:
Genomewide discovery & analysis of alternative promoters
全基因组发现
  • 批准号:
    7580978
  • 财政年份:
    2006
  • 资助金额:
    $ 33.72万
  • 项目类别:

相似国自然基金

随机阻尼波动方程的高效保结构算法研究
  • 批准号:
    12301518
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
大规模黎曼流形稀疏优化算法及应用
  • 批准号:
    12371306
  • 批准年份:
    2023
  • 资助金额:
    43.5 万元
  • 项目类别:
    面上项目
基于任意精度计算架构的量子信息处理算法硬件加速技术研究
  • 批准号:
    62304037
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
分布式非凸非光滑优化问题的凸松弛及高低阶加速算法研究
  • 批准号:
    12371308
  • 批准年份:
    2023
  • 资助金额:
    43.5 万元
  • 项目类别:
    面上项目
基于物理信息神经网络的雷达回波资料反演蒸发波导算法研究
  • 批准号:
    42305048
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

Computational tools and resources to study alternative splicing and mRNA isoform variation
研究选择性剪接和 mRNA 亚型变异的计算工具和资源
  • 批准号:
    10669330
  • 财政年份:
    2022
  • 资助金额:
    $ 33.72万
  • 项目类别:
Methods for improving clinical diagnostic by detection, prediction, interpretation and prioritization of aberrant transcriptome variations
通过异常转录组变异的检测、预测、解释和优先排序来改进临床诊断的方法
  • 批准号:
    10227951
  • 财政年份:
    2020
  • 资助金额:
    $ 33.72万
  • 项目类别:
Methods for improving clinical diagnostic by detection, prediction, interpretation and prioritization of aberrant transcriptome variations
通过异常转录组变异的检测、预测、解释和优先排序来改进临床诊断的方法
  • 批准号:
    10033447
  • 财政年份:
    2020
  • 资助金额:
    $ 33.72万
  • 项目类别:
Methods for improving clinical diagnostic by detection, prediction, interpretation and prioritization of aberrant transcriptome variations
通过异常转录组变异的检测、预测、解释和优先排序来改进临床诊断的方法
  • 批准号:
    10451556
  • 财政年份:
    2020
  • 资助金额:
    $ 33.72万
  • 项目类别:
Methods for improving clinical diagnostic by detection, prediction, interpretation and prioritization of aberrant transcriptome variations
通过异常转录组变异的检测、预测、解释和优先排序来改进临床诊断的方法
  • 批准号:
    10674723
  • 财政年份:
    2020
  • 资助金额:
    $ 33.72万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了