Biomedical Data Translator Technical Feasibility Assessment and Architecture Design
生物医学数据转换器技术可行性评估和架构设计
基本信息
- 批准号:9540181
- 负责人:
- 金额:$ 65.11万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2016
- 资助国家:美国
- 起止时间:2016-09-23 至 2018-06-30
- 项目状态:已结题
- 来源:
- 关键词:ArchitectureArtificial IntelligenceBiologicalBiological ModelsBiological ProcessCell SurvivalCell physiologyCellsClinical TrialsCommunitiesComplexComputerized Medical RecordCoupledCurrent Procedural Terminology CodesDNA Sequence AlterationDataData SetDiagnosisDiagnosticDiseaseFunctional disorderGene Expression ProfileGenerationsGeneticGoalsHumanHuman GeneticsIndividualKnowledgeLinkMethodologyModelingMolecularMutationNatureOrganPatientsPatternPharmaceutical PreparationsPhenotypePhosphorylationProbabilityProcessed GenesPropertyPublishingQuality ControlResearchResearch PersonnelResolutionResourcesSemanticsSourceSymptomsTerminologyTissuesTranslatingTranslation Processbasebiological systemschemical geneticsdata integrationdesigndrug candidateexperimental studygenetic associationhuman diseaseinsightinteroperabilitynovelphenotypic biomarkerpre-clinicalprotein functionresponsesmall moleculetreatment effect
项目摘要
A fundamental challenge to translate insights between biomedical researchers, who study
biological mechanisms, and clinicians, who diagnose patient symptoms, is that many links
between biological processes and disease pathophysiology are poorly understood. A
comprehensive Biomedical Translator must enable chains of inference across objects as
diverse as genetic mutations, molecular effects, tissue-specific expression patterns,
cellular processes, organ phenotypes, disease states, patient symptoms, and drug
responses, a challenge beyond the scope of any one organization.
Fortunately, many individual links in this chain have been made by experiments yielding
statistical connections between individual data types. High-throughput perturbation screens
link chemical and genetic perturbations to cellular phenotypes such as gene-expression
patterns, cell survival, or changes in phosphorylation. Genetic association studies link
mutations to human disease or intermediate phenotypes and biomarkers. Electronic medical
records (EMR) link diseases or human phenotypes to diagnostic or current procedural
terminology (CPT) codes, and clinical trials link the impact of drugs and drug candidates on
disease states.
In principle, incorporating these links into chains of inference could translate results between
the full set of data types within them. In practice, each link is maintained by experts with
domain-specific experiments, semantic terminology, and methodological standards. While a key
challenge faced by a global Biomedical Translator is to establish consistent standards across
these existing data types, a more important goal is to develop a principled and robust
framework to (a) model biological systems and experimental approaches to investigate them;
(b) organize knowledge about biological mechanism and disease; and (c) incorporate
diverse datasets that serve as windows into the underlying and unknown state of nature.
We propose to implement a Biomedical Translator as a probabilistic graphical model, a
paradigm from artificial intelligence (AI) research. Just as separate research communities form
weakly coupled parts of the translation process, graphical models allow global inferences from
weakly coupled “nodes”. These inferences require each node to publish only probability
distributions, enabling interoperability without necessarily having global entity-resolution
standards, and benefit from paradigms for quality control, fault tolerance, and relevance
assessment common in AI research. We hypothesize that a limited number of APIs,
implemented as probability computations by communities around the world, would yield a
Biomedical Translator as an emergent property of weakly coupled knowledge sources.
From basic properties of graphical models, such a Translator could probabilistically translate
among any data types connected within it, allowing for relatively complex query concepts. For
example: What cellular processes in which tissues are impacted in a patient-based EMR? What
genetic mutations sensitize cells to small-molecule treatment effects? Which small molecules
mimic genetic “experiments of nature” that protect against disease?
To illustrate the value of these resources and our architectural paradigm, we propose a
demonstration project to implement a Biomedical Translator supporting queries between
small molecules, biological processes, genes, and disease. The demonstration project will
provide a valuable first step to confront key data-integration and organizational challenges and
will enable previously impossible queries, such as identifying small molecules that perturb the
same biological processes implicated by human genetics in a disease context. In this capacity,
such Translator could realistically identify existing drugs for known symptoms (i.e., repurposing),
but could more broadly serve as an engine for hypothesis generation and
biological discovery, suggesting pre-clinical small molecules to develop based on their
observed biological activity, or providing heretofore novel links between cellular protein function
and disease pathophysiology.
在生物医学研究人员之间转化见解的基本挑战
生物学机制和诊断患者症状的忠诚者是许多环节
生物过程和疾病病理生理学之间的关系知之甚少。
全面的生物医学翻译器必须启用跨对象的推理链
基因突变、分子效应、组织特异性表达模式等多种多样,
细胞过程、器官表型、疾病状态、患者症状和药物
响应,这是一项超出任何组织范围的挑战。
幸运的是,这个链条中的许多单独的环节都是通过实验实现的
高通量扰动屏幕之间的统计联系。
将化学和遗传扰动与细胞表型(例如基因表达)联系起来
模式、细胞存活或磷酸化的变化遗传关联研究链接。
人类疾病或中间表型和生物标志物的突变。
记录 (EMR) 将疾病或人类表型与诊断或当前程序联系起来
术语(CPT)代码和临床试验将药物和候选药物的影响联系起来
疾病状态。
将这些链接纳入原则推理链可以将结果转化为
在实践中,每个链接都由专家维护。
特定领域的实验、语义术语和方法标准是关键。
全球生物医学翻译人员面临的挑战是在各个领域建立一致的标准
这些现有的数据类型,更重要的目标是开发一个有原则的、健壮的
(a) 生物系统模型和研究它们的实验方法的框架;
(b) 整理有关生物机制和疾病的知识;以及 (c) 纳入其中
不同的数据集作为了解潜在的和未知的自然状态的窗口。
我们建议将生物医学翻译器实现为概率图形模型,
正如独立研究团体的形成一样。
翻译过程中的弱耦合部分,图形模型允许从全局推断
这些推论要求每个节点仅发布概率。
发行版,无需全局实体解析即可实现互操作性
标准,并受益于质量控制、容错和相关性的范式
我们勇敢地面对人工智能研究中常见的评估,API 数量有限,
由世界各地的社区以概率计算的形式实现,将产生
生物医学翻译器作为弱耦合知识源的新兴属性。
从图模型的基本属性来看,这样的翻译器可以概率性地翻译
在其中连接的任何数据类型之间,允许相对复杂的查询概念。
例如:在基于患者的 EMR 中,组织的哪些细胞过程受到影响?
基因突变使细胞对小分子治疗效果敏感?
模仿遗传“自然实验”来预防疾病?
为了说明这些资源和我们的架构范例的价值,我们提出了一个
实施生物医学翻译器的示范项目,支持之间的查询
小分子、生物过程、基因和疾病。
为应对关键数据集成和组织挑战提供了宝贵的第一步,
将实现以前不可能的查询,例如识别干扰的小分子
在这种能力下,人类遗传学所涉及的相同生物过程。
这样的翻译器可以现实地识别现有药物的已知症状(即重新利用),
但可以更广泛地充当假设生成和
生物学发现,建议根据其特性开发临床前小分子
观察到的生物活性,或提供迄今为止细胞蛋白质功能之间的新联系
和疾病病理生理学。
项目成果
期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Application of MCAT questions as a testing tool and evaluation metric for knowledge graph-based reasoning systems.
应用 MCAT 问题作为基于知识图的推理系统的测试工具和评估指标。
- DOI:
- 发表时间:2021
- 期刊:
- 影响因子:0
- 作者:Fecho, Karamarie;Balhoff, James;Bizon, Chris;Byrd, William E;Hang, Sui;Koslicki, David;Rensi, Stefano E;Schmitt, Patrick L;Wawer, Mathias J;Williams, Mark;Ahalt, Stanley C
- 通讯作者:Ahalt, Stanley C
Knowledge Beacons: Web services for data harvesting of distributed biomedical knowledge.
知识信标:用于收集分布式生物医学知识数据的 Web 服务。
- DOI:
- 发表时间:2021
- 期刊:
- 影响因子:3.7
- 作者:Hannestad, Lance M;Dančík, Vlado;Godden, Meera;Suen, Imelda W;Huellas;Good, Benjamin M;Mungall, Christopher J;Bruskiewich, Richard M
- 通讯作者:Bruskiewich, Richard M
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
PAUL ANDREW CLEMONS其他文献
PAUL ANDREW CLEMONS的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('PAUL ANDREW CLEMONS', 18)}}的其他基金
Translating novel cancer targets and mechanisms from the CTD^2 Network using molecular glues
使用分子胶从 CTD^2 网络转化新的癌症靶点和机制
- 批准号:
10505307 - 财政年份:2022
- 资助金额:
$ 65.11万 - 项目类别:
Translating novel cancer targets and mechanisms from the CTD^2 Network using molecular glues
使用分子胶从 CTD^2 网络转化新的癌症靶点和机制
- 批准号:
10704124 - 财政年份:2022
- 资助金额:
$ 65.11万 - 项目类别:
Translating novel cancer targets and mechanisms from the CTD^2 Network using molecular glues
使用分子胶从 CTD^2 网络转化新的癌症靶点和机制
- 批准号:
10704124 - 财政年份:2022
- 资助金额:
$ 65.11万 - 项目类别:
A Translator Knowledge Provider for Systems Chemical Biology
系统化学生物学翻译知识提供者
- 批准号:
10548044 - 财政年份:2020
- 资助金额:
$ 65.11万 - 项目类别:
A Translator Knowledge Provider for Systems Chemical Biology
系统化学生物学翻译知识提供者
- 批准号:
10332543 - 财政年份:2020
- 资助金额:
$ 65.11万 - 项目类别:
Informing synthetic decision making using cheminformatic and bioinformatic profil
使用化学信息学和生物信息学分析为综合决策提供信息
- 批准号:
7696771 - 财政年份:2008
- 资助金额:
$ 65.11万 - 项目类别:
General data-analysis tools to relate chemical diversity
关联化学多样性的通用数据分析工具
- 批准号:
7476648 - 财政年份:2005
- 资助金额:
$ 65.11万 - 项目类别:
General data-analysis tools:cCemical Diversity (RMI)
通用数据分析工具:cChemical Diversity (RMI)
- 批准号:
7032046 - 财政年份:2005
- 资助金额:
$ 65.11万 - 项目类别:
General data-analysis tools to relate chemical diversity
关联化学多样性的通用数据分析工具
- 批准号:
7125582 - 财政年份:2005
- 资助金额:
$ 65.11万 - 项目类别:
Small-molecule fluorophores: screening for specific protein or RNA binding
小分子荧光团:筛选特定蛋白质或 RNA 结合
- 批准号:
6941848 - 财政年份:2004
- 资助金额:
$ 65.11万 - 项目类别:
相似国自然基金
水光谱与超声生物成像人工智能方法研究
- 批准号:22374082
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
基于多时序多模态MR特征和人工智能技术构建骨肉瘤预后的早期预测模型及其生物学可解释性研究
- 批准号:82373108
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
基于肠道微生物多组学的人工智能技术对尿毒症血液透析患者心血管风险预测的研究
- 批准号:
- 批准年份:2022
- 资助金额:53 万元
- 项目类别:面上项目
人工智能协同多源遥感的城市单木地上生物量估算研究
- 批准号:
- 批准年份:2021
- 资助金额:55 万元
- 项目类别:面上项目
基于人工智能辅助自动拉曼分选/测序耦合的微生物组单细胞功能解析方法学研究
- 批准号:
- 批准年份:2021
- 资助金额:59 万元
- 项目类别:面上项目
相似海外基金
UBIQUIBODY PLATFORM FOR TARGETED DEGRADATION OF ONCOGENIC FUSION PROTEINS
用于靶向降解致癌融合蛋白的 Ubiquibody 平台
- 批准号:
10806354 - 财政年份:2023
- 资助金额:
$ 65.11万 - 项目类别:
Identifying molecular traits associated with extreme human longevity using an AI based integrative approach
使用基于人工智能的综合方法识别与人类极端长寿相关的分子特征
- 批准号:
10745015 - 财政年份:2023
- 资助金额:
$ 65.11万 - 项目类别:
CRCNS: Dense longitudinal neuroimaging to evaluate learning in childhood
CRCNS:密集纵向神经影像评估儿童学习情况
- 批准号:
10835136 - 财政年份:2023
- 资助金额:
$ 65.11万 - 项目类别:
AIM-AI: an Actionable, Integrated and Multiscale genetic map of Alzheimer's disease via deep learning
AIM-AI:通过深度学习绘制阿尔茨海默病的可操作、集成和多尺度遗传图谱
- 批准号:
10668829 - 财政年份:2023
- 资助金额:
$ 65.11万 - 项目类别:
SCH: Al-driven Flexible Electronics for Cardiac Organoid Maturation
SCH:用于心脏类器官成熟的铝驱动柔性电子器件
- 批准号:
10816899 - 财政年份:2023
- 资助金额:
$ 65.11万 - 项目类别: