II: Data Cooperatives: Rapid and Incremental Data Sharing with Applications to Bioinformatics

II:数据合作社:快速增量数据共享及其在生物信息学中的应用

基本信息

  • 批准号:
    0513778
  • 负责人:
  • 金额:
    $ 129.53万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2005
  • 资助国家:
    美国
  • 起止时间:
    2005-07-15 至 2009-06-30
  • 项目状态:
    已结题

项目摘要

Generic tools and technologies for creating and maintaining data cooperatives- confederations whose purpose is distributed data sharing-will be developed to overcome the difficultiess encountered in the sharing of information in life sciences, specifically in bioinformatics.The vision of large-scale data sharing has been a long-time goal of the bioinformatics field, much of it proceeding through data integration efforts. However, conventional approaches to data integration do not have the necessary flexibility and adaptability to make the existing and future plethora of data accessible and usable to typical biologists, while keeping it rapidly extensible to new concepts, domains, and types of queries, and thus fostering new research developments. The main reasons are that (1) different biologists work with different types of data and at differing levels of abstraction; (2) schemas in the bioinformatics world are typically large and complex; (3) queries and mappings may "break" without warning because of asynchronous updates; (4) it is logistically, economically and politically difficult to operate centralized data integration facilities. In response to these difficulties data cooperatives emphasize: decentralization for both scalability and flexibility, incremental development of resources such as schemas, mappings, and queries, rapid discovery mechanisms for finding the resources relevant to a topic, and tolerance for intermittent participation of members and for approximate consistency of mappings.More specifically, the technical goals of the proposal include: (1)collaboratively developed yellow pages of biological topics; (2) schema templates, capturing the part of the structure of data pertaining to a specific interest and functioning also as visual templates from which a query form created; (3) incremental specification of mappings; (4) reasoning about uncertainty in mappings by measuring with statistical tools their degree of reliability and using it in query answering; (5) multi-path answering for queries with caching and replication in a large-scale data cooperative where the participation of individual members may not always be assured.Data cooperatives will have broader impact through applications in a variety of scientific and industrial fields, but it is in the field of bioinformatics that they are likely to have an immediate and significant impact. Therefore, a specific data cooperative as a biological testbed for evaluating the proposed technologies. This testbed is based on a small set of databases which are already collaborating and exchanging data related to Plasmodium falciparum. Broader impact will be also be achieved through the proposed educational initiatives, specifically through a "compu-tational orchestra" bioinformatics course which will expose students to data integration issues through project work, and a workshop for the Greater Philadelphia Bioinformatics Alliance (GPBA). Minority involvement will also be encouraged through a GPBA internship program.
用于创建和维护数据合作社的通用工具和技术 - 分配数据共享的目的是要开发出来,以克服生命科学信息共享时遇到的困难,特别是在生物信息学中,大规模数据共享的愿景一直是BioInformatics Field Field Field Field Field of It It It It It It Itsions intection intection intection int it It Itsions intection int it It it It it It it It it It it It it It it It it It it It it Itsions intection ressection ressection ressection ressection ressection ressection ressection。但是,传统的数据集成方法没有必要的灵活性和适应性,可以使现有和未来的大量数据可访问且可用于典型的生物学家,同时使其可迅速扩展到新概念,域和查询类型,从而促进新的研究发展。主要原因是(1)不同的生物学家与不同类型的数据和不同的抽象水平一起工作; (2)生物信息学世界中的模式通常是大而复杂的; (3)由于异步更新而无需警告而无需警告,查询和映射可能会“断开”; (4)在逻辑,经济和政治上很难运营集中数据集成设施。为了应对这些困难数据合作社的强调:用于可伸缩性和灵活性的权力化,诸如模​​式,映射和查询等资源的增量发展,迅速的发现机制,以找到与某个主题相关的资源的快速发现机制,以及对成员间歇性参与的容忍度以及对映射的近似映射的信息,包括蓝色的目标。 (2)模板模板,捕获与特定兴趣和功能相关的数据结构的一部分,也是创建查询形式的视觉模板; (3)映射的增量规范; (4)通过使用统计工具测量其可靠性并将其用于查询答案,从而对映射的不确定性进行推理; (5)在大规模数据合作社中缓存和复制的查询的多路径回答,在该数据合作社中,个人成员的参与可能并不总是得到保证。DATA合作社通过在各种科学和工业领域的应用中的应用会更广泛,但在各种科学和工业领域的应用中,它可能在生物信息领域中产生直接和重大影响。因此,特定的数据合作社作为用于评估所提出技术的生物测试床。该测试台基于一小部分数据库,这些数据库已经在协作和交换与恶性疟原虫有关的数据。 还将通过拟议的教育计划来实现更广泛的影响,特别是通过“构成乐团”生物信息学课程,该课程将通过项目工作使学生陷入数据整合问题,以及大费城生物信息知识联盟(GPBA)的研讨会。还将通过GPBA实习计划来鼓励少数派参与。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Susan Davidson其他文献

"Conversations: Rauschenberg in China"
《对话:劳森伯格在中国》
  • DOI:
  • 发表时间:
    2016
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Hiroko Ikegami;David White;Helen Hsu;Susan Davidson
  • 通讯作者:
    Susan Davidson
Cause and Effect: The Relationship Between Acne and Self-Esteem in the Adolescent Years
  • DOI:
    10.1016/j.nurpra.2008.01.021
  • 发表时间:
    2008-09-01
  • 期刊:
  • 影响因子:
  • 作者:
    Sandra L. Hedden;Susan Davidson;Christine B. Smith
  • 通讯作者:
    Christine B. Smith
Transcultural Intertwinements in East Asian Art and Culture, 1920s-1950s
1920 年代至 1950 年代东亚艺术与文化的跨文化交织
  • DOI:
  • 发表时间:
    2018
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Hiroko Ikegami;David White;Helen Hsu;Susan Davidson;加治屋健司;Hiroko Ikegami;辻泰岳;Hiroko Ikegami
  • 通讯作者:
    Hiroko Ikegami
「中原佑介の韓国」
《中原佑介的韩国》
  • DOI:
  • 发表时间:
    2016
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Hiroko Ikegami;David White;Helen Hsu;Susan Davidson;加治屋健司
  • 通讯作者:
    加治屋健司
Past Disquiet: Artists, International Solidarity and Museums-in-Exile
过去的不安:艺术家、国际团结和流亡博物馆
  • DOI:
  • 发表时间:
    2018
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Hiroko Ikegami;David White;Helen Hsu;Susan Davidson;加治屋健司;Hiroko Ikegami;辻泰岳;Hiroko Ikegami;Izumi Nakajima
  • 通讯作者:
    Izumi Nakajima

Susan Davidson的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Susan Davidson', 18)}}的其他基金

III: Medium: Collaborative Research: Citing Structured and Evolving Data
III:媒介:协作研究:引用结构化和不断变化的数据
  • 批准号:
    1302212
  • 财政年份:
    2013
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Standard Grant
BPC-DP: Penn COMP-ACT: A College Service Learning Course to Promote and Increase COMPutational Thinking and ACTivities in Afterschool and Summer Programs
BPC-DP:宾夕法尼亚大学 COMP-ACT:大学服务学习课程,旨在促进和提高课后和暑期项目中的计算思维和活动
  • 批准号:
    0940511
  • 财政年份:
    2010
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Standard Grant
III-COR-Medium: Providing Provenance through Workflows and Database Transformations
III-COR-Medium:通过工作流程和数据库转换提供来源
  • 批准号:
    0803524
  • 财政年份:
    2008
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Standard Grant
SEIII: Workshop on Information Integration
SEIII:信息集成研讨会
  • 批准号:
    0632541
  • 财政年份:
    2006
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Standard Grant
Collaborative Research: SEI+II ProtocolDB: Archiving and Querying Scientific Protocols, Data and Provenance
合作研究:SEI II ProtocolDB:归档和查询科学协议、数据和来源
  • 批准号:
    0612177
  • 财政年份:
    2006
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Standard Grant
Preserving Constraints in XML Data Exchange
保留 XML 数据交换中的约束
  • 批准号:
    0415810
  • 财政年份:
    2005
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Standard Grant
DLI Phase-2: Data Provenance
DLI 第 2 阶段:数据来源
  • 批准号:
    9817444
  • 财政年份:
    1999
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Continuing Grant
Maintaining Curated View Databases
维护精选视图数据库
  • 批准号:
    9975206
  • 财政年份:
    1999
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Standard Grant
A Deterministic Model for Semistructured and Structured Data
半结构化和结构化数据的确定性模型
  • 批准号:
    9977408
  • 财政年份:
    1999
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Standard Grant
Mediated Access to Biological Databases and Applications
对生物数据库和应用程序的介导访问
  • 批准号:
    9402292
  • 财政年份:
    1994
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Continuing Grant

相似国自然基金

肝癌外周血测序数据中循环肿瘤DNA占比的精确解耦方法研究
  • 批准号:
    62303271
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
面向卫星重力数据反演高精度地表质量变化模型的约束模型构建及优化
  • 批准号:
    42304097
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
物理-数据混合驱动的复杂曲面多模态视觉检测理论与方法
  • 批准号:
    52375516
  • 批准年份:
    2023
  • 资助金额:
    50 万元
  • 项目类别:
    面上项目
基于数据-机理协同驱动降阶模型的质子交换膜燃料电池多物理场孪生
  • 批准号:
    52306112
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
随机缺失下纵向数据的多重稳健估计
  • 批准号:
    12361057
  • 批准年份:
    2023
  • 资助金额:
    27 万元
  • 项目类别:
    地区科学基金项目

相似海外基金

An innovative platform using ML/AI to analyse farm data and deliver insights to improve farm performance, increasing farm profitability by 5-10%
An%20innovative%20platform%20using%20ML/AI%20to%20analysis%20farm%20data%20and%20deliver%20insights%20to%20improv%20farm%20performance,%20increasing%20farm%20profitability%20by%205-10%
  • 批准号:
    10093235
  • 财政年份:
    2024
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Collaborative R&D
Seamless integration of Financial data into ESG data
将财务数据无缝集成到 ESG 数据中
  • 批准号:
    10099890
  • 财政年份:
    2024
  • 资助金额:
    $ 129.53万
  • 项目类别:
    Collaborative R&D
Patient Lifestyle and Disease Data Interactium (PaLaDIn)
患者生活方式和疾病数据交互 (PaLaDIn)
  • 批准号:
    10103989
  • 财政年份:
    2024
  • 资助金额:
    $ 129.53万
  • 项目类别:
    EU-Funded
Patient Lifestyle and Disease Data Interactium (PaLaDIn)
患者生活方式和疾病数据交互 (PaLaDIn)
  • 批准号:
    10105921
  • 财政年份:
    2024
  • 资助金额:
    $ 129.53万
  • 项目类别:
    EU-Funded
Treecle - data and automation to unlock woodland creation in the UK to achieve net zero
Treecle - 数据和自动化解锁英国林地创造以实现净零排放
  • 批准号:
    10111492
  • 财政年份:
    2024
  • 资助金额:
    $ 129.53万
  • 项目类别:
    SME Support
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了