Compressive Structural BioInformatics: High Efficiency 3D Structure Compression
压缩结构生物信息学:高效 3D 结构压缩
基本信息
- 批准号:8870891
- 负责人:
- 金额:$ 45.95万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2015
- 资助国家:美国
- 起止时间:2015-06-01 至 2018-05-31
- 项目状态:已结题
- 来源:
- 关键词:3-DimensionalAdoptedAdoptionAlgorithmsArchivesBenchmarkingBig DataBindingBioinformaticsCapsidChimera organismClientCollaborationsComplexComputer softwareComputersCryoelectron MicroscopyDataData CompressionData FilesDatabasesDevelopmentDisciplineDiseaseDrug effect disorderGenomicsHIVImageryInternetJavaKnowledgeLibrariesLifeMedicineMemoryMethodsMolecular MachinesNucleic AcidsPerformanceProcessProgramming LanguagesPropertyProtein AnalysisProteinsPythonsResearchResolutionRoentgen RaysRunningSatellite CommunicationsScientistShapesSideSpeedStreamStructureStudentsSupport SystemSystemTabletsTechniquesTelephoneTelevisionTimecomputerized data processingdaltondistributed datagenome sequencinghandheld mobile devicelaptopmemory processmolecular assembly/self assemblyopen sourceoperationpublic health relevancestructural biologythree dimensional structuretooltransmission processtrend
项目摘要
DESCRIPTION (provided by applicant): The Protein Data Bank (PDB) archive has doubled in size since 2008 and exceeded 100,000 entries in 2014. At the same time, the size and complexity of structures are increasing dramatically, for example the recently determined structure of the HIV-capsid contains about 2.5 million atoms. The emerging techniques of integrative Structural Biology are starting to determine structures of molecular machines in the mega-Dalton range by combining cryo-Electron Microscopy, Small-Angle X-ray Scattering, X-ray, and NMR at increasingly higher resolution. Interactive visualization of large complexes exceeds available network bandwidth and memory of typical scientists' desktops, laptops, or mobile devices. Large-scale structural analyses and queries of the archive have become a Big Data challenge. To make these structures accessible to all scientists, educators, and students, new ways of representing these data are required. In domains such as high-definition television, satellite communication, video or audio streaming, high-efficiency compression has been key to deliver interactive media to phones, tablets, laptops, and desktops. A similar trend has emerged in the handling of whole genome sequence data. An entire discipline "Compressive Genomics" has been developed to deal with data compression and development of algorithms to process these data. This proposal introduces the concept of "Compressive Structural Bioinformatics", a set of compression algorithms, applications, and workflows that analyze and visualize large structures and large sets of structures at an unprecedented speed (100-1000 fold speedup) and with minimal client side overhead. The aims of this project are: 1. Develop a compact and extensible representation of 3-D biomolecular structures, 2. Enable interactive visualization of large complexes by reducing network bandwidth and enabling data streaming, 3. Enable large-scale analyses of the PDB archive for I/O bound workflows, and 4. Develop open source software libraries. Through collaboration with developers of widely used visualization applications and distributed data-parallel workflow systems, the new techniques will be implemented, benchmarked, and reference implementations will be provided in several programming languages for easy adoption. It is expect that these new "Compressive Structural Bioinformatics" tools will enable transformative research as intended by the NIH's Big Data to Knowledge initiative.
描述(由适用提供):自2008年以来,蛋白质数据库(PDB)档案的大小增加了一倍,2014年超过100,000个条目。与此同时,结构的大小和复杂性急剧增加,例如,最近确定的HIV-CAPSID结构包含约250万个原子。综合结构生物学的新兴技术开始通过在越来越高的分辨率下结合冷冻电子显微镜,小角度X射线散射,X射线和NMR来确定巨型 - 达尔顿范围内的分子机器的结构。大型复合物的交互式可视化超出了典型科学家台式机,笔记本电脑或移动设备的可用网络带宽和内存。档案的大规模结构分析和查询已成为一个大数据挑战。为了使所有科学家,教育者和学生都可以访问这些结构,需要新的代表这些数据的方法。在高清电视,卫星通信,视频或音频流等领域中,高效压缩是将交互式媒体传递到电话,平板电脑,笔记本电脑和台式机的关键。整个基因组序列数据的处理中也出现了类似的趋势。已经开发了整个学科的“压缩基因组学”来处理数据的数据压缩和开发以处理这些数据。该建议介绍了“压缩结构生物信息学”的概念,这是一系列压缩算法,应用和工作流程,这些算法,应用程序和工作流程以前所未有的速度(100-1000折叠速度)分析和可视化大型结构和大型结构,并以最小的客户端侧面的头顶来分析和可视化。该项目的目的是:1。建立3-D生物分子结构的紧凑而可扩展的表示,2。通过减少网络带宽和启用数据流的启用大型复合物的交互式可视化,3。启用对I/O Bound Workfort的PDB档案的大规模档案分析,以及I/O Bound Workfort和4。开源软件图书馆。通过与广泛使用的可视化应用程序和分布式数据并行工作流程系统的开发人员的合作,将以几种编程语言提供新技术,并以几种编程语言提供新技术,以方便采用。可以预期,这些新的“压缩结构生物信息学”工具将使NIH的大数据旨在实现知识计划的变革性研究。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Peter W Rose其他文献
A targeted approach significantly increases the identification rate of patients with undiagnosed haemochromatosis
有针对性的方法显着提高了未确诊血色病患者的识别率
- DOI:
- 发表时间:
2003 - 期刊:
- 影响因子:11.1
- 作者:
E. Cadet;D. Capron;A. S. Perez;S. N. Crepin;S. Arlot;J. P. Ducroix;M. Dautréaux;P. Fardellone;P. Leflon;A. Merryweather;K. Livesey;J. J. Pointon;Peter W Rose;Jean Harcourt;Jon Emery;Jon Emery;J. M. Sueur;R. Feyt;Kathryn J. H. Robson;Jacques Rochette - 通讯作者:
Jacques Rochette
Diagnosis in general practice
一般实践中的诊断
- DOI:
10.1136/bmj.b1187 - 发表时间:
2009 - 期刊:
- 影响因子:0
- 作者:
Carl J Heneghan;Paul P Glasziou;M. Thompson;Peter W Rose;J. Balla;Daniel Lasserson;C. Scott;Rafael Perera - 通讯作者:
Rafael Perera
Referral of patients with a family history of breast/ovarian cancer--GPs' knowledge and expectations.
有乳腺癌/卵巢癌家族史的患者转诊——全科医生的知识和期望。
- DOI:
10.1093/fampra/18.5.487 - 发表时间:
2001 - 期刊:
- 影响因子:2.2
- 作者:
Peter W Rose;Eila Watson;Patricia Yudkin;Jon Emery;Michael Murphy;Alice Fuller;A. Lucassen - 通讯作者:
A. Lucassen
IMI-CDE: an interactive interface for collaborative mapping of study variables to common data elements
IMI-CDE:用于将研究变量协作映射到通用数据元素的交互式界面
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Shiqiang Tao;Wei–Chun Chou;Jianfu Li;Jingcheng Du;Pritham M Ram;Rashmie Abeysinghe;Hua Xu;Xiaoqian Jiang;Peter W Rose;L. Ohno;Guoqiang Zhang - 通讯作者:
Guoqiang Zhang
Peter W Rose的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Peter W Rose', 18)}}的其他基金
Compressive Structural BioInformatics: High Efficiency 3D Structure Compression
压缩结构生物信息学:高效 3D 结构压缩
- 批准号:
9242471 - 财政年份:2015
- 资助金额:
$ 45.95万 - 项目类别:
Compressive Structural BioInformatics: High Efficiency 3D Structure Compression
压缩结构生物信息学:高效 3D 结构压缩
- 批准号:
9070726 - 财政年份:2015
- 资助金额:
$ 45.95万 - 项目类别:
相似国自然基金
采用新型视觉-电刺激配对范式长期、特异性改变成年期动物视觉系统功能可塑性
- 批准号:32371047
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
破解老年人数字鸿沟:老年人采用数字技术的决策过程、客观障碍和应对策略
- 批准号:72303205
- 批准年份:2023
- 资助金额:30.00 万元
- 项目类别:青年科学基金项目
通过抑制流体运动和采用双能谱方法来改进烧蚀速率测量的研究
- 批准号:12305261
- 批准年份:2023
- 资助金额:30.00 万元
- 项目类别:青年科学基金项目
采用多种稀疏自注意力机制的Transformer隧道衬砌裂缝检测方法研究
- 批准号:62301339
- 批准年份:2023
- 资助金额:30.00 万元
- 项目类别:青年科学基金项目
政策激励、信息传递与农户屋顶光伏技术采用提升机制研究
- 批准号:72304103
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
Optimization of electromechanical monitoring of engineered heart tissues
工程心脏组织机电监测的优化
- 批准号:
10673513 - 财政年份:2023
- 资助金额:
$ 45.95万 - 项目类别:
Engineering 3D Osteosarcoma Models to Elucidate Biology and Inform Drug Discovery
工程 3D 骨肉瘤模型以阐明生物学并为药物发现提供信息
- 批准号:
10564801 - 财政年份:2023
- 资助金额:
$ 45.95万 - 项目类别:
Motion-Resistant Background Subtraction Angiography with Deep Learning: Real-Time, Edge Hardware Implementation and Product Development
具有深度学习的抗运动背景减影血管造影:实时、边缘硬件实施和产品开发
- 批准号:
10602275 - 财政年份:2023
- 资助金额:
$ 45.95万 - 项目类别: