BioHDF - Open Binary File Standards for Bioinformatics
BioHDF - 生物信息学开放二进制文件标准
基本信息
- 批准号:6992995
- 负责人:
- 金额:$ 14.28万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2005
- 资助国家:美国
- 起止时间:2005-09-05 至 2006-08-31
- 项目状态:已结题
- 来源:
- 关键词:DNAartificial intelligencebioengineering /biomedical engineeringbioinformaticscomputational biologycomputer program /softwarecomputer system design /evaluationfunctional /structural genomicsgenetic mappinggenetic polymorphismmathematicsmolecular biology information systemnucleic acid sequencesingle nucleotide polymorphismvirus genetics
项目摘要
DESCRIPTION (provided by applicant): Geospiza Inc. and the National Center for Supercomputing Applications (NCSA) are creating a standards based software framework around NCSA's Heirarchical Data Format (HDF5). The envisioned framework will integrate algorithms important in DNA and protein sequence analysis to create scalable high throughput software systems which will be accessed using new graphical user interfaces (GUIs) to provide researchers with new views of their data to finish sequencing projects in large-scale genome sequencing, microbial genome sequencing, viral epidemiology, polymorphism detection, phylogenetic analysis, multi-locus sequence typing, confirmatory sequencing, and EST analysis.
In our vision, algorithms will be either integrated into the system to directly read and write from HDF5 project files, or they will communicate with project files via filter programs that produce standardized XML formatted data. Through this model, a scalable solution will support different applications of DNA sequencing, fulfilling the many needs and requirements expressed by the medical research community now and into the future. As the first step in this process we will, define requirements for editing and versioning data in DNA sequencing, research and propose data models for the computational phases of DNA sequencing and annotating DNA sequence data using existing standards, create a prototype application for DNA sequencing based SNP discovery, and engage the bioinformatics community for BioHDF adoption.
In the past ten years the cost of sequencing DNA has dropped over 1000 fold and the amount of raw sequence data, entering our national repositories is doubling every 12 months. DNA sequencing is fundamental to biological research activities such as genomics, systems biology, and clinical medicine. Proposals are being sought to decrease sequencing costs by two orders of magnitude through technology refinements with an ultimate vision of developing technology to sequence human genome equivalents for $1000 each. The amount of data that will be produced through these endeavors is unimaginable. However, the $1,000 genome will not advance medical research unless we integrate all phases of the DNA sequencing process and treat the creation, management, finishing, analysis, and sharing of the data as common goals.
描述(由申请人提供):Geospiza Inc.和国家超级计算应用程序中心(NCSA)正在围绕NCSA的继承人结构数据格式(HDF5)创建基于标准的软件框架。 The envisioned framework will integrate algorithms important in DNA and protein sequence analysis to create scalable high throughput software systems which will be accessed using new graphical user interfaces (GUIs) to provide researchers with new views of their data to finish sequencing projects in large-scale genome sequencing, microbial genome sequencing, viral epidemiology, polymorphism detection, phylogenetic analysis, multi-locus sequence typing,确认性测序和EST分析。
在我们的愿景中,算法要么集成到系统中以直接从HDF5项目文件读取和写入,要么通过产生标准化的XML格式数据的过滤器程序与项目文件进行通信。通过此模型,可扩展的解决方案将支持DNA测序的不同应用,从而满足医学研究界和未来医学研究界表达的许多需求和要求。作为此过程的第一步,我们将在DNA测序中定义和使用现有标准的DNA测序和注释DNA序列数据的数据模型编辑和版本的数据模型,为基于DNA测序的SNP发现创建原型的DNA序列数据,并为BioInforms Communition提供了生物源性及其对生物HDF的生物学群体的应用。
在过去的十年中,测序DNA的成本下降了1000倍,而原始序列数据的量则下降了,进入我们的国家存储库每12个月增加一倍。 DNA测序是基因组学,系统生物学和临床医学等生物学研究活动的基础。通过技术的改进,提出的提案通过两种数量级来降低测序成本,并具有开发技术的最终愿景,以每人1000美元的价格对人类基因组等效序列进行序列。通过这些努力产生的数据数量是不可想象的。但是,除非我们整合DNA测序过程的所有阶段并处理数据的创建,管理,分析,分析和共享数据作为共同目标,否则1,000美元的基因组将不会进步医学研究。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
TODD M SMITH其他文献
TODD M SMITH的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('TODD M SMITH', 18)}}的其他基金
Second Generation DNA Sequence Management Tools
第二代 DNA 序列管理工具
- 批准号:
6622259 - 财政年份:2000
- 资助金额:
$ 14.28万 - 项目类别:
Second Generation DNA Sequence Management Tools
第二代 DNA 序列管理工具
- 批准号:
6912979 - 财政年份:2000
- 资助金额:
$ 14.28万 - 项目类别:
Second Generation DNA Sequence Management Tools
第二代 DNA 序列管理工具
- 批准号:
6444292 - 财政年份:2000
- 资助金额:
$ 14.28万 - 项目类别:
SECOND GENERATION OF DNA SEQUENCE MANAGEMENT TOOLS
第二代 DNA 序列管理工具
- 批准号:
6211967 - 财政年份:2000
- 资助金额:
$ 14.28万 - 项目类别:
SECOND GENERATION EST CLUSTER AND ANALYSIS TOOLS
第二代EST集群和分析工具
- 批准号:
6017182 - 财政年份:1999
- 资助金额:
$ 14.28万 - 项目类别:
相似国自然基金
人工智能驱动的营销模式和消费者行为研究
- 批准号:72332006
- 批准年份:2023
- 资助金额:165 万元
- 项目类别:重点项目
基于“人工智能算法+高精度遥感数据”的棉花表型信息识别及解析
- 批准号:32360436
- 批准年份:2023
- 资助金额:32 万元
- 项目类别:地区科学基金项目
巴氏杀菌乳中金黄色葡萄球菌和肠毒素A风险预测和溯源的人工智能模型构建研究
- 批准号:32302241
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
制造企业人工智能工作场景下员工AI认同影响机制与员工主动行为内在机理研究
- 批准号:72362025
- 批准年份:2023
- 资助金额:27 万元
- 项目类别:地区科学基金项目
基于原子贡献与人工智能的萃取精馏溶剂分子设计研究
- 批准号:22308037
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
A Comprehensive Structural and Dynamic Map of DNA Duplex Ends
DNA 双链体末端的全面结构和动态图
- 批准号:
7024252 - 财政年份:2006
- 资助金额:
$ 14.28万 - 项目类别:
Vortex Tubed Thermocycler with Intelligent Software
带智能软件的涡流管热循环仪
- 批准号:
6810083 - 财政年份:2004
- 资助金额:
$ 14.28万 - 项目类别:
Vortex Tubed Thermocycler with Intelligent Software
带智能软件的涡流管热循环仪
- 批准号:
7243612 - 财政年份:2004
- 资助金额:
$ 14.28万 - 项目类别:
Vortex Tubed Thermocycler with Intelligent Software
带智能软件的涡流管热循环仪
- 批准号:
6914863 - 财政年份:2004
- 资助金额:
$ 14.28万 - 项目类别:
AC IMPEDANCE SPECTROSCOPY FOR HIV DNA DETECTION
用于 HIV DNA 检测的交流阻抗光谱
- 批准号:
2005782 - 财政年份:1997
- 资助金额:
$ 14.28万 - 项目类别: