Collaborative Research: CNS Core: Small: Efficient Ways to Enlarge Practical DNA Storage Capacity by Integrating Bio-Computer Technologies
合作研究:中枢神经系统核心:小型:通过集成生物计算机技术扩大实用 DNA 存储容量的有效方法
基本信息
- 批准号:2204656
- 负责人:
- 金额:$ 30万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2022
- 资助国家:美国
- 起止时间:2022-07-15 至 2025-06-30
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
The world's digital data increases immensely each year. By 2025, it will reach 175 Zettabytes (ZB). Most human activities are recorded in digital format today. However, data recorded in digital media cannot last very long. Therefore, valuable data cannot be preserved today with our current storage technologies and devices for a long duration (beyond 15 years). The capacity of existing storage media cannot keep up with the growth of the amount of digital data. Also, all storage devices could become obsolete within several years, so the data stored are vulnerable as they perish as time goes by. Therefore, synthetic deoxyribonucleic acid (DNA) becomes an attractive alternative storage medium due to its high density and long durability. These characteristics of DNA storage make it a great candidate for archival storage. However, the preliminary study of the project indicates the practical DNA storage tube capacity based on current technologies is only around 250GB, which is much less than the expected capacity. The major reason is that primer-payload collisions in DNA storage can drastically reduce the number of usable primers in a tube as the data payload size increases. The use of primers is essential for random access to DNA data. In this project, an interdisciplinary team is formed to investigate both bio and storage approaches that can improve the scalability of DNA storage. Among the many factors that can scale up DNA storage, the project plans to investigate the following questions: 1) How to identify more primers for a primer library to be used in DNA storage? 2) Given a primer library, how to efficiently allocate payload data to avoid primer-payload collisions to increase DNA storage capacity? and 3) How to effectively use a popular technique called data deduplication in data backup applications to further increase the storage capability of DNA storage? With a deep understanding of molecular biology and computer storage technologies and systems, this interdisciplinary team fosters several innovative ways of understanding the fundamental issues of DNA storage and will develop necessary genome engineering, sequencing techniques, software, and new algorithms to optimize the process of converting the world's digital data to DNA storage for archiving and preserving today's valuable digital data for hundreds of years in the future. The goal of storing the world's digital data in DNA storage to preserve all human activities can move one step closer with this project. The potential research outcomes of the project include fostering the advancement of bioscience and storage technologies, preserving human activities in DNA storage for hundreds of years, and facilitating fundamental understanding, identifying tradeoffs, and creating efficient ways of scaling up DNA storage. The project will provide an ideal inter-disciplinary thinking, hands-on learning, and development environment to teach computer science and electrical and computer engineering graduate and undergraduate students important system building and experimental skills that are critical for today's and the future IT workforce. The research outcomes of the project will be incorporated into the classroom teaching of the team members, for both class projects and the core courses in computer science and electrical and computer engineering. The team plans to include the obtained research results in a new course on Storage Technologies /Systems for Big Data for students in a Data Science Program, as well as in undergraduate senior design and directed research studies. The team plans to disseminate the research advances to industrial collaborators, and through publications, presentations, and public release of research data, software tools, and prototype systems to the research community. The team is committed to recruiting underrepresented undergraduate and graduate students to the project. Research results will be made quickly available to the general public and disseminated via websites and open source repositories like GitHub.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
世界数字数据每年都在大幅增加。到 2025 年,它将达到 175 泽字节 (ZB)。如今,大多数人类活动都以数字格式记录。然而,记录在数字媒体中的数据不能保存很长时间。因此,目前的存储技术和设备无法长期(超过 15 年)保存有价值的数据。现有存储介质的容量无法跟上数字数据量的增长。此外,所有存储设备都可能在几年内变得过时,因此存储的数据很容易受到攻击,因为它们会随着时间的推移而消失。因此,合成脱氧核糖核酸(DNA)由于其高密度和长耐用性而成为一种有吸引力的替代存储介质。 DNA 存储的这些特性使其成为档案存储的绝佳选择。但该项目的初步研究表明,基于现有技术的实际DNA存储管容量仅为250GB左右,远低于预期容量。主要原因是,随着数据有效负载大小的增加,DNA 存储中的引物有效负载碰撞会大大减少试管中可用引物的数量。引物的使用对于随机获取 DNA 数据至关重要。在该项目中,成立了一个跨学科团队来研究可以提高 DNA 存储可扩展性的生物和存储方法。在能够扩大DNA存储规模的众多因素中,该项目计划研究以下问题:1)如何为用于DNA存储的引物库识别更多引物? 2)给定一个引物库,如何有效地分配有效负载数据以避免引物与有效负载碰撞,从而增加DNA存储容量? 3)如何在数据备份应用中有效利用流行的重复数据删除技术来进一步提高DNA存储的存储能力? 凭借对分子生物学和计算机存储技术和系统的深入了解,这个跨学科团队培育了几种理解 DNA 存储基本问题的创新方法,并将开发必要的基因组工程、测序技术、软件和新算法来优化转换过程将世界各地的数字数据存储到 DNA 存储中,以便在未来数百年中归档和保存当今有价值的数字数据。通过该项目,将世界数字数据存储在 DNA 存储中以保存所有人类活动的目标又向前迈进了一步。该项目的潜在研究成果包括促进生物科学和存储技术的进步、在 DNA 存储中保存人类活动数百年、促进基本理解、确定权衡以及创建扩大 DNA 存储规模的有效方法。该项目将提供理想的跨学科思维、实践学习和开发环境,教授计算机科学、电气和计算机工程研究生和本科生重要的系统构建和实验技能,这些技能对当今和未来的 IT 劳动力至关重要。该项目的研究成果将纳入团队成员的课堂教学,包括课堂项目以及计算机科学和电气与计算机工程的核心课程。该团队计划将获得的研究成果纳入数据科学项目学生的大数据存储技术/系统新课程以及本科生高级设计和定向研究中。该团队计划向工业合作者传播研究进展,并通过出版物、演示和向研究界公开发布研究数据、软件工具和原型系统。该团队致力于招募代表性不足的本科生和研究生参与该项目。研究结果将快速向公众提供,并通过网站和 GitHub 等开源存储库传播。该奖项反映了 NSF 的法定使命,并通过使用基金会的智力价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(6)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Work-in-Progress: ExpCache: Online-Learning based Cache Replacement Policy for Non-Volatile Memory
正在进行中的工作:ExpCache:基于在线学习的非易失性内存缓存替换策略
- DOI:10.1109/cases55004.2022.00010
- 发表时间:2022-10
- 期刊:
- 影响因子:0
- 作者:Yang, Jinfeng;Li, Bingzhe;Yuan, Jianjun;Shen, Zhaoyan;Du, David;Lilja, David
- 通讯作者:Lilja, David
DP-DNA: A Digital Pattern-Aware DNA Encoding Scheme to Improve Encoding Density of DNA Storage
DP-DNA:一种提高 DNA 存储编码密度的数字模式感知 DNA 编码方案
- DOI:10.1109/mascots59514.2023.10387655
- 发表时间:2023-10
- 期刊:
- 影响因子:0
- 作者:Li, Bingzhe;Ou, Li;Yuan, Bo;Du, David H.C.
- 通讯作者:Du, David H.C.
SMRTS: A Performance and Cost-Effectiveness Optimized SSD-SMR Tiered File System with Data Deduplication
SMRTS:具有重复数据删除功能的性能和成本效益优化的 SSD-SMR 分层文件系统
- DOI:10.1109/iccd58817.2023.00050
- 发表时间:2023-11-06
- 期刊:
- 影响因子:0
- 作者:Zhichao Cao;Hao Wen;Fenggang Wu;David Hung
- 通讯作者:David Hung
Machine Learning-based Adaptive Migration Algorithm for Hybrid Storage Systems
基于机器学习的混合存储系统自适应迁移算法
- DOI:10.1109/nas55553.2022.9925545
- 发表时间:2022-10-01
- 期刊:
- 影响因子:0
- 作者:Milan M. Shetti;Bingzhe Li;D. Du
- 通讯作者:D. Du
Machine Learning-based Adaptive Migration Algorithm for Hybrid Storage Systems
基于机器学习的混合存储系统自适应迁移算法
- DOI:
- 发表时间:2022-10
- 期刊:
- 影响因子:0
- 作者:. Milan Shetti; Bingzhe Li
- 通讯作者:Bingzhe Li
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
David Du其他文献
Rendering Color Information Using Haptic Feedback
使用触觉反馈渲染颜色信息
- DOI:
- 发表时间:
2005 - 期刊:
- 影响因子:0
- 作者:
S. Chakrabarti;S. Pramanik;David Du;Rajashree Paul - 通讯作者:
Rajashree Paul
Learning Agent-based Modeling with LLM Companions: Experiences of Novices and Experts Using ChatGPT & NetLogo Chat
与法学硕士同伴学习基于代理的建模:新手和专家使用 ChatGPT 的经验
- DOI:
10.1145/3613904.3642377 - 发表时间:
2024-01-30 - 期刊:
- 影响因子:0
- 作者:
John Chen;Xi Lu;Michael Rejtig;David Du;Ruth Bagley;Michael S. Horn;Uri Wilensky - 通讯作者:
Uri Wilensky
Scalable Computing : Practice and Experience
可扩展计算:实践和经验
- DOI:
10.1016/j.jaac.2019.07.003 - 发表时间:
2007 - 期刊:
- 影响因子:13.3
- 作者:
Dazhang Gu;Marcin Paprzycki;Alexander Denisjuk;Shahram Rahimi;Hong Shen;Domenico Talia;P. Myszkowski;Peter Arbenz;Dorothy Bollman;Luigi Brugnano;F. Desprez;David Du;Y. Fet;Len Freeman;I. Gladwell;Andrzej Goscinski;Emilio Hernádez;Universidad Simón Bolívar;J. Van katwijk;David Keyes;Vadim Kotov;Thomas Ludwig;S. Margenov;Oscar Naím;Lalit Patnaik;D. Petcu;Siang Wun Song;Karl Boleslaw;Szymanski;R. Trobec;C. Tropper;P. Tvrdík;M. Vajtersic;L. Welch;Janusz Zalewski;Daniel Janies;P. Goloboff;Diego Pol;J. Sanford;J. Baumgardner;W. Brewer;P. Gibson;W. ReMine;A. Rose;Eric A. Stahlberg;Iris Meier;E. List;D. Berryman;A. J. Palmer;E. Jane Gosney;S. Okada;B. Kelder;J. Lichtenberg;J. Kopchick;Guruprasad Nagaraja;Lisa Gandy;B. Gupta;M. Wainer;D. Lewis;A. Sedighi - 通讯作者:
A. Sedighi
Age-dependent Decline of Copper Clearance at the Blood-Cerebrospinal Fluid Barrier.
血脑脊液屏障铜清除率随年龄的下降。
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:3.4
- 作者:
Luke L. Liu;David Du;Wei Zheng;Yanshu Zhang - 通讯作者:
Yanshu Zhang
David Du的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('David Du', 18)}}的其他基金
CSR: Small: Heterogeneous Storage Systems with Emerging Technologies for Solving Big Data Problems
CSR:小型:利用新兴技术解决大数据问题的异构存储系统
- 批准号:
1812537 - 财政年份:2018
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
CSR: Small: Efficient Usage of Shingled Magnetic Recording (SMR)Drives
CSR:小型:叠瓦式磁记录 (SMR) 驱动器的高效使用
- 批准号:
1525617 - 财政年份:2015
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
I/UCRC Phase II: Center on Intelligent Storage
I/UCRC二期:智能存储中心
- 批准号:
1439622 - 财政年份:2014
- 资助金额:
$ 30万 - 项目类别:
Continuing Grant
Travel Support for the 43rd Annual International Conference on Parallel Processing (ICPP-2014)
第 43 届并行处理国际会议 (ICPP-2014) 的差旅支持
- 批准号:
1438816 - 财政年份:2014
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
CSR: Small: Collaborative Research: Software Defined Energy Adaptation in Large Scale Data Centers
CSR:小型:协作研究:大型数据中心的软件定义能源适应
- 批准号:
1421913 - 财政年份:2014
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
II-NEW: One Cloud Does Not Fit All: Minnesota Integrated Cloud Systems Research Testbed (MiST)
II-新:单一云并不能满足所有需求:明尼苏达州集成云系统研究测试平台 (MiST)
- 批准号:
1305237 - 财政年份:2013
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
NeTS: Small: Information Dissemination in Vehicular Networks for Reducing Traffic Congestion
NeTS:小型:车载网络中的信息传播以减少交通拥堵
- 批准号:
1217572 - 财政年份:2012
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
CSR: Small: Prediction-Based Data Placement for New Memory and Storage Hierarchies
CSR:小型:新内存和存储层次结构的基于预测的数据放置
- 批准号:
1217569 - 财政年份:2012
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Travel Support for International Conference on Distributed Computing Systems (ICDCS) 2011
2011 年国际分布式计算系统会议 (ICDCS) 差旅支持
- 批准号:
1113505 - 财政年份:2011
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
CSR:Small: Efficient FTL Buffer Management for High-Performance Solid State Drives
CSR:Small:适用于高性能固态硬盘的高效 FTL 缓冲区管理
- 批准号:
1115471 - 财政年份:2011
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
相似国自然基金
LncMOB3A-2编码多肽在肠外致病性大肠杆菌入侵中枢神经系统中的作用机制研究
- 批准号:32302954
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
S100A9作为万古霉素儿童中枢神经系统抗感染个体化治疗预测因子的机制研究和量效分析
- 批准号:82304631
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
失重效应影响中枢神经系统药物脑空间分布及药动学的机制和调控研究
- 批准号:82373939
- 批准年份:2023
- 资助金额:48 万元
- 项目类别:面上项目
染色质重塑因子CHD3调控中枢神经系统少突胶质细胞发育的机制研究
- 批准号:82301950
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
基于人体镜像中枢神经系统和信任度的假肢互适应机制研究
- 批准号:62363006
- 批准年份:2023
- 资助金额:31 万元
- 项目类别:地区科学基金项目
相似海外基金
Collaborative Research: CISE-MSI: RCBP-RF: CNS: ESD4CDaT - Efficient System Design for Cancer Detection and Treatment
合作研究:CISE-MSI:RCBP-RF:CNS:ESD4CDaT - 癌症检测和治疗的高效系统设计
- 批准号:
2318573 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Small: Accelerating Serverless Cloud Network Performance
协作研究:CNS 核心:小型:加速无服务器云网络性能
- 批准号:
2229454 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Center of Biomedical Research Excellence in CNS Metabolism
中枢神经系统代谢生物医学卓越研究中心
- 批准号:
10557542 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Collaborative Research: CNS Core: Small: A Compilation System for Mapping Deep Learning Models to Tensorized Instructions (DELITE)
合作研究:CNS Core:Small:将深度学习模型映射到张量化指令的编译系统(DELITE)
- 批准号:
2230945 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Standard Grant
Collaborative Research: CNS Core: Medium: Movement of Computation and Data in Splitkernel-disaggregated, Data-intensive Systems
合作研究:CNS 核心:媒介:Splitkernel 分解的数据密集型系统中的计算和数据移动
- 批准号:
2406598 - 财政年份:2023
- 资助金额:
$ 30万 - 项目类别:
Continuing Grant