Algorithms to Improve the Efficiency of Data Compression and Caching on Wide-Area Networks

提高广域网上数据压缩和缓存效率的算法

基本信息

项目摘要

The goal of this research project is to develop new lossless text compression algorithms and software tools to incorporate compression in MIME/HTML standards. The approach consists of encoding the text to exploit the natural redundancy of a language via the use of a dictionary and then compressing it using a pre-existing compression algorithm. The encoding scheme depends on the specific characteristics of the compression algorithm and the Bzip2 algorithm based on Burrows-Wheeler transform is used as the main backend compression algorithm. A basic understanding of the interaction of the encoding schemes and the compression algorithms is developed in this research project. The performance of the algorithms is measured taking into account both compression and communication metrics. Infrastructure tools are developed using dynamic caching of dictionaries to embed compression into MIME/HTML standards. This research will have impact on the future of information technology by providing methods for data delivery systems where communication bandwidth is at a premium and archival storage is an exponentially costly endeavor. It is expected that the new lossless text compression algorithms will have 5 to 10% improved compression ratio over the best known pre-existing compression algorithms which might translate into a reduction of more than 50% of the text traffic on the Internet. The experimental research is linked to educational goals via rapid dissemination of results via reports, conference and journal papers, doctoral dissertations and masters theses, and transferring the research knowledge into the graduate curriculum. Software tools developed under this grant will be shared via a web site.
该研究项目的目的是开发新的无损文本压缩算法和软件工具,以将压缩纳入MIME/HTML标准。该方法包括编码文本以通过使用字典来利用语言的自然冗余,然后使用预先存在的压缩算法对其进行压缩。编码方案取决于压缩算法的特定特征和基于burrows-wheeler变换的BZIP2算法用作主要后端压缩算法。该研究项目对编码方案和压缩算法的相互作用的相互作用有基本的了解。考虑到压缩和通信指标,测量了算法的性能。基础架构工具是使用词典的动态缓存开发的,将压缩嵌入到MIME/HTML标准中。这项研究将通过为通信带宽处于溢价而提供的数据传递系统的方法来影响信息技术的未来,而档案存储是一种成本成本的努力。预计新的无损文本压缩算法将使压缩率提高5%至10%,而不是最著名的预先存在的压缩算法,这些算法可能会转化为Internet上50%以上的文本流量的减少。实验研究通过报告,会议和期刊论文,博士学位论文和硕士学位,以及将研究知识转移到研究生课程中,与教育目标有关。根据该赠款开发的软件工具将通过网站共享。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

暂无数据

数据更新时间:2024-06-01

Amar Mukherjee其他文献

Higher Compression from Burrows-Wheeler Transform for DNA Sequence
DNA 序列的 Burrows-Wheeler 变换的更高压缩
  • DOI:
  • 发表时间:
    2017
    2017
  • 期刊:
  • 影响因子:
    0
  • 作者:
    A. Gérard;Trujilla Lobo;Chun Li;Huan Liu;Junhong Liu;Yuping Qin;Zhifu Wangb;Jolanta Kawulok;Jouni Sir ́en;Niko V ̈alim ̈aki;Veli M ̈akinen;Rahul Vishwakarma;Newsha Amiri;S. Wandelt;Marc Bux;Yong Zhang;Amar Mukherjee;M. Powell
    A. Gérard;Trujilla Lobo;Chun Li;Huan Liu;Junhong Liu;Yuping Qin;Zhifu Wangb;Jolanta Kawulok;Jouni Sir ́en;Niko V ̈alim ̈aki;Veli M ̈akinen;Rahul Vishwakarma;Newsha Amiri;S. Wandelt;Marc Bux;Yong Zhang;Amar Mukherjee;M. Powell
  • 通讯作者:
    M. Powell
    M. Powell
An efficient algorithm for detection of road-like structures in satellite images
卫星图像中道路类结构检测的有效算法
ICASE/LaRC Symposium on Visualizing Time-Varying Data
ICASE/LaRC 时变数据可视化研讨会
  • DOI:
  • 发表时间:
    1996
    1996
  • 期刊:
  • 影响因子:
    0
  • 作者:
    D. Banks;T. Crockett;K. Stacy;bullet Hampton;Virginia K Stacy;N. Max;B. Becker;D. Banks;Mississippi;T. Crockett;Kathy Stacy;D. Banks;K. Stacy;Mary Adams;T. Crockett;Kwan;K. Severance;Lambertus Hesselink;R. Crawfis;Lawrence;Chuck Hansen;Duane Melson;L. Treinish;R. Haimes;Massachusetts;N. Max;Velvin Watson;Randy L. Ribler;Anup Mathur;Marc Abrams;Pak Chnng Wong;R. D. Bergeron;Will H Scullin;T. T. Kwan;Daniel A Reed;Eric J Davies;William B Cowan;B. Becket;Vineet Goel;Amar Mukherjee;R. Moorhead;Zhifan Zhu;Kelly Gaither;John Vanderzwagg;Tzi;William Mattson;Rick Angelini;Larry Matthias;Paula Detweiler;James Patten;G. Erlebacher;Richard J Schwartz;T. Crockett;William J Bent;R. Wilmoth;Bart A Singer;Patricia J. Crossno;M. Cheng;M. Livny;R. Ramakrishnan;Will Bene;Bart A Singer
    D. Banks;T. Crockett;K. Stacy;bullet Hampton;Virginia K Stacy;N. Max;B. Becker;D. Banks;Mississippi;T. Crockett;Kathy Stacy;D. Banks;K. Stacy;Mary Adams;T. Crockett;Kwan;K. Severance;Lambertus Hesselink;R. Crawfis;Lawrence;Chuck Hansen;Duane Melson;L. Treinish;R. Haimes;Massachusetts;N. Max;Velvin Watson;Randy L. Ribler;Anup Mathur;Marc Abrams;Pak Chnng Wong;R. D. Bergeron;Will H Scullin;T. T. Kwan;Daniel A Reed;Eric J Davies;William B Cowan;B. Becket;Vineet Goel;Amar Mukherjee;R. Moorhead;Zhifan Zhu;Kelly Gaither;John Vanderzwagg;Tzi;William Mattson;Rick Angelini;Larry Matthias;Paula Detweiler;James Patten;G. Erlebacher;Richard J Schwartz;T. Crockett;William J Bent;R. Wilmoth;Bart A Singer;Patricia J. Crossno;M. Cheng;M. Livny;R. Ramakrishnan;Will Bene;Bart A Singer
  • 通讯作者:
    Bart A Singer
    Bart A Singer
共 3 条
  • 1
前往

Amar Mukherjee的其他基金

U.S.-New Zealand and Australia Collaboration on Research for Data Compression
美国、新西兰和澳大利亚在数据压缩研究方面的合作
  • 批准号:
    0331188
    0331188
  • 财政年份:
    2004
  • 资助金额:
    $ 23万
    $ 23万
  • 项目类别:
    Standard Grant
    Standard Grant
ITR Collaborative Research: Compressed Search and Retrieval for Very Large Text and Image Repositories
ITR 协作研究:超大型文本和图像存储库的压缩搜索和检索
  • 批准号:
    0312724
    0312724
  • 财政年份:
    2003
  • 资助金额:
    $ 23万
    $ 23万
  • 项目类别:
    Standard Grant
    Standard Grant
Collaborative: Compressed Domain Search for Text and Images by Sorted Contexts
协作:按排序上下文对文本和图像进行压缩域搜索
  • 批准号:
    0207819
    0207819
  • 财政年份:
    2002
  • 资助金额:
    $ 23万
    $ 23万
  • 项目类别:
    Continuing Grant
    Continuing Grant
CISE Research Instrumentation: Equipment for Research in Special Purpose VLSI Architecture
CISE 研究仪器:专用 VLSI 架构研究设备
  • 批准号:
    9222798
    9222798
  • 财政年份:
    1993
  • 资助金额:
    $ 23万
    $ 23万
  • 项目类别:
    Standard Grant
    Standard Grant
String Processing Computer Architecture
字符串处理计算机体系结构
  • 批准号:
    8005096
    8005096
  • 财政年份:
    1980
  • 资助金额:
    $ 23万
    $ 23万
  • 项目类别:
    Standard Grant
    Standard Grant
Parallel Processing, Multiprocessing and Computer Organization
并行处理、多重处理和计算机组织
  • 批准号:
    7604763
    7604763
  • 财政年份:
    1976
  • 资助金额:
    $ 23万
    $ 23万
  • 项目类别:
    Standard Grant
    Standard Grant

相似国自然基金

过渡金属配合物-聚合物复合体系提升协同催化效率
  • 批准号:
    22371063
  • 批准年份:
    2023
  • 资助金额:
    50 万元
  • 项目类别:
    面上项目
基于"能源-环境-健康"框架的区域节能减排效率分析与提升研究-以长三角为例
  • 批准号:
    72304115
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
苯乙炔铜基材料去除高浓度有机污染物效率的显著提升及增效机制研究
  • 批准号:
    22376168
  • 批准年份:
    2023
  • 资助金额:
    50 万元
  • 项目类别:
    面上项目
基于“液相原子层沉积方法”的硫化锑太阳电池开发与转换效率提升基础研究
  • 批准号:
    62374065
  • 批准年份:
    2023
  • 资助金额:
    48 万元
  • 项目类别:
    面上项目
时序数据驱动的办公建筑三维设计流程效率机理认知及其提升方法
  • 批准号:
    52308002
  • 批准年份:
    2023
  • 资助金额:
    30.00 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

ViMuSe - a video-based AI music recommendation engine to improve creative efficiency and diversity.
ViMuSe - 基于视频的AI音乐推荐引擎,可提高创作效率和多样性。
  • 批准号:
    10104871
    10104871
  • 财政年份:
    2024
  • 资助金额:
    $ 23万
    $ 23万
  • 项目类别:
    Collaborative R&D
    Collaborative R&D
Early life nutrition to improve feed efficiency in commercial dairy goats
生命早期营养可提高商业奶山羊的饲料效率
  • 批准号:
    LP220100426
    LP220100426
  • 财政年份:
    2023
  • 资助金额:
    $ 23万
    $ 23万
  • 项目类别:
    Linkage Projects
    Linkage Projects
Power Management of Train Traffic to Improve Energy Efficiency and Suppress Peak Power and Negative Sequence Current
列车交通电源管理,提高能源效率并抑制峰值功率和负序电流
  • 批准号:
    23K03822
    23K03822
  • 财政年份:
    2023
  • 资助金额:
    $ 23万
    $ 23万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
    Grant-in-Aid for Scientific Research (C)
Developing and validating a training program to improve domain-specific working memory efficiency in second language.
开发和验证培训计划,以提高第二语言特定领域的工作记忆效率。
  • 批准号:
    23K17499
    23K17499
  • 财政年份:
    2023
  • 资助金额:
    $ 23万
    $ 23万
  • 项目类别:
    Grant-in-Aid for Challenging Research (Exploratory)
    Grant-in-Aid for Challenging Research (Exploratory)
An innovative all-in-one fleet operations solution using AI technology to improve efficiency, reduce operating costs by 15% and lower carbon emissions
An%20创新%20一体化%20机队%20运营%20解决方案%20使用%20AI%20技术%20到%20提高%20效率,%20减少%20运营%20成本%20by%2015%%20和%20降低%20碳%20排放
  • 批准号:
    83001607
    83001607
  • 财政年份:
    2023
  • 资助金额:
    $ 23万
    $ 23万
  • 项目类别:
    Innovation Loans
    Innovation Loans