CAREER: Enable Robust Virtualized Hosting Infrastructures via Coordinated Learning, Recovery, and Diagnosis
职业:通过协调学习、恢复和诊断实现强大的虚拟化托管基础设施
基本信息
- 批准号:1149445
- 负责人:
- 金额:$ 45万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2012
- 资助国家:美国
- 起止时间:2012-01-01 至 2018-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Large-scale virtualized hosting infrastructures have become the fundamental platforms for many real world systems such as cloud computing infrastructures, enterprise data centers, educational computing platforms, and data-intensive computing systems. However, due to their inherent complexity and sharing nature, hosting infrastructures are prone to various performance anomalies caused by software bugs or hardware failures. The overarching objective of this proposal is to systematically explore innovative runtime reliability management techniques for large-scale virtualized hosting infrastructures. Our research focuses on handling performance anomalies in distributed systems that are often very difficult to reproduce offline. The PI plans to combine the power of online learning, knowledge-driven first-response recovery, and in-situ diagnosis to handle unexpected system anomalies more efficiently and effectively. She aims at transforming the runtime system anomaly management from a trial-and-error guessing game into an efficient knowledge-driven self-healing process. Techniques developed in this project will generate significant impact on improving the reliability and diagnosability of many real world hosting infrastructure systems. The PI plans to develop new course modules and programming courseware based on the research prototype developed in this project. The prototype developed in this project will be applied to the Virtual Computing Lab at NCSU, an educational computing platform for K-12, community colleges and universities. She will also disseminate her results and collected data broadly through publication and technology transfer. Developed software artifacts and experimental datasets will also be released for public use.
大规模虚拟化托管基础设施已成为云计算基础设施、企业数据中心、教育计算平台和数据密集型计算系统等许多现实世界系统的基础平台。然而,由于其固有的复杂性和共享性质,托管基础设施很容易出现由软件错误或硬件故障引起的各种性能异常。 该提案的总体目标是系统地探索大规模虚拟化托管基础设施的创新运行时可靠性管理技术。我们的研究重点是处理分布式系统中的性能异常,这些异常通常很难离线重现。 PI 计划将在线学习、知识驱动的第一响应恢复和现场诊断的力量结合起来,更高效地处理意外的系统异常情况。她的目标是将运行时系统异常管理从试错猜谜游戏转变为高效的知识驱动的自我修复过程。该项目开发的技术将对提高许多现实世界托管基础设施系统的可靠性和可诊断性产生重大影响。 PI计划基于本项目开发的研究原型开发新的课程模块和编程课件。该项目开发的原型将应用于北卡罗来纳州立大学的虚拟计算实验室,这是一个面向 K-12、社区学院和大学的教育计算平台。她还将通过出版和技术转让广泛传播她的研究结果和收集的数据。开发的软件工件和实验数据集也将发布供公众使用。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Xiaohui Gu其他文献
Aerial Reconfigurable Intelligent Surface-Assisted Terrestrial Communications
空中可重构智能地面辅助地面通信
- DOI:
10.1109/iotm.001.2300141 - 发表时间:
2024-03-01 - 期刊:
- 影响因子:0
- 作者:
Xiaohui Gu;Wei Duan;Guoan Zhang;Miaowen Wen;Jaeho Choi;Pin - 通讯作者:
Pin
A Study of Security Vulnerabilities on Docker Hub
Docker Hub安全漏洞研究
- DOI:
10.1145/3029806.3029832 - 发表时间:
2017-03-22 - 期刊:
- 影响因子:0
- 作者:
Rui Shu;Xiaohui Gu;W. Enck - 通讯作者:
W. Enck
Fatigue life prediction of high‐speed train bearings based on the generalized linear cumulative damage theory
基于广义线性累积损伤理论的高速列车轴承疲劳寿命预测
- DOI:
10.1111/ffe.13984 - 发表时间:
2023-03-07 - 期刊:
- 影响因子:0
- 作者:
He‐Fei Li;Jia Wei;Shaochang Li;Yong;Xiaohui Gu;Ze‐Chao Liu;Shao - 通讯作者:
Shao
DScope: Detecting Real-World Data Corruption Hang Bugs in Cloud Server Systems
DScope:检测云服务器系统中真实的数据损坏挂起错误
- DOI:
10.1145/3267809.3267844 - 发表时间:
2018-10-11 - 期刊:
- 影响因子:0
- 作者:
Ting Dai;Jingzhu He;Xiaohui Gu;Shan Lu;Peipei Wang - 通讯作者:
Peipei Wang
Cooperative vehicular networks over Nakagami-m fading: Joint power control and spectrum scheduling
Nakagami-m 衰落下的协作车载网络:联合功率控制和频谱调度
- DOI:
10.1016/j.comnet.2023.110052 - 发表时间:
2023-10-01 - 期刊:
- 影响因子:0
- 作者:
Guoan Zhang;Wei Cao;Xiaohui Gu - 通讯作者:
Xiaohui Gu
Xiaohui Gu的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Xiaohui Gu', 18)}}的其他基金
CSR: Medium:Collaborative Research:Holistic, Cross-Site, Hybrid System Anomaly Debugging for Large Scale Hosting Infrastructures
CSR:中:协作研究:大规模托管基础设施的整体、跨站点、混合系统异常调试
- 批准号:
1513942 - 财政年份:2015
- 资助金额:
$ 45万 - 项目类别:
Continuing Grant
CSR:Small: Online System Anomaly Prediction and Diagnosis for Large-Scale Hosting Infrastructures
CSR:Small:大规模托管基础设施的在线系统异常预测与诊断
- 批准号:
0915567 - 财政年份:2009
- 资助金额:
$ 45万 - 项目类别:
Standard Grant
CSR: Small: Collaborative Research: Hybrid Opportunistic Computing for Green Clouds
CSR:小型:协作研究:绿色云的混合机会计算
- 批准号:
0915861 - 财政年份:2009
- 资助金额:
$ 45万 - 项目类别:
Continuing Grant
相似海外基金
Development of novel methods to enable robust comparison of real-world Progression Free Survival (rwPFS) and Clinical Trial PFS in Multiple Myeloma
开发新方法以对多发性骨髓瘤的真实无进展生存期 (rwPFS) 和临床试验 PFS 进行稳健比较
- 批准号:
10797236 - 财政年份:2023
- 资助金额:
$ 45万 - 项目类别:
Methods to enable robust and efficient use of genetic summary data
能够稳健、高效地使用遗传摘要数据的方法
- 批准号:
10653969 - 财政年份:2020
- 资助金额:
$ 45万 - 项目类别:
Methods to enable robust and efficient use of genetic summary data
能够稳健、高效地使用遗传摘要数据的方法
- 批准号:
10462613 - 财政年份:2020
- 资助金额:
$ 45万 - 项目类别:
Methods to enable robust and efficient use of genetic summary data
能够稳健、高效地使用遗传摘要数据的方法
- 批准号:
10251150 - 财政年份:2020
- 资助金额:
$ 45万 - 项目类别:
NRI: FND: Robust Learning of Sequential Motion from Human Demonstrations to Enable Robot-Guided Exercise Training
NRI:FND:从人体演示中稳健地学习顺序运动,以实现机器人引导的运动训练
- 批准号:
1830597 - 财政年份:2019
- 资助金额:
$ 45万 - 项目类别:
Standard Grant