Collaborative Research: OAC Core: Small: Anomaly Detection and Performance Optimization for End-to-End Data Transfers at Scale
协作研究:OAC 核心:小型:大规模端到端数据传输的异常检测和性能优化
基本信息
- 批准号:2007789
- 负责人:
- 金额:$ 27.5万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2020
- 资助国家:美国
- 起止时间:2020-08-01 至 2024-01-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Despite continuous efforts and investments to upgrade the networking infrastructure of research and education institutions to meet the needs of large-scale science applications, the data transfers on these networks often perform very poorly. Understanding the underlying reasons for poor transfer performance is important yet challenging due to the sophisticated design of today's cyberinfrastructures. This project offers a set of novel models and algorithms to detect and mitigate performance issues of data transfers in research networks. The proposed suite of tools helps researchers and system administrators to pinpoint the root cause of performance problems of data transfers so that necessary actions can be taken swiftly to minimize their impact on ongoing transfers. The project will also integrate the research into all levels of education, including science projects with K-12 students, development of new curriculum modules for graduate- and undergraduate-level courses, and summer workshops specifically for minority groups.Understanding the true underlying reasons for poor transfer performance is key to mitigating them and delivering the promised transfer speeds. However, the involvement of multiple end systems, dynamically changing background traffic, and the complexity of today's networking infrastructures turns it into a complicated and time-consuming process. This project develops a novel anomaly-detection and performance-optimization framework for end-to-end data transfers at scale. The framework helps to predict, understand, diagnose, and optimize wide-area file transfers in today's extreme-scale cyberinfrastructures. To achieve this goal, it derives deep-neural-network-based predictive models that can relate transfer settings to throughput. These models are then used to estimate the optimal configuration for new transfers. The framework also gathers performance metrics for end-system and network resources periodically to keep track of system utilization. When a transfer anomaly is detected, the collected metrics are fed into anomaly-classification models to identify the root causes. Once the underlying reasons of performance problems are identified, the framework launches a real-time optimization process to reconfigure the transfer settings such that the impact of anomalies can be alleviated.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
尽管不断努力和投资来升级研究和教育机构的网络基础设施,以满足大规模科学应用的需求,但这些网络上的数据传输往往表现很差。由于当今网络基础设施的设计复杂,了解传输性能不佳的根本原因非常重要,但也具有挑战性。该项目提供了一组新颖的模型和算法来检测和缓解研究网络中数据传输的性能问题。拟议的工具套件可帮助研究人员和系统管理员查明数据传输性能问题的根本原因,以便迅速采取必要的措施,最大限度地减少对正在进行的传输的影响。 该项目还将把研究整合到各级教育中,包括 K-12 学生的科学项目、为研究生和本科生课程开发新课程模块,以及专门针对少数群体的夏季研讨会。传输性能差是缓解这些问题并提供所承诺的传输速度的关键。然而,多个终端系统的参与、动态变化的后台流量以及当今网络基础设施的复杂性使其成为一个复杂且耗时的过程。 该项目开发了一种新颖的异常检测和性能优化框架,用于大规模端到端数据传输。该框架有助于预测、理解、诊断和优化当今超大规模网络基础设施中的广域文件传输。为了实现这一目标,它派生了基于深度神经网络的预测模型,可以将传输设置与吞吐量联系起来。然后使用这些模型来估计新传输的最佳配置。该框架还定期收集终端系统和网络资源的性能指标,以跟踪系统利用率。当检测到传输异常时,收集的指标将被输入异常分类模型以识别根本原因。一旦确定了性能问题的根本原因,该框架就会启动实时优化流程来重新配置传输设置,从而减轻异常的影响。该奖项反映了 NSF 的法定使命,并通过使用评估结果被认为值得支持。基金会的智力价值和更广泛的影响审查标准。
项目成果
期刊论文数量(9)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Learning Transfers via Transfer Learning
通过迁移学习进行学习迁移
- DOI:
- 发表时间:2021-01
- 期刊:
- 影响因子:0
- 作者:Arifuzzaman, MD;Arslan, Engin
- 通讯作者:Arslan, Engin
Swift and Accurate End-to-End Throughput Measurements for High-Speed Networks
快速、准确的高速网络端到端吞吐量测量
- DOI:
- 发表时间:2022-01
- 期刊:
- 影响因子:0
- 作者:Arifuzzaman, Md;Arslan, Engin
- 通讯作者:Arslan, Engin
Use Only What You Need: Judicious Parallelism For File Transfers in High Performance Networks
仅使用您需要的:高性能网络中文件传输的明智并行性
- DOI:10.1145/3577193.3593722
- 发表时间:2023-06-21
- 期刊:
- 影响因子:0
- 作者:Md. Arifuzzaman;Engin Arslan
- 通讯作者:Engin Arslan
Falcon: Fair and Efficient Online File Transfer Optimization
Falcon:公平高效的在线文件传输优化
- DOI:10.1109/tpds.2023.3282872
- 发表时间:2023-08-01
- 期刊:
- 影响因子:5.3
- 作者:Md. Arifuzzaman;B. Bockelman;James Basney;Engin Arslan
- 通讯作者:Engin Arslan
Reliable Wide-Area Data Transfers for Streaming Workflows
适用于流式工作流程的可靠广域数据传输
- DOI:10.1109/tpds.2022.3158673
- 发表时间:2022-01
- 期刊:
- 影响因子:5.3
- 作者:Sapkota, Hemanta;Arslan, Engin
- 通讯作者:Arslan, Engin
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Engin Arslan其他文献
Scalable Quantum Repeater Deployment Modeling
可扩展的量子中继器部署建模
- DOI:
10.48550/arxiv.2305.09855 - 发表时间:
2023-05-16 - 期刊:
- 影响因子:0.5
- 作者:
Tasdiqul Islam;Engin Arslan - 通讯作者:
Engin Arslan
Reliable Wide-Area Data Transfers for Streaming Workflows
适用于流式工作流程的可靠广域数据传输
- DOI:
10.1109/tpds.2022.3158673 - 发表时间:
2024-09-13 - 期刊:
- 影响因子:5.3
- 作者:
Hemanta Sapkota;Engin Arslan - 通讯作者:
Engin Arslan
Application-Level Optimization of Big Data Transfers through Pipelining, Parallelism and Concurrency
通过管道、并行性和并发性对大数据传输进行应用级优化
- DOI:
10.1109/tcc.2015.2415804 - 发表时间:
2024-09-14 - 期刊:
- 影响因子:6.5
- 作者:
E. Yildirim;Engin Arslan;Jangyoung Kim;T. Kosar - 通讯作者:
T. Kosar
Deep learning for the security of software-defined networks: a review
软件定义网络安全的深度学习:综述
- DOI:
10.1007/s10586-023-04069-9 - 发表时间:
2023-07-15 - 期刊:
- 影响因子:0
- 作者:
Roya Taheri;Habib Ahmed;Engin Arslan - 通讯作者:
Engin Arslan
RIVAChain: Blockchain-based Integrity Verification for File Transfers
- DOI:
10.1109/bigdata50022.2020.9378235 - 发表时间:
2020-12-10 - 期刊:
- 影响因子:0
- 作者:
Ahmed Alhussen;Engin Arslan - 通讯作者:
Engin Arslan
Engin Arslan的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Engin Arslan', 18)}}的其他基金
Elements: Adaptive End-to-End Parallelism for Distributed Science Workflows
要素:分布式科学工作流程的自适应端到端并行性
- 批准号:
2427408 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
CAREER: Efficient and Reliable Data Transfer Services for Next Generation Research Networks
职业:为下一代研究网络提供高效可靠的数据传输服务
- 批准号:
2348281 - 财政年份:2023
- 资助金额:
$ 27.5万 - 项目类别:
Continuing Grant
Collaborative Research: OAC Core: Small: Anomaly Detection and Performance Optimization for End-to-End Data Transfers at Scale
协作研究:OAC 核心:小型:大规模端到端数据传输的异常检测和性能优化
- 批准号:
2412329 - 财政年份:2023
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
CAREER: Efficient and Reliable Data Transfer Services for Next Generation Research Networks
职业:为下一代研究网络提供高效可靠的数据传输服务
- 批准号:
2145742 - 财政年份:2022
- 资助金额:
$ 27.5万 - 项目类别:
Continuing Grant
Elements: Adaptive End-to-End Parallelism for Distributed Science Workflows
要素:分布式科学工作流程的自适应端到端并行性
- 批准号:
2209955 - 财政年份:2022
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
CRII: OAC: Online Optimization of End-to-End Data Transfers in High Performance Networks
CRII:OAC:高性能网络中端到端数据传输的在线优化
- 批准号:
1850353 - 财政年份:2019
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
相似国自然基金
基于肿瘤病理图片的靶向药物敏感生物标志物识别及统计算法的研究
- 批准号:82304250
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
肠道普拉梭菌代谢物丁酸抑制心室肌铁死亡改善老龄性心功能不全的机制研究
- 批准号:82300430
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
社会网络关系对公司现金持有决策影响——基于共御风险的作用机制研究
- 批准号:72302067
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
面向图像目标检测的新型弱监督学习方法研究
- 批准号:62371157
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
面向开放域对话系统信息获取的准确性研究
- 批准号:62376067
- 批准年份:2023
- 资助金额:51 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: OAC Core: CropDL - Scheduling and Checkpoint/Restart Support for Deep Learning Applications on HPC Clusters
合作研究:OAC 核心:CropDL - HPC 集群上深度学习应用的调度和检查点/重启支持
- 批准号:
2403088 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC Core: CropDL - Scheduling and Checkpoint/Restart Support for Deep Learning Applications on HPC Clusters
合作研究:OAC 核心:CropDL - HPC 集群上深度学习应用的调度和检查点/重启支持
- 批准号:
2403090 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC Core: Distributed Graph Learning Cyberinfrastructure for Large-scale Spatiotemporal Prediction
合作研究:OAC Core:用于大规模时空预测的分布式图学习网络基础设施
- 批准号:
2403313 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC Core: Large-Scale Spatial Machine Learning for 3D Surface Topology in Hydrological Applications
合作研究:OAC 核心:水文应用中 3D 表面拓扑的大规模空间机器学习
- 批准号:
2414185 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC Core: Learning AI Surrogate of Large-Scale Spatiotemporal Simulations for Coastal Circulation
合作研究:OAC Core:学习沿海环流大规模时空模拟的人工智能替代品
- 批准号:
2402946 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant