SPX: Collaborative Research: NG4S: A Next-generation Geo-distributed Scalable Stateful Stream Processing System
SPX:合作研究:NG4S:下一代地理分布式可扩展状态流处理系统
基本信息
- 批准号:1919126
- 负责人:
- 金额:$ 29.93万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-10-01 至 2022-01-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Our society increasingly relies on applications that process streaming data across geo-distributed sites, such as making business decisions from marketing data, identifying spam campaigns in social network streams, and analyzing genome datasets in different labs and countries to track the sources of potential epidemics. State-of-art solutions for these needs are centered around stateless stream processing. This project advances stream processing to enable next-generation streaming applications to store and update state along with computation, therefore processing live data streams in a timely fashion from massive and geo-distributed datasets. Existing systems are mainly designed for stateless stream processing in intra-datacenter settings and do not scale well for running stream applications that contain large distributed states. This project breaks the traditional abstractions of a centralized architecture and hashtable-based stateless operators, redefining them with a new decentralized architecture and new memory-efficient stateful operators, which enables novel approaches to improve overall system performance and scalability. This project builds a next-generation geo-distributed scalable stateful stream processing system that will significantly improve the scalability of stream processing systems. This work includes three primary research directions. (1) At the architecture level, a new decentralized 'many masters/many workers' architecture will be proposed, which provides each master with maximum independence. (2) At the operator level, a new in-memory data structure will be designed and implemented to store application state and minimize the memory overhead so as to handle 'big data' requirements. (3) A new shard-based parallel recovery mechanism will be proposed to handle failures and stragglers in a scalable way. All three parts of the project will be prototyped and implemented on a widely adopted stream processing system (Apache Storm).This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
我们的社会越来越依赖于处理跨地理分布站点的流数数据的应用程序,例如从营销数据中做出业务决策,识别社交网络流中的垃圾邮件活动,并分析不同实验室和国家 /地区的基因组数据集以跟踪潜在流行病的来源。满足这些需求的最新解决方案围绕无状态流处理。该项目推进了流处理,以启用下一代流媒体应用程序与计算一起存储和更新状态,因此从大型和地理分布的数据集中及时处理实时数据流。现有系统主要是为无状态的流化处理设置中的无状态流处理而设计的,对于运行包含大型分布式状态的流式应用程序的扩展不佳。该项目打破了集中式体系结构和基于Hashtable的无状态运营商的传统抽象,通过新的分散体系结构和新的记忆有效的状态运营商重新定义了它们,这使得可以提高整体系统性能和可扩展性。 该项目构建了下一代地理分配的可扩展状态流处理系统,该系统将显着提高流处理系统的可扩展性。这项工作包括三个主要研究方向。 (1)在体系结构层面上,将提出一个新的分散的“许多大师/许多工人的体系结构,为每个主人提供最大的独立性。 (2)在操作员级别上,将设计和实施一种新的内存数据结构,以存储应用程序状态并最大程度地减少内存开销,以便处理“大数据”要求。 (3)将提出一种新的基于碎片的平行恢复机制,以可扩展的方式处理故障和散落者。该项目的所有三个部分将在广泛采用的流处理系统(Apache Storm)上进行原型化和实施。该奖项反映了NSF的法定任务,并使用基金会的知识分子优点和更广泛的影响审查标准,认为值得通过评估来获得支持。
项目成果
期刊论文数量(3)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
FP4S: Fragment-based Parallel State Recovery for Stateful Stream Applications
- DOI:10.1109/ipdps47924.2020.00116
- 发表时间:2020-05
- 期刊:
- 影响因子:0
- 作者:Pinchao Liu;Hailu Xu;D. D. Silva-D.;Qingyang Wang;Sarker Tanzir Ahmed;Liting Hu
- 通讯作者:Pinchao Liu;Hailu Xu;D. D. Silva-D.;Qingyang Wang;Sarker Tanzir Ahmed;Liting Hu
SR3: Customizable Recovery for Stateful Stream Processing Systems
- DOI:10.1145/3423211.3425681
- 发表时间:2020-12
- 期刊:
- 影响因子:0
- 作者:Hailu Xu;Pinchao Liu;Susana Cruz-Diaz;D. D. Silva-D.;Liting Hu
- 通讯作者:Hailu Xu;Pinchao Liu;Susana Cruz-Diaz;D. D. Silva-D.;Liting Hu
DART: A Scalable and Adaptive Edge Stream Processing Engine
DART:可扩展的自适应边缘流处理引擎
- DOI:
- 发表时间:2021
- 期刊:
- 影响因子:0
- 作者:Liu, Pinchao;Silva, Dilma Da;Hu, Liting.
- 通讯作者:Hu, Liting.
共 3 条
- 1
Liting Hu其他文献
Project Hoover: auto-scaling streaming map-reduce applications
Project Hoover:自动缩放流式 Map-Reduce 应用程序
- DOI:10.1145/2378356.237835910.1145/2378356.2378359
- 发表时间:20122012
- 期刊:
- 影响因子:0
- 作者:Rajalakshmi Ramesh;Liting Hu;K. SchwanRajalakshmi Ramesh;Liting Hu;K. Schwan
- 通讯作者:K. SchwanK. Schwan
RBAY: A Scalable and Extensible Information Plane for Federating Distributed Datacenter Resources
RBAY:用于联合分布式数据中心资源的可扩展且可扩展的信息平面
- DOI:10.1109/icdcs.2017.4210.1109/icdcs.2017.42
- 发表时间:20172017
- 期刊:
- 影响因子:0
- 作者:Xin Chen;Liting Hu;D. Blough;M. Kozuch;M. WolfXin Chen;Liting Hu;D. Blough;M. Kozuch;M. Wolf
- 通讯作者:M. WolfM. Wolf
Max orientation coverage: efficient path planning to avoid collisions in the CNC milling of 3D objects
最大方向覆盖范围:有效的路径规划,以避免 3D 对象 CNC 铣削中的碰撞
- DOI:
- 发表时间:20202020
- 期刊:
- 影响因子:0
- 作者:Xin Chen;Thomas M. Tucker;T. Kurfess;R. Vuduc;Liting HuXin Chen;Thomas M. Tucker;T. Kurfess;R. Vuduc;Liting Hu
- 通讯作者:Liting HuLiting Hu
Decaffe: DHT Tree-Based Online Federated Fake News Detection
Decaffe:基于 DHT 树的在线联合假新闻检测
- DOI:
- 发表时间:20232023
- 期刊:
- 影响因子:0
- 作者:Cheng;Liting HuCheng;Liting Hu
- 通讯作者:Liting HuLiting Hu
Breaking Down Hadoop Distributed File Systems Data Analytics Tools: Apache Hive vs. Apache Pig vs. Pivotal HWAQ
分解 Hadoop 分布式文件系统数据分析工具:Apache Hive、Apache Pig 与 Pivotal HWAQ
- DOI:10.1109/cloud.2017.11710.1109/cloud.2017.117
- 发表时间:20172017
- 期刊:
- 影响因子:0
- 作者:Xin Chen;Liting Hu;Liangqi Liu;Jing Chang;Diana Leante BoneXin Chen;Liting Hu;Liangqi Liu;Jing Chang;Diana Leante Bone
- 通讯作者:Diana Leante BoneDiana Leante Bone
共 12 条
- 1
- 2
- 3
Liting Hu的其他基金
CNS Core: Small: Core Scheduling Techniques and Programming Abstractions for Scalable Serverless Edge Computing Engine
CNS Core:小型:可扩展无服务器边缘计算引擎的核心调度技术和编程抽象
- 批准号:23229192322919
- 财政年份:2024
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Standard GrantStandard Grant
OAC Core: A Scalable and Deployable Container Orchestration Cyber Infrastructure Toolkit for Deploying Big Data Analytics Applications in Public Cloud
OAC Core:用于在公共云中部署大数据分析应用程序的可扩展和可部署的容器编排网络基础设施工具包
- 批准号:23137382313738
- 财政年份:2023
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Standard GrantStandard Grant
OAC Core: A Scalable and Deployable Container Orchestration Cyber Infrastructure Toolkit for Deploying Big Data Analytics Applications in Public Cloud
OAC Core:用于在公共云中部署大数据分析应用程序的可扩展和可部署的容器编排网络基础设施工具包
- 批准号:22122562212256
- 财政年份:2022
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Standard GrantStandard Grant
SPX: Collaborative Research: NG4S: A Next-generation Geo-distributed Scalable Stateful Stream Processing System
SPX:合作研究:NG4S:下一代地理分布式可扩展状态流处理系统
- 批准号:22028592202859
- 财政年份:2022
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Standard GrantStandard Grant
CAREER: Scalable and Adaptive Edge Stream Processing
职业:可扩展和自适应边缘流处理
- 批准号:23137372313737
- 财政年份:2022
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Continuing GrantContinuing Grant
CAREER: Scalable and Adaptive Edge Stream Processing
职业:可扩展和自适应边缘流处理
- 批准号:22056772205677
- 财政年份:2021
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Continuing GrantContinuing Grant
CAREER: Scalable and Adaptive Edge Stream Processing
职业:可扩展和自适应边缘流处理
- 批准号:19430711943071
- 财政年份:2020
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Continuing GrantContinuing Grant
相似国自然基金
数智背景下的团队人力资本层级结构类型、团队协作过程与团队效能结果之间关系的研究
- 批准号:72372084
- 批准年份:2023
- 资助金额:40 万元
- 项目类别:面上项目
在线医疗团队协作模式与绩效提升策略研究
- 批准号:72371111
- 批准年份:2023
- 资助金额:41 万元
- 项目类别:面上项目
面向人机接触式协同作业的协作机器人交互控制方法研究
- 批准号:62373044
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
基于数字孪生的颅颌面人机协作智能手术机器人关键技术研究
- 批准号:82372548
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
A-型结晶抗性淀粉调控肠道细菌协作产丁酸机制研究
- 批准号:32302064
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
SPX: Collaborative Research: Automated Synthesis of Extreme-Scale Computing Systems Using Non-Volatile Memory
SPX:协作研究:使用非易失性存储器自动合成超大规模计算系统
- 批准号:24089252408925
- 财政年份:2023
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Standard GrantStandard Grant
SPX: Collaborative Research: Scalable Neural Network Paradigms to Address Variability in Emerging Device based Platforms for Large Scale Neuromorphic Computing
SPX:协作研究:可扩展神经网络范式,以解决基于新兴设备的大规模神经形态计算平台的可变性
- 批准号:24015442401544
- 财政年份:2023
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Standard GrantStandard Grant
SPX: Collaborative Research: Intelligent Communication Fabrics to Facilitate Extreme Scale Computing
SPX:协作研究:促进超大规模计算的智能通信结构
- 批准号:24121822412182
- 财政年份:2023
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Standard GrantStandard Grant
SPX: Collaborative Research: Cross-stack Memory Optimizations for Boosting I/O Performance of Deep Learning HPC Applications
SPX:协作研究:用于提升深度学习 HPC 应用程序 I/O 性能的跨堆栈内存优化
- 批准号:23186282318628
- 财政年份:2022
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Standard GrantStandard Grant
SPX: Collaborative Research: NG4S: A Next-generation Geo-distributed Scalable Stateful Stream Processing System
SPX:合作研究:NG4S:下一代地理分布式可扩展状态流处理系统
- 批准号:22028592202859
- 财政年份:2022
- 资助金额:$ 29.93万$ 29.93万
- 项目类别:Standard GrantStandard Grant