喵ID:tw664L免责声明

Worst-Case Convergence Time of ML Algorithms via Extreme Value Theory

基于极值理论的 ML 算法的最坏情况收敛时间

基本信息

DOI:
--
发表时间:
2024
期刊:
2024 IEEE/ACM 3rd International Conference on AI Engineering – Software Engineering for AI (CAIN)
影响因子:
--
通讯作者:
Sriram Sankaranarayanan
中科院分区:
文献类型:
--
作者: Saeid Tizpaz;Sriram Sankaranarayanan研究方向: -- MeSH主题词: --
关键词: --
来源链接:pubmed详情页地址

文献摘要

This paper leverages the statistics of extreme values to predict the worst-case convergence times of machine learning algorithms. Timing is a critical non-functional property of ML systems, and providing the worst-case converge times is essential to guarantee the availability of ML and its services. However, timing properties such as worst-case convergence times (WCCT) are difficult to verify since (1) they are not encoded in the syntax or semantics of underlying programming languages of AI, (2) their evaluations depend on both algorithmic implementations and underlying systems, and (3) their measurements involve uncertainty and noise. Therefore, prevalent formal methods and statistical models fail to provide rich information on the amounts and likelihood of WCCT.Our key observation is that the timing information we seek represents the extreme tail of execution times. Therefore, extreme value theory (EVT), a statistical discipline that focuses on understanding and predicting the distribution of extreme values in the tail of outcomes, provides an ideal framework to model and analyze WCCT in the training and inference phases of ML paradigm. Building upon the mathematical tools from EVT, we propose a practical framework to predict the worst-case timing properties of ML. Over a set of linear ML training algorithms, we show that EVT achieves a better accuracy for predicting WCCTs than relevant statistical methods such as the Bayesian factor. On the set of larger machine learning training algorithms and deep neural network inference, we show the feasibility and usefulness of EVT models to accurately predict WCCTs, their expected return periods, and their likelihood.
本文利用极值的统计数据来预测机器学习算法的最坏情况。时间安排是ML系统的关键非功能性属性,提供最差的案例收敛时间对于确保ML及其服务的可用性至关重要。但是,很难验证定时属性(例如最差的融合时间(WCCT)),因为(1)它们未在AI的基础编程语言的语法或语法中编码,(2)他们的评估取决于算法实现和基础实现系统,(3)它们的测量涉及不确定性和噪声。因此,普遍的形式方法和统计模型未能提供有关WCCT的数量和可能性的丰富信息。我们的主要观察结果是,我们寻求的时序信息代表了执行时间的极端尾巴。因此,极端价值理论(EVT)是一种统计学科,侧重于理解和预测结果尾部中极值的分布,为在ML范式的训练和推理阶段中建模和分析WCCT提供了理想的框架。在EVT的数学工具的基础上,我们提出了一个实用的框架,以预测ML的最坏情况定时属性。在一系列线性ML训练算法上,我们表明EVT比相关统计方法(例如贝叶斯因素)实现了预测WCCT的精度。在较大的机器学习训练算法和深度神经网络推断上,我们展示了EVT模型准确预测WCCT,其预期返回期以及其可能性的可行性和实用性。
参考文献(1)
被引文献(0)

数据更新时间:{{ references.updateTime }}

Sriram Sankaranarayanan
通讯地址:
--
所属机构:
--
电子邮件地址:
--
免责声明免责声明
1、猫眼课题宝专注于为科研工作者提供省时、高效的文献资源检索和预览服务;
2、网站中的文献信息均来自公开、合规、透明的互联网文献查询网站,可以通过页面中的“来源链接”跳转数据网站。
3、在猫眼课题宝点击“求助全文”按钮,发布文献应助需求时求助者需要支付50喵币作为应助成功后的答谢给应助者,发送到用助者账户中。若文献求助失败支付的50喵币将退还至求助者账户中。所支付的喵币仅作为答谢,而不是作为文献的“购买”费用,平台也不从中收取任何费用,
4、特别提醒用户通过求助获得的文献原文仅用户个人学习使用,不得用于商业用途,否则一切风险由用户本人承担;
5、本平台尊重知识产权,如果权利所有者认为平台内容侵犯了其合法权益,可以通过本平台提供的版权投诉渠道提出投诉。一经核实,我们将立即采取措施删除/下架/断链等措施。
我已知晓