Synthesis and Verification in Markov Game Structures
马尔可夫博弈结构的综合与验证
基本信息
- 批准号:EP/H046623/1
- 负责人:
- 金额:$ 42.75万
- 依托单位:
- 依托单位国家:英国
- 项目类别:Research Grant
- 财政年份:2010
- 资助国家:英国
- 起止时间:2010 至 无数据
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
To meet the objectives of our project, we will divide our research into five work packages.The first work package is devoted to representing the control problems that we want to approach and to identify benchmarks and case studies as guidelines for relevant demands and measures of success for the applied aspects of our project. The starting point for our models will be- a generalisation of interactive Markov chains to 2.5 player games, a model in which the decisions of the different players are physically separated by assigning them to different states, and- a generalisation of Markov decision processes to Markov games, a model in which the decisions of both (or, more generally, of all) players are entangled and represented in the same node.We will extend these models by representations of the observational and computational power of the controllers under consideration, and formalisations of the--simple--objectives we want to meet.Additionally, we will develop benchmarks and case studies to guide the applied aspects of our project, and to root it in different communities--in particular in engineering and IT--by reflecting their respective demands.Work packages two, three, and four from the theoretic core of our work. Our second work package will address the simple question of constructing controllers with complete information, while a third work package will address the generalisation of these techniques to controllers with incomplete information but, for distributed controllers, equivalent information about the system state.Different to discrete systems, the abstraction (or restricted observability) of time plays a paramount role when considering incomplete information of these systems. This particular type of abstraction has proven to often simplify the construction of optimal strategies: The construction of optimal time-abstract strategies (and the proof of their existence) is much simpler than the construction of time dependent ones.The fourth work package refers to the extension of these results to distributed schedulers with different observational power.For work packages two, three, and four, we will study the decidability of quantitative and qualitative safety and reachability properties.In a fifth work package we will focus on algorithmic aspects like the development and selection of appropriate data structures of the model checking and optimisation problems, and develop prototype implementations that solve as a proof-of-concept for their applicability for a selection of the developed approaches. These proof-of-concept implementations will also play an important role in determining the applicability and potential of the techniques developed in the project on the target implementations defined in the first work package, and as means to communicate our results for dissemination and exploitation purposes.
为了实现我们项目的目标,我们将研究将研究包分为五个工作包。第一个工作包致力于代表我们想要解决的控制问题,并确定基准和案例研究作为对我们项目所应用方面成功的相关要求和成功措施的指南。我们的模型的起点将是 - 将交互式马尔可夫链条概括为2.5个玩家游戏,在这种模型中,不同玩家的决策通过将它们分配给不同的状态而在物理上分开,并且 - 马尔可夫决策过程对马尔可夫游戏的概括,这是一个模型,在这些模型中,这两种模型都在同一范围内代表这些模型。正在考虑的 - 简单的 - 目标 - 我们想遇到的目标。在此方面,我们将开发基准和案例研究,以指导项目的应用方面,并将其植根于不同社区的工程和IT,尤其是通过反映他们各自的需求。我们的第二个工作包将解决使用完整信息构建控制器的简单问题,而第三个工作包将解决这些技术对控制器的概括,但对于分布式控制器,对于分布式控制器,有关系统状态的等效信息,与离散系统不同,抽象(或限制性观察力)在考虑这些系统不完整的信息时扮演着最重要的角色。事实证明,这种特殊类型的抽象经常简化最佳策略的构建:最佳的抽吸策略(以及其存在的证明)比依赖时间依赖的构建的构建要简单得多。第四个工作套餐指的是这些结果的庞大涉及分配的调度程序,以分配不同的观察力和质量的质量,我们将质量和四个属性。第五工作包我们将重点介绍算法方面,例如模型检查和优化问题的适当数据结构的开发和选择,并开发原型实现,这些实现是为其选择开发方法的适用性的概念证明。这些概念验证实现也将在确定项目中定义的目标实现中开发的技术的适用性和潜力方面发挥重要作用,并作为将我们的结果传播以进行传播和开发目的的方式。
项目成果
期刊论文数量(10)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Finding Approximate Nash Equilibria of Bimatrix Games via Payoff Queries
通过支付查询找到 Bimatrix 博弈的近似纳什均衡
- DOI:10.1145/2956579
- 发表时间:2016
- 期刊:
- 影响因子:1.2
- 作者:Fearnley J
- 通讯作者:Fearnley J
Reachability in two-clock timed automata is PSPACE-complete
- DOI:10.1016/j.ic.2014.12.004
- 发表时间:2013-02
- 期刊:
- 影响因子:0
- 作者:John Fearnley;M. Jurdzinski
- 通讯作者:John Fearnley;M. Jurdzinski
CTL* synthesis via LTL synthesis
通过 LTL 合成进行 CTL* 合成
- DOI:10.4204/eptcs.260.4
- 发表时间:2017
- 期刊:
- 影响因子:0
- 作者:Bloem R
- 通讯作者:Bloem R
ISCASMC: A Web-Based Probabilistic Model Checker
- DOI:10.1007/978-3-319-06410-9_22
- 发表时间:2014-01-01
- 期刊:
- 影响因子:0
- 作者:Hahn, Ernst Moritz;Li, Yi;Zhang, Lijun
- 通讯作者:Zhang, Lijun
An ordered approach to solving parity games in quasi-polynomial time and quasi-linear space
- DOI:10.1007/s10009-019-00509-3
- 发表时间:2019-06-01
- 期刊:
- 影响因子:1.5
- 作者:Fearnley, John;Jain, Sanjay;Wojtczak, Dominik
- 通讯作者:Wojtczak, Dominik
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Sven Schewe其他文献
Hydrogen permeation and embrittlement behavior of ferritic SOEC/SOFC interconnect candidates
铁素体 SOEC/SOFC 互连候选材料的氢渗透和脆化行为
- DOI:
10.1016/j.ijhydene.2024.03.337 - 发表时间:
2024 - 期刊:
- 影响因子:7.2
- 作者:
David Kniep;Sven Schewe;Mario Rudolphi;Mathias Christian Galetz - 通讯作者:
Mathias Christian Galetz
Sven Schewe的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Sven Schewe', 18)}}的其他基金
TRUSTED: SecuriTy SummaRies for SecUre SofTwarE Development
值得信赖:安全软件开发的安全摘要
- 批准号:
EP/X03688X/1 - 财政年份:2023
- 资助金额:
$ 42.75万 - 项目类别:
Research Grant
Below the Branches of Universal Trees
普世树枝下
- 批准号:
EP/X017796/1 - 财政年份:2023
- 资助金额:
$ 42.75万 - 项目类别:
Research Grant
Valuation Structures for Infinite Duration Games
无限期游戏的估值结构
- 批准号:
EP/Y027663/1 - 财政年份:2023
- 资助金额:
$ 42.75万 - 项目类别:
Fellowship
Reinforcement Learning for Finite Horizons (ReLeaF)
有限视野强化学习 (ReLeaF)
- 批准号:
EP/X021513/1 - 财政年份:2022
- 资助金额:
$ 42.75万 - 项目类别:
Fellowship
Solving Parity Games in Theory and Practice
从理论和实践中解决平价博弈
- 批准号:
EP/P020909/1 - 财政年份:2017
- 资助金额:
$ 42.75万 - 项目类别:
Research Grant
相似国自然基金
可控硫醇交换辅助金团簇活性结构位点确认及其构效关系研究
- 批准号:22301001
- 批准年份:2023
- 资助金额:30.00 万元
- 项目类别:青年科学基金项目
APOBEC特征性突变作为卵巢透明细胞癌免疫治疗生物标志物的确认研究
- 批准号:82303968
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
针对严重耐药革兰阴性菌的药物新靶标确认及先导分子发现研究
- 批准号:82330110
- 批准年份:2023
- 资助金额:220 万元
- 项目类别:重点项目
Fcγ受体驱动Ntrk1活化诱导巨噬细胞对抗体依赖性细胞吞噬获得性耐受的机制研究和靶标确认
- 批准号:82373892
- 批准年份:2023
- 资助金额:48 万元
- 项目类别:面上项目
NRG-1/miR143-3p通路调控奥氮平治疗作用抵抗机制与干预靶点确认
- 批准号:82373854
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
相似海外基金
Agent-based Intelligent Messaging Systems, Natural Language Generation, Markov Decision Processes, Verification, Machine Learning
基于代理的智能消息系统、自然语言生成、马尔可夫决策过程、验证、机器学习
- 批准号:
520347-2017 - 财政年份:2019
- 资助金额:
$ 42.75万 - 项目类别:
Collaborative Research and Development Grants
Agent-based Intelligent Messaging Systems, Natural Language Generation, Markov Decision Processes, Verification, Machine Learning
基于代理的智能消息系统、自然语言生成、马尔可夫决策过程、验证、机器学习
- 批准号:
520347-2017 - 财政年份:2018
- 资助金额:
$ 42.75万 - 项目类别:
Collaborative Research and Development Grants
Theory and its Verification of Decision under the Interaction between Individuals and Groups
个体与群体互动下的决策理论及其验证
- 批准号:
16K01265 - 财政年份:2016
- 资助金额:
$ 42.75万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Model-checking quantum Markov chains: towards verification techniques for quantum cryptographic systems
模型检查量子马尔可夫链:面向量子密码系统的验证技术
- 批准号:
DP130102764 - 财政年份:2013
- 资助金额:
$ 42.75万 - 项目类别:
Discovery Projects
Modeling of Internet Traffic Sources and Its Verification by Measurement
互联网流量源建模及其测量验证
- 批准号:
13650443 - 财政年份:2001
- 资助金额:
$ 42.75万 - 项目类别:
Grant-in-Aid for Scientific Research (C)